Skip to main content

Currently Skimming:

Commercializing Auditory Neuroscience
Pages 5-14

The Chapter Skim interface presents what we've algorithmically identified as the most significant single chunk of text within every page in the chapter.
Select key terms on the right to highlight them within pages of the chapter.


From page 5...
... This ambitious endeavor will require a decade of work by a large team of specialists and a network of highly skilled collaborators supported by substantial financial resources. To date, from 2002 to 2006, we have developed the core technology, determined a viable market direction, secured financing, assembled a team, and developed and executed a viable, sustainable business model that provides incentives (expected return on investment)
From page 6...
... The basic model-building began in 1998 and continued through 2002, just when personal computers crossed the 1 GHz mark, which meant that, for the first time in history, it was possible to build working, real-time models of real brainsystem components, in software, on consumer computer platforms. Early demonstrations in 2001 and 2002 included high-resolution, real-time displays of the cochlea; binaural spatial representations, such as interaural time differences (ITDs)
From page 7...
... , a characterization process, a grouping process, a selection process, and Inverse FCT. · FCT provides a high-quality spectral representation of the sound mixture, with sufficient resolution and without introducing frame artifacts, to allow the characterization of components of multiple sound sources.
From page 8...
... sampled um er- ving usic audio Wind, Noise Ov Spectr y Mo m, ns ainstr, Noise wds y astF Cro, hor, cognitive ned systems ansF -Stationar Sirens the PA atter of Stationar P Non Random, Architecture In 2 udio A FIGURE
From page 9...
... FCT is updated with every audio sample, which allows for resolution of glottal pulses, as necessary, to compute periodicity measures on a performant basis as a cue for grouping voice components. Because of the way FFT is often configured, it provides poor spectral resolution at low frequencies; very often, the following processor (such as a back FIGURE 3 Comparison of fast Fourier transform and Fast Cochlea TransformTM.
From page 10...
... The boundaries between the two signals are much better defined, which results in high performance in the subsequent grouping and separation steps. Characterization Process The polyphonic pitch algorithm is capable of resolving the pitch of multiple speakers simultaneously and detecting multiple musical instruments simulta
From page 11...
... Spatial localization is valuable for stream separation and locating sound sources, when stereo microphones are available. Figure 6 shows the response of binaural representations to a sound source positioned to the right of the stereo microphone pair.
From page 12...
... Bottom panel: isolated voice. Source: Audience Inc.
From page 13...
... and what our system must do to be commercially viable (e.g., compute with conventional digital representations, use fast silicon hardware, provide inverse spectral transformation)
From page 14...
... New York: Oxford University Press.


This material may be derived from roughly machine-read images, and so is provided only to facilitate research.
More information on Chapter Skim is available.