J Am Acad Audiol 1999; 10(06): 304-318
DOI: 10.1055/s-0042-1748502
Original Article

Acoustic-Phonetic Approach toward Understanding Neural Processes and Speech Perception

Dawn Burton Koch
Department of Communication Sciences and Disorders, Northwestern University, Evanston, Illinois
,
Therese J. McGee
Department of Communication Sciences and Disorders, Northwestern University, Evanston, Illinois
,
Ann R. Bradlow
Department of Communication Sciences and Disorders, Northwestern University, Evanston, Illinois
,
Nina Kraus
Department of Communication Sciences and Disorders, Northwestern University, Evanston, Illinois
Departments of Neurobiology, Physiology, Otolaryngology, and Northwestern University, Evanston, Illinois
› Author Affiliations

Abstract

This review paper describes an “acoustic-phonetic” experimental approach aimed at understanding normal and abnormal speech perception processes from both a behavioral and an electrophysiologic perspective. First, we consider the relevant acoustic characteristics of speech and identify a set of acoustic-phonetic classes that represent the parameters most important for making an acoustic signal sound like speech. Second, we review what is known about the neurophysiologic representation of acoustic-phonetic speech parameters in animal and human subjects. Third, we describe how an acoustic-phonetic approach has been useful in understanding the biologic basis of some auditory learning problems in children and in characterizing the behavioral and neurophysiologic changes resulting from speech-sound training. Finally, we discuss these findings and how they may expand the diagnostic and rehabilitative repertoire of practicing audiologists.

Abbreviations: F0 = fundamental frequency; F1, F2, etc. = first formant, second formant, etc; fMRI = functional magnetic resonance imaging; JND = just noticeable difference; MMN = mismatch negativity



Publication History

Article published online:
29 April 2022

© 1999. American Academy of Audiology. This article is published by Thieme.

Thieme Medical Publishers, Inc.
333 Seventh Avenue, 18th Floor, New York, NY 10001, USA

 
  • REFERENCES

  • Bradlow AR, Pisoni DB, Akahane-Yamada R, Tohkura Y. (1997). Training Japanese listeners to identify English /r/ and /1/: some effects of perceptual learning on speech production. J Acoust Soc Am 101:2299–2310.
  • Bradlow AR, Kraus N, Nicol T, McGee T, Cunningham J, Carrell TD. (1998). Speech-sound perception in normal and learning-disabled children: effect of lengthened CV transition duration. J Acoust Soc Am 103:2985.
  • Bradlow AR, Kraus N, Nicol T, McGee T, Cunningham J, Carrell TD. (1999). Effect of lengthened transition duration on discrimination and neural representation of synthetic CV syllables by normal and learning-disabled children. Submitted.
  • Buchwald J, Halas E, Schramm S. (1966). Changes in cortical and subcortical unit activity during behavioral conditioning. Physiol Behav 1:11–22.
  • Carney LH, Geisler CD. (1986). A temporal analysis of auditory-nerve fiber responses to spoken stop consonant-vowel syllables. J Acoust Soc Am 79:1896–1914.
  • Carr CE, Heiligenberg W, Rose GJ. (1986). A time-comparison circuit in the electric fish midbrain. I. Behavior and physiology. J Neurosci 6:107–119.
  • Creutzfeldt 0, Hellweg F-C, Schreiner C. (1980). Thalamocortical transformation of responses to complex auditory stimuli. Exp Brain Res 39:87–104.
  • Delgutte B, Kiang NYS. (1984a). Speech coding in the auditory nerve: I. Vowel-like sounds. J Acoust Soc Am 75:866–878.
  • Delgutte B, Kiang NYS. (1984b). Speech coding in the auditory nerve: III. Voiceless fricative consonants. J Acoust Soc Am 75:887–896.
  • Delgutte B, Kiang NYS. (1984c). Speech coding in the auditory nerve: V Vowels in background noise. J Acoust Soc Am 75:908–918.
  • Deng L, Geisler CD. (1987). Responses of auditory-nerve fibers to nasal consonant-vowel syllables. J Acoust Soc Am 82:1977–1988.
  • Depireux DA, Simon JZ, Klein DJ, Shamma SA. (1998). Representation of dynamic complex spectra in primary auditory cortex. Assoc Res Otolaryngol Abstr 21:148.
  • de Ribaupierre F, Goldstein M, Yeni-Komshian G. (1972). Cortical coding of repetitive acoustic pulses. Brain Res 46: 205–225.
  • Eggermont J. (1990). The Correlative Brain: Theory and Experiment in Neural Interaction. Studies of Brain Function. Vol. 16. Berlin: Springer-Verlag.
  • Eggermont J. (1991). Rate and synchronization measures of periodicity coding in cat primary auditory cortex. Hear Res 56:153–167.
  • Eggermont J. (1995). Representation of a voice onset time continuum in primary auditory cortex of the cat. J Acoust Soc Am 98:911–920.
  • Eggermont J. (1997). Firing rate and firing synchrony distinguish dynamic from steady state sound. Neuroreport 8:2709–2713.
  • Eggermont J, Smith GM. (1996). Neural connectivity only accounts for a small part of neural correlation in auditory cortex. Exp Brain Res 110:379–391.
  • Elliott L, Hammer M. (1988). Longitudinal changes in auditory discrimination in normal children and children with language-learning problems. J Speech Hear Disord 53:467–474.
  • Elliott L, Hammer M, Scholl M. (1989). Fine-grained auditory discrimination in normal children and children with language-learning problems. J Speech Hear Res 32:112–119.
  • Frisina RD, Walton JP, Karcich KJ. (1994). Dorsal cochlear nucleus single neurons can enhance temporal processing capabilities in background noise. Exp Brain Res 102:160–164.
  • Geisler CD. (1989). The responses of models of “high-spontaneous” auditory-nerve fibers in a damaged cochlea to speech syllables in noise. J Acoust Soc Am 86:2192–2205.
  • Geisler CD, Gamble T. (1989). Responses of “high-spontaneous” auditory-nerve fibers to consonant-vowel syllables in noise. J Acoust Soc Am 85:1639–1652.
  • Halas E, Beardsley J, Sandie M. (1970). Conditioned neuronal responses at various levels in conditioning paradigms. Electroencephalogr Clin Neurophysiol 28:468–477.
  • Harrison RV. (1998). An animal model of auditory neuropathy. Ear Hear 19:355–361.
  • Harrison R, Stanton S, Ibrahim D, Nagasawa A, Mount R. (1993). Neonatal cochlear hearing loss results in developmental abnormalities of the central auditory pathways. Acta Otolaryngol 113:296–302.
  • Heil P, Irvine DR. (1997). First-spike timing of auditory-nerve fibers and comparison with auditory cortex. J Neurophysiol 78:2438–2454.
  • Irvine DR, Rajan R. (1996) Injury- and use-related plasticity in the primary sensory cortex of adult mammals: possible relationship to perceptual learning. Clin Exp Pharmacol Physiol 23:939–947.
  • Johnson K. (1997). Acoustic and Auditory Phonetics. Cambridge, MA: Blackwell.
  • Kiang NY, Moxon EC. (1974). Tails of tuning curves of auditory-nerve fibers. J Acoust SocAm 55:620–630.
  • King C, Nicol T, McGee T, Kraus N. (1997). Aggregate responses in the left and right thalamus and cortex to auditory stimulation. Assoc Res Otolaryngol Abstr 20:104.
  • King C, Nicol T, McGee T, Kraus N. (in press). Thalamic asymmetry is related to acoustic signal complexity. Neurosci Lett.
  • Klein SK, Kurtzberg D, Brattson A, Kreuzer JA, Stapells DR, Dunn MA, Rapin I, Vaughan HG Jr. (1995). Electrophysiologic manifestations of impaired temporal lobe auditory processing in verbal auditory agnosia. Brain Lang 51:383–405.
  • Koch D, Tremblay K, Dunn I, Dinces E, Carrell T, Kraus N. (1997). Speech-evoked Nl and mismatch neurophysiology responses in cochlear-implant users and normal listeners. Assoc Res Otolaryngol Abstr 20:80.
  • Koch DB, Bradlow AR, Bellis TJ, Cunningham J, Kraus N. (1999). Effects of age on speech-sound representation and perception in normal-hearing women. Assoc Res Otolaryngol Abstr 22:286.
  • Kowalski N, Depireux DA, Shamma SA. (1996a). Analysis of dynamic spectra in ferret primary auditory cortex. I. Characteristics of single-unit responses to moving ripple spectra. J Neurophysiol 76:3503–3523.
  • Kowalski N, Depireux DA, Shamma SA. (1996b). Analysis of dynamic spectra in ferret primary auditory cortex. II. Prediction of unit responses to arbitrary dynamic spectra. J Neurophysiol 76:3524–3534.
  • Kowalski N, Versnel H, Shamma SA. (1995). Comparison of responses in the anterior and primary fields of the ferret cortex. J Neurophysiol 73:1513–1523.
  • Kraus N, Disterhoft J. (1982). Response plasticity of single neurons in rabbit auditory association cortex during tone-signalled learning. Brain Res 246:205–215.
  • Kraus N, McGee T, Carrell T, Sharma A. (1995a). Neurophysiology bases of speech discrimination. Ear Hear 16:19–37.
  • Kraus N, McGee T, Carrell T, King C, Tremblay K, Nicol T. (1995b) Central auditory system plasticity associated with speech discrimination training. J Cogn Neurosci 7:25–32.
  • Kraus N, McGee TJ, Carrell TD, Zecker TG, Nicol TG, Koch DB. (1996). Auditory neurophysiologic responses and discrimination deficits in children with learning problems. Science 273:971–973.
  • Kraus N, McGee T, Ferre J, Hoeppner JA, Carrell TD, Sharma A, Nicol T. (1993). Mismatch negativity in the neurophysiologic/behavioral evaluation of auditory processing deficits: a case study. Ear Hear 14:223–234.
  • Kraus N, McGee T, Littman T, Nicol T, King C. (1994a). Nonprimary auditory thalamic representation of acoustic change. J Neurophysiol 72:1270–1277.
  • Kraus N, McGee T, King C, Littman T, Nicol T (1994b). Discrimination of speech-like signals in auditory thalamus and cortex. J Acoust Soc Am 96:2758–2768.
  • Kurtzberg D, Stapells DR, Wallace IF. (1988). Event-related potential assessment of auditory system integrity: implications for language development. In: Vietze P, Vaughan Jr HG, eds. Early Identification of Infants with Developmental Disabilities. Philadelphia: Grune and Stratton, 160–180.
  • Ladefoged P. (1993). A Course in Phonetics. New York: Harcourt Brace Jovanovich.
  • McClaskey C, Pisoni D, Carrell T. (1983). Transfer of training of a new linguistic contrast in voicing. Percept Psychophys 34:323–330.
  • McGee T, Kraus N, King C, Nicol T. (1996). Acoustic elements of speechlike stimuli are reflected in surface recorded responses over the guinea pig temporal lobe J Acoust SocAm 99:3606–3614.
  • Merzenich M, Grajski K, Jenkins W, Recanzone G, Peterson B. (1991). Functional cortical plasticity: cortical network origins of representational changes. Cold Spring Harbor Symp Quant Biol 55:873–887.
  • Merzenich MM, Jenkins WM, Johnston P, Schreiner C Miller SL, Tallal P. (1996). Temporal processing deficits of language-learning impaired children ameliorated by training. Science 271:77–81.
  • Micelli G. (1982). The processing of speech sounds in a patient with cortical auditory disorder. Neuropsychologia 20:5–20.
  • Miller MI, Sachs MB. (1983). Representation of stop consonants in the discharge patterns of auditory-nerve fibers. J Acoust Soc Am 74:502–517.
  • Musiek FE, Lamb L. (1994). Central auditory assessment: an overview. In: Katz J, ed. Handbook of Clinical Audiology. 4th Ed. Baltimore: Williams & Wilkins 197–211.
  • Neville H, Schmidt A, Kutas M. (1983). Altered visual-evoked potentials in congenitally deaf adults. Brain Res 266:127–132.
  • Neville HJ, Bavelier D, Corina D, Rauschecker J, Kami A, Lalwani A, Braun A, Clark V, Jezzard P, Turner R. (1998). Cerebral organization for language in deaf and hearing subjects: biological constraints and effects of experience. Proc Natl Acad Sci U S A 95:922–929.
  • Olds J, Disterhoft J, Segal M, Kornblith C, Hirsh R. (1972). Learning centers of rat brain mapped by measuring latencies of conditioned unit responses. J Neurophysiol 35:202–219.
  • Palmer AR. (1990). The representation of the spectra and fundamental frequencies of steady-state single- and double-vowel sounds in the temporal discharge patterns of guinea pig cochlear-nerve fibers. J Acoust SocAm 88:1412–1426.
  • Palmer CV, Nelson CT, Lindley GA. (1998). The functionally and physiologically plastic adult auditory system. J Acoust Soc Am 103:1705–1721.
  • Peretz I, Kolinsky R, Tramo M, Labrecque R, Hublet C, Demeurisse G, Belleville S. (1994). Functional dissociations following bilateral lesions of auditory cortex. Brain 117:1283–1301.
  • Phillips DP. (1993). Representation of acoustic events in the primary auditory cortex. J Exp Psychol Hum Percept Perform 19:203–216.
  • Phillips DP. (1995). Central auditory processing: a view from auditory neuroscience. Am J Otol 16:338–352.
  • Phillips DP, Farmer ME. (1990). Acquired word deafness and the temporal grain of sound representation in the primary auditory cortex. Behav Brain Res 40:85–94.
  • Phillips DP, Hall SE. (1990). Response timing constraints on the cortical representation of sound time structure. J Acoust Soc Am 88:1403–1411.
  • Pickett JM. (1980). The Sounds of Speech Communication. Baltimore: University Park Press.
  • Pickles J. (1988). An Introduction to the Physiology of Hearing. New York: Academic Press.
  • Pisoni DB, Aslin RN, Perey AJ, Hennessy BL. (1982). Some effects of laboratory training on identification and discrimination of voicing contrasts in stop consonants. J Exp Psychol Hum Percept Perform 8:297–314.
  • Rajan R, Irvine DR. (1998). Neuronal responses across cortical field Al in plasticity induced by peripheral auditory organ damage. Audiol Neuro-Otol 3:123–144.
  • Rauschecker JP. (1997). Processing of complex sounds in the auditory cortex of cat, monkey, and man. Acta Otolaryngol Suppl 532:34–38.
  • Rauschecker JP. (1998). Parallel processing in the auditory cortex of primates. Audiol Neuro-Otol 3:86–103.
  • Rauschecker JP, Tian B, Hauser M. (1995). Processing of complex sounds in the macaque nonprimary auditory cortex. Science 268:111–114.
  • Recanzone G, Schreiner C, Merzenich M. (1993). Plasticity in the frequency representation of primary auditory cortex following discrimination training in adult owl monkeys. J Neurosci 13:87–104.
  • Rhode WS. (1995). Interspike intervals as a correlate of periodicity pitch in cat cochlear nucleus. J Acoust Soc Am 97:2414–2429.
  • Rhode WS, Greenberg S. (1994). Encoding of amplitude modulation in the cochlear nucleus of the cat. J Neurophysiol 71:1797–1825.
  • Robertson D, Irvine DR. (1989). Plasticity of frequency organization in auditory cortex of guinea pigs with partial unilateral deafness. J Comp Neurol 282:456–471.
  • Rosen S. (1992). Temporal information in speech: acoustic, auditory and linguistic aspects. Phil Trans R Soc Lond 336:367–373.
  • Rouiller E, Ribaupierre Y de, Toros-Morel A, Ribaupierre F de. (1981). Neural coding of repetitive clicks in the medial geniculate body of cat. Hear Res 5:81–100.
  • Sachs MB, Voigt HF, Young ED. (1983). Auditory nerve representation of vowels in background noise. J Neurophysiol 50:27–45.
  • Salvi RJ, Henderson D, Fiorino F, Colletti V, eds. (1996). Auditory System Plasticity and Regeneration. New York: Thieme Medical.
  • Samra K, Riklan M, Levita E, Zimmerman J, Waltz JM, Bergmann L, Cooper IS. (1969). Language and speech correlates of anatomically verified lesions in thalamic surgery for parkinsonism. J Speech Hear Res 12:510–540.
  • Schreiner CE, Langner G. (1988). Coding of temporal patterns in the central auditory nervous system. In: Edelman GM, Gail WE, Cowan WM, eds. Auditory Function. New York: John Wiley & Sons, 337–361.
  • Shamma SA, Versnel H. (1995). Ripple analysis in ferret primary auditory cortex. II. Prediction of unit responses to arbitrary spectral profiles. Auditory Neurosci 1:255–270.
  • Sharma A, Kraus N, Carrell T, Thompson C. (1994). Physiologic bases of pitch and place of articulation perception: a case study. J Acoust Soc Am 95:3011.
  • Sinex DG. (1993). Auditory nerve fiber representation of cues to voicing in syllable-final stop consonants. J Acoust Soc Am 94:1351–1362.
  • Sinex DG, Geisler CD. (1983). Responses of auditory-nerve fibers to consonant-vowel syllables. J Acoust Soc Am 73:602–615.
  • Sinex DG, Geisler CD. (1984). Comparison of the responses of auditory nerve fibers to consonant-vowel syllables with predictions from linear models. J Acoust Soc Am 76:116–121.
  • Sinex DG, McDonald LP. (1988). Average discharge rate representation of voice onset time in the chinchilla auditory nerve. J Acoust Soc Am 83:1817–1827.
  • Sinex DG, Narayan SS. (1994). Auditory-nerve fiber representation of temporal cues to voicing in world-medial stop consonants. J Acoust Soc Am 95:897–903.
  • Steinschneider M, Arezzo JC, Schroeder C, Vaughan HG Jr. (1994). Speech evoked activity in primary auditory cortex: effects of voice onset time. Electroencephalogr Clin Neurophysiol 92:30–43.
  • Stevens K. (1980). Acoustic correlates of some phonetic categories. J Acoust Soc Am 68:836–842.
  • Takahashi N, Kawamura M, Shinotou H, Hirayama K, Kaga K, Shindo M. (1992). Pure word deafness due to left hemisphere damage. Cortex 28:295–303.
  • Tallal P, Miller SL, Bedi G, Byma G, Wang X, Nagarajan SS, Schreiner C, Jenkins WM, Merzenich MM. (1996). Language comprehension in language-learning impaired children improved with acoustically modified speech. Science 271:81–84.
  • Tremblay K, Kraus N, Carrell T, McGee T. (1997). Central auditory system plasticity: generalization to novel stimuli following listing training. J Acoust Soc Am 102:3762–3773.
  • Tremblay K, Kraus N, McGee TJ. (1998). The time course of auditory perceptual learning: neurophysiological changes during speech-sound training. Neuroreport 9:3557–3560.
  • Turner CW, Holte L. (1987). Discrimination of spectral-peak amplitude by normal and hearing-impaired subjects. JAcoust SocAm 81:445–451.
  • Turner CW, Robb MP. (1987). Audibility and recognition of stop consonants in normal and hearing-impaired subjects. JAcoust SocAm 81:1566–1573.
  • Viemeister NF, Plack CJ. (1993). Time analysis. In: Yost WA, Popper AN, Fay RR, eds. Human Psychophysics. New York: Springer-Verlag, 116–154.
  • Voigt HF, Sachs MB, Young ED. (1982). Representation of whispered vowels in discharge patterns of auditory-nerve fibers. Hear Res 8:49–58.
  • Wang X, Merzenich MM, Beitel R, Schreiner CE. (1995). Representation of a species-specific vocalization in the primary auditory cortex of the common marmoset: temporal and spectral characteristics. J Neurophysiol 74:2685–2706.
  • Weinberger MN, Hopkins W, Diamond DM. (1984). Physiological plasticity of single neurons in auditory cortex of the cat during acquisition of the pupillary conditioned response: I. Primary field (AI). Behav Neurosci 98:171–188.
  • Young ED, Sachs MB. (1979). Representation of steady-state vowels in the temporal aspects of the discharge patterns of populations of auditory-nerve fibers. J Acoust Soc Am 66:1381–1403.
  • Zeng FG, Turner CW. (1990). Recognition of voiceless fricatives by normal and hearing-impaired subjects. J Speech Hear Res 33:440–449.