Speech Recognition as a Practice Tool for Dysarthria

Susan Koch Fager

doi:10.1055/s-0037-1602841

RSS-Feed abonnieren

Bitte kopieren Sie die angezeigte URL und fügen sie dann in Ihren RSS-Reader ein.

https://www.thieme-connect.de/rss/thieme/de/10.1055-s-00000076.xml

PDF herunterladen

Semin Speech Lang 2017; 38(03): 220-228
DOI: 10.1055/s-0037-1602841

Review Article

Thieme Medical Publishers 333 Seventh Avenue, New York, NY 10001, USA.

Speech Recognition as a Practice Tool for Dysarthria

Autor*innen

Susan Koch Fager

¹Institute for Rehabilitation Science and Engineering, Madonna Rehabilitation Hospitals, Lincoln, Nebraska.

Weitere Informationen

Publikationsverlauf

Publikationsdatum:
15. Juni 2017 (online)

Auch verfügbar auf

Lizenzen und Reprints

Abstract

Recovery of speech in dysarthria requires an extensive amount of time and practice. Speech recognition (SR) technology may support long-term practice and speech recovery efforts for individuals with dysarthria. However, SR technology development has been focused on typical (neurologically intact) speakers to support writing. This article describes the history and development of SR technology, how it has been used by individuals with dysarthria, and includes a case study illustration of the use of a novel SR technology as a speech practice tool. Case study participants included two individuals with differing onsets and dysarthria due to traumatic brain injury. Results indicated that both were able to make acoustic/perceptual changes during speech practice sessions, and one participant demonstrated generalization of changes to habitual speech. Limitations and future directions of current SR technology as a speech practice tool are discussed.

Keywords

Dysarthria - speech recognition - speech practice

References
1 Yorkston K, Beukelman D, Strand E, Hakel M. Management of Motor Speech Disorders in Children and Adults. Austin, TX: Pro-ed; 2010

Suche in Google Scholar
Download RIS citation
2 Schmidt RA, Lee TD. Motor Learning and Performance. From Principles to Application. 5th ed. Champaign, IL: Human Kinetics; 2013

Suche in Google Scholar
Download RIS citation
3 Maas E, Robin DA, Austermann Hula SN. , et al. Principles of motor learning in treatment of motor speech disorders. Am J Speech Lang Pathol 2008; 17 (03) 277-298

Crossref PubMed Suche in Google Scholar
Download RIS citation
4 Jordan FM. Whatever happened after the “return from silence”?. Brain Inj 1994; 8 (03) 277-283

Crossref PubMed Suche in Google Scholar
Download RIS citation
5 Workinger MS, Netsell R. Restoration of intelligible speech 13 years post-head injury. Brain Inj 1992; 6 (02) 183-187

Crossref PubMed Suche in Google Scholar
Download RIS citation
6 Rosen K, Yampolsky S. Automatic speech recognition and a review of its functioning with dysarthric speech. J Augment Altern Commun 2000; 16: 48-60

Suche in Google Scholar
Download RIS citation
7 Venkatagiri HS. Speech recognition technology applications in communication disorders. Am J Speech Lang Pathol 2002; 11: 323-332

Crossref Suche in Google Scholar
Download RIS citation
8 Fried-Oken M. Voice recognition device as a computer interface for motor and speech impaired people. Arch Phys Med Rehabil 1985; 66 (10) 678-681

PubMed Suche in Google Scholar
Download RIS citation
9 Coleman C, Meyers L. Computer recognition of the speech of adults with cerebral palsy and dysarthria. Augment Altern Commun 1991; 7: 34-42

Crossref Suche in Google Scholar
Download RIS citation
10 Brown C, Cavalier AR. Voice recognition technology and persons with severe mental retardation and severe physical impairment: learning, response differentiation, and affect. J Spec Educ Technol 1992; 11 (04) 196-206

Crossref Suche in Google Scholar
Download RIS citation
11 Dabbagh HH, Damper RI. Text composition by voice: design issues and implementations. Augment Altern Commun 1985; 1 (02) 84-93

Crossref Suche in Google Scholar
Download RIS citation
12 Noyes J, Frankish C. Speech recognition technology for individuals with disabilities. Augment Altern Commun 1992; 8 (04) 297-303

Crossref Suche in Google Scholar
Download RIS citation
13 Ferrier LJ, Jarrell N, Carpenter T, Shane HC. A case study of a dysarthric speaker using the Dragon Dictate voice recognition system. J Comput Users Speech Hear 1992; 8: 33-53

Suche in Google Scholar
Download RIS citation
14 Blaney B, Wilson J. Acoustic variability in dysarthria and computer speech recognition. Clin Linguist Phon 2000; 14 (04) 307-327

Crossref Suche in Google Scholar
Download RIS citation
15 Ferrier L, Shane H, Ballard H, Carpenter T, Benoit A. Dysarthric speakers' intelligibility and speech characteristics in relation to computer speech recognition. Augment Altern Commun 1995; 11: 165-174

Crossref Suche in Google Scholar
Download RIS citation
16 Doyle PC, Leeper HA, Kotler AL. , et al. Dysarthric speech: a comparison of computerized speech recognition and listener intelligibility. J Rehabil Res Dev 1997; 34 (03) 309-316

PubMed Suche in Google Scholar
Download RIS citation
17 Thomas-Stonell CN, Kotler A, Leeper H, Doyle P. Computerized speech recognition: influence of intelligibility and perceptual consistency on recognition accuracy. Augment Altern Commun 1998; 14: 51-56

Crossref Suche in Google Scholar
Download RIS citation
18 Hux K, Rankin-Erickson J, Manasse N, Lauritzen E. Accuracy of three speech recognition systems: case study of dysarthric speech. Augment Altern Commun 2000; 16: 186-196

Crossref Suche in Google Scholar
Download RIS citation
19 Kotler A, Thomas-Stonell N. Effects of speech training on the accuracy of speech recognition for an individual with a speech impairment. Augment Altern Commun 1997; 13: 71-80

Crossref Suche in Google Scholar
Download RIS citation
20 Manasse NJ, Hux K, Rankin-Erickson JL. Speech recognition training for enhancing written language generation by a traumatic brain injury survivor. Brain Inj 2000; 14 (11) 1015-1034

Crossref PubMed Suche in Google Scholar
Download RIS citation
21 Raghavendra P, Rosengren E, Hunnicutt S. An investigation of different degrees of dysarthric speech as input to speaker-adaptive and speaker-dependent recognition systems. Augment Altern Commun 2001; 17: 265-275

Crossref Suche in Google Scholar
Download RIS citation
22 Wan V, Carmichael J. Polynomial dynamic time warping kernel support vector machines for dysarthric speech recognition with sparse training data. INTERSPEECH 9th European Conference on Speech Communication and Technology, September 4–8, 2005 in Lisboa, Portugal

Download RIS citation
23 Polur PD, Miller GE. Effect of high-frequency spectral components in computer recognition of dysarthric speech based on a Mel-cepstral stochastic model. J Rehabil Res Dev 2005; 42 (03) 363-371

PubMed Suche in Google Scholar
Download RIS citation
24 Omar S, Morales C, Cox S. Modeling errors in automatic speech recognition for dysarthric speakers. EURASIP J Adv Signal Process 2009; 20 (09) 1-14

Suche in Google Scholar
Download RIS citation
25 Hawley M. Speech recognition as an input to electronic assistive technology. Br J Occup Ther 2002; 65: 15-20

Crossref Suche in Google Scholar
Download RIS citation
26 Hawley MS, Enderby P, Green P. , et al. A speech-controlled environmental control system for people with severe dysarthria. Med Eng Phys 2007; 29 (05) 586-593

Crossref PubMed Suche in Google Scholar
Download RIS citation
27 Judge S, Robertson Z, Hawley M, Enderby P. Speech-driven environmental control systems—a qualitative analysis of users' perceptions. Disabil Rehabil Assist Technol 2009; 4 (03) 151-157

Crossref PubMed Suche in Google Scholar
Download RIS citation
28 Hird K, Hennessey NW. Facilitating use of speech recognition software for people with disabilities: a comparison of three treatments. Clin Linguist Phon 2007; 21 (03) 211-226

Crossref PubMed Suche in Google Scholar
Download RIS citation
29 Kitzing P, Maier A, Ahlander VL. Automatic speech recognition (ASR) and its use as a tool for assessment or therapy of voice, speech, and language disorders. Logoped Phoniatr Vocol 2009; 34 (02) 91-96

Crossref PubMed Suche in Google Scholar
Download RIS citation
30 Maier A, Haderlein T, Stelzle F. , et al. Automatic speech recognition systems for the evaluation of voice and speech disorders in head and neck cancer. EURASIP J Audio Speech Music Process 2009; 20 (10) 926-951

Suche in Google Scholar
Download RIS citation
31 McHenry M, LaConte SM. Computer speech recognition as an objective measure of intelligibility. J Med Speech-Lang Pathol 2010; 18 (04) 99-103

Suche in Google Scholar
Download RIS citation
32 Abad A, Pompili A, Costa A. , et al. Automatic word naming recognition for an on-line aphasia treatment system. Comput Speech Lang 2013; 27 (06) 1235-1248

Crossref Suche in Google Scholar
Download RIS citation
33 Palmer R, Enderby P, Hawley M. Addressing the needs of speakers with longstanding dysarthria: computerized and traditional therapy compared. Int J Lang Commun Disord 2007; 42 (01) (Suppl. 01) 61-79

Crossref PubMed Suche in Google Scholar
Download RIS citation
34 Watson CS, Reed DJ, Kewley-Port D, Maki D. The Indiana Speech Training Aid (ISTRA). I: comparisons between human and computer-based evaluation of speech quality. J Speech Hear Res 1989; 32 (02) 245-251

Crossref PubMed Suche in Google Scholar
Download RIS citation
35 Bunnell H, Yarrington D, Polikoff JB. STAR: Articulation Training for Young Children. INTERSPEECH 6th International Conference on Spoken Language Processing, Oct. 16–20, 2000, Beijjing, China

Download RIS citation
36 Ryalls J. Comparison of two computerised speech training systems: Speech Viewer and ISTRA. J Speech Lang Pathol Audiol 1989; 13 (03) 53-56

Suche in Google Scholar
Download RIS citation
37 Hosom J, Jakobs T, Baker A, Fager S. Automatic speech recognition for assistive writing in speech supplemented word prediction. Interspeech 2010; xx: 2674-2677

Suche in Google Scholar
Download RIS citation
38 Fager SK, Beukelman DR, Jakobs T, Hosom JP. Evaluation of a speech recognition prototype for speakers with moderate and severe dysarthria: a preliminary report. Augment Altern Commun 2010; 26 (04) 267-277

Crossref PubMed Suche in Google Scholar
Download RIS citation
39 Yorkston K, Beukelman D, Hakel M, Dorsey M. Sentence Intelligibility Test [Computer software]. Lincoln, NE: Madonna Rehabilitation Hospitals; 2007

Suche in Google Scholar
Download RIS citation
40 Bachu RG, Kopparthi S, Adapa B, Barkana BD. Separation of voiced and unvoiced using zero crossing rate and energy of the speech signal. Paper presented at: American Society for Engineering Education (ASEE) Zone Conference Proceedings; June 24–25, 2008, Pittsburgh, PA

Download RIS citation
41 Shete DS, Patil SB, Patil SB. Zero crossing rate and energy of the speech signal of Devanagari script. IOSR-JVSP 2014; 4 (01) 1-5

Suche in Google Scholar
Download RIS citation

Ähnliche Zeitschriften

RSS-Feed abonnieren

Teilen / Bookmarken

Speech Recognition as a Practice Tool for Dysarthria

Autor*innen

Publikationsverlauf

Abstract

Keywords

References