Key Words
cochlear implants - functional magnetic resonance imaging - hearing - infants - language - observation - parents
INTRODUCTION
Benefits of cochlear implantation during the critical period are evident in terms of speech recognition and language performance of children fitted with a cochlear implant (CI) (e.g., [McConkey Robbins et al, 2004]; [Nikolopoulos et al, 2004]; [Svirsky et al, 2004]; [Sharma and Dorman, 2006]; [Geers and Nicholas, 2013]; [Tobey et al, 2013]; [Castellanos et al, 2014]). Although studies show a general trend of improved auditory and speech-language performance after implantation, there is considerable variability in individual outcomes ([Zeng, 2004]; [Zaidman-Zait and Most, 2005]; [Kirk et al, 2006]; [Geers et al, 2008]; [Pisoni et al, 2008]; [Lazard et al, 2010]; [Niparko et al, 2010]; [Geers et al, 2011]). “Despite significant research effort, there is no reliable and accurate presurgical predictor of postsurgical performance in cochlear implants” ([Zeng, 2004], p. T5). Although various factors such as age at onset of hearing loss, preimplant auditory status, age at implantation and stimulation, surgical and postsurgical procedures, postimplant intervention, and family involvement contribute to this variability ([McKay, 2005]; [Tomblin et al, 2005]; [Wake et al, 2005]; [Finley et al, 2008]; [Geers et al, 2008]; [Gilley et al, 2008]; [Green et al, 2012]; [Tobey et al, 2013]), a single factor that will be able to explain a significant portion of the variability has not yet been identified conclusively ([Zeng, 2004]). Understanding the neural mechanisms that support hearing, speech and language functions in successful CI users could facilitate accurate individual prognosis ([Lazard et al, 2010]). Preimplant neuroimaging techniques may have the potential to improve prediction accuracy of postimplant performance outcomes ([Zeng, 2004]; [Petersen et al, 2013]; [Smalt et al, 2013]). Activation of certain regions in the brain has been found to correlate with behavioral measures of auditory and speech-language performance ([Patel et al, 2007]; [Eisner et al, 2010]; [Morillon et al, 2010]; [Horowitz-Kraus et al, 2013]; [Nagels et al, 2013]). For instance, [Eisner et al (2010)] used functional magnetic resonance imaging (fMRI) to study brain regions responsible for analyzing CI-simulated speech stimuli in normal hearing individuals. The fMRI technique utilizes differences in magnetic properties of oxygenated blood, deoxygenated blood, and tissue to acquire images of “the brain in action.” Participants in the [Eisner et al (2010)] study listened to noise-vocoded speech stimuli spectrally shifted either upward (from low to high frequencies) or downward (from high to low frequencies) during fMRI. Participants displayed greater activation in the left superior temporal sulcus and inferior frontal gyrus in response to the upward, more learnable condition; thus, implicating these regions in speech perception and comprehension of CI-simulated speech. Other auditory stimuli have also been used in fMRI studies of brain activation. [Tan et al (2013)] used a combination of structural and functional magnetic resonance imaging (MRI) data to delineate regions that effectively classified between the brains of infants with normal hearing and those with severe to profound hearing loss. The functional imaging portion of their experiment used a passive story listening task. They highlighted several brain regions that maximally differentiated between the two groups including regions in the parietal, frontal, and limbic lobes. Other fMRI experiments have also used the story listening task because of the feasibility and relevance of its use with young children ([Ahmad et al, 2003]; [Schmithorst et al, 2006], [2007]; [Vannest et al, 2009]; [Szaflarski et al, 2012]; [Horowitz-Kraus et al, 2013]; [Deshpande et al, 2016]).
A recent study by our group ([Deshpande et al, 2016]) identified neuroimaging biomarkers of postimplant oral language performance in young CI recipients. Eleven children with severe to profound hearing loss passively listened to narratives while inside the scanner. They obtained CI(s) after the fMRI task. Participants’ speech-language performance was measured 2 yr postimplant using standardized clinician-administered tests. A correlation analysis (controlled for preimplant hearing levels and age at implantation) performed between preimplant fMRI activation and postimplant oral language performance as measured by the Clinical Evaluation of Language Fundamentals-Preschool, Second Edition (CELF-P2; [Wiig et al, 2004]) revealed brain regions including the angular gyrus, medial frontal gyrus, and the cingulate gyrus. The results of this study indicate that (a) greater activation in certain brain regions may suggest better postimplant outcomes and (b) fMRI can possibly be used as a predictor of postimplant oral language outcomes.
Measures in the [Deshpande et al (2016)] study were clinician-administered. [Yoshinaga-Itano et al (1998)] have succinctly summarized the need and appropriateness of using parental reports in studying child language. They stated three distinct advantages of using parental observations/reports: In a home-based setting—(a) parents have a greater understanding of their child’s language abilities, (b) familiarity with people and surroundings elicits better responses from children, and (c) children’s speech/language performance is not unfairly affected by fatigue. In an extensive review of studies on pediatric cochlear implantation, [Thoutenhoofd et al (2005)] emphasized the need to observe children “in their day-to-day lives, after implantation, rather than in clinical tests” (p. 243). Parents have the unique opportunity to observe their children’s auditory and language performance every day. This rich dataset must be tapped into to better understand children’s auditory and speech-language status ([Hyde et al, 2011]). Other researchers have also suggested the use of parental observations and reports to supplement clinician-administered tests (e.g., [Knoors et al, 2003]; [Lin et al, 2008]). [Ching et al (2010)] found significant positive correlations between language performance as measured by parental reports and that measured by clinician-administered tests suggesting the validity of the use of parent reports/observations in children with hearing loss. Parental observations/reports like the Auditory Skills Checklist (ASC; [Meinzen-Derr et al, 2007]) and the SKI-HI Language Development Scale (SKI-HI LDS, 2nd edition; [Watkins and Tonelson, 2004]) have been used to study auditory and language performance of children with normal hearing, hearing loss, and developmental delays (e.g., [Meinzen-Derr et al, 2007]; [Wiley et al, 2008]; [Nowakowski et al, 2009]; [Ruggirello and Mayer, 2010]; [Meinzen-Derr et al, 2011]).
Because information obtained from parental observations of children’s auditory and language performance is vital, neuroimaging biomarkers predicting such information may prove to be useful in guiding postimplant rehabilitation plans. The aim of this study was to identify preimplant neuroimaging biomarkers that predict children’s postimplant auditory and language outcomes as measured by parental observations. ASC and SKI-HI LDS were specifically chosen for this investigation as they are well-suited to study auditory hierarchy and language performance in young CI recipients as witnessed by their primary caregiver.
MATERIALS AND METHODS
Participants
All study procedures were carried out following approval from Cincinnati Children’s Hospital Medical Center (CCHMC) Institutional Review Board. Twelve children with bilateral severe to profound hearing loss were recruited for the present study (from a larger dataset recruited for an NIH study—R01-DC07186). Preimplant minimum response levels (MRLs) were calculated for all participants as a four-frequency average (500, 1000, 2000, and 4000 Hz) using sound field audiometry. The mean preimplant MRL of participants was 104.42 dB HL. All participants received fMRI scanning before 24 mo of age, under sedation with propofol. After the fMRI procedure, all participants received CI(s) before 36 mo of age. Two years postimplant, ASC and SKI-HI scores were obtained at a follow-up visit to the hospital. ASC scores were available on all 12 children whereas SKI-HI LDS scores were available on 11 of the 12 participants. Children with metallic or electronic implants; gestational age < 36 weeks; birth weight less than the 25th percentile; head circumference less than the 5th percentile or greater than the 95th percentile; neurological disorders and/or anatomical malformations; or cognitive impairments were excluded. Participants underwent cochlear implantation at a mean age of 19.4 mo (range: 12–36 mo). All children received Cochlear’s™ Nucleus® Freedom implants and used the advanced combination encoder (ACE) strategy for their everyday implant maps. The demographic data of participants is presented in [Table 1]. The group consisted of an equal number of male and female participants. They received the implant either in the right ear (n = 3), or the left ear (n = 1), or both ears (n = 8). Additional information about the side of first implant for children who received bilateral sequential implants is noted under the “ear of implant” column. “R-L” denotes acquisition of right implant before the left implant and vice versa.
Table 1
Demographic Data of CI Participants
Participant
|
Gender
|
Preimplant MRLs (dB HL)
|
Ear of Implant
|
Age at First Implant (mo)
|
ASC Score
|
SKI-HI LDS RLA (mo)
|
SKI-HI LDS ELA (mo)
|
P1
|
Female
|
117
|
Bilateral sequential
|
20
|
65
|
38
|
34
|
R-L
|
P2
|
Male
|
73
|
Bilateral sequential
|
20
|
60
|
38
|
38
|
R-L
|
P3
|
Female
|
105
|
Right
|
20
|
59
|
38
|
38
|
P4
|
Male
|
115
|
Bilateral sequential
|
13
|
59
|
34
|
30
|
R-L
|
P5
|
Male
|
106
|
Bilateral simultaneous
|
12
|
58
|
34
|
30
|
P6
|
Female
|
106
|
Bilateral sequential
|
18
|
28
|
23
|
21
|
L-R
|
P7
|
Male
|
90
|
Right
|
25
|
21
|
—
|
—
|
P8
|
Female
|
113
|
Bilateral simultaneous
|
13
|
60
|
30
|
30
|
P9
|
Female
|
95
|
Bilateral simultaneous
|
36
|
64
|
42
|
38
|
P10
|
Male
|
98
|
Bilateral sequential
|
20
|
65
|
46
|
46
|
L-R
|
P11
|
Male
|
117
|
Left
|
14
|
26
|
5
|
3
|
P12
|
Female
|
118
|
Right
|
22
|
37
|
26
|
21
|
Note: ELA = Expressive Language Age; RLA = Receptive Language Age.
PreImaging Procedures
Per CCHMC protocol, after initial identification of hearing loss, all children received a re-evaluation of hearing status via auditory brainstem response and otoacoustic emission testing. Next, possible CI candidates were referred for a clinical MRI of the auditory nerve to ensure structural integrity and to rule out other abnormality affecting hearing. At this point, parents of these children were informed about the current fMRI study by their attending otolaryngologist. Interested parents were explained the entire protocol in detail and their questions were answered by one of the investigators. Although the clinical MRI procedure requires sedation (propofol: 200–300 μg/kg/min with an 8% sevofluorane induction), children cannot be sedated for research purposes at our institution. However, the Institutional Review Board does permit the addition of an fMRI study at the end of the clinical MRI procedure provided that not >15 min of additional sedation is required. The fMRI procedures, including additional sedation, were described to parents as part of the informed consent process. If parents agreed to participate in the study, written informed consent was obtained before the clinical MRI. The fMRI experiment was conducted at the end of the clinical MRI after the magnetic resonance images were reviewed by the neuroradiologists and determined to have no abnormalities in auditory or language areas of the brain. Note that the use of propofol for sedation during fMRI with auditory stimulation has previously been reported by our group ([Patel et al, 2007]; [DiFrancesco et al, 2013]; [Deshpande et al, 2016]).
Imaging Procedures
fMRI was performed on a Siemens 3T Trio scanner. All participants were between 9 and 23 mo during fMRI scanning. Blood oxygen level dependent (BOLD) hemodynamic response was obtained in response to three auditory conditions: speech, noise, and silence. The speech condition consisted of stories read in a female voice (36 sentences; 18 segments of two sentences each) whereas the noise condition consisted of narrowband noise presented randomly at octave frequencies from 250 to 4000 Hz. Three comparisons (or contrasts) were possible with the presentation of the three-condition paradigm: speech versus silence, noise versus silence, and speech versus noise. During fMRI, stimuli were presented binaurally via high fidelity magnetic resonance-compatible headphones (Avotec Inc.). The headphones were a part of the Avotec Silent Scan Audio System (SS3100). The intensity of stimulus presentation was set at 10–15 dB SL with respect to MRLs of each participant ([Tan et al, 2013]; [Deshpande et al, 2016]). The duration of each “stimulus” condition (when speech, noise, or silence was presented) was 5 sec while the “no stimulus” condition (when the scanner was active) lasted for 6 sec. Each stimulus condition was preceded by 1 sec of silence for acoustic demarcation ([Tan et al, 2013]; [Deshpande et al, 2016]).
During MRI, rapid switching of magnetic gradients within the scanner creates a loud noise up to 110 dBA ([Hall, 2006]; [Gaab et al, 2007]). We used a silent acquisition method called Hemodynamics Unrelated to Sounds of Hardware (HUSH; [Schmithorst and Holland, 2004]) which utilizes the delay in hemodynamic response ([Frahm et al, 1996]; [Krüger et al, 1996]; [Buxton et al, 1998]; [Logothetis et al, 1999]) to facilitate evaluation of neuronal activity in auditory research by minimizing the confounding effects of scanner noise. [Figure 1] depicts representation of six sequential time points (1 through 6) during fMRI acquisition using the HUSH technique. At time point 1, noise was presented for 5 sec; hemodynamic response was initiated in the brain; image acquisition stayed off; the scanner was silent and hence did not interfere with stimulus presentation. No stimulus was presented during the next 6 sec (time point 2); the hemodynamic response reached its peak approximately 5 sec after initiation of the auditory stimulus; and the scanner captured this activity with three sequential brain image volumes (2 sec each). In [Figure 1], (1 + 5) seconds during time points 1, 3, and 5 represents an additional 1 sec of silence for acoustic demarcation and to allow for hemodynamic response to return to equilibrium. The total fMRI scan time for each participant was limited to <11 min.
Figure 1 Block diagram of HUSH story listening paradigm showing the duration and order of presentation of auditory stimuli with the corresponding hemodynamic changes and image acquisition parameters at six different time points.
Cochlear Implantation and Postimplant Procedures
Participants received CI(s) after the fMRI scanning (see [Table 1] for age at implantation). Auditory and language tests—including SKI-HI LDS and ASC—were administered 2 yr after implantation. All participants enrolled in this study used verbal language as the primary mode of communication, and test scores were obtained based on parental participation/observation.
The ASC is a criterion-referenced tool administered to assess the functional auditory skills of young children with hearing loss. It consists of 35 questions on a continuum of detection–discrimination–identification–comprehension. Each domain has different number of questions: detection (nine questions), discrimination (seven questions), identification (seven questions), and comprehension (12 questions). Examples of questions in each domain are as follows: “Does your child localize correct sound source?” (detection), “Does your child discriminate minimal-pair words?” (discrimination), “Does your child identify his/her name when called?” (identification), and “Does your child follow two-step directions?” (comprehension). Both parent responses and clinician observations are combined to arrive at the answers to the above questions. Finally, the tester assigns a rating based on the answers as either 0 (does not have the skill); 1 (emerging skill development); or 2 (consistently demonstrates the skill). Thus, the score for a given child can range from 0 to 70 with each domain contributing to the total maximum possible score as follows: detection (18), discrimination (14), identification (14), and comprehension (24). It takes approximately 10 min to administer the complete test. A high correlation (r = 0.90, p < 0.0001) between ASC and the Infant Toddler-Meaningful Auditory Integration Scale ([Zimmerman-Phillips et al, 2001]) validated the ASC as a tool for early evaluation of functional auditory skills. A high internal consistency between questions and between domains was also observed (Cronbach’s alpha >0.91).
The SKI-HI LDS was primarily developed for caregivers as a tool to evaluate and monitor their children’s language skills. It assesses receptive and expressive language skills of children with hearing loss from birth to 5 yr of age. It gives an age-related norm based on the LDS score. Language performance is given credit irrespective of the modality used (e.g., sign language). The test consists of 20 units which delineate language skills at 2 mo intervals from 0 to 2 yr of age; 4 mo intervals from 2 to 4 yr of age; and 6 mo intervals from 4 to 5 yr of age. Each unit is divided into receptive and expressive skills. Intertester and intratester reliability measures have been performed during test development of the LDS. Internal consistency measures ensure unit-level and item-level reliability. Concurrent and construct validity were rated high based on comparisons with Receptive-Expressive Emergent Language Scale ([Bzoch and League, 1994]) and coefficient of reproducibility, respectively. Caregivers administer this test every 6 mo while observing their child for 1 week. Caregivers/parents are expected to get familiarized with the test items—a parent advisor can help them reach this goal—before marking an observed behavior. It is advised that the parents mark the language behavior as a plus (+) as soon as it is observed. If the child obtains pluses for >50% of the items on a unit, the child receives credit for the entire unit, and parents can proceed to the next unit. Caregivers are blind to the age range that each unit represents, and thus eliminating observer’s bias and ensuring validity of observations. Guidelines on identifying units relevant to the child’s language ability are provided to the caregivers.
For the present study, the total score on the ASC (maximum possible score: 70) and the receptive and expressive language age on the SKI-HI LDS (maximum receptive and expressive language age: 54–60 mo) were considered as outcomes of interest. The age range obtained for SKI-HI LDS receptive and expressive language skills was converted into a mean age for data analysis.
Data Analysis
Raw functional data from each participant were obtained along with time-series information to ensure validity of observations ([Szaflarski et al, 2006]). A pyramid iterative coregistration algorithm ([Thévenaz et al, 1998]) was used to realign fMRI data because of subject motion. Following motion correction, fMRI time series data were transformed to the Anterior-Posterior Commissure alignment plane. Preprocessing also included correction for global intensity variation. The general linear model implemented in an in-house software developed at Cincinnati Children’s Hospital ([Schmithorst and Dardzinski, 2000]) was used to create unthresholded z-maps for each contrast ([Worsley et al, 2002]). Because there were three auditory conditions, three contrasts were obtained: speech versus silence, noise versus silence, and speech versus noise. No significant activation was detected in regions of interest related to hearing or language for the contrast of speech versus noise, so this contrast was not considered for further analysis. The subjects’ functional and anatomical images were normalized to the CCHMC Imaging Research Center infant template (available as a free download at https://irc.cchmc.org/software/infant.php) in the Statistical Parametric Mapping (Wellcome Trust Centre for Neuroimaging, London, UK) software. It has been shown that pediatric or adult templates may not be suitable for normalization of infant functional data ([Altaye et al, 2008]). Hence the infant template, which includes brain image data from 76 infants in the age range of 9–15 mo, was used for the existing data. The Pick Atlas ([Maldjian et al, 2003]), developed at the fMRI Laboratory of the Wake Forest University School of Medicine (available as a free download at http://fmri.wfubmc.edu/software/PickAtlas), was also normalized to the infant template. Finally, data were transformed into the Montreal Neurological Institute (MNI) space ([Friston et al, 1995]). Both positive and negative activated voxels from the first-level general linear model analysis were included in subsequent analyses ([Schmithorst and Holland, 2004]; [Patel et al, 2007]; [Deshpande et al, 2016]). In the second-level analysis, participants’ functional contrast maps were correlated with postimplant scores on parental observation reports in MATLAB (MathWorks, Natick, MA). Initially, a linear regression was performed with the contrast maps and test scores as formulated in [Equation (1)] to control for effects of MRLs and age at implantation ([Wake et al, 2005]; [Gilley et al, 2008]).
where y is the original language test score, β1, β2, and b are the parameters in the regression model, ŷ is the score on either parental observation reports (ASC and SKI-HI LDS) after regressing out MRLs and age at implantation. ŷ was used for subsequent correlation analysis instead of the original language test scores as it reflected the corrected test score after controlling for the two variables (MRLs and age at implantation). A nonparametric correlation analysis was performed and the Spearman’s correlation coefficient was calculated in addition to the p-value for each voxel. Regression maps were corrected for multiple correlations by controlling for the family-wise error rate using the AlphaSim program ([Ward, 2000]). Default parametric values were used during the AlphaSim p-value correction method (as described in [Deshpande et al, 2016]). The regression maps were overlaid on the infant template ([Altaye et al, 2008]) and visualized in MRIcro (http://www.mricro.com) ([Rorden, 2005]). For each significantly correlated cluster, Brodmann areas and co-ordinates of the center of the cluster have been reported in the MNI framework.
RESULTS
[Table 2] summarizes means and standard deviations (SDs) of scores and language ages obtained by participants on ASC and SKI-HI LDS, respectively. The mean ASC score was 50.17 (SD = 16.90, range = 21–65, maximum possible score = 70). A mean ASC score of 50.17/70 on the detection–discrimination–identification–comprehension continuum indicates that participants in the present study achieved most of the skills required for auditory detection, discrimination, and identification. Auditory comprehension skills of most participants were emergent (see ASC subscale scores in [Table 3] for details). On the SKI-HI LDS, participants obtained a mean receptive language age of 32.18 mo (SD = 11.23, range = 5–46 mo, maximum possible age range = 54–60 mo) and a mean expressive language age of 29.91 mo (SD = 11.64, range = 3–46 mo, maximum possible age range = 54–60 mo). The mean age of participants at the time of SKI-HI LDS test administration was 45.91 mo (SD = 6.94, range = 37–61 mo). These data indicate that as a group, participants in the present study lagged by 13.73 mo in their receptive language skills and by 16 mo in their expressive language skills 2 yr after implantation as compared with the normative data.
Table 2
Descriptive Data of Participants on ASC and SKI-HI LDS
|
Mean
|
SD
|
N
|
ASC
|
50.17
|
16.90
|
12
|
SKI-HI LDS receptive language age (mo)
|
32.18
|
11.23
|
11
|
SKI-HI LDS expressive language age (mo)
|
29.91
|
11.64
|
11
|
Table 3
ASC Subscale Scores for All Participants (The Number in Parenthesis in the Top Row Indicates the Maximum Possible Score on Each Subscale)
Participants
|
ASC Detection (18)
|
ASC Discrimination (14)
|
ASC Identification (14)
|
ASC Comprehension (24)
|
ASC Total (70)
|
P1
|
18
|
14
|
14
|
19
|
65
|
P2
|
18
|
12
|
14
|
16
|
60
|
P3
|
18
|
14
|
14
|
13
|
59
|
P4
|
18
|
14
|
14
|
13
|
59
|
P5
|
18
|
13
|
14
|
13
|
58
|
P6
|
16
|
10
|
2
|
0
|
28
|
P7
|
13
|
4
|
4
|
0
|
21
|
P8
|
18
|
11
|
14
|
17
|
60
|
P9
|
18
|
13
|
14
|
19
|
64
|
P10
|
18
|
14
|
14
|
19
|
65
|
P11
|
14
|
6
|
5
|
1
|
26
|
P12
|
16
|
10
|
9
|
2
|
37
|
Results from the correlation of the BOLD contrasts with parent-reported language skill measures, and the fMRI contrast maps are described next.
Comparison of Speech and Silence
[Figure 2] and [Table 4] depict brain regions graphically and in tabular format, where significant correlations were found between the fMRI contrast of speech versus silence and the two behavioral measures. Correlation coefficients, corresponding Brodmann areas, and laterality of the clusters are reported in [Table 4]. The cluster size and the MNI co-ordinates of the center of the cluster are also listed. The MNI co-ordinates represent the location of a centroid in three dimensions: Lateral–Medial, Anterior–Posterior, and Superior–Inferior (x, y, and z, respectively). A positive “x” value indicates that the centroid is located in the right hemisphere whereas a negative “x” value indicates that the centroid is located in the left hemisphere. Similarly, positive “y” indicates anterior and negative “y” indicates posterior location with respect to the anterior commissure (AC, a white matter tract connecting the two hemispheres). Finally, a centroid with positive “z” value is located superior to the AC and a negative “z” value is located inferior to the AC. The speech versus silence contrast did not reveal significant correlations between preimplant fMRI activation and the ASC or the receptive language subtest of SKI-HI LDS. However, two clusters with significant positive correlation with postimplant SKI-HI LDS expressive language age were identified—supramarginal gyrus in the parietal lobe (Spearman’s correlation coefficient r = 0.89, corrected p via AlphaSim = 0.0008) and cingulate gyrus in the limbic lobe (r = 0.83, corrected p = 0.0014). [Figure 2] displays maps of Spearman’s correlation between preimplant fMRI activation and postimplant scores on SKI-HI LDS (expressive language subtest) for the speech versus silence BOLD contrast. For all figures, the identified clusters can be visualized in three separate sections: coronal, sagittal, and transverse. The coronal and transverse sections are presented in the radiological orientation, that is, the left side of the image represents the right side of the brain and vice versa. The color bar at the bottom of the figure represents correlation values from –1 to 0 (bottom half) and 0 to 1 (top half). For [Figure 2], bright colors (e.g., yellow) of the identified clusters depict a positive correlation value closer to 1.
Figure 2 Spearman’s correlation map depicting clusters of significant correlation (corrected p < 0.05) between the SKI-HI expressive language age and brain activation to the speech versus silence BOLD contrast in n = 11 CI participants.
Table 4
Clusters Revealed by Correlation between Preimplant BOLD Activation and Postimplant Scores on SKI-HI LDS Reveals the Two Clusters Listed Previously
BOLD Contrast: Speech vs. Silence
|
Assessment Tool
|
Brodmann Areas
|
Predictive Regions
|
Cluster Size
|
Laterality
|
MNI Co-ordinates (x, y, z)
|
Correlation Coefficient (r)
|
Corrected p Value
|
ASC
|
—
|
—
|
—
|
—
|
—
|
—
|
—
|
SKI-HI LDS receptive language
|
—
|
—
|
—
|
—
|
—
|
—
|
—
|
SKI-HI LDS expressive language
|
BA 40
|
Parietal lobe, supramarginal gyrus
|
45
|
Right
|
62, −58, 33
|
0.89
|
0.0008
|
BA 24
|
Limbic lobe, cingulate gyrus
|
42
|
Right
|
2, -2, 28
|
0.83
|
0.0014
|
Note: Preimplant MRLs and age at implantation were controlled for. AlphaSim was implemented post hoc to correct for multiple correlations. Significant Brodmann areas and cluster sizes are listed along with the MNI coordinates of peak correlation in the identified hemisphere for the speech vs. silence BOLD contrast.
Comparison of Noise and Silence
[Figures 3] and [4] represent maps of the Spearman’s correlations between the preimplant fMRI activation and postimplant scores on ASC and SKI-HI LDS for the noise versus silence BOLD contrast. [Table 5] lists the regions of high correlation between preimplant activation for the noise versus silence contrast and parents’ responses for these measures of auditory and language behaviors postimplant. Corresponding Brodmann areas, cluster sizes, laterality, MNI coordinates of the centroid of the cluster, and Spearman’s correlation coefficients are also tabulated here. A total of five clusters were identified to correlate significantly with the postimplant behavioral measures after the corrections for multiple comparisons. The left precuneus, an area in the parietal lobe, was the only brain region that correlated (r = 0.95, corrected p = 0.0011) with postimplant ASC scores. Four clusters correlated with the SKI-HI LDS, of which two correlated with the receptive subtest (right middle frontal gyrus, r = –0.90, corrected p = 0.0140; and right sub gyral, r = –0.88, corrected p = 0.0001) and two with the expressive subtest (left middle temporal gyrus, r = –0.86, corrected p = 0.0016; and right middle occipital gyrus, r = –0.86, corrected p = 0.0005). Note that for the noise versus silence contrast, brain regions that correlated with the SKI-HI LDS displayed a negative correlation value ([Figure 4]). For [Figure 4], the reader is directed to the bottom half of the color bar which indicates negative values of correlation. A dark blue color indicates correlation coefficient values closer to zero whereas light blue and green colors indicate correlation coefficient values closer to −1. The actual correlation coefficient values are reported in [Table 5].
Figure 3 Regression maps depicting clusters of strong correlation (corrected p < 0.05) between fMRI activation and ASC scores for the noise versus silence BOLD contrast. The color bar denotes the correlation coefficient value (from −1 to +1). Image views and their orientations are indicated at the top and bottom of the figure, respectively.
Figure 4 Correlation maps depicting clusters of strong correlation (corrected p < 0.05) between fMRI activation and SKI-HI LDS subtests for the noise versus silence BOLD contrast. The color bar denotes the correlation coefficient value (from −1 to +1). Image views and their orientations are indicated at the top and bottom of the figure, respectively.
Table 5
Correlation between Preimplant BOLD Activation and Postimplant Scores on ASC and SKI-HI LDS Reveals the Previously Mentioned Clusters
BOLD Contrast: Noise vs. Silence
|
Assessment Tool
|
Brodmann Areas
|
Predictive Regions
|
Cluster Size
|
Laterality
|
MNI Co-ordinates (x, y, z)
|
Correlation Coefficient (r)
|
Corrected p Value
|
ASC
|
BA 7
|
Parietal lobe, Precuneus
|
44
|
Left
|
−14, −62, 58
|
0.95
|
0.0011
|
SKI-HI LDS receptive language
|
BA 10, 11
|
Frontal lobe, middle frontal gyrus
|
31
|
Right
|
46, 46, −2
|
−0.90
|
0.0140
|
BA 19
|
Occipital lobe, subgyral
|
52
|
Right
|
38, –66, −7
|
−0.88
|
0.0001
|
SKI-HI LDS expressive language
|
BA 21
|
Temporal lobe, middle temporal gyrus
|
40
|
Left
|
−50, –66, 8
|
−0.86
|
0.0016
|
BA 19
|
Occipital lobe, middle occipital gyrus
|
46
|
Right
|
38, –78, −2
|
−0.86
|
0.0005
|
Note: Preimplant MRLs and age at implantation were controlled for in the second-level regressions. AlphaSim was implemented post hoc to adjust the significance threshold for multiple correlations. MNI coordinates and cluster sizes are listed along with Brodmann areas associated with the coordinates and hemisphere activated for the noise vs. silence BOLD contrast.
DISCUSSION
Use of Parent Reports
Structured parent questionnaires have been used to assess various linguistic and nonlinguistic behaviors in young children (e.g., [Coplan et al, 1982]; [Glascoe, 2000]; [Rescorla and Alley, 2001]; [Wolraich et al, 2003]; [Meinzen-Derr et al, 2007]). Use of parental reports is recommended in very young children or children with additional disabilities ([Kim et al, 2010]). [Yoshinaga-Itano et al (1998)] have emphasized the need for home-based parental observation to obtain holistic information about children’s speech-language abilities. Responses may be obtained in a more natural environment and are not limited by physical or temporal constraints. [Sharma et al (2009)] reported the clinical profile of a young girl who received a CI at 7.4 yr of age. Her cortical auditory evoked potential responses 6 mo postimplant revealed an abnormal polyphasic waveform. Her scores on Infant Toddler-Meaningful Auditory Integration Scale, a parental report of auditory behavior, correlated with CAEP responses and were also poor. On the other hand, [Kang et al (2010)] found that despite absent electric compound action potential or poor auditory steady state responses, CI recipients in their study received acceptable scores on parent observation reports. This variability indicates the need for identifying additional neuroimaging biomarkers (e.g., as identified by fMRI) to predict auditory/language behaviors based on parent observations.
The aim of the present study was to use fMRI to identify preimplant neuroimaging biomarkers that predict children’s postimplant auditory and language outcomes as measured by parental observation/reports, specifically the ASC and the SKI-HI LDS. Voxel clusters that correlated significantly with either ASC or SKI-HI LDS were identified in the vicinity of the temporal, parietal, and occipital junction. The different regions in this area are responsible for, among other functions, auditory processing of language, attention to sounds, extraction of meaning, and higher level processing and integration of information ([Stoeckel et al, 2009]; [Regev et al, 2013]; [Seghier, 2013]). This junction has been implicated in the processing of speech and nonspeech auditory stimuli ([Belin et al, 2000]). In addition, clusters in “extra-auditory” regions were also identified and are discussed in greater detail in the next section.
Neuroimaging Biomarkers
Given the auditory nature of the stimuli, temporoparietal lobe involvement was expected in the present study. The supramarginal gyrus (BA 40) in the right parietal lobe exhibited a strong positive correlation with SKI-HI LDS expressive language age (r = 0.89, corrected p = 0.0008) for the speech versus silence contrast. The left middle temporal gyrus (BA 21) also correlated with postimplant SKI-HI LDS expressive language age (Spearman’s r = −0.86, corrected p = 0.0016) for the noise versus silence contrast. The middle temporal and supramarginal gyri have been identified previously in neuroimaging studies of various language measures ([Ahmad et al, 2003]; [Schmithorst et al, 2006]; [Catani et al, 2007]; [Binder et al, 2009]; [Turkeltaub and Coslett, 2010]; [Horowitz-Kraus et al, 2013]; [Li et al, 2013]; [Regev et al, 2013]; [Seghier, 2013]; [Smalt et al, 2013]; [Tan et al, 2013]; [Specht, 2014]). Notably, [Tan et al (2013)] identified multiple brain regions that differentiated between normal and impaired hearing using a combined structural and functional image analysis approach. Implementing this approach on MRI/fMRI image data from 39 infants/toddlers with normal and impaired hearing, the authors identified 16 regions of interest (ROIs) that effectively classified between the two groups. These ROIs included the supramarginal and middle temporal gyri. In the present study, the correlation of these regions with ASC and SKI-HI LDS underlines the role of these regions in predicting postimplant auditory and language performance of young CI recipients as observed by the primary caregiver.
Brain regions other than the primary and associative auditory cortices such as the precuneus, the cingulate gyrus, and areas of the visual cortex have been identified in comparisons of brain activations between normal hearing individuals and those with hearing loss/CI ([Naito et al, 2000]; [Giraud et al, 2001a],[b]; [Kang et al, 2004]; [Green et al, 2008]; [Smalt et al, 2013]; [Tan et al, 2013]). The cingulate cortex has been implicated in processing of speech ([Xu et al, 2005]; [Obleser et al, 2007]), comprehension of narratives ([Schmithorst et al, 2006]), attending to relevant stimuli, and conflict resolution ([Weissman et al, 2005]). Blood flow changes in the precuneus in response to speech stimuli have been noted in normal hearing individuals as well as CI recipients ([Wong et al, 1999]; [Giraud et al, 2001a]; [Moteki et al, 2014]). Also, visual association areas (e.g., subgyral; BA 19) have been implicated in response to passive narratives ([Karunanayaka et al, 2007]) and “may be relevant to the future use of functional neuroimaging to guide predictions about speech and language outcomes in HI infants who receive a cochlear implant” ([Tan et al, 2013], p. 427).
[Lee et al (2005)] retrospectively studied the preimplant F-18 fluorodeoxyglucose positron emission tomography images of 11 congenitally hearing impaired children in the age range of 5–7 1/2 yr at cochlear implantation. They divided their participants in two groups based on the Korean version of the Central Institute for the Deaf (CID) sentence test: those with 2-yr postimplant K-CID score of >65% (the better performing group) and those with 2-yr postimplant K-CID score of <45% (the poorer performing group). They found greater metabolic activity in the frontoparietal regions of the good group and greater metabolic activity in the visual occipital regions of the poor group. Although the limited sample size prevented further subdivision of the group based on postimplant behavioral measures, the areas of significant correlation found in the current study are consistent with the [Lee et al (2005)] study. Five of the seven clusters identified as having strong correlations with the postimplant ASC score and SKI-HI LDS language age, were localized to the frontal, parietal, or occipital lobes. The correlation of extra-auditory regions with postimplant behavioral measures in the present study is further evidence of the predictive value of these regions in auditory and language performance.
Hemispheric Laterality
In this study, observation of significantly correlated clusters revealed no consistent pattern of laterality in response to the different auditory stimuli. Previously, leftward activation of temporo-parieto-occipital areas for speech stimuli has been noted (e.g., [Vigneau et al, 2006]; [Turkeltaub and Coslett, 2010]; [Seghier, 2013]). Fewer studies have found a rightward activation of speech-language areas in response to speech stimuli ([Belin et al, 2000]; [van Ettinger-Veenstra et al, 2010]; [Lazard et al, 2010]; [Smith et al, 2011]). However, a finding of mixed laterality (bilateral activation) especially in children predominates the fMRI literature ([Schmithorst et al, 2006]; [Holland et al, 2007]; [Yeatman et al, 2010]; [Moteki et al, 2014]). The results of a study by our group ([Deshpande et al, 2016]) displayed clusters of significant correlation in the temporo-parieto-occipital junction bilaterally in response to similar speech and noise stimuli as used in the present study. Split test-wise, current results show one cluster in the left precuneus that was predictive of the ASC score whereas five clusters in the right hemisphere (supramarginal gyrus, cingulate gyrus, middle frontal gyrus, subgyral, and middle occipital gyrus) which were predictive of the SKI-HI LDS language age. Only one cluster was identified in the left hemisphere that was predictive of postimplant SKI-HI LDS language age. This cluster correlated with the SKI-HI LDS expressive language age and was found in the middle temporal gyrus (BA 21). These results suggest that different brain regions in both hemispheres of the central auditory system of children with hearing loss may be predictive of auditory and language performance 2 yr after implantation.
Differences between “Speech versus Silence” and “Noise versus Silence” Contrasts
In the present study, comparison of the regression maps across different contrasts revealed that some clusters that correlated with ASC and SKI-HI LDS for the noise versus silence contrast failed to correlate with the behavioral measures for the speech versus silence contrast. For instance, a group of voxels in the parietal lobe that correlated strongly with ASC scores for the noise versus silence contrast did not present itself on the speech versus silence contrast for the same test ([Figures 2] and [3]). Only two clusters were identified on the speech versus silence contrast whereas five were identified on the noise versus silence contrast. In one of the landmark neuroimaging studies of CI users and normal hearing listeners in response to speech and noise, [Wong et al (1999)] found that the noise-like stimulus used in their study (multitalker babble) activated bilateral temporal regions in the CI group only. They attributed this finding to the lack of habituation to noise by the CI group. In the present study, the presence of bilateral clusters for the noise versus silence contrast may indicate a different coding strategy employed by young CI users in response to noise. The lack of habituation and the employment of a “coarse” coding strategy in response to noise may explain the presence of multiple clusters of significant correlation for the noise versus silence contrast ([Wong et al, 1999]). Another explanation may be related to the parameters of data analysis. Evaluation of the group composite maps revealed that although some clusters did not appear on the correlation maps, they were activated on the composite maps. This finding suggests that the minimum cluster size restriction in AlphaSim may have prevented some clusters from appearing on the correlation maps. Several studies in the fMRI literature have used a smaller cluster size (e.g., [Shomstein and Yantis, 2004]; [2006]; [Napadow et al, 2005]; [Phan et al, 2006]).
Positive and Negative Correlations
In the present study, positive correlations were found between two brain regions (right supramarginal gyrus and right cingulate gyrus) and the SKI-HI LDS expressive language age for the speech versus silence contrast. Similarly, one cluster (left precuneus) was identified that correlated positively with postimplant ASC scores for the noise versus silence contrast. These positive correlations suggest that a greater preimplant activation in specific brain regions may predict improved auditory and language performance in young CI recipients as measured by parental observations/reports.
Some correlation maps also displayed a negative correlation between the preimplant fMRI activation and postimplant test scores. For instance, both the SKI-HI LDS receptive and expressive subtest language ages correlated negatively with fMRI activation for the noise versus silence contrast. In other words, greater fMRI activation in response to noise indicated a lower language age 2 yr postimplant. This phenomenon was observed across different brain regions including the occipital lobe, the temporal lobe, and the frontal lobe.
Previous studies have found both structural and functional changes in the occipital region of children with hearing loss (e.g., [Lee et al, 2005]; [Smith et al, 2011]). In an F-18 fluorodeoxyglucose positron emission tomography imaging study of 11 CI candidates between 5 and 7 1/2 yr of age, [Lee et al (2005)] observed greater metabolic activity in the occipital regions of children who performed poorly on speech perception measures after implantation. In the current study, a simple explanation for the observed negative correlations may be that greater activation in certain extra-auditory areas may be detrimental to future auditory/language development and indicate that these regions are engaging in auditory processes that are not augmentative to recognition and processing of speech stimuli.
A negative correlation was also observed between fMRI activation in the middle temporal gyrus and SKI-HI LDS expressive language age for the noise versus silence contrast. Previous studies (e.g., [Chou et al, 2006a],[b]) have found a negative correlation between the middle temporal gyrus and semantic association in children. The negative correlation between activation in the middle temporal gyrus and relatively low semantic auditory stimuli may arise from “more extensive access to semantic representations in order to identify overlapping features” ([Booth et al, 2007], p. 775). The low semantic quality inherent to the narrow band noise stimuli used in the present study may have resulted in the negative correlations observed in the noise versus silence contrast.
Finally, a negative correlation was observed between fMRI activation in the right middle frontal gyrus (BA 10) and SKI-HI LDS receptive language age for the noise versus silence contrast. This observation is consistent with previous findings wherein activation in the right middle frontal gyrus was negatively correlated with reading, writing, phonological, and decoding skills in children ([Hoeft et al, 2007]). Decreased performance accuracy on semantic judgment and picture naming has also been identified with increased activation in the middle frontal gyrus ([Blumenfeld et al, 2006]).
Limitations and Future Directions
In the present study, a sample size of 12 may be perceived as a limitation. [Desmond and Glover (2002)] reported that fMRI studies with a sample size of 12 yielded adequate power to detect a significant differences in populations in fMRI studies. Although the present study had 12 participants, a post hoc power analysis was conducted using nQuery Advisor (nQuery Advisor 6.0, Statistical Solutions, Cork, Ireland) to investigate if the results could be extended to the general population. Power was calculated for each cluster and each contrast based on the mean voxel value of clusters that correlated significantly with ASC and SKI-HI LDS. Such analysis based on estimated slope and variance yielded power of up to 68% in certain clusters. The above results indicate that the present study had moderate power to detect a significant correlation between fMRI activation in relevant clusters and behavioral measures (scores/language ages on ASC and SKI-HI LDS).
In the present study, two major variables that affect postimplant auditory/language performance were controlled viz. preimplant auditory status and age at implantation. These variables directly affect auditory/language development and account for a large proportion of variance in postimplant outcomes ([Wake et al, 2005]; [Gilley et al, 2008]). However, because of the limited sample size, additional variables that may have affected postimplant behavioral outcomes (e.g., hearing aid use before implantation, duration or frequency of postimplant speech-language intervention) could not be controlled. Future studies may consider evaluating contribution of multiple variables affecting postimplant speech-language and auditory outcomes in a structural equation model (SEM).
One variable that could not be controlled for because of the limited sample size was side of CI and bilateral versus unilateral CI. Literature is still emerging on the benefits of early bilateral implantation for long-term speech and language outcomes ([van Hoesel, 2004]; [Litovsky et al, 2006]; [Brown and Balkany, 2007]). Therefore, we might expect bilateral CI patients in our cohort to experience better outcomes 2 yr after CI than unilateral implant patients. However, the parent report measures are not likely to be sensitive to the specific benefits of bilateral implantation such as better perception of speech in noise or better localization. We did not investigate unilateral, bilateral, or side of implant as covariates in this study because of the sample size limitation.
As early as the 1990s, [Kampfe et al, (1993)] emphasized the importance of education, guidance, and counseling in a successful rehabilitation program before, during, and after implantation. According to them, listing the benefits and limitations of the CI, discussing other possible alternatives, and providing opportunities for dialogue with other parents of CI recipients will help parents make an informed decision about cochlear implantation. If parents are provided evidence-based counseling about postimplant outcomes before implantation, it helps them set realistic expectations from the implant and prepare for the rigorous rehabilitation process. Parents with a greater understanding of the realistic benefits of CI report higher satisfaction with the CI (e.g., [Meadow-Orlans et al, 2003]). Both measures used in the present study (SKI-HI LDS and ASC) may be viewed as indirect measures of parental expectations in that they do not include questions such as “What is your expectation out of the CI?” Instead, these measures of behavioral observation include items that parents attribute to the children’s auditory/language performance. Results of the present study suggest that direct measures of parental expectation/satisfaction (e.g., [Zaidman-Zait and Most, 2005]) may be used in conjunction with fMRI in the future to help establish realistic expectations for parents considering a CI for their child.
CONCLUSIONS
The aim of the present study was to use fMRI to identify preimplant neuroimaging biomarkers that predict children’s postimplant auditory and language outcomes as measured by parental observation/reports, specifically the ASC and the SKI-HI LDS. A regression analysis was implemented between preimplant fMRI activation (in response to speech and noise) and postimplant scores on ASC and SKI-HI LDS. Clusters of significant correlation were identified in auditory as well as extra-auditory brain regions. The results of this study indicate that (a) it is possible to identify neuroimaging biomarkers of auditory and language performance before implantation and (b) brain activation in specific regions correlates with postimplant auditory and language outcomes as measured by parental observation/reports. These results may be helpful in directing efforts toward validating fMRI as an objective predictor of postimplant auditory/language performance of young CI recipients.
Abbreviations
ASC:
Auditory Skills Checklist
BOLD:
blood oxygen level dependent
CCHMC:
Cincinnati Children’s Hospital Medical Center
CI:
cochlear implant
CID:
Central Institute for the Deaf
fMRI:
functional magnetic resonance imaging
HUSH:
Hemodynamics Unrelated to Sounds of Hardware
MRI:
magnetic resonance imaging
MRLs:
minimum response levels
MNI:
Montreal Neurological Institute
SD:
standard deviation
SKI-HI LDS:
SKI-HI Language Development Scale