Keywords:
Reading disorders - systematic review - evidence-based medicine - meares-irlen syndrome
- colored lenses
Palavras-chave:
Transtorno da leitura - revisão sistemática - medicina baseada em evidência - síndrome
de meares-irlen - lentes coloridas
The scotopic sensitivity syndrome was identified in 1980 by the clinical picture of
complaints of perceptual dysfunction according to the light source, luminance, intensity,
wavelength and color contrast[1]. Subsequently, individuals with similar symptoms were also identified with reading
difficulties due to poor adaptation to color contrasts (light and dark) and distorted
graphic images. All these signs were labeled the Irlen syndrome (IS)[2].
This visual alteration was subdivided into six groups: people with photophobia (sensitivity
to brightness); distortion of the fundus (difficulty of adaptation between contrasts
such as light and dark); graphical distortions during reading (sensation of movement
of the letters); decreased visual field (clear image in the center with peripheral
cloudiness); difficulty in ocular fixation during reading; and change in depth perception[2]. Studies have reported that the scotopic sensitivity syndrome, or IS, (and even
Meares-Irlen syndrome), may result in a slow, ineffective and poorly comprehended
ability to read, as well as causing fatigue and tension to the reader[3],[4].
The pathophysiology of IS is claimed to be associated with changes in the visual cortex
and deficits of the magnocellular system, which is important during reading, and for
the acquisition of information from the visual system on movement[3],[5],[6]. Some studies have claimed that IS has a genetic component and is related to specific
biomarkers, affecting both sexes, and manifesting itself in different degrees of impairment[7],[8].
Current treatment recommends the use of filters and colored lenses in order to reshape
the light spectrum, reducing the contrast between light and dark to facilitate visual
and retinal photoreceptor adaptation. According to the data at the time of publication,
patients would be able to improve their comprehension of reading, attention, sensation
of depth, fatigue symptoms, among others, by wearing these colored lenses[2],[6]. Despite these reports[9], little data about the level of evidence for IS, or the treatments proposed by its
advocates, are available. Finally, evidence for the IS itself and even the legitimacy
of the therapeutic approach became questionable after the emergence of data regarding
the effect of the lenses being indistinguishable from placebo[10].
Thus, the objective of the present study was to carry out a systematic review, with
wide coverage on IS existence, the use of colored lenses as a therapeutic approach
and its effectiveness, evaluating the available data according to their level of evidence.
METHODS
A systematic review was performed according to the PRISMA[11] statement. Databases searched covered PubMed, Embase, PsycNET, ERIC, Cochrane, Clinical
Trials, LILACS and ScieLO, without time span constraints. In order to achieve wider
coverage, we first used (“irlen” OR “Irlen” OR “Irlen-Meares”) in PubMed and then
“Irlen” as the unique search term for “All Fields”. The same strategy was performed
for the other databases.
Selection of studies
Inclusion criteria: studies having an IS diagnosis; studies where colored lenses (or
filters) were used as an intervention. Exclusion criteria: reviews; when IS was not
the study subject; idiom that none of the authors could read; letters to the editor,
comments or merely specialists’ point of view; non-peer reviewed articles, grey literature
and unindexed or predatory journals.
Data extraction
Two investigators independently extracted data (EM, JSM). The following information
was retrieved from the studies: first author's name, publication year, population
characteristics (type, size, sex, age), the existence of a comparison group, study
type, presence of sample size calculations, tools used for IS diagnosis, intervention
(when possible), color filter wavelength measurement, study outcomes, reading rate
modification observed, risk of bias, main conclusions and level of evidence.
Level of evidence
Retrieved articles were classified according to the Oxford Center for Evidence-Based
Medicine's (OCEBM)[12] level of evidence. Three independent reviewers (JSM, RVGV, ESNG) determined the
level of evidence classification individually. Although discrepancies were rare, the
highest level of evidence was used on ties. Data extraction and summaries of data
were performed by two independent groups of two reviewers (EM, JSM and RVGV, ESNG).
Ties were dismissed by RW.
The OCEBM classification comprises five levels of evidence for each type of study
target (treatment/prevention, prognosis, differential diagnosis and diagnosis/symptom
prevalence study). Level I therapeutic studies comprise systematic reviews with meta-analysis
of randomized clinical trials (Ia) and well-designed individual randomized clinical
trials (Ib). Level II comprises systematic reviews of cohort studies (IIa), individual
cohort studies, randomized clinical trials with less than 80% follow-up (IIb) and
ecological studies (IIc). Level 3 comprises reviews of case-control (IIIa) and individual
case-control (IIIb) studies. Level 4 covers case-control and low-quality cohort studies
(absence of blinding or presence of biases and trend of results). Finally, level 5
consists of expert opinions, without critical evaluation, or based on physiology,
database search or “first principles”.
In this classification, there is no inclusion of descriptive or opinion studies. Thus,
we overestimated these types of works for level 5 in order to include them as potential
sources of information. This adjustment was based on the World Health Organization
criteria for the level of evidence[13], however the OCEBM was the main instrument.
RESULTS
Systematic review and summary of evidence
There are no MeSH terms in PubMed for Irlen Syndrome, Meares-Irlen Syndrome or Scotopic
Sensitivity Syndrome; the Emtree® suggests Irlen as a referred term, not an entry term, as does PsycNET and the Cochrane
Database.
[Figure 1] outlines our study selection process in a PRISMA flowchart[11].
Figure 1 Flow diagram of study selection according to the PRISMA statement.
The complete summary of findings is available as supplemental material due to space
constraints. It was not possible to perform a meta-analysis due to great variations
in study types, critical biases, extremely heterogeneous groups and generally low
standards, as will be demonstrated below. Nevertheless, summarized data is shown in
the [Table]. The main findings are presented in the [Table], ordered by publication date.
Table
Summary of evidence (resumed version).
|
Reference
|
Population
|
Study type
|
Diagnostic tool
|
Intervention
|
Main findings
|
Level of evidence
|
|
Population
|
Comparison group
|
|
O'Connor et al. 1990[14]
|
92 Children
|
Non-scotopic
|
CC
|
IDPS NARA FRI
|
PCOF DCOF
|
More visual comfort. Gained reading rate. Filter improved reading.
|
3
|
|
Scheiman et al. 1990[15]
|
Varied (age 10 to 49) n = 39
|
No control
|
CH
|
IDPS
|
|
More visual comfort. Gained reading rate. Filter improved reading.
|
4
|
|
Blaskey et al. 1990[16]
|
Varied (age 9 to 51) 12 male; 18 female
|
Non-scotopic
|
CCH
|
IDPS
|
IF
|
Self-declared more comfort. No gain in reading. Irlen filter group showed no significant gains.
|
4
|
|
Robinson, Conway 1990[17]
|
Varied (age 9 to 15) 33 males, 11 female
|
No control
|
CH
|
IDPS NARA SPAS
|
IF
|
Improvements on Irlen tests. No improvements on NARA and SPAS. Filter improved reading accuracy and comprehension, but not rate.
|
4
|
|
Martin et al. 1993[18]
|
7th graders n = 60
|
Normal readers
|
CCH
|
LILP NARA RPM NWT
|
IL/COF
|
No difference among groups. No gain in reading. No improvement with IL/COF.
|
4
|
|
Carrol et al. 1994[19]
|
Varied (age 10 to 20) n = 64
|
23 normal readers
|
CC
|
IDPS
|
Dark adaptation
|
No difference among groups. Inconclusive.
|
4
|
|
Evans et al. 1994[20]
|
Varied (mean age 22; age 7 to 12) n = 82
|
11 normal readers
|
CC
|
PGT SRVST
|
COF
|
No difference with lenses. No differences in PGT among groups.
|
4
|
|
|
|
|
NARA
|
|
Borderline significance supporting lenses.
|
|
|
Lopez et al. 1994[21]
|
Varied (children) 39
|
15 with “academic problems”
|
CC
|
ISSST
|
PCOF
|
No difference in PCOF users. No difference in academic performance. More research needed.
|
4
|
|
Sawyer et al. 1994[22]
|
Varied (age 7 to 15) n = 271
|
86 children without referred reading problems
|
CC
|
SRTa
|
PCOF
|
Differences inside the instrument variation. No improvement in reading. Very low effect; other refractive problems as the cause; more investigation needed.
|
4
|
|
Wilkins et al. 1994[23]
|
Varied (age 9 to 15) n = 37
|
Children with reported failing in reading, with and without lenses.
|
PCT
|
ISSST NARA
|
Intuitive Colorimeter®
|
No differences on NARA and other symptoms evaluated. No gain in reading. Lenses ameliorate symptoms of IS.
|
2
|
|
Evans et al. 1995[24]
|
Children aged 12 n = 42
|
26 nonresponsive to COF
|
CC
|
ISSST SRTb NARA PGT
|
Intuitive Colorimeter®
|
No differences on NARA or symptoms. No gain in reading. Suggests caution on data interpretation due to high similarity among case and control groups.
|
4
|
|
Adult and children
|
|
|
Interview
|
|
No differences on WRAT-R.
|
|
|
Spafford et al. 1995[25]
|
4 children, 4 adults
|
8 matched controls
|
CC
|
WRAT-R
|
COF
|
Lens color not critical for reading. Inconclusive; use of any unproven therapy could delay appropriate treatment.
|
4
|
|
Evans et al. 1996[26]
|
Reports on sample from previous study (25)
|
Participants report failing in reading, with/ without lenses.
|
CC
|
ISSST PGT Optometrics
|
COF
|
No difference in pattern glare and IS. Ocular motor anomalies correlate to IS. Method psychophysically primitive; optometric anomalies are priority in treatment.
|
4
|
|
Ciuffreda et al. 1997[27]
|
Adults (age 18 to 39) 2 males; 6 females
|
No
|
OB/CR
|
LILP
|
COF
|
No positive effect on steady-state accommodation. No improvement on accuracy; participants in fact needed other vision intervention.
|
4
|
|
Evans et al 1999[28]
|
Varied (age 4 to 73) N=323
|
No
|
RP/EP
|
ISSST IO IC
|
PCOF
|
Patient perception of improvement. 73% still wearing tinted lenses 1,5 yrs after.
|
4
|
|
Robinson, Foreman 1999[29]
|
Children (age 9 to 13) n = 113
|
35 controls with reading problems
|
PCT
|
ISSST/LILP NARA
|
IL
|
No difference among groups. No modification related to other groups. No improvements at all.
|
2
|
|
Robinson, Foreman 1999[30]
|
Same as previous (32) n = 88
|
28 controls from previous study (32)
|
PCT
|
ISSST/LILP NARA
|
IL
|
Statistical difference among selected groups. Improvements on some tasks.
|
2
|
|
Robinson et al 2000[31]
|
Varied (age 9 to 13) n = 158
|
125 referred 33 screening
|
PP
|
ISSST IS
|
-
|
84% of parents with IS No difference among groups Prevalence of symptoms on males
|
4
|
|
Bouldoukian et al. 2002[32]
|
Varied (age 7 to 40) n = 33
|
With/without overlay
|
CC
|
IO WRRT
|
IO
|
Increased reading rate due to IO. Increased rate due to practice on same test. IO improved rate of reading. No placebo effect.
|
2
|
|
Evans, Joseph 2002[33]
|
University students n = 113
|
13 participants without complaints
|
CC
|
IO WRRT
|
IO
|
More than 90% was 3,8% faster with IO. 21 participants stated that IO worsened perception. One-third of the sample shows benefit on reading (>5%) with colored lenses. Prevalence in adults equal in children.
|
4
|
|
Scott et al. 2002[34]
|
Children (age 10 to 12) n = 94
|
63 without IS criterion
|
CC
|
IO, WRRT, LRT
|
IO
|
No difference among groups with and without IO. Supports the beneficial effects of IO.
|
4
|
|
Children
|
|
|
|
|
No significant difference among groups; no increase in reading speed.
|
|
|
Northway, 2003[35]
|
(age 9 to 15) n = 64
|
With/without lenses
|
OB/RP
|
IO, WRRT, DEM
|
IO
|
Improvement only on WRRT, not DEM. Visual symptomatic children found a preferred colored overlay of benefit.
|
4
|
|
Waldie, Wilkins, 2004[36]
|
Children (age mean = 9.4) n = 23
|
-
|
OB
|
IO
|
IO
|
No significant difference among groups
|
4
|
|
Kriss, Evans, 2005[37]
|
Children (age 7 to 12) n = 64 dyslexic
|
32 dyslexic; with / without IO
|
CC with 2x2 mixed factorial
|
IO, WRRT
|
IO
|
Control group w. higher scores in WRRT; 34% of dyslexic group up to 8% faster on WRRT
with IO. IS affected normal readers. Although no significant difference in prevalence, more
IS in dyslexic children.
|
4
|
|
Hollis, Allen, 2006[38]
|
Adults (age 18 to 58) n = 58
|
-
|
OB/PP
|
IO WRRT PGT
|
IO
|
No benefit of IO for normal. Self reported symptom. Reading speed test recommended.
|
4
|
|
Riddel et al. 2006[39]
|
Children (age 9 to 16) 10 (6 male; 4 female)
|
-
|
OB/CS
|
Previous IO users; VEP
|
IO
|
No significant differences among groups. No objective VEP differences can be measured
in all children who claim benefit from the use of colored lenses.
|
4
|
|
Kruk et al. 2008[40]
|
Children (age 9 to 10) n = 36
|
18 non-dyslexic
|
OB/PP
|
IRPS, LILP, WRAT
|
No intervention
|
No difference among normal readers and dyslexic. IS diagnosis was not an indicator of visual deficit subtype of dyslexia. No relationship between IS and reading performance.
|
4
|
|
Children
|
|
|
|
|
NARA and SDMT improved in all groups.
|
|
|
(age 7 to 11)
|
|
|
|
|
IDPS improved in placebo and experimental, and decreased in control.
|
|
|
Mitchell et al. 2008[10].
|
n = 49 (35 male, 14 female)
|
With/without lenses
|
PCT
|
IDPS, NARA, SDMT, IC
|
IO
|
No difference among placebo and experimental groups.
|
2
|
|
|
|
|
|
|
Clear definition of visuoperceptual reading disabilities impaired the possible conclusion
and experimental design.
|
|
|
Nichols et al. 2009[41]
|
University students (mean age 23.4) n = 74 (7 male)
|
-
|
OB/PP
|
IO, WRRT, LADS
|
-
|
24% revealed IS. LADS occurred within IS. More research is needed.
|
4
|
|
Adults
|
|
|
|
|
No improvement in reading.
|
|
|
Taub et al. 2009[42]
|
N = 60
|
No referred symptoms of IS
|
OB/PP
|
Visagraph®
|
Colored lenses
|
No difference among groups. The IS symptoms were related to binocular/accommodative vision disorder.
|
4
|
|
Ritchie et al. 2011[43]
|
Children (primary school) n = 75
|
-
|
OB/PP
|
LILT, WRRT, MMSE, GORT
|
IO
|
No gain in reading. No difference among groups for all tests. More orthoptic problem in IS group.
|
3
|
|
Vidal-López 2011[44]
|
Secondary students (mean age 12) n = 54 (28 males)
|
27 paired individuals without IS
|
CC
|
PGT, IO, ISSST, PROLEC, SI, VS-SDT
|
IO
|
No differences with IO users. Attributional bias. Did not support the visual stress
theory. Suggested strong motivational effects.
|
3
|
|
Ritchie et al. 2012[45]
|
IS children (mean age 9) n = 18
|
10 non-IS
|
CC/CH (1yr)
|
WRRT, MMSE, GORT
|
IO
|
No difference among groups. No improvements with IO after 1yr. No benefits to reading. No effects in short or long term.
|
4
|
|
Chang et al. 2014[46]
|
IS students (age 8 to 34) n = 34
|
11 dyslexic
|
CC
|
Not specified
|
IO Korea
|
Improvement needed on IS diagnosis. Lenses efficacy should be measured. Objectively testing for the syndrome. Further studies required.
|
4
|
|
Kim et al. 2015[47]
|
Varied (age 13 to 41) n = 15
|
-
|
OB/PP
|
Self-reported IS
|
IO
|
Brain reacts different with and without lenses. Temporal regions activate after lenses.
|
4
|
|
Loew et al. 2015[48]
|
Non-clinical samples (age 21 to 60) n = 24 (9 males, 15 females)
|
With/without lenses.
|
OB
|
Self-reported; WRRT; PROLEC; IO
|
Tinted lenses
|
No differences in reading speeds among participants with and without lenses. Fluorescent lighting can affect readers at all levels of proficiency.
|
4
|
|
Alanazi et al. 2016[49]
|
Varied (age 18 to 30) n = 450
|
Female medical students
|
EP
|
Self-reported
|
Not defined
|
2% dyslexic 6% IS 33% dyslexic and IS
|
4
|
|
Garcia et al. 2017[50]
|
Children (age 9 to 12) n = 68 (36 male, 32 female)
|
-
|
OB/PP
|
IRPS/OO, WRRT
|
ISOs
|
No significant gain in reading. 9 individuals reported 6% increase in reading speed on WRRT. Filters improved reading rate.
|
4
|
IDPS: Irlen Differential Perceptual Schedule®, ISSST: Irlen Scotopic Sensitivity Screening
Test®; IRPS: Irlen Reading Perceptual Scale®; IRPS/OO: IRPS Optimum Overlay®; ISOs:
Irlen Spectral Overlays set®; LILP: Licensed Irlen Lens Practitioner; IO: Intuitive
Overlays®; IC: Intuitive Colorimeter®; PGT: Pattern Glare Test; WRRG: Wilkins Rate
of Reading Test; NARA: Neale Analysis of Reading Ability; SPAS: Students’ Perception
of Ability Scale; FRI: Formal Reading Inventory; RPM: Raven's Progressive Matrices;
NWT: Nonsense Word Test; SRVST: Simulated Reading Visual Search Test; SRTa: Salford
Reading Test; SRTb: Suffolk Reading Test; WRAT-R: Wide Range Achievement Test-Revised;
AIRA: Aston Index Reading Age; LRT: London Reading Test; DEM: Developmental Eye Movement
test; VEP: Visual Evoked Potential; SDMT: Symbol Digit Modalities Test; Visagraph®;
MMSE: Mini-Mental State Examination; GORT: Gray Oral Reading Test; PROLEC®; SI: Stress
Inductor; VS-SDT: Visual Stimuli based on Signal Detection Theory; DRT: Dyslexia Research
Trust UK; COF: Colored Overlay Filters; PCOF: Preferred Colored Overlay Filter; DCOF:
Different-Colored Overlay Filters; IF: Irlen Filters; IL/COF: Irlen Filters/Colored
Overlay Filters; IO: Intuitive Overlays®; CC: Case-Control; RCT: Randomized Control
Trial; PCT: Placebo Controlled Trial; NRCC: Non-Randomized Controlled Cohort; CH:
Cohort; CCH: Controlled Cohort; OB: Observational study; PP: Prospective; RP: Retrospective;
EP: Epidemiologic; CR: Case report; CS: Case series; PL: Previous use of Irlen (or
colored) lenses; IIP: Irlen Institute Patient.
Summary
The interval time between all publications varied from 1989 to 2018. The sum of narrative
reviews, letters, comments or responses comprised 44 articles, about one-third of
all references regarding IS, which corroborates the controversy on the subject. The
same proportion of original articles was retrieved for the summary (n = 45).
From the studies summarized, 21 were published in Ophthalmology or Optometry journals;
and 27 studies were published in the same four journals. One author[17] appears in 11 of the 45 studies; another one[23] appears in 10 and owns the property of several patents related to the diagnosis
and intervention in IS; a third author appears in six studies.
The sum of all participants studied was 3,963 (mean/SD of 90.07 ± 132.8), with an
irregular distribution between male and female participants, as well their ages or
the presence of individuals with comorbidities, irregular distribution of methods,
such as the absence of an ophthalmic or optometric evaluation in many cases, as well
as other characteristics, revealing the heterogeneity. However, removing study participants
from the strictly epidemiological articles reduced the main sample to 2,281 subjects
(with mean/SD of 57.03 ± 47.87); five of these studies[28],[29],[33],[49],[52] comprised a total of 1,682 participants (with mean/SD of 420.5 ± 250.7). Even with
these adjustments, the whole sample remained heterogeneous.
Along with that, as three different studies[25],[26],[30] shared the same group of patients and part of the data, they could not be summed
for effect size. It is worth noting that they shared other characteristics described
in several parts in the published articles. Other studies[28]
–
[31] partially used the samples from the same group in the same location for slightly
different analyses.
Study types were varied and, although some of them claimed a certain design, it is
arguable whether they followed the mentioned design. Only two studies[44],[45] provided sample size estimations in their design in order to find populational significance;
15 studies did not have proper control groups, ranging from undiagnosed or self-referred
asymptomatic individuals, to the declared lack of control group where needed.
A large study on IS prevalence was performed with 450 participants, aged 18 to 30[49], all female, and the authors did not perform any of the procedures for IS detection.
Instead, a survey was spread among a nonrandomized population. Another large prevalence
study[51] comprised 486 males and 265 females from ages 7 to 17; yet another[28], analyzed data from 323 participants aged 4 to 73 years and, finally, a study[31] covered 158 participants aged 7 to 13. Even with considerable sample sizes, their
heterogeneity is a matter of concern.
Ten tests for IS diagnosis were identified, where seven were proprietary and patented;
in some cases, it was difficult to discriminate when the tests were interchangeable,
were improved versions of the same test or new ones[53]. Along with these, 21 other tests were performed either for tool comparison or confirmation
of the presence of IS. Only two studies[10],[44] performed psychophysical tests. Nevertheless, the use of validated tests was rarely
reported.
Interventions were, in most cases, the use or not of colored lenses. However, this
was a confounding factor, because colored lenses were one of the identification tools
proposed to determine IS. Fifteen studies carried some kind of wavelength measurement
for the color filter instead of simply declaring a color for the filter. Of these,
four used physical measurements[23],[32],[48],[54] and the other 11 studies used a patented method for a color search instead of instrumental
wavelength measurements.
The main conclusions were unequivocal about the positive effects of the lenses or
as supportive for IS in 14 of the 45 studies. On the other hand, 14 studies claimed
the opposite; five stated that studies were inconclusive and four suggested further
studies. Diverse conclusions emerged from the other eight studies.
The level of evidence according to the OCEBM is shown in [Figure 2].
Figure 2 Level of evidence of the articles.
A total of five articles were classified as level 2 of evidence and three as level
3. As mentioned above, comments and letters were pushed up in their classification
to level 5 of evidence in order to consider specialized opinion.
The main findings of the upper level classified articles are presented below.
Level 1
Griffiths et al.[9] presented a comprehensive systematic review of the literature and was not limited
to IS, but to the effect of colored lenses in different situations. In this article,
51 studies covering 244 patients were analyzed. Inclusion criteria were randomized
clinical trials and exclusions were the absence of a control group and absence of
a formal reading measurement. Of the studies analyzed, 15 were related to the use
of lenses with colored filters for reading in IS. It was concluded that the use of
colored lenses to improve reading in individuals with IS could not be proven based
on the existing literature, regardless of the type of color system used for the study.
In addition, two trials with the lowest risk of bias failed to demonstrate any improvement
in reading through the use of the prescribed lenses[29],[30]. None of the studies with low risk of bias or high statistical value corroborated
the aid of lenses in reading. The majority of studies were subject to ‘high’ or ‘uncertain’
risk of bias in one or more key aspects of study design or outcome. Studies at lower
risk of bias provided less support for the benefit of colored lenses/overlays on reading
ability. While many studies reported improvements with colored lenses, the effect
size was generally small and/or similar to the improvement found with a placebo condition.
Galushka and Schulte-Körne[5] performed a systematic review and meta-analysis that aimed at the development of
guidelines for the diagnosis and treatment of reading and/or spelling disorders in
children and adolescents in Germany. Among several results, they showed robust data
from a metaanalysis where Irlen lens efficacy was not confirmed. They claimed that
IS is a non-symptom-oriented intervention with or without the lack of evidence. They
also state, within their higher grade of recommendation, that Irlen lenses should
not be used.
Level 2
Wilkins et al.[23] performed a double-blind placebo-controlled trial of colored filtered lenses in
children. Like the previous study, a strong selection bias for experimental and control
groups was seen when the authors claimed they selected children “who reported benefit”
from the study subject. The study also showed a very low size effect and assessment
data based solely on the children's opinions and perceptions. Finally, the authors
themselves reported no differences in assessment between group and control; however,
they reported less frequent symptomatology among the experimental group.
In the studies by Robinson and Foreman[29],[30], 113 individuals aged between 9-13 years, with poor reading and IS, were recruited
and randomly allocated to one of three experimental groups: with properly prescribed
(optimal or ideal) colored lenses (n = 38), with a blue lens (n = 41) or with a placebo
lens, similar to the ideal lenses but not improving visual symptoms (n = 34). In the
study, there was also a control group (n = 35) with poor reading skills but no IS.
This control group was recruited from a local school, different from the experimental
group, thus creating a potential selection bias. Although described as a 20-month
long-term placebo-controlled study, the participants only started using the recommended
lenses after the first three months. In the beginning no difference between groups
in any reading measurement was observed. The group with ideal lenses presented with
values slightly lower than the others. After three months, all groups showed improvements
on reading tests, with no significant difference among them. It was suggested that
this result occurred due to reading practice and not to the lenses themselves. Although
there was improvement in textual comprehension in the group with optimal lenses, comparisons
between groups in a parallel group study should be analyzed carefully[6]. As well, both studies were carried out with selection bias for the studied groups.
They used several self-reported instruments for the main assessments in reading and,
finally, reported a nonsignificant increase in the rate of reading[29] and a failure to find significant improvement for the experimental groups[30].
Bouldoukian and colleagues[32] carried out a randomized control trial with 33 participants, comprising children
and adults for testing the effects of colored Irlen lenses. Even though it was claimed
to be a randomized control trial, the study had major flaws in design. The group selection
was biased by inclusion of individuals who knew their condition; very low size effect
with misleading percent data; arguable randomization protocol; and assessment data
based on self-reported tasks and merely asked preferences.
Level 3
O'Connor et al.[14] carried out a study in 92 children with reading disabilities, classified as scotopic
or non-scotopic. The diagnosis was made using the Irlen Differential Perceptual Schedule
and participants were randomly assigned to one of six treatment groups using colored
or clear overlays. The authors showed significant reading improvements among scotopic
children with the preferred colored overlay filter in comparison with those not using
filters, whereas non-scotopic children showed no change. The study had serious selection
bias where children were pointed to the study by teachers who had identified reading
difficulties. Although the authors claimed double-blind procedures, it is arguable
whether they carried them out properly. Also, it is worth noting that the study lacked
validated instruments for assessment.
Ritchie et al.[43] examined 75 children with below-average reading ability measured by validated instruments
and evaluated by an Irlen-certified professional. Fourteen children were excluded
from the study because they could not participate in the screening test for the syndrome.
Of the remaining 61 children, 47 were diagnosed with SI (77%). Three of these 47 were
removed from the main analysis because they knew their ideal lens color. All the children
were tested using prescribed filters, placebo and a clear filter. In both the control
and the experimental groups, the filters had no significant effect on reading. The
three children who were not in the blinding showed a significant improvement in the
reading test, indicating a significant placebo effect, even though it was considered
biased and non-representative, as well as a possible bias of analysis once a positive
result was removed from the main analysis. Of the children diagnosed with the syndrome,
44 participated in a parallel study, in which 22 received a colorless filter and 22
received the ideal filter. There was no significant difference between the two groups
in any of the readings. The same group was followed for a year; 22 children (30%)
were still using the filter or the lens and, those who were still available for analysis
showed evident reading deterioration. Ritchie et al. also used the same group of patients
in a sequel study in order to follow up the group. Data showed no pattern to distinguish
the IS children who continued treatment from those who did not. The authors also report
that in their study, like others, about 50% of the colored lens users abandoned the
device after one year.
Vidal-López[44] tested the theories of two models of colored lens effects for reading speed: attribution
bias theory and the visual stress theory associated with reading. The attribution
bias consists of the placebo effect that the lenses may possess. A visual stress model
(or cortical hyperexcitability) was proposed based on standard contrast tests. This
test causes symptoms of visual stress in some individuals, which are mitigated by
the use of specific lenses[54]. According to this theory, lenses can normalize electrical discharge patterns in
hyperexcitable neurons. The experimental group consisted of individuals with high
scores on tests and had a preferred color for reading. The control group consisted
of individuals with low scores and who did not have a preferred reading lens. Only
the reading speed from the control group with colored lenses showed a statistically
significant improvement. The results did not corroborate the visual stress theory
since the improvement in reading speed was not higher in the experimental group during
the visual stress test. In addition, the data supported the idea that the use of colored
lenses may have induced changes in the individual's reading pattern (even if statistically
not significant), as proposed by the attribution bias theory. Alternatively, lenses
may have changed the individual's motivation and expectation (placebo effect), leading
to more engagement in the reading task, transferring his disability to an external
cause (the lenses).
DISCUSSION
In Griffiths et al.[12], the authors cite the book “The Irlen Revolution”, in which the author argues that
only the colored filters provided by the Irlen Institute are effective in treating
the syndrome, but there is no scientific basis for such an assertion. In addition,
in the studies found on the Irlen Institute, a consistent definition of the disease
was applied, as well as a diagnostic procedure involving test materials provided by
the professionals trained by the institute. The book by Helen Irlen could not be accessed
by the authors of this study, nor is there any evidence on the efficacy of Irlen lenses
alone.
Sample size, group selection and study type
The summaries presented herein show the enormous heterogeneity in group selection,
sample size estimations and validity, as well as major flaws in study design. In this
particular case, it is remarkable that the control group had the worst measured reading
and visual parameters in comparison with all the experimental groups[29]. It was also related that studies had their groups selected from special schools
or from teachers’ criteria[14]. Another finding was that language proficiency testing was scarce, with a notable
case where the control group, which had English as their first language, was more
likely to be the participants who read significantly faster with their overlay[33]. The data showed the control group had only 39% of individuals with English as their
first language and the experimental group with the overlay comprised 67% English speakers.
Also, a common feature was the fragility or even absence of tests for the control
group and severe imbalance among the case and control groups. Self-reported reading
discomfort or difficulties were widespread among the studies presented here.
The IS diagnosis was a major focus of criticism. More than the vagueness of IS symptoms[55], some authors affirmed that when visual perceptual distortions were reported when
reading, and such symptoms were alleviated by colored filters, then the individual
was affected by the condition[24].
The IS diagnosis is generally based on tests developed by Irlen and colleagues[56], comprising three steps: (1) a questionnaire of 32 questions about ocular and reading
symptoms; (2) a series of visual tasks, such as responding to questions about visual
distortions while observing images; (3) an assessment of the degree of improvement
of the presented symptoms and improvement in the visual and reading tasks while using
the colored lenses[23]. At the end of the reading task, all subjects were asked: How do your eyes feel?
Did you have any difficulties with the print? Did you have any difficulties with the
light? Is there anything you would like to add? The test remains unpublished.
Despite the claims of validity for IS diagnostic tools, the most cited validation
studies raise several concerns. Only four studies performed validation tests, two
of them are unpublished dissertations[57],[58], and two[59],[60] do not include critical aspects of the validation processes. Data from these articles
do not support proper validation by methodological limitations such as the construction
of the tool and its premises, sample size estimation, validation of the construct,
internal scores, accuracy and reliability, even when compared with other scales of
perception[8]. It is recognized, even by supporters of the tests[37], that the severe limitations to addressing estimates of prevalence provided by Irlen,
due to her not providing any data or diagnostic criteria, result in the lack of an
objective diagnostic test. Hence, it is not surprising that strong criticism relies
specifically on IS detection.
The method has low sensitivity and low specificity[9]. Even readers without the syndrome, if they have poor reading, can be diagnosed
with the syndrome. The opposite is also true: individuals who may have the syndrome
to a lesser degree may not be diagnosed. Therefore, the method is criticized as hyper-inclusive[5]. Some of these tests were restricted to individuals who underwent the training program
at the Irlen Institute and became certified Irlen screeners[44]. In this case, even if it was a validated test, many of the tests performed within
these studies should be regarded as invalid or flawed by the lack of such certification.
Another point of criticism is the fact that a significant part of the test for IS
is based upon the patients’ preferences for a specific color. It was specified by
Evans et al.[28] that when the patient showed preference for a colored overlay, then this color should
be adopted for reading if one found it helpful. In other words, the preference for
a color might define the treatment. This has profound effects, especially on children.
Moreover, selecting individuals for study, with the objective of evaluating the effect
of the lenses, induces a selection bias. Irlen Syndrome is diagnosed on the basis
of either the sustained voluntary use of an overlay or an immediate improvement (of
more than 5%) on the Wilkins Rate of Reading Test[37]. Wilkins developed this test to isolate and measure the effect of visual factors
on reading[37]. Most reading tests are designed to evaluate high-level reading skills, but not
the contribution of visual factors to reading. However, the Wilkins Rate of Reading
Test is criticized for being devised to be visually crowded and to maximize visual
discomfort by simulating the striped effect, which is known to cause a pattern glare
effect. Visual discomfort is maximized by reducing the spacing between words in order
to achieve this effect[35]. This may affect reading performance in the presence of eye movement control difficulties.
In this sense, when separating an experimental group with IS individuals and a control
group without IS, the experimental group inherently respond to the lenses in the diagnostic
method. This scenario occurred in many studies[15]
–
[18]. Nevertheless, even with this flagrant bias, most of the studies did not present
statistically significant differences between the control and experiment groups, with
adequate lenses or placebo[15],[16],[20].
Interventions
Among the arsenal of possible lenses, there are 55 different colors that can be prescribed
in combinations of pairs, trios or even quartets for the final prescription. That
is, there are a total of 368,830 possible combinations. Thus, there is a considerable
random effect present in this modality of treatment, which cannot be considered valid
for clinical practice and, therefore, is seriously questioned[4]. The lenses are prescribed using the Intuitive Colorimeter®, an instrument developed by Wilkins and patented by the Medical Research Council[37].
Another point of concern is the fact that no wavelength measurement is performed in
order to determine the actual color of the filter. Only in four studies was this procedure
conducted. The tones of blue or pink or yellow are a continuum along the spectrum
and very precise measurements are possible. Furthermore, the environmental luminance
was barely considered in a few studies, which undoubtedly interfered in the final
color received by the patients’ retina. Under any psychophysical test, this omission
is considered a fatal flaw for the experiment and is like saying that a certain drug
dosage is not measured before administration. The feasibility of such experimental
control and the possibility of variable control was not observed in the studies.
Studies conclusion
Many studies[33]
–
[37] contradicted their own results, even claiming that, although not statistically significant,
their study had shown improvements in reading speed in children with severe self-reported
reading difficulties, when compared with other participants. There were also statements
that, when no test detected significant differences in reading speed between the groups,
the “obvious reason” was ascribed to the test[48].
More than fallacies, many of them resort to highly-speculative mechanisms in order
to find possible, but far from feasible, mechanisms. Regarding the most accepted underlying
mechanisms for IS, two models are proposed as a possible role of the lenses when any
improvement is detected. The most identified is the placebo effect. Individual characteristics
about the perception of being involved in a study for their own reading deficiency
possibly increases motivation to carry out the reading task presented. In addition,
the participants transfer their disability to some external resolution (the lenses),
easily applicable, and not internal resolution (more reading practice, slow reading
etc.)[44].
Another possible mechanism is to alleviate cortical neuronal hyperexcitability present
in individuals with the syndrome. Clinical studies have failed to identify such a
mechanism and the effect of lenses on it. Neuroimaging studies[47], however, show results that may support this theory. In one case report[53], a cortical response was verified in the child identified with IS. However, the
study showed a low level of scientific evidence, several experimental flaws and has
not yet been reproduced. Finally, the anatomical and functional knowledge about parvocellular,
magnocellular and koniocellular pathways, the mechanisms underlying visual information
processing, is robust and well established. The magnocellular pathway is insensible
to colors and to the visible spectrum, being responsible for stereopsis. These facts
expose the unfounded use of colored filters and reinforce that the reduction of contrast
could worsen the reading ability[61].
Overall features
Concerns are expressed[62] about the validity of IS and Irlen lenses due to the lack of evidence, and the fact
that colored overlays achieve no significant results. Experiments on reading problems
involving suppression of distractors, background pattern removal or foreground clutter
have some evidence of function[63]. The IS advocates have been urged to publish their data in order to make them available
to the scientific community, if this evidence, in fact, exists[61].
It is worth noting that most of the IS and Irlen lenses advocates generally rely on
potentially outdated research for their argument. Several improvements in the understanding
of visual attention and its neural correlates have been attained. From the 122 articles
retrieved in the present study, only 15 were published in the second decade of the
2000s. Even so, data from these previous studies ignored findings from their time[64] and still claimed improvements from colored lenses for everyone.
Another point to consider is the fact that a large bulk of data has been produced
about the influence of environmental colors on cognitive tasks, perceptual performance
and the sleep cycle, to mention a few. It is remarkable that IS advocates changed
their position slightly[33], shifting from the controversial lacking evidence denomination of IS to Meares-Irlen
syndrome/visual stress and then to a vaguer term, visual stress, supposedly as the
same condition. This suggests that academic discussions about IS maybe confined to
a restricted community. Not surprisingly, the strongest data challenging the findings
surrounding IS were presented outside this community. Even so, the heterogeneity emerges
as a critical issue when analyzing these studies, as shown below.
Taking everything in account, the American Academy of Ophthalmology and the American
Academy of Pediatrics published a guideline, in 2014[65], for clinical decision making. After 34 years of research, since IS was described
in 1980, scientific evidence on disease diagnosis and lens therapy still show invalidities
to corroborate the applicability of such processes. In this way, both academies do
not recommend the use of colored lenses for treating reading deficits, similar to
data presented by Galuschka and Schulte-Körne[5], and from another study by these authors, on reading disabilities[66]. Finally, the dilemma posed by Sagan[67] on the persistence of selected conclusions is applicable to IS and its colored lenses
advocates in a way that indicates that stronger evidence is still needed.
In conclusion, after more than three decades since the description of IS, despite
the quality of the current evidence regarding the subject it does not allow us to
completely refute the disease in terms of symptomatology. The present study found
data arguing that the screening and diagnostic tests are at least questionable, if
not valid. Despite the IS treatments and the claims of its harmlessness or, at least,
innocuous effects, more evidence-based medicine is needed. The prescription of colored
lenses specifically for this spectrum of symptoms should not be recommended. Adequate
scientific evidence is the only way to dismiss the doubts about the real usefulness
of such treatment. Also, IS supporters have not been successful in either the recognition
and identification of IS per se, or with possible and effective treatments. In fact, the clinical descriptions do
not seem definitive and, therefore, any proposal of a therapeutic approach will have
methodological difficulties. Under the light of the scientific method, the rhetoric
controversy only favors the colored arguments of IS advocates. To our knowledge, we
have presented the most comprehensive review on IS to date. We conclude that the use
of colored lenses or overlays to ameliorate reading difficulties cannot be endorsed,
and that any benefits reported by individuals in clinical settings are likely to be
the result of placebo, practice or the Hawthorne effect, consistent with previous
reviews and advice from several associations. Therefore, it is still necessary to
develop an accurate diagnosis of IS to develop further therapeutic approaches.