Key words
ultrasound - adenocarcinoma - Crohn’s disease - strain ratio - ex-vivo
Introduction
Differentiation between benign and malignant tissue is a major challenge in medical
imaging. Tissue samples are usually necessary for a reliable diagnosis, but they may
be difficult to obtain. Pathological changes may alter tissue structure and hence
its elastic properties. New imaging modalities have been introduced as potential tools
for the discrimination between malignant lesions and benign tissue based on changes
in elasticity or strain distribution patterns [1]. Real-time elastography (RTE) is an ultrasound-based technique in which the distribution
of strain is mapped when soft tissue is exposed to repetitive stress. The method was
described by Ophir et al. and referred to as quasi-static elastography [2]. Several methods for registering strain or elasticity using different forces to
induce tissue deformation leading to recordable displacements or shear waves have
subsequently emerged [3]
[4]. These techniques have been used to characterize tissue in focal lesions, contracting
organs, such as the heart [5]
[6], stomach [7], and pancreas [8], or in liver fibrosis [9]
[10]. Hard tissue is characterized by low strain, while softer areas show higher strain.
RTE has shown promising results for differentiating between malignant and benign lesions
in breast lumps [11], prostate tumors [12]
[13], thyroid nodules [14]
[15]
[16], lymph nodes [17]
[18] and the pancreas [8]
[19]. Both malignant tumors and chronic inflammation may lead to the formation of soft
tissue fibrosis [20]
[21]. Fibrosis formation in malignant tumors is called desmoplasia and may contribute
to increased tissue hardness through several mechanisms involving tumor cells, stromal
cells and the extracellular matrix [22]. Malignant tumors are also usually characterized by increased interstitial pressure,
a feature that decreases quickly in devascularized tissue [23]
[24]
[25].
In this study, RTE was performed on surgically resected bowel specimens containing
benign or malignant lesions using one categorical classification system and two semi-quantitative
methods for relative strain assessment. Our aim was to evaluate whether RTE could
differentiate lesions caused by inflammation from malignant neoplastic lesions using
qualitative and semi-quantitative methods for strain assessment. Furthermore, we examined
whether strain ratio (SR) measurements were affected by changes in the elasticity
dynamic range (E-dyn). Finally, we also wanted to correlate elastography results to
a histological semi-quantification of fibrosis, inflammation parameters and tumor
description.
Material and methods
Patients referred for elective bowel resection were recruited for this feasibility
study from May 2008 to June 2009. All patients signed an informed consent to participate.
The study was approved by the Regional Committee for Medical and Health Research Ethics
in Western Norway and was conducted according to the Helsinki Declaration.
38 lesions from 27 patients were examined, 16 sections from Crohn’s stenoses in 9
different patients, 18 sections from adenocarcinomas in 16 patients and 4 sections
from adenomas in 3 different patients. One cancer patient had a synchronous adenoma
and another had two synchronous adenocarcinomas. Two adenomas were separate lesions
from the same patient. In one patient with a multilobular tumor, two different tumor
sections were included as separate lesions. In the Crohn’s disease specimens, separate
skip lesions as well as the proximal and distal end of some longer stenoses were included
as individual lesions.
The observer was not blinded to the indication for surgery, but the final diagnosis
was confirmed by histopathology after the examination and image evaluation.
Newly resected specimens were collected from the operating room, rinsed in water and
subsequently examined by ultrasound with RTE in a specially designated chamber. A
2-cm layer of hard paraffin wax covered the bottom of the chamber in order to limit
US reflections and to provide hold for the needles used for fastening the specimens.
A 1.5-cm layer of flexible agar (Agar No.1, 1 %, Oxoid Ltd, Basingstoke Hants, UK)
was placed on top of this to make a flexible and echo-free background for ultrasound
scanning. The scanning plane of a captured elastogram was marked on the specimen with
3 identically colored needles ([Fig. 1]). The specimen was then fixed in a 4 % formalin solution for 3 – 7 days before a
pathological examination was performed to obtain a histological confirmation of diagnosis.
Fig. 1 The surgical preparations were examined in a transparent plexiglass chamber with
the bottom covered with solid paraffin wax and a separate layer of 1.5 cm of 1 % agar.
The tissue was fixed to the agar layer with needles (yellow color) and the sections
examined with B-mode US and elastography were marked with three needles with the same
head-color (red, black, green) in order to define the position of the lesions and
the scanning direction.
Abb. 1 Die Untersuchung der chirurgischen Präparate erfolgte in einer transparenten Plexiglaskammer,
deren Boden mit einer festen Paraffinschicht und einer 1,5 cm dicken separaten Schicht
bestehend aus 1 % Agar bedeckt war. Das Gewebe wurde mit Nadeln (gelbe Farbe) auf
der Agarschicht fixiert und die mittels Graustufenultraschall und Elastografie untersuchten
Bereiche wurden mit drei gleichfarbigen Nadeln (Kopf in rot, schwarz, grün) markiert,
um die Position der Läsionen und die Messrichtung festzulegen.
Ultrasound equipment
A Hitachi Hi Vision 900 ultrasound scanner with software version V16 – 04 STEP 2 and
a L54 M linear probe with frequencies 9 – 13 MHz (Hitachi Medical Corporation, Tokyo,
Japan) were used in the study. The elastography modality was developed for free-hand
application, and small repetitive compression/decompression movements of the probe
were used to provide stress and induce tissue strain. With this method, relative strain
values in different areas are calculated based on the assumption that stress is evenly
distributed over the region of interest (ROI). A color map showing the strain in individual
image elements as compared to the mean strain of the whole ROI is produced. The color
distribution depends on the selected level of the E-dyn. A graphic indicator on the
display shows when there is sufficient strain to produce an elastogram. The applied
strain imaging algorithm records changes in echo positions between consecutive radio
frequency frames using the extended combined autocorrelation method (ECAM), which
is an extension of the original quasi-static elastography [26]
[27].
We used the following parameter settings for RTE: Frame reject: 6, noise reject: 4,
persistence: 2 – 5, smoothing: 2 – 3. The VAS and categorical scores were determined
using E-dyn level 4 (default level). SR examinations were also recorded at E-dyn levels
2 and 6. The frame rate varied between 9 and 14 depending on the selected ROI size.
Methods for strain assessment
We applied a categorical classification based on the color distribution within the
lesion as originally described for pancreatic lesions by Janssen et al. [28]. In this classification, three basic patterns of color distribution are recognized:
Type 1 is relatively homogeneous, type 2 has 2 or 3 different colors and type 3 has
a honeycomb pattern. The colors present are represented by the subsequent letters
usually indicating a declining proportion in the lesion: A = blue, B = green/yellow
and C = red. 32 lesions (92 %) were classified according to this system and fell into
five categories: 1B: homogeneously green, 2BA: green > blue, 2AB: blue > green, 1A:
homogeneously blue and 2 A-B-C: presence of blue, green and red without one of the
colors being classified as more prominent.
A visual analog scale (VAS) consisted of a 100-mm line where the observer marked the
assumed level of the lesion’s hardness as assessed by elastography at E-dyn level
4. If the lesion was regarded as equal to the surrounding tissue, the mark was set
at 50 mm. Lesions appearing harder were marked between 50 and 100 mm and softer lesions
were marked between 0 and 50 mm. Assessments using the VAS score and visual categorical
score were repeated by a trained observer blinded to the clinical data using video
clips of the elastography seen in Windows Media Player (Microsoft Inc., Ca, USA).
The results were compared to those obtained in the original scoring by the non-blinded
observer.
The third strain evaluation method was a semi-quantitative measurement of SR comparing
the reference tissue (B) and the lesion (A) using the following formula[]:
SR is a built-in scanner software function and was first introduced in breast scanning
and was called the “fat-lesion ratio”. It compares the mean strain in a user-selected
portion of the reference tissue with a similarly selected part of the lesion, assuming
that both areas have been subjected to the same amount of stress. We selected circular
areas of reference tissue at the same depth and with approximately the same size as
the lesion for all measurements ([Fig. 2]). In most cases, the central area of the lesion was selected for calculation (area
A). However, if the lesions had reproducible differences in tissue hardness, the hardest
area was selected as area A. Pericolic or mesenteric fat and connective tissue as
well as parts of the normal bowel wall could be included in the reference area. SR
measurements were performed on representative image frames with sufficient signal
quality. Measurements were usually repeated 3 times on E-dyn levels 2, 4 and 6, respectively,
thus yielding 9 SR measurements per lesion. However, only 3 SR measurements were recorded
in three lesions. (E-dyn scale: 1 – 8).
Fig. 2 Split-screen images showing B-mode ultrasound image on the right side and B-mode
image with elastogram on the left. The displayed colors represent the degree of strain
in the tissue covered by the region of interest. The strain ratio is calculated by
the mean strain in the reference tissue (area B) divided by the mean strain in the
lesion area (area A). The B-mode image is used for placing the areas A and B for strain
ratio calculation. Images represent: a: Adenoma (SR 1.58), b: Adenocarcinoma (SR 1.80) and c: Crohn's lesion (SR 3.73).
Abb. 2 Splitscreen-Darstellungen zeigen das B-Bild auf der rechten Seite und das B-Bild
mit Elastogramm zur Linken. Die dargestellten Farben stellen den Strain-Grad des Gewebes
in der „Region of Interest“ an. Die Strain-Ratio wird berechnet aus dem mittleren
Strain des Referenzgewebes (Bereich B) geteilt durch den mittleren Strain der Läsion
(Bereich A). Das B-Bild wird benötigt, um die Bereiche A und B zu platzieren und die
Strain-Ratio zu berechnen. Die Bilder stellen dar: a: Adenom (SR 1,58), b: Adenokarzinom (SR 1,80) und c: Crohn-Läsion (SR 3,73).
Pathological evaluation
One dedicated pathologist conducted all of the pathological examinations. The pathologist
was assisted by the person who performed the ultrasound to ensure the correct orientation
of tissue planes according to the previous US elastography examination during the
gross examination of the formalin-fixed specimens. Whole mount slices with a thickness
of 2 mm were taken according to the needle marks, inked for orientation and processed
by standard procedures. Haematoxylin-eosin-stained sections were investigated for
the following parameters of inflammation in cases of Crohn’s disease: ulceration,
granulation tissue, fissure ulcers, fistula, abscess, cryptitis, and lymphoplasmacytic
inflammation. A fibrosis score was determined in all cases. Lymphoplasmacytic inflammation
and fibrosis were graded semi-quantitatively as absent (0), slight (1), moderate (2),
or severe (3).
The fibrosis score was based on a semi-quantitative visual evaluation performed by
one pathologist. In order to validate this score, the same pathologist determined
a second score for all lesions one year after the first scoring. A Kappa score of
0.37 was obtained, indicating fair reproducibility of the fibrosis score.
Statistical analysis
The VAS scores were found to be within the limits of normal distribution for adenomas,
Crohn’s lesions and adenocarcinomas as tested with the Shapiro-Wilk test, and comparisons
between the groups were based on mean values with standard deviations in each group. When
evaluating SR differences, the median of repeated SR measurements in each lesion was
selected for further quantitative analysis. When comparing the three diagnostic groups
of lesions, the mean of these SR medians was calculated within each group. The median
SR results from Crohn’s lesions and adenocarcinomas were not distributed normally.
A log transformation was attempted, but did not yield a normal distribution. We had
included one to three separate lesions from the surgical specimens due to skip lesions
in Crohn’s lesions and some synchronous neoplastic lesions. Consequently, the untransformed
data were analyzed using a generalized estimating equations (GEE) model allowing analysis
of both related and unrelated observations. In this model, SR and VAS results for
Crohn’s lesions and adenomas were compared to the corresponding results for adenocarcinomas,
which served as a reference. For comparison of SR measurements at E-dyn levels 2,
4 and 6, we used a Spearman’s rho correlation and Wilcoxon-Mann-Whitney test for paired
differences between the medians of three recorded SR measurements at each E-dyn level.
The significance level was adjusted to apply for 3 measurements: p = 0.05/3 = 0.017. This
test also provided information about the repeatability of SR measurements in the individual
lesions.
The interobserver agreement for categorical scores was assessed using Cohen’s Kappa,
which expresses the agreement controlled for agreement by chance and is considered
a conservative measurement of interobserver agreement. The Kappa values were interpreted
in the following manner: 0 = no agreement, 0 – 0.20 = slight agreement, 0.21 – 0.40 = fair
agreement, 0.41 – 0.60 = moderate agreement, 0.61 – 0.80 = substantial agreement,
and 0.81–-1 = almost perfect agreement [29]. Continuous scores (VAS) were compared using correlation (Pearson’s) and limits
of agreement [30]. Statistical analyses were performed using SPSS 17 and IBM-SPSS 19 (Chicago, Ill,
USA).
Results
Categorical evaluation of strain images
The qualitative classification of color distribution in the elastogram showed reduced
strain in Crohn’s lesions as well as adenocarcinomas indicating increased tissue hardness
compared to surrounding tissue. Elastograms of an adenoma, an adenocarcinoma and a
Crohn’s lesion are shown in [Fig. 2a-c]. [Fig. 3] shows the distribution of qualitative strain image categories within each group
of lesions evaluated by observer 1. 29 of 33 scored cases were described as category
2AB or 2 BA.
Fig. 3 A categorical visual score based on the color distribution in the lesion of interest.
The bars indicate the number of lesions in each diagnostic group categorized by observer
1.
Abb. 3 Eine kategorische visuelle Bewertung aufgrund der Farbverteilung in der Läsion, die
von Interesse ist. Die Säulen geben die Anzahl der Herdläsionen in jeder diagnostischen
Gruppe an, kategorisiert durch Beobachter 1.
Semi-quantitative evaluation of strain images
The VAS evaluation showed no significant difference between Crohn’s lesions and adenocarcinomas,
while adenomas had a significantly lower VAS score indicating softer tissue. SR quantification
also could not demonstrate any difference between Crohn’s lesions and adenocarcinomas.
However, the four adenomas had a significantly lower SR than the adenocarcinomas.
The mean SR as well as median SR, intra-assay standard deviation, and VAS score for
each diagnostic group are summarized in [Table 1] and the GEE model details are listed for VAS and SR in appendices 1 and 2, respectively.
Appendix 1.
tissue strain by visual analog scale (VAS), n = 35
(GEE, linear model)
|
test of model effects
|
|
estimated marginal means
|
95 % confidence interval
|
coefficient
|
95 % confidence interval
|
p-value
|
diagnostic group
|
|
|
|
|
< 0.001
|
adenoma (n = 4)
|
59.8
|
(54.8, 64.7)
|
–15.472
|
(–23.910, –7.035)
|
|
Crohn’s lesion (n = 13)
|
81.6
|
(78.5, 84.8)
|
6.393
|
(–1.037, 13.823)
|
adenocarcinoma (n = 18)
|
75.2
|
(68.5, 82.0)
|
0
|
(reference)
|
Interobserver validation of categorical and semi-quantitative elastography
For the categorical evaluation, cine loops of 28 of the original 33 lesions were re-evaluated
by an observer blinded to the clinical data. Five cases were excluded from re-evaluation
because only still images were stored and not the original video files. A comparison
between observer 1 (original) and observer 2 (blinded) yielded a Kappa value of 0.38
for the categorical scale, indicating fair interobserver agreement. The second observer
also repeated the VAS scoring in 30 of the 35 cases that had previously been scored
by observer 1. There was a significant correlation between the two observers (Pearson’s
r = 0.55, p = 0.002). The correlation plot and limits of agreement between observer
1 and 2 are shown in [Fig. 4]. Moderately good correlations were found with better interobserver agreement and
correlation for higher values of VAS than for mid-range values.
Fig. 4 Correlation plot and limits of agreement plot of VAS score agreement between observer
1 and observer 2. Observer 2 was blinded to the clinical data. The agreement improved
with higher VAS scores.
Abb. 4 Korrelationsplot und „Limits of agreement“-Plot der Übereinstimmung von Beobachter
1 und 2 bei der VAS-Auswertung. Beobachter 2 kannte die klinischen Daten nicht (Verblindung).
Die Übereinstimmung verbesserte sich mit höherer VAS-Punktzahl.
Intraobserver validation of strain ratio measurements
The comparison between triplets of SR recorded in separate loops at different levels
of E-dyn is presented in [Table 2]. We found moderate to good correlations (Spearman’s rho = 0.47 – 0.82) between the
separate recordings with different E-dyn ranges. A comparison of the median SR showed
no significant differences between recorded SR measurements between E-dyn levels 2
and 6 or between E-dyn levels 4 and 6. However, there was a significant difference
in SR at E-dyn levels 2 and 4. The intra-assay standard deviations for all SR measurements
in individual lesions are listed according to diagnosis in [Table 1]. These values showed higher measurement variation for Crohn’s lesions than for adenocarcinomas.
Appendix 2.
tissue strain measured by strain ratio (SR), n = 38
(GEE, linear model)
|
test of model effects
|
|
estimated marginal means
|
95 % confidence interval
|
coefficient
|
95 % confidence interval
|
p-value
|
diagnostic group
|
|
|
|
|
< 0.001
|
adenoma (n = 4)
|
1.31
|
(1.20, 1.42)
|
–1.202
|
(–1.755, –0.649)
|
|
Crohn’s lesion (n = 16)
|
3.33
|
(0.79, 5.88)
|
0.821
|
(–0.783, 3.425)
|
adenocarcinoma (n = 18)
|
2.51
|
(1.95, 3.08)
|
0
|
(reference)
|
Assessment of fibrosis
The number of included patients, scanned sections, the macroscopic length and the
categorical fibrosis score for each of the three diagnostic groups are presented in
[Table 3]. There was no significant difference in the macroscopic length of Crohn’s stenoses
and the maximum length of the adenocarcinomas. The adenomas were smaller than the
adenocarcinomas and Crohn’s lesions. In two separate evaluations, one pathologist
visually categorized the fibrosis in 37 sections of the same lesions as 2 = “moderate
fibrosis” in 25 and 17 cases, respectively. Category 1 = “slight fibrosis” was used
7 and 13 times, respectively. The two separate evaluations with categorical fibrosis
scores 0 – 3 had a Kappa value of 0.37. In the Crohn’s lesions, the fibrosis score
indicated variations in fibrosis in different wall layers. The mean fibrosis scores
were as follows: submucosa –2.44, subserosa –1.75 and proper muscle –1.50.
Table 1
Mean and median strain ratio (SR) and visual analog scale for the three diagnostic
entities: adenoma, Crohn’s lesion and adenocarcinoma. The SR intra-assay standard
deviation is greater for Crohn’s lesions than for adenocarcinomas indicating a larger
measurement variation.
diagnosis
|
mean strain ratio[1] (SD)
|
median strain ratio (min, max)
|
SR intra-assay standard deviation
|
Mean VAS[2]
0 – 100 mm (SD)
|
adenoma (n = 4)
|
1.31 (0.58)
|
1.25 (0.68, 2.06)
|
0.35
|
59.8 (10.4)
|
Crohn’s lesion (n = 16)
|
3.33 (5.21)
|
2.09 (1.31, 22.75)
|
1.23
|
81.7 (6.3)
|
adenocarcinoma (n = 18)
|
2.51 (1.17)
|
2.18 (0.81, 5.34)
|
0.46
|
75.2 (13.3)
|
1 Mean SR: Crohn’s lesion vs. adenocinoma: p = 0.537. Mean SR: Adenoma vs. adenocarcinoma:
p < 0.001.
2 VAS: Crohn’s lesion vs. adenocarcinoma: p = 0.092. VAS: Adenoma vs. adenocarcinoma:
p < 0.001.
Assessment of inflammation in Crohn’s specimens
The patients diagnosed with Crohn’s disease all had pain as the main symptom with
stenotic findings in the small or large bowel confirmed by radiological imaging or
endoscopy. Eight of nine patients were non-responders to high-dose steroids or TNF-antagonists.
Two of the patients who underwent surgery in 2008 were reluctant to use TNF-antagonists
or prednisolone. Two patients who underwent surgery only 3 and 8 months after diagnosis
had suffered substantial weight loss of 10 and 15 kg, respectively. One Crohn’s patient
was operated on under the suspicion of cancer in the ascending colon, although histopathology
had not confirmed this prior to surgery. This patient had only received a 10-day course
of antibiotics prior to surgery. All Crohn’s lesions exhibited transmural inflammatory
changes. Lymphoplasmacytic inflammation was assessed as slight in 4/16 cases, moderate
in 8/16 cases and severe in 4/16 cases. Other inflammatory parameters were mucosal
ulcerations in 13/16 (81 %) cases, granulation tissue in 12/16 (75 %) cases, cryptitis
in 7/16 (44 %) cases, fissure ulcers in 4/16 (25 %) cases, fistulas in 3/16 (19 %)
cases and abscesses in 0/16 cases. Lesions with high overall inflammatory activity
did not differ from lesions with low overall inflammatory activity by fibrosis score.
Furthermore, there was no difference in SR or VAS between lesions with overall high
or low inflammatory activity. Inflammatory changes in the adenocarcinomas and adenomas
were not specifically evaluated. However, none of these specimens exhibited transmural
chronic inflammatory changes. Only one adenocarcinoma specimen also contained an area
of acute inflammation and abscess formation according to the original pathology report.
Strain evaluation by tumor stage and grade
The SR and VAS results were compared by tumor stage according to the TNM classification
for the 18 examined sections of malignant lesions. The results are summarized in [Table 4]. There was a tendency for a higher fibrosis score for tumors staged as T3 or T4. We
found no significant differences in SR and VAS between lesions staged as T3 or T4
compared to T1 and T2, but the included numbers were low. The tumor grade was described
as highly differentiated (> 95 % glandular structures) in 2 cases, moderately differentiated
(50 – 95 % glandular structures) in 9 cases and poorly differentiated (< 5 % glandular
structures) in 7 cases. When highly and moderately differentiated tumors were considered
as one group (n = 11) and compared to poorly differentiated tumors (n = 7), we found
no significant differences in the SR or VAS scores. The number of lesions given fibrosis
score 2 was 8/11 (73 %) for the highly and moderately differentiated adenocarcinomas
and 5/7 (71 %) for the poorly differentiated adenocarcinomas.
Table 2
Comparison of median strain ratio measurements of the same lesions with elasticity
dynamic range (E-dyn) levels 2, 4 and 6. The differences were not normally distributed.
The exact two-tailed Wilcoxon-Mann-Whitney test was used for the comparison of paired
differences. The significance level for differences was set to p ≤ 0.05/3 = 0.017
to adjust for multiple comparisons.
E-dyn levels
compared
|
Pairs
n
|
median SR (min-max)
|
Spearman’s correlation:
rho (p-value)
|
P-value of paired differences
|
E-dyn 4
E-dyn 2
|
34
|
2.25 (0.68 – 8.28)
1.74 (0.81 – 30.79)
|
0.47 (0.005)
|
0.008
|
E-dyn 4
E-dyn 6
|
35
|
2.25 (0.68 – 8.28)
2.06 (0.33–-22.75)
|
0.55 (0.001)
|
0.849 (n.s)
|
E-dyn 2
E-dyn 6
|
34
|
1.74 (0.81–-30.79)
2.06 (0.33 – 22.75)
|
0.82 (< 0.001)
|
0.022 (n.s)
|
Discussion
Differentiation between neoplastic and inflammatory lesions by RTE
RTE demonstrated low strain indicating harder tissue in resected lesions from both
colorectal carcinomas and Crohn’s disease. The hypothesis that strain imaging could
differentiate malignant neoplastic lesions from benign lesions caused by inflammation
was based upon observations that malignant tumors in other organs tended to become
harder than benign lesions [14]
[19]
[31]
[32]
[33]. Pathologists frequently observe increased fibrosis within and in the vicinity of
malignant lesions. A similar process also occurs in chronic inflammation in which
case fibroblasts and possibly other cells produce collagen-rich interstitium resulting
in scar tissue [34]. Such fibrogenic activity seems to be a dominant feature in stenotic lesions of
Crohn’s disease. A second contributor to the hard appearance of malignant tumors may
be high interstitial pressure [23]
[25]
[35]
[36], which decreases quickly after devascularization. Hence, its probable contribution
to differences in tissue elasticity was not measureable in our study on ex-vivo tissue
specimens. Crohn’s disease is usually subjected to surgery because of intractable
stenosis not responding to medical therapy. All Crohn’s lesions in our study had extensive
fibrosis, as demonstrated by histology. Hence, increased tissue hardness was visualized
in both Crohn’s lesions and adenocarcinomas. However, it was not possible to detect
any strain differences between benign transmural lesions caused by chronic inflammation
and malignant tumors.
Strain evaluation and fibrosis in adenocarcinomas
The number of adenocarcinomas in the study was limited. Hence, we could not find a
statistically significant difference in SR or VAS score, but there was a tendency
to increase the fibrosis score and SR with increasing tumor stage. With respect to
tumor grading, we would expect poorly differentiated tumors to have a higher fibrosis
score. However, no significant strain differences were found, even if both the SR
and VAS scores showed an insignificant tendency towards higher strain for poorly differentiated
tumors. The low number of lesions included may be the cause for this (type 2 error),
and larger studies are warranted.
Strain assessment
A majority of published papers on RTE have used visual categorical scoring based on
color distribution in the lesion as the main strain evaluation tool. In this study,
we applied a categorical classification based on a distribution pattern and predominance
of separate colors within the lesions. This classification was introduced by Janssen
et al. [28] for the evaluation of pancreatic lesions using endoscopic ultrasonography, and up
to five of the original categories were used in our study. The qualitative scoring
system could not reliably separate the diagnostic entities in this study. Observer
1 classified 50 % of the adenomas, 50 % of the Crohn’s lesions and 35 % of the adenocarcinomas
as 2BA, indicating more green than blue. Category 1B, with a homogeneously green presentation,
was only used for benign lesions, but only 3 of 16 benign lesions were categorized
as 1B. The interobserver reproducibility of categorical scoring was fair (Kappa: 0.38)
when the cine loops were re-evaluated by a second observer.
Elastograms were also evaluated with a continuous VAS score, anticipated to be more
fine-grained than the categorical score, and comprising an aspect of strain comparison
between the lesion and the surrounding reference tissue. Because none of the lesions
were perceived as softer than the reference tissue, the lower half of the scale was
not utilized. VAS evaluation had a moderate correlation (r = 0.55) between the original
and an observer blinded to clinical data. The difference between the observers was
larger for low scores (moderately hard and mixed lesions) than for high scores (harder
lesions) ([Fig. 4]). This coincides with an observation that two observers frequently scored the same
lesions as 2AB (inhomogeneous, blue > green) and 2BA (inhomogeneous, green > blue),
correspondingly, on the categorical scale.
The SR is based on a computer-based comparison of mean strain in two selected areas,
and may thus seem to be more objective than the VAS and the categorical scale. However,
the selection of calculation areas is still user-dependent, and the recorded strain
may be subject to significant variations especially in heterogeneous reference areas.
Uneven application of stress within the elastogram ROI may also influence the SR results.
Hence, SR evaluation is usually quoted as a semi-quantitative method. The SR data
([Table 1]) demonstrates a large spread in strain measurements within the groups of Crohn’s
lesions and adenocarcinomas. The variation between cases in the same group may express
measurement variation and also reflect a biological diversity in tissue strain for
that entity.
Reproducibility of SR measurements
The E-dyn level of RTE influences the color presentation of strain distribution profoundly
[37]. Consequently, it is crucial to keep this setting on a standardized level for the
visual comparison of strain images in different cases. Median SR recordings at E-dyn
4 vs. 6 and E-dyn 2 vs. 6 were not significantly different. A significant difference
was found for the SR recorded at E-dyn 2 vs. 4. This was an unexpected and probably
unreliable result since such a difference was not found with a larger E-dyn span between
2 and 6. We have previously shown that SR measurements are independent of E-dyn level
in a standardized liver-tissue-mimicking phantom [38]. Therefore, we interpreted the observed SR measurement variability as the result
of the general variability of this method in separate recordings of the same lesions
using a free-hand application, and not as a consequence of the selection of different
levels of E-dyn. More dedicated studies may be required to confirm the relationship
between E-dyn level and SR measurements in biological tissues.
Relevance
The differentiation between colorectal tumors and Crohn’s lesions predominantly in
the small intestines may seem to have limited clinical value. However, these lesions
were chosen as a model for lesions caused by benign inflammation and malignant neoplasms
in the bowel wall. We selected Crohn’s lesions because they are transmural and expand
the bowel wall in a similar manner to neoplasias. Rustemovic et al. have studied patients
with ulcerative colitis and compared them to patients with Crohn’s disease and healthy
controls in vivo [39]. They found a significantly higher SR in Crohn’s patients than in ulcerative colitis
patients with active rectal disease. In the case of Crohn’s disease, the patients
with active disease also had a significantly higher SR than those in remission, indicating
a substantial contribution from increased interstitial pressure. In the case of ulcerative
colitis, the inflammation is limited to the mucosa and submucosa and does not create
the transmural inflammation and fibrosis characteristic for Crohn’s disease. From
our point of view, the contribution of endoscopic RTE in the differentiation between
Crohn’s disease and ulcerative colitis may not provide more than visualization of
the transmural inflammation resulting in bowel wall thickening clearly visible by
B-mode EUS. However, further studies may investigate whether a possible clinical application
of EUS-elastography may be the evaluation of polyps and pseudo-polyps in ulcerative
colitis, and whether a thickened bowel wall with increased tissue hardness may indicate
the development of malignancy in a patient with ulcerative colitis under surveillance.
Very few adenomas were included in this study, as the main object was to evaluate
Crohn’s lesions and adenocarcinomas. However, we found that adenomas appeared to have
higher strain and, correspondingly, lower strain ratios than both adenocarcinomas
and Crohn’s lesions. The difference in strain properties between adenomas and adenocarcinomas
in the rectum is clinically relevant, and endorectal ultrasonography (ERUS) with elastography
can potentially be a valuable preoperative diagnostic tool [33]. ERUS is currently acknowledged as the best method for selecting patients for minimally
invasive rectal surgery such as endoscopic mucosal resection or transanal endoscopic
microsurgery [40]
[41]. The preoperative classification of early rectal adenocarcinomas and adenomas remains
challenging and there is still a need for improved methods in the staging of these
tumors.
Limitations of the study
This study was performed on surgically resected specimens, in which the contribution
of interstitial pressure as a source of tissue hardness is eliminated. Hence, the
ex-vivo setup may underestimate tissue hardness in vascularized tissue. However, all
surgical specimens were examined under the same conditions, and the contribution from
interstitial pressure should be reduced in a similar way for all samples. A second
limitation is that the primary observer was not blinded to the clinical diagnosis.
A second observer, blinded to clinical data, re-evaluated the elastography and B-mode
video loops. The second evaluation was performed during a short time span (days) while
the original scoring was performed after each examination spanning a period of 12
months. This may have caused some drifting in visual scoring over time, and may partly
explain the moderate agreement for VAS and the fair agreement for the categorical
score. The low number of adenomas included in this study (4) limits the validity of
the statistical difference found between adenomas and adenocarcinomas.
Conclusion
Free-hand RTE of newly resected surgical specimens could not differentiate stenotic
lesions caused by chronic inflammation in Crohn’s disease from adenocarcinomas on
the basis of differences in semi-quantitative strain measurements. The SR and VAS
score showed significant strain differences between adenocarcinomas and a limited
number of adenomas. Pathological evaluation showed increased tissue fibrosis in both
adenocarcinomas and Crohn’s lesions. The interobserver reproducibility of the visual
evaluation using a VAS score and categorical scale was fair to moderate. The intra-assay
reproducibility of SR was more reliable, but the potential influence of E-dyn settings
on SR measurements could not be unequivocally affirmed. No significant differences
in strain assessment were found on the basis of inflammation grade in Crohn’s lesions
and by tumor stage or grade in adenocarcinomas. Ultrasound elastography is a promising
clinical method, but increased strain may be observed in malignant and benign lesions
and further studies on strain data of different lesions in vivo are warranted.
Abbreviations
RTE:
real-time elastography
SR:
strain ratio
E-dyn:
elasticity dynamic range
VAS:
visual analog scale
Appendix
Table 3
Descriptive data for the diagnostic entities adenocarcinoma, Crohn’s lesion and adenoma
including the number of patients, number of lesions, macroscopic length and number
of lesions categorized using the 0 – 3 fibrosis score by the pathologist. One patient
had both an adenocarcinoma and an adenoma and was counted twice. A total of 27 patients
were included.
diagnosis
|
patients
|
lesions
|
macroscopic length (min-max) (cm)
|
fibrosis score (0 – 3)
|
Score
|
n
|
adenocarcinoma
|
16*
|
18
|
3.75 (1.3 – 7.5)
|
Score 0
|
0
|
Score 1
|
5
|
Score 2
|
13
|
Score 3
|
0
|
Crohn’s lesion
|
9
|
16
|
3.50 (2.0 – 15.0)
|
Score 0
|
0
|
Score 1
|
2
|
Score 2
|
10
|
Score 3
|
4
|
adenoma
|
3
|
4
|
2.35 (1.0 – 3.5)
|
Score 0
|
2
|
Score 1
|
0
|
Score 2
|
2
|
Score 3
|
0
|
Table 4
Fibrosis score, strain ratio and VAS score for adenocarcinomas according to tumor
stage.
adenocarcinomas,
T-stage
|
n
|
fibrosis score = 1
|
fibrosis score = 2
|
median SR
(min-max)
|
VAS score
(mean, SD)
|
T1
|
2
|
1
|
1
|
1.73 (1.62 – 1.84)
|
62.5 (4.95)
|
T2
|
2
|
1
|
1
|
1.56 (0.81 – 2.31)
|
72.0 (16.97)
|
T3
|
8
|
3
|
5
|
3.00 (1.19 – 5.34)
|
81.1 (11.05)
|
T4
|
6
|
0
|
6
|
2.41 (1.59 – 3.18)
|
72.2 (15.29)
|