Key words
mammographic density - breast cancer - risk
Schlüsselwörter
mammografische Dichte - Mammakarzinom - Erkrankungsrisiko
Introduction
Breast cancer risk prediction has improved a lot over the last decades. Both genetic
and nongenetic risk factors have been studied in many case-control and cohort studies
[1]. It is thought that about 20–25 % of all familial breast cancer cases can be explained
by known genetic susceptibility variants [2]. Familial breast cancer risk remains one of the most important risk factors for
breast cancer, with a relative risk (RR) of about 1.8–2.6 in case a first degree relative
has been diagnosed with breast cancer [3]. Over the last decade the importance of mammographic density as one of the major
risk factors became more and more evident and has been shown to modify the risk for
breast cancer with an odds ratio (OR) between 3 and 6 [1], [4], [5]. In addition to a pure estimate of breast cancer risk, mammographic density
(MD) has been shown to indicate the increase of breast cancer risk under hormone replacement
therapy (HRT) and to indicate the decrease of breast cancer risk under a chemoprevention
with tamoxifen [6], [7]. Recent reports showed a link of MD to some genetic variation [8] and to family history of breast cancer as well [9], however mammographic density has not yet been implemented in early detection programs
or screening for breast cancer risk prediction not for women without nor for women
with a family history of breast cancer [10].
Even though its repeated validation as a breast cancer risk factor in multiple case
control and cohort studies, MD has been criticized, because there is no standardized
measurement method for MD and most of the methods are subjective, such as the Wolfeʼs
patterns, using four categories [11], [12], Boydʼs classification, with six categories [13], and subjective assessment of the percentage density by a reader, with values between
0 and 100 % [14]. In addition to these completely subjective methods, several computer-assisted methods
have been developed, such as Madena and Cumulus
[15], [16], [17]. Specifically, these computer programs assess MD as the proportion of the area with
dense breast tissue in relation to the whole breast area on a mammogram. These methods
have served to date as the gold standard for assessing the percentage mammographic
density (PMD). Recently some automated computer measurement methods of breast cancer
risk from mammograms have been investigated in some studies [18], [19], [20].
Using computer-assisted thresholding methods such as Madena or Cumulus – together with the PMD – other measures of the two-dimensional mammograms are obtained:
the total breast area, the nondense area and the dense area. The PMD is calculated
by dividing the dense area by the total breast area. It has been hypothesized that
the absolute dense area (DA) is an indicator of breast cancer risk as well, because
a higher amount of dense breast tissue could directly correlate to a higher probability
of one of the cells within the dense area to progress to a malignant cell. Furthermore
DA has not been as strongly associated with BMI, thus maybe providing some additional
information about breast cancer risk, that is more independent from BMI [21], [22].
Therefore the aim of our study was to assess the percent mammographic density as a
risk factor in a recent German case-control study for breast cancer and to assess
whether breast cancer risk can be described more accurately by adding the measurement
of the dense area to the prediction model.
Patients and Methods
The Bavarian Breast Cancer Cases and Controls (BBCC) studies are case-control studies
conducted in northern Bavaria (a state in the southeast of Germany) and are part of
the Breast Cancer Association Consortium (BCAC). The BBCC1 study is a case-control
and cohort study that aimed at the investigation of genetic and nongenetic biomarkers
and their influence on breast cancer risk and prognosis. BBCC1 is part of the studies
[23], [24], [25], [26], [27], [28], [29], [30], [31], [32], [33]. Likewise the BBCC2 study, on which is reported in this article for the first time,
had the same study aims and is part of the BCAC as well. Patients were recruited at
the University Breast Center
Franconia between 2002 and 2010. All patients were hospital-based and healthy controls
were from the same ZIP code area and were invited by newspaper advertisements. They
had to be ≥ 18 years of age, willing to complete a standardized questionnaire on medical
history and breast cancer risk factors as well as lifestyle factors and to contribute
peripheral blood for germline DNA extraction. All patients were furthermore required
to be willing to contribute a flash frozen breast cancer tumor sample for further
molecular analysis of the tumor and adjacent healthy breast tissue. A total of 619
patients with invasive breast cancer and 468 healthy individuals were recruited.
Patients with breast cancer were eligible for the mammographic density analysis, if
a mammogram from the time of the initial diagnosis was available. All of these cancer
cases were incident. Control individuals were eligible if a mammogram was available
for them within 6 months before or after completion of the questionnaire. For both,
cases and controls, information about HRT use was not taken from the questionnaire,
but from the mammography findings, as this information is routinely documented at
the time of mammography. The final study population for which mammograms were available
consisted of 561 breast cancer patients and 361 healthy control individuals.
Assessment of mammographic percent density
The computer-based threshold density assessments and breast area measurements were
carried out by two readers with explicit training in the method used, and averages
of both measurements were taken for analysis. Mammographic images were digitized using
the CAD PRO Advantage® film digitizer (VIDAR®, Herndon, Virginia, USA). This digitizer
is capable of registering signals over the full range of film opacity, from clear
to fully black, without saturating. It has a very high precision level at 570 dots
per inch (DPI), a bit depth of 32 bits gray scale input mapped to a 16-bit (65 536)
gray scale output, a clinical optical range of 0.05–4.20, and a spatial resolution
of 44 µm. Mammograms were digitized regardless whether they were analogue films or
printout of modified digital mammograms. For assessment of the density fraction and
the dense area, the reader used the Madena software program, Version X (Eye Physics,
LLC, Los Alamitos, California, USA) [17]. The software program assigns a pixel value of 0 to the darkest (black) shade in
the image and a value of 255 to the lightest (white) shade, with linear shades of
gray being assigned linear intermediate values. The total area of the breast is outlined
using a computerized outlining tool, and the total number of pixels within the outline
is counted. The density assessment is carried out as follows: the reader outlines
a region of interest (ROI), which includes the entire breast but excludes the pectoralis
muscle, prominent veins, and fibrous strands. The reader then uses a tinting tool
to apply a color to dense pixels with gray levels at or above a threshold of x ≤ 255.
The reader sets the best threshold at which all pixels ≥ x within the ROI can be considered
to represent mammographically dense breast areas. The software estimates the total
number of pixels and the number of tinted pixels within the ROI (= dense area). The
dense area represents the
count of the tinted pixels within the ROI. Percentage density, or the fraction (%)
of the breast with densities, is the ratio of the dense area to the total breast area.
The mammograms were read in random order by the same two observers, who were unaware
of any previous classifications or pathological findings.
Statistical considerations
Characteristics of breast cancer cases and controls are presented as means and standard
deviations or frequencies and percentages. P-values for the appropriate statistical
tests indicate the distribution between cases and controls. The Welch t-test was used
for continuous characteristics, the χ2 test for categorical characteristics and the Wilcoxon rank-sum test for ordinal but
not continuous characteristics.
The association between mammographic measures (PMD and DA) and breast cancer risk
was analyzed with logistic regression models. PMD and DA were categorized on the basis
of quartiles for the purpose of comparison with earlier studies [24], [34]. Unadjusted analyses with PMD or DA as single predictor and adjusted analyses with
PMD or DA and the well-known risk factors age (continuously), BMI (continuously),
parity (ordinally) and menopausal and HRT status (categorically; “premenopausal”,
“postmenopausal and HRT usage”, “postmenopausal and no HRT usage”) as predictors were
performed. Furthermore, unadjusted and adjusted analyses with models including both
mammographic measures as predictors were performed to study the additional predictive
value of each mammographic measure, especially whether DA improves the risk prediction
in addition to MD. Odds ratios (OR) and their 95 % confidence intervals were estimated
for
the mammographic measures. The area under the receiver operating characteristic curve
(AUC) was calculated for each regression model to compare the predictive strength.
The AUC ranges from 0.5 (random prediction) to 1 (perfect prediction). The relative
goodness of fit of two nested models was tested with the likelihood ratio test. A
significant test result means that the model with more predictors predicts significantly
better breast cancer case-control status than the model with fewer predictors.
Additionally, the mammographic measures were analyzed as continuous predictors to
show independently of the above chosen categories how breast cancer risk changes when
mammographic measures vary. Therefore, logistic regression models with mammographic
measures as cubic spline predictors with 2 knots and adjusting predictors as above
were used. The models with which the figures were constructed were fitted without
patients with MD and DA, respectively, beyond the 10th and 90th percentile to avoid
an unreliable curve shape at the outer ranges of the measurements. The model comparisons
in the text, however, were based on cubic spline models fitted by the whole data set.
All of the tests were two-sided, and a p-value < 0.05 was regarded as statistically
significant. Calculations were carried out using the R system for statistical computing
(version 2.13.1; R Development Core Team, Vienna, Austria, 2011).
Results
A total of 561 cases and 376 controls were analyzed in this study. Patient characteristics
are summarized in [Table 1]. Healthy control individuals were more likely to have a lower BMI than breast cancer
patients (25.5 kg/m2 vs. 26.9 kg/m2). Among the healthy controls 21.9 % (n = 70) were premenopausal and 78.1 % (n = 249)
were peri- or postmenopausal. Among the breast cancer patients 21.2 % (n = 114) were
premenopausal and 78.8 % (n = 423) were peri- or postmenopausal. Within the group
of postmenopausal patients 55.4 % (n = 138) of the controls were HRT users and 37.4 %
(n = 158) of the breast cancer patients. There were no differences concerning the
parital status or with regards to age ([Table 1]). Age was different between the groups as the patient groups were not age-matched
but for the study aim, this was taken into consideration by adjusting for it in the
multivariate analysis.
Table 1 Characteristics of cases and control individuals.
Risk factor
|
All (n = 937) mean (SD) or n (%)
|
Cases (n = 561) mean (SD) or n (%)
|
Controls (n = 376) mean (SD) or n (%)
|
p-value
|
Age at mammogram (years)
|
59.3 (11.7)
|
60.8 (12.1)
|
57.0 (10.5)
|
< 0.0 001
|
BMI
|
26.9 (5.5)
|
26.9 (5.5)
|
25.5 (4.6)
|
< 0.0 001
|
Parity
|
|
|
|
0.32
|
|
153 (16.3 %)
|
88 (15.9 %)
|
65 (17.4 %)
|
|
|
220 (23.8 %)
|
138 (25.0 %)
|
82 (22.0 %)
|
|
|
371 (40.1 %)
|
205 (37.1 %)
|
166 (44.5 %)
|
|
|
182 (19.7 %)
|
122 (22.1 %)
|
60 (16.1 %)
|
|
Menopausal and HRT status
|
|
|
|
< 0.0 001
|
|
184 (21.5 %)
|
114 (21.2 %)
|
70 (21.9 %)
|
|
|
296 (34.6 %)
|
158 (29.4 %)
|
138 (43.3 %)
|
|
|
376 (43.9 %)
|
265 (49.4 %)
|
111 (34.8 %)
|
|
As PMD is the proportion of the dense area to the whole breast area, the values of
the dense area were highly correlated with the PMD (Spearmanʼs ρ = 0.70, [Fig. 1]). It seems that for women with a small PMD and a small DA, the correlation is somewhat
stronger than for women with a high PMD and a large DA.
Fig. 1 Association between percent mammographic density in percent (PMD) and the dense area
in pixels (DA).
Percent mammographic density and dense area were associated with those risk factors
that are commonly known to be associated with mammographic density with the expected
effect sizes and directions ([Table 2]). BMI was stronger correlated with PMD (ρ = −0.56) than with DA (ρ = −0.11). Likewise
Association with age was stronger for PMD (ρ = −0.45) than for DA (ρ = −0.29). The
association for parity was of similar strength.
Table 2 Correlation between patient charactistics and mammographic measures. Spearmanʼs correlation
coefficient ρ is shown.
Risk factor
|
All (n = 937)
|
Cases (n = 561)
|
Controls (n = 376)
|
|
Percent mammographic density
|
Age at mammogram
|
− 0.45
|
− 0.44
|
− 0.45
|
BMI
|
− 0.56
|
− 0.55
|
− 0.56
|
Parity
|
− 0.12
|
− 0.12
|
− 0.13
|
Dense area (DA)
|
0.70
|
0.68
|
0.70
|
|
Dense area
|
Age at mammogram
|
− 0.29
|
− 0.24
|
− 0.31
|
BMI
|
− 0.11
|
− 0.10
|
− 0.08
|
Parity
|
− 0.10
|
− 0.04
|
− 0.19
|
We built several logistic regression models to assess their ability to predict case-control
status. The models were then compared in order to answer the primary study aim, whether
the parameter DA contributes to a better predictive value of the model as assessed
by AUC and compared by the likelihood ratio test.
In the multivariate analysis including commonly known risk factors and mammographic
density, quartiles of mammographic density, the predictive value concerning breast
cancer risk was confirmed. The first quartile comprised women with mammographic densities
from 0 to 21 % and was our reference group for all comparisons ([Table 3]). The OR for women with a mammographic density within quartile 2 (PMD from 21–32 %),
quartile 3 (PMD from 33–49 %) and quartile 4 (PMD more than 49 %) were 1.64 (95 %
CI: 1.06–2.52), 1.57 (95 % CI: 1.00–2.46) and 2.12 (95 % CI: 1.25–3.62). The AUC for
this model was 0.66 as compared to 0.65 for the model including only commonly known
risk factors. The predictive value was significantly better for the first model (p = 0.03,
likelihood ratio test). [Fig. 2] shows continuous ORs for PMD estimated by a cubic spline regression model. The log
OR-curve agrees with the categorical OR from above. The
predictive power of both PMD models coincide (both AUCs were 0.66), i.e., the chosen
categories seem to be sensible.
Fig. 2 Percent mammographic density (PMD) as continuous predictor for breast cancer. Continuous
odds ratio (OR) estimated by a cubic spline logistic regression model is shown. The
ORs are adjusted for age, BMI, parity, and menopausal and HRT status. Median PMD (= 33)
is used as baseline (i.e., log OR = 0). Vertical lines indicate PMD quartiles.
Table 3 Percent mammographic density (PMD) as a risk factor for breast cancer. Odds ratios
(OR) with 95 % confidence intervals in brackets and corresponding p-values are shown.
PMD categories (quartiles)
|
OR unadjusted
|
p-value
|
OR adjusted for age, menopausal status and HRT-usage, BMI, parity
|
p-value
|
Q1 (< 21)
|
1 (reference)
|
–
|
1 (reference)
|
–
|
Q2 (21–32)
|
1.25 (0.86, 1.82)
|
0.25
|
1.64 (1.06, 2.52)
|
0.03
|
Q3 (33–49)
|
0.93 (0.64, 1.35)
|
0.71
|
1.57 (1.00, 2.46)
|
0.05
|
Q4 (> 49)
|
0.76 (0.53, 1.10)
|
0.15
|
2.12 (1.25, 3.62)
|
< 0.01
|
Similar analysis for the dense areas showed the following adjusted OR for the quartiles
([Table 4]). Q2: OR = 0.92 (95 % CI: 0.60–1.39), Q3: OR = 1.10 (95 % CI: 0.72–1.70) and Q4:
OR = 0.75 (95 % CI: 0.49–1.15). Overall there seemed to be no correlation between
the DA and breast cancer risk. The AUC remained 0.65 regardless of whether DA was
included in the prediction model or not. The model comparison with the likelihood
ratio test yielded p = 0.29. [Fig. 3] shows continuous ORs for DA. As before, the log OR-curve agrees with the categorical
OR, and both DA models had the same AUC value.
Fig. 3 Dense area (DA) as continuous predictor for breast cancer. Continuous odds ratio
(OR) estimated by a cubic spline logistic regression model is shown. The ORs are adjusted
for age, BMI, parity, and menopausal and HRT status. Median DA is used as baseline
(i.e., log OR = 0). Vertical lines indicate DA quartiles.
Table 4 Dense area (DA) as a risk factor for breast cancer. Odds ratios with 95 % confidence
intervals in brackets and corresponding p-values are shown.
DA categories (quartiles)
|
OR unadjusted
|
p-value
|
OR adjusted for age, menopausal status and HRT-usage, BMI, parity
|
p-value
|
Q1
|
1 (reference)
|
–
|
1 (reference)
|
–
|
Q2
|
0.80 (0.55, 1.17)
|
0.25
|
0.92 (0.60, 1.39)
|
0.68
|
Q3
|
0.88 (0.60, 1.28)
|
0.50
|
1.10 (0.72, 1.70)
|
0.65
|
Q4
|
0.56 (0.39, 0.81)
|
< 0.01
|
0.75 (0.49, 1.15)
|
0.19
|
Including both mammogram measurements, PMD and DA, improved the prediction model further.
The AUC for a prediction model including commonly known risk factors, PMD and DA was
0.68. Comparing the models with the likelihood ratio test showed that this model was
better than the model that only included risk factors (p < 0.001), better than the
model that included risk factors and DA (p < 0.001) and better than the model including
risk factors and PMD (p < 0.01). The additional predictive value of DA became apparent
by the highly significant ORs for PMD when DA is considered as adjusting variable
([Table 3]) and by the significant OR for DA when PMD is considered as adjusting variable ([Table 4]).
Discussion
In this breast cancer case-control study we could confirm mammographic density as
a risk factor in a German population. In addition to the biomarker percent mammographic
density, the absolute dense area on the mammogram was examined with regard to breast
cancer risk prediction. DA alone was not predictive, but the addition of DA to commonly
known risk factors and PMD did improve the prediction of breast cancer risk.
To our knowledge this is the second breast cancer case-control study from Germany
which has examined mammographic characteristics with regards to breast cancer risk.
Previously we reported on a population of patients in another study, which was larger
and the individuals were recruited in an earlier time period [24]. With regard to adjusted OR the two studies have very comparable results. Patients
with a PMD within the highest quartile have an OR of 2.12 (95 % CI: 1.25–3.62) in
the recent study and 2.3 (95 % CI: 1.5–3.6) in the older study [24]. The unadjusted OR are somewhat different ([Table 3]). However PMD is highly correlated with several factors that are known to be correlated
with breast cancer risk and PMD (e.g. BMI, age, parity, HRT use) [35], [36]. Therefore these differences could be explained by differences between both
populations concerning these parameters. Comparing the recent and the older study
with other breast cancer risk studies of mammographic density, the risk levels as
assessed by our group in Germany seem to be slightly lower than in other published
reports [4].
The primary aim of this analysis was to assess in how far absolute DA of a mammogram
can or cannot improve the estimation of breast cancer risk. It is known that mammographic
dense areas are highly correlated to both, the amount of epithelial breast cells and
the amount of extracellular matrix [37], [38]. Therefore the DA might have a higher correlation to the absolute count of epithelial
cells, and it has been hypothesized that the correlation of breast cancer risk with
mammographic characteristics could be explained stochastically.
An Australian study examined a similar question [39] like our study and found DA to be a better predictor than PMD with regards to breast
cancer risk. Estimates of the OR concerning PMD were very similar to our study, however
this correlation was not maintained when adding dense area to the prediction model
and PMD lost its predictive value [39]. Our findings show that DA helps to improve the overall estimation of breast cancer
risk, however when used alone without PMD there was no association with breast cancer
risk. As both PMD and DA are assessed together during the measurement of a mammogram
both values can easily be used for a prediction model. Therefore we suggest to use
both mammographic characteristics and not only one as implied by Stone et al. [39].
Although we could show that the inclusion of DA in a prediction model improves the
strength of the prediction of the case control status, different from other studies
[34], [40], [41], [42], [43] we could not show that the dense area without including PMD into the logistic regression
model was associated with breast cancer risk. All other parameters (age, menopausal
status, HRT use, BMI and parital status) however were correlated with DA in an expected
way ([Table 2]) [21].
There are some strengths and some weaknesses to this study that should be taken into
consideration when interpreting our results. One of the weaknesses could be the limited
power due to sample size. Mammographic density and dense area are known to be influenced
by a series of confounders that are associated with mammographic density and breast
cancer risk as well. In our study some of the subgroups were rather small, and therefore
some of the described effects could be due to chance. However when comparing the OR
for PMD quartiles after adjustment for these risk factors there seemed to be an effect
size that was comparable to our first study [24]. Our cases were hospital-based and the controls were invited by newspaper advertisements.
This could be the reason that we found more HRT users in controls than in cases. Other
studies by our group could show that awareness of breast cancer risk factors is associated
with a higher willingness to take
part in breast cancer prevention trials [44], [45]. Therefore women who are aware of HRT as a risk factor for breast cancer could be
more willing to serve as controls for a breast cancer case control study. Strengths
of this study are the assessment of mammographic density and the availability of biomaterials.
Every mammogram was assessed using a computer-assisted method and each mammogram was
assessed by two different readers, who were not aware of the other readerʼs results.
The average of both measurements was taken for analysis. This may reduce measurement
inaccuracies. Another strength of this study is that biomaterials of all patients
are available and can be used to answer further molecular questions concerning breast
cancer risk genes or gene expression within the tumor, however this is not part of
the analysis we present here. It is known however that the availability of frozen
tumors is associated with
slightly larger tumors, as a sample is more difficult to obtain from smaller tumors.
This as well could have influenced the characteristics of the patient population.
Conclusion
The use of imaging characteristics of the breast for breast cancer risk estimation
is established and used in many case-control and cohort studies. Besides percent mammographic
density, there seem to be more characteristics that are either detectable by automated
methods [18] or to be found in the third dimension [46]. One parameter that is easy to assess together with percent mammographic density
is the dense area. The use of dense area alone should be investigated in further studies,
as our results were conflicting with other studies. It seems to be reasonable to include
the measurement of dense area into a breast cancer risk prediction model, as it improves
the strength of a prediction model.