Abstract

Objective. Endometrial cancers have historically been classified by histomorphologic appearance, which is subject to interobserver disagreement. As molecular and biomarker testing has become increasingly available, the prognostic significance and accuracy of histomorphologic diagnoses have been questioned. To address these issues for a large, prospective cohort study, we provide the results of a centralized pathology review and biomarker analysis of all incidental endometrial carcinomas occurring between 1976 and 2012 in the Nurses’ Health Study. Methods. Routine histology of all () cases was reviewed for histomorphologic diagnosis. Cases were subsequently planted in a tissue microarray to explore expression of a variety of biomarkers (e.g., ER, PR, p53, PTEN, PAX2, AMACR, HNF1β, Napsin A, p16, PAX8, and GATA3). Results. Histologic subtypes included endometrioid (87.2%), serous (5.6%), carcinosarcoma (3.9%), clear cell (1.7%), and mixed type (1.7%). Biomarker results within histologic subtypes were consistent with existing literature: abnormal p53 was frequent in serous cases (74%), and HNF1β (67%), Napsin A (67%), and AMACR (83%) expression was frequent in clear cell carcinomas. Our dataset also allowed for examination of biomarker expression across non-preselected histologies. The results demonstrated that (1) HNF1β was not specific for clear cell carcinoma, (2) TP53 mutations occurred across many histologies, and (3) GATA3 was expressed across multiple histotypes, with 75% of positive cases demonstrating high-grade features. Conclusions. Our findings establish the subtypes of endometrial cancer occurring in the Nurses’ Health Study, corroborate the sensitivity of certain well-established biomarkers, and call into question previously identified associations between certain biomarkers (e.g., HNF1B) and particular histotypes.

1. Introduction

Endometrial cancers have historically been subclassified by their histomorphologic appearance alone [1, 2]. However, histomorphologic classification suffers from interobserver agreement issues, particularly in the case of high-grade endometrial carcinomas (e.g., FIGO grade 3 endometrioid, clear cell carcinoma, serous carcinoma, and carcinosarcoma) [37]. Further, in the molecular era, morphologic subtyping is considered insufficient for determining prognosis. Most famously, The Cancer Genome Atlas molecularly classified endometrial tumors into four categories which were prognostically significant irrespective of histomorphology [8].

Despite the inherent limitations in morphologic analysis of tumors, histomorphologic diagnosis largely determines inclusion in cancer registries as well as in clinical and epidemiologic research [9]. These diagnoses may suffer from changes in diagnostic categories over time, an absence of centralized pathology review, and, importantly, the absence of corroborative biomarker (immunohistochemical) data.

The application of biomarkers (i.e., immunohistochemical staining) assists pathologists in rendering more accurate diagnoses. A recent study found that up to 40% of cases initially diagnosed as high-grade endometrial endometrioid adenocarcinomas may be reclassified as uterine serous carcinomas based on a panel of four immunostains (p53, p16, estrogen receptor (ER), and mammaglobin) [10]. Immunostains in routine practice, however, are often not pursued due to a pathologist’s degree of morphologic certainty or the possibility of ambiguous results.

Further, biomarkers have their own limitations. Biomarker sensitivity and specificity have largely been determined by research conducted on highly selected cohorts of tumors, typically including only those with unambiguous morphologies. Thus, the sensitivity and specificity of markers tends to shift once the markers are applied in the clinical setting and to cases with less prototypical morphologies. For example, hepatocyte nuclear factor-1beta (HNF1β) expression was initially thought to have a specificity of 93 to 100% for clear cell carcinoma [1114]; however, overtime, the specificity has been questioned as expression has been documented in a subset of serous and endometrioid carcinomas [1517].

In this study, we attempt to address two of the aforementioned issues. First, we perform a centralized review of the incidental endometrial cancer cases arising in the Nurses’ Health Study, a large, prospective cohort study with long-term follow-up. Secondly, we attempt to systematically apply a set of biomarkers to incidental endometrial cancer cases, thus exploring expression across a spectrum of subtypes. We explore biomarkers that are currently well-established in their interpretation (e.g., p53) as well as more novel biomarkers (e.g., GATA3). Further, we explore the limitations and benefits of applying biomarkers retrospectively in the case of a large epidemiologic study, especially the feasibility of interpretation in the setting of tissue microarrays.

2. Materials and Methods

This study was approved by the Human Studies Review Board at Brigham and Women’s Hospital. The protocol for this study was approved by the Human Research Committees at Brigham and Women’s Hospital, Boston, MA, USA.

2.1. Case Selection

Using data from the Nurses’ Health Study [18, 19], a prospective cohort of 121,700 female nurses initiated in 1976, all incidental endometrial cancers diagnosed between 1976 and 2012 were identified.

When available, paraffin-embedded tissue and hematoxylin and eosin stained slides from the patient’s endometrial biopsy or hysterectomy were obtained. Only cases with paraffin blocks for the creation of tissue microarrays were included. Full hematoxylin and eosin sections of the original clinical diagnostic blocks underwent central pathology review by a pathologist (GM), and a histologic diagnosis was rendered for each case using current diagnostic criteria [1]. Additional information regarding case ascertainment and tissue microarray construction has been published elsewhere [20].

For each specimen, a representative paraffin-embedded tissue block was chosen, and three 0.6 mm cores were planted in a tissue microarray. Serial sections of each microarray were used for marker studies as below.

2.2. Immunohistochemical Studies

For each marker studied, four-micron thick sections of each tissue microarray were stained in the following sequence: hematoxylin and eosin, marker replicate 1, pankeratin AE1/AE3 (Dako Cat#M3515 at 1 : 200 dilution), and marker replicate 2. The following immunostains were performed using the Leica Bond III staining platform using the following primary antibodies: estrogen receptor (ER, murine monoclonal ERID5 (Dako) at 1 : 300 dilution), progesterone receptor (PR, murine monoclonal PgR636 (Dako) at 1 : 150 dilution), p53 (murine monoclonal PAb 1801 (Leica Biosystems) at 1 : 300 dilution), AMACR (rabbit monoclonal Z2001 (Zeta Corp) at 1 : 50), ARID1A (rabbit polyclonal HPA005456 (Sigma) at 1 : 200 dilution), GATA3 (murine monoclonal L50-823 (Biocare) at 1 : 100 dilution), HNF1β (rabbit polyclonal HPA002083 (Sigma) at 1 : 500 dilution), Napsin A (murine monoclonal NCL-L-Napsin A (Leica) at 1 : 400 dilution), p16 (murine monoclonal E6H4 (Ventana) at 1 : 300 dilution), PAX2 (polyclonal rabbit Z-RX2 (InVitrogen) at 1 : 300 dilution), PAX8 (rabbit polyclonal 10336-1-AP (Proteintech) at 1 : 600 dilution), and PTEN (murine monoclonal 6h2.1 (Millipore) at 1 : 100).

2.3. Marker Interpretation

All stained slides were digitally captured at 40x magnification by a Hamamatsu nanozoomer whole slide digital scanner. For each marker, hematoxylin and eosin and keratin-stained images of matched tissue microarray sections were used to confirm the presence of tumor tissue and to discriminate tumor cells from background as needed. Duplicate stains for each marker were independently scored for marker specific signal within tumor cells (JW and GM) (see Table 1 for scoring methodology). Discordant replicates were resolved by rereview. Markers scored on continuous scales were averaged across the duplicate runs for final data analysis. Technical failures due to tissue dropout, high background, or low signal were excluded as noninformative on a marker-by-marker basis.

Due to the limited tumor represented on the tissue microarrays, p53 was only interpreted as abnormal when staining was diffuse and strong; a “null” phenotype was not reported due to the possibility of failed staining or regional variability in tumor expression and thus the possibility of false positives [21].

2.4. Case Inclusion and Exclusion Criteria

Some patients had separate endometrial biopsies and hysterectomy specimens containing tumor. After staining, only unique specimens (1 per patient) were included in the final analysis, with preference for hysterectomy specimens over biopsies. Nonepithelial tumors were excluded. Carcinosarcoma and mixed carcinoma cases were included.

2.5. Covariates

Age at diagnosis (continuous) and body mass index at diagnosis [22] (continuous, kg/m2) were assessed from the last biennial Nurses’ Health Study questionnaire prior to cancer diagnosis.

2.6. Statistical Analyses

Statistical analysis was performed in SYSTAT (v13.1, Systat Software, Inc., San Jose, CA). One-way ANOVA was used to independently compare age and body mass index with the diagnostic tumor subtypes. -tests, with separate variances, were employed to compare biomarker expression across age and body mass index. Results with a value < 0.05 were considered statistically significant. Hierarchical clustering of bimodally scored (normal vs. abnormal) marker results was performed using Ward’s linkages and Jaccard similarity coefficient distance metric. Bimodal scoring of the continuous percentages of estrogen and progesterone receptor was done across a 10% threshold.

3. Results

Pathology materials were received for 472 patients with reported endometrial cancer. After application of the inclusion criteria, 360 patients had adequate tissue for inclusion in the tissue microarray (343 hysterectomy specimens and 17 biopsy/curettages).

3.1. Histomorphologic Review

Histomorphologic diagnoses were as follows: 314 endometrioid (87.2%), 20 serous (5.6%), 14 carcinosarcoma (3.9%), 6 clear cell (1.7%), and 6 mixed endometrioid plus nonendometrioid type (1.7%). Within the endometrioid group, 244 were grade 1, 43 were grade 2, and 27 were grade 3.

3.2. Biomarker Results within Histomorphologic Diagnoses (Table 2)

Clear cell carcinomas frequently expressed AMACR, HNF1B, and Napsin A (83.3%, 66.7%, and 66.7%, respectively). Serous carcinomas commonly displayed diffuse, strong p53 staining (mutant phenotype; 73.7%) and p16 diffuse positivity (70%). Endometrioid tumors demonstrated high rates of ER (92.7%) and PR (85.1%) expression, with the highest ER and PR expression noted in FIGO grade 1 endometrioid tumors.

3.3. Biomarker Expression across Histomorphologic Diagnoses

GATA3 was expressed across histotypes (16 cases total; 10 endometrioid (62.5%), 3 serous (18.8%), 2 carcinosarcoma (12.5%), and 1 clear cell (6.25%)). GATA3 expression correlated positively with expression of HNF1β () and p16 () and absence of staining for PAX2 (), ER (), and PR (). Rereview of the GATA3 positive tumors revealed an absence of morphologic findings consistent with mesonephric carcinomas. Further, 12 of the 16 cases showed high-grade morphology with ambiguous cytology (Figure 1). GATA3 and PAX8, respectively, markers of mesonephric and Müllerian origin, were not always mutually exclusive. 79.6% of cases (262/329) were positive for only one of the two markers, 15.8% (52/329) were negative for both, and 4.6% of cases (15/329) were positive for both.

Diffuse, strong p53 expression (mutant phenotype) was seen across histotypes. Of p53 mutant tumors (37 total), 37.8% were serous, 32.4% were endometrioid, 13.5% were carcinosarcoma, 8.1% were clear cell carcinoma, and 8.1% were mixed type. HNF1β showed similar lack of correlation with a specific histologic type with 71.9% of positive cases being endometrioid, 10.5% serous, 7.0% carcinosarcoma, 7.0% clear cell carcinoma, and 3.5% mixed type. Stated another way, 33.3% of carcinosarcomas, 66.7% of clear cell carcinomas, 13.7% of endometrioid carcinomas, 40% of mixed type tumors, and 31.6% of serous carcinomas stained positively for HNF1β.

Mutant p53 staining had variable correlation with aberrant p16 staining, a previously reported association especially in serous cancers [23]. In our overall cohort, p53 and p16 stainings were discordant in 8.9% of cases (31/347). Of these cases, 48.3% (15/31) demonstrated mutant p53 with only patchy p16 staining.

3.4. Marker Expression Patterns

Hierarchical clustering of biomarker results from 240 cases with complete data shows two main clades corresponding to PTEN and P53 mutant classes (Figure 2).

3.5. Age, Body Mass Index, and Biomarker Expression

Age at diagnosis was found to be statistically different across histologic categories (, Table 3), with endometrioid type tumors presenting on average at a younger age ( y) than other subtypes. Body mass index did not differ significantly across diagnostic categories (, Table 3).

When body mass index and age were compared with biomarker expression (Table 4), several significant associations were found. Body mass index was significantly lower in women whose tumors expressed HNF1β (26.9 vs. 30.5 kg/m2, ), Napsin A (25.1 vs. 30.0 kg/m2, ), and mutant p53 (26.0 vs. 30.3 kg/m2, ). Body mass index was significantly higher in women whose tumors expressed ER (30.3 vs. 27.2 kg/m2, ) and PR (30.4 vs. 28.2 kg/m2, ). The loss of expression of PAX2 was associated with younger age at diagnosis (68.6 vs. 70.5, ).

4. Discussion

Our study addresses two key issues. First, we performed a centralized review of the incidental endometrial cancers in the Nurses’ Health Study. Secondly, through the administration of biomarkers to a large, incidental cohort, we explored biomarker expression across a spectrum of endometrial tumors with variable morphologies.

As was expected, most incidental cases of endometrial cancer in the Nurses’ Health Study (87.2% of cases) were “type 1 endometrial carcinomas” (i.e., endometrioid adenocarcinomas arising secondary to estrogen stimulation). Such “type 1” cancers account for the majority of endometrial tumors and typically arise in obese, post-menopausal women [2426]. “Type II endometrial carcinomas” (i.e., predominantly nonendometrioid carcinomas arising in the setting of TP53 mutation or 1p deletion) [2527], were less common in our cohort, reflecting their lower prevalence in the general population.

Our biomarker exploration of these tumors confirmed the preexisting associations between type I and type II endometrial carcinomas with hormonal receptors and p53 expression. Low-grade endometrioid carcinomas (type I) expressed high rates of ER and PR, supporting the hormonal pathway to carcinogenesis. Likewise, our type 2 tumors (nonendometrioid) demonstrated frequent p53 overexpression (mutant phenotype). Such findings are not novel, but do support the feasibility and reliability of applying biomarkers to large epidemiologic studies in a retrospective fashion.

While we confirmed preexisting biomarker expression pattern within histotypes, the advantage of our study was the ability to assess biomarkers across an incidental cohort of histomorphologic subtypes. Multiple marker-histotype correlations were found to be less specific than previously thought. Specifically, while p53 abnormalities are seen in 73.7% of serous cancers, 62.2% of all p53 abnormal cancers had a nonserous histology. This experience is mirrored in other purely marker-driven endometrial cancer classification systems such as the genomic-based TCGA classification schema, where molecular and histopathologic classes only partly overlap, and some unique molecular phenotypes of clinical interest (such as polymerase E mutation) cannot be reliably identified by histology alone [8, 16].

Our study additionally contributes to the growing evidence that HNF1β is a nonspecific marker [1517]. While two-thirds of our clear-cell cases stained with HNF1β, 31.6% of serous carcinomas and 13.7% of endometrioid carcinomas expressed HNF1β, making HNF1β staining an unreliable marker in the clear cell versus serous/endometrioid differential. Thus, we conclude that HNF1β, on its own, should not be considered a specific marker for clear cell carcinoma in clinical practice.

Our findings regarding expression of GATA3 are also of interest. Prior studies demonstrate GATA3 be a highly sensitive and specific marker for mesonephric lineage in lesions of the lower female genital tract [28, 29]. In our study, GATA3 positive staining in an unselected sample of uterine tumors did not correspond well to any single histotype. Notably, none of the cases, even those of low-grade cytology, displayed the classic morphologic features of mesonephric carcinomas (e.g., small round tubules, cuboidal or flatted epithelium, and angulated vesicular nuclei). Furthermore, 12 of 16 positive cases displayed high-grade or ambiguous cytologic features, raising the possibility that poorly differentiated endometrial tumors may express GATA3 not due to lineage but rather due to gain of function mutations. This is further demonstrated by our finding that 4.6% of cases demonstrated positivity for both PAX8 and GATA3, two stains that should be mutually exclusive of each other.

An unsupervised self-organizing dendrogram of marker results in our tumors gives an indication of how combinatorial marker trends can define tumor subgroups (Figure 2). Two major clusters are evident: a p53 mutant arm and a PTEN/PAX2 arm, corresponding to “type II” and “type I” endometrial cancers, respectively. Further, there is a p16/p53 subgroup distinct from a Napsin A/AMACR/HNF1β class. Of note, GATA3 expression clustered with both clear cell markers and/or p53/p16 expression. This clustering provides additional evidence that GATA3 expression likely denotes aberrant expression/gain of function mutations in high-grade malignancies arising from mutations in TP53 rather than mesonephric lineage.

Given our unique dataset, we were able to compare biomarker expression across participants’ ages and body mass index. As expected, p53 expression was associated with significantly lower body mass index, and ER/PR expression was associated with significantly higher body mass index (additional ER/PR findings in this cohort are reported elsewhere [20]). Additionally, endometrioid-type tumors tended to present at earlier ages than other subtypes. These findings are consistent with common mechanisms of endometrial carcinogenesis in which either hormonal stimulation (typically in the setting of obesity) [30] or p53 mutation leads to neoplastic growth [25, 26, 31, 32].

Our study did highlight some of the necessities and limitations of working with tissue microarrays. Of note, when performing a centralized pathology review, we found it essential to render a morphologic diagnosis on whole H&E slides only, not on the tissue microarray preparations. This is due to the limited amount of tumor present on the tissue microarray. Secondly, due to the occasional absence of internal controls noted in the tissue microarray tissues, we recommend running separate positive controls on each tissue microarray assay. We also recommend replicate stain reads to control for idiosyncratic runs, ideally with the replicates reviewed independently by mutually blinded pathologists. The main limitation of using tissue microarrays in our study was the inability to reliably interpret p53 protein null phenotypes caused by rare nonsense mutations [33]. Thus, more of our cases likely had aberrant p53 staining than we could reliably conclude from our tissue microarray preparations.

In this study, we provide a centralized review of the incidental endometrial cancers in the Nurses’ Health Study as well as the biomarker expression results across this cohort. This data is largely applicable to future clinical/epidemiologic studies that derive data on endometrial cancers form the Nurses’ Health Study. However, we were also able to test the hypothesis that biomarker studies in which cases are preselected by pathologists likely lead to overestimated sensitivities and specificities. In the case of GATA3, p53, and HNF1β, we found this hypothesis to be true. We believe our approach of testing biomarkers across an incidental population of cancers gives a more realistic sense of the utility of such markers when applied to incidental disease. Further, our findings mimic those of The Cancer Genome Atlas and other data collected from next-generation sequencing in which there is much overlap in the mutational profiles of tumor categories as defined by histomorphology. We conclude that expression of a single biomarker (e.g., HNF1β and GATA3) should not be interpreted as diagnostic of a particular tumor type without taking into consideration the histology. We would recommend that, when in doubt of the histology, a panel of immunostains, including stains redundant for a diagnosis (e.g., PTEN, ARID1a, and ER for endometrioid adenocarcinoma; HNF1β, Napsin A, and AMACR for clear cell carcinoma; and p16 and p53 for serous carcinoma), be performed rather than relying on a single immunostain. Lastly, we show that tissue microarrays have some limitations; however, the results of biomarker studies in this setting are largely reliable and reproducible.

Data Availability

The Nurses’ Health Study data are housed within the Harvard School of Public Health and are not publicly available without an IRB and application for access.

Additional Points

Highlights. (1) This study characterizes all incidental endometrial cancers in the Nurses’ Health Study. (2) Our findings call into question specificity of several commonly used biomarkers. (3) We demonstrate that tissue microarrays are reliable and reproducible in retrospective studies.

Disclosure

The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. The corresponding author had full access to all the data in the study and had final responsibility for the decision to submit for publication. Jaclyn C. Watkins present address is Department of Pathology, Massachusetts General Hospital, 55 Fruit St., Boston, MA 02114, USA.

Conflicts of Interest

The authors have no conflicts of interest to report.

Authors’ Contributions

Jaclyn C. Watkins is the primary author and involved in immunohistochemical and statistical data collection, analyses, and interpretation. Michael J. Downing is involved technically in tissue microarray construction and immunohistochemical staining and provided edits to the final manuscript. Marta Crous-Bou is involved in epidemiologic portion of data collection and provided edits to final manuscript. Evan L. Busch is involved in epidemiologic portion of data collection and provided edits to the final manuscript. Maxine Chen is involved in epidemiologic portion of data collection and provided edits to the final manuscript. Immaculata De Vivo is the principal collaborator from the Nurses’ Health Study and assisted in the study design and edits to the final manuscript. George L. Mutter is the principal investigator; designed study, collected data, and performed statistical analyses; and provided final edits to the manuscript. All authors have approved the final article.

Acknowledgments

The authors would like to thank the participants and staff of the Nurses’ Health Study for their valuable contributions as well as the following state cancer registries for their help: AL, AZ, AR, CA, CO, CT, DE, FL, GA, ID, IL, IN, IA, KY, LA, ME, MD, MA, MI, NE, NH, NJ, NY, NC, ND, OH, OK, OR, PA, RI, SC, TN, TX, VA, WA, and WY. We also thank the Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital, and Harvard Medical School. Finally, we would like to acknowledge the members of the De Vivo and Mutter laboratories for their assistance. The Nurses’ Health Study is supported by the National Cancer Institute-National Institutes of Health (UM1 CA186107 and P01 CA87969). This study was supported by a grant from the National Cancer Institute (2 R01 CA082838-10). ELB and MMC were supported in part by a grant from the National Cancer Institute (5T32CA009001). Additional project support was provided by Departmental (Pathology, BWH) Research Funds (GLM).