Main

Human papillomavirus (HPV) is the primary cause of oropharyngeal cancer (OPSCC), accounting for 40–80% of cases in the western world (D’Souza et al, 2007; Termine et al, 2008; Adelstein et al, 2009; Dayyani et al, 2010). The incidence of HPV-associated OPSCC has increased significantly over the past 40 years, and continues to rise (Adelstein et al, 2009; Chaturvedi et al, 2011).

HPV-associated OPSCC commonly presents at advanced stage and is usually poorly differentiated (Adelstein et al, 2009; Sethi et al, 2012). Although both factors would be expected to result in poor outcomes, studies report significantly better long-term survival in most HPV-positive OPSCC patients compared with those with HPV-negative disease of matched stage (Licitra et al, 2006; Fakhry et al, 2008; Worden et al, 2008; Lassen et al, 2009; Ang et al, 2010; Dayyani et al, 2010). The explanation for this remains unclear and may be multifactorial. Human papillomavirus-positive tumours have improved response to chemotherapy/radiotherapy compared with HPV-negative tumours, possibly modulated through the expression of wild-type p53 and Rb tumour suppressor genes; patients with HPV-positive cancers have a lower risk of developing secondary primary tumours, probably due to the absence of field cancerisation; in addition, HPV status inversely correlates with biomarkers of poor prognosis such as EGFR (Westra et al, 2008; Adelstein et al, 2009; Hong et al, 2010a).

A potential explanation for survival benefit may be that tumour-infiltrating T lymphocytes (TILs), which permeate many HPV-positive OPSCC, confer a protective effect through an adaptive host immune response directed against viral antigens, and HPV-16 E7-specific T cells have been reported in OPSCC patients (Albers et al, 2005; Wansom et al, 2010, 2012). Although TIL abundance is associated with improved clinical outcomes in a number of other tumour types, their role in head and neck malignancy is not well documented (Galon et al, 2006; Uppaluri et al, 2008; Gooden et al, 2011).

Several studies have reported that the survival advantage of HPV-driven tumours is independent of treatment modality, whether surgery, radiation, or chemoradiotherapy (or combinations thereof), raising the possibility that some HPV-positive patients receive unnecessary treatment (Licitra et al, 2006; Fakhry et al, 2008; Adelstein et al, 2009; Lassen et al, 2009; Ang et al, 2010; Hong et al, 2010b). This has led to suggestions that treatment be de-intensified to reduce therapy-related morbidity (Adelstein et al, 2009), and a number of clinical trials are testing this currently. However, within the HPV-positive OPSCC population, there remains a significant minority that respond poorly to treatment and have a poor prognosis. Ang et al (2010) identified three different ‘at-risk’ groups for patients with oropharyngeal cancer and found that HPV-positive patients who were heavy smokers and had advanced nodal disease (>10 pack-years smoking with N2b disease or above) were at an ‘intermediate’ risk of death. These factors have subsequently been validated independently and have been used as exclusion criteria in some de-escalation trials (Ang et al, 2010; Granata et al, 2012). Other authors have also described an association between heavy smoking and poor outcome in patients with HPV-positive tumours (Adelstein et al, 2009; Gillison et al, 2012; Sethi et al, 2012); however, there is still no widely accepted strategy for identifying high-risk HPV-positive patients.

In this study, we assess the relative contributions of HPV status and pathological, molecular, and immunological features in survival from OPSCC. We correlate them with known clinical predictors of outcome to formulate a predictive tool for OPSCC patients with the ultimate aim of facilitating stratification of patients for treatment decisions.

Patients and methods

We retrospectively identified 442 consecutively treated patients with OPSCC (University Hospital Southampton (UHS; 2000–2010), n=316; Poole NHS Foundation Trust and Bart’s and the London NHS Trust (PFT/BLT; 2000–2006), n=126. Patients were excluded from further analysis if they had follow-up of <6 months, had received either no treatment or treatment without curative intent, or had an unknown cause of death. Following these exclusions, 402 patients remained. Archived pathology material was available for 274 patients who were subsequently analysed further. Demographics and survival of patients without pathology material were comparable to those with pathology material (Supplementary Information). All patients were analysed according to REMARK guidelines (McShane et al, 2005). Data were obtained from patient records on age, sex, site, smoking (non-, ex- (stopped smoking >6 months previously), light smokers (defined as <10 pack year history (Ang et al, 2010; Gillison et al, 2012)), heavy smokers (>10 pack year history)), alcohol (non-/ex-drinker, current drinker >10 units per week), T stage, N stage, treatment modality, treatment failure, tumour recurrence, date, and cause of death. Data on smoking and alcohol consumption can be difficult to accurately obtain in retrospective studies, often because of poor documentation. In this study, smoking data were more accurately recorded in the medical records than alcohol consumption, with reliable data available in 87% and 74% of cases, respectively. Demographics are shown in Table 1. Ethical approval was obtained (UKCRN 8130; ISRCTN 71276356; and REC references 09/H0501/90 and 07/Q0405/1).

Table 1 Baseline characteristics of patients in the retrospective patient cohorts (i.e., at the time of diagnosis)

Treatment

A treatment summary is included in Table 1. The centres involved in this study differed in their approach to treatment of OPSCC, with more patients being surgically treated at UHS. This allowed a direct comparison of survival according to treatment modality. Patients undergoing chemoradiotherapy received 2–6 doses of platinum-based chemotherapy (either cisplatin or carboplatin) combined with 64–66 Gy of radiotherapy given in 32–33 fractions. No patients received accelerated fractionation or intensity-modulated radiotherapy. Neoadjuvant chemotherapy was 1-2 doses of either cisplatin or carboplatin, in conjunction with 5-flurouracil. Of the surgically treated patients, 78% received post-operative radiotherapy. Full details are given in Supplementary Information.

Histology and immunochemistry

Tumour histology was reviewed by pathologists (GJT, SJ, and KP) blinded to treatment and outcome, according to the 1998 UK Royal College of Pathologist Guidelines, and a suitable paraffin block was selected. From a haematoxylin/eosin (H+E)-stained section, data were obtained on grade (well, moderate, or poorly differentiated). A prominent lymphocytic infiltrate (TIL) was scored under low-power magnification ( × 2·5 objective) as high (diffuse; present in >80% of tumour/stroma), low (weak/absent; present in <20% of tumour/stroma), or moderate (patchy; present in 20–80% of tumour/stroma) as described (Marsh et al, 2011). Concordance between pathologists in identifying TILlow tumours was >95%. Typical examples of TILhigh and TILlow tumours are shown in Figure 1D and Supplementary Information.

Figure 1
figure 1

Kaplan–Meier curves for OPSCC mortality in combined patient cohorts. (A) Kaplan–Meier curves for OPSCC mortality according to HPV status (log-rank test, P<0·001). (B) Immunochemistry (p16) and ISH (HPV) showing an example of an HPV-positive tumour. (C) Kaplan–Meier curves for OPSCC mortality according to abundance of infiltrating lymphocytes (TIL; high, moderate or low; log-rank test, P<0·001). (D) H&E-stained sections showing examples of OPSCC with high and low TIL infiltrates. Tumour islands are arrowed. In TILhigh tumours, lymphocytes fill the stromal compartment and infiltrate into tumour islands. (E) Kaplan–Meier curves for OPSCC mortality stratified according to HPV status and TIL levels. Patients with HPV+/TILlow tumours show similar survival to HPV-negative patients. (F) Immunochemistry showing typical examples of HPV-positive OPSCC with high and low TIL. Tumour is positive for p16, T lymphocytes for CD3.

Tissue microarrays (TMAs) were constructed using triplicate, randomly selected, paraffin-embedded 1-mm tumour cores (Aphelys Minicore 2, Mitogen, Harpenden, UK). Automated immunostaining (Ventana XT, Ventana, Tucson, AZ, USA) was performed in a CPA-accredited clinical cellular pathology department using antibodies optimised to national diagnostic standards (NEQAS). The antigenic targets were as follows: p16 (CINtec, Roche, Burgess Hill, UK), EGFR (Dako, Ely, UK), T-lymphocyte markers (CD3, pan T cells; CD4, helper T cells; CD8, cytotoxic T cells (all from Novocastra, Milton Keynes, UK); FoxP3, regulatory T cells (eBioscience, Hatfield, UK), and B lymphocytes (CD20) (Novocastra). TILs were quantified using a Zeiss AxioCam MRc5 microscope (Zeiss, Cambridge, UK) and Zeiss Axiovision software (version 4·8·1·0) (Zeiss). An average intratumoural TIL score/high-power field ( × 400) was calculated across representative areas of each triplicate core to allow for intratumoural heterogeneity (i.e., an average of three counts across the tumour). EGFR and p16 were scored as positive (>50% tumour cells positive) or negative (50% tumour cells positive).

In situ hybridisation and HPV classification

HPV in situ hybridisation (ISH) was performed using Genpoint HPV DNA Probe Cocktail (Dako), which recognises the 13 most prevalent high-risk HPV genotypes, including HPV-16 and -18. Cases were scored as negative or positive. Tumours positive both for p16 immunochemistry and HPV ISH were classified as HPV positive (Robinson et al, 2010).

Microarray data analysis

Affymetrix U133 Plus 2.0 CEL files for 46 HNSCC patient samples (24 survivors and 22 non-survivors) were downloaded from a publicly available microarray data set (http://bioinformatics.picr.man.ac.uk/vice/PublicProject.vice?pid=361) and normalised in BioConductor-R (version 2.15.1) (Bioconductor, Seattle, WA, USA). Gene-set enrichment analysis was performed using the GSEA 2.7 software package (Broad Institute of MIT and Harvard, Cambridge, MA, USA). All 15,031 genes differentially expressed between survivors and non-survivors were rank ordered and tested for enrichment of gene sets upregulated during T-cell differentiation (Best et al, 2013).

Statistics

The primary endpoint was death from OPSCC (i.e., disease-specific survival (DSS)). An OPSCC-specific survival time was measured from the date of diagnosis until date of death from OPSCC or date last seen alive. For those that died of other causes, survival from OPSCC was censored at the time of death. Patients in whom the cause of death was unknown, or who received either no treatment or treatment with palliative intent, were excluded from all survival analyses. Kaplan–Meier plots (with log-rank tests) and both unadjusted and adjusted Cox proportional hazards models were used to analyse the survival data. Correlations between factors were assessed by cross-tabulation and the use of either Fisher’s exact test or χ2-test where appropriate.

To evaluate the prognostic performance of each factor (i.e., the 3-year OPSCC death rate), we calculated the detection rate (DR; also known as sensitivity) and false-positive rate (FPR; also known as 1−specificity). Detection rate is the proportion of patients who died from OPSCC with marker-positive results; FPR is the proportion of patients who did not die from OPSCC with marker-positive results. Likelihood ratios (DR/FPR) were obtained, which indicate the strength of a marker (i.e., maximising the DR while minimising the FPR; Deeks and Altman, 2004).

A prognostic model for 3-year mortality was developed using binary logistic regression, with a backwards selection procedure (Royston et al, 2009). Variables were retained in the model if they were significant at the 5% level. The UHS cohort was used as the ‘training’ set to develop the model, with the PFT/BLT cohort acting as the ‘validation’ set. A Receiver-operating characteristic (ROC) curve of the linear predictor was used to identify the optimum cut point (balancing both sensitivity and specificity; Royston et al, 2009). The prognostic ability of the final model was then assessed through the calculation of DR and FPR values in the ‘validation’ set (Heagerty et al, 2000). All statistical tests were two sided and a P-value of 0.05 was considered significant. Statistical analyses were performed using SPSS for Windows version 19.0 (IBM, Portsmouth, UK).

Results

Clinical, pathological, and immunochemical characteristics at the time of diagnosis are shown in Table 1. Human papillomavirus-positive tumours were defined as those positive for both p16 and HPV ISH (Robinson et al, 2010). Similar to previous studies, 94% of p16-positive cases were HPV ISH positive and 89% of HPV ISH-positive cases were p16 positive (Robinson et al, 2010).

Of the 274 patients included in the study, 149 (54.4%) were classified as HPV positive. Human papillomavirus status could not be established in four cases (insufficient tumour tissue remaining in the paraffin block); thus, further analysis was performed on 270 patients (median follow-up 58 months and minimum follow-up 8 months). There were 82 deaths from OPSCC over the study period. Consistent with previous studies, patients with HPV-positive tumours had significantly improved DSS compared with those that were HPV negative (Figure 1A; hazard ratio (HR)=0.33 (0.21–0.53)). Table 2A shows the HRs for OPSCC mortality for each factor separately. Improved survival was significantly associated with HPV-positive tumours, a younger age at diagnosis, non-, ex-, or light smokers (defined as <10 pack year history (Ang et al, 2010; Gillison et al, 2012)), early T-stage tumours (T1/2), early N-stage tumours (N0–N2a), tumours that were EGFR negative, and a high level of TILs (assessed on an H+E-stained section). When a multivariate analysis was performed (adjusting for age, T stage, N stage, and smoking status), HPV status and TIL levels remained statistically significant markers of OPSCC mortality (Table 2A).

Table 2A HRs for disease-specific death from OPSCC

In unstratified OPSCC, high TIL levels predicted for survival (Figure 1C; Table 2A) and correlated significantly with HPV-positive tumours (P<0.001). The percentage of HPV-positive patients with high, moderate, and low TIL was 49%, 36%, and 15% respectively (16%, 38%, 46%, respectively, in HPV-negative tumours). Figure 1D and F show typical examples of HPV-positive tumours with high and low TIL levels. Demographics of these groups are shown in Supplementary Information.

There was a highly significant difference between the Kaplan–Meier curves for HPV-positive OPSCC DSS according to TIL levels (P<0.001; Figure 1E); patients with HPV-positive/TILlow tumours showed similar DSS to HPV-negative patients (Figure 1E) with an adjusted HR of 1.01 (P=0.98; Table 2A). TIL levels predicted similarly for progression-free survival (PFS; log-rank P<0.001; data not shown); patients with HPV-positive/TILlow tumours also showed similar PFS to HPV-negative patients with an adjusted HR of 0.64 (P=0.23). The adjusted HRs for HPV-positive tumours with moderate and high TIL levels indicated a 49% and 79% reduction in OPSCC mortality compared with HPV-negative tumours (adjusted HRs, 0·51 and 0·21, respectively; P=0.17 and P=0.02; Table 2B). In HPV-negative tumours, patients with TILhigh tumours did not show significantly improved survival (adjusted HR, 0.47, P=0.17) over those with TILlow tumours.

Table 2B HRs for disease-specific death in OPSCC stratified by HPV status

Similar to previous studies, we found that heavy smoking reduced survival benefit in HPV-positive patients (HR, 5.81 (2.38–14.16)). We therefore assessed whether there was an association between heavy smoking and low TIL levels; low TIL levels did not correlate with heavy smoking (>10 pack-years) and there was no significant difference in the proportion of TILlow tumours according to the amount smoked (percentage of TILlow tumours: <10 pack year history 14.1%, >10 pack year history 22.5%, P=0.15).

We performed more complex analysis of the immune infiltrate to establish whether this improved prognostication in HPV-positive tumours. We quantified intratumoural T-lymphocyte number (CD3) and T-cell subsets (CD4, CD8, and Foxp3). We performed ROC analysis (death from OPSCC at 3 years) to determine the predictive value of T-cell number and/or subtype, including subtype ratios (CD4:CD8 and CD8:FoxP3), in comparison with simple H+E evaluation of TIL infiltrate. Immunochemical quantification did not outperform the simpler scoring method (Figure 2A and B, and Supplementary Information).

Figure 2
figure 2

Receiver-operating characteristic (ROC) curve for 3-year mortality in HPV-positive OPSCC to determine the predictive value of T-cell number and/or subtype. (A) Immunochemistry showing representative example of an OPSCC stained with T-cell markers (CD3, pan T cells; CD4, helper T cells; CD8 cytotoxic T cells; and FoxP3, regulatory T cells). (B) ROC curve for 3-year HPV-positive OPSCC mortality using TIL levels (scored on H+E-stained section), compared with cell counts following immunochemistry for T-cell subtypes (CD3+, CD4+, CD8+, and Foxp3+), including subtype ratios (CD4:CD8, and Foxp3:CD8). Immunochemical quantification did not outperform the simpler scoring method.

To evaluate whether high TIL levels reflected a bystander or effector response, we tested a publicly available HNSCC microarray data set (Thurlow et al, 2010) for enrichment of gene sets defined as upregulated in naive, effector, or memory T cells (Best et al, 2013). High expression of gene sets associated with CD8+ T-cell effector function were significantly enriched (enrichment score (ES)=0.70, P=0.046) in patients with better survival (Figure 3 and Supplementary Information). Genes upregulated in naive or late memory CD8+ T cells were overrepresented in patients with poor survival but at non-significant levels, possibly reflecting their TILlow status (ES=0.30, P=0.46; data not shown).

Figure 3
figure 3

T cell gene-set enrichment in HNSCC. Gene-set enrichment analysis of publicly available gene sets describing effector CD8+ T cells. Genes to the left and right of the rank-ordered list are enriched in HNSCC patients with early (survival <3 years, mean 16 months) or late mortality (survival >3 years, mean 49 months), respectively. The rank-ordered list of genes describing effector CD8+ T cells is shown in Supplementary Information.

Table 3 shows the prognostic performance of each factor (using DR and FPR) for predicting 3-year OPSCC mortality in HPV-positive and HPV-negative patients. Commonly reported prognostic factors are included. In HPV-positive OPSCC, TIL levels were the best-performing factor, with a likelihood ratio (LR) for TILlow tumour of 3.3, followed by heavy smoking and advanced T stage (LR 2.78 and 2.44, respectively). In HPV-negative OPSCC, advancing age, advanced stage, and poor differentiation were the predictive factors with the highest LRs (Table 3).

Table 3 LRs for 3-year death from OPSCC

HPV-positive patients had received different types of therapy (cohorts combined: surgery n=55, chemoradiotherapy n=65, and radiotherapy n=24). When adjusted for stage and age, there were no significant treatment-dependent survival benefits (HR for DSS relative to surgery: radiotherapy 0·76, P=0.79 and chemoradiotherapy 1·25, P=0.84).

Using the UHS cohort as a training set, we used logistic regression to develop a predictive model for survival from HPV-positive OPSCC (at 3 years). The initial variables entered into the model were as follows: age, T stage, N stage, grade, HPV status, TIL level, alcohol history, EGFR, and smoking history. The final model included TIL levels (low vs moderate or high), T stage (T1/2 vs T3/4), and smoking status (heavy vs light-, ex-, or non-smoker). The regression parameter estimates were used to create a prognostic score (equation: −4.948+2.768 (if a current heavy smoker)+ 2.310 (if TILlow)+2.928 (if T3/4)). ROC analysis was performed and produced an area under the curve of 0·87 (Figure 4A). A cut-off score of −0.945 was selected based on this ROC curve (DR, 72.7%; FPR 10.2%; high risk, n=14) and applied to the PFT/BLT (test; high risk, n=7) cohort. This score was highly predictive of 3-year mortality, with 66.7% of dead patients test-positive (DR=67%) and 5.6% of alive patients test-positive (FPR=5.6%). The resultant LR for prediction of 3-year mortality was 11.9 (PFT/BLT, AUROC, 0·82; Figure 4B). Replacing TIL levels with counts or ratios of T-lymphocyte subsets did not improve this model.

Figure 4
figure 4

Receiver-operating characteristic (ROC) curves for 3-year HPV-positive OPSCC mortality. (A) ROC curve for 3-year HPV-positive OPSCC mortality derived from the UHS patient cohort (training set) using a model containing TIL levels, smoking, and T stage. The line of no effect is also shown (dashed; area under the curve (AUROC, 50%). This prognostic model was highly predictive of HPV-positive OPSCC mortality, with an AUROC of 0·87. A cut-off score of −0.945 was selected (circled in the figure) based on this ROC curve (DR, 72.7%; FPR ,10.2%) and applied to the PFT/BLT (test) cohort. (B) ROC curve for 3-year HPV-positive OPSCC mortality derived from the PFT/BLT patient cohort (test set) using the predictive model containing TIL levels, smoking, and T stage derived from the UHS training set. The line of no effect is also shown (dashed; (AUROC, 50%)). The cut-off score of 0.945 identified in the training set is circled in the figure. This score was highly predictive of 3-year mortality (AUROC, 0·82), with 66.7% of dead patients testing positive (DR=67%) and 5.6% of alive patients testing positive (FPR=11%). The resultant LR for prediction of 3-year mortality was 11.9.

Discussion

Patients with HPV-positive OPSCC have significantly better survival, although the biological reason for this remains unclear (Adelstein et al, 2009). To reduce therapy-associated morbidity, it may be possible to de-intensify treatment, particularly as several studies have shown survival benefit to be independent of therapy choice (Licitra et al, 2006; Fakhry et al, 2008; Worden et al, 2008; Adelstein et al, 2009; Lassen et al, 2009; Ang et al, 2010; Dayyani et al, 2010; Hong et al, 2010b). However, a significant minority of HPV-positive OPSCC patients experience treatment failure and have poor outcome; it has been suggested that future clinical investigations concentrate on this group in order to improve outcome (and also to avoid treatment de-intensification in these patients; Adelstein et al, 2009). Currently, there is no widely accepted strategy for the identification of poor prognosis in HPV-positive patients before therapy.

A plausible explanation for the differences in survival between HPV-positive and HPV-negative patients is that virally driven tumours provoke an adaptive immune response directed against tumour-expressed viral antigens: immune responses against the foreign viral antigen are less likely to be suppressed; there is no central immunological tolerance to confound the immune system’s attempts to control cancer. HPV-16-specific CD8-positive T cells have been detected in the blood of HPV-positive OPSCC patients and, more recently, isolated from tumours, implicating a role in the anti-tumour response (Albers et al, 2005; Heusinkveld et al, 2012; Wansom et al, 2012). Several recent studies have further demonstrated anti-tumour immunity in HPV-positive OPSCC. Liang et al (2012) found better survival in HPV-positive patients with seropositivity to E6/E7. Furthermore, infiltration of HPV-positive HNSCC by PD-1-expressing T lymphocytes has been shown to be a favourable prognostic factor (Badoual et al, 2013).

In several solid tumours, quantification of densities of various T-cell subsets, including CD8+, CD4+ Treg cells (FoxP3+CD4+), or subset ratios, have been suggested to improve predictive power over that of absolute T-cell number (CD3+; Gooden et al, 2011). Cytotoxic CD8+ T cells are the principal anti-tumour effector cells, and their abundance has been shown to be a predictor of positive outcome in several tumour types, particularly colorectal cancer, suggesting that the adaptive immune system has a role in suppressing tumour progression (Galon et al, 2006). Our analysis of a publicly available HNSCC microarray data set showed enrichment of genes associated with CD8+ T-cell effector function in patients with improved survival, arguing that the T cells are not simply ‘innocent bystanders’ of the biological events in the tumour but take an active role in tumour recognition.

We conducted this study to examine the effects of HPV status and TIL levels on survival in OPSCC patients. Similar to other large studies, HPV-positive tumours were associated with significantly improved survival (3-year survival; HPV-positive 82% vs HPV-negative 56%, P<0.001); pathological and clinical features of HPV-positive tumours were consistent with these series. The HPV-positive OPSCC were generally of late stage and were poorly differentiated (both P<0.001). HPV-positivity inversely correlated with smoking, and with EGFR expression (all P<0.005). TIL levels also correlated significantly with the HPV status (P<0.001), with around 85% of HPV-positive OPSCC tumours containing high or moderate levels of TIL. Taking into account the array data, this is likely to reflect an adaptive anti-tumour response as the mechanism for improved outcome. TIL levels stratified HPV-positive OPSCC patients into those with good and poor prognosis. The 3-year survival for HPV-positive/TILhigh tumours was 96% compared with 76% for HPV-positive/TILmod and 59% for HPV-positive/TILlow. The HPV-positive/TILlow tumours had similar survival to HPV-negative tumours (3-year survival, 56%). We also quantified densities of CD3-, CD4-, CD8-, and FoxP3-positive T cells and carried out ROC analysis to determine the predictive value of cell numbers or subset ratios. These more complex analyses did not outperform the simpler scoring method performed on H+E-stained sections. It is possible that lymphocyte counts performed on randomly selected TMA cores may not be representative of the tumour as a whole. However, TMA cell counts significantly correlated with whole-section TIL levels, arguing against this (e.g., total (CD3+) T-lymphocyte count: TILlow 42.9, TILmod 60.4, and TILhigh 96.3, P<0.001).

In keeping with previously published studies (Badoual et al, 2006; Bron et al, 2013), we found that infiltration with FoxP3-positive regulatory T cells was associated with improved survival (Figure 2B). However, we feel that the increased numbers seen in HPV-positive tumours simply reflect an increase in overall T-lymphocyte numbers in these tumours. Indeed, the proportion of Foxp3-positive cells was reduced in HPV-positive tumours, compared with those that were HPV-negative (Supplementary Information). Interestingly, TIL levels did not predict for survival in HPV-negative tumours.

Several studies have reported potential clinical and molecular predictors for identifying poor prognosis in patients with HPV-positive tumours. The HPV-positive light- or non-smokers have the best outcome in OPSCC (Hafkamp et al, 2008; Kumar et al, 2008; Gillison et al, 2012; Sethi et al, 2012). We also found that HPV-positive current heavy smokers had reduced survival benefit compared with non-, ex-, or light smokers (3-year survival; HPV-positive/current heavy smoker 63% vs HPV-positive/non-, ex-, current light smoker 94%; Supplementary Information). There was no significant association between heavy smoking and low TIL levels, arguing against TIL levels acting as a surrogate marker for heavy smoking. However, it is possible that poor documentation in retrospective records may be responsible for this lack of correlation, and this should be examined prospectively. We calculated LRs to indicate the strength of a marker (maximising the DR while minimising the FPR). In HPV-positive patients, low TIL levels and smoking status were the best predictors of mortality at 3 years (LRs 3·3 and 2·8, respectively). This was reflected in our prognostic model where, in the ‘training’ set (UHS patients) only TIL levels, smoking, and T stage were significant (AUROC analysis 0·87). When this model was applied to the ‘validation set’ (PFT/BLT cohort), it was found to be highly predictive (DR 66.7%; FPR 5.6%; LR for 3-year mortality 11.9). This LR is better than other currently used predictive tests. For example, the unrehydrated guaiac fecal occult blood test used in screening for colorectal carcinoma has an LR of 6.4 (DR 64% and FPR 10%) (Allison et al, 2007). The 3-year survival of low-risk HPV-positive patients (score <−0.945) was 94% compared with 36.8% for those that were classified as patients with high risk (score >−0.945, P<0.001).

Ang et al (2010) have suggested a predictive model for HPV-positive OPSCC where patients who are heavy smokers (>10 pack year history) and have advanced nodal disease (>N2b) are at increased risk of death. We compared this model with our own, and calculated both LRs and 3-year survival rates. The LR for the Ang model in our combined cohort was 4.8 (DR 61.1% and FPR 12.8%; compared with LR=8.6 (DR=70.6% and FPR=8.2%) using our model) and the 3-year survival was 91.5% for low-risk patients and 50% for high-risk patients (compared with 94% and 36.8%, respectively, using our model). Clearly, both models have significant predictive power. Interestingly, we found no association between TIL levels and nodal status in HPV-positive OPSCC (P=0.43).

Retrospective studies have intrinsic limitations. In this study, cases were excluded if there was insufficient archival material for full pathological and immunochemical analysis. However, demographics and survival of excluded patients did not differ significantly from those included in the study (Supplementary Information). Most patients in this study came from the same geographical area of England, and we cannot therefore assess the effect of geographical, genetic, or ethnic influences on our findings. The number of patients with HPV-positive/TILlow tumours was relatively small and, thus, any conclusions drawn must perhaps be treated with some caution. However, this may simply reflect the fact that the vast majority of HPV-positive tumours are associated with a prominent immune response. Similarly, it could be argued that the heterogenous treatments received by patients in this study complicates the analysis. However, the uniformly improved survival of HPV-positive patients, regardless of treatment modality, argues that the reason for this improved prognosis is related to an intrinsic feature of these tumours. Prospective analysis of our prognostic model will be informative, and a multicentre validation study is currently under development.

In summary, we show that TIL levels predict for survival in OPSCC patients. High TIL levels are significantly associated with HPV status, suggesting that the reason for improved survival in most HPV-positive OPSCC is the presence of an adaptive host anti-tumour immune response. Survival in patients with HPV positive, TILlow tumours, is not significantly different than in those with HPV-negative disease. A prognostic model based on low TIL levels, heavy smoking, and late T stage is extremely effective at identifying a group of HPV-positive patients with poor survival. Tumour-infiltrating lymphocyte evaluation can be performed quickly on diagnostic biopsy, and more complex analysis of the T-cell infiltrate does not improve on the predictive power of the simpler scoring method. This is an effective tool for identifying HPV-positive OPSCC patients who are likely to respond poorly to treatment, and is highly relevant for the clinical and pathological evaluation of HPV-positive OPSCC.