In a study of 114 epidemiologically linked Zambian transmission pairs, we evaluated the impact of human leukocyte antigen class I (HLA-I)–associated amino acid polymorphisms, presumed to reflect cytotoxic T lymphocyte (CTL) escape in Gag and Nef of the virus transmitted from the chronically infected donor, on the plasma viral load (VL) in matched recipients 6 mo after infection. CTL escape mutations in Gag and Nef were seen in the donors, which were subsequently transmitted to recipients, largely unchanged soon after infection. We observed a significant correlation between the number of Gag escape mutations targeted by specific HLA-B allele–restricted CTLs and reduced VLs in the recipients. This negative correlation was most evident in newly infected individuals, whose HLA alleles were unable to effectively target Gag and select for CTL escape mutations in this gene. Nef mutations in the donor had no impact on VL in the recipient. Thus, broad Gag-specific CTL responses capable of driving virus escape in the donor may be of clinical benefit to both the donor and recipient. In addition to their direct implications for HIV-1 vaccine design, these data suggest that CTL-induced viral polymorphisms and their associated in vivo viral fitness costs could have a significant impact on HIV-1 pathogenesis.
Control of viral replication after infection has been attributed, at least in part, to CTL (CD8+) activity (1). Although all infected individuals can mount HIV-specific CTL responses, sustained, near-complete, viral control is only evident in a few who are recognized as aviremic controllers (2). Progression to AIDS occurs in the vast majority of untreated HIV infected patients, and this loss of immune control is partly caused by CTL escape mutations (3). Escape mutations modify a viral genome during infection for optimal replication in the absence of an immune response. Thus, CTLs targeting structurally or functionally conserved epitopes might benefit the host by diminishing the ability of the virus to develop escape mutations while at the same time retaining replicative fitness. Independent studies have demonstrated that CTLs directed against Gag, a relatively conserved protein with structurally critical domains, correlate with improved clinical markers of disease progression (4, 5). A recent study of >500 antiretroviral therapy naive HIV-1 patients in South Africa demonstrated that the best level of viral control occurred in patients who targeted Gag-specific CTL epitopes that are presented by specific HLA class I (HLA-I) alleles (6). In many cases, such targeting leads to the selection of specific HLA-I–associated amino acid polymorphisms that result in reduced epitope recognition (1). It remains unclear whether the clinical benefit observed in these individuals is caused by the quality of the immune responses, to viral fitness constraints resulting from CTL escape, or to both.
Evidence that it is a combination of these factors that results in reduced plasma viral load (VL) is supported by studies in individuals who carry certain HLA class I alleles and who control HIV well in the absence of antiretroviral therapy. HLA-B*57 and HLA-B*27, in particular, have been consistently associated with improved markers of clinical outcome and slower rates of progression to AIDS (7). HIV-specific CTLs restricted by these two HLA molecules dominate in individuals expressing these alleles (8), with Gag being the major CTL target. Distinct qualitative differences have been demonstrated in CTLs restricted by HLA-B*57 (9) or HLA-B*27 (10). Recent studies demonstrate reversion of CTL escape mutations upon transmission into a new host in the absence of HLA-mediated selection pressure (11, 12). These in vivo findings have been supported by in vitro studies showing a reduced replication capacity of viruses with certain CTL escape mutations (13, 14). Nevertheless, none of these mutations have been shown to negatively affect VL or modify other biological markers of clinical outcome in chronic infection. One explanation may be that CTL escape mutations, which reduce the immunological suppression of virus replication (3), can simultaneously reduce the replication capacity of the virus (11, 13, 14). In other words, although a CTL escape mutation may result in a reduced viral replication capacity, it is not likely that this will result in a lower VL in the chronically infected host, where this mutation was induced, because the virus is gaining a fitness advantage by avoiding CTLs. Such fitness effects may therefore only be evident upon transmission to a recipient lacking the same restricting HLA-I molecule.
In this report, we took advantage of the HLA-I–associated HIV-1 polymorphisms that had been defined in Gag and Nef from studies of a large South African cohort with chronic clade C infection (unpublished data). We used this information to study—in a cohort of epidemiologically linked, clade C–infected Zambian HIV-1 transmission pairs—the effects of mutations induced in the transmitted virus by donor CTLs on an early VL set point in the newly infected recipient. Our findings suggest that HLA-B–restricted mutations in Gag, but not Nef, result in a reduced VL upon transmission, and directly demonstrate for the first time in vivo that a clinically significant viral fitness cost is induced when the virus escapes immune recognition by donor CTLs.
Results And Discussion
To identify HLA-I–associated HIV-1 subtype C polymorphisms, we performed HLA-I typing on 701 untreated South African patients with chronic HIV-1 infection and sequenced the gag and nef genes of their circulating virus (unpublished data) (12, 15). Associations were corrected for viral phylogeny, as this approach was shown to significantly improve the specificity of HLA-I associations (16, 17). We limited our initial analysis to HLA-I viral polymorphisms that occurred within or flanked (within five amino acids of the N or C termini of the epitope) known HLA-I–restricted CTL epitopes. Such HIV-1 polymorphisms are more likely to represent CTL escape and less likely to represent confounding secondary compensatory mutations (11, 13, 14). These analyses resulted in the identification of 20 mutations within Gag (18 within p24) and 11 within Nef (Table I).
The majority of the mutations within Gag (17 out of 20; 85%) and Nef (8 out of 11; 73%) were HLA-B restricted. Of those mutations that have been fully evaluated, all (five in Gag and two in Nef) were formally shown to be CTL escape variants (12, 15, 18–20).
We then applied these HLA-I–associated polymorphisms, found in chronic infection, to the analyses of early HIV-1 transmission within a cohort of 114 epidemiologically linked Zambian transmission pairs. These couples were initially identified as HIV-1 discordant (i.e., one partner was HIV-1 infected and their spouse was uninfected). Despite counseling and condom provision, transmission from the chronically infected partner to their spouse continues, but at a significantly reduced rate (∼8% per year) (21). Epidemiological linkage of the transmission pairs was established through sequence analysis of the regions encoding gp41 and Gag (22). Population-based sequencing of gag and nef genes was performed on DNA RT-PCR amplified from plasma virus RNA obtained ∼6 mo after the estimated date of infection. A phylogenetic tree, constructed to determine the relationships of HIV-1 gag gene sequences between the two cohorts studied in this paper and other clade C sequences derived from the Los Alamos National Laboratory HIV Molecular Immunology Database (Fig. 1), demonstrated that Southern African sequences tend to cluster together, distinct from those prevalent in representative countries in East Africa, South America, and Asia.
These data suggest that sequence analyses derived from one country may be closely applicable to viruses found in neighboring countries in this region. This is supported by previous experiments that have reported the identical escape mutations selected in geographically distinct but otherwise similar clade B–infected cohorts (12).
After HLA-I typing of both partners from each of the 114 epidemiologically linked Zambian transmission pairs, plasma VL was determined from the same samples that viral gag and nef gene sequences were derived. We examined the relationship of amino acid polymorphisms in Gag and Nef sequences to early VL in the newly infected partners of this Zambian transmission pair cohort. We reasoned that early VL (after seroconversion) would be more reflective of the characteristics of the virus transmitted from the donor than VL seen later during chronic infection, when reversion and compensatory mutations might occur. Indeed, the majority of HIV-1 polymorphisms (579 out of 610; 94.9%) were identical in the donor and recipient at 6 mo after transmission. Gag mutations were more likely to be unchanged in the recipient when compared with Nef mutations, because the latter had a greater tendency to exist as the clade C consensus amino acid in the recipient (97.1 vs. 91.3%, respectively; P = 0.002).
Next, we analyzed the relationship between the numbers of transmitted HLA-I–linked Gag and Nef mutations within or flanking known CTL epitopes (Table I) to early VL in recipient partners. Accumulation of HLA-I–associated polymorphisms in Nef was not linked to either a higher or lower VL in recipients (ρ = 0.01; NS; Fig. 2 A).
In contrast, a higher number of HLA-I–associated mutations within Gag was significantly associated with lower VL (ρ = −0.27; P = 0.01; Fig. 2 B).
Although limiting the HLA-I–associated HIV-1 polymorphisms to those occurring within or flanking known CTL epitopes strengthens the case that these changes represent CTL escape mutations, it potentially introduces selection bias into our dataset. We therefore included all HLA-I–associated polymorphisms (P < 0.001; q < 0.2) whether or not they were coupled with known CTL epitopes (Table I; and Table S1, available at http://www.jem.org/cgi/content/full/jem.20072457/DC1). This provided an additional 16 mutations in Gag and 12 mutations in Nef. Unlike what was observed for mutations associated with known CTL epitopes, we saw a more even distribution of the HLA-I restriction of these mutations for both Gag (seven HLA-A, four HLA-B, and four HLA-C restricted) and Nef (four restricted by each HLA-I allele). No associations were seen when comparing the numbers of Nef mutations with VL (ρ = 0.01; NS; Fig. 2 C), consistent with the previous analysis, and there was no longer a significant association with the total number of Gag mutations and VL (ρ = −0.11; NS; Fig. 2 D). In contrast, when we focused the analysis on HLA-B–associated Gag polymorphisms, these remained significantly associated with diminished VL (ρ = −0.24; P = 0.02; Fig. 2 F). HLA-A– and HLA-C–associated Gag polymorphisms were not associated with a lower recipient VL (ρ = 0.08; NS; unpublished data). Similarly, HLA-B–associated Nef mutations were not associated with either positive or negative VL changes in the recipients (ρ = 0.02; NS; Fig. 2 E).
These results (Fig. 2 F) support previous studies demonstrating that the HIV-1–specific CTL responses with the greatest impact on VL are HLA-B restricted (23) and are directed against Gag epitopes associated with CTL escape mutations (6), perhaps reflecting the selection pressure imposed by these effective responses. Thus, a confounding factor in this analysis of the in vivo fitness cost of escape mutations may be the new HIV-1–specific CTL responses, certainly present a 6 mo after infection, which could by themselves reduce VL. This would imply that newly infected individuals lacking the HLA-B molecules that drive CTL escape mutations in Gag (Table I) would benefit most from the receipt of virus with detrimental escape mutations in Gag. When only these latter recipients were included in the analysis, the association between the increasing number of Gag mutations and reduced VL was much stronger (ρ = −0.57; P = 0.0003; Fig. 3 A).
This finding also suggests that the recipients who are able to effectively target Gag themselves would derive less absolute benefit from CTL-induced Gag mutations because of the equipoise between CTL-derived virus suppression and the fitness costs associated with escape mutations. Indeed, we did not see any association between the number of Gag mutations and VL in recipient carriers of HLA-B alleles that select for the Gag CTL escape mutations described in Table I (ρ = 0.07; NS; Fig. 3 B). To determine whether newly induced Gag-specific CTL responses could be contributing to VL control in this latter recipient group, the number of HLA-B–restricted Gag epitopes that could potentially be targeted in each individual was quantified and compared with VL. In line with previous findings (4, 6), the greater the number of Gag CTL epitopes potentially targeted by these recipients, the greater the decrease in VL (ρ = −0.28; P = 0.047; Fig. 3 C). This latter finding in recipients with Gag-specific CTLs capable of inducing mutations is also similar to a recently published paper in which an inverse relationship was demonstrated between VL and HLA-I–associated polymorphisms in Gag, Pol, and Nef in chronically infected hosts (24). Our analysis expands on this study because CTL escape mutations induced by the recipient's immune system were almost nonexistent at the time point analyzed (unpublished data). Therefore, it is likely that the potency of CTLs associated with mutations in Gag plays an important role in early viral control, even if these mutations do not occur.
Because certain HLA-B alleles are associated with improved disease outcomes (7, 23), it was important to consider the possibility that the significant associations with VL were driven by Gag mutations restricted only by those HLA-I alleles previously associated with a lack of disease progression (e.g., HLA-B*57 or -B*5801). We therefore stratified the data according to which HLA-I–associated Gag mutation was present in recipients who lacked HLA-B alleles that drive escape (Fig. 4).
Although none of the VL associations with any individual HLA-I allele were statistically significant, certain patterns could be seen. All 12 HLA-B–associated Gag mutations were linked to lower VLs in HLA-mismatched recipients (compared with only one out of five HLA-A– or HLA-C–associated mutations; P = 0.002). Of note, this trend was not unique to HLA-B alleles associated with good clinical markers, but instead was also observed with HLA-B*07, -B*35, -B*41, and -B*44. These data suggest that it is number of CTL epitopes targeted by any individual HLA allele (as assessed by the frequency of Gag mutations induced) that drives its association with lower VL. Notably, 13 out of 20 (65%) of the mutations in Gag (Table I) were restricted by HLA molecules (HLA-B*42, -B*57, -B*5801, and -B*8101) previously associated with lower VLs and a better clinical outcome (7, 23).
Previous studies have examined VL differences based on CTL escape that occurs during the course of chronic infection. These analyses are complicated by the fact that the occurrence of a CTL escape mutation in an individual abolishes immunological suppression of viral replication in that individual. Indeed, escape from an HLA-B*27–restricted Gag response results in a higher VL (3) despite the fact that it is associated with reduced replicative capacity in vitro (14). It is therefore difficult to assess whether any particular escape mutation results in a viral fitness cost in chronically infected patients. This current study is unique in that the identity of the transmitted virus was known in each of the individuals early after infection. This allowed us to observe the impact of HLA-I–induced mutations on virus replication after it entered a new immunogenetic environment where the fitness cost of the mutations would be most pronounced. Importantly, we demonstrate two different mechanisms of viral control in newly infected recipients. Similar to previous experiments (6), those recipients expressing certain HLA-B alleles likely control VL by effective CTL Gag targeting. In contrast, newly infected recipients who lack HLA-B alleles associated with Gag polymorphisms can derive significant benefit from the transmission of viruses containing Gag mutations induced in partners with potent HLA-B–restricted responses.
One possible explanation for the finding that mutations in Gag, but not Nef, affect viral fitness is that the former protein (and in particular p24) must make multiple interactions with other Gag molecules during the assembly of both immature viral capsids and mature viral cores, as well as with cellular components such as cyclophilin A (25). This relative structural inflexibility may allow the immune system, through its targeting of Gag, to significantly affect viral replication, and would provide a strong mechanistic rationale for the repeated observation that Gag-specific CTLs correlate with biological markers of improved clinical outcome (4–6).
It is not clear why HLA-B–restricted responses directed against HIV-1 play such an important role in immunopathogenesis, although several groups have demonstrated this association (6, 8, 23). It is possible the HLA-B alleles just happen to target important regions in p24 with a higher binding affinity (26). Furthermore HLA-A, -B, and -C are differentially expressed depending on the cell type (27), and it is plausible that certain antigen-presenting cells preferentially express HLA-B molecules. In addition to CTL function, HLA-B Bw4 interacts with NK cells, and this was recently demonstrated to play an important role in HIV-1 control (28).
The ∼10-fold VL reduction associated with increasing numbers of escape mutations (fewer than two versus more than six Gag mutations; Fig. 3 A) observed in this report may be clinically significant, because even a 2.5-fold lower VL among recent HIV-1 seroconverters was associated with a benefit in AIDS-free survival (29). Compensatory mutations (11, 13) or reversions (30) occurring after our analyzed time point could dampen the impact of CTL escape on viral fitness constraints. Follow-up VL data were available for 39 recipients at ∼1 yr after seroconversion or 6 mo after the first evaluation. Although the VL significantly increased at the second time point compared with the first (70,928 vs. 21,961 copies/ml, respectively; P < 0.0001 using the Wilcoxon signed-rank test), it was higher in all patients irrespective of the number of Gag polymorphisms or whether or not the patients carried the “protective” HLA-I alleles. Therefore, it is difficult to know whether the increase in plasma VL seen after the first year of infection is caused by the development of compensatory mutations, the loss of immune control, or other factors. Nevertheless, these HLA-I–induced mutations are certainly present at the earliest point of infection and would be expected to also influence peak VL. Because much of the CD4 depletion occurs before seroconversion (31), these viral fitness mutations may contribute to long-lasting clinical benefit even if their biological impact is lost at some point after seroconversion. It would have been interesting to evaluate the impact of viral fitness mutations on CD4 T cells; however, this clinical test was not available at the time of the study.
Although these findings need to be replicated in other cohorts, they strengthen the argument for CTL targeting of Gag antigens as an important mechanism for sustained viral control. Moreover, if a virus encoding escape mutations in gag were to be transmitted to a new host in the absence of fully compensatory mutations (as observed in this report), these new recipients would likely benefit as well. These data imply that for CTL-based HIV vaccines to effectively control VL, they must simultaneously target multiple Gag epitopes, thereby ensuring that fitness constraints prevent the virus from easily mutating. Thus, efforts to identify other CTL-induced HIV-1 mutations in structurally important regions of Gag (or other proteins), and to define mechanisms by which these mutations influence VL set point and disease progression will ultimately benefit the design of an AIDS vaccine.
Materials And Methods
The South African cohort consisted of 680 patients with chronic HIV-1 infection from Durban, South Africa (6, 23). The Zambian participants included 114 transmission pairs, shown to have epidemiologically linked HIV-1, from the Zambia-Emory HIV Research Project (21, 22). All patients in both cohorts were antiretroviral therapy naive. Zambian linked recipients were identified at 5.6 mo (mean; median = 3.1 mo) after the estimated time of infection, at which time plasma samples were obtained from both the donor and the recipient. All research protocols were approved by the ethics committees in Durban, South Africa and Lusaka, Zambia, and by the Emory University and Oxford University Institutional Review Boards.
Viral RNA from Zambian samples was extracted from 200 μl of plasma using a robotic system with the MagNA Pure LC Total Nucleic Acid Isolation Kit (Roche). 10 and 1 μl RNA were amplified using a one-step RT-PCR high fidelity kit (Invitrogen) for the RT and first-round amplification steps using gene-specific primers. Gag primers included (forward) 5′-ATTTGACTAGCGGAGGCTAGAA-3′ and (reverse) 5′-GACAGGTGTAGGTCCTACTAATACTGTACC-3′. Nef primers included (forward) 5′-AATAGAGTTAGGCAGGGATAC-3′ (32) and (reverse) 5′-GCACTCAAGGCAAGCTTTATTGAGGCTTA-3′. For nested PCR reactions, primer sequences for gag amplification were (inner forward) 5′-TTTGACTAGCGGAGGCTAGAAGGA-3′ and (inner reverse) 5′-GTATCATCTGCTCCTGTGTCTAAGAGAGC-3′. The primer sequences for nef gene amplification were (inner forward) 5′-GAGAGACTTCATATTGGTTGCAGCG-3′ (a gift from S. Mallal, Murdoch University, Murdoch, Australia) and (inner reverse) 5′-AAAGCAGCTGCTTATATGCAGCATCT-3′. PCR products were purified using Montage MultiScreen PCR96 filtration plates (Millipore), and amplicons were sequenced by Macrogen. Sequences were analyzed using Sequencher 4.7 software (Gene Codes Corp.). Multiple peaks were recorded using International Union of Pure and Applied Chemistry codes whenever the lower peak level exceeded one third of the level of the maximum peak, and nucleotide sequences were then converted to amino acid sequences using the universal code. We also performed gag sequencing of multiple single genomes from both the donor and the recipient (mean = 20 clones in each group) in a subset of 11 transmission pairs. The consensus sequencing method mirrored the major polymporphisms transmitted from the donor to the recipient (unpublished data). HIV-1 sequencing of South African plasma samples was performed in a similar fashion (unpublished data).
Plasma VL was determined at the Emory Center for AIDS Research Virology Core Laboratory using the Amplicor HIV-1 Monitor Test (version 1.5; Roche).
HLA class I typing.
Using genomic DNA prepared from whole blood or buffy coats (QIAamp blood kit; QIAGEN), HLA class I genotyping relied on a combination of PCR-based techniques, including PCR with sequence-specific primers (Invitrogen) and reference-strand conformation analysis (Invitrogen), as described previously (33). Ambiguities were resolved by sequencing-based typing using kits (Abbott Molecules, Inc.) designed for capillary electrophoresis and the ABI 3130xl DNA Analyzer (Applied Biosystems).
Calculation of HLA-I–associated HIV polymorphisms.
We used a method similar to those previously described (16, 34) to identify HLA-I–associated HIV polymorphisms. This approach corrected for the phylogenetic structure of the sequences. Additionally, we included multiple HLA predictors, allowing for the elimination of spurious associations caused by HLA linkage disequilibrium. In brief, for both Gag and Nef, a maximum likelihood phylogenetic tree was constructed from the corresponding sequences. For every HLA allele, amino acid position, and amino acid at that position, two generative—or directed graphical—models of the observed presence or absence of the amino acid in each sequence were created: one represented the null hypothesis that the observations are generated by the phylogenetic tree alone, and the other represented the alternative hypothesis that additional escape or reversion takes place because of HLA pressure in the subjects for which the sequences are observed. The likelihood of the observations was then maximized over the parameters of both models using an expectation-maximization algorithm, and a p-value was computed using a likelihood ratio test based on those likelihoods. To increase power, the tests were binarized such the presence or absence of a given HLA allele was correlated with the presence or absence of a given amino acid. HLA polymorphism pairs were also analyzed only when the actual or expected count in every cell of the corresponding two-by-two contingency table was greater than or equal to three. For every amino acid at each position, the HLA allele with the strongest association (and its corresponding p-value) was added to the list of identified associations. The analysis was repeated after removing individuals having or possibly having this HLA allele. This procedure was iterated until no HLA allele yielded an association with P < 0.05. A q-value statistic, estimating the proportion of false positives among the associations identified, was computed for each association by repeating this analysis on null data. These q-value statistics were estimated separately for Gag and Nef associations using one million null tests per protein. Associations with q-values less than or equal to 0.2 were deemed significant.
Analysis of HLA-I–associated HIV-1 polymorphisms in linked Zambian transmission pairs.
HIV-1 gag and nef sequences were analyzed in 88 and 96 out of 114 Zambian transmission pairs, respectively. HIV-1 polymorphisms were defined based on associations calculated in the South African cohort. Only predicted CTL escapes and attractions were analyzed. HLA-I associations that were either not present in the Zambian cohort or were associated with less than or equal to one mutation were not included in the analysis. In cases where more than one HLA-I was associated with a specific polymorphism, the linked amino acid also associated with a known CTL epitope was used. If no CTL association was present, then the HLA-I allele demonstrating the greatest association with the HIV-1 polymorphism was used. The donor sequence was treated as the transmitted sequence, except in cases where the donor encoded an escape mutation and the recipient encoded the clade C consensus amino acid (69 out of 1,004, or 6.9%, of the mutations).
Comparisons between VL and the number of mutations were performed by the nonparametric Spearman rank correlation and Mann-Whitney U tests.
Online supplemental material.
Table S1 lists HLA-I allele–associated HIV-1 polymorphisms not occurring within or near known CTL epitopes.
© 2008 Goepfert et al. This article is distributed under the terms of an Attribution–Noncommercial–Share Alike–No Mirror Sites license for the first six months after the publication date (see http://www.jem.org/misc/terms.shtml). After six months it is available under a Creative Commons License (Attribution–Noncommercial–Share Alike 3.0 Unported license, as described at http://creativecommons.org/licenses/by-nc-sa/3.0/).
We thank the Zambia-Emory HIV Research Project participants and those in Durban who made possible the collection of blood samples for this study.
This work was supported by National Institutes of Health (NIH) grants AI-64060 (E. Hunter, P.A. Goepfert, and R.A. Kaslow) and AI-46995 (P.J.R. Goulder), the International AIDS Vaccine Initiative (S. Allen and E. Hunter), the Wellcome Trust (P.J.R. Goulder), the Medical Research Fund UK (P. Matthews and A. Prendergast), Microsoft Research (D. Heckerman and J.M. Carlson), and the Emory Center for AIDS Research Virology Core through a NIH center grant (P30 AI050409). E. Hunter is a Georgia Research Alliance Eminent Scholar, and P.J.R. Goulder is an Elizabeth Glaser Pediatric AIDS Foundation scientist.
The authors have no conflicting financial interests.