Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Molecular Evolution of HIV-1 CRF01_AE Env in Thai Patients

  • Samatchaya Boonchawalit ,

    Contributed equally to this work with: Samatchaya Boonchawalit, Duangrat Jullaksorn

    Affiliation Thailand-Japan Research Collaboration Center on Emerging and Re-emerging Infections (RCC-ERI), Nonthaburi, Thailand

  • Duangrat Jullaksorn ,

    Contributed equally to this work with: Samatchaya Boonchawalit, Duangrat Jullaksorn

    Affiliation National Institute of Health, Department of Medical Sciences (DMSc), Ministry of Public Health (MOPH), Nonthaburi, Thailand

  • Jiraporn Uttiyoung,

    Affiliation Regional Medical Science Center Chiangrai, DMSc, MOPH, Chiangrai, Thailand

  • Amara Yowang,

    Affiliation Regional Medical Science Center Chiangrai, DMSc, MOPH, Chiangrai, Thailand

  • Nongkran Krathong,

    Affiliation Mae-Soay Hospital, Chiangrai, Thailand

  • Sununta Chautrakul,

    Affiliation Mae-Soay Hospital, Chiangrai, Thailand

  • Akifumi Yamashita,

    Affiliation Graduate School of Life Science, Tohoku University, Miyagi, Japan

  • Kazuyoshi Ikuta,

    Affiliation Research Institute for Microbial Diseases, Osaka University, Osaka, Japan

  • Amornsak Roobsoong,

    Affiliation Mae-Soay Hospital, Chiangrai, Thailand

  • Sangkom Kanitvittaya,

    Affiliation Regional Medical Science Center Chiangrai, DMSc, MOPH, Chiangrai, Thailand

  • Pathom Sawanpanyalert,

    Affiliation National Institute of Health, Department of Medical Sciences (DMSc), Ministry of Public Health (MOPH), Nonthaburi, Thailand

  • Masanori Kameoka

    mkameoka@biken.osaka-u.ac.jp

    Affiliations Thailand-Japan Research Collaboration Center on Emerging and Re-emerging Infections (RCC-ERI), Nonthaburi, Thailand, Research Institute for Microbial Diseases, Osaka University, Osaka, Japan

Abstract

Background

The envelope glycoproteins (Env), gp120 and gp41, are the most variable proteins of human immunodeficiency virus type 1 (HIV-1), and are the major targets of humoral immune responses against HIV-1. A circulating recombinant form of HIV-1, CRF01_AE, is prevalent throughout Southeast Asia; however, only limited information regarding the immunological characteristics of CRF01_AE Env is currently available. In this study, we attempted to examine the evolutionary pattern of CRF01_AE Env under the selection pressure of host immune responses.

Methodology/Principal Findings

Peripheral blood samples were collected periodically over 3 years from 15 HIV-1-infected individuals residing in northern Thailand, and amplified env genes from the samples were subjected to computational analysis. The V5 region of gp120 showed highest variability in several samples over 3 years, whereas the V1/V2 and/or V4 regions of gp120 also showed high variability in many samples. In addition, the N-terminal part of the C3 region of gp120 showed highest amino acid diversity among the conserved regions of gp120. Chronological changes in the numbers of amino acid residues in gp120 variable regions and potential N-linked glycosylation (PNLG) sites are involved in increasing the variability of Env gp120. Furthermore, the C3 region contained several amino acid residues potentially under positive selection, and APOBEC3 family protein-mediated G to A mutations were frequently detected in such residues.

Conclusions/Significance

Several factors, including amino acid substitutions particularly in gp120 C3 and V5 regions as well as changes in the number of PNLG sites and in the length of gp120 variable regions, were revealed to be involved in the molecular evolution of CRF01_AE Env. In addition, a similar tendency was observed between CRF01_AE and subtype C Env with regard to the amino acid variation of gp120 V3 and C3 regions. These results may provide important information for understanding the immunological characteristics of CRF01_AE Env.

Introduction

Human immunodeficiency virus type-1 (HIV-1) is characterized by extensive genetic heterogeneity [1], and is divided into four groups, M (major), O (outlying), N (new or non-M, non-O) and P (pending). The viruses in group M are further classified into many subtypes and circulating recombinant forms (CRFs). Among them, subtypes A, B, C, D and G, as well as CRF01_AE and CRF02_AG, are the major subtypes and CRFs, which are responsible for the worldwide HIV-1 pandemic [2]. While subtype B of HIV-1 is the predominant subtype in the Americas, Europe and Australia, there is a growing epidemic of non-B subtypes and CRFs in Africa and Asia. CRF01_AE is prevalent throughout Southeast Asia [2], and is responsible for more than 80% of infection cases in Thailand [3].

The heterogeneity of the HIV-1 genome is mainly attributed to the error-prone nature of viral reverse transcriptase [4]. In the reverse transcription process, a G to A hypermutation is introduced into proviral DNA by the APOBEC3 family of cytosine deaminases [5], [6], [7], [8]. Although several lines of evidence have demonstrated that APOBEC3 family protein-mediated hypermutation plays an important role in the host defense mechanism against HIV-1 in a clinical setting [9], [10], [11], [12], [13], a sub-lethal level of APOBEC3 activity probably affect viral evolution, which facilitates viral escape from immune responses and antiretroviral therapy (ART) [14], [15].

The envelope glycoproteins (Env), gp120 and gp41, of HIV-1 play a central role in viral transmission to target cells, and mediate attachment and incorporation of the virus into the cells through specific interaction with the CD4 receptor and chemokine receptors. In addition, Env is a major target of humoral immune responses against HIV-1 [16], [17]. Env gp120 and gp41 are the most variable HIV-1 proteins with typical intersubtype and intrasubtype differences soaring to 35% and 20%, respectively [1]; therefore, the humoral immune responses against Env potentially somewhat vary among different subtypes and CRFs. For example, the replication of many clinical isolates of subtype B, C and D is neutralized by a human monoclonal antibody against the CD4 binding site (CD4BS) of Env gp120, IgG1 b12, which was established from an HIV-1 subtype B-infected individual [18], while most CRF01_AE viruses are resistant to IgG1 b12-mediated neutralization [19], [20], [21]. In addition, recently established, broadly neutralizing human monoclonal antibodies derived from HIV-1 subtype A-infected individuals recognize conserved regions of the V2 and V3 regions [22], while several broadly neutralizing, human or murine monoclonal antibodies elicited by the HIV-1 subtype B antigen recognize CD4BS or V3 region of Env gp120 [18], [23], [24], [25], [26], [27]. These results imply that the Env of different subtypes and CRFs show different antigenicity.

The introduction of mutations into HIV-1 Env, including those involved in the N-linked glycosylation of particular amino acid residues, leads to a reduction of the susceptibility to neutralizing antibodies [28], [29], [30]; therefore, the mutation-driven evolution of HIV-1 Env plays an important role in conferring viral escape from humoral immune responses. In order to study the evolutionary pattern of CRF01_AE Env under host immune pressure, we examined the changes of HIV-1 Env amino acid sequences derived from chronically CRF01_AE-infected Thai patients over 3 years.

Methods

Ethics statement

This study was conducted with the approval from the ethics committee of the Department of Medical Sciences, Ministry of Public Health of Thailand and with written informed consent from the patients.

Study participants and sample collection

Peripheral blood samples were collected every 3 months from April 2008 to January 2011, namely 12 times in total, from 9 drug-naïve, HIV-1-infected patients as well as from 6 HIV-1-infected patients on ART. All patients were infected with HIV-1 CRF01_AE viruses [31], and were negative for hepatitis B and C viruses at the time of enrollment. Among 9 drug-naive patients, 4 patients started ART during the project.

Measurement of CD4 count and viral load

As clinical markers, the CD4 count and viral load of the patients were monitored during the study period. The CD4 count was measured every 3 months by flow cytometric analysis at Chiangrai Prachanukoh Hospital, according to the manufacturer's protocol (Beckman Coulter, Fullerton, California, USA). In addition, the viral load was measured every 6 months as follows. Viral RNA was extracted from a plasma sample using High Pure System Viral Nucleic Acid (Roche, Basel, Switzerland). The viral load was then measured using the Cobas AmpliPrep/Cobas TaqMan HIV-1 version 5.1 Assay (Roche).

Amplification of viral genomic fragment encoding a full length Env precursor gp160

Plasma was isolated from peripheral blood samples by centrifugation for 10 min at 2000 rpm. In addition, peripheral blood mononuclear cells (PBMC) were isolated by density gradient centrifugation using Ficoll-Paque (GE Healthcare, Buckinghamshire, UK). Prior to RNA extraction, viral particles were concentrated from 1-2 ml of plasma by ultracentrifugation for 1 hour at 65,000 rpm using TLA-100.3 rotor with Optima TLX ultracentrifuge (Beckman Coulter). RNA and DNA were then extracted from the concentrated viral particles and PBMC using the QIAamp viral RNA mini-kit and the QIAamp DNA blood mini-kit (Qiagen, Hilden, Germany), respectively. Viral RNA was reverse transcribed to cDNA using the SuperScript III First-Stand Synthesis kit (Invitrogen, Carlsbad, California, USA) with the reverse primer, K-env-R1, 5′-CCAATCAGGGAAGAAGCCTTG-3′ [corresponding to nucleotide (nt) 8736 to 8716 of CRF01_AE reference strain, CM240 (GenBank accession no. U54771)]. The HIV-1 genomic fragment, encoding full-length Env precursor gp160, Rev and Vpu as well as partial fragments of Tat and Nef, was then amplified by nested PCR using BIO-X-ACT DNA polymerase (Bioline, Luckenwalde, Germany) and one of two primer sets, as follows. As the first set of primers, N-env-F1; 5′-TTAGAGGAGCTTAAAAATGAAGC-3′ (nt 5193 to 5215) and N-env-R1; 5′-TTAAAAAGAAGCTAAGATCAAAAGC-3′ (nt 8638 to 8614) were used for the first PCR, and N-env-F2; 5′- GAATTGGGTGTCAACATAGCAGAATAGGC-3′ (nt 5344 to 5372) and N-env-R2; 5′- TATCTAGATCTTGAGATACTGCTCC-3′ (nt 8485 to 8461) were used for nested PCR. As the second set of primers, K-env-F1; 5′- CTAGAGCCCTGGAATCATCCG-3′ (nt 5419 to 5439) and K-env-R1; 5′- CCAATCAGGGAAGAAGCCTTG-3′ (nt 8736 to 8716) were used for the first PCR, and K-env-F2; 5′-CGAGGAACTCCTCAGAGCAG-3′ (nt 5563 to 5582) and K-env-R2; 5′-TCTTGTGCTCTCAGCCAGAC-3′ (nt 8549 to 8530) were used for nested PCR. If the PCR using the first set of primers failed to amplify viral genomic fragment, the second set of primers was used. The PCR conditions were as follows. For the 1st PCR using the first set of primers, one cycle of 1 min at 94°C for denaturation; 10 cycles of 20 sec at 94°C for denaturation, 30 sec at 48°C for annealing and 5 min at 68°C for extension; 20 cycles of 10 sec at 94°C for denaturation, 30 sec at 48°C for annealing and 5 min at 68°C for extension with cycle elongation of 10 sec for each cycle; and a final extension cycle of 10 min at 68°C were carried out. For the nested PCR using the first set of primers, one cycle of 1 min at 94°C for denaturation; 30 cycles of 10 sec at 94°C for denaturation, 30 sec at 52°C for annealing and 4 min at 68°C for extension with cycle elongation of 5 sec for each cycle; and a final extension cycle of 5 min at 68°C were carried out. For the 1st and nested PCR using the second set of primers, annealing temperatures were changed to 60°C and 58°C, respectively. If a viral gene fragment failed to be amplified from the cDNA generated from viral RNA even after multiple attempts, it was amplified instead from DNA extracted from PBMC. In order to examine the genomic fragment of the major viral population in a sample, PCR products amplified at the end-point dilution of cDNA or DNA templates were subjected to sequencing analysis.

Sequencing and data analysis

Sequencing analysis of the amplified HIV-1 genomic fragment was carried out using the BigDye Terminator v3.1 Cycle Sequencing kit with an ABI PRISM 3130XL genetic analyzer (Applied Biosystems, Foster City, California, USA), and data were assembled using SeqScape v2.5 software (Applied Biosystems). The deduced amino acid sequences derived from an individual patient were then aligned with the viral sequence derived from the earliest sample collected in April 2008, using the ClustalW algorithm [32] with slight manual adjustment, followed by the numbering of amino acid residues according to their position in the HXB2 Env (Genbank accession no. K03455). Pairwise genetic distances between two amino acid sequences derived from an individual patient were calculated by using the p-distance model with 1,000 bootstrap replicates, conducted with the MEGA5 software package [33]. The diversity of each amino acid residue among 10 amino acid sequences derived from an individual was evaluated by calculating the Shannon index [34] with a program available at the website, http://www.gen-info.osaka-u.ac.jp/~uhmin/study/consensus/index.html. The nonsynonymous to synonymous substitution rate (dN/dS ratio) was estimated by calculating the numbers of synonymous substitutions per synonymous site (dS) and nonsynonymous substitutions per nonsynonymous site (dN) based on the Tamura-Nei method using the HyPhy program in the MEGA5 software package [33], [35]. Finally, the potential N-linked glycosylation (PNLG) site was evaluated using N-Glycosite (http://www.hiv.lanl.gov/).

Nucleotide sequence accession numbers

The nucleotide sequences of the viral gene fragment encoding full-length Env precursor gp160 have been deposited in the GenBank database under accession numbers JN388081-JN388230.

Results

Sample collection from 15 Thai patients infected with HIV-1 CRF01_AE viruses.

The molecular evolution of CRF01_AE Env, which was presumably driven by the selection pressure of humoral immune responses, was studied in this report. To this end, we periodically collected 10-12 viral sequences encoding full-length Env precursor gp160 for 3 years from 15 Thai patients chronically infected with HIV-1 CRF01_AE viruses, and performed computational analysis on 10 selected viral sequences derived from each patient (Table S1). We attempted to study viral sequences derived from plasma viral RNA; however, if a viral gene fragment failed to be amplified from viral RNA, that derived from proviral DNA was instead studied. Proviral DNA is generally considered to be more heterogeneous than viral RNA; however, the concordance of viral sequences between viral RNA and the major population of proviral DNA was observed previously [36], [37], [38]. Indeed, homogeneous viral sequences were amplified at the end-point dilution of proviral DNA samples (data not shown). The CD4 count and viral load were measured as clinical markers during the study (Tables S2 and S3). The CD4 counts of most patients were relatively stable, but fluctuated over 3 years (Table S2). In addition, the CD4 counts of 4 drug-naive patients gradually declined to <200 cells/mm3, and subsequently increased in response to ART (Table S2). The viral loads of the patients on ART were close to or under the detection limit throughout the study, whereas these of drug-naive patients gradually increased during the study (Table S3). These results suggested that the clinical conditions of most patients were relatively stable over 3 years, but those of some patients gradually deteriorated during the study.

Variability of CRF01_AE Env gp120 and gp41 over 3 years

After the establishment of HIV-1 infection, the amino acid diversity of HIV-1 Env increases during the course of disease progression [39]. In order to evaluate the molecular evolution of the functional regions of CRF01_AE Env under the pressure of humoral immune responses, we studied the variability of amino acid sequences in the conserved and variable regions of Env gp120 as well as in gp41. To this end, pairwise genetic distances (p-distances) between the amino acid sequences of an Env region in the earliest and one of the later samples derived from an individual were determined, and were plotted together to compare the variability of each Env region over 3 years. The results showed that the pairwise genetic distances of the amino acid sequences in gp120 variable regions, V1-V5, were higher than in other regions, including the conserved regions, C1-C5, of gp120 and gp41, as expected (Fig. 1). The V5 region showed the highest variability in samples derived from 8 individuals, CR2, CR3, CR14, CR15, CR11, CR17, CR25 and CR29, whereas the V1/V2 or V4 region(s) showed higher variability than the V5 region in samples derived from 7 individuals, CR10, CR8, CR12, CR19, CR28, CR36 and CR38 (Fig. 1). V3 region showed relatively low variability among variable regions (Fig. 1). In addition, C3 showed the highest variability among the conserved regions of gp120 (Fig. 1). Pairwise genetic distances of amino acid sequences in 5 conserved regions are also shown in Figure S1. Finally, gp41 showed lower variability than gp120 in all samples (Fig. 1). There were two patterns of change in the amino acid variability of Env gp120 and gp41 over 3 years. One was a gradually increasing pattern (example; CR3, V1/V2 regions), while the other was a fluctuating pattern of variability during 3 years (example, CR2, C3 region) (Figs. 1 and S1).

thumbnail
Figure 1. Variability of amino acid sequences in Env gp120 regions and gp41.

Nine amino acid sequences derived from samples collected between July 2008 and January 2011 from an individual were aligned with the amino acid sequence derived from the earliest sample collected in April 2008 from a corresponding individual. Aligned sequences were then divided into variable and conserved regions of gp120 and gp41. Pairwise genetic distances (p-distance) between the sequences derived from the earliest and one of the later samples was determined as described in Methods. The data obtained for all Env regions were plotted together in chronological order of sampling, as indicated from 1 vs 2 to 1vs 10. Patient IDs, Env regions and the status of treatment are denoted above, below and beside the panels, respectively.

https://doi.org/10.1371/journal.pone.0027098.g001

High amino acid diversity in the N-terminal part of C3 region and in entire V5 region

C3 and V5 regions showed the highest variability among the conserved and variable regions of gp120, respectively (Figs. 1 and S1). We next attempted to determine amino acid residues with high diversity within these regions by calculating the Shannon index of diversity. The Shannon index is a diversity index used to measure diversity in categorical data. We calculated the index for each amino acid residue in the C3 and V5 regions of gp120 among 10 viral sequences derived from an individual. The results showed that the N-terminal, but not C-terminal part of the C3 region contained amino acid residues with Shannon index values of 0.5-2.5 in many samples, indicating that high amino acid diversity was observed in the N-terminal part of the C3 region (Fig. 2). In addition, most amino acid residues, except a few amino acid residues at N- and C-termini, of the V5 region showed high diversity (Fig. 2). A large amino acid insertion was found in the middle part of the V5 region of gp120 in a few samples, and these amino acid residues also showed high diversity (Fig. 2).

thumbnail
Figure 2. Amino acid residues with high diversity in the C3 and V5 regions of gp120.

The Shannon index was calculated for each amino acid residue in the C3 and V5 regions of gp120 among 10 amino acid sequences derived from an individual, as described in Methods. The data obtained from 15 patient samples were then plotted together. Amino acid numbering is based on the HXB2 Env gp120.

https://doi.org/10.1371/journal.pone.0027098.g002

Variations in the length of gp120 variable regions and the number of PNLG sites

The length of the variable regions of gp120 as well as the N-linked glycosylation of particular amino acid residues affect the protein structure and lead to changes in the neutralization susceptibility of HIV-1 Env [17], [30], [40]; therefore, these factors are important for the molecular evolution of HIV-1 Env. We studied the changes in the numbers of amino acid residues and PNLG sites in the functional regions of gp120 and gp41. Average numbers of amino acid residues and PNLG sites in each Env region are shown in Table 1. The amino acid numbers did not significantly change over 3 years in the C1, C2, V3, C4 and C5 regions of gp120 as well as in gp41 (data not shown). In contrast, the numbers of amino acid residues in the V1/V2, C3, V4 and V5 regions of gp120 varied among samples derived from different individuals, and the numbers changed over 3 years (Fig. 3). A moderate correlation was observed between the variability and the chronological change of amino acid numbers in V1/V2, V4 and V5 regions (Figs. 1 and 3). Namely, if the chronological changes, either increasing or decreasing, in amino acid numbers were frequent (Fig. 3), the variability of the Env regions were high in some samples (examples; V1/V2 regions of CR2, V4 region of CR11, V5 region of CR14) (Fig. 1). The number of PNLG sites changed significantly in the V1/V2 and V4 regions of gp120 over 3 years, whereas only a few chronological changes were observed in the number of PNLG sites in the C1, C2, V3, C3, C4 and V5 regions of gp120 as well as in gp41 (Fig. 4 and data not shown). A moderate correlation was again observed between the variability and the chronological change in the number of PNLG sites in the V1/V2, C3 and V4 regions of gp120 (Figs. 1, S1 and 4).

thumbnail
Figure 3. Changes in the number of amino acid residues in the gp120 regions.

The numbers of amino acid residues in the gp120 regions were counted manually for 10 amino acid sequences derived from an individual. The data obtained from 15 patient samples were then plotted together in chronological order of sampling, as indicated from 1 to 10. Env regions are denoted above the panels, whereas patient IDs and the status of treatment are denoted below the panels.

https://doi.org/10.1371/journal.pone.0027098.g003

thumbnail
Figure 4. Changes in the number of PNLG sites in the gp120 regions.

PNLG sites in the gp120 regions were evaluated using N-Glycosite, and the number of the sites was then counted manually for 10 amino acid sequences derived from an individual. The data obtained from15 patient samples were then plotted together in chronological order of sampling, as indicated from 1 to 10. Env regions are denoted above the panels, whereas patient IDs and the status of treatment are denoted below the panels.

https://doi.org/10.1371/journal.pone.0027098.g004

thumbnail
Table 1. Average numbers of amino acid residues and potential N-linked glycosylation (PNLG) sites in Env regions.

https://doi.org/10.1371/journal.pone.0027098.t001

Detection of the amino acid residues potentially under positive selection

We next attempted to determine the amino acid residues in gp120 and gp41 under positive selection. To this end, the dN/dS ratio was calculated, and an amino acid residue with a ratio greater than 1 was considered to be potentially under positive selection. Since insertion and deletion mutations were frequently introduced into the V1/V2, V4 and V5 regions of gp120 over 3 years in many samples (Fig. 3), we failed to estimate reliable dN and dS values following the alignment of amino acid sequences for these variable regions. In contrast, the dN/dS ratios of the amino acid residues in the V3 and five conserved regions of gp120 as well as in gp41 were successfully determined. We found that the dN/dS ratios were greater than 1 at 6, 5, 3, 17, 7, 0 and 9 amino acid residues in the C1, C2, V3, C3, C4 and C5 regions of gp120 and gp41, respectively, in samples derived from at least one individual (Table 2). The C3 region of gp120 contained several amino acid residues with a dN/dS ratio greater than 1 (Table 2). The dN/dS ratios of amino acid residues in C3 region are shown in Figure 5. Our results showed that several amino acid residues in the N-terminal part of C3 region showed high diversity (Fig. 2), and some of these amino acid residues were potentially under positive selection (Fig. 5).

thumbnail
Figure 5. Amino acid residues potentially under positive selection in the C3 region of Env gp120.

The dN/dS ratio was estimated for each amino acid residue in the C3 region of gp120 among 10 amino acid sequences derived from an individual, as described in Methods. The data obtained from 15 patient samples were then plotted together. Amino acid numbering is based on the HXB2 Env gp120.

https://doi.org/10.1371/journal.pone.0027098.g005

thumbnail
Table 2. Amino acid residues potentially under positive selection.*

https://doi.org/10.1371/journal.pone.0027098.t002

Involvement of APOBEC3 activity in amino acid substitutions due to the positive selection of Env

The sub-lethal level of APOBEC3 activity is proposed to be involved in viral evolution; therefore, we attempted to study the possible involvement of APOBEC family protein-mediated G to A mutation in the positive selection of Env amino acid residues. APOBEC3G and APOBEC3F are involved in GG-AG [41] and GA-AA mutations [42], respectively. We manually detected GG-AG, GA-AA, GC-AC and GT-AT mutations, and found one of these mutation patterns 44 times over 3 years at 21 out of 47 amino acid residues with a dN/dS ratio greater than 1 (45%) in the C1, C2, V3, C3, C4 and C5 regions of gp120 and gp41 (Table 3). G to A mutations were more frequently detected in samples derived from individuals on ART, including CR17 and CR19, compared to samples derived from drug-naive individuals including CR2 and CR14 (Table 3), suggesting the accumulation of viruses with G to A mutations under therapeutic pressure and limited viral replication. Nevertheless, these results suggested that APOBEC3 activity was involved, at least in part, in mutations due to the positive selection of Env gp120 and gp41.

thumbnail
Table 3. Association of positive selection and potential APOBEC3-related mutation.

https://doi.org/10.1371/journal.pone.0027098.t003

Discussion

The molecular evolution of CRF01_AE Env was studied using viral gene fragments periodically collected from 15 chronically HIV-1-infected Thai patients over 3 years. It was previously reported for subtype B Env that the V1/V2 regions of gp120 are under positive selection in vivo [43], and the expansion of V1/V2 regions along with the accumulation of PNLG sites reduces the susceptibility of viruses to autologous neutralizing antibody [28]. The V3 region of subtype B gp120 shows strong immunogenicity, and several neutralizing monoclonal antibodies have been established [23], [24], [27]. In addition, the positive selection of amino acid residues in the V3 region is reported [44], [45], [46]. In contrast, the V3 region of subtype C gp120 is conserved, and the molecular evolution of C3-V4 regions is observed under the selection pressure of autologous serum antibodies [29]. In addition, the V5 region, cooperating in part with the V3 region, of subtype C gp120 is involved in generating an escape variant against humoral immune responses [47]. We first attempted to identify the region of CRF01_AE Env that showed high variability over 3 years, and found that the V1/V2, V4 and V5 regions of CRF01_AE gp120 showed high variability (Fig. 1). The gp120 region that showed highest variability was different among patients, suggesting that the V1/V2, V4 or V5 regions differentially counteracts with humoral immune responses. In contrast to these regions, the V3 region showed relatively low variability among the variable regions of gp120 (Fig. 1). The vast majority of CRF01_AE Env clones derived from plasma samples were CCR5-tropic [48]. In addition, CCR5-tropic CRF01_AE viruses with a low positive charge in the V3 region show low neutralization susceptibility to anti-V3 antibodies [49]. Considering these reports together, the V3 region of CCR5-tropic CRF01_AE viruses may not be under strong selection pressure of humoral immune responses in chronically infected, asymptomatic patients.

Molecular evolution of the C3 region of subtype C gp120 was previously reported [50]. An alpha helix domain located in the N-terminal part of the C3 region of subtype C gp120 shows high variation and is under positive selection, whereas that of subtype B gp120 is relatively conserved [1], [51]. In addition, the quaternary structure of this region differs between subtype B and C Env [52], and the C3 region of subtype C, but not of subtype B gp120 is suggested to be under immune pressure. Furthermore, mutations in the C3 region, cooperating with those in the variable regions, of subtype C gp120 affect the neutralization susceptibility of viruses to autologous neutralizing antibodies [53]. Our results showed that the C3 region of CRF01_AE gp120, similar to that of subtype C gp120, showed high amino acid variation (Figs. 1 and S1), and was suggested to be under the selection pressure of autologous immune responses.

Viral evolution in patients on ART has been reported previously [45]. Consistent with the previous report, high variability was observed in the variable regions of gp120 derived not only from drug-naïve patients, but also from patients on ART (Fig. 1). We observed not only a gradually increasing pattern, but also a fluctuating pattern in the chronological change in the variability of Env amino acid sequences over 3 years (Fig. 1), indicating two possibilities: that the variability was generated by the evolution of a trace of escape mutants against ART or was due to viral quasispecies which acquired divergence prior to the onset of ART. The frequent appearance of stop codons in the C2, V3 and C3 regions of gp120 in proviral DNA derived from successfully treated patients is reported [54]. In contrast to the previous report, premature stop codons were not frequently detected in Env gp120 and gp41 derived from patients on ART (data not shown). A possible explanation for this discrepancy might be the difference in efficacy, including the potency and the level of adherence, of ART between patients enrolled in our and other studies.

Potential APOBEC3 family protein-mediated G to A mutations were detected among the mutations due to the positive selection of several Env amino acid residues (Table 3), suggesting the role of a sub-lethal level of APOBEC3 activity in viral evolution. APOBEC3G introduces missense or nonsense mutations into viral DNA during the RT process and leads to a diminishment of viral infectivity, whereas HIV-1 Vif counteracts this activity [5], [7], [8]; therefore, we consider that an approach to inhibit Vif function or stimulate APOBEC3 activity has potential for therapy.

Finally, HIV-1 Env, gp120 and gp41 are the major targets among viral proteins of humoral immune responses to viral infection; therefore, they are candidates of vaccine antigens. In order to develop an effective vaccine, it is important to understand the immunological characteristics of target viral proteins; however, information regarding the molecular evolution of CRF01_AE Env is still limited. CRF01_AE is a major circulating recombinant form of HIV-1 prevalent in Southeast Asia, including Thailand, and HIV vaccine trials have been conducted in Thailand [55], [56]. We hope that our results may provide useful information for understanding the immunological characteristics of CRF01_AE Env as well as for designing effective vaccine antigens.

Supporting Information

Figure S1.

Variability of amino acid sequences in the conserved regions of Env gp120. Pairwise genetic distances (p-distance) was determined, as described in the legend to Figure 1. Patient IDs, Env regions and the status of treatment are denoted above, below and beside the panels, respectively.

https://doi.org/10.1371/journal.pone.0027098.s001

(TIF)

Table S2.

Changes in the CD4 count of study participants.

https://doi.org/10.1371/journal.pone.0027098.s003

(DOC)

Table S3.

Changes in the viral load of study participants.

https://doi.org/10.1371/journal.pone.0027098.s004

(DOC)

Acknowledgments

We are grateful to Drs. Yoshitake Nishimune and Shigeyuki Hamada (RCC-ERI, Research Institute for Microbial Diseases, Osaka University) for their valuable help with this study. The manuscript was proofread by Medical English Service (Kyoto, Japan).

Author Contributions

Conceived and designed the experiments: SB DJ MK. Performed the experiments: SB DJ JU A. Yowang NK SC. Analyzed the data: SB DJ A. Yamashita MK. Contributed reagents/materials/analysis tools: DJ A. Yamashita KI AR SK PS. Wrote the paper: SB MK.

References

  1. 1. Gaschen B, Taylor J, Yusim K, Foley B, Gao F, et al. (2002) Diversity considerations in HIV-1 vaccine selection. Science 296: 2354–2360.
  2. 2. Hemelaar J, Gouws E, Ghys PD, Osmanov S (2011) Global trends in molecular epidemiology of HIV-1 during 2000-2007. AIDS 25: 679–689.
  3. 3. Arroyo MA, Phanuphak N, Krasaesub S, Sirivichayakul S, Assawadarachai V, et al. (2010) HIV type 1 molecular epidemiology among high-risk clients attending the Thai Red Cross Anonymous Clinic in Bangkok, Thailand. AIDS Res Hum Retroviruses 26: 5–12.
  4. 4. Mansky LM, Temin HM (1995) Lower in vivo mutation rate of human immunodeficiency virus type 1 than that predicted from the fidelity of purified reverse transcriptase. J Virol 69: 5087–5094.
  5. 5. Sheehy AM, Gaddis NC, Choi JD, Malim MH (2002) Isolation of a human gene that inhibits HIV-1 infection and is suppressed by the viral Vif protein. Nature 418: 646–650.
  6. 6. Harris RS, Bishop KN, Sheehy AM, Craig HM, Petersen-Mahrt SK, et al. (2003) DNA deamination mediates innate immunity to retroviral infection. Cell 113: 803–809.
  7. 7. Mangeat B, Turelli P, Caron G, Friedli M, Perrin L, et al. (2003) Broad antiretroviral defence by human APOBEC3G through lethal editing of nascent reverse transcripts. Nature 424: 99–103.
  8. 8. Zhang H, Yang B, Pomerantz RJ, Zhang C, Arunachalam SC, et al. (2003) The cytidine deaminase CEM15 induces hypermutation in newly synthesized HIV-1 DNA. Nature 424: 94–98.
  9. 9. Biasin M, Piacentini L, Lo Caputo S, Kanari Y, Magri G, et al. (2007) Apolipoprotein B mRNA-editing enzyme, catalytic polypeptide-like 3G: a possible role in the resistance to HIV of HIV-exposed seronegative individuals. J Infect Dis 195: 960–964.
  10. 10. Land AM, Ball TB, Luo M, Pilon R, Sandstrom P, et al. (2008) Human immunodeficiency virus (HIV) type 1 proviral hypermutation correlates with CD4 count in HIV-infected women from Kenya. J Virol 82: 8172–8182.
  11. 11. Kieffer TL, Kwon P, Nettles RE, Han Y, Ray SC, et al. (2005) G-->A hypermutation in protease and reverse transcriptase regions of human immunodeficiency virus type 1 residing in resting CD4+ T cells in vivo. J Virol 79: 1975–1980.
  12. 12. Jin X, Brooks A, Chen H, Bennett R, Reichman R, et al. (2005) APOBEC3G/CEM15 (hA3G) mRNA levels associate inversely with human immunodeficiency virus viremia. J Virol 79: 11513–11516.
  13. 13. Pace C, Keller J, Nolan D, James I, Gaudieri S, et al. (2006) Population level analysis of human immunodeficiency virus type 1 hypermutation and its relationship with APOBEC3G and vif genetic variation. J Virol 80: 9259–9269.
  14. 14. Jern P, Russell RA, Pathak VK, Coffin JM (2009) Likely role of APOBEC3G-mediated G-to-A mutations in HIV-1 evolution and drug resistance. PLoS Pathog 5: e1000367.
  15. 15. Wood N, Bhattacharya T, Keele BF, Giorgi E, Liu M, et al. (2009) HIV evolution in early infection: selection pressures, patterns of insertion and deletion, and the impact of APOBEC. PLoS Pathog 5: e1000414.
  16. 16. Wyatt R, Sodroski J (1998) The HIV-1 envelope glycoproteins: fusogens, antigens, and immunogens. Science 280: 1884–1888.
  17. 17. Pantophlet R, Burton DR (2006) GP120: target for neutralizing HIV-1 antibodies. Annu Rev Immunol 24: 739–769.
  18. 18. Barbas CF, 3rd , Collet TA, Amberg W, Roben P, Binley JM, et al. (1993) Molecular profile of an antibody response to HIV-1 as probed by combinatorial libraries. J Mol Biol 230: 812–823.
  19. 19. Binley JM, Wrin T, Korber B, Zwick MB, Wang M, et al. (2004) Comprehensive cross-clade neutralization analysis of a panel of anti-human immunodeficiency virus type 1 monoclonal antibodies. J Virol 78: 13232–13252.
  20. 20. Trkola A, Pomales AB, Yuan H, Korber B, Maddon PJ, et al. (1995) Cross-clade neutralization of primary isolates of human immunodeficiency virus type 1 by human monoclonal antibodies and tetrameric CD4-IgG. J Virol 69: 6609–6617.
  21. 21. Utachee P, Jinnopat P, Isarangkura-Na-Ayuthaya P, de Silva UC, Nakamura S, et al. (2009) Phenotypic studies on recombinant human immunodeficiency virus type 1 (HIV-1) containing CRF01_AE env gene derived from HIV-1-infected patient, residing in central Thailand. Microbes Infect 11: 334–343.
  22. 22. Walker LM, Phogat SK, Chan-Hui PY, Wagner D, Phung P, et al. (2009) Broad and potent neutralizing antibodies from an African donor reveal a new HIV-1 vaccine target. Science 326: 285–289.
  23. 23. Gorny MK, Xu JY, Gianakakos V, Karwowska S, Williams C, et al. (1991) Production of site-selected neutralizing human monoclonal antibodies against the third variable domain of the human immunodeficiency virus type 1 envelope glycoprotein. Proceedings of the National Academy of Sciences of the United States of America 88: 3238–3242.
  24. 24. Javaherian K, Langlois AJ, LaRosa GJ, Profy AT, Bolognesi DP, et al. (1990) Broadly neutralizing antibodies elicited by the hypervariable neutralizing determinant of HIV-1. Science 250: 1590–1593.
  25. 25. Wu X, Yang ZY, Li Y, Hogerkorp CM, Schief WR, et al. (2010) Rational design of envelope identifies broadly neutralizing human monoclonal antibodies to HIV-1. Science 329: 856–861.
  26. 26. Posner MR, Hideshima T, Cannon T, Mukherjee M, Mayer KH, et al. (1991) An IgG human monoclonal antibody that reacts with HIV-1/GP120, inhibits virus binding to cells, and neutralizes infection. Journal of immunology 146: 4325–4332.
  27. 27. Goudsmit J, Boucher CA, Meloen RH, Epstein LG, Smit L, et al. (1988) Human antibody response to a strain-specific HIV-1 gp120 epitope associated with cell fusion inhibition. AIDS 2: 157–164.
  28. 28. Sagar M, Wu X, Lee S, Overbaugh J (2006) Human immunodeficiency virus type 1 V1-V2 envelope loop sequences expand and add glycosylation sites over the course of infection, and these modifications affect antibody neutralization sensitivity. J Virol 80: 9586–9598.
  29. 29. Moore PL, Gray ES, Choge IA, Ranchobe N, Mlisana K, et al. (2008) The c3-v4 region is a major target of autologous neutralizing antibodies in human immunodeficiency virus type 1 subtype C infection. J Virol 82: 1860–1869.
  30. 30. Wei X, Decker JM, Wang S, Hui H, Kappes JC, et al. (2003) Antibody neutralization and escape by HIV-1. Nature 422: 307–312.
  31. 31. Jullaksorn D, Boonchawalit S, Uttiyoung J, Soonthornsata B, Yowang A, et al. (2010) Sustained appearance of drug resistance-associated mutations in HIV-1 CRF01_AE protease and reverse transcriptase derived from protease inhibitor-naive Thai patients. Southeast Asian J Trop Med Public Health 41: 347–357.
  32. 32. Thompson JD, Higgins DG, Gibson TJ (1994) CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res 22: 4673–4680.
  33. 33. Tamura K, Peterson D, Peterson N, Stecher G, Nei M, et al. (2011) MEGA5: Molecular Evolutionary Genetics Analysis using Maximum Likelihood, Evolutionary Distance, and Maximum Parsimony Methods. Molecular biology and evolution 28: 2731–2739.
  34. 34. Shannon CE (1948) A Mathematical Theory of Communication. The Bell System Technical Journal 27: 379–423, 623-656.
  35. 35. Pond SL, Frost SD, Muse SV (2005) HyPhy: hypothesis testing using phylogenies. Bioinformatics 21: 676–679.
  36. 36. Curran R, Ball JK (2002) Concordance between semen-derived HIV-1 proviral DNA and viral RNA hypervariable region 3 (V3) envelope sequences in cases where semen populations are distinct from those present in blood. Journal of medical virology 67: 9–19.
  37. 37. Ball JK, Holmes EC, Whitwell H, Desselberger U (1994) Genomic variation of human immunodeficiency virus type 1 (HIV-1): molecular analyses of HIV-1 in sequential blood samples and various organs obtained at autopsy. The Journal of general virology 75(Pt 4): 67–79.
  38. 38. Karlsson AC, Lindback S, Gaines H, Sonnerborg A (1998) Characterization of the viral population during primary HIV-1 infection. AIDS 12: 839–847.
  39. 39. Shankarappa R, Margolick JB, Gange SJ, Rodrigo AG, Upchurch D, et al. (1999) Consistent viral evolutionary changes associated with the progression of human immunodeficiency virus type 1 infection. J Virol 73: 10489–10502.
  40. 40. Wyatt R, Kwong PD, Desjardins E, Sweet RW, Robinson J, et al. (1998) The antigenic structure of the HIV gp120 envelope glycoprotein. Nature 393: 705–711.
  41. 41. Beale RC, Petersen-Mahrt SK, Watt IN, Harris RS, Rada C, et al. (2004) Comparison of the differential context-dependence of DNA deamination by APOBEC enzymes: correlation with mutation spectra in vivo. Journal of molecular biology 337: 585–596.
  42. 42. Liddament MT, Brown WL, Schumacher AJ, Harris RS (2004) APOBEC3F properties and hypermutation preferences indicate activity against HIV-1 in vivo. Current biology : CB 14: 1385–1391.
  43. 43. Lamers SL, Sleasman JW, She JX, Barrie KA, Pomeroy SM, et al. (1993) Independent variation and positive selection in env V1 and V2 domains within maternal-infant strains of human immunodeficiency virus type 1 in vivo. J Virol 67: 3951–3960.
  44. 44. Yamaguchi Y, Gojobori T (1997) Evolutionary mechanisms and population dynamics of the third variable envelope region of HIV within single hosts. Proc Natl Acad Sci U S A 94: 1264–1269.
  45. 45. Frost SD, Gunthard HF, Wong JK, Havlir D, Richman DD, et al. (2001) Evidence for positive selection driving the evolution of HIV-1 env under potent antiviral therapy. Virology 284: 250–258.
  46. 46. Bagnarelli P, Mazzola F, Menzo S, Montroni M, Butini L, et al. (1999) Host-specific modulation of the selective constraints driving human immunodeficiency virus type 1 env gene evolution. J Virol 73: 3764–3777.
  47. 47. Rong R, Li B, Lynch RM, Haaland RE, Murphy MK, et al. (2009) Escape from autologous neutralizing antibodies in acute/early subtype C HIV-1 infection requires multiple pathways. PLoS pathogens 5: e1000594.
  48. 48. Utachee P, Jinnopat P, Isarangkura-Na-Ayuthaya P, de Silva UC, Nakamura S, et al. (2009) Genotypic Characterization of CRF01_AE env Genes Derived from Human Immunodeficiency Virus Type 1-Infected Patients Residing in Central Thailand. AIDS Res Hum Retroviruses 25: 229–236.
  49. 49. Naganawa S, Yokoyama M, Shiino T, Suzuki T, Ishigatsubo Y, et al. (2008) Net positive charge of HIV-1 CRF01_AE V3 sequence regulates viral sensitivity to humoral immunity. PLoS ONE 3: e3206.
  50. 50. Dieltjens T, Loots N, Vereecken K, Grupping K, Heyndrickx L, et al. (2009) HIV type 1 subtype A envelope genetic evolution in a slow progressing individual with consistent broadly neutralizing antibodies. AIDS Res Hum Retroviruses 25: 1165–1169.
  51. 51. Archary D, Gordon ML, Green TN, Coovadia HM, Goulder PJ, et al. (2010) HIV-1 subtype C envelope characteristics associated with divergent rates of chronic disease progression. Retrovirology 7: 92.
  52. 52. Gnanakaran S, Lang D, Daniels M, Bhattacharya T, Derdeyn CA, et al. (2007) Clade-specific differences between human immunodeficiency virus type 1 clades B and C: diversity and correlations in C3-V4 regions of gp120. J Virol 81: 4886–4891.
  53. 53. Rong R, Gnanakaran S, Decker JM, Bibollet-Ruche F, Taylor J, et al. (2007) Unique mutational patterns in the envelope alpha 2 amphipathic helix and acquisition of length in gp120 hypervariable domains are associated with resistance to autologous neutralization of subtype C human immunodeficiency virus type 1. J Virol 81: 5658–5668.
  54. 54. Mens H, Pedersen AG, Jorgensen LB, Hue S, Yang Y, et al. (2007) Investigating signs of recent evolution in the pool of proviral HIV type 1 DNA during years of successful HAART. AIDS Res Hum Retroviruses 23: 107–115.
  55. 55. Rerks-Ngarm S, Pitisuttithum P, Nitayaphan S, Kaewkungwal J, Chiu J, et al. (2009) Vaccination with ALVAC and AIDSVAX to prevent HIV-1 infection in Thailand. N Engl J Med 361: 2209–2220.
  56. 56. Currier JR, Ngauy V, de Souza MS, Ratto-Kim S, Cox JH, et al. (2010) Phase I Safety and Immunogenicity Evaluation of MVA-CMDR, a Multigenic, Recombinant Modified Vaccinia Ankara-HIV-1 Vaccine Candidate. PLoS ONE 5: e13983.