Massively parallel pyrosequencing highlights minority variants in the HIV-1 env quasispecies deriving from lymphomonocyte sub-populations

  • Gabriella Rozera1,

    Affiliated with

    • Isabella Abbate1Email author,

      Affiliated with

      • Alessandro Bruselles1,

        Affiliated with

        • Crhysoula Vlassi2,

          Affiliated with

          • Gianpiero D'Offizi2,

            Affiliated with

            • Pasquale Narciso2,

              Affiliated with

              • Giovanni Chillemi3,

                Affiliated with

                • Mattia Prosperi1,

                  Affiliated with

                  • Giuseppe Ippolito4 and

                    Affiliated with

                    • Maria R Capobianchi1

                      Affiliated with


                      DOI: 10.1186/1742-4690-6-15

                      Received: 10 November 2008

                      Accepted: 12 February 2009

                      Published: 12 February 2009



                      Virus-associated cell membrane proteins acquired by HIV-1 during budding may give information on the cellular source of circulating virions. In the present study, by applying immunosorting of the virus and of the cells with antibodies targeting monocyte (CD36) and lymphocyte (CD26) markers, it was possible to directly compare HIV-1 quasispecies archived in circulating monocytes and T lymphocytes with that present in plasma virions originated from the same cell types. Five chronically HIV-1 infected patients who underwent therapy interruption after prolonged HAART were enrolled in the study. The analysis was performed by the powerful technology of ultra-deep pyrosequencing after PCR amplification of part of the env gene, coding for the viral glycoprotein (gp) 120, encompassing the tropism-related V3 loop region. V3 amino acid sequences were used to establish heterogeneity parameters, to build phylogenetic trees and to predict co-receptor usage.


                      The heterogeneity of proviral and viral genomes derived from monocytes was higher than that of T-lymphocyte origin. Both monocytes and T lymphocytes might contribute to virus rebounding in the circulation after therapy interruptions, but other virus sources might also be involved. In addition, both proviral and circulating viral sequences from monocytes and T lymphocytes were predictive of a predominant R5 coreceptor usage. However, minor variants, segregating from the most frequent quasispecies variants, were present. In particular, in proviral genomes harboured by monocytes, minority variant clusters with a predicted X4 phenotype were found.


                      This study provided the first direct comparison between the HIV-1 quasispecies archived as provirus in circulating monocytes and T lymphocytes with that of plasma virions replicating in the same cell types. Ultra-deep pyrosequencing generated data with some order of magnitude higher than any previously obtained with conventional approaches. Next generation sequencing allowed the analysis of previously inaccessible aspects of HIV-1 quasispecies, such as co-receptor usage of minority variants present in archived proviral sequences and in actually replicating virions, which may have clinical and therapeutic relevance.


                      The error prone nature of HIV-1 reverse transcriptase, combined with the high replicative activity of the virus, results, in each infected individual, in the formation of many genetically related viral variants referred to as quasispecies, in which most viral sequences differ from all others. This variability is the substrate for the selective pressure exerted by drugs or by the immune system, leading to the continuous evolution of HIV-1 in the infected host [1, 2]. The most variable part of the HIV-1 genome is the region coding for the V3 loop of HIV-1 surface glycoprotein (gp120) that is involved in the coreceptor binding [3]. Shortly after primary infection, viral heterogeneity is relatively low and progressively increases in the absence of treatment [4, 5]. During the natural history of the infection, compartmentalized viral replication in different cell types may contribute to virion diversity, and ultimately may determine the segregation of viral clusters in different body sites [69]. While recent reports show that in patients treated with antiviral drugs HIV-1 quasispecies present in monocytes may evolve in clusters segregated from viral quasispecies harboured by lymphocytes [10, 11], most HIV-1 compartmentalization studies have focused mainly on proviral DNAs in lymphomonocyte populations [1016].

                      However, HIV-1 proviral DNA represents an archive of viral variants, including those acquired in the past, that may not necessarily reflect the viral population replicating at every given time, which makes the evaluation of how the different cell sources impact the circulating HIV-1 quasispecies rather difficult.

                      Cell-derived antigens acquired during the budding process serve as markers of the virus cellular origin [1721]. Consequently, using cell type-specific antibodies when studying plasma virions may aid in identifying viral populations originating in vivo from different cellular sources [1922]. In the present study, we analyzed five patients who experienced therapy interruptions after prolonged periods of highly active antiretroviral therapy (HAART). Stringent inclusion criteria are detailed in the Materials and Methods section. Considering that, as recently shown, both activated immature monocyte/macrophage (CD36 positive) and CD4 T cell (CD26 positive) compartments contribute to viral load [22], proviral V3 quasispecies harboured by these cells at therapy interruption (T0) were compared to quasispecies present in circulating viral RNA genomes one month later (T1). Monocyte- and lymphocyte-enriched cell sources were obtained by immunosorting with anti-CD36 and anti-CD26 monoclonal antibodies; the same antibodies were used to sort circulating virions originated from the same cell lineages as previously described [19].

                      To study the V3 quasispecies, an innovative and powerful technology was used: the ultra-deep pyrosequencing, performed with the 454 Life Sciences platform (GS-FLX, distributed by Roche). By this approach it is possible to analyze simultaneously thousands of clonally amplified PCR amplicons, increasing the probability of identifying minority variants, as already shown in [2325] for rare HIV drug resistance mutations.

                      After sequencing, heterogeneity parameters were calculated for both proviral and circulating virion amino acid sequences. Phylogenetic analysis was performed to identify the genetic relationship between viral genomes from different sources. Co-receptor usage was deduced from the V3 region sequence of each variant.

                      Materials and methods


                      The enrolled subjects were 5 chronically HIV-1 infected patients who underwent therapy interruption after prolonged treatment with effective HAART (overall extent ≥ 5 years, with at least 2 consecutive years before enrollment). Therapeutic regimens included combinations of two NRTIs and either an NNRTI or a ritonavir-boosted PI. Pt. 2 underwent a therapy interruption cycle 3 years before the present study.

                      The eligibility criteria to interrupt HAART were: CD4+>350/microliter; nadir CD4+>100/microliter; absence of virological failure during the last HAART course (≥ 2 years); CDC classification: A or B. The project was approved by the Institutional Ethics Committee, and the patients agreed to participate by signing an informed consent. Demographic, clinical and virological data are reported in Table 1. Plasma viremia was determined using the Versant HIV RNA test, version 3.0 (Siemens Medical Solutions).
                      Table 1

                      Demographic, clinical and virological features of the study patients





                      Time of



                      Total time on



                      NADIR CD4











































                      > 5,70





















                      * M, male; F, female.

                      ** T0: at the time of HAART interruption

                      *** T1: one month after HAART interruption

                      All patients were infected with HIV-1 subtype B, as determined by sequence analysis of env region, according to the Los Alamos genotyping algorithm.

                      Immunosorting of lymphomonocyte subpopulations and of HIV-1 virions

                      Monocytes and T lymphocyte cells were purified by immunomagnetic sorting (Miltenyi Biotec, Bologna, Italy) from freshly isolated peripheral blood mononuclear cells (PBMC) by positive selection using anti-CD36 (clone CLB-703, Monosan) antibody for monocytes and anti CD26 (clone M-A261, Pharmingen) antibody for T-cells.

                      HIV-1 virions from patients' plasma, originating from either monocytes or T lymphocytes, were sorted by immobilized antibody capture (IAC), using the same antibodies used for PBMC sorting (anti CD36 and anti CD26 antibodies) as described elsewhere [21]. To rule out the possibility that soluble molecules present in the fluids could inhibit virus binding to specific MAbs, IAC was performed on HIV-1 purified according to published procedures [19] by applying 50 μl of purified virion preparations, diluted as appropriate to contain 50,000–100,000 RNA copies/ml, on 96 wells PRO-BIND Assay Plates (Becton Dickinson) coated with the specific Mabs. As IAC yield for each antibody ranged between 5 and 10% of input virus, to obtain sufficient amounts of virions to undergo ultra-deep pyrosequencing, HIV-1 captured by 10 different wells coated with either anti-CD36 and or anti-CD26 were pooled.

                      To validate the method used to sort the viral quasispecies originating from the two cellular sources, two HIV-1 strains, genetically distinct and with different co-receptor usage (i.e. the reference R5 strain HIV-1 BaL and a clinical X4 isolate) were grown on monocytes derived macrophages (MDM) and on PHA activated CD4+ T lymphocytes, respectively. The R5 and the X4 viral preparations were mixed at a ratio of 1:9 and applied to 2 sets of immuno-capture wells, coated with anti-CD36 and anti-CD26 antibodies, respectively. Then the immuno-captured virions were analyzed by ultra-deep pyrosequencing. By this approach, only 0.51% of virions captured by anti-CD36 actually represented virions with a V3 sequence matching that of the X4 strain grown on CD4+ T lymphocytes; in parallel, only 0.12% of virions captured by anti-CD26 segregated with the R5 strain grown on MDM. These results indicate that the cell lineage-specific antibodies in fact captured the virions originating from the corresponding cellular subpopulation, with high specificity (see Additional file 1).

                      To further establish the specificity of the virion immunocapture method, binding to the control antibody anti-CD19 was evaluated. The proportion of virions captured by anti-CD19 was <0.001% for R5 strain grown on MDM and 0.195% for the X4 strain grown on CD4+ T lymphocytes. The proportion of plasma virions from the study patients captured by the control antibody was consistently low (in the range of 0.3–0.7%).

                      Nucleic acid extraction, RT and PCR conditions

                      Total DNA extraction from CD36 and CD26 cells was performed by using a DNA blood kit (Qiagen, Hilden, Germany). HIV-1 RNA from immunocaptured virions underwent extraction using the QIAamp Viral RNA kit (Qiagen). Retrotranscription was performed by random hexamer extension with M-MuLV Reverse Transcriptase (Roche) for 1 h at 42°C followed by 15 min at 65°C.

                      For V3 region amplifications, two rounds of 35 cycles (95°C for 2 min, 95° for 30 sec, annealing at 60°C for 30 sec, extension at 72°C for 45 sec and final elongation at 72°C for 5 min.) were carried out using a proof-reading enzyme (Fast Start High fidelity enzyme, Roche): outer sense primer ATGGGATCAAAGCCTAAAGCCATGTG (position 6556–6581 in HXB2), outer antisense primer AGTGCTTCCTGCTGCTCCCAAGAACCCAAG (position 7822–7792 in HXB2), inner sense primer GCCTCCCTCGCGCCATCAG TGGCAGTCTAGCAGAAGAAG (position 7010 to 7029 in HXB2) and inner antisense primer GCCTTGCCAGCCCGCTCAG CTGGGTCCCCTCCTGAGG (position 7332 to 7315 in HXB2). The inner primers included 5' extensions (underlined) which provided binding sites for pyrosequencing (see below).

                      To maximize the number of templates undergoing ultra-deep pyrosequencing, we pooled a number of 2 to 10 different PCR reactions for each sample type.

                      In addition, in order to avoid possible cross-contamination of the amplicons, each sample was handled in separate time frames, and accurate decontamination was performed after each manipulation of the nucleic acids.

                      Ultra-deep pyrosequencing

                      Ultra-deep pyrosequencing was carried out with the 454 Life Sciences platform (GS-FLX, Roche Applied Science). PCR products were clonally amplified on capture beads in water-in-oil emulsion micro-reactors, and pyrosequencing was performed by using one of 16 lanes of a 70 × 75 mm PicoTiterPlate for each sample, following the standard approach for PCR amplicons sequencing. For each sample an SFF file was obtained, from which nucleotide sequence data were extracted.

                      UDPS error rate estimation and correction algorithm

                      To measure the accuracy of the ultra-deep pyrosequencing, a plasmid clone containing the region of interest was sequenced in parallel by ultra-deep pyrosequencing and by the Sanger method. The plasmid clone was obtained from a patient's sample by inserting a PCR amplicon spanning nucleotide positions 6,989 to 7,667 (reference strain HXB2) into a pCR4-TOPO vector (Invitrogen Corp.). Sanger sequencing of the clone was performed on ABI Prism 310, using the BigDye Terminator cycle sequencing kit, following the manufacturer's instructions (Applied Biosystems Warrington, UK). Any differences between the two methods were considered to be a GS-FLX sequencing error. Because it has been previously reported that the pyrosequencing error rate is higher in regions with nucleotide repeats of three or more identical bases, defined as homopolymeric [26], we determined the error rates separately in homopolymeric and in non-homopolymeric regions. The error rate in homopolymeric regions (3 to 5 identical nucleotides) was 0.0097 ± 0.0056 (mean ± SEM), whereas in non-homopolymeric regions it was 0.0024 ± 0.0009; the overall error rate was 0.0043 ± 0.0016.

                      To be noted, the plasmid used for the evaluation contained a highly polymeric region of 6 adjacent adenines (A), that was read as an incomplete extension of 5 A-homopolymer (Additional file 2). However, this particular pattern was present only in CD36-virus from one patient, where it represented 21% of the total clones from this source. Of these, only 3% were read well, whereas the remaining 18% showed a 5 A-homopolymer. The latter situation introduced a stop codon in the sequences, determining their elimination during the subsequent correction process of the sequences. Therefore the error appeared to have no consequence on the overall results.

                      Based on these considerations, and with the aim of considering only the sequences leading to functional products, we adopted the following correction algorithm. Nucleotide sequences from each sample were divided into two separate files, one for the forward reads and one for the reverse ones. These sequences were then translated into amino acids with EMBOSS [27] using all possible frames; only those translated with the right open reading frame (ORF) were retained.

                      Multiple alignments of the amino acid sequence files were then constructed using the software MUSCLE [28] (default options) and trimmed at the 5' and 3' termini to include only a region potentially covered by both forward and reverse reads. The env region resulting from this trimming consisted of about 66 amino acids and encompassed the V3 loop (-16 before and +15 after V3 loop).

                      To reduce the error rate, sequences containing ambiguous bases (Ns) were also discarded; this reduced the error rate due to the possible presence of reads coming from multi-templated beads [29]. Then, for each sample we compared the forward sequence datasets with the reverse ones and clustered them, reporting only identical matches between the two.

                      The application of the correction algorithm led to a reduction in the number of sequences for each sample type that underwent quasispecies analysis (see Additional file 3); the unique variants for each sample type obtained after de-replication were used for the construction of the phylogenetic trees.

                      All the sequences obtained from the patients have been compared to the sequences of reference HIV strains present in the laboratory to rule out possible contamination with these amplicons.

                      Heterogeneity parameters calculation and construction of phylogenetic trees

                      After the application of the above mentioned sequence correction algorithm, the resulting amino acid sequences were used for quasispecies analysis. Although nucleotide sequences provide more information to the end of heterogeneity and phylogenetic analyses, the choice to use amino acid sequences was forced by the limitation of existing bioinformatic tools, designed for medium-small scale size of data sets. However, we were able to perform a limited phylogenetic analysis based on nucleotide sequences from the patient displaying the smallest number of unique variants (i.e. pt 4), obtaining a tree shape substantially equivalent to that based on amino acid sequences (not shown). To assess diversity, the mean genetic distance of amino acid sequences was calculated by PROTDIST using Jones-Taylor-Thornton matrix and with an in-house written code. Quasispecies complexity was calculated using normalized Shannon (S) entropy [21]. The neighbor-joining method was used to construct individual phylogenetic trees for each patient. Bootstrap analysis of 1,000 replicates was used to place approximate confidential limits on individual branches. All the algorithms for quasispecies analysis were included in the MEGA package Version 4.0.1. Although the individual patients' trees are built using only unique variants in each sample type (CD36-provirus, CD26-provirus, CD36-virus, CD26-virus), the relative abundance of sequences included in the clusters and their mean PSSM score were referred to all the variants in each sample type.

                      Prediction of coreceptor usage by position specific score matrices (PSSM)

                      PSSM analysis for HIV-1 subtype B was applied to the V3 amino acid sequences obtained by ultra-deep pyrosequencing to obtain a score for co-receptor usage prediction as described elsewhere [30]. In general, the higher the score, the higher the probability that the given V3 sequence uses CXCR4 co-receptor. The 95th and 5th percentiles for X4 and R5 tropism are > -2.88 and < -6.96, respectively. As reference, the score of the R5 strain HIV-1 BaL is -12.96, while the score of the X4 strain HIV-1 HXB2 is +3.47.


                      V3 loop heterogeneity in HIV from monocytes and T lymphocytes

                      The characteristics of the 5 study patients are reported in Table 1. Four types of samples were analyzed: 1) proviral DNA from monocytes (CD36-provirus); 2) proviral DNA from T lymphocytes (CD26-provirus); 3) viral RNA from anti CD36-captured virions (CD36-virus); 4) viral RNA from anti CD26-captured virions (CD26-virus). For 2 patients (Pt. 1 and Pt. 2) the analysis was restricted to provirus, since the rebounding HIV-1 viremia was too low to perform immuno-capture and subsequent quasispecies analysis.

                      The GS-FLX sequencing platform generated an average of 13,456 reads per sample, determining a median coverage of 12,609 reads per nt. A correction algorithm, substantially based on the translation in amino acid sequences and on the retention of only coding sequences, was applied. The number of filtered amino acid sequences used for quasispecies analysis is reported in Additional file 3. The application of this algorithm restricted the analysis only to sequences with a phenotypic significance and at the same time reduced the computational burden, allowing us to use conventional softwares for phylogenetic analysis.

                      The number of unique variants, diversity and complexity of HIV-1 V3 loop for CD36-provirus, CD26-provirus, CD36-virus and CD26-virus are shown in Table 2. Diversity was significantly correlated with complexity (r = 0.72, p = 0.0017, in Pearson correlation test). All these parameters in the monocyte compartment (including proviral and viral HIV-1) were generally higher than those found in T-lymphocytes. In fact, with the exception of one patient (Pt.1), in all patients unique proviral sequences deriving from CD36 cells were more numerous than those harboured by CD26 cells; similarly, with the exception of one patient (Pt 4), the diversity was higher in CD36 provirus as compared to CD26 provirus. Concerning circulating virions, the situation was more variable. On the whole, mean diversity of proviral+viral sequences of CD36-derived HIV-1 was significantly higher than CD26-derived HIV-1 (0.060 ± 0.015 vs 0.022 ± 0.006, p = 0.027 in Student's t test).
                      Table 2

                      Amino acid heterogeneity of V3 region of HIV-1 proviral quasispecies harboured by monocytes (CD36) and T lymphocytes (CD26) and of plasma virions replicating in the two cell types after HAART interruption.


                      Sample type


                      N. of unique variants*







                      0.0795 ± 0.0005





                      0.0197 ± 0.0001






                      0.1446 ± 0.0005





                      0.0031 ± 0.0001






                      0.0508 ± 0.0003





                      0.0084 ± 0.0002






                      0.0428 ± 0.0010





                      0.0298 ± 0.0003






                      0.0226 ± 0.0007





                      0.0457 ± 0.0003






                      0.0113 ± 0.0002





                      0.0208 ± 0.0002






                      0.0585 ± 0.0003





                      0.0411 ± 0.0004






                      0.0728 ± 0.0005





                      0.0042 ± 0.0001


                      Sequences obtained by ultra-deep pyrosequencing were filtered through the correction algorithm and amino acid sequences obtained were analyzed to establish diversity and complexity, as described in the Materials and Methods section.

                      * Number of unique sequences obtained after dereplication

                      ** Diversity: p distance ± SEM

                      *** Complexity: normalized Shannon entropy

                      Individual phylogenetic analysis of HIV-1 env region from monocytes and T lymphocytes

                      The repertoire of V3 amino acid sequences from each patient was also used to build individual phylogenetic trees that are shown in Figures 1, 2, 3, 4 and 5.
                      Figure 1

                      Individual phylogenetic tree of HIV-1 V3 amino acid sequences from Pt.1. Proviral quasispecies harboured by monocytes (red circle = CD36-provirus) and T lymphocytes (green circle = CD26-provirus) of Pt.1 were analyzed after long term suppression of viremia. Nucleotide sequences obtained by ultra-deep pyrosequencing were translated in amino acid sequences and filtered through the correction algorithm described in the Material and Methods section. Unique sequences for each sample type were used to build phylogenetic trees with the neighbour-joining method. Bootstrap values ≥ 80% are shown. The cellular sources of proviral and viral sequences are indicated by coloured bullets. Sample type-specific cluster with bootstrap values ≥ 80% are encircled with the corresponding colours. PSSM score was calculated for each of these clusters, and the corresponding values are included in the insert, together with their relative abundance in the corresponding sample type. Bars indicate p distance scale.

                      Figure 2

                      Individual phylogenetic tree of HIV-1 V3 amino acid sequences from Pt.2. Proviral quasispecies harboured by monocytes (red circle = CD36-provirus) and T lymphocytes (green circle = CD26-provirus) of Pt.2 were analyzed, as in Fig. 1.

                      Figure 3

                      Individual phylogenetic tree of HIV-1 V3 amino acid sequences from Pt.3. Proviral quasispecies harboured by monocytes (red circle = CD36-provirus) and T lymphocytes (green circle = CD26-provirus) of Pt.3 were analyzed as in Fig. 1. In addition the analysis also included from the same patient virions rebounding into the circulation one month after therapy interruption and sorted by immunocapture with monoclonal antibodies to CD36 (blue circle = CD36-virus) and CD26 (purple circle = CD26-virus).

                      Figure 4

                      Individual phylogenetic tree of HIV-1 V3 amino acid sequences from Pt.4. Proviral quasispecies harboured by monocytes (red circle = CD36-provirus) and T lymphocytes (green circle = CD26-provirus), virions rebounding into the circulation one month after therapy interruption (blue circle = CD36-virus) and CD26 (purple circle = CD26-virus), from Pt.4 were analyzed as in Fig. 3.

                      Figure 5

                      Individual phylogenetic tree of HIV-1 V3 amino acid sequences from Pt.5. Proviral quasispecies harboured by monocytes (red circle = CD36-provirus) and T lymphocytes (green circle = CD26-provirus), virions rebounding into the circulation one month after therapy interruption (blue circle = CD36-virus) and CD26 (purple circle = CD26-virus), from Pt.5 were analyzed as in Fig. 3.

                      On the whole, consistent with heterogeneity data, the phylogenetic trees showed a more heterogeneous quasispecies in provirus harboured by CD36 cells than in provirus harboured by CD26 cells. Both monocytes and T lymphocytes might contribute to virus rebounding in the circulation after one month from therapy interruptions, although other virus sources might be also involved. In 3 out of 5 patients (Fig. 1, 2 and 4) monophyletic clusters of sequences deriving from each of the cell source could be highlighted. On the basis of the phylogenetic relationships between virions shed into the circulation and the proviral sequences archived in the two cellular sources, 3 distinct situations could be identified in the 3 patients where this analysis could be performed: i) in Pt. 4 (Fig. 4) the circulating virus (carrying either CD26 or CD36 on its surface) segregated with provirus harboured by T lymphocytes, implying that, apparently, the provirus harboured by monocytes before therapy interruption did not contribute to viremia rebound, at least at this time point; ii) in Pt. 3 (Fig. 3) both archived and replicating virus sequences were interspersed, preventing the identification of reservoir-specific proviral clusters; iii) in Pt. 5 (Fig. 5), the situation was similar to Pt. 3, but in addition, a segregating cluster of circulating virions displaying the CD36 surface marker could be identified that seemed not to be related to the provirus harboured by either circulating cell compartment. It is possible that the lack of separation of proviral quasispecies from CD36 and CD26 cells in patients 3 and 5 simply reflects a mere artefact. However, in our opinion, the contamination of CD36 provirus with lymphocyte-derived viruses is unrealistic, since data from Table 2 suggest that CD36 provirus from these two patients had greater variability (as both n. of variants and diversity) than the corresponding CD26 provirus, thereby ruling out that the CD36 provirus was contaminated by CD26 provirus. The proportionally fewer proviral sequences in monocytes is a more likely explanation. In fact, in the present study as in a previous one from our group [31], proviral load in monocytes was about 1/5 of that present in T lymphocytes. However, also to this regard the higher variability of CD36 provirus renders unlikely that a mere disproportion between the proviral quasispecies from the two cell compartments may fully explain the lack of separation.

                      Co-receptor usage prediction of viral quasispecies

                      The sequence repertoire obtained by ultra-deep pyrosequencing was also used to obtain a prediction of co-receptor usage, by PSSM analysis of V3 loop amino acid sequences. The distribution of variants according to their score value for each patient is shown in Figures 6, 7, 8, 9 and 10. Some general findings could be identified. For both proviral and circulating viral genomes in monocytes and T lymphocytes, the results indicated a predominant CCR5 coreceptor usage (R5 phenotype). However, the intra-patient range of PSSM score values was rather wide, particularly in the archived proviral sequences, highlighting the presence of minority sequences (from <0.1% to 4.2%) with a clear CXCR4 coreceptor usage (X4 phenotype) only in archived sequences from at least one cellular reservoir in 4 patients (Fig 6, 7, 9, 10), while only one patient (Pt. 3, Fig. 8) harboured exclusively R5 populations. None of the patients showed X4 variants in circulating virus, as the minimum value obtained by PSSM prediction was -2.99.
                      Figure 6

                      PSSM score distribution of V3 amino acid sequences from Pt.1. The PSSM score distribution of all the detected variants from CD36- and CD26-provirus present in Pt.1 at the moment of therapy interruption are shown. Vertical orange lines indicate 95th and 5th percentiles for CXCR4 and CCR5 predicted co-receptor usage (> -2.88 and < -6.96 respectively).

                      Figure 7

                      PSSM score distribution of V3 amino acid sequences from Pt.2. The PSSM score distributions of all the variants from CD36- and CD26- provirus detected in Pt.2 at the moment of therapy interruption are shown. Vertical orange lines indicate percentiles for predicted co-receptor usage as in Fig. 6.

                      Figure 8

                      PSSM score distribution of V3 amino acid sequences from Pt.3. The PSSM score distribution of all variants from CD36- and CD26-provirus detected in Pt.3 at the moment of therapy interruption are shown together with those from CD36- and CD26- captured virus present one month after therapy stop. Vertical orange lines indicate percentiles for predicted co-receptor usage as in Fig. 6.

                      Figure 9

                      PSSM score distribution of V3 amino acid sequences from Pt.4. The PSSM score distribution of all variants from CD36- and CD26-provirus detected in Pt.4 at the moment of therapy interruption are shown together with those from CD36- and CD26-captured virus present one month after therapy stop. Vertical orange lines indicate percentiles for predicted co-receptor usage as in Fig. 6.

                      Figure 10

                      PSSM score distribution of V3 amino acid sequences from Pt.5. The PSSM score distribution of all variants from CD36- and CD26-provirus detected in Pt.5 at the moment of therapy interruption are shown together with those from CD36- and CD26-captured virus present one month after therapy stop. Vertical orange lines indicate percentiles for predicted co-receptor usage as in Fig. 6.

                      In addition, we calculated the mean score for the sample-specific clusters segregating with bootstrap values ≥ 80% to see if the phylogenetic segregation could be associated with different biological properties, namely coreceptor usage. Individual scores of segregating clusters from each sample type are shown in the inserts of Fig. 1, 2, 3, 4, 5, together with their relative abundance in the corresponding sample type. It is noteworthy that two clusters of CD36-provirus, identified in Pt. 2 and Pt. 4 (Fig. 2 and 4), displayed PSSM scores clearly predicting X4 phenotype (+3.73 and +3.76). However, these clusters represented only 1.4% and 4.2%, respectively, of all CD36-provirus sequences in the corresponding patients. We also found a single cluster of CD26-provirus in Pt. 4, including only 1.2% of CD26-provirus sequences, with a mean PSSM score of -3.75, when mean CD26 provirus score was -8.40.

                      Concerning circulating virus, in Pt. 5 less negative score values were observed for CD26-virus (-4.85) as compared to CD36-virus (-8.43). In addition, in this patient the segregating cluster of CD36-virus presented a mean score value (-9.02) (Fig. 5) more negative than the mean values of CD36-virus. Also in this case, this cluster included only a minority (6.7%) of all CD36-virus sequences.


                      This study provided a direct comparison between the HIV-1 quasispecies archived in monocytes and T lymphocytes with that present in circulating virions replicating in the same cell types after one month from therapy interruption, exploiting the presence of cell lineage markers on viral envelope. In fact, this method proved to be highly specific in sorting the virions originating from monocytes and CD4 T lymphocytes, respectively. The amount of plasma virions captured by anti-CD26 and -CD36 antibodies ranged between 5 and 10%. However, a similar proportion of virions immuno-captured by these two antibodies did not necessarily mean that the amount of virions carrying these molecules was similar. In fact, a number of factors may explain this apparent paradox, including the asymmetric position of cell-derived molecules on the virion envelope, and the different affinity of the monoclonal antibodies used for the immuno-capture, as claimed in previous study [32].

                      The analysis took advantage of ultra-deep pyrosequencing based on the powerful technology of massively parallel sequencing that enabled us to perform a very detailed analysis of viral quasispecies. The data set obtained for each patient is some order of magnitude higher than any previously studied with conventional approaches based on cloning of PCR products or on limiting dilution PCR [11, 1315]. This allowed us to analyze previously inaccessible aspects of HIV-1 quasispecies, such as coreceptor usage of minority variants present in archived proviral sequences and in actually replicating virions.

                      One of the main findings of this study is that the heterogeneity of provirus associated with monocytes/macrophages after prolonged HAART is generally higher than that of T lymphocytes, even though in this study the proviral load in monocytes was about 1/5 of that present in T lymphocytes, in agreement with our previous findings [31].

                      It has been previously shown that during therapy, HIV-1 variants harboured by monocytes may evolve more rapidly than those harboured by T cells [11, 14], eventually leading to segregation of viral quasispecies present in the two different cell types. This is probably due to the fact that monocytes and T lymphocytes may show different sensitivity to individual antiviral drugs, with residual ongoing viral replication in monocytes also in the presence of suppressive therapy. However, a greater diversity in HIV quasispecies harboured by monocyte/macrophages may be also due to other possibilities, including larger numbers of locally replicating quasispecies prior to treatment. In favour of this hypothesis recent data by Joos et al. [33] showed that rebounding virus after therapy interruption was phylogenetically older than virus present at the moment of therapy start. These findings argued against the possibility that rebounding virus during therapy interruptions originated from viral populations undergoing low-level persistent replication during suppressive therapy Instead, they are consistent with the hypothesis of virus reactivation from latent reservoirs.

                      R5 variants are generally predominant in our patients, even long after primary infection. This is in accordance with numerous studies showing that most of the patients' isolates, even at late stages of infection, are generally R5/monocytotropic or R5/X4 dual-tropic [34, 35]. Furthermore, the results of the present study indicated that circulating monocytes and T lymphocytes may harbour minor provirus variants predictive of CXCR4 usage, as previously shown ex vivo for tissue resident macrophages or in vitro for monocytes-derived-macrophages [36, 37]. The fact that the two patients displaying clusters of sequences predictive of CXCR4 usage in proviral HIV DNA in monocytes have been on fully suppressive HAART for almost the entire time of the infection, although intriguing, is not completely unexpected, as in a recent study where X4 viruses (either pure or dual/mixed R5/X4) were recognized in 17.2% of patients studied early after seroconversion [38].

                      It is not possible to deduce information on the replicative competence of archived provirus from these molecular data. In fact, one limitation of the next generation pyrosequencing technology is that the clonal sequences may not be physically separated from the whole quasispecies. Hence, it would not be possible to conduct phenotypic analysis of clones unless the synthetic approach was used to re-construct the viral genomes to undergo biological characterization.

                      The small number of patients under study and the variable profile of viral quasispecies impair the possibility to make general conclusions from the present results. Nevertheless, the higher HIV-1 heterogeneity in monocytes and the presence of distant variants suggests that this compartment, although quantitatively inferior to the CD4 T cell reservoir [31], may represent a possible source of viral diversity and contribute to escape mechanisms in regards to both immunity and antiviral therapy. This idea may be clinically relevant in the light of the recent therapeutic approaches involving co-receptor antagonists [39] that required a deep profiling of the co-receptor usage [40].

                      Moreover, we could obtain direct evidence that both T lymphocytes and monocytes do actually contribute to virus rebounding in the circulation early after therapy interruption. However, at the same time we found that minor replicating clusters may derive from cellular sources different from the main circulating reservoirs. This is in agreement with the results of a recent study where quasispecies analysis of residual viremia present in patients on HAART was established and compared to those found in resting T CD4 cells [41].


                      The combined use of immuno-capture of circulating virions associated with the ultra-deep sequence analysis of V3-containing env region may be a powerful tool to investigate viral dynamics, useful for exploring the contribution of different viral reservoirs to replicating virus along the natural history of the infection, and for identifying co-receptor usage in minority viral populations harboured by different cell lineages.



                      We thank G. Franchini and M. Malkovsky for helpful discussions and for critically reading the paper. C. Nisii is also acknowledged for editing the manuscript. This work has been partially supported by Grants to M.R.C. from Istituto Superiore di Sanita' (National AIDS Project), and by the Italian Ministry of Health (Fondi Ricerca Corrente and Ricerca Finalizzata).

                      Authors’ Affiliations

                      Laboratory of Virology, INMI L. Spallanzani
                      Clinical Department, INMI L. Spallanzani
                      Consorzio Interuniversitario per le Applicazioni di Supercalcolo per l'Università e la Ricerca (CASPUR)
                      Scientific Direction, INMI L. Spallanzani


                      1. Fisher AG, Ensoli B, Looney D, Rose A, Gallo RC, Saag MS, Shaw GM, Hahn BH, Wong-Staal F: Biologically diverse molecular variants within a single HIV-1 isolate. Nature 1988, 334: 444–447.View ArticlePubMed
                      2. Coffin JM: HIV population dynamics in vivo: implications for genetic variation, pathogenesis, and therapy. Science 1995, 267: 483–489.View ArticlePubMed
                      3. Hwang SS, Boyle TJ, Lyerly HK, Cullen BR: Identification of the envelope V3 loop as the primary determinant of cell tropism in HIV-1. Science 1991, 253: 71–74.View ArticlePubMed
                      4. Troyer RM, Collins KR, Abraha A, Fraundorf E, Moore DM, Krizan RW, Toossi Z, Colebunders RL, Jensen MA, Mullins JI, Vanham G, Arts EJ: Changes in Human Immunodeficiency Virus Type 1 Fitness and Genetic Diversity during Disease Progression. J Virol 2005, 79: 9006–9018.View ArticlePubMed
                      5. Nowak P, Karlsson AC, Naver L, Bohlin AB, Piasek A, Sönnerborg A: The selection and evolution of viral quasispecies in HIV-1 infected children. HIV Med 2002, 3: 1–11.View ArticlePubMed
                      6. Poss M, Rodrigo AG, Gosink JJ, Learn GH, De Vange Panteleef D, Martin HL, Bwayo J, Kreiss JK, Overbaugh J: Evolution of Envelope Sequences from the Genital Tract and Peripheral Blood of Women Infected with Clade A Human Immunodeficiency Virus Type 1. J Virol 1998, 72: 8240–8251.PubMed
                      7. Haggerty S, Stevenson M: Predominance of distinct viral genotypes in brain and lymph node compartments of HIV-1-infected individuals. Viral Immunol 1991, 4: 123–131.View ArticlePubMed
                      8. Hughes ES, Bell JE, Simmonds P: Investigation of the dynamics of the spread of human immunodeficiency virus to brain and other tissues by evolutionary analysis of sequences from the 17 gag and env genes. J Virol 1997, 71: 1272–1280.PubMed
                      9. Epstein LG, Kuiken C, Blumberg BM, Hartman S, Sharer LR, Clement M, Goudsmit J: HIV-1 V3 domain variation in brain and spleen of children with AIDS: tissue-specific evolution within host-determined quasispecies. Virology 1991, 180: 583–590.View ArticlePubMed
                      10. Zhu T, Muthui D, Holte S, Nickle D, Feng F, Brodie S, Hwangbo Y, Mullins JI, Corey L: Evidence for human immunodeficiency virus type 1 replication in vivo in CD14(+) monocytes and its potential role as a source of virus in patients on highly active antiretroviral therapy. J Virol 2002, 76: 707–716.View ArticlePubMed
                      11. Fulcher JA, Hwangbo Y, Zioni R, Nickle D, Lin X, Heath L, Mullins JI, Corey L, Zhu T: Compartmentalization of Human Immunodeficiency Virus Type 1 between Blood Monocytes and CD4 + T Cells during Infection. J Virol 2004, 78: 7883–7893.View ArticlePubMed
                      12. Chun TW, Davey RT, Ostrowski M, Shawn Justement J, Engel D, Mullins JI, Fauci AS: Relationship between pre-existing viral reservoirs and the re-emergence of plasma viremia after discontinuation of highly active anti-retroviral therapy. Nat Med 2000, 6: 736–737.View Article
                      13. Imamichi H, Crandall KA, Natarajan V, Jiang MK, Dewar RL, Berg S, Gaddam A, Bosche M, Metcalf JA, Davey RT, Clifford Lane H: Human immunodeficiency virus type 1 quasi species that rebound after discontinuation of highly active antiretroviral therapy are similar to the viral quasispecies present before initiation of therapy. J Infect Dis 2001, 183: 36–50.View ArticlePubMed
                      14. Llewellyn N, Zioni R, Zhu H, Andrus T, Xu Y, Corey L, Zhu T: Continued evolution of HIV-1 circulating in blood monocytes with antiretroviral therapy: genetics analysis of HIV-1 in monocytes and CD4+ T cells of patients with discontinued therapy. J Leukoc Biol 2006, 80: 1118–26.View ArticlePubMed
                      15. Lambotte O, Chaix ML, Gubler B, Nasreddine N, Wallon C, Goujard C, Rouzioux C, Taoufik Y, Delfraissy JF: The lymphocyte HIV reservoirs in patients on long-term HAART is a memory of virus evolution. AIDS 2004, 18: 1147–58.View ArticlePubMed
                      16. Delobel P, Sandres-Sauné K, Cazabat M, L'Faqihi FE, Aquilina C, Obadia M, Pasquier C, Marchou B, Massip P, Izopet J: Persistence of distinct HIV-1 populations in blood monocytes and naive and memory CD4 T cells during prolonged suppressive HAART. AIDS 2005, 19: 1739–50.View ArticlePubMed
                      17. Capobianchi MR, Fais S, Castilletti C, Gentile M, Ameglio F, Dianzani F: A simple and reliable method to detect cell membrane proteins on infectious human immunodeficiency virus type 1 particles. J Infect Dis 1994, 169: 886–9.PubMed
                      18. Saarloos M, Sullivan BL, Czerniewski MA, Parameswar KD, Spear GT: Detection of HLA-DR associated with monocytotropic, primary, and plasma isolates of human immunodeficiency virus type 1. J Virol 1997, 71: 1640–1643.PubMed
                      19. Lawn SD, Roberts BD, Griffin GE, Folks TM, Butera ST: Cellular Compartments of Human Immunodeficiency Virus Type 1 Replication In Vivo: Determination by Presence of Virion-Associated Host Proteins and Impact of Opportunistic Infection. J Virol 2000, 74: 139–145.View ArticlePubMed
                      20. Abbate I, Dianzani F, Turriziani O, Antonelli G, D'Offizi G, Galati V, Pierdominici M, Pandolfi F, Capobianchi MR: Changes in host cell molecules acquired by circulating HIV-1 in patients treated with HAART. AIDS 2001, 15: 11–16.View ArticlePubMed
                      21. Abbate I, Cappiello G, Longo R, Ursitti A, Spanò A, Calcaterra S, Dianzani F, Antinori A, Capobianchi MR: Cell membrane proteins and quasispecies compartmentalization of CSF and plasma HIV-1 from aids patients with neurological disorders. Infect Genet Evol 2005, 5: 247–53.View ArticlePubMed
                      22. Toossi Z, Mayana-Kizza H, Lawn SD, Hirsch CS, Lupo LD, Butera ST: Dinamic Variation in the Cellular Origin of HIV Type 1 during Treatment of Tuberculosis in Dually Infected Subjects. AIDS Research and Human Retroviruses 2007, 23: 93–100.View ArticlePubMed
                      23. Wang C, Mitsuya Y, Gharizadeh B, Ronaghi M, Shafer RW: Characterization of mutation spectra with ultra-deep pyrosequencing: Application to HIV-1 drug resistance. Genome Res 2007, 17: 1195–201.View ArticlePubMed
                      24. Hoffmann C, Minkah N, Leipzig J, Wang G, Arens MQ, Tebas P, Bushman FD: DNA bar coding and pyrosequencing to identify rare HIV drug resistance mutations. Nucl Acids Res 2007, 35: e91.View ArticlePubMed
                      25. Mitsuya Y, Varghese V, Wang C, Liu TF, Holmes SP, Jayakumar P, Gharizadeh B, Ronaghi M, Klein D, Fessel WJ, Shafer RW: Minority HIV-1 Variants in Antiretroviral Naive Persons with Reverse Transcriptase Codon 215 Revertant Mutations. J Virol 2008, 82: 10747–55.View ArticlePubMed
                      26. Margulies M, Egholm M, Altman WE, Attiya S, Bader JS, Bemben LA, Berka J, Braverman MS, Chen YJ, Chen Z, Dewell SB, Du L, Fierro JM, Gomes XV, Brian C, Godwin BC, He W, Helgesen S, Ho CH, Irzyk GP, Jando SC, Alenquer MLI, Jarvie TP, Jirage KB, Kim JB, Knight JR, Lanza JR, Leamon JH, Lefkowitz SM, Lei M, Jing Li J, Lohman KL, Lu H, Makhijani VB, McDade KE, McKenna MP, Eugene W, Myers Ew, Nickerson E, Nobile JR, Plant R, Puc BP, Ronan MT, Roth GT, Sarkis GJ, Simons JF, Simpson JW, Srinivasan M, Tartaro KR, Tomasz A, Vogt KA, Volkmer GA, Shally H, Wang SH, Wang Y, Weiner MP, Yu P, Begley RF, Rothberg JM: Genome sequencing in microfabricated high-density picolitre reactors. Nature 2005, 437: 376–80.PubMed
                      27. Rice P, Longden I, Bleasby A: EMBOSS: the European Molecular Biology Open Software Suite. Trends Genet 2000, 16: 276–7.View ArticlePubMed
                      28. Edgar RC: MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinformatics 2004, 19: 5–113.
                      29. Huse SM, Huber JA, Morrison HG, Sogin ML, Welch DM: Accuracy and quality of massively parallel DNA pyrosequencing. Genome Biol 2007, 8: R143.View ArticlePubMed
                      30. Jensen MA, Li FS, van 't Wout AB, Nickle DC, Shriner D, He HX, McLaughlin S, Shankarappa R, Margolick JB, Mullins JI: Improved coreceptor usage prediction and genotypic monitoring of R5-to-X4 transition by motif analysis of HIV-1 env V3 loop sequences. J Virol 2003, 77: 13376–13388.View ArticlePubMed
                      31. Garbuglia AR, Calcaterra S, D'Offizi G, Topino S, Narciso P, Lillo F, Girardi E, Capobianchi MR: HIV-1 DNA burden dynamics in CD4 T cells and monocytes in patients undergoing a transient therapy interruption. J Med Virol 2004, 74: 373–81.View ArticlePubMed
                      32. Fais S, Capobianchi MR, Abbate I, Castilletti C, Gentile M, Cordiali Fei P, Ameglio F, Dianzani F: Unidirectional budding of HIV-1 at the site of cell-to-cell contact is associated with co-polarization of intercellular adhesion molecules and HIV-1 viral matrix protein. AIDS 1995, 9: 329–335.PubMed
                      33. Joos B, Fischer M, Kuster H, Pillai SK, Wong JK, Böni J, Hirschel B, Weber R, Trkola A, Günthard HF, Swiss HIV Cohort Study: HIV rebounds from latently infected cells, rather than from continuing low-level replication. Proc Natl Acad Sci USA 2008, 105: 16725–16730.View ArticlePubMed
                      34. Lehmann C, Daumer M, Boussaad I, Sing T, Beerenwinkel N, Lengauer T, Schmeisser N, Wyen C, Fätkenheuer G, Kaiser R: Stable coreceptor usage of HIV in patients with ongoing treatment failure on HAART. J Clin Virol 2006, 37: 300–4.View ArticlePubMed
                      35. Kupfer B, Sing T, Schuffer P, Hall R, Kurz R, McKeown A, Schneweis KE, Eberl W, Oldenburg J, Brackmann HH, Rockstroh JK, Spengler U, Daumer MP, Kaiser R, Lengauer MB: Fifteen years of env C2V3C3 evolution in six individuals infected clonally with human immunodeficiency virus type 1. J Med Virol 2007, 79: 1629–39.View ArticlePubMed
                      36. Jayakumar P, Berger I, Autschbach F, Weinstein M, Funke B, Verdin E, Goldsmith MA, Keppler OT: Tissue Resident Macrophages Are Productively Infected Ex Vivo by Primary X4 Isolates of Human Immunodeficiency Virus Type 1. J Virol 2005, 79: 5220–5226.View ArticlePubMed
                      37. Chen S, Tuttle DL, Oshier JT, Knot HJ, Streit WJ, Goodenow MM, Harrison JK: Transforming growth factor-β1 increases CXCR4 expression, stromal-derived factor-1α-stimulated signalling and human immunodeficiency virus-1 entry in human monocyte-derived macrophages. Immunology 2005, 114: 565–574.View ArticlePubMed
                      38. De Mendoza C, Rodriguez C, Garcìa F, Eiros JM, Ruìz L, Caballero E, Aguilera A, Leiva P, Colomina J, Gutierrez F, Del Romero J, Aguero J, Soriano V, on behalf of the Spanish HIV Seroconverter Study Group: Prevalence of X4 tropic viruses in patients recently infected with HIV-1 and lack of association with transmission of drug resistance. J Antimicrobiol Chemotherapy 2007, 59: 698–704.View Article
                      39. Repik A, Richards KH, Clapham PR: The promise of CCR5 antagonists as new therapies for HIV-1. Curr Opin Investig Drugs 2007, 8: 130–9.PubMed
                      40. Bushman FD, Hoffmann C, Ronen K, Malani N, Minkah N, Rose HM, Tebas P, Wang GP: Massively parallel pyrosequencing in HIV research. AIDS 2008, 22: 1411–1415.View ArticlePubMed
                      41. Bailey JR, Sedaghat AR, Kieffer T, Brennan T, Lee PK, Wind-Rotolo M, Haggerty CM, Kamireddi AR, Liu Y, Lee J, Persaud D, Gallant JE, Cofrancesco J Jr, Quinn TC, Wilke CO, Ray SC, Siliciano JD, Nettles RE, Siliciano RF: Residual Human Immunodeficiency Virus Type 1 viremia in some patients on antiretroviral therapy is dominated by a small number of invariant clones rarely found in circulating CD4 + T cells. J Virol 2006, 80: 6441–6457.View ArticlePubMed


                      © Rozera et al. 2009

                      This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://​creativecommons.​org/​licenses/​by/​2.​0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.