Discovery and full genome characterization of two highly divergent simian immunodeficiency viruses infecting black-and-white colobus monkeys (Colobus guereza) in Kibale National Park, Uganda
Retrovirology volume 10, Article number: 107 (2013)
African non-human primates (NHPs) are natural hosts for simian immunodeficiency viruses (SIV), the zoonotic transmission of which led to the emergence of HIV-1 and HIV-2. However, our understanding of SIV diversity and evolution is limited by incomplete taxonomic and geographic sampling of NHPs, particularly in East Africa. In this study, we screened blood specimens from nine black-and-white colobus monkeys (Colobus guereza occidentalis) from Kibale National Park, Uganda, for novel SIVs using a combination of serology and “unbiased” deep-sequencing, a method that does not rely on genetic similarity to previously characterized viruses.
We identified two novel and divergent SIVs, tentatively named SIVkcol-1 and SIVkcol-2, and assembled genomes covering the entire coding region for each virus. SIVkcol-1 and SIVkcol-2 were detected in three and four animals, respectively, but with no animals co-infected. Phylogenetic analyses showed that SIVkcol-1 and SIVkcol-2 form a lineage with SIVcol, previously discovered in black-and-white colobus from Cameroon. Although SIVkcol-1 and SIVkcol-2 were isolated from the same host population in Uganda, SIVkcol-1 is more closely related to SIVcol than to SIVkcol-2. Analysis of functional motifs in the extracellular envelope glycoprotein (gp120) revealed that SIVkcol-2 is unique among primate lentiviruses in containing only 16 conserved cysteine residues instead of the usual 18 or more.
Our results demonstrate that the genetic diversity of SIVs infecting black-and-white colobus across equatorial Africa is greater than previously appreciated and that divergent SIVs can co-circulate in the same colobine population. We also show that the use of “unbiased” deep sequencing for the detection of SIV has great advantages over traditional serological approaches, especially for studies of unknown or poorly characterized viruses. Finally, the detection of the first SIV containing only 16 conserved cysteines in the extracellular envelope protein gp120 further expands the range of functional motifs observed among SIVs and highlights the complex evolutionary history of simian retroviruses.
Simian immunodeficiency viruses (SIV) are primate lentiviruses that naturally infect at least 45 different African non-human primate (NHP) species [1, 2]. It is now well established that zoonotic transmission of SIVs from chimpanzees (Pan troglodytes troglodytes) and gorillas (Gorilla gorilla gorilla) as well as from sooty mangabeys (Cercocebus atys) led to the emergence of human immunodeficiency virus type 1 (HIV-1) and type 2 (HIV-2), respectively [3–5]. Despite high divergence among SIVs, each primate species is typically infected with one or more species-specific viruses. However, there are also numerous examples of cross-species transmission and recombination [6–10]. Interestingly, different animals from a single primate species can be infected by more than one SIV. For example, mandrills (Mandrillus sphinx) in Gabon and southern Cameroon are infected by two different SIVs, SIVmnd-1 or SIVmnd-2, possibly due to their geographic separation by the Ogoué River [8, 11]. Even in the absence of physical barriers, two distinct SIVs can co-circulate in a single species living in a small geographical area, as observed for mustached monkeys (Cercopithecus cephus) in Cameroon that are infected by SIVmus-1 or SIVmus-2 . Recently, it has been reported that mustached monkeys in Gabon are also infected with a highly divergent SIVmus, demonstrating that the same monkey subspecies can harbor at least three distinct SIVs .
Phylogenetic analysis shows that all SIVs cluster in a single clade within the mammalian lentiviruses, indicating descent from a single common ancestor . Only African Old World monkeys and apes from sub-Saharan Africa, but not Asian Old World monkeys or New World monkeys, are naturally infected with SIV, suggesting that SIV originated in sub-Saharan Africa after the landmass separation and migration that gave rise to New World and Asian primate lineages [14, 15]. Old World monkeys (Cercopithecidae) are separated into two distinct subfamilies, Colobinae and Cercopithecinae, which diverged approximately 18 million years ago (MYA) . The Colobinae are further divided into African (Colobini) and Asian (Presbytini) groups and African colobines consist of two genera: Colobus and Procolobus. SIVcol, isolated from a black-and-white colobus (Colobus guereza) in Cameroon, represents the only SIV infecting the Colobus genus for which full-length sequences are available , with partial sequences being available from black colobus (Colobus satanas satanas) from Bioko (49). The Procolobus genus includes red colobus (subgenus Piliocolobus) and olive colobus (subgenus Procolobus) and full-length SIV sequences are available from both groups from Tai forest in the Ivory Coast (Procolobus badius badius and Procolobus verus, respectively) . In addition, SIV sequences are available from Tephrosceles red colobus (Procolobus rufomitratus tephrosceles) in Kibale National Park, Uganda , Temminck’s red colobus (Procolobus badius temminckii) from The Gambia  and Tshuapa red colobus (Procolobus tholloni) from the Democratic Republic of Congo , demonstrating the large geographic distribution and diversity of SIV in colobines. Interestingly, SIVcol is highly divergent from other known SIVs, possibly reflecting ancient divergence of host lineages.
Colobine NHPs are distributed throughout equatorial Africa, however no full-length SIV sequences have been isolated from East African monkeys. Although the density and variety of East African NHP is high, particularly at Kibale National Park (KNP) in Uganda, this region has been undersampled . To gain insight into the diversity of SIVs infecting Colobus monkeys from East Africa, we screened nine black-and-white colobus (Colobus guereza occidentalis) from KNP (Figure 1) for the presence of novel SIVs, the same park in which we previously recovered partial polymerase (pol) sequences from a divergent SIV infecting red colobus monkeys (Procolobus rufomitratus tephrosceles) . Here, we report the discovery and full genome characterization of two novel SIVs identified in Kibale black-and-white colobus (C. guereza), tentatively named SIVkcol-1 and SIVkcol-2. These two viruses were discovered in three and four animals, respectively, using “unbiased” deep-sequencing, a method that does not rely on homology to previously characterized genomes and is thus more sensitive for detecting divergent sequences. Furthermore, we establish phylogenetic relationships to other known SIVs, and functionally characterize both viruses.
Discovery of novel SIVs from KNP, Uganda
Blood plasma from nine black-and-white colobus monkeys (BWC) from Kibale National Park, Uganda, (Figure 1) was screened for the presence of SIVs using a combination of deep-sequencing and serology. The sequencing approach applied in this study is “unbiased” in that it uses random hexamers for priming and therefore does not rely on homology to known sequences. This approach is therefore less biased than specific PCR and may be more likely to discover novel and divergent viruses, as previously demonstrated by the discovery of other novel RNA viruses in monkeys from the same national park [24–26]. On average, around 456,000 reads per sample were generated. A query of reads against the GenBank database using the basic local alignment search tool blastn , locally implemented on the University of Wisconsin-Madison’s Condor High Throughput Computing cluster, revealed the presence of SIV reads in seven of nine animals, ranging from 0.2% (2,220 reads) to 7.9% (79,375 reads) of total reads (Table 1). For two of those animals (BWC01 and BWC07), enough reads were present to de novo assemble SIV genomes covering the entire coding region. A query against the NCBI GenBank database  revealed that the two viruses shared between 73% and 77% nucleotide identity with SIVcol, an SIV that was previously discovered in black-and-white colobus monkeys from Cameroon . Furthermore, a pairwise comparison between the two new viruses revealed that the genomes were distinct from each other, sharing only 72% nucleotide identity (Table 2). This is comparable with the two distinct SIV variants infecting mustached monkeys in Cameroon, SIVmus-1 and SIVmus-2, which share 73% nucleotide identity. For consistency with established nomenclature, both viruses were tentatively named SIVkcol-1 and SIVkcol-2, reflecting their origin from KNP as well as their relation to SIVcol [GenBank sequence accession numbers KF214240 and KF214241]. To determine the frequency of SIV infection for each variant, we mapped reads to the previously assembled SIVkcol-1 and SIVkcol-2 genomes. Among the nine black-and-white colobus, three were infected with SIVkcol-1 and four with SIVkcol-2. Interestingly, no co-infections were observed in any BWC in our study and none of the variants was restricted to any one social group. The sequence of the de novo-assembled SIVkcol-1 and SIVkcol-2 genomes was confirmed by generating four overlapping amplicons covering the entire ORF followed by deep-sequencing on the Illumina MiSeq. Both SIVs contain genomic structures similar to those of complex retroviruses, including all three structural (gag, pol and env) as well as various accessory genes (vif, vpr, tat, rev and nef), thus resembling the genome organization of SIVcol. No additional accessory genes previously reported for other SIVs, like vpx or vpu, were present in either of the two novel BWC SIVs .
Antibody reactivity against SIV proteins was observed in two of nine BWC (Table 1). In the HIV-2 specific WB, BWC01 and BWC03 showed antibody responses whereas in the HIV-1/-2 InnoLIA assay, only the plasma of BWC03 was weakly seroreactive. While antibodies in the HIV-2 WB were specific for the p26 matrix protein, antibodies in the HIV-1/-2 InnoLIA assay reacted against the HIV-2 Env protein gp36 (Table 1). Both animals that showed seroreactivity were SIV-RNA positive, but we were also able to recover SIV-specific reads from five additional animals that did not show any antibody reactivity in these two assays. Taken together, the random hexamer-based detection of SIV-RNA in blood was more sensitive than the detection of cross-reactive antibodies to HIV-1 or HIV-2 proteins using the HIV-2 WB and HIV-1/-2 InnoLIA assays and suggests that prevalence estimates solely based on those methods could underestimate overall SIV prevalence.
Functional motifs in Gag and Env proteins
All primate lentiviruses characterized to date contain at least 18 conserved cysteine residues in the extracellular subunit of gp120, however SIV strains can also contain up to four additional cysteine pairs, generally located in the variable domains of gp120 (Figure 2) . SIVcol, isolated from a black-and-white colobus from Cameroon, contains 18 conserved cysteine residues with no additional cysteine pairs (Figure 2). While the same cysteine architecture was also conserved for SIVkcol-1, SIVkcol-2 only contains 16 conserved cysteine residues in the extracellular gp120. Specifically, conserved cysteine residues 15 and 18 in the C-terminal half of Env, known to form a disulfide bond in HIV-1, were missing . The same unusual cysteine architecture was conserved among all four SIVkcol-2 positive animals (BWC02, BWC03, BWC04, BWC08).
PT/SAP and YPXL are two binding site motifs within the SIV Gag p6 protein that have been identified to be crucial for lentiviral budding [29–31]. The presence of one motif can compensate for the absence of the other, but both motifs can also be present at the same time [32, 33]. Both SIVkcol-1 and SIVkcol-2 are missing the PT/SAP motif and only contain a singular YPXL motif, thus resembling SIVcol.
Sequence similarity and phylogenetic analyses
In order to compare genomes of the two newly discovered SIVs from KNP to other previously characterized SIVs infecting the Colobini, we performed a similarity analysis of concatenated Gag, Pol, Vif, Env and Nef protein sequences (Figure 3). Across the genome, SIVs infecting BWC monkeys (SIVkcol-1, SIVkcol-2 and SIVcol) are more similar to each other than to those infecting red and olive colobus (SIVwrc and SIVolc, respectively), possibly reflecting divergence between the host genera Colobus and Procolobus (Table 2). Interestingly, although SIVkcol-1 and SIVkcol-2 were both isolated from BWC monkeys from Kibale, SIVkcol-1 is consistently more similar to SIVcol from Cameroon than to SIVkcol-2, particularly in Env. Confirming results from previous studies, a 200 aa region in the N-terminal half of Pol (approximately positions 700 to 900 in the simplot alignment) shares the highest similarity between the Colobus and Procolobus SIVs [18, 19]. Using the NCBI conserved domain and protein classification database, we identified this region as the reverse transcriptase (RT) domain (cd01645) .
Pairwise comparisons of nucleotide identity across the entire coding region further illustrate the divergence between SIVs isolated from Colobus and Procolobus as well as the closer relationship of SIVkcol-1 and SIVcol (Table 2). Although we were unable to assemble full SIV genomes for every infected animal, we obtained consensus sequences covering the entire Gag protein from all seven SIV-infected BWC colobus monkeys (three infected with SIVkcol-1, four infected with SIVkcol-2), allowing us to further assess inter-host genetic diversity between different variants of the same virus [GenBank sequence accession numbers KF214242-KF214246]. Overall, SIVkcol-1 was slightly more diverse than SIVkcol-2, sharing 88.7 ± 5.3% nucleotide identity among strains (Table 3), whereas isolates of SIVkol-2 were 93.9 ± 5.9% identical (Table 4). We also identified two SIVkcol-2 strains (from BWC03 and BWC07) exhibiting ≤2% divergence across Gag, potentially indicating epidemiologically linked infections . The fact that both animals belonged to the same social group further supports the idea that close contact between those two animals resulted in direct transmission of the virus.
To estimate phylogenetic relationships of the two novel SIVs to other known SIVs, we constructed separate evolutionary trees for gag, pol, env and nef genes. In all four phylogenies, SIVkcol-1 and SIVkcol-2 formed a highly supported distinct lineage with SIVcol, with SIVkcol-2 in a distinct branch ancestral to SIVcol and SIVkcol-1, similar to the genetic relationships described above in the similarity plot analysis (Figure 4). In the env and nef trees, the ancestor of the BWC SIVs diverged at the root of the SIV tree. In the gag tree, the colobus SIVs clustered weakly with procolobine SIVs from western red colobus, while the other procolobine SIV, SIVolc, clusters with all other SIVs, though with low posterior support. In the pol trees, the colobus SIVs share a common ancestor with the procolobine SIVs (SIVolc and SIVwrc) and the SIVsun/l’hoest and SIVmnd lineages.
To estimate the age of the novel SIVs in relation to other SIVs, we determined TMRCA using Bayesian inference. The relaxed molecular clock used in this analysis was based on the adjusted SIV substitution rate that was previously determined for divergence of the Bioko monkey SIVs and used a 308-bp alignment of conserved pol sequences . The root of the tree is estimated to be 40,323 years before present (ybp) (95% highest posterior density (HPD) = 24,406 - 61,988 ybp) and is thus comparable to that of inferred for the Bioko monkey SIV phylogenies (49,129 ybp; 95% HPD = 29,078 - 71,268 ybp) (Figure 5, Table 5). The split between SIVkcol-1/col and SIVkcol-2 occurred at least 10,657 ybp (95% HPD = 5,215-18,146 ybp). Despite the use of a strong geological calibration point for our molecular dating estimates, we acknowledge that considerable debate exists about the accuracy of SIV TMRCA estimates and suggest that dates should be regarded as minimum estimates .
Although colobine primates are distributed throughout equatorial Africa, no full-length SIVs from this subfamily have been obtained from East Africa, potentially influencing our current understanding of the diversity and evolutionary history of SIV. We therefore screened nine BWC monkeys (Colobus guereza occidentalis) from KNP in Uganda, a park known for its exceptionally high density and diversity of NHPs , for the presence of SIVs using a combination of deep-sequencing and serological testing. Here, we report the discovery and characterization of two novel SIVs, tentatively named SIVkcol-1 and SIVkcol-2, in three and four animals, respectively. The new viruses are divergent from each other as well as from the previously discovered SIVcol from Cameroon and are both circulating within the same host population in KNP.
Traditionally, methods to recover complete or partial SIV genomes have relied on PCR using consensus primers. The design of those primers is based on regions conserved among different SIV lineages and amplified products are used to further characterize the virus and to confirm serological results. Here, we report for the first time a random hexamer based deep-sequencing approach to identify novel SIVs. This approach does not rely on homology to known sequences and is thus more sensitive for detecting divergent sequences, as previously demonstrated by the discovery of other novel RNA viruses in monkeys from the same national park [24–26]. Within the BWC population at KNP, no animals were found to be co-infected with SIVkcol-1 and SIVkcol-2. While we cannot exclude that this is due to the small sample size, infection with one virus could also protect against infection with the other, potentially through establishment and cross-reactivity of adaptive immune responses . Additional sampling as well as in vitro experiments will be necessary to clarify the interactions between these viruses.
There was a strong discordance between infection data obtained by random hexamer-based deep-sequencing and serological testing, with 7/9 and 2/9 of animals being vRNA- and antibody-positive, respectively. One possible explanation could be the high divergence of SIVkcol-1 and SIVkcol-2 to HIV-1 and HIV-2, thus limiting cross-reactivity with HIV antigens used in the HIV-2 WB and the HIV-1/-2 InnoLIA assays. Similar serological results were observed for SIVcol-infected BWC from Cameroon supporting this hypothesis . Recently, SIV lineage-specific ELISAs and flow-cytometry based assays have been successfully employed to detect specific SIV antibody responses [22, 38]. Although these assays might have higher specificity, they must be regularly updated when new lineages are discovered and not every lineage-specific peptide can be successfully synthesized . The serological assays used in this study have been successfully applied in the past to detect infection with divergent SIVs [38–41], however we also acknowledge that SIVcol lineage-specific assays might have resulted in a higher sensitivity compared to the serological assays employed in our study.
Another explanation for the low frequency of serological detection could be that the majority of BWC were acutely infected with SIV and thus had not mounted an antibody response at the time of sampling, although this might be unlikely given that samples were collected over a range of more than seven months. Overall, we believe that the use of blood as a sample source in combination with random hexamer-based deep-sequencing allows for a reliable assessment of SIV infection in wild NHP. Furthermore, studies relying only on serological data to either determine SIV prevalence or to discover new viruses may potentially be underestimating the occurrence and diversity of those viruses, particularly for NHPs infected with highly divergent SIVs.
BWC are the third NHP species found to be infected with two distinct SIV variants. Mandrills were the first species reportedly infected with two different SIVs. However, mandrill populations harboring SIVmnd-1 and SIVmnd-2 were geographically separated by the Ogoué River in Gabon, possibly explaining the presence of two divergent variants within this population [8, 11]. A second species, mustached monkeys (C. cephus) from Cameroon, are also infected by at least two distinct viral variants. Interestingly, SIVmus-1 and SIVmus-2 were detected in samples collected within a radius of 5 km which is comparable to the area in which we collected samples from BWC in Kibale, thus providing a second example for SIVs co-circulating within a geographically confined NHP population.
The newly discovered SIVkcol-1 and SIVkcol-2 are most closely related to SIVcol and form a BWC-specific SIV lineage. For the 3′ genomic region (env and nef), the BWC SIVs originate near the root of the SIV tree, suggesting that SIV was introduced to the Colobus genus after the divergence from the Procolobus. In contrast, in the gag region the BWC SIVs cluster weakly with the procolobine SIV from western red colobus monkeys, but both lineages are divergent from the procolobine SIV in olive colobus monkeys. Likewise, in the pol gene the BWC SIVs share a common ancestry with Procolobus-infecting SIVs and the SIVsun/lst and SIVmnd-1 lineages, although this observation should be viewed with caution due to the high degree of divergence characterized by the long branch length. Furthermore, the high similarity observed by similarity plot analysis in the N-terminal half of Pol between Colobus and Procolobus genera might not necessarily be reflective of a common ancestry but rather represent high conservation of the essential RT domain, thereby obscuring ancestral relationships. Sequences from additional colobus and procolobus monkeys may be needed to further clarify the phylogenetic relationships of the 3′ genomes of colobine SIVs and their ancestral origins.
Surprisingly, although both SIVkcol-1 and SIVkcol-2 were isolated from the same group of BWC, SIVkcol-1 is more closely related to SIVcol from Cameroon. Since all three strains were isolated from the same subspecies (Figure 1), recent gene flow between the different strains as well as ancestral polymorphisms could explain their unusual relationship. These explanations are particularly feasible given the large population size and relatively continuous range of this subspecies across Central Africa. Based on our estimates, SIVcol and SIVkcol-1 diverged from SIVkcol-2 around 10,600 ybp, potentially explaining the high amount of divergence observed between these viruses. More sampling, covering populations across the range of this subspecies (C. guereza occidentalis) as well as other C. guereza subspecies across equatorial Africa, should resolve uncertainties about the evolutionary history of SIVs infecting this species.
Two different binding site motifs within the SIV Gag p6 protein have been identified to be crucial for lentiviral budding: PT/SAP and YPXL [29–31]. While the presence of one motif can compensate for the absence of the other, both motifs can also be present at the same time, although the significance of maintaining two binding site motifs is unknown [32, 33]. The majority of SIVs contain either both motifs or a singular PT/SAP motif in the Gag p6 protein . Only three viruses, SIVdeb from De Brazza’s monkeys (Cercopithecus neglectus), SIVden from Dent’s monkeys (Cercopithecus denti) and SIVcol from black-and-white colobus, have a singular YPXL motif. Both SIVkcol-1 and SIVkcol-2 are missing the PT/SAP motif and only have the YPXL motif, thereby confirming the singular presence of the YPXL motif among Colobus-infecting SIVs while also expanding the number of viruses exclusively using this budding motif.
All known primate lentiviruses characterized to date contain at least 18 conserved cysteine residues in the extracellular subunit of the envelope protein and this is also referred to as the “18 Cys state” . Covalent disulfide bridges formed by cysteine pairs determine the tertiary structure of gp120 and are therefore essential for envelope function, including binding to the host cell receptor CD4 . We believe that SIVkcol-2 is the first primate lentivirus that only contains 16 cysteines (Figure 2). Bibollet-Ruche et al. have speculated that the conservation of 18 cysteine residues across all SIVs represents an ancestral state of primate lentiviruses and that different SIV lineages have eventually added cysteine pairs over time . The original “18 Cys state” has been independently conserved in Cercopithecus SIVs, SIVcpz (which is a recombinant containing a Cercopithecus envelope), as well as in SIVcol and SIVkcol-1. Currently, we are uncertain what led to SIVkcol-2 having only 16 cysteine residues and whether this is a result of “losing” a cysteine pair or whether the “16 Cys state” existed before the “18 Cys state”, thus represents an ancient state. Further sampling of BWC as well as other NHP will be required to confirm the presence of the “16 Cys state” in other NHP species. Additional studies will also be required to determine if the missing disulfide bond affects the antigenic structure of gp120.
Our results demonstrate that SIV diversity in black-and-white colobus is greater than previously appreciated, and that divergent SIVs can co-circulate in the same colobine population. The success of our of “unbiased” molecular detection methods and our finding of two novel viruses in East African NHP indicate that using similar methods in similarly under-sampled settings is likely to be a fruitful avenue for future research. Also, our results show that deep sequencing has some advantages over traditional serological approaches, especially for the detection of unknown or poorly characterized viruses. Additional sampling of BWC across Africa is needed to confirm both the ubiquity of the unusual cysteine architecture observed in the envelope of SIVkcol-2, and the full extent of phylogenetic diversity among SIVs infecting the colobine primates.
All animal procedures followed the guidelines of the Weatherall Report on the use of NHPs in research and received approval from the Uganda Wildlife Authority, the Uganda National Council for Science and Technology, and the University of Wisconsin Animal Care and Use Committee, prior to initiation of the research, and materials were shipped in accordance with international regulations (CITES permit 002290). The study was conducted in KNP, western Uganda, a semi-evergreen, montane forest (795 km2) at the foothills of the Rwenzori Mountains, notable for its diversity and density of NHPs. As part of a larger study of wild primate health and infection, from January 27th to July 08th 2010, nine black and white colobus monkeys (all adult or subadult) were anesthetized with a combination of ketamine (5.11 ± 1.79 mg/kg) and xylazine (1.05 ± 0.12 mg/kg) or medetomidine (0.10 ± 0.08 mg/kg) administered intramuscularly using a variable-pressure air rifle (Pneudart, Inc, Williamsport, PA, USA) . Blood was drawn from the femoral vein into an evacuated plasma preparation tube (Becton, Dickinson and Company, Inc, Franklin Lakes, NJ, USA) and kept cool until processing. Animals were then given the reversal agent atipamezole (0.32 ± 0.19 mg/kg) and released after recovery back to their social group without incident. Seven of the nine black-and-white colobus belonged to five different social groups with overlapping home ranges . The social groups of the remaining two monkeys were unknown. All samples were collected within an area of approximately 15 km2. Blood was separated into components using centrifugation in a field laboratory and frozen immediately in liquid nitrogen for storage and transport.
RNA extraction and deep-sequencing
From each animal, one ml of plasma was centrifuged at 5,000 × g (4°C, 5 min) with subsequent filtration of the supernatant through a 0.45-μm filter (Millipore, Billerica, MA, USA) to remove residual host cells. Viral RNA was then isolated using the Qiagen QIAamp MinElute virus spin kit (Qiagen, Hilden, Germany) according to the manufacturer’s instructions, but omitting carrier RNA. The eluted RNA was treated with DNase I (DNA-free, Ambion, Austin, TX, USA) and double stranded DNA was generated using the Superscript double-stranded cDNA Synthesis kit (Invitrogen, Carlsbad, CA, USA), primed with random hexamers. The DNA was purified using the Agencourt Ampure XP system (Beckman Coulter, Brea, CA, USA) and approximately one ng of DNA was subjected to simultaneous fragmentation and adaptor ligation (“tagmentation”) with the Nextera DNA Sample Prep Kit (Illumina, San Diego, CA, USA). One sample (BWC01) was also subjected to tagmentation with the Roche/454 Titanium-compatible Nextera kit (Epicentre Biotechnologies, Madison, WI, USA). DNA was subsequently cleaned using the Agencourt Ampure XP system, PCR amplified (15 cycles) to add Illumina- and Roche/454-compatible adaptors onto each fragment, and cleaned again with the Agencourt Ampure XP system. DNA fragments were then sequenced using the Illumina MiSeq (Illumina, San Diego, CA, USA) or the Roche/454 GS Junior instruments (Roche 454 Life Sciences, Branford, CT, USA).
Sequence data were analyzed using CLC Genomics Workbench 5.5 (CLC bio, Aarhus, Denmark). Low quality (<q30) and short reads (<100 bp) were removed and the remaining reads were subjected to de novo assembly using the following parameters: automatic word and bubble size; mismatch cost = 2; insertion cost = 3; deletion cost = 3; length fraction = 0.5; similarity fraction = 0.8. Assembled contiguous sequences (contigs) and singleton reads were queried against GenBank databases nt and nr using the basic local alignment search tools blastn and blastx, respectively.
In order to deep-sequence SIV genomes in infected individuals, we designed four overlapping PCR amplicons of approximately 2.5 kb each covering the entire SIV coding genome. Primers for each amplicon were based on the sequences obtained through de novo assembly of singleton reads. Viral RNA was prepared from 1 ml of plasma as described above, except that carrier RNA was added during the extraction. Viral RNA was then reverse transcribed and amplified using the SuperScript III High Fidelity One-Step RT-PCR kit (Invitrogen, Life Technologies, Carlsbad, CA). The reverse transcription-PCR conditions were as follows: 50°C for 30 min; 94°C for 2 min; 40 cycles of 94°C for 15 sec, 55°C for 30 sec, and 68°C for 3 min; and 68°C for 5 min. Following PCR, amplicons were purified from excised gel slices (1% agarose) using a Qiagen MinElute Gel Extraction kit (QIAGEN, Valencia, CA). Each amplicon was quantified using Quant-IT HS reagents (Invitrogen, Life Technologies, Carlsbad, CA), and all amplicons from a single viral genome were pooled together at equimolar ratios. Each pool was then quantitated and approximately 50 ng of each was used in a tagmentation reaction with Nextera DNA Sample Prep Kit (Illumina, San Diego, CA, USA). Final libraries representing each genome were characterized for average size using a DNA high sensitivity chip on a 2100 bioanalyzer (Agilent Technologies, Loveland, CO) and quantitated with Quant-IT HS reagents. Libraries were sequenced on the Illumina MiSeq as described above. Primer sequences employed are available upon request.
Plasma samples were screened for HIV/SIV antibodies by using the Innogenetics INNO-LIA HIV-1/-2 Score assay (Innogenetics NV, Gent, Belgium) capable of detecting SIVcpz; HIV-1 groups M, N, and O; and other divergent SIVs [38–41]. The following recombinant proteins and peptides are used as antibody targets in the INNO-LIA assay: sgp120 (HIV-1); gp41 (HIV-1); p31, p24, p17 (HIV-1 proteins capable of cross-reacting with HIV-2 antibodies); sgp105 (HIV-2); gp36 (HIV-2). Samples were further tested for the presence of antibodies by using an HIV-2-based Western blot (WB) test (MP Biomedicals,Singapore), targeting the following proteins and peptides: gp125, gp80, p68, p56, p53, gp36 and p26. Both serologic assays have been previously shown to have good sensitivity in identifying divergent SIVs [38, 39, 41].
Viral sequence and phylogenetic analyses
Nucleotide sequences of gag, pol, envelope (env) and nef were codon aligned individually for all known SIVs with complete genomes using ClustalW in the alignment editor program in MEGA v5.10 and edited manually. The best fitting distance model of nucleotide substitution for each alignment was inferred using the maximum likelihood (ML) method with goodness of fit measured by the Bayesian information criterion in MEGA v5.10. The best fitting nucleotide substitution model for the phylogenetic alignments was inferred to be the GTR model with discrete gamma and invariant among-site rate variation. Phylogenetic relationships were inferred using Bayesian analysis with the BEAST v1.6.2 program . Statistical support for the inferred Bayesian trees was assessed by posterior probabilities. For the Bayesian analyses, an uncorrelated lognormal, relaxed molecular clock model was used and each run consisted of two independent 50 × 106 Markov chain Monte Carlo (MCMC) generations with sampling every 5,000 generations and a Yule coalescent tree prior. Convergence of the MCMC was assessed by calculating the effective sampling size (ESS) of the runs using the program Tracer v1.5 (http://beast.bio.ed.ac.uk/Tracer). All parameter estimates showed significant ESSs > 1,200. The tree with the maximum product of the posterior clade probabilities was chosen from the posterior distribution of 10,001 sampled trees after burning in the first 1,000 sampled trees with the program TreeAnnotator version 1.6.2. The amino-acid similarity of the novel SIVs with related SIV lineages was determined across Gag, Pol, Env and Nef using SimPlot v3.5.1 (ref) following TranslatorX alignment (MAAFT) without Gblocks cleaning.
Time to the most recent common ancestor (TMRCA) for the new SIV sequences was inferred with the BEAST v1.6.2 program using a 308-bp alignment of all cdp of 89 SIV taxa, a relaxed molecular clock with an uncorrelated log normal rate distribution, a Yule tree prior, the HKY nucleotide substitution model with gamma distributed rates and an estimated proportion of invariable sites. TMRCAs were inferred by calibrating the molecular clock using an the estimated 10,000 year old separation of the drill (Mandrill leucophaeus) SIVs on mainland Africa from those on Bioko Island, Equatorial Guinea as previously described . 50 million MCMC were used in each run and chain convergence and mixing, effective sample sizes (ESS), and Bayes Factors were determined using the program Tracer v1.5. All ESSs were greater than 400. Maximum clade credibility (MCC) trees were obtained using TreeAnnotator after a burn-in of the first 1000 trees. MCC trees were viewed using the program FigTree v1.3.1.
Apetrei C, Robertson DL, Marx PA: The history of SIVS and AIDS: epidemiology, phylogeny and biology of isolates from naturally SIV infected non-human primates (NHP) in Africa. Front Biosci. 2004, 9: 225-254. 10.2741/1154.
Locatelli S, Peeters M: Cross-species transmission of simian retroviruses: how and why they could lead to the emergence of new diseases in the human population. Aids. 2012, 26: 659-673. 10.1097/QAD.0b013e328350fb68.
Gao F, Bailes E, Robertson DL, Chen Y, Rodenburg CM, Michael SF, Cummins LB, Arthur LO, Peeters M, Shaw GM, Sharp PM, Hahn BH: Origin of HIV-1 in the chimpanzee Pan troglodytes troglodytes. Nature. 1999, 397: 436-441. 10.1038/17130.
Hirsch VM, Olmsted RA, Murphey-Corb M, Purcell RH, Johnson PR: An African primate lentivirus (SIVsm) closely related to HIV-2. Nature. 1989, 339: 389-392. 10.1038/339389a0.
Van Heuverswyn F, Li Y, Neel C, Bailes E, Keele BF, Liu W, Loul S, Butel C, Liegeois F, Bienvenue Y, Ngolle EM, Sharp PM, Shaw GM, Delaporte E, Hahn BH, Peeters M: Human immunodeficiency viruses: SIV infection in wild gorillas. Nature. 2006, 444: 164-10.1038/444164a.
Bibollet-Ruche F, Galat-Luong A, Cuny G, Sarni-Manchado P, Galat G, Durand JP, Pourrut X, Veas F: Simian immunodeficiency virus infection in a patas monkey (Erythrocebus patas): evidence for cross-species transmission from African green monkeys (Cercopithecus aethiops sabaeus) in the wild. J Gen Virol. 1996, 77: 773-781. 10.1099/0022-1317-77-4-773.
Jin MJ, Rogers J, Phillips-Conroy JE, Allan JS, Desrosiers RC, Shaw GM, Sharp PM, Hahn BH: Infection of a yellow baboon with simian immunodeficiency virus from African green monkeys: evidence for cross-species transmission in the wild. J Virol. 1994, 68: 8454-8460.
Souquiere S, Bibollet-Ruche F, Robertson DL, Makuwa M, Apetrei C, Onanga R, Kornfeld C, Plantier JC, Gao F, Abernethy K, White LJ, Karesh W, Telfer P, Wickings EJ, Mauclere P, Marx PA, Barre-Sinoussi F, Hahn BH, Muller-Trutwin MC, Simon F: Wild Mandrillus sphinx are carriers of two types of lentivirus. J Virol. 2001, 75: 7086-7096. 10.1128/JVI.75.15.7086-7096.2001.
Beer BE, Foley BT, Kuiken CL, Tooze Z, Goeken RM, Brown CR, Hu J, St Claire M, Korber BT, Hirsch VM: Characterization of novel simian immunodeficiency viruses from red-capped mangabeys from Nigeria (SIVrcmNG409 and -NG411). J Virol. 2001, 75: 12014-12027. 10.1128/JVI.75.24.12014-12027.2001.
Bailes E, Gao F, Bibollet-Ruche F, Courgnaud V, Peeters M, Marx PA, Hahn BH, Sharp PM: Hybrid origin of SIV in chimpanzees. Science. 2003, 300: 1713-10.1126/science.1080657.
Takehisa J, Harada Y, Ndembi N, Mboudjeka I, Taniguchi Y, Ngansop C, Kuate S, Zekeng L, Ibuki K, Shimada T, Bikandou B, Yamaguchi-Kabata Y, Miura T, Ikeda M, Ichimura H, Kaptue L, Hayami M: Natural infection of wild-born mandrills (Mandrillus sphinx) with two different types of simian immunodeficiency virus. AIDS Res Hum Retroviruses. 2001, 17: 1143-1154. 10.1089/088922201316912754.
Aghokeng AF, Bailes E, Loul S, Courgnaud V, Mpoudi-Ngolle E, Sharp PM, Delaporte E, Peeters M: Full-length sequence analysis of SIVmus in wild populations of mustached monkeys (Cercopithecus cephus) from Cameroon provides evidence for two co-circulating SIVmus lineages. Virology. 2007, 360: 407-418. 10.1016/j.virol.2006.10.048.
Liegeois F, Boue V, Mouacha F, Butel C, Ondo BM, Pourrut X, Leroy E, Peeters M, Rouet F: New STLV-3 strains and a divergent SIVmus strain identified in non-human primate bushmeat in Gabon. Retrovirology. 2012, 9: 28-10.1186/1742-4690-9-28.
Gifford RJ: Viral evolution in deep time: lentiviruses and mammals. Trends Genet. 2012, 28: 89-100. 10.1016/j.tig.2011.11.003.
Bibollet-Ruche F, Bailes E, Gao F, Pourrut X, Barlow KL, Clewley JP, Mwenda JM, Langat DK, Chege GK, McClure HM, Mpoudi-Ngole E, Delaporte E, Peeters M, Shaw GM, Sharp PM, Hahn BH: New simian immunodeficiency virus infecting De Brazza’s monkeys (Cercopithecus neglectus): evidence for a cercopithecus monkey virus clade. J Virol. 2004, 78: 7748-7762. 10.1128/JVI.78.14.7748-7762.2004.
Perelman P, Johnson WE, Roos C, Seuanez HN, Horvath JE, Moreira MA, Kessing B, Pontius J, Roelke M, Rumpler Y, Schneider MP, Silva A, O’Brien SJ, Pecon-Slattery J: A molecular phylogeny of living primates. PLoS Genet. 2011, 7: e1001342-10.1371/journal.pgen.1001342.
Grubb P, Butynski TM, Oates JF, Bearder SK, Disotell TR, Groves CP, Struhsaker TT: Assessment of the diversity of African primates. Int J Primatol. 2003, 24: 1301-1357.
Courgnaud V, Pourrut X, Bibollet-Ruche F, Mpoudi-Ngole E, Bourgeois A, Delaporte E, Peeters M: Characterization of a novel simian immunodeficiency virus from guereza colobus monkeys (Colobus guereza) in Cameroon: a new lineage in the nonhuman primate lentivirus family. J Virol. 2001, 75: 857-866. 10.1128/JVI.75.2.857-866.2001.
Liegeois F, Lafay B, Formenty P, Locatelli S, Courgnaud V, Delaporte E, Peeters M: Full-length genome characterization of a novel simian immunodeficiency virus lineage (SIVolc) from olive Colobus (Procolobus verus) and new SIVwrcPbb strains from Western Red Colobus (Piliocolobus badius badius) from the Tai Forest in Ivory Coast. J Virol. 2009, 83: 428-439. 10.1128/JVI.01725-08.
Goldberg TL, Sintasath DM, Chapman CA, Cameron KM, Karesh WB, Tang S, Wolfe ND, Rwego IB, Ting N, Switzer WM: Coinfection of Ugandan red colobus (Procolobus [Piliocolobus] rufomitratus tephrosceles) with novel, divergent delta-, lenti-, and spumaretroviruses. J Virol. 2009, 83: 11318-11329. 10.1128/JVI.02616-08.
Locatelli S, Lafay B, Liegeois F, Ting N, Delaporte E, Peeters M: Full molecular characterization of a simian immunodeficiency virus, SIVwrcpbt from Temminck’s red colobus (piliocolobus badius temminckii) from abuko nature reserve, the Gambia. Virology. 2008, 376: 90-100. 10.1016/j.virol.2008.01.049.
Ahuka-Mundeke S, Ayouba A, Mbala-Kingebeni P, Liegeois F, Esteban A, Lunguya-Metila O, Demba D, Bilulu G, Mbenzo-Abokome V, Inogwabini BI, Muyembe-Tamfum JJ, Delaporte E, Peeters M: Novel multiplexed HIV/simian immunodeficiency virus antibody detection assay. Emerg Infect Dis. 2011, 17: 2277-2286. 10.3201/eid1712.110783.
Johnston AR, Gillespie TR, Rwego IB, McLachlan TL, Kent AD, Goldberg TL: Molecular epidemiology of cross-species Giardia duodenalis transmission in western Uganda. PLoS Negl Trop Dis. 2010, 4: e683-10.1371/journal.pntd.0000683.
Lauck M, Sibley SD, Lara J, Purdy MA, Khudyakov Y, Hyeroba D, Tumukunde A, Weny G, Switzer WM, Chapman CA, Hughes AL, Friedrich TC, O'Connor DH, Goldberg TL: A novel hepacivirus with an unusually long and intrinsically disordered NS5A protein in a wild Old World primate. J Virol. 2013, 87: 8971-8981. 10.1128/JVI.00888-13.
Lauck M, Sibley SD, Hyeroba D, Tumukunde A, Weny G, Chapman CA, Ting N, Switzer WM, Kuhn JH, Friedrich TC, O'Connor DH, Goldberg TL: Exceptional simian hemorrhagic fever virus diversity in a wild African primate community. J Virol. 2013, 87: 688-691. 10.1128/JVI.02433-12.
Lauck M, Hyeroba D, Tumukunde A, Weny G, Lank SM, Chapman CA, O'Connor DH, Friedrich TC, Goldberg TL: Novel, divergent simian hemorrhagic fever viruses in a wild Ugandan Red colobus monkey discovered using direct pyrosequencing. PLoS ONE. 2011, 6: e19056-10.1371/journal.pone.0019056.
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol. 1990, 215: 403-410.
Leonard CK, Spellman MW, Riddle L, Harris RJ, Thomas JN, Gregory TJ: Assignment of intrachain disulfide bonds and characterization of potential glycosylation sites of the type 1 recombinant human immunodeficiency virus envelope glycoprotein (gp120) expressed in Chinese hamster ovary cells. J Biol Chem. 1990, 265: 10373-10382.
Freed EO: Viral late domains. J Virol. 2002, 76: 4679-4687. 10.1128/JVI.76.10.4679-4687.2002.
Strack B, Calistri A, Craig S, Popova E, Gottlinger HG: AIP1/ALIX is a binding partner for HIV-1 p6 and EIAV p9 functioning in virus budding. Cell. 2003, 114: 689-699. 10.1016/S0092-8674(03)00653-6.
von Schwedler UK, Stuchell M, Muller B, Ward DM, Chung HY, Morita E, Wang HE, Davis T, He GP, Cimbora DM, Scott A, Krausslich HG, Kaplan J, Morham SG, Sundquist WI: The protein network of HIV budding. Cell. 2003, 114: 701-713. 10.1016/S0092-8674(03)00714-1.
Martin-Serrano J, Zang T, Bieniasz PD: HIV-1 and Ebola virus encode small peptide motifs that recruit Tsg101 to sites of particle assembly to facilitate egress. Nat Med. 2001, 7: 1313-1319. 10.1038/nm1201-1313.
Puffer BA, Parent LJ, Wills JW, Montelaro RC: Equine infectious anemia virus utilizes a YXXL motif within the late assembly domain of the Gag p9 protein. J Virol. 1997, 71: 6541-6546.
Marchler-Bauer A, Lu S, Anderson JB, Chitsaz F, Derbyshire MK, DeWeese-Scott C, Fong JH, Geer LY, Geer RC, Gonzales NR, Gwadz M, Hurwitz DI, Jackson JD, Ke Z, Lanczycki CJ, Lu F, Marchler GH, Mullokandov M, Omelchenko MV, Robertson CL, Song JS, Thanki N, Yamashita RA, Zhang D, Zhang N, Zheng C, Bryant SH: CDD: a conserved domain database for the functional annotation of proteins. Nucleic Acids Res. 2011, 39: D225-D229. 10.1093/nar/gkq1189.
Ma D, Jasinska A, Kristoff J, Grobler JP, Turner T, Jung Y, Schmitt C, Raehtz K, Feyertag F, Martinez Sosa N, Wijewardana V, Burke DS, Robertson DL, Tracy R, Pandrea I, Freimer N, Apetrei C: SIVagm infection in wild African green monkeys from South Africa: epidemiology, natural history, and evolutionary considerations. PLoS Pathog. 2013, 9: e1003011-10.1371/journal.ppat.1003011.
Worobey M, Telfer P, Souquiere S, Hunter M, Coleman CA, Metzger MJ, Reed P, Makuwa M, Hearn G, Honarvar S, Roques P, Apetrei C, Kazanji M, Marx PA: Island biogeography reveals the deep history of SIV. Science. 2010, 329: 1487-10.1126/science.1193550.
Wyand MS, Manson K, Montefiori DC, Lifson JD, Johnson RP, Desrosiers RC: Protection by live, attenuated simian immunodeficiency virus against heterologous challenge. J Virol. 1999, 73: 8356-8363.
Ndongmo CB, Switzer WM, Pau CP, Zeh C, Schaefer A, Pieniazek D, Folks TM, Kalish ML: New multiple antigenic peptide-based enzyme immunoassay for detection of simian immunodeficiency virus infection in nonhuman primates and humans. J Clin Microbiol. 2004, 42: 5161-5169. 10.1128/JCM.42.11.5161-5169.2004.
Hu J, Switzer WM, Foley BT, Robertson DL, Goeken RM, Korber BT, Hirsch VM, Beer BE: Characterization and comparison of recombinant simian immunodeficiency virus from drill (Mandrillus leucophaeus) and mandrill (Mandrillus sphinx) isolates. J Virol. 2003, 77: 4867-4880. 10.1128/JVI.77.8.4867-4880.2003.
Liegeois F, Courgnaud V, Switzer WM, Murphy HW, Loul S, Aghokeng A, Pourrut X, Mpoudi-Ngole E, Delaporte E, Peeters M: Molecular characterization of a novel simian immunodeficiency virus lineage (SIVtal) from northern talapoins (Miopithecus ogouensis). Virology. 2006, 349: 55-65. 10.1016/j.virol.2006.01.011.
Switzer WM, Parekh B, Shanmugam V, Bhullar V, Phillips S, Ely JJ, Heneine W: The epidemiology of simian immunodeficiency virus infection in a large number of wild- and captive-born chimpanzees: evidence for a recent introduction following chimpanzee divergence. AIDS Res Hum Retroviruses. 2005, 21: 335-342. 10.1089/aid.2005.21.335.
Goldberg TL, Paige SB, Chapman CA: The kibale EcoHealth project: exploring connections among human health, animal health, and landscape dynamics in western Uganda. New Directions in Conservation Medicine: Applied Cases of Ecological Health. Edited by: Aguirre PD AA, Ostfeld RS. 2012, New York: Oxford University Press, 452-465.
Ting N: Mitochondrial relationships and divergence dates of the African colobines: evidence of Miocene origins for the living colobus monkeys. J Hum Evol. 2008, 55: 312-325. 10.1016/j.jhevol.2008.02.011.
Drummond AJ, Rambaut A: BEAST: Bayesian evolutionary analysis by sampling trees. BMC Evol Biol. 2007, 7: 214-10.1186/1471-2148-7-214.
This work was funded by NIH grant TW009237 as part of the joint NIH-NSF Ecology of Infectious Disease program and the UK Economic and Social Research Council and in part by National Institutes of Health grants R01 AI084787 and R01 AI077376-04A1; it was also supported by National Center for Research Resources grant RR000167 and the Office of Research Infrastructure Programs (ORIP) grant P51OD011106. The research was conducted, in part, at a facility constructed with support from Research Facilities Improvement Program grants RR15459-01 and RR020141-01. This research was performed using resources and the computing assistance of the UW-Madison Center for High Throughput Computing (CHTC) in the Department of Computer Sciences. The CHTC is supported by UW-Madison and the Wisconsin Alumni Research Foundation, and is an active member of the Open Science Grid, which is supported by the National Science Foundation and the U.S. Department of Energy’s Office of Science. Use of trade names is for identification only and does not imply endorsement by the U.S. Department of Health and Human Services, the Public Health Service, or the Centers for Disease Control and Prevention. The findings and conclusions in this report are those of the authors and do not necessarily represent the views of the Centers for Disease Control and Prevention.
The authors declare that they have no competing interests.
Conceived and designed the experiments: DHO, TLG, TCF, CAC. Performed the experiments: ML, SDS, WMS, AS, BT. Analyzed the data: ML, WMS. Wrote the paper: ML, WMS, TLG, DHO, TCF, NT. Conducted study in the field: DH, AT, GW, TLG. All authors read and approved the final manuscript.
About this article
Cite this article
Lauck, M., Switzer, W.M., Sibley, S.D. et al. Discovery and full genome characterization of two highly divergent simian immunodeficiency viruses infecting black-and-white colobus monkeys (Colobus guereza) in Kibale National Park, Uganda. Retrovirology 10, 107 (2013). https://doi.org/10.1186/1742-4690-10-107
- Simian immunodeficiency virus
- Old World primate
- Colobus guereza
- Next-generation sequencing
- Virus discovery