Functional conservation of HIV-1 Gag: implications for rational drug design
© Li et al.; licensee BioMed Central Ltd. 2013
Received: 14 July 2013
Accepted: 21 October 2013
Published: 31 October 2013
HIV-1 replication can be successfully blocked by targeting gag gene products, offering a promising strategy for new drug classes that complement current HIV-1 treatment options. However, naturally occurring polymorphisms at drug binding sites can severely compromise HIV-1 susceptibility to gag inhibitors in clinical and experimental studies. Therefore, a comprehensive understanding of gag natural diversity is needed.
We analyzed the degree of functional conservation in 10862 full-length gag sequences across 8 major HIV-1 subtypes and identified the impact of natural variation on known drug binding positions targeted by more than 20 gag inhibitors published to date. Complete conservation across all subtypes was detected in 147 (29%) out of 500 gag positions, with the highest level of conservation observed in capsid protein. Almost half (41%) of the 136 known drug binding positions were completely conserved, but all inhibitors were confronted with naturally occurring polymorphisms in their binding sites, some of which correlated with HIV-1 subtype. Integration of sequence and structural information revealed one drug binding pocket with minimal genetic variability, which is situated at the N-terminal domain of the capsid protein.
This first large-scale analysis of full-length HIV-1 gag provided a detailed mapping of natural diversity across major subtypes and highlighted the considerable variation in current drug binding sites. Our results contribute to the optimization of gag inhibitors in rational drug design, given that drug binding sites should ideally be conserved across all HIV-1 subtypes.
KeywordsHIV subtype Gag inhibitor Matrix Capsid Nucleocapsid Drug binding site Natural polymorphism Amino acid conservation
A curative therapy or preventive vaccine for HIV-1 infected patients remains elusive to date. Standard HIV treatment is confronted with the emergence of viral resistance to existing drug classes, necessitating the development of inhibitors with new mechanisms of action . The gag polyprotein, essential for HIV-1 morphogenesis, comprises four major domains (matrix, capsid, nucleocapsid, p6) and two small spacer peptides (p1, p2) . Recently, HIV-1 inhibitors that target different stages of virion morphogenesis demonstrated promising antiviral activity, mainly by inhibiting capsid assembly, disrupting nucleocapsid binding with viral RNA/DNA or blocking proteolytic processing of polyproteins during maturation [2–5].
HIV-1 subtype B isolates were predominantly used for the in vitro experiments. Non-B subtypes however account for 90% of HIV-1 infections worldwide  and amino acid (AA) compositions can differ up to 30% between subtypes . Recently, treatment failure of patients in a phase II clinical study of the maturation inhibitor bevirimat was attributed to natural polymorphisms at drug binding positions, showing up in subtype-specific patterns . Studies that extensively investigate the implications of HIV-1 diversity for gag-directed drug development are lacking to date. In this large-scale analysis, we examined the distribution of naturally occurring sequence variability in full-length gag sequences of major HIV-1 subtypes. Moreover, we evaluated the impact of HIV-1 subtypes on the conservation of gag drug binding positions and multisite binding pockets published to date.
We analyzed 10862 full-length gag sequences that fulfilled the quality criteria, encompassing 8 HIV-1 group M subtypes and CRFs: A1 (n = 1648), B (n = 4131), C (n = 2780), D (n = 443), F1 (n = 35), G (n = 49), CRF01_AE (n = 1714) and CRF02_AG (n = 62). Sequences were sampled from 61 countries between 1981 and 2012. Additional file 1: Table S1 summarizes more than 50 gag inhibitors including their binding sites, target protein, mechanism of action, HIV-1 subtypes and PDB data. These candidate inhibitors were either small organic molecules or peptides and primarily targeted the capsid or nucleocapsid proteins. A total of 136 gag positions were reported as drug binding positions, of which 53 interacted with more than one inhibitor.
Natural polymorphism proportions in gag domains and drug binding positions across 8 HIV-1 subtypes and CRFs (%)
The inter- and intra-subtype diversity of gag AA sequences in 8 HIV-1 subtypes and CRFs (%)
The pairwise AA diversity of gag domains in 8 HIV-1 subtypes and CRFs (%)
Discussion and conclusions
To our knowledge, our large-scale analysis provided the first detailed mapping of functional conservation of gag across major HIV-1 subtypes, with implications for the rational design of gag inhibitors. With more than 50 gag inhibitors published to date, targeting virion morphogenesis is considered a potential new drug class for HIV-1 treatment . A clinical proof-of-concept was demonstrated in a phase II clinical trial of the maturation inhibitor bevirimat , which blocks proteolytic processing at the capsid-p2 cleavage site . Lack of response was observed in 50% of patients and attributed to naturally occurring polymorphisms in the p2 region . A single polymorphism V370A is sufficient for a 40-fold reduction in bevirimat drug susceptibility , with A370 representing the consensus amino acid in several non-B subtypes. Natural diversity was also observed to affect drug effectiveness of other experimental gag inhibitors [13–15]. Polymorphisms T190I, E230D and I256V, for instance, reduced drug susceptibility to the benzodiazepine and benzimidazole compounds . Moreover, known HIV vaccine candidates containing subtype B gag gene in HIV-derived vectors did not show sufficient protective efficacies in several large-scale clinical trials . The high diversity of gag and env genes within and between subtypes can contribute to the challenges of designing a global HIV vaccine neutralizing all HIV-1 subtypes . For the development of HIV vaccine and a potential new drug class targeting virion morphogenesis , an assessment of gag functional conservation and polymorphisms at known drug binding positions is warranted.
We found that 23.4% of drug binding positions in the full-length gag showed natural polymorphisms in non-B subtypes which could not be detected in subtype B. More importantly, all gag inhibitors had at least one polymorphic binding position irrespective of subtype. We also found levels of gag intra- and inter-subtype diversity (9.04% and 17.0%) that exceeded diversity estimates of key viral enzymes (< 7% and < 11%) targeted by standard HIV-1 treatment . However, the most conserved gag protein capsid has the same level of intra-subtype diversity as integrase (~5%) , favoring it as a conserved drug target.
Another potential drug target is the nucleocapsid protein, containing two critical zinc-finger domains for binding with viral RNA genomes . Our conservation analysis mapped the conserved nucleocapsid regions to zinc-finger domains (Figures 2 and 5) and confirmed previous findings of absolute conservation of CCHC motifs at zinc-coordinating positions . However, we detected considerable variation at other positions, which may alter drug binding and affect antiviral activity. Furthermore, nucleocapsid inhibitors tend to suffer from limited specificity and high toxicity due to the ubiquitous presence of zinc finger domains in many human proteins .
Matrix inhibitors with broad spectrum antiviral activities were recently reported, but mutations at drug binding positions significantly reduced their effectiveness [23, 24]. We also observed many natural variants at their drug binding sites (Additional file 1: Table S2), suggesting that further optimization of matrix inhibitors is needed.
Studies that analyzed genetic variability and drug binding site heterogeneity in gag using large-scale sequence populations are lacking. Previously, small subtype B sequence datasets were used to characterize gag conservation (n = 125)  or positive selective pressure (n = 635) . Polymorphisms at drug binding sites of capsid inhibitor PF-3450074  and conservation of nucleocapsid zinc-finger domains  were also reported using fewer than 200 sequences. The only large-scale analysis that we found  quantified the drug binding site conservation of a single matrix inhibitor and lacked information on subtype-specific variations. By contrast, we presented here a large-scale and integrative analysis using 10862 full-length gag sequences, 136 gag inhibitor drug binding positions and 14 PDB structures. Natural polymorphisms of full-length gag were detected across 8 major HIV-1 subtypes and a robust estimation of functional conservation was performed using CI analysis, which incorporated biochemical similarities between amino acids (Additional file 3). This sequence analysis predicted three conserved drug targets in gag (Figure 2) which were confirmed by existing structural knowledge (Figure 5).
This study is limited in that it neither addressed how to optimize known gag inhibitors nor quantified the impact of newly identified polymorphisms on antiviral activities of investigated inhibitors. We collected all available PDBs of gag-inhibitor structures from the RCSB protein data bank, but more crystallized complexes are needed to reveal novel mechanisms of action. Moreover, the limited number of available gag sequences for subtypes F1, G and CRF02_AG (n < 100) may have affected the identification of polymorphic positions, but consistent conservation patterns were observed in gag regardless of HIV-1 subtype (Figure 2). While we attempted to be as comprehensive as possible, additional inhibitors may have been reported. Conservation of their binding positions can nevertheless be deduced from our full-length gag analysis. Future studies are also needed to address whether interactions between gag and protease can affect gag drug binding sites, leading to compromised drug activities of gag inhibitors .
In conclusion, our study presented a comprehensive mapping of functional conservation in gag and strengthened the idea of capsid as a potential target for HIV-1 therapeutics. Increased knowledge on HIV-1 natural diversity in drug binding pockets contributes to rational design of gag inhibitors and it remains a challenge to design gag inhibitors with drug binding sites conserved across HIV-1 subtypes.
We retrieved 12543 gag sequences spanning all 1500 base pairs from the HIV Los Alamos database (http://www.hiv.lanl.gov). Sequences were aligned against the HXB2 reference and manually curated using Seaview 4.3 . Hypermutated sequences were detected using the Los Alamos hypermut tool . HIV-1 subtype was determined by the Rega  and COMET subtyping tools (http://comet.retrovirology.lu/). Sequence quality was ensured by excluding duplicates and sequences with internal stop-codons, hypermutations, more than 1% ambiguous nucleotides, discordant subtype classification or an identical combination of patient code, sampling year and country. The analysis was restricted to the major subtypes and circulating recombinant forms (CRFs) characterizing the global HIV-1 subtype distribution . For each individual subtype, amino acids that differed from the corresponding consensus AA and with prevalence ≥ 0.5% were defined as polymorphisms . PDB data of protein-inhibitor complexes were collected from the RCSB Protein Data Bank , summarized in Additional file 1. The AA sequences in each PDB were aligned against the HXB2 reference. Drug binding pockets were defined by protein positions within a minimum Euclidean distance of less than 5Å between atoms of inhibitors and non-hydrogen atoms of residues . Information on known gag candidate inhibitors and binding positions was retrieved from more than 50 publications, summarized in Additional file 1.
To quantify the degree of positional conservation, a conservation index (CI) was calculated for each position by averaging pairwise scores between all AAs using the BLOSUM62 substitution matrix. Adapted from Karlin and Brocchieri , the conservation index (CI) of position x is calculated as: , where x i is the amino acid at position x in the ith sequence of the multiple sequence alignment (MSA), N is the number of sequences in the MSA and S(x i , x j ) is the substitution score of BLOSUM62 between amino acids x i and x i . Given that denominators cannot be zero, a linear transformation was applied to S(x i , x j ) by adding the absolute value of the minimum score | min(S)| + 1. CI measures were scaled between 0 and 1, with a CI value of 0 indicating that AA variation was absent at that position. A highly conserved position was identified if its CI is below 0.01 for each HIV-1 subtype, a cutoff which corresponds approximately to a cumulative polymorphism prevalence below 1% (Additional file 3). The Mann–Whitney U test was performed to compare CI distributions. Performance of the CI method is evaluated in Additional file 3 and our Matlab toolbox for sequence analysis is available in Additional file 4.
circulating recombinant form
multiple sequence alignment
protein data bank.
We thank Supinya Piampongsant, Fossie Ferreira, Andrea Clemencia Pineda, Ricardo Khouri, Mónica Eusébio, Soraya Maria Menezes and Dan Clements for technical assistance and valuable contributions to the analysis. The research leading to these results has been supported in part by the European Community's Seventh Framework Programme (FP7/2007-2013) under the project "Collaborative HIV and Anti-HIV Drug Resistance Network (CHAIN)" grant agreement n° 223131; by the Fonds voor Wetenschappelijk Onderzoek – Flanders (FWO) grant G.0611.09. GL acknowledged his funding from China Scholarship Council. AV acknowledged his JSPS funding. KT received funding from the Fonds voor Wetenschappelijk Onderzoek – Flanders (FWO).
- Engelman A, Cherepanov P: The structural biology of HIV-1: mechanistic and therapeutic insights. Nat Rev Microbiol. 2012, 10: 279-290. 10.1038/nrmicro2747.PubMed CentralView ArticlePubMedGoogle Scholar
- Waheed AA, Freed EO: HIV type 1 Gag as a target for antiviral therapy. AIDS Res Hum Retroviruses. 2012, 28: 54-75. 10.1089/aid.2011.0230.PubMed CentralView ArticlePubMedGoogle Scholar
- Bocanegra R, Rodriguez-Huete A, Fuertes MA, Del Alamo M, Mateu MG: Molecular recognition in the human immunodeficiency virus capsid and antiviral design. Virus Res. 2012, 169: 388-410. 10.1016/j.virusres.2012.06.016.View ArticlePubMedGoogle Scholar
- Dau B, Holodniy M: Novel targets for antiretroviral therapy: clinical progress to date. Drugs. 2009, 69: 31-50. 10.2165/00003495-200969010-00003.View ArticlePubMedGoogle Scholar
- Salzwedel K, Martin DE, Sakalian M: Maturation inhibitors: a new therapeutic class targets the virus structure. AIDS Rev. 2007, 9: 162-172.PubMedGoogle Scholar
- Hemelaar J, Gouws E, Ghys PD, Osmanov S: Isolation W-UNfH, Characterisation: Global trends in molecular epidemiology of HIV-1 during 2000–2007. AIDS. 2011, 25: 679-689. 10.1097/QAD.0b013e328342ff93.PubMed CentralView ArticlePubMedGoogle Scholar
- Gaschen B, Taylor J, Yusim K, Foley B, Gao F, Lang D, Novitsky V, Haynes B, Hahn BH, Bhattacharya T, Korber B: Diversity considerations in HIV-1 vaccine selection. Science. 2002, 296: 2354-2360. 10.1126/science.1070441.View ArticlePubMedGoogle Scholar
- Adamson CS, Sakalian M, Salzwedel K, Freed EO: Polymorphisms in Gag spacer peptide 1 confer varying levels of resistance to the HIV- 1 maturation inhibitor bevirimat. Retrovirology. 2010, 7: 36-10.1186/1742-4690-7-36.PubMed CentralView ArticlePubMedGoogle Scholar
- Blair WS, Pickford C, Irving SL, Brown DG, Anderson M, Bazin R, Cao J, Ciaramella G, Isaacson J, Jackson L, et al: HIV capsid is a tractable target for small molecule therapeutic intervention. PLoS Pathog. 2010, 6: e1001220-10.1371/journal.ppat.1001220.PubMed CentralView ArticlePubMedGoogle Scholar
- Smith PF, Ogundele A, Forrest A, Wilton J, Salzwedel K, Doto J, Allaway GP, Martin DE: Phase I and II study of the safety, virologic effect, and pharmacokinetics/pharmacodynamics of single-dose 3-o-(3',3'-dimethylsuccinyl)betulinic acid (bevirimat) against human immunodeficiency virus infection. Antimicrob Agents Chemother. 2007, 51: 3574-3581. 10.1128/AAC.00152-07.PubMed CentralView ArticlePubMedGoogle Scholar
- Nguyen AT, Feasley CL, Jackson KW, Nitz TJ, Salzwedel K, Air GM, Sakalian M: The prototype HIV-1 maturation inhibitor, bevirimat, binds to the CA-SP1 cleavage site in immature Gag particles. Retrovirology. 2011, 8: 101-10.1186/1742-4690-8-101.PubMed CentralView ArticlePubMedGoogle Scholar
- Lu W, Salzwedel K, Wang D, Chakravarty S, Freed EO, Wild CT, Li F: A single polymorphism in HIV-1 subtype C SP1 is sufficient to confer natural resistance to the maturation inhibitor bevirimat. Antimicrob Agents Chemother. 2011, 55: 3324-3329. 10.1128/AAC.01435-10.PubMed CentralView ArticlePubMedGoogle Scholar
- Goudreau N, Lemke CT, Faucher AM, Grand-Maitre C, Goulet S, Lacoste JE, Rancourt J, Malenfant E, Mercier JF, Titolo S, Mason SW: Novel inhibitor binding site discovery on HIV-1 capsid N-terminal domain by NMR and X-ray crystallography. ACS Chem Biol. 2013Google Scholar
- Bartonova V, Igonet S, Sticht J, Glass B, Habermann A, Vaney MC, Sehr P, Lewis J, Rey FA, Krausslich HG: Residues in the HIV-1 capsid assembly inhibitor binding site are essential for maintaining the assembly-competent quaternary structure of the capsid protein. J Biol Chem. 2008, 283: 32024-32033. 10.1074/jbc.M804230200.View ArticlePubMedGoogle Scholar
- Lemke CT, Titolo S, von Schwedler U, Goudreau N, Mercier JF, Wardrop E, Faucher AM, Coulombe R, Banik SS, Fader L, et al: Distinct effects of two HIV-1 capsid assembly inhibitor families that bind the same site within the N-terminal domain of the viral CA protein. J Virol. 2012, 86: 6643-6655. 10.1128/JVI.00493-12.PubMed CentralView ArticlePubMedGoogle Scholar
- Schiffner T, Sattentau QJ, Dorrell L: Development of prophylactic vaccines against HIV-1. Retrovirology. 2013, 10: 72-10.1186/1742-4690-10-72.PubMed CentralView ArticlePubMedGoogle Scholar
- Stephenson KE, Barouch DH: A global approach to HIV-1 vaccine development. Immunol Rev. 2013, 254: 295-304. 10.1111/imr.12073.PubMed CentralView ArticlePubMedGoogle Scholar
- Rhee SY, Liu TF, Kiuchi M, Zioni R, Gifford RJ, Holmes SP, Shafer RW: Natural variation of HIV-1 group M integrase: implications for a new class of antiretroviral inhibitors. Retrovirology. 2008, 5: 74-10.1186/1742-4690-5-74.PubMed CentralView ArticlePubMedGoogle Scholar
- Yufenyuy EL, Aiken C: The NTD-CTD intersubunit interface plays a critical role in assembly and stabilization of the HIV-1 capsid. Retrovirology. 2013, 10: 29-10.1186/1742-4690-10-29.PubMed CentralView ArticlePubMedGoogle Scholar
- Zhang H, Curreli F, Zhang X, Bhattacharya S, Waheed AA, Cooper A, Cowburn D, Freed EO, Debnath AK: Antiviral activity of alpha-helical stapled peptides designed from the HIV-1 capsid dimerization domain. Retrovirology. 2011, 8: 28-10.1186/1742-4690-8-28.PubMed CentralView ArticlePubMedGoogle Scholar
- Goudreau N, Coulombe R, Faucher AM, Grand-Maitre C, Lacoste JE, Lemke CT, Malenfant E, Bousquet Y, Fader L, Simoneau B, et al: Monitoring binding of HIV-1 capsid assembly inhibitors using (19)F ligand-and (15)N protein-based NMR and X-ray crystallography: early hit validation of a benzodiazepine series. ChemMedChem. 2013, 8: 405-414. 10.1002/cmdc.201200580.View ArticlePubMedGoogle Scholar
- Thomas JA, Gorelick RJ: Nucleocapsid protein function in early infection processes. Virus Res. 2008, 134: 39-63. 10.1016/j.virusres.2007.12.006.PubMed CentralView ArticlePubMedGoogle Scholar
- Zentner I, Sierra LJ, Fraser AK, Maciunas L, Mankowski MK, Vinnik A, Fedichev P, Ptak RG, Martin-Garcia J, Cocklin S: Identification of a small-molecule inhibitor of HIV-1 assembly that targets the phosphatidylinositol (4,5)-bisphosphate binding site of the HIV-1 matrix protein. ChemMedChem. 2013, 8: 426-432. 10.1002/cmdc.201200577.View ArticlePubMedGoogle Scholar
- Alfadhli A, McNett H, Eccles J, Tsagli S, Noviello C, Sloan R, Lopez CS, Peyton DH, Barklis E: Analysis of small molecule ligands targeting the HIV-1 matrix protein-RNA binding site. J Biol Chem. 2013, 288: 666-676. 10.1074/jbc.M112.399865.PubMed CentralView ArticlePubMedGoogle Scholar
- Frahm N, Kaufmann DE, Yusim K, Muldoon M, Kesmir C, Linde CH, Fischer W, Allen TM, Li B, McMahon BH, et al: Increased sequence diversity coverage improves detection of HIV-specific T cell responses. J Immunol. 2007, 179: 6638-6650.View ArticlePubMedGoogle Scholar
- Snoeck J, Fellay J, Bartha I, Douek DC, Telenti A: Mapping of positive selection sites in the HIV-1 genome in the context of RNA and protein structural constraints. Retrovirology. 2011, 8: 87-10.1186/1742-4690-8-87.PubMed CentralView ArticlePubMedGoogle Scholar
- Zentner I, Sierra LJ, Maciunas L, Vinnik A, Fedichev P, Mankowski MK, Ptak RG, Martin-Garcia J, Cocklin S: Discovery of a small-molecule antiviral targeting the HIV-1 matrix protein. Bioorg Med Chem Lett. 2013, 23: 1132-1135. 10.1016/j.bmcl.2012.11.041.PubMed CentralView ArticlePubMedGoogle Scholar
- Fun A, Wensing AM, Verheyen J, Nijhuis M: Human immunodeficiency virus Gag and protease: partners in resistance. Retrovirology. 2012, 9: 63-10.1186/1742-4690-9-63.PubMed CentralView ArticlePubMedGoogle Scholar
- Gouy M, Guindon S, Gascuel O: SeaView version 4: A multiplatform graphical user interface for sequence alignment and phylogenetic tree building. Mol Biol Evol. 2010, 27: 221-224. 10.1093/molbev/msp259.View ArticlePubMedGoogle Scholar
- Rose PP, Korber BT: Detecting hypermutations in viral sequences with an emphasis on G –> A hypermutation. Bioinformatics. 2000, 16: 400-401. 10.1093/bioinformatics/16.4.400.View ArticlePubMedGoogle Scholar
- Pineda-Pena AC, Faria NR, Imbrechts S, Libin P, Abecasis AB, Deforche K, Gomez-Lopez A, Camacho RJ, de Oliveira T, Vandamme AM: Automated subtyping of HIV-1 genetic sequences for clinical and surveillance purposes: Performance evaluation of the new REGA version 3 and seven other tools. Infect Genet Evol. 2013Google Scholar
- Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE: The protein data bank. Nucleic Acids Res. 2000, 28: 235-242. 10.1093/nar/28.1.235.PubMed CentralView ArticlePubMedGoogle Scholar
- Zhang Z, Li Y, Lin B, Schroeder M, Huang B: Identification of cavities on protein surface using multiple computational approaches for drug binding site prediction. Bioinformatics. 2011, 27: 2083-2088. 10.1093/bioinformatics/btr331.View ArticlePubMedGoogle Scholar
- Brocchieri L, Karlin S: Conservation among HSP60 sequences in relation to structure, function, and evolution. Protein Sci. 2000, 9: 476-486.PubMed CentralView ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.