Skip to main content

Genome-wide association studies on HIV susceptibility, pathogenesis and pharmacogenomics


Susceptibility to HIV-1 and the clinical course after infection show a substantial heterogeneity between individuals. Part of this variability can be attributed to host genetic variation. Initial candidate gene studies have revealed interesting host factors that influence HIV infection, replication and pathogenesis. Recently, genome-wide association studies (GWAS) were utilized for unbiased searches at a genome-wide level to discover novel genetic factors and pathways involved in HIV-1 infection. This review gives an overview of findings from the GWAS performed on HIV infection, within different cohorts, with variable patient and phenotype selection. Furthermore, novel techniques and strategies in research that might contribute to the complete understanding of virus-host interactions and its role on the pathogenesis of HIV infection are discussed.


There is considerable heterogeneity in HIV-1 susceptibility and in disease progression rates after infection. Certain people are relatively resistant to HIV-1 infection and remain uninfected despite multiple exposures to HIV-1, while others are infected upon first exposure. After seroconversion, some individuals progress to AIDS in as little as 2 years, while others remain symptom-free for more than 15 years. This variation between individuals is determined by both viral and host factors.

The emergence of HIV-1 variants that use coreceptor CXCR4 rather than CCR5 in the course of infection is associated with an accelerated CD4+ T-cell decline and more rapid progression to AIDS [1, 2]. Other evidence that viral factors may influence the clinical course of HIV-1 infection comes from a cohort of long-term nonprogressors (LTNPs) who were all infected with an HIV-1 variant that was attenuated due to a deletion in the viral nef gene [3].

The first polymorphisms in host genetic factors that affected HIV-1 infection and disease were determined using candidate gene studies, in which genetic variants of host factors that were already known or suspected to play a role in HIV-1 pathogenesis and immune regulation were tested for association with HIV-1 infection and/or disease progression. These studies identified several important host polymorphisms associated with HIV-1 infection and pathogenesis [414]. The human leukocyte antigen (HLA) type is a strong example of a host factor that is associated with HIV-1 disease course. HLA-B*5701 and HLA-B27 are more prevalent among LTNPs whereas HLA-B35 is associated with an accelerated progression to AIDS [1517]. Another important host factor polymorphism is a 32 basepair deletion in CCR5 (CCR5Δ32), the major coreceptor for HIV-1. This deletion, which results in a truncated protein product that is no longer expressed on the cell surface, provided nearly complete protection against HIV-1 infection in individuals homozygous for this deletion [1820]. Individuals carrying the heterozygous CCR5Δ32 genotype have sufficient CCR5 expression on the cell surface to support infection; however, this heterozygous genotype is associated with delayed disease progression after HIV-1 infection [18, 21, 22].

In the case of CCR5, the association between the genetic polymorphism and disease progression has even resulted in the development of new antiviral strategies to block CCR5 in HIV-1 infected individuals [23]. These developments illustrate the potential of host genetic research to combat HIV-1 infection and AIDS. However, even when combined, these genetic variations together still only explained a small fraction of the variability of HIV-1 control between individuals.

The more recent genome-wide association studies (GWAS) offer a hypothesis-free analysis to scan the complete human genome for additional factors without a priori knowledge about their role in complex diseases. Following the completion of the human genome sequence in 2003 [24], the HapMap project was launched [25] in which commonly occurring genetic variations were identified along the complete genome and across several populations. These mostly single DNA mutations were called single-nucleotide polymorphisms (SNPs). In most of the genome, combinations of SNPs appear to be found together in blocks with strong linkage disequilibrium (LD), which gives the opportunity to cover almost the whole genome on the chips used in GWAS, by selecting “tagSNPs” that represent the LD blocks [26]. In this review, we will discuss GWAS performed on HIV-1, focusing on the differences in cohorts and phenotypes used.


Genome-wide association studies on HIV-1

Host genetic factors that associate with viral load control

The first reported GWAS was performed on 486 individuals from the Euro-CHAVI cohort, and used HIV RNA viral load at set point as phenotype, which is known to be predictive for disease progression. (see Table 1 for an overview of all published GWAS up to date) [27]. In the association analysis, using linear regression, two loci were genome-wide significantly associated with viral load at set point. Without an a priori hypothesis, a stringent correction for multiple tests in GWAS is required to avoid false-positive errors. The current standard for genome-wide significance in GWAS is a P-value below 5x10-8. One of these loci is tagged by SNP rs2395029 near the HLA complex 5 gene (HCP5), a gene that is localized within the MHC class I region. SNP rs2395029 is in nearly absolute LD with HLA-B*57, which was already known to be protective against disease progression as described above. The other SNP, rs9264942, is located 35kb from the HLA-C gene. It was shown that the variation within the 3’ UTR region of HLA-C regulates binding of the microRNA hsa-miR-148 to its target site, resulting in differential expression of the HLA-C gene [28, 29]. Despite the fact that the HCP5 SNP and −35 HLA-C SNP were in moderate LD, Fellay et al. showed an independent effect of each of these variants on the viral load set point variation between individuals. This first GWAS additionally identified a set of seven SNPs that were in high LD and located close to ring finger protein 39 (RNF39) and zinc ribbon domain-containing protein 1 (ZNRD1), to be associated with progression to CD4+ T cell count below 350 cells/ml [27]. The replication study by Catano et al. [30] showed that the causal effect of these SNPs on HIV-1 disease progression might be because of a very strong LD between these SNPs and HLA-A10.

Table 1 Description of GWAS on HIV infection and the most interesting signals discovered

In a follow-up study, Fellay et al. [33] performed a GWAS on an extended population (n = 2362) to identify additional genetic variants that could explain the variability of HIV-1 control between individuals. As expected, this study confirmed the association of the HCP5 SNP (rs2395029, P = 4.5 x 10-35) and the −35 HLA-C SNP (rs9264942, P = 5.9 x 10-32) with viral load at set point. Next to these already known variants, the authors identified other, independent loci in the MHC that were associated with viral load control. For example rs9468692, located in the 3’ region of the TRIM10 gene, and the non-synonymous coding SNP rs8192591, located in the 9th exon of the NOTCH4 gene.

Dalmasso et al. [31] also used viral load as a disease phenotype in their GWAS, but evaluated plasma HIV-RNA during primary infection rather than at set point. This study analyzed 605 seroconverters in a case–control study, comparing 45 long-term HIV controllers, with an RNA viral load below 400 copies/ml for more than 10 years, with the rest of the HIV-infected individuals. The protective allele of SNP rs10484554, located in the region between HLA-C and HLA-B, was genome-wide significantly (P = 3.58 x 10-9) over-represented among the long-term HIV controllers. These authors identified HCP5 rs2395029 to be most strongly associated with HIV-1 DNA levels in the first patient samples drawn at the time of enrolment during primary infection (P = 6.72 x 10-7). These HIV-1 DNA levels were considered as a marker of the HIV reservoir in their seroconverters. This SNP was also associated with HIV controller status, thereby confirming the results from the first GWAS by Fellay et al.

Viral load was also used as the phenotype in the multinational HIV Controllers study [35]. This large cohort of HIV-infected individuals was divided into elite and viremic controllers (n = 1526), which are seropositive individuals who are able to control viral load to levels below 50 or 2,000 copies of viral RNA/ml plasma respectively, and HIV-1 progressors (n = 2648), who failed to ever control viremia without therapy. Over 300 SNPs were identified to be genome-wide significantly associated with viral load (P < 5.0 x 10-8), and all were located within the MHC gene region on chromosome 6. Only four of these SNPs were independently associated with viremic control: the already known SNPs −35 HLA-C (P = 2.8 x 10-35) and HCP5 rs2395029 (P = 9.7 x 10-26), and two novel SNPs, rs4418214, located near MICA (P = 1.4 x 10-34), and rs3131018 in PSORSIC3 (P = 4.2 x 10-16), a gene that has been implicated in psoriasis. Interestingly, the authors identified several specific amino acids in the HLA-B peptide binding groove to be even more strongly associated with viral load control than any SNP found in the GWAS, or any of the HLA alleles.

Host genetic factors that associate with HIV-1 disease progression

Two GWAS were performed in the Genomics of Resistance to Immunodeficiency Virus (GRIV) cohort to look for genetic associations with extreme phenotypes in HIV-1 infection in either LTNPs [32] or rapid progressors (RP) [34]. The nonprogression GRIV GWAS compared 275 LTNPs to a control group of 1352 seronegative individuals and found HCP5 rs2395029 to be most strongly associated with nonprogression after HIV-1 infection (P = 6.8 x 10-10). Mainly associations with genetic variation in chromosome 6 were found and this GWAS again confirmed association between HIV-1 control and the HCP5 and the ZNRD1 locus identified by the EURO-CHAVI cohort. More recently, this GWAS was reanalyzed to specifically identify genetic variants that associate with LTNP without elite control of the viral load [39]. To this end, the authors compared 697 uninfected individuals with 186 LTNPs, excluding elite controller patients with a viral load below 100 copies/ml. SNP rs2234358 in the CXCR6 gene was identified to be associated with LTNP and this association could be replicated in three independent European studies (P = 9.7 x 10-10).

The only genome-wide analysis of RP to date [34] revealed several interesting loci outside the MHC region in a case–control study of 85 HIV-1-infected patients who had experienced rapid disease progression and who were compared with 1352 seronegative individuals. SNPs rs4118325 (P = 6.1 x 10-7), in the vicinity of PRMT6 and rs1522232 in SOX5 (P = 1.8 x 10-6) were amongst the top SNPs that were associated with rapid progression after HIV-1 infection. These associations were, however, not significant after correction for multiple testing. The exact potential mechanism of action for these two SNPs is unknown. Although analysis of RP yields unique loci, these individuals are underrepresented in most cohorts. This low number of RP could be an explanation for the lack of genome-wide significant signals, and indicates the difficulty of replicating signals in other RP cohorts.

A multi-stage GWAS in US seroconverters compared RP (n = 51), moderate progressors (n = 57) and LTNPs (n = 48) [37]. Genetic variation rs17762192, upstream of PROX1, a negative regulator of IFN-γ expression in T cells [46], was associated with slower progression to AIDS (P = 6.2 x 10-7). Although this SNP upstream of PROX1 was not genome-wide significantly associated with slower progression to AIDS, the loci could be replicated in an independent population of 590 HIV-infected seroconverters.

Troyer et al. [41] identified a cluster of SNPs in the gene PARD3B to be associated with a delayed survival time to clinical AIDS (P = 3.4 x 10-9) in a GWAS amongst US seroconverters (n = 755). One of the PARD3B variants in this cluster could be confirmed in two European cohorts of rapid progressors. PARD3B interacts with members of the SMAD family, which are known to interact directly with HIV-1 [47]. The HCP5 rs2395029 signal was not found to be associated with survival time to clinical AIDS in this cohort.

Another GWAS on HIV-1 disease progression was performed in the Amsterdam Cohort Studies (ACS) [43]. In this study in 404 HIV-infected individuals, the association of SNPs with survival time to AIDS-diagnosis and AIDS-related death was tested. Albeit not genome-wide significant, SNP rs152363 showed a tendency to association with disease progression after HIV-1 infection in both the ACS and the GRIV cohort (P = 3.5 x 10-6). Furthermore, this GWAS showed that in the ACS the HCP5 rs2395029 was also significantly associated with delayed progression to AIDS and AIDS-related death, although the effect was notably reduced when viral load at set point was included as a covariate in multivariate analysis [48].

Results from GWAS on disease progression show that replication in cohorts with alternative phenotypes has proven to be challenging. Moreover, not many prospective seroconversion cohorts exist that have sufficient follow up time between the moment of HIV-1 infection and disease progression, which complicates these survival studies.

GWAS in African populations

Human genetic studies in disease in general have been focusing almost exclusively on individuals of European ancestry. Until 2011 more than 75% of studies in the catalog of published GWAS were analyzing individuals from European ancestry [49]. Remarkably, less than 5% of the GWAS were focusing on individuals from African descent. For HIV studies this is discouraging, since the HIV-infection prevalence rate is particularly alarming in sub-Saharan Africa ( An obvious reason for this inequality in population coverage in GWAS is the availability of study populations with DNA samples and documented phenotypes. In addition, African populations are characterized by lower levels of LD [50], which has led to poor coverage of the whole-genome by proxy SNPs on the initial chips used in GWAS. Because of this uneven prevalence of European ancestry in GWAS, and the different genetic make-up of other populations, reflected in differences in HLA class I allele frequencies and in LD between populations, associations of genetic variations that were identified in Caucasian HIV-infected individuals can be absent from other populations. Indeed HCP5 rs2395029 was not found to be associated with viral load at set point in an African population [51].

The first published GWAS in a non-European population searched for associations with more than 500,000 SNPs with viral load at set point in 515 African Americans [38]. Although no loci were genome-wide significantly associated with viral load at set point, one of the strongest associations was a SNP tagging the HLA-B*5703 allele. Individuals carrying the HLA-B*5703 allele, have a significantly lower viral load at set point (P = 5.6 x 10-10), thereby again emphasizing the important association between HLA-B*57 and the control of viral load after HIV infection, both in individuals of African and European ancestry.

In sub-Saharan Africa, children are infected with HIV predominantly through mother-to-child transmission (MTCT). Joubert et al. [36] conducted a GWAS in a Malawi cohort to identify the genetic host factors associated with vertical transmission of HIV. In this study, HIV-negative (n = 126) and positive (n = 100) children from HIV-infected mothers, were compared. The top signal (rs8069770) was not significant after correction for multiple testing (P = 3.79 x 10-5). This could be due to limited sample size, causing the study only to be powered to detect large genetic effects. SNP rs8069770 is located within HS3ST3A1, a gene involved in heparin sulfate biosynthesis, which interestingly is very abundantly expressed in the placenta. In another population of individuals from Malawi, a country with high prevalence of HIV-1 infection, a GWAS was performed to identify host determinants of HIV-1 susceptibility [40]. Unfortunately, no single SNP yielded a significant P-value after correction for multiple testing, when the authors compared 848 high-risk seronegatives with 531 HIV-1 seropositive individuals. Failing to detect a genome-wide significant signal could be due to the difficulty to quantify the level of HIV-1 exposure in this population. Although the studied individuals are assumed to be in a high-risk category as they are attending STI clinics in a region with a high HIV-1 prevalence and incidence, there were no actual data collected on exposure details (e.g. number of partners and type of sexual contacts). Furthermore, HIV-1 susceptibility can be strongly influenced by other factors, like circumcision status of male partner, concurrent STIs and viral load level of the donor.

A recent GWAS selected participants from two cohorts of African HIV-1 serodiscordant heterosexual couples [45], thereby increasing the probability that the HIV-1 negative partners have a risk for HIV-1 acquisition. In this study 496 HIV-1 infected individuals were compared to 302 matched HIV-1 uninfected individuals with similar documented HIV-1 exposure. Nevertheless, after correction for multiple testing, no SNPs were significantly associated with HIV-1 susceptibility, or with viral load at set point among the subset of HIV-1 infected participants.

Alternative phenotypes in GWAS

Most GWAS on HIV infection have focused on viral load control and disease progression. However, alternative phenotypes might be interesting to identify additional host genetic factors for HIV replication and pharmacogenomics.

We performed a genetic association analysis on in vitro HIV-1 susceptibility of primary monocyte-derived macrophages [42]. The top signal identified in this study, while not genome-wide significant, was rs12483205, a SNP intronic of DYRK1A (P = 2.2 x 10-5). This SNP appeared to be associated with HIV-1 disease progression as well in two independent cohort studies.

The use of antiretroviral drugs has been associated with severe toxicities, including hypersensitivity reactions, neurotoxicity and liver damage [52, 53]. Several candidate gene studies on the association between genetic determinants of people infected with HIV-1 and clinical toxicity resulting from different antiretroviral drugs have been performed. The study by Chantarangsu et al. [44] was the first to conduct a GWAS on therapy-side effects in HIV-infected patients. The study was performed in 72 HIV-infected Thai patients with nevaripine-induced rash, compared with 77 nevaripine-tolerant patients, and candidate genes were followed-up in an additional group of patients. SNPs rs1265112 and rs746647, within coiled-coil a–helical rod protein1 (CCHCR1) on chromosome 6, were found to be significantly associated with susceptibility to allergic skin reactions after Nevaripine use (P = 1.2 x 10-8). Since the CCHCR1 gene is located close to both HLA-B and HLA-C locus, the association with rash might be explained by LD between CCHCR1 and HLA-B*3505, a previously identified genetic marker for nevaripine-induced rash [54].


The first GWAS on HIV infection identified variants in the HLA-region to be most dominantly associated with viral load at set point [27], which was successfully replicated in other cohorts [3033, 35], using viral load control and disease progression as phenotypes. GWAS that used clinical disease progression as a phenotype, such as LTNP, survival time to AIDS-diagnosis and AIDS-related death, identified additional genetic variants outside the HLA-region [32, 34, 37, 39, 41, 43]. However, not all of these signals could be replicated in other studies and need confirmation. Multiple determinants may account for the observed variability in results from different GWAS, especially those not using viral load at set point. For example, variability in the phenotype studied, differences in the genotyping platform that was used, ancestry of the study population, gender, transmission route of infection and choices of statistical tests may influence the outcomes of these studies. Furthermore, the number of identified host factors involved in HIV infection up to now explains only a small fraction of the observed heritability. Several explanations for this “missing heritability” have been proposed such as additional common variants of small effect, low frequency variants (MAF < 5%), which have not been covered well by current GWAS, and epigenetic effects. Data from the 1000 Genomes Project ( could help to identify additional associated rare variants. The 1000 Genomes Project aimed to sequence the full genome of approximately 2500 individuals, not only Caucasian individuals but also from African populations from Malawi, the Gambia and Ghana. These sequences will allow for a detailed characterization of human genome sequence variation and a further increase in sample size will likely also reveal new genetic variants.

By combining clinical data from all cohorts that have longitudinal data, a large sample set will be created which will give the opportunity to identify novel polymorphisms outside the HLA-gene region that have a small effect, a low-frequency, or a recessive association thereby explaining only a marginal portion of the observed variance. The International HIV Acquisition Consortium (IHAC) has initiated the collection of longitudinal clinical data from all cohorts that have GWAS data available [55]. Imaginably, it will be a challenge to combine all of the clinical data that were collected by the different cohorts. Other opportunities lay in more in-depth analyses of the available GWAS data, thereby focusing on pathway analysis and gene-gene interactions or by combining GWAS data with other genome-scale data sets, such as RNAi screens or gene expression profiling. These analyses may support the discovery of additional variants that did not survive the stringent multiple testing correction thresholds in the discovery studies due to limited power.

Advances in sequencing technologies will enable whole-genome sequencing (WGS) to rapidly develop and overtake the position of GWAS in genomic research. Sequencing the complete genome of cases will make it possible to capture the rare variants that might be an explanation for the missing heritability in common diseases and directly identify the causal variant.

Until WGS is inexpensive enough to be used for large sample sizes, careful selection of individuals is essential. One approach involves the selection of individuals from each side of the extremes of the phenotype distribution. For HIV-1, the sequencing of the complete genome of hemophiliacs who are known to be highly exposed to HIV-1, but who have remained uninfected has been initiated [56]. Still many technical challenges in WGS need to be overcome. Analysis techniques need to be developed in order to cope with the millions of variants that are identified per genome. Moreover, accurately coding all of the small insertions and deletions is a tremendous challenge.

Whole-exome sequencing, a strategy to selectively sequence the coding regions, can be a more cost-effective alternative to identify host genetic markers that are associated with disease. However, there are drawbacks to this technique, as only a small number of SNPs that are associated with traits are located in, or occur in, high LD with protein coding regions of genes. The vast majority of trait-associated SNPs fall in intergenic regions and noncoding introns [49, 57].

The shift of genetic research from GWAS to WGS or exome sequencing might be a particularly positive development for genomics in African populations, since the causal variants are genotyped directly, without the need for high LD structures. Thereby, a smaller sample size might be needed to identify rare, causal variants, making it possible to collect smaller amounts of samples from individuals from different subpopulation structures throughout the African continent.


Several host genetic determinants of HIV-1 infection and pathogenesis have been identified in the last decennia, either by the classical candidate-gene approaches or in the last four years with the help of GWAS. In all these studies, variants in the HLA-region and the coreceptor CCR5 were the most consistent and with the largest effect size. While these polymorphisms may have a large effect on the disease course in the individual, these genetic markers were only able to explain a small fraction of overall observed differences in HIV-1 infection and disease progression in the population. The missing genetic variations may be identified by combining GWAS data sets of cohorts to increase power and by looking into additional phenotypes. Furthermore, in the next couple of years the use of whole-genome sequencing will most likely allow the identification of novel rare variations that are associated with HIV-1 susceptibility and disease progression and the unraveling of pathways that are causally involved in these phenotypes. Hopefully, the ongoing genetic research may contribute substantially to the understanding of the pathogenesis of HIV-1 infection and thereby lead to the development of new strategies to combat the AIDS epidemic worldwide.


  1. 1.

    Connor RI, Sheridan KE, Ceradini D, Choe S, Landau NR: Change in coreceptor use correlates with disease progression in HIV-1-infected individuals. J Exp Med. 1997, 185: 621-628. 10.1084/jem.185.4.621.

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  2. 2.

    Koot M, Keet IPM, Vos AHV, De Goede REY, Roos MTL, Coutinho RA, Miedema F, Schellekens PTA, Tersmette M: Prognostic value of human immunodeficiency virus type 1 biological phenotype for rate of CD4+ cell depletion and progression to AIDS. Ann Intern Med. 1993, 118: 681-688.

    CAS  Article  PubMed  Google Scholar 

  3. 3.

    Deacon NJ, Tsykin A, Solomon A, Smith K, Ludford-Menting M, Hooker DJ, McPhee DA, Greenway AL, Ellett A, Chatfield C, et al: Genomic structure of an attenuated quasi species of HIV-1 from a blood transfusion donor and recipients. Science. 1995, 270: 988-991. 10.1126/science.270.5238.988.

    CAS  Article  PubMed  Google Scholar 

  4. 4.

    Smith MW, Dean M, Carrington M, Winkler C, Huttley GA, Lomb DA, Goedert JJ, O'Brien TR, Jacobson LP, Kaslow R, et al: Contrasting genetic influence of CCR2 and CCR5 variants on HIV-1 infection and disease progression. Science. 1997, 277: 959-965. 10.1126/science.277.5328.959.

    CAS  Article  PubMed  Google Scholar 

  5. 5.

    Ioannidis JPA, O'Brien TR, Rosenberg PS, Contopoulos-Ioannidis DG, Goedert JJ: Genetic effects on HIV disease progression. Nature Med. 1998, 4: 536-10.1038/nm0598-536.

    CAS  Article  PubMed  Google Scholar 

  6. 6.

    Kostrikis LG, Huang Y, Moore JP, Wolinsky SM, Zhang L, Guo Y, Deutsch L, Phair J, Neumann AU, Ho DD: A chemokine receptor CCR2 allele delays HIV-1 disease progression and is associated with a CCR5 promotor mutation. Nature Med. 1998, 4: 350-353. 10.1038/nm0398-350.

    CAS  Article  PubMed  Google Scholar 

  7. 7.

    van Rij RP, De Roda Husman AM, Brouwer M, Goudsmit J, Coutinho RA, Schuitemaker H: Role of CCR2 genotype in the clinical course of syncytium-inducing (SI) or non-SI human immunodeficiency virus type 1 infection and in the time to conversion to SI virus variants. J Infect Dis. 1998, 178: 1806-1811. 10.1086/314522.

    CAS  Article  PubMed  Google Scholar 

  8. 8.

    Winkler C, Modi W, Smith MW, Nelson GW, Wu X, Carrington M, Dean M, Honjo T, Tashiro K, Yabe D, et al: Genetic restriction of AIDS pathogenesis by an SDF-1 chemokine gene variant. Science. 1998, 279: 389-393. 10.1126/science.279.5349.389.

    CAS  Article  PubMed  Google Scholar 

  9. 9.

    Mummidi S, Ahuja SS, Gonzalez E, Anderson SA, Santiago EN, Stephan KT, Craig FE, O'Connell P, Tryon V, Clark RA, et al: Genealogy of the CCR5 locus and chemokine system gene variants associated with altered rates of HIV-1 disease progression. Nature Med. 1998, 4: 786-793. 10.1038/nm0798-786.

    CAS  Article  PubMed  Google Scholar 

  10. 10.

    van Rij RP, Broersen S, Goudsmit J, Coutinho RA, Schuitemaker H: The role of a stromal cell-derived factor-1 chemokine gene variant in the clinical course of HIV-1 infection. AIDS. 1998, 12: F85-F90. 10.1097/00002030-199809000-00002.

    CAS  Article  PubMed  Google Scholar 

  11. 11.

    Nakayama EE, Hoshino Y, Xin X, Liu H, Goto M, Watanabe N, Taguchi H, Hitani A, Kawana-Tachikawa A, Fukushima M, et al: Polymorphism in the Interleukin-4 promotor affects acquisition of Human Immunodeficiency Virus Type 1 Syncytium-Inducing Phenotype. J Virol. 2000, 74: 5452-5459. 10.1128/JVI.74.12.5452-5459.2000.

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  12. 12.

    Kwa D, van Rij RP, Boeser-Nunnink B, Vingerhoed J, Schuitemaker H: Association between an interleukin-4 promoter polymorphism and the acquisition of CXCR4 using human immunodeficiency virus type 1 variants. AIDS. 2003, 17: 981-985. 10.1097/00002030-200305020-00006.

    CAS  Article  PubMed  Google Scholar 

  13. 13.

    Nakayama EE, Meyer L, Iwamoto A, Persoz A, Nagai Y, Rouzioux C, Delfraissy JF, Debre P, McIlroy D, Theodorou I, et al: Protective Effect of Interleukin-4–589T Polymorphism on Human Immunodeficiency Virus Type 1 Disease Progression: Relationship with Virus Load. J Infect Dis. 2002, 185: 1183-1186. 10.1086/339825.

    CAS  Article  PubMed  Google Scholar 

  14. 14.

    Wichukchinda N, Nakayama EE, Rojanawiwat A, Pathipvanich P, Auwanit W, Vongsheree S, Ariyoshi K, Sawanpanyalert P, Shioda T: Protective effects of IL4-589T and RANTES-28G on HIV-1 disease progression in infected Thai females. AIDS. 2006, 20: 189-196. 10.1097/01.aids.0000199830.64735.6f.

    CAS  Article  PubMed  Google Scholar 

  15. 15.

    Navis M, Schellens I, van Baarle D, Borghans J, van Swieten P, Miedema F, Kootstra N, Schuitemaker H: Viral Replication Capacity as a Correlate of HLA B57/B5801-Associated Nonprogressive HIV-1 Infection. J Immunol. 2007, 179: 3133-3143.

    CAS  Article  PubMed  Google Scholar 

  16. 16.

    Migueles SA, Sabbaghian MS, Shupert WL, Bettinotti MP, Marincola FM, Martino L, Hallahan CW, Selig SM, Schwartz D, Sullivan J, et al: HLA B*5701 is highly associated with restriction of virus replication in a subgroup of HIV-infected long term nonprogressors. eProc Natl Acad Sci U S A. 2000, 97: 2709-2714. 10.1073/pnas.050567397.

    CAS  Article  Google Scholar 

  17. 17.

    Gao X, Bashirova A, Iversen AK, Phair J, Goedert JJ, Buchbinder S, Hoots K, Vlahov D, Altfeld M, O'Brien SJ, et al: AIDS restriction HLA allotypes target distinct intervals of HIV-1 pathogenesis. Nat Med. 2005, 11: 1290-1292. 10.1038/nm1333.

    CAS  Article  PubMed  Google Scholar 

  18. 18.

    Dean M, Carrington M, Winkler C, Huttley GA, Smith MW, Allikmets R, Goedert JJ, Buchbinder SP, Vittinghoff E, Gomperts E, et al: Genetic restriction of HIV-1 infection and progression to AIDS by a deletion allele of the CKR5 structural gene. Science. 1996, 273: 1856-1862. 10.1126/science.273.5283.1856.

    CAS  Article  PubMed  Google Scholar 

  19. 19.

    Liu R, Paxton WA, Choe S, Ceradini D, Martin SR, Horuk R, MacDonald ME, Stuhlmann H, Koup RA, Landau NR: Homozygous defect in HIV-1 coreceptor accounts for resistance of some multiply-exposed individuals to HIV-1 infection. Cell. 1996, 86: 367-377. 10.1016/S0092-8674(00)80110-5.

    CAS  Article  PubMed  Google Scholar 

  20. 20.

    Samson M, Libert F, Doranz BJ, Rucker J, Liesnard C, Farber C-M, Saragosti S, Lapouméroulie C, Cognaux J, Forceille C, et al: Resistance to HIV-1 infection in caucasian individuals bearing mutant alleles of the CCR-5 chemokine receptor gene. Nature. 1996, 382: 722-725. 10.1038/382722a0.

    CAS  Article  PubMed  Google Scholar 

  21. 21.

    Rappaport J, Cho YY, Hendel H, Schwartz EJ, Schachter F, Zagury JF: 32 bp CCR-5 gene deletion and resistance to fast progression in HIV-1 infected heterozygotes. Lancet. 1997, 349: 922-923. 10.1016/S0140-6736(05)62697-9.

    CAS  Article  PubMed  Google Scholar 

  22. 22.

    De Roda Husman AM, Koot M, Cornelissen M, Brouwer M, Broersen SM, Bakker M, Roos MTL, Prins M, De Wolf F, Coutinho RA, et al: Association between CCR5 genotype and the clinical course of HIV-1 infection. Ann Intern Med. 1997, 127: 882-890.

    CAS  Article  PubMed  Google Scholar 

  23. 23.

    Dorr P, Westby M, Dobbs S, Griffin P, Irvine B, Macartney M, Mori J, Rickett G, Smith-Burchnell C, Napier C, et al: Maraviroc (UK-427,857), a potent, orally bioavailable, and selective small-molecule inhibitor of chemokine receptor CCR5 with broad-spectrum anti-human immunodeficiency virus type 1 activity. Antimicrob Agents Chemother. 2005, 49: 4721-4732. 10.1128/AAC.49.11.4721-4732.2005.

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  24. 24.

    International Human Genome Sequencing Consortium: Finishing the euchromatic sequence of the human genome. Nature. 2004, 431: 931-945. 10.1038/nature03001.

    Article  Google Scholar 

  25. 25.

    International HapMap Consortium: The International HapMap Project. Nature. 2003, 426: 789-796. 10.1038/nature02168.

    Article  Google Scholar 

  26. 26.

    Steemers FJ, Gunderson KL: Whole genome genotyping technologies on the BeadArray platform. Biotechnol J. 2007, 2: 41-49. 10.1002/biot.200600213.

    CAS  Article  PubMed  Google Scholar 

  27. 27.

    Fellay J, Shianna KV, Ge D, Colombo S, Ledergerber B, Weale M, Zhang K, Gumbs C, Castagna A, Cossarizza A, et al: A whole-genome association study of major determinants for host control of HIV-1. Science. 2007, 317: 944-947. 10.1126/science.1143767.

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  28. 28.

    Thomas R, Apps R, Qi Y, Gao X, Male V, O'hUigin C, O'Connor G, Ge D, Fellay J, Martin JN, et al: HLA-C cell surface expression and control of HIV/AIDS correlate with a variant upstream of HLA-C. Nat Genet. 2009, 41: 1290-1294. 10.1038/ng.486.

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  29. 29.

    Kulkarni S, Savan R, Qi Y, Gao X, Yuki Y, Bass SE, Martin MP, Hunt P, Deeks SG, Telenti A, et al: Differential microRNA regulation of HLA-C expression and its association with HIV control. Nature. 2011, 472: 495-498. 10.1038/nature09914.

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  30. 30.

    Catano G, Kulkarni H, He W, Marconi VC, Agan BK, Landrum M, Anderson S, Delmar J, Telles V, Song L, et al: HIV-1 disease-influencing effects associated with ZNRD1, HCP5 and HLA-C alleles are attributable mainly to either HLA-A10 or HLA-B*57 alleles. PLoS One. 2008, 3: e3636-10.1371/journal.pone.0003636.

    PubMed Central  Article  PubMed  Google Scholar 

  31. 31.

    Dalmasso C, Carpentier W, Meyer L, Rouzioux C, Goujard C, Chaix ML, Lambotte O, Avettand-Fenoel V, Le Clerc S, de Senneville LD, et al: Distinct genetic loci control plasma HIV-RNA and cellular HIV-DNA levels in HIV-1 infection: the ANRS Genome Wide Association 01 study. PLoS One. 2008, 3: e3907-10.1371/journal.pone.0003907.

    PubMed Central  Article  PubMed  Google Scholar 

  32. 32.

    Limou S, Le Clerc S, Coulonges C, Carpentier W, Dina C, Delaneau O, Labib T, Taing L, Sladek R, Deveau C, et al: Genomewide association study of an AIDS-nonprogression cohort emphasizes the role played by HLA genes (ANRS Genomewide Association Study 02). J Infect Dis. 2009, 199: 419-426. 10.1086/596067.

    Article  PubMed  Google Scholar 

  33. 33.

    Fellay J, Ge D, Shianna KV, Colombo S, Ledergerber B, Cirulli ET, Urban TJ, Zhang K, Gumbs CE, Smith JP, et al: Common genetic variation and the control of HIV-1 in humans. PLoS Genet. 2009, 5: e1000791-10.1371/journal.pgen.1000791.

    PubMed Central  Article  PubMed  Google Scholar 

  34. 34.

    Le Clerc S, Limou S, Coulonges C, Carpentier W, Dina C, Taing L, Delaneau O, Labib T, Sladek R, Deveau C, et al: Genomewide association study of a rapid progression cohort identifies new susceptibility alleles for AIDS (ANRS Genomewide Association Study 03). J Infect Dis. 2009, 200: 1194-1201. 10.1086/605892.

    CAS  Article  PubMed  Google Scholar 

  35. 35.

    Pereyra F, Jia X, McLaren PJ, Telenti A, de Bakker PI, Walker BD, Ripke S, Brumme CJ, Pulit SL, Carrington M, et al: The major genetic determinants of HIV-1 control affect HLA class I peptide presentation. Science. 2010, 330: 1551-1557.

    PubMed Central  Article  PubMed  Google Scholar 

  36. 36.

    Joubert BR, Lange EM, Franceschini N, Mwapasa V, North KE, Meshnick SR: A whole genome association study of mother-to-child transmission of HIV in Malawi. Genome Med. 2010, 2: 17-10.1186/gm138.

    PubMed Central  Article  PubMed  Google Scholar 

  37. 37.

    Herbeck JT, Gottlieb GS, Winkler CA, Nelson GW, An P, Maust BS, Wong KG, Troyer JL, Goedert JJ, Kessing BD, et al: Multistage genomewide association study identifies a locus at 1q41 associated with rate of HIV-1 disease progression to clinical AIDS. J Infect Dis. 2010, 201: 618-626. 10.1086/649842.

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  38. 38.

    Pelak K, Goldstein DB, Walley NM, Fellay J, Ge D, Shianna KV, Gumbs C, Gao X, Maia JM, Cronin KD, et al: Host Determinants of HIV-1 Control in African Americans. J Infect Dis. 2010, 201: 1141-1149. 10.1086/651382.

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  39. 39.

    Limou S, Coulonges C, Herbeck JT, van Manen D, An P, Le Clerc S, Delaneau O, Diop G, Taing L, Montes M, et al: Multiple-cohort genetic association study reveals CXCR6 as a new chemokine receptor involved in long-term nonprogression to AIDS. J Infect Dis. 2010, 202: 908-915. 10.1086/655782.

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  40. 40.

    Petrovski S, Fellay J, Shianna KV, Carpenetti N, Kumwenda J, Kamanga G, Kamwendo DD, Letvin NL, McMichael AJ, Haynes BF, et al: Common human genetic variants and HIV-1 susceptibility: a genome-wide survey in a homogeneous African population. AIDS. 2011, 25: 513-518. 10.1097/QAD.0b013e328343817b.

    PubMed Central  Article  PubMed  Google Scholar 

  41. 41.

    Troyer JL, Nelson GW, Lautenberger JA, Chinn L, McIntosh C, Johnson RC, Sezgin E, Kessing B, Malasky M, Hendrickson SL, et al: Genome-wide association study implicates PARD3B-based AIDS restriction. J Infect Dis. 2011, 203: 1491-1502. 10.1093/infdis/jir046.

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  42. 42.

    Bol SM, Moerland PD, Limou S, van Remmerden Y, Coulonges C, van Manen D, Herbeck JT, Fellay J, Sieberer M, Sietzema JG, et al: Genome-wide association study identifies single nucleotide polymorphism in DYRK1A associated with replication of HIV-1 in monocyte-derived macrophages. PLoS One. 2011, 6: e17190-10.1371/journal.pone.0017190.

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  43. 43.

    van Manen D, Delaneau O, Kootstra NA, Boeser-Nunnink BD, Limou S, Bol SM, Burger JA, Zwinderman AH, Moerland PD, van 't Slot R, et al: Genome-wide association scan in HIV-1-infected individuals identifying variants influencing disease course. PLoS One. 2011, 6: e22208-10.1371/journal.pone.0022208.

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  44. 44.

    Chantarangsu S, Mushiroda T, Mahasirimongkol S, Kiertiburanakul S, Sungkanuparph S, Manosuthi W, Tantisiriwat W, Charoenyingwattana A, Sura T, Takahashi A, et al: Genome-wide association study identifies variations in 6p21.3 associated with nevirapine-induced rash. Clin Infect Dis. 2011, 53: 341-348. 10.1093/cid/cir403.

    CAS  Article  PubMed  Google Scholar 

  45. 45.

    Lingappa JR, Petrovski S, Kahle E, Fellay J, Shianna K, McElrath MJ, Thomas KK, Baeten JM, Celum C, Wald A, et al: Genomewide Association Study for Determinants of HIV-1 Acquisition and Viral Set Point in HIV-1 Serodiscordant Couples with Quantified Virus Exposure. PLoS One. 2011, 6: e28632-10.1371/journal.pone.0028632.

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  46. 46.

    Wang L, Zhu J, Shan S, Qin Y, Kong Y, Liu J, Wang Y, Xie Y: Repression of interferon-gamma expression in T cells by Prospero-related homeobox protein. Cell Res. 2008, 18: 911-920. 10.1038/cr.2008.275.

    CAS  Article  PubMed  Google Scholar 

  47. 47.

    Abraham S, Sawaya BE, Safak M, Batuman O, Khalili K, Amini S: Regulation of MCP-1 gene transcription by Smads and HIV-1 Tat in human glial cells. Virology. 2003, 309: 196-202. 10.1016/S0042-6822(03)00112-0.

    CAS  Article  PubMed  Google Scholar 

  48. 48.

    van Manen D, Kootstra NA, Boeser-Nunnink B, Handulle MA, Van 't Wout AB, Schuitemaker H: Association of HLA-C and HCP5 gene regions with the clinical course of HIV-1 infection. AIDS. 2009, 23: 19-28. 10.1097/QAD.0b013e32831db247.

    CAS  Article  PubMed  Google Scholar 

  49. 49.

    Hindorff LA, Sethupathy P, Junkins HA, Ramos EM, Mehta JP, Collins FS, Manolio TA: Potential etiologic and functional implications of genome-wide association loci for human diseases and traits. Proc Natl Acad Sci U S A. 2009, 106: 9362-9367. 10.1073/pnas.0903103106.

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  50. 50.

    Campbell MC, Tishkoff SA: African genetic diversity: implications for human demographic history, modern human origins, and complex disease mapping. Annu Rev Genomics Hum Genet. 2008, 9: 403-433. 10.1146/annurev.genom.9.081307.164258.

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  51. 51.

    Shrestha S, Aissani B, Song W, Wilson CM, Kaslow RA, Tang J: Host genetics and HIV-1 viral load set-point in African-Americans. AIDS. 2009, 23: 673-677. 10.1097/QAD.0b013e328325d414.

    PubMed Central  Article  PubMed  Google Scholar 

  52. 52.

    Martin AM, Nolan D, Gaudieri S, Phillips E, Mallal S: Pharmacogenetics of antiretroviral therapy: genetic variation of response and toxicity. Pharmacogenomics. 2004, 5: 643-655. 10.1517/14622416.5.6.643.

    CAS  Article  PubMed  Google Scholar 

  53. 53.

    Berns JS, Kasbekar N: Highly active antiretroviral therapy and the kidney: an update on antiretroviral medications for nephrologists. Clin J Am Soc Nephrol. 2006, 1: 117-129.

    CAS  Article  PubMed  Google Scholar 

  54. 54.

    Chantarangsu S, Mushiroda T, Mahasirimongkol S, Kiertiburanakul S, Sungkanuparph S, Manosuthi W, Tantisiriwat W, Charoenyingwattana A, Sura T, Chantratita W, et al: HLA-B*3505 allele is a strong predictor for nevirapine-induced skin adverse drug reactions in HIV-infected Thai patients. Pharmacogenet Genomics. 2009, 19: 139-146. 10.1097/FPC.0b013e32831d0faf.

    CAS  Article  PubMed  Google Scholar 

  55. 55.

    McLaren P, Zagury JF, Fellay J, The Intl HIV Acquisition Consortium: Poster 295: Joining forces in HIV host genomics: The International HIV Acquisition Consortium. CROI. 2012.

    Google Scholar 

  56. 56.

    Fellay J, Shianna KV, Telenti A, Goldstein DB: Host genetics and HIV-1: the final phase?. PLoS Pathog. 2010, 6: e1001033-10.1371/journal.ppat.1001033.

    PubMed Central  Article  PubMed  Google Scholar 

  57. 57.

    Manolio TA: Genomewide association studies and assessment of the risk of disease. N Engl J Med. 2010, 363: 166-176. 10.1056/NEJMra0905980.

    CAS  Article  PubMed  Google Scholar 

Download references


We acknowledge funding from the Netherlands Organization for Scientific Research (TOP, registration number 9120.6046) and the Landsteiner Foundation Blood Research (registration number 0526).

Author information



Corresponding author

Correspondence to Hanneke Schuitemaker.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

DM, AW and HS wrote the manuscript. All authors read and approved the final manuscript.

Rights and permissions

Reprints and Permissions

About this article

Cite this article

van Manen, D., van ‘t Wout, A.B. & Schuitemaker, H. Genome-wide association studies on HIV susceptibility, pathogenesis and pharmacogenomics. Retrovirology 9, 70 (2012).

Download citation


  • Genome-wide association studies
  • Single-nucleotide polymorphisms
  • Host genetics
  • HIV susceptibility
  • HIV pathogenesis