Skip to main content


We’d like to understand how you use our websites in order to improve them. Register your interest.

Tracing the HIV-1 subtype B mobility in Europe: a phylogeographic approach



The prevalence and the origin of HIV-1 subtype B, the most prevalent circulating clade among the long-term residents in Europe, have been studied extensively. However the spatial diffusion of the epidemic from the perspective of the virus has not previously been traced.


In the current study we inferred the migration history of HIV-1 subtype B by way of a phylogeography of viral sequences sampled from 16 European countries and Israel. Migration events were inferred from viral phylogenies by character reconstruction using parsimony. With regard to the spatial dispersal of the HIV subtype B sequences across viral phylogenies, in most of the countries in Europe the epidemic was introduced by multiple sources and subsequently spread within local networks. Poland provides an exception where most of the infections were the result of a single point introduction. According to the significant migratory pathways, we show that there are considerable differences across Europe. Specifically, Greece, Portugal, Serbia and Spain, provide sources shedding HIV-1; Austria, Belgium and Luxembourg, on the other hand, are migratory targets, while for Denmark, Germany, Italy, Israel, Norway, the Netherlands, Sweden, Switzerland and the UK we inferred significant bidirectional migration. For Poland no significant migratory pathways were inferred.


Subtype B phylogeographies provide a new insight about the geographical distribution of viral lineages, as well as the significant pathways of virus dispersal across Europe, suggesting that intervention strategies should also address tourists, travellers and migrants.


Pandemic HIV-1 group M infection originated in Africa from the simian immunodeficiency virus (SIVcpz) infecting chimpanzees [16]. The subtype B epidemic in the United States and elsewhere, was the result of a single point introduction -migration – of the virus from Haiti around the late sixties [7, 8]. The introduction of HIV-1 into Europe occurred mainly through homosexual contacts or needle sharing in or from the USA [913], or through heterosexual contacts with individuals from Central Africa [14, 15]. At the beginning of the HIV-1 epidemic (the early 1980's) the prevalence of HIV-1 infection was higher among men having sex with other men (MSM) than among heterosexuals. For this reason and also because subtype B was identified at a high prevalence among MSM in the USA, it was the predominant clade in Europe. The prevalence of non-B subtypes in Europe has been increasing over the last years [1631]. However, the AIDS epidemic among the long-term residents is still dominated by viruses assigned to subtype B [32, 33].

RNA viruses, such as the HIV-1, provide measurably evolving populations characterized by very high nucleotide substitution rate [34, 35]. Phylogenies can be used for molecular epidemiology studies and notably they contain information about temporal and spatial dynamics of the virus [36]. The latter is the geographic pattern of viral lineages sampled from different localities, also termed as phylogeography, tracking the migration of the virus. For several viral infections, the dispersal of the parasite and its host cannot be easily tracked, therefore suggesting that phylogenies may be a better way to monitor migratory pathways of the virus [37, 38]. This methodology has been recently applied to phylogeographic studies of influenza A (H5N1) [37] and HCV [39] epidemics showing the pathways of viral dispersal.

Thus, phylogenies are the 'state of the art' in characterizing viral genealogy and evolution and also serve as tools to track migration for organisms for which there is no other way to monitor their dispersal [38]. Although several phylogenetic studies have analyzed HIV-1 clades by geographic region in Europe, none has inferred the history of virus's migration through its phylogeny. In the present study, we inferred the migration history of HIV-1 virus among 17 countries in Europe, by way of a phylogeography of subtype B sequences.


Migration events were inferred through virus phylogenies by using the Slatkin and Maddison's method [40] (illustrated in Figure 1). Trees were built by maximum likelihood (ML) methodology and countries from which sequences were sampled were assigned to the tips of the 103 ML bootstrap trees. Inclusion of a large number of phylogenies takes into account phylogenetic uncertainty, because migration events are estimated over a set of trees rather than a single one.

Figure 1

This tree contains 8 sequences sampled from 2 countries (A and B). Tips (HIV-1 sequences) were labelled according to its sampling country. A. If there are no epidemiological links between the two populations A and B, viral sequences will consist of two monophyletic groups, therefore representing distinct epidemics. B. In case that an individual sampled within population B acquired the infection in geographic area A, one branch sampled from population B would cluster within the monophyletic clade of the population A. The migration pattern for each country was estimated by counting "state" (county label) changes at each internal node of the tree by the criterion of parsimony. For each country we counted "exporting" (From) and "importing" (To) migration events. Specifically, as shown in Fig. 1b, a state change (A-B) is counted as an exporting migration event for country A and as importing for B. In our study migration events correspond to mobility of HIV-1 strains or infections and, therefore, inferred exporting or importing migration events are proportional to country-wise mobility of HIV-1 subtype B strains.

Phylogenetic analyses

Phylogenies of subtype B sequences from 16 countries in Europe and Israel (Table 1) showed no considerable grouping of sequences by country, however in the case of Poland most of the sequences (65, 72%) formed a single monophyletic clade (Figure 2). Similarly a fraction of sequences from Austria (16, 18%), Luxembourg (13, 14%) and Portugal (20, 22%) fell within single clusters, however the number of viral lineages spreading within local transmission networks was much lower in these areas than in Poland. Notably, in Poland individuals infected locally were mainly IDUs (39/65, 60%). Bayesian phylogenetic methods were used to further confirm the monophyletic nature of the B sequences from Poland, Austria, Luxembourg and Portugal. The final analysis was performed including a few sequences of the different monophyletic clusters identified in the ML trees and 1–2 from the other countries as references. Sequences again appeared as monophyletic in this analysis, with high posterior probability support (>0.8; data not shown), further supporting our previous results.

Table 1 Proportion of transmission risk groups among the study population.
Figure 2

Parts of the phylogenetic tree inferred for subtype B sequences sampled across Europe. Monophyletic groups of sequences sampled from A. Austria (purple), B. Portugal (cyan), C. Luxembourg (orange) and D. Poland (green). E. Part of the tree showing the geographical dispersal of HIV-1 subtype B sequences. Branches are shown in different colours by country of origin as described in the legend. Branches are not drawn to scale.

ML phylogenies suggest that sequences from the rest of Europe show distinct grouping patterns. Specifically a number of sequences for each locality cluster within short monophyletic clades (approximately consisting of 2–6 sequences), or others show no grouping according to their geographic origin (Figure 2E). These findings suggest that except in the case of Poland and also to a lesser extend for Austria, Portugal, Luxembourg, where a considerable percentage of infections were the result of single migration and subsequent spread among the local population, for the rest of countries there is a high level of mixing across Europe.

For patients recruited in the prospective study, information on the most likely origin of the HIV infection was collected through a questionnaire. Among them, 572 sequences were used in the current analysis. Interestingly, among those for whom this information was available (456 patients), 90.4% claimed that they acquired the subtype B.

Statistical Phylogeography

To test the significance of specific pathways of location changes (migration events) between countries, we estimated the expected number of changes, under the null hypothesis of complete geographic mixing, for each pair of countries (Tables S1 and S2 in Additional file 1), as described previously [37, 39]. The total number of location changes between countries (migration events) for all trees was significantly lower than expected by chance under the null hypothesis of panmixis confirming that, although there is a high level of HIV dispersal between countries, there is still geographic subdivision among the subtype B lineages analyzed. Moreover, the results of this test showed major differences across Europe (Additional files 2 and 3). In particular, for Austria, Luxembourg and Poland no significant exporting migration was observed, while for the latter importing migration was also not significant; therefore classifying Poland as the country with the lowest HIV migration – or, in other words, with the most isolated HIV epidemic among the countries analysed (Figure 3). For Austria, and Luxembourg, on the other hand, there was evidence that some of the subtype B infections were the result of migration from Italy and Portugal, Switzerland, respectively; while similarly to Poland no significant outgoing migration was observed. According to the ML trees, only a few sequences from Israel and Greece fell within the Polish monophyletic cluster, suggesting limited migration to the latter countries (Figure 2D).

Figure 3

Significant HIV migratory pathways across Europe. Arrowheads indicate the targets of migration shown in different colours and styles by country of origin.

Germany, Greece, Italy, Norway, the Netherlands, Portugal, Spain, Serbia, Switzerland, and the UK appeared as source of subtype B mobility (high levels of exporting migration; "From") to other countries (Additional files 2 and 3). In case that significant migration was detected from a country to more than 2 others, the former was designated as "exporter". Notably, Greece's migratory targets were dispersed to 7 countries, while for both Spain and the Netherlands; they were to 5 and 6 countries, respectively (Figure 3). High levels of HIV migration – with regard to the highest difference between the observed and the expected migration events under panmixis – were detected from Italy to Austria and Switzerland, from Portugal to Luxembourg and also from the Netherlands to Germany (Table S2 in Additional file 1). On the other hand, Belgium, Denmark, Sweden and Israel showed only limited export of HIV-1 subtype B (Additional files 2 and 3).

Major migratory targets of HIV-1 subtype B (importing migration; "To") were Austria, Belgium, Germany, Italy, Luxembourg, Norway, the Netherlands, Sweden, Spain, Switzerland, and the UK (a similar criterion as for the "From" migration was used to assign countries) (Additional files 4 and 5), while limited migration was observed into Serbia and Israel (Supplementary information Figure 1c, d in Additional files 4 and 5) (in case that significant migration was detected from a country to more than 2 others, the former was designated as "exporter"). Notably, except from Poland, significant importing migration was detected for all countries across Europe (Figure 3).

Based on these findings, evidence for directional HIV dispersion was detected where Spain, Greece, Portugal and Serbia acted as sources of migration events ("exporters") (Figure 3); Austria, Belgium, and Luxembourg (Luxembourg and Austria were classified within the "importers" due to the high migration (>7) inferred from Portugal towards Luxembourg), provided migratory targets ("importers") (Figure 3), while significant bidirectional HIV migration was found for Denmark, Germany, Italy, Israel, Norway, the Netherlands, Sweden, Switzerland and the UK (Figure 3). Israel and Sweden were classified among localities with bidirectional migration because in both countries significant bidirectional mobility was detected. In contrast, for Poland, no significantly importing or exporting migration was found that is in accordance with the high percentage of sequences grouping according to the sampling location.

To further confirm our findings all steps of the analyses (phylogenetic analysis with ML bootstrapping, inference of migration events and statistical phylogeography) were repeated in a 2nd run. Notably, migration events inferred on 103 newly inferred ML bootstrap trees were almost identical to the previous (R2 = 0.98, p < 0.001; data not shown). Moreover, statistical phylogeography revealed that out of 46 and 50 significantly high migration events inferred in the two rounds of analyses, 43 were identical, thus suggesting that the major migratory pathways were reproducible.


Our results based on a phylogeographic study of a large number of sequences sampled from 16 countries in Europe and Israel provided important clues about HIV-1 subtype B spatial diffusion across Europe. Notably according to the findings of phylogenetic analyses, viral lineages sampled from all countries except Poland, Austria, Luxembourg and Portugal, showed low levels of grouping according to the geographic origin. For most countries, we identified small networks of local transmission, but to a different extent in each country, along with sequences showing no particular geographic clustering. Such a pattern suggests that the subtype B epidemic in most countries was introduced by several founders, some of them causing subsequent local dispersal, while others lead to dead end infections. We should note that under the conditions of our study, we cannot estimate the percentage of infections occurring within local transmission networks, since we don't have sufficient covering per country.

Poland's epidemic dispersal is quite different. Based on the high number of viral lineages coalescing to a common origin within the country, we suggest that the epidemic is the result of a few migrations of the virus successfully spreading within the local population. This pattern is consistent with a main viral dispersal through IDU networks associated with extensive local epidemics. Monophyletic HIV epidemics have been described among IDUs for other European countries, as well, including also non-B subtypes strains [12, 13, 26, 27, 4245].

For Austria, Poland and Luxembourg we identified more extensive local transmission networks than for the other European countries. Similarly HIV local networks have been described for Canada, Greece and the UK [4650].

According to the epidemiological data, most of the subtype B infections newly diagnosed during 2002–2004, occurred locally. The geographic distribution by means of the viral evolutionary history, the phylogeography, on the other hand, revealed high levels of viral dispersal. Both observations are not necessarily in contradiction. Rather, they suggest that most of the migration identified through phylogeography may date from earlier in the transmission chain, and that the pre-existing complexity of the epidemic (multiple sources of introduction from diverse localities) is the main reason for the continuous extensive geographical dispersal across the viral phylogeny. Particularly if there are multiple founders, subsequent infections will be dispersed, across the viral phylogeny, according to the geographic origin of the founders' source. This is in accordance with previous findings about multiple introductions of the subtype B infection through sexual intercourses or IDU across Europe [13, 47, 49, 51, 52].

In addition to epidemic dispersal patterns, our study provided important findings about HIV-1 subtype B major sources and targets for migratory events, as well as localities with bidirectional viral dispersion.

In particular, Greece, Portugal, and Spain attract many travellers and tourists, especially from Central Europe, thus suggesting that HIV dispersal from Southern to Central Europe may, at least in part, occur by travellers infected during their stay in Southern Europe

For countries classified among the HIV migratory targets (Austria, Belgium and Luxembourg) the epidemic was mainly imported due to the high HIV mobility to these countries. According to the epidemiological information, the highest rate of imported infections from other European countries occurs in Luxembourg. Moreover, the fact that 13% of the population of Luxembourg is of Portuguese origin provides a plausible explanation for the migratory pathway from Portugal Another significant pathway was tracked from Italy to Austria, in accordance with the high influx from Italy during recent years Denmark provided migratory target from another Scandinavian country (Sweden) but also from Spain. This is in accordance with epidemiological findings that a percentage of HIV subtype B infections in Denmark originated from Sweden and Spain.

Additionally we identified several countries showing bidirectional migration. Notably, for the Netherlands 6 significant pathways were detected from and to the same localities. The Netherlands is among the countries in Europe with the most diverse geographical origin among newly diagnosed patients, confirmed by the high percentage of non-Dutch individuals among the newly HIV-infected patients during 2003–2004 [32, 53]. Moreover, because of its policies, the Netherlands attracts foreign drug users and male homosexuals, two populations known to be at higher risk for HIV infection [51].

Migratory pathways inferred through viral phylogenies cannot be directly validated by other sources of information (epidemiological figures, mobility and immigration information, tourism, etc), because these data are not stratified by subtype. Moreover, due to the high mobility of population within Europe and the complexity of the epidemic spread, information about the locus of infection for an individual doesn't necessarily match with the geographic origin of the source. On the other hand, phylogenetic analysis of viral sequences provides a realistic approach for the reconstruction of HIV transmission chains or networks [36, 46, 47, 49, 5456], therefore suggesting that statistical phylogeography is appropriate for inferring the spatial dispersal of a viral epidemic.

Given the high complexity of the epidemic, dense sampling is needed in order to accurately reconstruct the spatial characteristics of the subtype B infections in Europe. This provides one of the limitations of this study; on the other hand however the analysis of our dataset, which is the largest available at the time of analysis, provides for a first time a description of the geographic distribution of viral lineages as well as the significant migrations of HIV subtype B across Europe, by means of viral phylogenies. Dense sampling for each locality would be ideal for such purposes; however limited availability of sequences for several countries, as well as computation time provide as the major limitations for such a study.

We paid special attention to representativeness of our data. The prospective SPREAD collection strategy (data from 2002–2004) was specifically designed to avoid such a bias [53], while the retrospectively collected CATCH data (1996–2002) were sampled as part of national surveillance studies designed to investigate the transmission of drug resistance or as part of the standard clinical practice of baseline sequencing for all newly diagnosed cases in each participating center [57]. For most countries where national data were available, the data were a rather good representation of the national epidemic.

In conclusion, HIV-1 subtype B phylogeographies provide a new insight for the first time into the pathways of spatial diffusion and virus migration across Europe. HIV-1 subtype B was each time introduced from multiple sources and subsequently spread locally, but the pattern is not uniform across Europe. The countries grouped into sources (Greece, Portugal, Serbia and Spain) and sinks (Austria, Belgium and Luxembourg) of virus migration, as well as countries with significant bidirectional migration (Denmark, Germany, Italy, Israel, Norway, the Netherlands, Sweden, Switzerland and the UK). The only exception was Poland where a significant number of sequences fell within a monophyletic cluster. These results suggest that mobility of the virus matches mobility of the host, such that in order to reduce further spread of the epidemic, prevention measures should not only be directed towards national populations, but also towards migrants, travellers and tourists who are the major sources and targets of HIV dispersal.


HIV-1 sequences

Protease (PR) and partial reverse transcriptase (RT) sequences were sampled from HIV-1 seropositive individuals who had never received antiretroviral drugs (ARV) as described previously [53, 57]. Specifically, partial PR/RT sequences were sampled from 17 countries in Europe including Israel. Sequences were collected from two studies, the Combined Analysis of Resistance Transmission over Time of Chronically and Acute Infected HIV Patients; (CATCH), in a retrospective setting [57] and a prospective study named after Strategy to Control SPREAD of HIV Drug Resistance (SPREAD) [53]. In the CATCH analysis all sequences were collected during 1996–2002 from geographically distinct centres across the participating countries, except for Belgium and the Netherlands, where HIV-1 sequences were sampled from a single geographic area. In the prospective setting (SPREAD), samples were collected during 2002–2004 according to two different approaches in order to ensure representative sampling [53]. Notably although data from the period 1996–2002 were retrospectively analyzed, they were collected as part of national surveillance studies designed to investigate the transmission of drug resistance or of the standard clinical practice of baseline sequencing for all newly diagnosed cases in each participating center [57]. In the prospective setting a standardized sampling strategy was designed in order to ensure representative sampling in all countries [53]. For the purpose of this study we included only those classified as subtype B. All individuals were sampled at a single time point. The subtyping process was performed by phylogenetic analysis [53, 57]. The prevalence of the transmission risk groups among the study population is shown in Table 1.

Phylogenetic analyses

Sampling strategy

For the estimation of country-wise clustering (migration), first we need to infer the phylogenies of the sequences under study. One of the issues to be addressed was how many sequences needed to be included for each country. The dataset size needs to be large enough as: 1) to include most of the available information from each country and 2) to estimate rare migration events. On the other hand, we had to restrict the number of sequences to keep the computation time needed for phylogenetic inference reasonable, while maintaining an informative number of sequences required for the calculation of migration events. For this reason, we performed a preliminary analysis of migration for 4 countries including 10, 20 25 or 90 sequences per country. For each dataset, we tested whether the distribution of the total number of migration events across the set of all credible trees differed significantly from a distribution of randomly generated trees (phylogenetic inference was performed by ML method). The results of this preliminary analysis showed that with 25 sequences per country, the largest number of countries reached significantly different migration levels than compared to the distribution for a random set of trees (P < 0.01). However the larger the number of sequences included per country the higher the signal for clustering with regard the total number of changes across inferred versus random set of trees.

Consequently, we included in the analyses the largest number of sequences (90) available per country, expect from Belgium, Greece, the Netherlands, Israel, Norway and Serbia for which a smaller number of sequences was available, however only for the last three countries the number of sequences included was << 90. As a result of choosing approximately equal number of strains per country, irrespective of the prevalence or the total number of infected individuals across Europe, we calculated the relative mobility per infected individual. Therefore, the numbers in the migration matrices are directly comparable reflecting actual differences in mobility between countries. For example, we estimated higher migration from the UK to Spain (5.34), than from Germany to Italy (3.23) (Table S2 in Additional file 1).

Phylogenetic analyses for the estimation of the migration process were performed in a single dataset consisting of 1337 sequences analyzed in two independent runs (Table 1).

Alignment and phylogenetic tree reconstruction

The alignment of the subtype B partial RT sequences sampled from 1337 individuals was performed using CLUSTAL W version 1.74 [58] and manually edited according to the encoded reading frame. In order to avoid any bias due to convergent evolution at antiretroviral drug resistance mutations on the phylogenetic analysis, we excluded all sites associated with major resistance in PR (30, 32, 33, 46, 47, 48, 50, 54, 76, 82, 84, 88, and 90) and RT (41, 62, 65, 67, 69, 70, 74, 75, 77, 100, 103, 106,108, 115, 116, 151, 181, 184, 188, 190, 210, 215, 219, 225, and 236) leaving 687 nt.

Phylogenetic trees were inferred by maximum likelihood method under the general time-reversible GTR model of nucleotide substitution including a Γ distributed rates heterogeneity among sites as implemented in RAxML [59]. Bootstrapping was performed on the maximum likelihood trees (1000 replicates) to assess the reliability of the obtained topologies.

Inference of migration events

All bootstrap generated trees (103) were used for the estimation of the HIV-1 migration events by using the cladistic approach first described by Slatkin and Maddison [40], as implemented in MacClade [60]. Specifically, all the nodes of the inferred trees were assigned with a character according to the geographic origin (e.g. 0, 1, 2, 3 for Austria, Belgium, Denmark, France, etc). The algorithm reconstructs "ancestral" states that in our case correspond to countries, at each internal node by the criterion of parsimony [40]. Parsimony selects the reconstruction that minimizes the total number of steps on the tree [41].

When two branches from 2 different locations (e.g. 0 and 1) join with each other, and thus more than one character can be reconstructed at the node, then the ancestor state at the internal node is assigned to be the union of the two characters [0, 1] that is assigned a migration event. If this number between two groups of sequences remains low, the possibility for migration events between these particular groups also remains low.

Specifically, the migration events between HIV-1 sequences sampled in different locations were estimated for each dataset according to the following method: 1) for nodes with more than one equally parsimonious reconstructions (e.g. 0, 1 or 0), implicit examination of all most parsimonious reconstructions (MPRs) was used in case of a big number of MPRs [61, 62], while explicit examination was used in case of a small number of MPR, as implemented in MacClade. As a result, for a particular type of character change, e.g. [0,1] MacClade reports a minimum, a maximum and a average number of [0,1] changes estimated over all possible MPRs. We estimated the average number of migration events for each tree used in the analyses. 2) Polytomies that correspond to nodes with more than two descendant nodes were interpreted as regions of uncertain evolution (soft polytomies) as implemented in MacClade.

Inference of migration matrices

For each dataset a 17 × 17 migration matrix was estimated between HIV-1 sequences sampled in different European countries. Each migration event was calculated as the median of the distribution estimated from all trees (103) used in the analysis. In the matrix, all 'from' events and 'to' events are pooled per country.

Statistical phylogeography

To further estimate which migration events were significantly different from the expected number of changes under the null hypothesis of full geographic mixing of HIV-1 sequences, we estimated if the distribution for each of the migration events estimated over 103 bootstrap trees was statistically different from the distribution estimated from the same set of trees (103) after reshuffling taxa at the tips. This analysis was performed using Mesquite [63]. Equality of medians between observed and expected migration events was assessed by means of the Kruskal-Wallis one-way analysis of variance and the level of significance was adjusted according to Bonferroni correction for multiple comparisons.

The differences between the observed and the expected values indicate the levels of HIV-1 country-dependent structure in the dataset, and thus also of the relative mobility of the virus between countries. This strategy allowed estimating significant differences also when an unequal number of strains were included per country.

Notably in order to assess the validity of our results, the whole process of phylogenetic analysis, inference of migration events and statistical phylogeography was repeated twice.


  1. 1.

    Peeters M, Honore C, Huet T, Bedjabaga L, Ossari S, Bussi P, Cooper RW, Delaporte E: Isolation and partial characterization of an HIV-related virus occurring naturally in chimpanzees in Gabon. Aids. 1989, 3: 625-630. 10.1097/00002030-198910000-00001.

  2. 2.

    Peeters M, Fransen K, Delaporte E, Van den Haesevelde M, Gershy-Damet GM, Kestens L, van der Groen G, Piot P: Isolation and characterization of a new chimpanzee lentivirus (simian immunodeficiency virus isolate cpz-ant) from a wild-captured chimpanzee. Aids. 1992, 6: 447-451.

  3. 3.

    Gao F, Bailes E, Robertson DL, Chen Y, Rodenburg CM, Michael SF, Cummins LB, Arthur LO, Peeters M, Shaw GM, et al: Origin of HIV-1 in the chimpanzee Pan troglodytes troglodytes. Nature. 1999, 397: 436-441. 10.1038/17130.

  4. 4.

    Hahn BH, Shaw GM, De Cock KM, Sharp PM: AIDS as a zoonosis: scientific and public health implications. Science. 2000, 287: 607-614. 10.1126/science.287.5453.607.

  5. 5.

    Sharp PM, Bailes E, Gao F, Beer BE, Hirsch VM, Hahn BH: Origins and evolution of AIDS viruses: estimating the time-scale. Biochem Soc Trans. 2000, 28: 275-282.

  6. 6.

    Heeney JL, Rutjens E, Verschoor EJ, Niphuis H, ten Haaft P, Rouse S, McClure H, Balla-Jhagjhoorsingh S, Bogers W, Salas M, et al: Transmission of simian immunodeficiency virus SIVcpz and the evolution of infection in the presence and absence of concurrent human immunodeficiency virus type 1 infection in chimpanzees. J Virol. 2006, 80: 7208-7218. 10.1128/JVI.00382-06.

  7. 7.

    Gilbert MT, Rambaut A, Wlasiuk G, Spira TJ, Pitchenik AE, Worobey M: The emergence of HIV/AIDS in the Americas and beyond. Proc Natl Acad Sci USA. 2007, 104: 18566-18570. 10.1073/pnas.0705329104.

  8. 8.

    Robbins KE, Lemey P, Pybus OG, Jaffe HW, Youngpairoj AS, Brown TM, Salemi M, Vandamme AM, Kalish ML: U.S. Human immunodeficiency virus type 1 epidemic: date of origin, population history, and characterization of early strains. J Virol. 2003, 77: 6359-6366. 10.1128/JVI.77.11.6359-6366.2003.

  9. 9.

    Brunet JB, Bouvet E, Massari V: Epidemiological aspects of acquired immune deficiency syndrome in France. Ann N Y Acad Sci. 1984, 437: 334-339. 10.1111/j.1749-6632.1984.tb37153.x.

  10. 10.

    Glauser MP, Francioli P: Clinical and epidemiological survey of acquired immune deficiency syndrome in Europe. Eur J Clin Microbiol. 1984, 3: 55-58. 10.1007/BF02032823.

  11. 11.

    Melbye M, Biggar RJ, Ebbesen P, Sarngadharan MG, Weiss SH, Gallo RC, Blattner WA: Seroepidemiology of HTLV-III antibody in Danish homosexual men: prevalence, transmission, and disease outcome. Br Med J (Clin Res Ed). 1984, 289: 573-575. 10.1136/bmj.289.6445.573.

  12. 12.

    Casado C, Urtasun I, Saragosti S, Chaix ML, de Rossi A, Cattelan AM, Dietrich U, Lopez-Galindez C: Different distribution of HIV type 1 genetic variants in European patients with distinct risk practices. AIDS Res Hum Retroviruses. 2000, 16: 299-304. 10.1089/088922200309403.

  13. 13.

    Lukashov VV, Kuiken CL, Vlahov D, Coutinho RA, Goudsmit J: Evidence for HIV type 1 strains of U.S. intravenous drug users as founders of AIDS epidemic among intravenous drug users in northern Europe. AIDS Res Hum Retroviruses. 1996, 12: 1179-1183. 10.1089/aid.1996.12.1179.

  14. 14.

    Clumeck N, Sonnet J, Taelman H, Mascart-Lemone F, De Bruyere M, Vandeperre P, Dasnoy J, Marcelis L, Lamy M, Jonas C, et al: Acquired immunodeficiency syndrome in African patients. N Engl J Med. 1984, 310: 492-497.

  15. 15.

    Vittecoq D, May T, Roue RT, Stern M, Mayaud C, Chavanet P, Borsa F, Jeantils P, Armengaud M, Modai J, et al: Acquired immunodeficiency syndrome after travelling in Africa: an epidemiological study in seventeen Caucasian patients. Lancet. 1987, 1: 612-615. 10.1016/S0140-6736(87)90245-5.

  16. 16.

    Chaix ML, Descamps D, Harzic M, Schneider V, Deveau C, Tamalet C, Pellegrin I, Izopet J, Ruffault A, Masquelier B, et al: Stable prevalence of genotypic drug resistance mutations but increase in non-B virus among patients with primary HIV-1 infection in France. Aids. 2003, 17: 2635-2643. 10.1097/00002030-200312050-00011.

  17. 17.

    Descamps D, Chaix ML, Andre P, Brodard V, Cottalorda J, Deveau C, Harzic M, Ingrand D, Izopet J, Kohli E, et al: French national sentinel survey of antiretroviral drug resistance in patients with HIV-1 primary infection and in antiretroviral-naive chronically infected patients in 2001–2002. J Acquir Immune Defic Syndr. 2005, 38: 545-552. 10.1097/01.qai.0000155201.51232.2e.

  18. 18.

    Deroo S, Robert I, Fontaine E, Lambert C, Plesseria JM, Arendt V, Staub T, Hemmer R, Schneider F, Schmit JC: HIV-1 subtypes in Luxembourg, 1983–2000. Aids. 2002, 16: 2461-2467. 10.1097/00002030-200212060-00012.

  19. 19.

    Machuca R, Bogh M, Salminen M, Gerstoft J, Kvinesdal B, Pedersen C, Obel N, Nielsen H, Nielsen C: HIV-1 subtypes in Denmark. Scand J Infect Dis. 2001, 33: 697-701. 10.1080/00365540110026863.

  20. 20.

    Sonnerborg A, Durdevic S, Giesecke J, Sallberg M: Dynamics of the HIV-1 subtype distribution in the Swedish HIV-1 epidemic during the period 1980 to 1993. AIDS Res Hum Retroviruses. 1997, 13: 343-345. 10.1089/aid.1997.13.343.

  21. 21.

    Snoeck J, Van Laethem K, Hermans P, Van Wijngaerden E, Derdelinckx I, Schrooten Y, Vijver van de DA, De Wit S, Clumeck N, Vandamme AM: Rising prevalence of HIV-1 non-B subtypes in Belgium: 1983–2001. J Acquir Immune Defic Syndr. 2004, 35: 279-285. 10.1097/00126334-200403010-00009.

  22. 22.

    Lospitao E, Alvarez A, Soriano V, Holguin A: HIV-1 subtypes in Spain: a retrospective analysis from 1995 to 2003. HIV Med. 2005, 6: 313-320. 10.1111/j.1468-1293.2005.00313.x.

  23. 23.

    Boni J, Pyra H, Gebhardt M, Perrin L, Burgisser P, Matter L, Fierz W, Erb P, Piffaretti JC, Minder E, et al: High frequency of non-B subtypes in newly diagnosed HIV-1 infections in Switzerland. J Acquir Immune Defic Syndr. 1999, 22: 174-179.

  24. 24.

    Op de Coul EL, Coutinho RA, van der Schoot A, van Doornum GJ, Lukashov VV, Goudsmit J, Cornelissen M: The impact of immigration on env HIV-1 subtype distribution among heterosexuals in the Netherlands: influx of subtype B and non-B strains. Aids. 2001, 15: 2277-2286. 10.1097/00002030-200111230-00009.

  25. 25.

    Paraskevis D, Magiorkinis E, Magiorkinis G, Sypsa V, Paparizos V, Lazanas M, Gargalianos P, Antoniadou A, Panos G, Chrysos G, et al: Increasing prevalence of HIV-1 subtype A in Greece: estimating epidemic history and origin. J Infect Dis. 2007, 196: 1167-1176. 10.1086/521677.

  26. 26.

    Bobkov A, Kazennova E, Selimova L, Bobkova M, Khanina T, Ladnaya N, Kravchenko A, Pokrovsky V, Cheingsong-Popov R, Weber J: A sudden epidemic of HIV type 1 among injecting drug users in the former Soviet Union: identification of subtype A, subtype B, and novel gagA/envB recombinants. AIDS Res Hum Retroviruses. 1998, 14: 669-676. 10.1089/aid.1998.14.669.

  27. 27.

    Holmes EC, Zhang LQ, Robertson P, Cleland A, Harvey E, Simmonds P, Leigh Brown AJ: The molecular epidemiology of human immunodeficiency virus type 1 in Edinburgh. J Infect Dis. 1995, 171: 45-53.

  28. 28.

    Brown AJ, Lobidel D, Wade CM, Rebus S, Phillips AN, Brettle RP, France AJ, Leen CS, McMenamin J, McMillan A, et al: The molecular epidemiology of human immunodeficiency virus type 1 in six cities in Britain and Ireland. Virology. 1997, 235: 166-177. 10.1006/viro.1997.8656.

  29. 29.

    Thomson MM, Najera R: Travel and the introduction of human immunodeficiency virus type 1 non-B subtype genetic forms into Western countries. Clin Infect Dis. 2001, 32: 1732-1737. 10.1086/320764.

  30. 30.

    Liitsola K, Holmstrom P, Laukkanen T, Brummer-Korvenkontio H, Leinikki P, Salminen MO: Analysis of HIV-1 genetic subtypes in Finland reveals good correlation between molecular and epidemiological data. Scand J Infect Dis. 2000, 32: 475-480. 10.1080/003655400458721.

  31. 31.

    Palma AC, Araujo F, Duque V, Borges F, Paixao MT, Camacho R: Molecular epidemiology and prevalence of drug resistance-associated mutations in newly diagnosed HIV-1 patients in Portugal. Infect Genet Evol. 2007, 7: 391-398. 10.1016/j.meegid.2007.01.009.

  32. 32.

    Paraskevis D, Wensing AMJ, Vercauteren J, Vijver van de DA, Albert J, Asjo B, on behalf of the SPREAD-programme, et al: Prevalence of HIV-1 subtypes among newly HIV-1 diagnosed individuals during 2002–2003 in Europe: Evidence for a continuous introduction of non-B subtypes. 1st International Workshop on HIV Transmission; Toronto, Canada. 2006, 31-Abstract No 34

  33. 33.

    Abecasis A, Wensing A, Paraskevis D, Vercauteuren D, Vijver Van de D, Albert J, et al: obotS-p Demographic determinants of HIV-1 subtype in Europe. Sixth European HIV Drug Resistance Workshop; Budapest, Hungary. 2008

  34. 34.

    Drummond AJ, Pybus OG, Rambaut A, Forsberg R, Rodrigo A: Measurably evolving populations. TRENDS in Ecology and Evolution. 2003, 18: 481-488. 10.1016/S0169-5347(03)00216-7.

  35. 35.

    Coffin JM: HIV population dynamics in vivo: implications for genetic variation, pathogenesis, and therapy. Science. 1995, 267: 483-489. 10.1126/science.7824947.

  36. 36.

    Leitner T, Escanilla D, Franzen C, Uhlen M, Albert J: Accurate reconstruction of a known HIV-1 transmission history by phylogenetic tree analysis. Proc Natl Acad Sci USA. 1996, 93: 10864-10869. 10.1073/pnas.93.20.10864.

  37. 37.

    Wallace RG, Hodac H, Lathrop RH, Fitch WM: A statistical phylogeography of influenza A H5N1. Proc Natl Acad Sci USA. 2007, 104: 4473-4478. 10.1073/pnas.0700435104.

  38. 38.

    Holmes EC: Evolutionary history and phylogeography of human viruses. Annu Rev Microbiol. 2008, 62: 307-328. 10.1146/annurev.micro.62.081307.162912.

  39. 39.

    Nakano T, Lu L, Liu P, Pybus OG: Viral gene sequences reveal the variable history of hepatitis C virus infection among countries. J Infect Dis. 2004, 190: 1098-1108. 10.1086/422606.

  40. 40.

    Slatkin M, Maddison WP: A cladistic measure of gene flow inferred from the phylogenies of alleles. Genetics. 1989, 123: 603-613.

  41. 41.

    Fitch WM: Toward defining the course of evolution: Minimal change for a specific tree topology. Syst Zool. 1971, 20: 406-416. 10.2307/2412116.

  42. 42.

    Lukashov VV, Karamov EV, Eremin VF, Titov LP, Goudsmit J: Extreme founder effect in an HIV type 1 subtype A epidemic among drug users in Svetlogorsk, Belarus. AIDS Res Hum Retroviruses. 1998, 14: 1299-1303. 10.1089/aid.1998.14.1299.

  43. 43.

    Liitsola K, Tashkinova I, Laukkanen T, Korovina G, Smolskaja T, Momot O, Mashkilleyson N, Chaplinskas S, Brummer-Korvenkontio H, Vanhatalo J, et al: HIV-1 genetic subtype A/B recombinant strain causing an explosive epidemic in injecting drug users in Kaliningrad. AIDS. 1998, 12: 1907-1919. 10.1097/00002030-199814000-00023.

  44. 44.

    Nabatov AA, Kravchenko ON, Lyulchuk MG, Shcherbinskaya AM, Lukashov VV: Simultaneous introduction of HIV type 1 subtype A and B viruses into injecting drug users in southern Ukraine at the beginning of the epidemic in the former Soviet Union. AIDS Res Hum Retroviruses. 2002, 18: 891-895. 10.1089/08892220260190380.

  45. 45.

    Bobkov A, Cheingsong-Popov R, Selimova L, Ladnaya N, Kazennova E, Kravchenko A, Fedotov E, Saukhat S, Zverev S, Pokrovsky V, Weber J: An HIV type 1 epidemic among injecting drug users in the former Soviet Union caused by a homogeneous subtype A strain. AIDS Res Hum Retroviruses. 1997, 13: 1195-1201. 10.1089/aid.1997.13.1195.

  46. 46.

    Brenner BG, Roger M, Routy JP, Moisi D, Ntemgwa M, Matte C, Baril JG, Thomas R, Rouleau D, Bruneau J, et al: High rates of forward transmission events after acute/early HIV-1 infection. J Infect Dis. 2007, 195: 951-959. 10.1086/512088.

  47. 47.

    Lewis F, Hughes GJ, Rambaut A, Pozniak A, Leigh Brown AJ: Episodic sexual transmission of HIV revealed by molecular phylodynamics. PLoS Med. 2008, 5: e50-10.1371/journal.pmed.0050050.

  48. 48.

    Gifford RJ, de Oliveira T, Rambaut A, Pybus OG, Dunn D, Vandamme AM, Kellam P, Pillay D: Phylogenetic surveillance of viral genetic diversity and the evolving molecular epidemiology of human immunodeficiency virus type 1. J Virol. 2007, 81: 13050-13056. 10.1128/JVI.00889-07.

  49. 49.

    Hue S, Pillay D, Clewley JP, Pybus OG: Genetic analysis reveals the complex structure of HIV-1 transmission within defined risk groups. Proc Natl Acad Sci USA. 2005, 102: 4425-4429. 10.1073/pnas.0407534102.

  50. 50.

    Paraskevis D, Zavitsanou M, Magiorkinis E, Papa A, Magiorkinis G, Beloukas A, Sypsa V, Nikolopoulos G, Malisiovas N, Hatzakis A: Molecular typing of the subtype B transmission networks: a nationwide study to estimate the contact patterns of the HIV epidemic in Greece. 16th Conference on Retroviruses and Opportunistic Infections. Montreal, Canada. 2009

  51. 51.

    Op de Coul EL, Prins M, Cornelissen M, Schoot van der A, Boufassa F, Brettle RP, Hernandez-Aguado L, Schiffer V, McMenamin J, Rezza G, et al: Using phylogenetic analysis to trace HIV-1 migration among western European injecting drug users seroconverting from 1984 to 1997. AIDS. 2001, 15: 257-266. 10.1097/00002030-200101260-00017.

  52. 52.

    Skar H, Sylvan S, Hansson HB, Gustavsson O, Boman H, Albert J, Leitner T: Multiple HIV-1 introductions into the Swedish intravenous drug user population. Infect Genet Evol. 2008, 8: 545-552. 10.1016/j.meegid.2008.03.004.

  53. 53.

    Wensing AMJ, Vercauteren J, Vijver van de DA, Albert J, Asjo B, Balotta C, Camacho R, Coughlan S, Grossman Z, Horban A, et al: Transmission of drug-resistant HIV-1 in Europe remains limited to single classes. Aids. 2008, 22: 625-635. 10.1097/QAD.0b013e3282f5e062.

  54. 54.

    Hue S, Clewley JP, Cane PA, Pillay D: Investigation of HIV-1 transmission events by phylogenetic methods: requirement for scientific rigour. Aids. 2005, 19: 449-450.

  55. 55.

    Lemey P, Derdelinckx I, Rambaut A, Van Laethem K, Dumont S, Vermeulen S, Van Wijngaerden E, Vandamme AM: Molecular footprint of drug-selective pressure in a human immunodeficiency virus transmission chain. J Virol. 2005, 79: 11981-11989. 10.1128/JVI.79.18.11981-11989.2005.

  56. 56.

    Paraskevis D, Magiorkinis E, Magiorkinis G, Kiosses VG, Lemey P, Vandamme AM, Rambaut A, Hatzakis A: Phylogenetic reconstruction of a known HIV-1 CRF04_cpx transmission network using maximum likelihood and Bayesian methods. J Mol Evol. 2004, 59: 709-717. 10.1007/s00239-004-2651-6.

  57. 57.

    Wensing AM, Vijver DA, Angarano G, Asjo B, Balotta C, Boeri E, Camacho R, Chaix ML, Costagliola D, De Luca A, et al: Prevalence of drug-resistant HIV-1 variants in untreated individuals in Europe: implications for clinical management. J Infect Dis. 2005, 192: 958-966. 10.1086/432916.

  58. 58.

    Thompson JD, Higgins DG, Gibson TJ: CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994, 22: 4673-4680. 10.1093/nar/22.22.4673.

  59. 59.

    Stamatakis A, Hoover P, Rougemont J: A rapid bootstrap algorithm for the RAxML Web servers. Syst Biol. 2008, 57: 758-771. 10.1080/10635150802429642.

  60. 60.

    Maddison WP, Maddison DR: MacClade: analysis of phylogeny and character evolution. 2003, Sunderland, Mass: Sinaeur Associates, 4

  61. 61.

    Sankoff D, Cedergren RJ: Simultaneous comparison of three or more sequences related by a tree. Time Warps, String Edits, and Macromolecules: The Theory and Practice of Sequence Comparison. Edited by: Sankoff D, Kruskal JB. 1983, Reading, Mass: Addison-Wesley, 253-263.

  62. 62.

    Rinsma I, Hendy M, Penny D: Minimally colored trees. Math Biosci. 1990, 98: 201-210. 10.1016/0025-5564(90)90124-H.

  63. 63.

    Maddison WP, Maddison DR: Mesquite: a modular system for evolutionary analysis. Version 2.6. 2009

Download references


We thank the patients and doctors throughout Europe, for their consent and support for the study. The study was supported in part the European Commission (QLK2-CT-2001-01344) by the Hellenic Scientific Society for the Study of AIDS and STDs, by the Belgian AIDS Reference Laboratory fund and the Belgian Fonds voor Wetenschappelijk Onderzoek (F.W.O. nr G.0611.09). We wish to acknowledge Maria Detsika for editing the text.

Author information




Corresponding author

Correspondence to Dimitrios Paraskevis.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

DP designed the study performed the analysis and prepared the manuscript, OP, GM and AH designed part of the analysis, AMJW and DAV collected the data and coordinated CATCH and SPREAD studies, JA, GA, BÅ, CB, EB, RC, MLC, SC, DC, ADL, CDM, ID, ZG, OH, IMH, AH, KK, CK, TL, CL, EMR, IM, LM, CN, ELMO, VO, VO, LP, EPS, LR, MS, JCS, RS, VS, JS, MS, DS, KVL, MV, SY, and MZ provided their data (protease and partial reverse transcriptase HIV-1 sequences together with epidemiological data). CAB coordinated CATCH and SPREAD-studies and AMV designed the study. All authors contributed to writing the paper

Electronic supplementary material

Additional file 1: Tables S1 and S2. Table S1 – Number of calculated migration events (medians) between countries. Table S2 – Differences of the medians between observed and the expected migration events. Cells in bold and underlined bold denote significantly higher and lower migration numbers, respectively. (DOC 164 KB)

Additional file 2: Figure S1 (part A). Significant HIV exporting (A and B) and importing (C and D) migration events between different countries as estimated by statistical phylogeography study For all countries, 90 sequences were included per analysis, except for Belgium (BEL), Greece (GRC) and the Netherlands (NLD) for which 86, 73 and 84 sequences were included. For Israel (ISR), Norway (NOR) and Serbia (YUG) <<< 90 sequences were available, respectively. This lower number of sequences explains why the significantly high migration count for these countries is lower than for the other countries. Country code as in table 1. (GIF 49 KB)

Additional file 3: Figure S1 (part B). Significant HIV exporting (A and B) and importing (C and D) migration events between different countries as estimated by statistical phylogeography study For all countries, 90 sequences were included per analysis, except for Belgium (BEL), Greece (GRC) and the Netherlands (NLD) for which 86, 73 and 84 sequences were included. For Israel (ISR), Norway (NOR) and Serbia (YUG) <<< 90 sequences were available, respectively. This lower number of sequences explains why the significantly high migration count for these countries is lower than for the other countries. Country code as in table 1. (GIF 45 KB)

Additional file 4: Figure S1 (part C). Significant HIV exporting (A and B) and importing (C and D) migration events between different countries as estimated by statistical phylogeography study For all countries, 90 sequences were included per analysis, except for Belgium (BEL), Greece (GRC) and the Netherlands (NLD) for which 86, 73 and 84 sequences were included. For Israel (ISR), Norway (NOR) and Serbia (YUG) <<< 90 sequences were available, respectively. This lower number of sequences explains why the significantly high migration count for these countries is lower than for the other countries. Country code as in table 1. (GIF 36 KB)

Additional file 5: Figure S1 (part D). Significant HIV exporting (A and B) and importing (C and D) migration events between different countries as estimated by statistical phylogeography study For all countries, 90 sequences were included per analysis, except for Belgium (BEL), Greece (GRC) and the Netherlands (NLD) for which 86, 73 and 84 sequences were included. For Israel (ISR), Norway (NOR) and Serbia (YUG) <<< 90 sequences were available, respectively. This lower number of sequences explains why the significantly high migration count for these countries is lower than for the other countries. Country code as in table 1. (GIF 33 KB)

Authors’ original submitted files for images

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and Permissions

About this article

Cite this article

Paraskevis, D., Pybus, O., Magiorkinis, G. et al. Tracing the HIV-1 subtype B mobility in Europe: a phylogeographic approach. Retrovirology 6, 49 (2009).

Download citation


  • Migration Event
  • Viral Lineage
  • Monophyletic Cluster
  • Bootstrap Tree
  • Migration Matrice