- Open Access
A new genotype of bovine leukemia virus in South America identified by NGS-based whole genome sequencing and molecular evolutionary genetic analysis
Retrovirology volume 13, Article number: 4 (2016)
Bovine leukemia virus (BLV) is a member of retroviridae family, together with human T cell leukemia virus types 1 and 2 (HTLV-1 and -2) belonging to the genes deltaretrovirus, and infects cattle worldwide. Previous studies have classified the env sequences of BLV provirus from different geographic locations into eight genetic groups. To investigate the genetic variability of BLV in South America, we performed phylogenetic analyses of whole genome and partial env gp51 sequences of BLV strains isolated from Peru, Paraguay and Bolivia, for which no the molecular characteristics of BLV have previously been published, and discovered a novel BLV genotype, genotype-9, in Bolivia.
In Peru and Paraguay, 42.3 % (139/328) and over 50 % (76/139) of samples, respectively, were BLV positive. In Bolivia, the BLV infection rate was up to 30 % (156/507) at the individual level. In Argentina, 325/420 samples were BLV positive, with a BLV prevalence of 77.4 % at the individual level and up to 90.9 % at herd level. By contrast, relatively few BLV positive samples were detected in Chile, with a maximum of 29.1 % BLV infection at the individual level. We performed phylogenetic analyses using two different approaches, maximum likelihood (ML) tree and Bayesian inference, using 35 distinct partial env gp51 sequences from BLV strains isolated from Peru, Paraguay, and Bolivia, and 74 known BLV strains, representing eight different BLV genotypes from various geographical locations worldwide. The results indicated that Peruvian and Paraguayan BLV strains were grouped into genotypes-1, -2, and -6, while those from Bolivia were clustered into genotypes-1, -2, and -6, and a new genotype, genotype-9. Interestingly, these results were confirmed using ML phylogenetic analysis of whole genome sequences obtained by next generation sequencing of 25 BLV strains, assigned to four different genotypes (genotypes-1, -2, -6, and -9) from Peru, Paraguay, and Bolivia. Comparative analyses of complete genome sequences clearly showed some specific substitutions, in both structural and non-structural BLV genes, distinguishing the novel genotype-9 from known genotypes.
Our results demonstrate widespread BLV infection in South American cattle and the existence of a new BLV genotype-9 in Bolivia. We conclude that at least seven BLV genotypes (genotypes-1, -2, -4, -5, -6, -7, and -9) are circulating in South America.
Bovine leukemia virus (BLV) is a member of retroviridae family belonging to the genes deltaretrovirus, being considered a model of pathogens for human T-cell leukemia virus types 1 (HTLV-1) , and is the etiological agent of enzootic bovine leukosis (EBL), the most common neoplastic disease of cattle [2, 3]. Some cattle infected with BLV suffer from lymphomas and/or B-lymphocyte proliferation (persistent lymphocytosis), but the majority of BLV-infected cattle are healthy carriers of the virus [2, 3].
The complete genome of BLV consists of 8714 nucleotides, including the structural and enzymatic gag, pro, pol, and env essential genes and two identical long terminal repeats (LTRs). The BLV gag gene is translated as the precursor, Pr70 Gag, and processed into three mature proteins: the matrix protein, p15 (MA), the most abundant capsid protein, p24 (CA), and the nucleocapsid protein, p12 (NC) [4, 5]. The BLV pro and pol genes encode proteases (Pro) p14 and p80, harboring reverse transcriptase (RT) and integrase (IN) activities, respectively [3, 4]. The env gene encodes a mature surface glycoprotein (gp51) and a transmembrane protein (gp30) , and is involved in viral infectivity [6–8]. In addition, the BLV genome contains a pX region, located between the env sequence and the 3′ LTR [2, 3]. At least four proteins, including the regulatory proteins Tax and Rex, and the accessory proteins R3 and G4, are encoded by this genomic region. The Tax protein has been extensively studied and is believed to play a critical role in BLV induced leukemogenesis . Rex is responsible for nuclear export of viral RNA and promotes cytoplasmic accumulation and translation of viral messenger RNA (mRNA) in BLV-infected cells . The R3 and G4 proteins contribute to the maintenance of high viral load [11, 12] and the G4 protein is particularly relevant to leukemogenesis, since it can immortalize primary embryonic fibroblasts . The R3 protein contributes to the maintenance of infectivity , and is located in the nucleus and cellular membranes . In addition to the above, BLV RNA polymerase III (pol III)-encoded viral microRNAs are strongly expressed in preleukemic and malignant cells, in which structural and regulatory gene expression is repressed, suggesting a possible key role in tumor onset and progression [14, 15].
The Env gp51 glycoprotein plays an essential role in the viral life cycle [7, 8]. gp51 is required for cell entry and the target of neutralizing antibodies [8, 16]. The N-terminal half of BLV gp51 contains three conformational epitopes, F, G and H , and plays an important role in viral infectivity and syncytium formation [7, 18, 19], while the C-terminal half of BLV gp51 contains the linear epitopes A, B, D, and E [16, 17]. Therefore, the gp51 region has been widely used for BLV genotyping studies and recent phylogenetic studies of this region from viral strains isolated worldwide demonstrate that BLV can be classified into at least eight genotypes [20–33].
Cattle were introduced to the American continent by the Spanish conquerors from 1493. Within a few years, these cattle had spread all over South America and their population size had increased to several million . More than 300 years, Creole cattle were the only cattle bred in America, and during the end of 19th and the beginning of the 20th centuries, animals from British breeds (Shorthorn, Hereford, Angus, and Holstein) and indicus breed (Nelore, Brahman, and Gir) were introduced in temperate/cold and subtropical/tropical regions, respectively, in order to improve beef and dairy production through crossbreeding with local populations or directly by replacement. BLV infects cattle worldwide, imposing a severe economic impact on the dairy cattle industry, and EBL has been recognized in South America (Brazil) since 1943 . World Organization for Animal Health data (OIE 1999 and 2009) has confirmed the presence of BLV in most parts of South America. In Brazil, infection rates between 17.1 and 60.8 % were detected, with individual prevalence rates varying considerably among states, and recent data indicate a sharp increase in the prevalence of BLV [20, 23, 35–38]. Early phylogenetic analysis of the BLV env gene demonstrated that Brazilian BLV isolates were clustered into genotypes-1, -2, -5, -6, and -7 [20, 23]. In Columbia, BLV prevalence levels up to 83.3 % have been reported, with average prevalence differing significantly between cattle breeds and according to the viral detection method used [39, 40]. Previous studies have indicated the presence of BLV in Argentina [26, 31, 41, 42], with individual and herd prevalence levels of 32.8 and 84 %, respectively . The presence of BLV genotypes-1, -2, -4, and -6 were confirmed in Argentina [26, 31, 32]. BLV infection has also been reported in Peru, Chile, and Uruguay [24, 33, 44]. Restriction fragment length polymorphism (RFLP) analysis of Chilean BLV strains demonstrated the presence of genotypes-1 and -4 ; however, a study of cattle in Uruguay and Chile by Moratorio et al. , aligning partial env gp51 sequences with data available from other South American BLV strains, found that Chilean BLV strains were assigned into genotypes-4 and -7, but not -1, while Uruguayan BLV strains were of genotype-1. Thus, it appears that at least six genotypes of BLV strains are circulating in South America. However, there are currently no studies of the molecular characteristics of BLV in Bolivia, Paraguay, or Peru (OIE 2009).
Here, we investigated the distribution of and molecularly characterized BLV strains in Peru, Paraguay, Bolivia, Argentina, and Chile, together with other South American BLV strains as follows: (1) first, the spread of BLV infection was investigated by amplification of BLV LTRs by nested polymerase chain reaction (PCR) from blood samples obtained from a total of 2204 cattle in Peru, Paraguay, Bolivia, Argentina, and Chile; (2) second, the genetic variability of BLV strains circulating in Peru, Paraguay, and Bolivia was examined by DNA sequencing and 35 distinct strains collected from these countries were classified by genotype using phylogenetic analyses based on partial env (gp51) sequences, comparing these strains and isolates from other geographical locations worldwide; (3) third, whole genome sequences of 25 BLV strains, assigned to four different genotypes, from Peru, Paraguay and Bolivia were obtained by next generation sequencing (NGS) and compared with eight full-length BLV genome sequences, generated by Sanger sequencing, available in the NCBI database.
Collectively, our comprehensive studies provide strong evidence for the existence of a novel BLV genotype, which we designate genotype-9, in Bolivia.
Investigation of the spread of BLV infection in South America
To investigate the spread of BLV infection in South America, a total of 2204 blood samples were collected from cattle in different regions of five South American countries, including Peru, Paraguay, Bolivia, Argentina, and Chile, as shown in Fig. 1. The samples were screened for BLV infection by nested PCR to amplify BLV LTRs (Table 1; Fig. 1). In Peru, samples from 139 cattle out of 328 (42.3 %) were positive for the BLV provirus, with prevalence rates ranging from 0 to 58.6 % at the farm level. In Paraguay, 76 cattle samples out of 139 (54.5 %) were BLV positive and infected cattle breeds were Holstein, Brown Swiss, or Holstein × Brown Swiss. All breeds showed a high prevalence of BLV infection in Paraguay, with a range of 28.6–71.4 % in the Holstein breed and 36.4 % in Brown Swiss. Of 507 samples collected from eight farms in different parts of Bolivia, 156 (30.7 %) were positive for BLV provirus. Infected cattle breeds in Bolivia were Holstein, Gir and Yacumeño, but not Brown Swiss, Brahman and Montana. Interestingly, BLV infection levels differed from farm to farm, and from breed to breed. For example, Holstein cattle showed the highest BLV prevalence, with up to 100 % at the individual level, while Yacumeño showed moderate levels of BLV infection ranging from 4.5 to 24.3 %. Compared to the above-mentioned countries, samples collected from Argentina (n = 420) demonstrated extremely high levels of BLV prevalence (77.4 %), with up to 90.9 % at the herd level. In Chile, 810 samples were collected from multiple cattle breeds at 19 farms over a relatively wide geographical area (Table 1). Of the samples screened, 236 (29.1 % prevalence) were BLV provirus positive. It is difficult to compare BLV prevalence levels between breeds in Chile, as a wide range of breeds were BLV positive; however, remarkable differences were observed between farms, breeds, and locations, indicating variability of BLV infection in this country. Our study result is in agreement with previous study [45, 46] that on average dairy cattle breeds, such as Holstein and Brown Swiss, has higher level of BLV infection than that of beef cattle, namely Yacumeño and Nelore. However, there is no statistically significant correlation of BLV prevalence among Breeds, because individually breeds are different from farm to farm and also from country to county.
Identification of a novel BLV genotype, genotype-9, by phylogenetic analysis of partial env gp51 sequences
Recent phylogenetic studies on the env gene of BLV strains isolated worldwide demonstrate that these viruses can be classified into eight genotypes [20–33]. However, no studies of the molecular characteristics of BLV in Bolivia, Paraguay, and Peru have previously been published. Therefore, to gain insight into the degree of genetic variability of BLV strains in South America, partial sequences of env gp51 from 131 field strains, representing 30 % of BLV positive samples collected from each country, were amplified. After direct sequencing of these amplicons, because of strains from the same farm having 100 % identity to each other, 475 bp nucleotide sequences of 35 distinct strains among 131, corresponding to nucleotide positions 5090–5564 of the full-length BLV genome (BLV cell line FLK-BLV strain pBLV913, accession number EF600696) , were aligned with 74 corresponding sequences from known BLV strains, representing eight different BLV genotypes from additional South American countries (Argentina, Brazil, Chile, and Uruguay) and other parts of the world [27–29, 32]. Phylogenetic trees were then constructed using Maximum likelihood (ML; Fig. 2) and Bayesian Inference (BI; Fig. 3) approaches. The ML and BI trees showed congruent topologies, supported by moderate to high bootstrap values and high posterior probabilities. As shown in Figs. 2 and 3, in contrast to previous studies where only eight BLV strains were detected, both methods of phylogenetic tree construction identified nine sequence clusters, designated genotypes 1-9. All of the BLV strains we collected from Peru were assigned to genotypes-1, -2, and -6. Likewise, the majority of Paraguayan BLV strains clustered into genotypes-1 and -6, with a small number in genotype-2, together with Argentine BLV strains. Interestingly, the BLV strains collected from Bolivia clustered not only into genotypes-1, -2, and -6, together with our Peruvian and Paraguayan strains, but also into a unique clade, which was distinct from the eight previously known BLV genotypes. This result demonstrates that some of our Bolivian BLV strains did not cluster with any others and were highly divergent from the eight known genotypes. This novel genotype displayed a high bootstrap supporting value (92 %) in the ML tree (Fig. 2) and a posterior probability of 100 % using the BI approach (Fig. 3), indicating the presence of new genotype, genotype-9. Furthermore, the genotyping result of reference sequences from Argentine, Brazil, Chile and Uruguay in this study is in agreement with previous studies [20, 23, 24, 26, 31, 32]. Argentine BLV strains were mainly assigned into genotypes-1 and -2, with relatively few samples grouped into genotypes-4 and -6. Reference sequences of Brazilian BLV strains demonstrated a wider variety of BLV genotypes, consisting of genotypes-1, -2, -5, -6, and -7. Known Chilean BLV strains were limited to genotypes-4 and -7. In addition, strains from Uruguay were assigned to genotype-1 only. Thus, our results demonstrate that BLV strains assigned to at least seven genotypes (genotypes-1, -2, -4, -5, -6, and -7, plus the novel genotype-9) are circulating in South America.
Whole genome sequencing and sequence comparison of strains with genotypes-1, -2, and -6, and a novel genotype-9
To compare the divergence of the BLV whole genome, overlapping genomic fragments covering the complete BLV genomic sequence were amplified from strains grouped into the novel genotype-9 and those in other genotypes, namely genotype-1, -2 and -6, as determined by dual phylogenetic analyses of partial env gp51 sequences. Twenty-five samples, including one from Peru, seven from Paraguay, and 17 from Bolivia, were amplified by PCR (Additional file 1: Figure S1) and the amplicons were pooled to construct a DNA library for multiplex sequencing on the MiSeq system (Illumina). Short-read sequences were assembled using three BLV reference sequences (Accession numbers; EF600696, FJ914764, and AF033818) and the average depth of genome coverage was between 143 and 1558 (Additional file 2: Table S1). Moreover, results of analyses of whole genome sequences were 100 % concordant with those of partial env sequences produced by Sanger sequencing. Comparative analyses indicated that whole genome sequences obtained from the 25 samples showed 98.29–100 % homology with each other, while they were 95.63–99.42 % homologous to the FLK-BLV strain pBLV913 sequence.
We next calculated the average substitution rate of paired sequences for each gene between the 25 novel complete BLV provirus genomes and those of eight previously reported BLV whole genomes. The numbers of nucleotide and amino acid substitutions per site were estimated using the Jukes and Cantor, and p-distance models, respectively. Table 2 indicates the degree of variation in each of the structural genes for a total of 33 BLV strains, including gag (p15, p12, and p24), pro, pol, and env (gp30 and gp51), non-structural genes, including the tax, rex, R3, and G4, and LTRs. The average nucleotide substitution rate for the whole BLV genome was 0.023 per site. Comparisons between each gene revealed that G4, rex, and the LTR genes had the lowest average nucleotide substitution rates. By contrast, the genes R3, p24, and gp30 revealed the highest average rate of nucleotide substitution. Comparison of amino acid substitution rates between BLV proteins revealed that tax and R3 were the most polymorphic, whereas p24 was significantly conserved.
Confirmation of the existence of a novel BLV genotype, genotype-9, by phylogenetic analysis of whole genome sequences
Twenty-five novel and eight previously reported BLV whole genome sequences, which were assigned to genotypes-1, -2 and -4, were used to construct a ML phylogenetic tree (Fig. 4). Since the whole genome sequences of genotypes-3, -5, -6 and -8 are not available, we sequenced, for the first time, the whole genome sequence of genotypes-6 and included in phylogenetic analysis. The tree clearly demonstrates stratification of BLV genotypes, including genotypes-1, -2, -4, and -6, and the novel genotype-9, into separate clades (Bootstrap values 100 % for every clade). Thus, this analysis provides evidence for the existence of a novel genotype-9, since the results were identical to those obtained using partial env gp51 sequences. However, in this tree based on BLV whole genome sequences, genotype-9 sequences from Portachuelo and Montero were grouped in separate branches. This result indicates at least two different strains within the novel genotype-9 in Bolivia.
Variations in structural and non-structural genes in 16 distinct novel BLV whole genome sequences from Peru, Paraguay, and Bolivia
To compare the new BLV genotype-9 with previously reported South American genotypes-1, -2, and -6, whole nucleotide or predicted amino acid sequences, including gag (p15, p12, and p24), pro, pol, env (gp30 and gp51), non-structural genes, including tax, rex, R3, G4, LTRs and miRNAs of 16 distinct strains, selected from 25 unique strains were aligned, using the FLK-BLV strain pBLV913 as a reference sequence (Figs. 5, 6, 7, 8, 9, 10).
BLV transcription is initiated at the U3-R junction of the 5′-LTR by the Tax protein. Transactivation requires the presence of three 21 bp enhancer elements (Tax-responsive elements, TxRE) located in the U3 region of the 5′-LTR. Each TxRE contains a cyclic AMP-responsive element (CRE) and an E-box sequence, which overlaps the CRE motif. The U3 region also contains a PU.1/Spi-B binding site and a glucocorticoid responsive element (GRE). In addition, BLV expression is regulated by 5′-LTR sequences downstream of the transcription initiation site (DAS) at the 3′end of the R region and an interferon regulatory factor (IRF) binding site in the U5 region. Comparison of the 16 LTR sequences of four South American genotypes, genotypes-1, -2, -6 and -9, with that of FLK-BLV strain pBLV913 revealed that these entire regions, including the CRE motifs, E Boxes, CAT Boxes, polyadenylation sites (PAS), and IRF binding sites, were well conserved overall (Fig. 5). Interestingly, three nucleotide changes were observed downstream of the DAS, at residues 374, 399, and 400 in sequences of genotype-9 strains and strains of other genotypes. There are also some nucleotide changes outside of the functional motifs (Fig. 5).
Gag is a polyprotein precursor made up of p15 MA, p24 CA, and p12 NC proteins (Fig. 6a). All of these polyprotein regions were highly conserved among the aligned sequences. For the p15 MA protein, sequences of three strains assigned to BLV genotype-1 were identical to that of pBLV913, whereas BLV strains clustered into genotypes-2, -6, and -9 had amino acid substitutions at residues 48, 61, 63, 69, 88 and 90 (Fig. 6a). The p24 CA protein region was also well conserved overall, with substitutions detected at residues 318 (V318 M/I/T), and 323 (V323I) (Fig. 6a). The p12 NC protein is constituted of a region rich in basic amino acid residues and zinc binding domains involved in RNA packaging, both of which are very well conserved. A study by Wang et al. concluded that substitutions affecting either the basic amino acid residues or the zinc finger domains might lead to a reduction in RNA packaging . In our study, all basic residues and Zn finger domains were conserved in the BLV strains analyzed (Fig. 6a), but a substitution from Proline to Serine at amino acid residue 340 (P340S) was detected in the p12 NC protein region of South American genotype-1 strains. This substitution has previously been reported in genotype-1 BLV strains LS1, LS2, and SL3 . In addition, another substitution at residue 365 (A365T) was detected in strains of genotypes-2, -6, and -9. However, the functional consequences of these substitutions are unknown.
The BLV Pro protein is an aspartic protease with a function in gag processing and thus virion maturation . As shown in Fig. 6b, although substitutions at residues 13, 81, 91, 134, 148 and 149 were detected among the Pro protein sequences of South American strains, this region was generally well conserved and no new genotype-specific substitutions were detected.
The deduced BLV RT and IN amino acid sequences encoded by the pol gene of all of 16 South American BLV strains were also highly conserved (Fig. 7). The polymerase (Pol) amino acid sequences of the 16 samples were similar to the reference sequence; however, interesting and important findings included substitutions of glutamic acid to aspartic acid (E166D) and aspartic acid to glycine (D447G) in the RT region, and from histidine to tyrosine (H644Y) and alanine to threonine (A826T) in the IN region, observed only in all sequences of strains assigned to the new BLV genotype-9 (Fig. 7). In addition, an asparagine to serine substitution (A792S) was detected only in the IN protein sequence of all 12 samples collected from Portachuelo, which were grouped into genotype-9, but this substitution was not detected in other genotype-9 strains, indicating that particular amino acid changes could be limited to certain geographic regions (Fig. 7). In addition, nine substitutions at residues 209, 231, 268, 350, 399, 437, 695, 814, and 839 were found in the Pol protein region of three sequences from Bolivian strains clustered into genotype-6.
The sequences encoding the Env leader peptide and gp30 protein were highly conserved among all of our 16 South American BLV strains, as were those of gp51 proteins (Fig. 8). Comparisons of conformational (F, G, and H) and linear (A, B, D, and E) epitopes demonstrated that A, B, E, F, and H were conserved, while the G-epitope at residues 48, 74 and 82, and D-epitope at residues 254 and 267 showed divergence (Fig. 8). The K74R substitution has previously been described in env gp51 partial sequence deposited in the GenBank database . The S82F substitution was present in sequences from strains grouped to genotypes-2, -6, and -9. Interestingly, the S82L substitution was detected only in all 12 sequences of Portachuelo-derived strains grouped in genotype-9. Of the neutralizing domains, the first and third were conserved, while the second demonstrated some genotype-specific substitutions (Fig. 8). Genotype-1 samples collected from Paraguay showed substitutions of aspartic acid to asparagine at residue 134 (D134N) and of phenylalanine to serine at residue 146 (F146S), which have previously been observed in some env gp51 partial sequences isolated from Uruguay and Brazil . In addition, BLV strains assigned into genotypes-2 and -6 also showed specific substitutions, N141D and I144T, respectively, in the second neutralizing domain. Interestingly, one significant substitution was observed at position 133 of the gp51 protein, with alanine (A) substituted for valine (V) in all BLV genotype-9 strains, in agreement with results obtained by direct sequencing of the partial gp51 env region. Furthermore,both gp51 partial sequences and full genome sequences showed the conservation of all of the eight N-linked glycosylation sites. In the gp30 protein, the fusion peptide and GD 21 were highly conserved, while cytoplasmic domain of the transmembrane protein showed divergence with some substitutions, including Q470E, L479F, T480A, H500R, and V504T, commonly detected in South American BLV strains of various genotypes (Fig. 8).
The functional domains of the BLV Tax protein, which regulates BLV expression, include a putative zinc finger motif (residues 30–53) [2, 3], a leucine-rich activation domain (residues 157–197) , two phosphorylation sites (residues 106 and 293) , and a multi functional domain (residues 240–265) [2, 52–57]. These domains were assigned in the four South American genotypes, -1, -2, -6, and -9, and pBLV943, and compared between the 16 distinct strains. In the deduced amino acid alignments of the three functional domains of Tax, substitutions at residue 40 (Q40R) in the zinc finger domain, residue 164 (S164P), residue 169 (L169I) and residue 183 (R183 K) in the leucine-rich activation domain, and residue 257 (C257 N) in the multi functional domain were detected in South American BLV strains (Fig. 9a). By contrast, the two phosphorylation sites were conserved in all South American BLV strains included in this study. The most important findings in the alignment of the Tax protein were substitutions at residue 108 (F108L), detected only in sequences of all of the new South American genotype-9 BLV strains, and a substitution at residue 100 (P100S), observed only in samples collected from Portachuelo, indicating genotype-9 and/or Portachuelo region specific mutations in Tax.
Another BLV regulatory protein, Rex, is associated with nuclear pores and harbors a nuclear localization signal (NLS) and a nuclear export signal (NES) required for RNA-binding and nuclear localization (Fig. 9b). Alignment of deduced amino acid sequences from the 16 South American BLV strains showed that the NLS domain was well conserved, while one substitution (A80S) was detected in the NES of the majority of samples. In addition, other mutations, including M39T, T72P, and L146S, were commonly detected in BLV strains with genotypes-2, -6, and -9, and were also present in sequences deposited in the GenBank database. Five substitutions at residues 12, 40, 53, 58, and 70 were detected only in BLV strains of genotype-6. Of interest, one substitution at residue 113 (A113E) of the Rex protein was observed only in all genotype-9 strain sequences.
The amino acid sequence of the accessory protein, G4, includes an amino terminal stretch of hydrophobic residues (amino acids 1–24) followed by a potential proteolytic cleavage site, a myb-like motif (MYB; amino acids 39–44) and an arginine-rich region (ARR; amino acids 58–72) located in the middle of the protein  (Fig. 9d). The MYB and ARR of G4 were well conserved; however, a substitution from phenylalanine to serine at residue 22 was detected in one of the two cleavage sites in three South American BLV strains of genotypes-2 and -6. The functional effect of this F22S substitution is unknown. Other point mutations at residues 7, 25, 26, 27, 35, 45, 55, and 71 were found in some South American BLV strains.
By contrast, as shown in Fig. 9c, the alignment of the deduced amino acid sequences of another accessory protein, R3, demonstrated that a substitution at residue 26 (N26H) was detected only in the sequences of all genotype-9 BLV strains. In addition, four substitutions, Q12E, K27E, H35Y and L40S, were observed only in three South American BLV strains clustered into genotype-6, indicating genotype-specific mutations in the R3 protein.
BLV-encoded micro RNAs (miRNAs) might possibly play potential roles in the regulation of gene expression and in tumor development [14, 15]. In this study, we also obtained sequences of ten unique BLV miRNAs derived from five predicted miRNA precursor hairpins, BLV-premiR-B1 to -B5 (Fig. 10), corresponding to those included in previous studies [14, 15]. All of the ten miRNA sequences were well conserved, with BLV-miRNA-B2-5p, BLV-miRNA-B3-5p, and BLV-miRNA-B3-3p conserved in all four South American BLV genotypes included in the analysis. In addition, the seed sequences of the predominant arms of seven of the ten miRNAs were identical across all examined BLV strains. However, all genotype-9 BLV strains had an A to G nucleotide change in the seed sequence of BLV-miRNA-B5-3p. In addition, nucleotide substitutions of A to G and C to T in BLV-miRNA-B1-3p, G to A in BLV-miRNA-B2-3p, A to T in BLV-miRNA-B4-3p, and T to C in BLV-miRNA-B5-5p were detected only in BLV genotype-6 strains.
BLV genotype-9 specific mutations
Sixteen distinct novel BLV complete provirus genome sequences from Peru, Paraguay, and Bolivia were aligned with eight previously reported BLV whole genome sequences, including FLK-BLV pBLV913 , strains LS1-LS3 from Uruguay , strains Arg41and Arg38 from Argentina [59, 60], and two strains from Japan  and the USA. As summarized in Fig. 11, ten unique amino acid substitutions were observed in genotype-9 BLV strains as follows: (1) Two substitutions, E166D and D447G, in the Pol (RT) region; two substitutions, H644Y and A826T, in Pol (IN) region; and one substitution at residue 792 of the Pol (IN) region, restricted to only sequences of all 12 samples collected from Portachuelo, but not in other genotype-9 strains. (2) In the Env (gp51) protein, one significant substitution, A133 V, was observed in all BLV genotype-9 strains and not detected in eight known whole BLV genome sequences. Interestingly, another substitution in the Env (gp51) protein, S82L, was detected only in all of the 12 Portachuelo strain sequences of genotype-9, but not in other genotype-9 sequences. (3) In the regulatory proteins Tax and Rex, substitutions at residue 108 (F108L) of Tax and residue 113 (A113E) of Rex were detected only in genotype-9 strains, but not in other strains or the eight known BLV genome sequences. In addition, a substitution at residue 100 (P100S) of the Tax protein was observed only in samples collected from Portachuelo. (4) Likewise, a substitution at residue 26 (N26H) in the R3 accessory protein was detected only in the sequences of all genotype-9 BLV strains, but not in other strains or known BLV genome sequences. These ten substitutions were first detected in the sequences of the new genotype-9 strains identified in this study and the impact of these substitutions on the function of the proteins will require further study. In addition, 15, 7, and 22 unique amino acid substitutions were observed in BLV strains of genotypes-1, -2, and -6, respectively (Fig. 11).
We draw four major conclusions from the results of this study of BLV in South American cattle. First, we identified a novel BLV genotype, genotype-9, in samples from two Bolivian provinces, Montero and Portachuelo, by phylogenetic analyses of partial env gp51 and whole genome sequences. Recent phylogenetic studies using BLV env gene sequences from strains isolated worldwide have classified the virus into eight genotypes [20–33]. Thus, the detection of genotype-9 is a novel finding of this study. Second, ML phylogenetic analysis of BLV whole genome sequences showed that genotype-9 sequences derived from Portachuelo were distinct from those collected in Montero. Importantly, this result was obtained from BLV whole genome sequence data, but was not apparent from analyses of partial env gp51 sequences. Third, current research provides the new information of the full genome sequences of a novel genotype-9 and genotype-6 (partially known env sequences). Fourth, our data provides updated information about the spread of BLV infection in South American countries and demonstrates a high rate of BLV infection in five countries: Peru, Paraguay, Bolivia, Argentina, and Chile. This information will be important for the control of BLV infection in South America, and the data regarding the prevalence of BLV in Bolivia and Paraguay is unique to this study.
The most interesting data in this study are the phylogenetic results of the existence of genotype-9 confirmed by the ML method using 25 novel and eight previously reported BLV whole genome sequences (Fig. 4), which was in agreement with the phylogenetic analyses of 109 partial env gp51 sequences obtained by two independent methods (Figs. 2, 3). Notably, the bootstrap value of the genotype-9 cluster was remarkably increased in the ML phylogenetic tree constructed from complete genome sequences (bootstrap value = 100 %), compared to that constructed using partial env sequences (bootstrap value = 92 %). Therefore, the NGS analysis provided strong confirmatory evidence that genotype-9 is separate from the other eight genotypes.
As shown in Fig. 11, our comparison of BLV amino acid sequences showed that 10 unique amino acid substitutions were observed in the Pol, Env, Tax, Rex, and R3 regions of genotype-9 BLV strains. Importantly, two of these substitutions, including N792L in the Pol region and P100S in Tax, were observed only in samples collected from Portachuelo assigned to genotype-9, indicating the presence of at least two distinct genotype-9 strains in Bolivia. This comparison also revealed that 15, 7, and 22 unique amino acid substitutions are encoded by BLV genotypes-1, -2, and -6, respectively. This result was confirmed by ML phylogenetic analysis with BLV whole genome sequences, which also divided Bolivian genotype-9 sequences into at least two strains.
Phylogenetic analyses of the env gp51 gene also suggested the possibility that genotype-6 may be divided into three sub-genotypes, including clades for (1) the FJ808582/Argentina strain, (2) the DQ059415/Brazil and AY185360/Brazil strains, and (3) our newly-identified sequences, including Paraguay-91. By whole genome analysis, Paraguay-62, which was assigned to clade (2) in the gp51 env analysis, was clearly divided from Paraguay-91, with a bootstrap value of 100 %. Our previous work has also suggested that genotype-6 may be divided into three subgroups: G-6a, G-6b and G-6c . Therefore, these two branches, including Paraguay-62, and Paraguay-89 and -91 may be designated as separate genotypes when additional whole genome sequences of genotype-6 are accumulated and analyzed in future.
This study revealed a widespread distribution and remarkably high levels of BLV in five South American countries, including Peru, Chile, Argentina, Paraguay, and Bolivia. In Peru, 139 of 328 cattle samples were positive for BLV provirus, with a prevalence rate of up to 58.6 %. Of Peruvian samples, 58.6 % of those from Holstein cattle in Lima were BLV positive, whereas 40.0 % of samples from Pucallpa gave positive results. Our results were consistent with those of Ch , who demonstrated that BLV infection is widespread in Peru; however, the percentages of BLV positive samples found in the Peruvian areas of Lima and Pucallpa were higher in our study than those reported by Ch (up to 31.0 %) . In Argentina, we found that 325 of 420 samples were BLV positive, with an overall prevalence of 77.4 % at individual level and up to 90.9 % at the herd level. These results are consistent with previous reports demonstrating that BLV is widespread in Argentina [26, 31, 41, 42, 61]. Argentine samples were collected from farms in Buenos Aires, where a high individual BLV prevalence was reported by Trono et al.  and our results were broadly consistent with those of Trono et al.; however, we detected a higher BLV prevalence at the herd level, since Trono et al. reported infection levels of 84 % . In addition, at the individual level, the percentage (77.4 %) of BLV positive Argentine samples in our study was slightly higher than that reported by Monti et al. (70 %) , indicating a possible increased prevalence of BLV in Argentina. Compared to Peru and Argentine, relatively few BLV positive samples were identified from Chile. However, from the distribution of BLV presented in Table 1, we can see that more breeds and a wider geographical area were involved in BLV infection in Chile, indicating a tendency towards widespread infection, in agreement with a previous study of BLV prevalence in this country . By contrast, over 50 % (76/139) of screened samples from Paraguay were BLV positive and all infected breeds in Paraguay demonstrated a high level of BLV prevalence, ranging from 28.6 to 71.4 % in the Holstein breed, with 36.4 % in Brown Swiss cattle. In Bolivia, 156 of 507 samples were BLV positive and the BLV infection level differed from farm to farm. There are no previous data detailing the prevalence of BLV in Paraguay and Bolivia, and our result is the first report confirming the presence of BLV in these countries.
Based on sequencing results from the BLV provirus, the predicted amino acid sequences of the partial and full gp51 env gene from Peruvian, Paraguayan, and Bolivian BLV strains were highly conserved. Amino acid substitutions in the gp51 protein were mostly located in the second neutralization domain, which is consistent with previous studies [32, 33]. In this study, only two distinct genotype-1 BLV strains from Paraguay had amino acid changes at positions 134 (D134N) and 146 (F146S) of the second neutralizing domain. These two substitutions were also observed in Brazilian and Uruguayan BLV genotype-1 strain isolates , indicating that these amino acid substitutions exist mainly in South American BLV strains, and suggesting a common origin for these virus strains. Interestingly, Moratorio et al.  confirmed that D134N changes the net charge of a loop of the Env protein, indicating a potential impact of this substitution on virus-host interactions. BLV strains grouped into genotype-2 showed a common substitution, N141D. Interestingly, the V191I substitution in the B epitope, observed in only one distinct BLV strain from Peru in this study, has been reported in some Argentine BLV isolates . Most importantly and interestingly, an A133V substitution in the second neutralization domain was observed in only genotype-9 BLV strains. Previous data have demonstrated amino acid changes of alanine to aspartic acid (A to D) or threonine (A to T)  at this residue, but there are no previous reports of alanine to valine substitutions, indicating that A133V is a novel finding of this study.
At present, there are eight full-length BLV genome sequences available in the GenBank database. The major limitations of Sanger sequencing compared to Next Generation methods are increased time and cost, and the limited amount of data generated by sequencing runs. The rapid development of NGS (DNA and RNA-seq) has accelerated sequencing analysis of new genomes and transcriptomes of complex organisms, allowing the identification of new virus species integrated within complete or draft genome sequences. Commonly, virus genomes are very compact and the sequences have multiple functions, such as protein encoding (sometimes for multiple proteins), transcription, regulation of host genes, and virus genome packaging. Therefore, identification of complete viral genome sequences is becoming increasingly important. In this study, we identified both BLV partial env and complete viral genome sequences from the same samples. Our data indicate that the BLV env sequence is a good target for classification of BLV strains, and analysis of env sequences was helpful for selection of informative strains to include in the complete genome sequence analysis.
In this study, we sequenced complete genomes of 25 BLV strains, including 17 novel genotype-9 strains. Our results revealed that BLV genomes contain a number of unique genotype specific substitutions not only in the env region, but also in the LTR, Gag, Pro, Pol, Tax, Rex, R3, G4, and miRNA encoding regions. The Information about substitutions in viral genomes is important for investigating viral spread worldwide. Therefore, full understanding of the biological functions of these genotype specific substitutions is now essential. By contrast, we also found that BLV genome sequences of strains from different geographic origins, especially the important sites on the regulation of viral replication of BLV, are relatively stable and highly conserved as follows. Three copies of CRE sequences were imperfectly conserved in all strains of each genotype. Merezaki et al. (2001) found that introducing a perfect CRE sequences into TxRE increases the BLV LTR promoter activity, suggesting that imperfect conservation of CREs repress viral expression with escape from the host immune response . Furthermore, deep sequencing results of the current study were consistent with previous result  that all of the eight N-linked glycosylation sites are conserved very well. The absence of mutation of N-linked glycosylation sites in naturally infected cattle is explained by the hypothesis as being favorable for the virus to coexist with its host for efficient replication and transmission . Moreover, the two phosphorylation sites on Tax protein, and NES and NLS regions in Rex protein were also conserved well. Thus, the fact that limited sequence variations are compatible with the development of a vaccine [19, 64, 65].
The present study provides the first evidence of the prevalence of BLV infection among cattle in Paraguay, Peru, and Bolivia, and also confirms the widespread distribution of BLV infection in Argentina and Chile. Our BLV genotyping studies demonstrate that Peruvian and Paraguayan BLV strains are of genotypes-1, -2, and -6. Of interest, our findings indicate that Bolivian BLV strains are clustered into genotypes-1, -2, -6, and -9. The existence of the novel genotype-9 was confirmed by comprehensive phylogenetic analyses of both partial gp51 env and full BLV genome sequences.
Experimental samples and extraction of proviral DNA
Blood samples were obtained from a total of 2204 cattle, including 328 from three different farms in Peru, 139 from five farms in Paraguay, 507 from eight farms in Bolivia, 420 from three farms in Argentina, and 810 from 19 farms in Chile (Fig. 1; Table 1). Farms located in main-cattle raising area were chosen for sampling in each country. All animals were handled by veterinarians from RIKEN, Universidad Austral de Chile and LAVET, in strict accordance with good animal practice following the Universidad Austral de Chile Institutional guidelines. This study was approved by the Committee on the Ethics of Animals for Research at the National University of LA PLATA (Certificate date May 26th, 2014) and by the Committee on the Ethics of Animals for Research at Universidad Austral de Chile (Certificate No. 153-2014).
Proviral DNA was extracted from 40 μl of whole blood spotted onto Whatman FTA elute cards (GE Healthcare Japan Corp., Tokyo, Japan), according to the manufacturer’s instructions. The extracted DNA was stored at −20 °C until required for PCR. Cattle were classified as BLV-infected where genomic integration of the BLV provirus was detected.
Detection of BLV provirus by nested PCR
Each sample was screened for the presence of BLV provirus by the detection of BLV LTR regions using nested PCR, as described previously [30, 66]. Briefly, the first PCR amplification was performed with primers BLTR256F (5′-GAGCTCTCTTGCTCCCGAGAC-3′) and BLTR453R (5′-GAAACAAACGCGGGTGCAAGCCAG-3′). As an internal control, the BoLA-DRA gene was amplified using primers BDRA488F (5′-ACAACACCCCAAACACCAAT-3′) and BDRA1145R (5′-AGGAAGGGGAGGTAGTGGAA-3′). Five picomoles of each primer, 2 μl of 10× rTaq PCR Buffer (Toyobo, Osaka, Japan), 2 μl of 25 mM MgCl2, 2 μl of 2 mM dNTP mix, 0.1 μl of 5U/μl rTaq, and 10.4 μl of nuclease-free water were added to each sample, which was amplified in a final volume of 20 μl for 45 cycles of 94 °C for 30 s, 58 °C for 30 s, and 72 °C for 30 s, and then a final extension step of 4 min. Two μl of Exo-SAP IT (USB Corp., Cleveland, OH, USA) was applied to the initial PCR products and incubated at 37 °C for 15 min, then at 80 °C for 15 min. The initial PCR amplicons were subsequently applied to the second PCR with primers BLTR306F (5′-GTAAGGCAAACCACGGTTT-3′) and BLTR408R (5′-AGGAGGCAAAGGAGAGAGT-3′). Five picomoles of each primer, 2 μl of 10× rTaq PCR Buffer, 2 μl of 25 mM MgCl2, 2 μl of 2 mM dNTP mix, 0.1 μl of 5U/μl rTaq, and 11.9 μl of nuclease-free water were added to 1 μl of each purified PCR product. Sterilized water was used as a PCR negative control.
PCR amplification and sequencing of BLV env gene fragments
One hundred and thirty-one BLV field strain samples, consisting of 41, 34, and 56 from Peru, Paraguay, and Bolivia, respectively, representing 30 % of BLV positive samples collected from each country, were randomly chosen for amplification of the BLV env gene. The partial BLV env gene was amplified by nested PCR. PCR amplification was performed using PrimeSTAR GXL DNA Polymerase (Takara Bio Inc., Otsu, Japan) and the following primers: external forward (5′-ATGCCYAAAGAACGACGG-3′) and external reverse (5′-CGACGGGACTAGGTCTGACCC-3′) described previously , and Env5032 (5′-TCTGTGCCAAGTCTCCCAGATA-3′) and Env5608r (5′-AACAACAACCTCTGGGAAGGGT-3′) for second-round PCR . The reaction mixture contained 13.5 μl (initial PCR) and 14.5 μl (second PCR) of distilled water, 5 μl of 5× PS GXL Buffer, 2 μl of 2.5 mM dNTP mix, 0.5 μl of PrimeSTAR GXL, and 1 μl of each primer (10 μM). Conditions for PCR amplification were as follows: 98 °C for 2 min, followed by 30 cycles of denaturation at 98 °C for 15 s, annealing at 60 °C for 20 s, and extension at 68 °C for 60 s. The external primers resulted in amplification of a 913 bp DNA fragment, and internal primers amplified a 597 bp fragment of the gp51 region of the env gene.
Positive second-round PCR products were purified using Exo-SAP IT (USB Corp., Cleveland, OH, USA) and sequenced on an ABI3730xl DNA Analyzer using an ABI PRISM Big Dye Terminator v 3.1 Ready Reaction Cycle Sequencing Kit (Applied Biosystems, Foster City, CA, USA). Sequences included a 475 bp sequence of the env gene, corresponding to nucleotide positions 5090–5564 of the BLV cell line FLK-BLV subclone pBLV913 complete genome (GenBank accession number EF600696) . Editing, alignment, and identification of nucleotide sequences were performed using MEGA 5.1 software .
Phylogenetic analysis of env nucleotide sequences
The 131 BLV env partial sequences from Peru, Paraguay, and Bolivia were successfully amplified. Because of strains showing homology to each other, 35 distinctive sequences among the 131 were aligned with 74 BLV env sequences from GenBank, including sequences from other South American countries and also of BLV isolates from other parts of the world, representative of the eight known BLV genotypes, using MEGA 5.1 software . For robust and accurate phylogenetic analysis of the BLV env gp51 partial sequences, phylogenetic trees were constructed using two different algorithms.
First, ML trees were constructed using MEGA 5.1  and Kimura 2-parameter model plus gamma distribution (K2+G) was chosen as the best model for nucleotide substitution . For the ML analysis, the reliability of the phylogenetic relationships was evaluated using nonparametric bootstrap analysis with 1000 replicates.
Next, to confirm the data obtained by ML analysis, BI was performed using MrBayes v.3.2.5  with the evolutionary model set to lset nst = 6, rates = equal [corresponding to the general-time-reversible model (GTL)]. In BI analysis, two runs with four Markov chains were carried out simultaneously for 100,000,000 generations, and the trees were sampled every 100 generations. The first 25 % of the BI trees were discarded as “burn-in”. A consensus tree was constructed from the output file produced in the BI analysis using FigTree v. 1.4.2 (http://tree.bio.ed.ac.uk/software/figtree/).
PCR amplification of the whole BLV provirus
To gain insight into the genetic differences between the distinct BLV genotypes, PCR amplification of overlapping genomic fragments (Additional file 1: Figure S1) covering the complete BLV genome of 25 South American samples was achieved using the PrimeSTAR GXL DNA Polymerase (Takara Bio Inc., Otsu, Japan) and specific primers designed for this study (Life Technologies Japan Ltd, Tokyo, Japan) (Additional file 1: Figure S1). The 25 μl final reaction mixture contained 5 μl of 5× PrimerSTAR GXL Buffer, 2 μl of 2.5 mM dNTP mix, 1 μl of each primer at a concentration of 10 pmol, 3 μl of template and 0.5 μl of PrimerSTAR GXL DNA Polymerase. The cycles for the PCR amplification were as follows: 98 °C for 2 min, followed by 33 cycles of denaturation at 98 °C for 15 s, annealing at 60 °C for 20–30 s, and extension at 68 °C for 1–6 min (1 min per kilobase), followed by a final extension at 72 °C for 4 min. Each amplicon was quantified using the Qubit dsDNA BR Assay kit (Life Technologies Ltd, Oregon, OR, USA) and four different BLV provirus genome PCR amplicons from each individual were pooled together at an equimolar ratio to a final concentration of 30 ng/μl as the starting material for whole genome sequencing library preparation.
BLV whole genome library preparation
DNA libraries of the above-mentioned pooled samples were prepared by a transposase-mediated library preparation method, using the SureSelect QXT Library prep for Illumina Multiplexed Sequencing Kit (Agilent Technologies, Santa Clara, CA, USA). Briefly, 30 ng of each pooled sample was treated with 1 μl of the SureSelect QXT enzyme mix ILM and 8.5 μl of QXT buffer, and incubated for 10 min at 45 °C, followed by addition of 16 μl of QXT stop solution, enabling enzymatic fragmentation and addition of adaptors to the ends of fragments in a single reaction. The adaptor-tagged library fragments were purified using Agencourt AMPure XP beads (Beckman Coulter, Inc., Brea, CA, USA) with the NGS Magna Stand Ch YS-Model (NIPPON Genetics Co., Ltd., Tokyo, Japan) according to the manual. Next, each purified adaptor-tagged library was PCR amplified with 1 μl of each of the indexing primers, P5 and P7, added to the index in a reaction mixture containing 5 μl of 5× Herculase II reaction buffer, 0.25 μl of 100 mM dNTP mix, 1.25 μl of dimethyl sulfoxide (DMSO), 0.5 μl of Herculase II Fusion DNA Polymerase, and 6 μl of water to a final volume of 25 μl. Each sample was dual indexed. The amplified libraries were purified with AMPure XP beads (Beckman Coulter Inc., Brea, CA, USA). The quality and quantity of amplified libraries were assessed using the Agilent 2100 Bioanalyzer (Agilent Technologies, Waldbronn, Germany) and DNA 1000 Assay. The final dual indexed libraries were pooled into one at equimolar concentrations. The library was subjected to multiplex sequencing on the MiSeq sequencer (Illumina).
NGS sequence data analysis
Sequence data analysis followed a framework described previously . Briefly, Fastq files were generated as row sequences by the Illumina MiSeq, with 600 cycles of paired-end read, and validated by evaluation of the distribution of quality scores. Validated fastq files from each viral genome were aligned with the Burrows-Wheeler Aligner tool (BWA v. 0.7.8-r455)  against a reference sequence and the resulting Alignment Map (SAM) format output was suitable for analyses using SAMTools . Analysis of sequence quality, depth of coverage, short-read alignment, and variant identification were performed using SAMTools . The indexed file was visualized using the Integrative Genomics Viewer (IGV) . The BLV cell line FLK-BLV subclone pBLV913 (GenBank accession number: EF600696) , BLV strain Arg41 (GenBank accession number: FJ914764) , and BLV (GenBank accession number: AF033818) complete genomes were used as reference sequences. Full genome consensus sequences of each individual BLV strain were saved for further analyses, including alignment using MAFFT v7.123b (http://mafft.cbrc.jp/alignment/software/) and deduction of protein sequences by in silico translation of nucleotide to amino acid sequences using MEGA 5.1 .
BLV genome sequences (8728 bp) were aligned using MAFFT software, and the nucleotide and amino acid substitution per site was calculated using the Jukes and Cantor , and p-distance models , respectively. A ML tree for full BLV genome sequences was constructed using MEGA 6.06 software , and 1000 replications were used to calculate bootstrap values.
Rodriguez SM, Florins A, Gillet N, de Brogniez A, Sanchez-Alcaraz MT, Boxus M, Boulanger F, Gutierrez G, Trono K, Alvarez I, et al. Preventive and therapeutic strategies for bovine leukemia virus: lessons for HTLV. Viruses. 2011;3:1210–48.
Aida Y, Murakami H, Takahashi M, Takeshima SN. Mechanisms of pathogenesis induced by bovine leukemia virus as a model for human T-cell leukemia virus. Front Microbiol. 2013;4:328.
Gillet N, Florins A, Boxus M, Burteau C, Nigro A, Vandermeers F, Balon H, Bouzar AB, Defoiche J, Burny A, et al. Mechanisms of leukemogenesis induced by bovine leukemia virus: prospects for novel anti-retroviral therapies in human. Retrovirology. 2007;4:18.
Sagata N, Yasunaga T, Ohishi K, Tsuzuku-Kawamura J, Onuma M, Ikawa Y. Comparison of the entire genomes of bovine leukemia virus and human T-cell leukemia virus and characterization of their unidentified open reading frames. EMBO J. 1984;3:3231–7.
Hamard-Peron E, Muriaux D. Retroviral matrix and lipids, the intimate interaction. Retrovirology. 2011;8:15.
Inabe K, Nishizawa M, Tajima S, Ikuta K, Aida Y. The YXXL sequences of a transmembrane protein of bovine leukemia virus are required for viral entry and incorporation of viral envelope protein into virions. J Virol. 1999;73:1293–301.
Callebaut I, Voneche V, Mager A, Fumiere O, Krchnak V, Merza M, Zavada J, Mammerickx M, Burny A, Portetelle D. Mapping of B-neutralizing and T-helper cell epitopes on the bovine leukemia virus external glycoprotein gp51. J Virol. 1993;67:5321–7.
Johnston ER, Radke K. The SU and TM envelope protein subunits of bovine leukemia virus are linked by disulfide bonds, both in cells and in virions. J Virol. 2000;74:2930–5.
Willems L, Heremans H, Chen G, Portetelle D, Billiau A, Burny A, Kettmann R. Cooperation between bovine leukaemia virus transactivator protein and Ha-ras oncogene product in cellular transformation. EMBO J. 1990;9:1577–81.
Felber BK, Derse D, Athanassopoulos A, Campbell M, Pavlakis GN. Cross-activation of the Rex proteins of HTLV-I and BLV and of the Rev protein of HIV-1 and nonreciprocal interactions with their RNA responsive elements. New Biol. 1989;1:318–28.
Willems L, Kerkhofs P, Dequiedt F, Portetelle D, Mammerickx M, Burny A, Kettmann R. Attenuation of bovine leukemia virus by deletion of R3 and G4 open reading frames. Proc Natl Acad Sci USA. 1994;91:11532–6.
Florins A, Gillet N, Boxus M, Kerkhofs P, Kettmann R, Willems L. Even attenuated bovine leukemia virus proviruses can be pathogenic in sheep. J Virol. 2007;81:10195–200.
Lefebvre L, Vanderplasschen A, Ciminale V, Heremans H, Dangoisse O, Jauniaux JC, Toussaint JF, Zelnik V, Burny A, Kettmann R, Willems L. Oncoviral bovine leukemia virus G4 and human T-cell leukemia virus type 1 p13(II) accessory proteins interact with farnesyl pyrophosphate synthetase. J Virol. 2002;76:1400–14.
Kincaid RP, Burke JM, Sullivan CS. RNA virus microRNA that mimics a B-cell oncomiR. Proc Natl Acad Sci USA. 2012;109:3077–82.
Rosewick N, Momont M, Durkin K, Takeda H, Caiment F, Cleuter Y, Vernin C, Mortreux F, Wattel E, Burny A, et al. Deep sequencing reveals abundant noncanonical retroviral microRNAs in B-cell leukemia/lymphoma. Proc Natl Acad Sci USA. 2013;110:2306–11.
Mamoun RZ, Morisson M, Rebeyrotte N, Busetta B, Couez D, Kettmann R, Hospital M, Guillemain B. Sequence variability of bovine leukemia virus env gene and its relevance to the structure and antigenicity of the glycoproteins. J Virol. 1990;64:4180–8.
Bruck C, Mathot S, Portetelle D, Berte C, Franssen JD, Herion P, Burny A. Monoclonal antibodies define eight independent antigenic regions on the bovine leukemia virus (BLV) envelope glycoprotein gp51. Virology. 1982;122:342–52.
Portetelle D, Couez D, Bruck C, Kettmann R, Mammerickx M, Van der Maaten M, Brasseur R, Burny A. Antigenic variants of bovine leukemia virus (BLV) are defined by amino acid substitutions in the NH2 part of the envelope glycoprotein gp51. Virology. 1989;169:27–33.
LanLan B, Otsuki H, Sato H, Kohara J, Isogai E, Takeshima S-N, Aida Y. Identification and characterization of common B cell epitope in bovine leukemia virus via high-throughput peptide screening system in infected cattle. Retrovirology. 2015;12(1):106.
Camargos MF, Stancek D, Rocha MA, Lessa LM, Reis JK, Leite RC. Partial sequencing of env gene of bovine leukaemia virus from Brazilian samples and phylogenetic analysis. J Vet Med B Infect Dis Vet Public Health. 2002;49:325–31.
Coulston J, Naif H, Brandon R, Kumar S, Khan S, Daniel RC, Lavin MF. Molecular cloning and sequencing of an Australian isolate of proviral bovine leukaemia virus DNA: comparison with other isolates. J Gen Virol. 1990;71(Pt 8):1737–46.
Beier D, Blankenstein P, Marquardt O, Kuzmak J. Identification of different BLV provirus isolates by PCR, RFLPA and DNA sequencing. Berl Munch Tierarztl Wochenschr. 2001;114:252–6.
Camargos MF, Pereda A, Stancek D, Rocha MA, dos Reis JK, Greiser-Wilke I, Leite RC. Molecular characterization of the env gene from Brazilian field isolates of Bovine leukemia virus. Virus Genes. 2007;34:343–50.
Felmer R, Munoz G, Zuniga J, Recabal M. Molecular analysis of a 444 bp fragment of the bovine leukaemia virus gp51 env gene reveals a high frequency of non-silent point mutations and suggests the presence of two subgroups of BLV in Chile. Vet Microbiol. 2005;108:39–47.
Molteni E, Agresti A, Meneveri R, Marozzi A, Malcovati M, Bonizzi L, Poli G, Ginelli E. Molecular characterization of a variant of proviral bovine leukaemia virus (BLV). Zentralbl Veterinarmed B. 1996;43:201–11.
Monti G, Schrijver R, Beier D. Genetic diversity and spread of Bovine leukaemia virus isolates in Argentine dairy cattle. Arch Virol. 2005;150:443–58.
Balic D, Lojkic I, Periskic M, Bedekovic T, Jungic A, Lemo N, Roic B, Cac Z, Barbic L, Madic J. Identification of a new genotype of bovine leukemia virus. Arch Virol. 2012;157:1281–90.
Rola-Luszczak M, Pluta A, Olech M, Donnik I, Petropavlovskiy M, Gerilovych A, Vinogradova I, Choudhury B, Kuzmak J. The molecular characterization of bovine leukaemia virus isolates from Eastern Europe and Siberia and its impact on phylogeny. PLoS One. 2013;8:e58705.
Matsumura K, Inoue E, Osawa Y, Okazaki K. Molecular epidemiology of bovine leukemia virus associated with enzootic bovine leukosis in Japan. Virus Res. 2011;155:343–8.
Polat M, Ohno A, Takeshima SN, Kim J, Kikuya M, Matsumoto Y, Mingala CN, Onuma M, Aida Y. Detection and molecular characterization of bovine leukemia virus in Philippine cattle. Arch Virol. 2015;160:285–96.
Licursi M, Inoshima Y, Wu D, Yokoyama T, Gonzalez ET, Sentsui H. Provirus variants of bovine leukemia virus in naturally infected cattle from Argentina and Japan. Vet Microbiol. 2003;96:17–23.
Rodriguez SM, Golemba MD, Campos RH, Trono K, Jones LR. Bovine leukemia virus can be classified into seven genotypes: evidence for the existence of two novel clades. J Gen Virol. 2009;90:2788–97.
Moratorio G, Obal G, Dubra A, Correa A, Bianchi S, Buschiazzo A, Cristina J, Pritsch O. Phylogenetic analysis of bovine leukemia viruses isolated in South America reveals diversification in seven distinct genotypes. Arch Virol. 2010;155:481–9.
Liron JP, Peral-Garcia P, Giovambattista G. Genetic characterization of Argentine and Bolivian Creole cattle breeds assessed through microsatellites. J Hered. 2006;97:331–9.
D’Angelino JL, Garcia M, Birgel EH. Epidemiological study of enzootic bovine leukosis in Brazil. Trop Anim Health Prod. 1998;30:13–5.
Samara SI, Lima EG, Nascimento AA. Monitoring of enzootic bovine leukosis in dairy cattle from the Pitangueiras region in São Paulo, Brazil. Braz J Vet Res Animal Sci. 1997;34:349–51.
Castro RS, Leite RC, Abreu JJ, Lage AP, Ferraz IB, Lobato ZI, Balsamao SL. Prevalence of antibodies to selected viruses in bovine embryo donors and recipients from Brazil, and its implications in international embryo trade. Trop Anim Health Prod. 1992;24:173–6.
Molnár É, Molnár L, Dias H, Silva A, Vale W. Occurrence of enzootic bovine leukosis in the State of Pará, Brazil. Pesquisa Veterinária Brasileira. 1999;19:7–11.
Hernández-Herrera D, Posso-Terranova A, Benavides J, Muñoz-Flórez J, Giovambattista G, Álvarez-Franco L. Bovine leukosis virus detection in Creole Colombian breeds using nested-PCR. Acta Agronómica. 2011;60:311–7.
Alfonso R, Almansa JE, Barrera JC. Serological prevalence and evaluation of the risk factors of bovine enzootic leukosis in the Bogota savannah and the Ubate and Chiquinquira Valleys, Colombia. Rev Sci Tech. 1998;17:723–32.
Juliarena MA, Lendez PA, Gutierrez SE, Forletti A, Rensetti DE, Ceriani MC. Partial molecular characterization of different proviral strains of bovine leukemia virus. Arch Virol. 2013;158:63–70.
Monti GE, Frankena K, Engel B, Buist W, Tarabla HD, de Jong MC. Evaluation of a new antibody-based enzyme-linked immunosorbent assay for the detection of bovine leukemia virus infection in dairy cattle. J Vet Diagn Invest. 2005;17:451–7.
Trono KG, Perez-Filgueira DM, Duffy S, Borca MV, Carrillo C. Seroprevalence of bovine leukemia virus in dairy cattle in Argentina: comparison of sensitivity and specificity of different detection methods. Vet Microbiol. 2001;83:235–48.
Ch AH. Bovine leukaemia virus infection in Peru. Trop Anim Health Prod. 1983;15:61.
Kobayashi S, Hidano A, Tsutsui T, Yamamoto T, Hayama Y, Nishida T, Muroga N, Konishi M, Kameyama K, Murakami K. Analysis of risk factors associated with bovine leukemia virus seropositivity within dairy and beef breeding farms in Japan: a nationwide survey. Res Vet Sci. 2014;96:47–53.
Ohno A, Takeshima SN, Matsumoto Y, Aida Y. Risk factors associated with increased bovine leukemia virus proviral load in infected cattle in Japan from 2012 to 2014. Virus Res. 2015;210:283–90.
Rovnak J, Boyd AL, Casey JW, Gonda MA, Jensen WA, Cockerell GL. Pathogenicity of molecularly cloned bovine leukemia virus. J Virol. 1993;67:7096–105.
Wang H, Norris KM, Mansky LM. Involvement of the matrix and nucleocapsid domains of the bovine leukemia virus Gag polyprotein precursor in viral RNA packaging. J Virol. 2003;77:9431–8.
Moratorio G, Fischer S, Bianchi S, Tome L, Rama G, Obal G, Carrion F, Pritsch O, Cristina J. A detailed molecular analysis of complete bovine leukemia virus genomes isolated from B-cell lymphosarcomas. Vet Res. 2013;44:19.
Willems L, Kettmann R, Burny A. The amino acid (157–197) peptide segment of bovine leukemia virus p34tax encompass a leucine-rich globally neutral activation domain. Oncogene. 1991;6:159–63.
Willems L, Grimonpont C, Kerkhofs P, Capiau C, Gheysen D, Conrath K, Roussef R, Mamoun R, Portetelle D, Burny A, et al. Phosphorylation of bovine leukemia virus Tax protein is required for in vitro transformation but not for transactivation. Oncogene. 1998;16:2165–76.
Tajima S, Aida Y. The region between amino acids 245 and 265 of the bovine leukemia virus (BLV) tax protein restricts transactivation not only via the BLV enhancer but also via other retrovirus enhancers. J Virol. 2000;74:10939–49.
Tajima S, Aida Y. Mutant tax protein from bovine leukemia virus with enhanced ability to activate the expression of c-fos. J Virol. 2002;76:2557–62.
Tajima S, Takahashi M, Takeshima SN, Konnai S, Yin SA, Watarai S, Tanaka Y, Onuma M, Okada K, Aida Y. A mutant form of the tax protein of bovine leukemia virus (BLV), with enhanced transactivation activity, increases expression and propagation of BLV in vitro but not in vivo. J Virol. 2003;77:1894–903.
Takahashi M, Tajima S, Okada K, Davis WC, Aida Y. Involvement of bovine leukemia virus in induction and inhibition of apoptosis. Microbes Infect. 2005;7:19–28.
Arainga M, Takeda E, Aida Y. Identification of bovine leukemia virus tax function associated with host cell transcription, signaling, stress response and immune response pathway by microarray-based gene expression analysis. BMC Genom. 2012;13:121.
Takahashi M, Tajima S, Takeshima SN, Konnai S, Yin SA, Okada K, Davis WC, Aida Y. Ex vivo survival of peripheral blood mononuclear cells in sheep induced by bovine leukemia virus (BLV) mainly occurs in CD5-B cells that express BLV. Microbes Infect. 2004;6:584–95.
Alexandersen S, Carpenter S, Christensen J, Storgaard T, Viuff B, Wannemuehler Y, Belousov J, Roth JA. Identification of alternatively spliced mRNAs encoding potential new regulatory proteins in cattle infected with bovine leukemia virus. J Virol. 1993;67:39–52.
Dube S, Abbott L, Dube DK, Dolcini G, Gutierrez S, Ceriani C, Juliarena M, Ferrer J, Perzova R, Poiesz BJ. The complete genomic sequence of an in vivo low replicating BLV strain. Virol J. 2009;6:120.
Dube S, Dolcini G, Abbott L, Mehta S, Dube D, Gutierrez S, Ceriani C, Esteban E, Ferrer J, Poiesz B. The complete genomic sequence of a BLV strain from a Holstein cow from Argentina. Virology. 2000;277:379–86.
Gutierrez G, Alvarez I, Politzki R, Lomonaco M, Dus Santos MJ, Rondelli F, Fondevila N, Trono K. Natural progression of Bovine Leukemia virus infection in Argentinean dairy cattle. Vet Microbiol. 2011;151:255–63.
Merezak C, Pierreux C, Adam E, Lemaigre F, Rousseau GG, Calomme C, Van Lint C, Christophe D, Kerkhofs P, Burny A, et al. Suboptimal enhancer sequences are required for efficient bovine leukemia virus propagation in vivo: implications for viral latency. J Virol. 2001;75:6977–88.
de Brogniez A, Bouzar AB, Jacques JR, Cosse JP, Gillet N, Callebaut I, Reichert M, Willems L. Mutation of a single envelope N-linked glycosylation site enhances the pathogenicity of Bovine Leukemia virus. J Virol. 2015;89:8945–56.
Bai L, Takeshima SN, Isogai E, Kohara J, Aida Y. Novel CD8 cytotoxic T cell epitopes in bovine leukemia virus with cattle. Vaccine. 2015;33:7194–202.
Gutierrez G, Rodriguez SM, de Brogniez A, Gillet N, Golime R, Burny A, Jaworski JP, Alvarez I, Vagnoni L, Trono K, Willems L. Vaccination against delta-retroviruses: the bovine leukemia virus paradigm. Viruses. 2014;6:2416–27.
Tajima S, Ikawa Y, Aida Y. Complete bovine leukemia virus (BLV) provirus is conserved in BLV-infected cattle throughout the course of B-cell lymphosarcoma development. J Virol. 1998;72:7569–76.
Asfaw Y, Tsuduku S, Konishi M, Murakami K, Tsuboi T, Wu D, Sentsui H. Distribution and superinfection of bovine leukemia virus genotypes in Japan. Arch Virol. 2005;150:493–505.
Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S. MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011;28:2731–9.
Kimura M. A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences. J Mol Evol. 1980;16:111–20.
Ronquist F, Teslenko M, van der Mark P, Ayres DL, Darling A, Hohna S, Larget B, Liu L, Suchard MA, Huelsenbeck JP. MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space. Syst Biol. 2012;61:539–42.
Rabbani B, Mahdieh N, Hosomichi K, Nakaoka H, Inoue I. Next-generation sequencing: impact of exome sequencing in characterizing Mendelian disorders. J Hum Genet. 2012;57:621–32.
Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–60.
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R. The sequence alignment/map format and SAMtools. Bioinformatics. 2009;25:2078–9.
Robinson JT, Thorvaldsdottir H, Winckler W, Guttman M, Lander ES, Getz G, Mesirov JP. Integrative genomics viewer. Nat Biotechnol. 2011;29:24–6.
Jukes TH, Cantor CR. Evolution of protein molecules. In: Munro HN, editor. Mammalian protein metabolism. New York: Academic Press; 1969, pp. 21–132.
Tamura K, Stecher G, Peterson D, Filipski A, Kumar S. MEGA6: molecular evolutionary genetics analysis version 6.0. Mol Biol Evol. 2013;30:2725–9.
MP and ST participated in all experiments, analyzed data, and drafted the manuscript. ST also participated in sample collection. KH participated in the NGS experiment. JK, TM, KY, MA, TM, and YM performed the nested PCR. VBD, CJP, and ETG performed BLV detection. VBD, MK, MO, and GG coordinated sampling and participated in sample collection. YA conceived the study, participated in experiments and sample collection, participated in experimental design, coordinated experiments, and drafted the manuscript. All authors read and approved the final manuscript.
We thank the veterinarians and our collaborators for kindly assisting with sampling from many farms in Chile, Peru, Paraguay, Bolivia and Argentina. We are grateful to the Support Unit at the Bio-material Analysis, RIKEN BSI Research Resources Center, for help with sequence analysis. This work was supported by Grants-in-Aid for Scientific Research (A and C) from the Japan Society for the Promotion of Science (JSPS) and by a Grant from Integration Research for Agriculture and Interdisciplinary Fields in Japan.
The authors declare that they have no competing interests.
Meripet Polat and Shin-nosuke Takeshima authors contributed equally to this work
About this article
Cite this article
Polat, M., Takeshima, S., Hosomichi, K. et al. A new genotype of bovine leukemia virus in South America identified by NGS-based whole genome sequencing and molecular evolutionary genetic analysis. Retrovirology 13, 4 (2016) doi:10.1186/s12977-016-0239-z
- Bovine leukemia virus (BLV)
- Next generation sequencing
- BLV whole genome analysis
- gp51 env sequencing
- Phylogenetic analysis
- South America