Skip to main content

HTLV-1 contains a high CG dinucleotide content and is susceptible to the host antiviral protein ZAP



Human T cell leukaemia virus type 1 (HTLV-1) is a retrovirus associated with human diseases such as adult T-cell leukaemia/lymphoma and HTLV-1 associated myelopathy/tropical spastic paraparesis. In contrast to another human retrovirus, human immunodeficiency virus type 1 (HIV-1), HTLV-1 persists in the host not via vigorous virus production but mainly via proliferation and/or long-term survival in the form of silent proviruses in infected host cells. As a result, HTLV-1-infected cells rarely produce virus particles in vivo even without anti-retroviral treatment. That should be an advantage for the virus to escape from the host immune surveillance by minimizing the expression of viral antigens in host cells. However, why HIV-1 and HTLV-1 behave so differently during natural infection is not fully understood.


We performed cap analysis of gene expression (CAGE) using total RNAs and nascent, chromatin-associated, RNAs in the nucleus and found that HTLV-1 RNAs were processed post-transcriptionally in infected cells. RNA processing was evident for the sense viral transcripts but not the anti-sense ones. We also found a higher proportion of CG di-nucleotides in proviral sequences of HTLV-1-infected cells, when compared to the HIV-1 genomic sequence. It has been reported recently that CG dinucleotide content of viral sequence is associated with susceptibility to the antiviral ZC3HAV1 (ZAP), suggesting the involvement of this protein in the regulation of HTLV-1 transcripts. To analyse the effect of ZAP on HTLV-1 transcripts, we over-expressed it in HTLV-1-infected cells. We found there was a dose-dependent reduction in virus production with ZAP expression. We further knocked down endogenous ZAP with two independent targeting siRNAs and observed a significant increase in virus production in the culture supernatant. Other delta-type retroviruses such as simian T-cell leukaemia virus and bovine leukaemia virus, also contain high CG-dinucleotide contents in their viral genomes, suggesting that ZAP-mediated suppression of viral transcripts might be a common feature of delta-type retroviruses, which cause minimal viremia in their natural hosts.


The post-transcriptional regulatory mechanism involving ZAP might allow HTLV-1 to maintain a delicate balance required for prolonged survival in infected individuals.


The human T cell leukaemia virus type 1 (HTLV-1) was the first retrovirus that was associated with a human disease [1,2,3]. Specifically, it causes adult T-cell leukaemia/lymphoma (ATL) and several inflammatory diseases such as HTLV-1-associated myelopathy/tropical spastic paraparesis [4,5,6,7]. Further, this virus infects approximately 20 million people worldwide, and mainly those living in endemic areas including Southwestern Japan, the Caribbean, and sub-Saharan Africa [8]. As a retrovirus, HTLV-1 integrates into the genome of infected cells in the form of a provirus. The plus and minus strands of this provirus encode several viral proteins, such as Tax and HBZ [9]. Another retrovirus, human immunodeficiency virus type I (HIV-1), show vigorous viral replication without anti-retroviral drugs, but HTLV-1 persists in infected individuals without virus in the plasma even in the absence of anti-retroviral drugs. It has also been reported that some HIV-1-infected clones expand clonally like HTLV-1-infected cells [10, 11]. Most of them carry defective proviruses [12], whereas HTLV-1-infected clones carrying full-length provirus seems to expand without producing viral particles, suggesting latency-prone phenotype of HTLV-1-infected cells. Most of HTLV-1-infected cells are transcriptionally silenced in vivo, but they quickly increase a few hours after culture ex vivo [13, 14]. In contrast, the minus-strand transcript HBZ is present in the majority of infected cells, at low levels [15, 16]. This pattern of viral gene expression is regulated by a complex mechanism involving cellular, viral, and metabolic factors [17, 18]. For example, at the post-transcriptional level, the plus-strand-encoded Rex viral protein plays an important role in orchestrating the nuclear export of viral mRNAs [19]. In addition, it has been reported that HBZ mRNA is retained in the nucleus [20]. These indicates there would be unidentified regulatory mechanisms for proviral transcriptional regulation.

Microbial infections are detected by the host through multiple mechanisms. Viruses can be recognized by pattern recognition receptors such as RIG-I, which trigger an intracellular signalling cascade activating the expression of inflammatory mediators to eliminate infected cells and pathogens [21]. In addition to these transmembrane and cytoplasmic receptors, viral infections can also be inhibited by the action of cellular restriction factors at different stages in their life cycle [22]. Among these restriction factors, the ZC3HAV1 (ZAP) protein has been reported to exert antiviral activity against a broad range of viral families including alphaviruses, filoviruses, Hepatitis B virus, influenza A virus, and retroviruses such HIV-1 [23]. It still remains largely unknown how these antiviral mechanisms control persistence of HTLV-1 infection in the host [24]. In this study, we sought to determine whether HTLV-1 viral transcripts could be detected by ZAP, targeting them for degradation or processing.


The cap analysis of gene expression (CAGE) profile of an HTLV-1-infected cell line suggests the processing or degradation of viral RNAs

The expression of HTLV-1 transcripts occurs in bursts or intermittently in infected cells [25, 26], indicating that the regulation of HTLV-1 transcripts at the transcriptional or post-transcriptional levels is more complex than our current understanding. To gain insight into the pattern of proviral transcript regulation in more detail, we conducted CAGE [27], which detects 5′ capped RNAs and is therefore useful to identify 5′ end of RNAs and transcription start sites (TSSs), as well as to quantify coding and non-coding RNAs with a 5′-cap structure. We analysed the TBX-4B cell line, isolated from the peripheral blood of an HTLV-1-infected individual, which contains one copy of the integrated provirus in chromosome 22; moreover, this provirus is highly transcribed (Fig. 1a) [28]. The level of transcription in the sense direction in these cells was much higher than that in the antisense direction (Additional file 1: Fig. S1). In general, CAGE signals tended to accumulate near the TSS, as observed for the host genes PNPLA3 and SAMM50 (Fig. 1a). Unexpectedly, the CAGE signal was not only detectable in the LTRs, which serve as promoters, but was broadly spread all along the provirus (Fig. 1a). These data suggested two possibilities. First, there might be cryptic proviral transcription from the region within the 5′ and 3′ LTRs. Second, HTLV-1 RNAs might be post-transcriptionally degraded, resulting in a broad CAGE signal when cleaved RNAs are re-capped. Previous studies reported that some CAGE tags align to not only TSSs or enhancer regions but also other genomic regions such as exonic regions, indicating that transcribed RNAs are processed and recapped and thereby detectable by CAGE [29, 30]. To distinguish these two possibilities, we performed a modified version of CAGE, called native elongating transcript-CAGE (NET-CAGE), where nascent RNAs that are not yet affected by post-transcriptional processing are purified from chromatin and used as an input for CAGE [31]. We compared the ratio of signals in the internal region of the provirus between CAGE and NET-CAGE, and found that with NET-CAGE this was much lower than that with CAGE in the sense direction (Fig. 1b). This tendency was not observed in the antisense direction (Fig. 1c). We calculated the proportion of CAGE signals in the internal region of the provirus within the total number aligning to the whole provirus and plotted the results for the plus and minus strands separately (Fig. 1d). We observed a larger number of peaks in CAGE than in NET-CAGE for plus-strand-aligning reads (Fig. 1d, top). This difference was not observed for the minus strand-aligning reads (Fig. 1d, bottom). It has been reported that broad CAGE signals occur around the TSSs of human genes with high levels of transcription [29]. We searched the whole human genome for a high-density CAGE signal in TBX-4B cells. We performed a sliding window analysis to investigate the number of CAGE peaks present in respective 10-kb windows, which is approximately the size of the provirus. We discovered that the integrated HTLV-1 provirus was the most significantly enriched region for multiple CAGE signals within the human genome, as shown by the red circles in Fig. 1e. This tendency was not observed with NET-CAGE (Fig. 1f), suggesting that 5′-capped RNAs originating from the internal regions of the provirus are rare in the nucleus but present in the cytoplasm. These findings indicate that the broad CAGE signal observed in the internal proviral region (Fig. 1a) are generated at the post-transcriptional level.

Fig. 1
figure 1

Evidence for RNA processing in an HTLV-1-infected cell line. a One representative cap-associated gene expression (CAGE) profile for TBX-4B cells is visualized by integrate genomic viewer (IGV). Each gray line represents each sequencing read we detected in the analysis. The region around the integrated provirus is shown as red bar. b Quantification of the CAGE and nascent-elongating-transcript CAGE (NET-CAGE) reads in the sense orientation within the proviral region, between the LTRs. c The same is shown for reads in the anti-sense orientation. d CAGE and NET-CAGE profiles in the proviral region between the LTRs (top: sense signal; bottom: anti-sense signal). Only the first nucleotide of the 5′ side of the transcripts is shown (transcription start site; TSS). The signal was calculated as a percentage of total reads that aligned to the provirus. e Signal in a sliding window of 10 kb is shown for CAGE. f NET-CAGE datasets of TBX-4B cells. Red circles correspond to virus-aligned reads, whereas black circles correspond to human genome-aligned reads

The proviral sequences of delta retroviruses contain higher proportions of CG di-nucleotides

Several cellular proteins target viral RNA, relying on different mechanisms to detect them and mount an anti-viral immune response. ZAP is a cellular restriction factor that shows antiviral activity against a wide range of RNA viruses [23, 32,33,34,35]. In addition to a complex secondary structure [36, 37], a high content of CG di-nucleotides seems to be a key factor for the recognition of targets by ZAP [38]. The evolution of RNA viruses of vertebrates, such as HIV-1, has been reported to lead to viral genomes with a low proportion of CGs to escape from the anti-viral host mechanism [38, 39]. We analysed the proportions of various dinucleotides in the human genome and found that the CG dinucleotide was under-represented compared to other dinucleotides, consistent with previous reports [39, 40] (Fig. 2a). We next performed the same analysis for several retroviruses and found that CG suppression in HIV-1 and HIV-2 was similar to that of the human genome (Fig. 2b), but was not as apparent in HTLV-1 and HTLV-2. The same tendency was observed for other delta retroviruses and previously-reported ZAP-target viruses (Fig. 2c, d). We then analysed the CG di-nucleotide content and distribution along the provirus based on three HIV-1 and three HTLV-1 sequences (Fig. 2e, f). A sequence of approximately the same length as HIV-1 and HTLV-1, 10 kb and 9 kb, respectively, with a random distribution of CG di-nucleotides, was used for comparison. On one hand, there was a clear depletion of CG dinucleotides in HIV-1 genomes (Fig. 2e). On the other hand, the depletion of the CG di-nucleotides was not as evident in HTLV-1 sequences compared to that in the random control (Fig. 2f). These data demonstrated that HTLV-1 maintains a high CG dinucleotide content despite long-term co-existence with humans [41].

Fig. 2
figure 2

Delta retroviruses have a higher a CG content. a ρ statistic of each dinucleotide within the human genome, shown independently for some chromosomes. b The same analysis is shown for HIV-1, HIV-2 (lentiviruses), HTLV-1, HTLV-2 (delta retroviruses), JEV, and Ebola (previously reported to be ZAP targets). c CG dinucleotide content is shown for several retroviruses grouped by families. d ρ statistic of each dinucleotide within retroviruses and viruses reported to be ZAP targets. Proportion of CG-dinucleotides in a 200-nucleotide window in viral (HIV-1: e, HTLV-1: f) and a random sequence. Below each graph, individual proviral sequences are shown, wherein black lines represent a CG di-nucleotide

HTLV-1 viral transcripts contain a higher proportion of CG di-nucleotides and could be targeted by ZAP

The HTLV-1 genome encodes several viral RNAs both in sense and antisense orientations [42]. We next analysed the CG dinucleotide content of HIV-1 and HTLV-1 transcripts together with the those of the host cell. The CG dinucleotide content per transcript length for all HTLV-1 transcripts was higher than the average value for human transcripts. In contrast, that of all HIV-1 transcripts was lower than the average value for human transcripts (Fig. 3a, b). CG di-nucleotides were found to be distributed broadly in the HTLV-1 transcripts tax and HBZ compared to distribution in HIV-1 transcripts such as tat and nef (Fig. 3c). Transcripts of simian T-cell leukaemia virus type 1 (STLV-1) and bovine leukaemia virus (BLV) also showed a higher CG content compared to those of their hosts (Fig. 3d, e, respectively).

Fig. 3
figure 3

HTLV-1 transcripts show a higher content of CG di-nucleotides. a The CG dinucleotide content of HIV-1 (red dots), HTLV-1 (blue dots), and human (grey dots) transcripts is shown in correlation with the transcripts’ lengths. HIV-1 mRNAs used in this analysis are shown in Additional file 1: Table S4 [56]. b The average number of CG di-nucleotides per kb of transcript is shown for the same organisms. Numbers in parentheses indicate the total number of transcripts considered for the calculation. c Individual CG dinucleotides are shown for viral transcripts. Blue: HTLV-1, red: HIV-1. d Proportion of CG-dinucleotides in relation to transcript length for STLV-1 (NCBI GenBank: JX987040.1) and monkey (Macaca mulatta) genomes. To visualize the difference between average value and STLV-1 Tax, expanded version of the graph is shown. e Proportion of CG-dinucleotides in relation to transcript length for BLV and bovine (Bos taurus) genomes

ZAP exerts an inhibitory effect on the production of HTLV-1

Next, we made a hypothesis that HTLV-1 transcripts are targeted by ZAP because of their high GC content. To test this, we analysed the effect of ZAP expression on the HTLV-1 transcript tax. We transfected HeLa cells with Tax and ZAP expression vectors and found that ZAP expression decreased Tax RNA level in a dose-dependent manner (Fig. 4a). The effect was modest but statistically significant. These data demonstrated that HTLV-1 viral transcripts contain a high proportion of CG di-nucleotides, and could be targeted by ZAP. To evaluate the role of ZAP in HTLV-1 production, we over-expressed ZAP using a ZAP-expression vector to transfect JEX22 cells, a cell line latently infected with HTLV-1. This cell line expresses viral transcripts upon stimulation with PMA/ionomycin. The over-expression of ZAP decreased the production of HTLV-1 p19 protein in the culture supernatant in a dose-dependent manner (Fig. 4b). To investigate the role of ZAP under physiological conditions, we next knocked down endogenous expression using siRNAs in JEX22 cells. ZAP knock-down using two different siRNAs significantly reduced the level of some sensei viral RNAs, such as tax, gag, and pol region (Fig. 4c), as well as virus production in the culture supernatant (Fig. 4d). The better ZAP suppression by ZAP2 siRNA resulted in the enhanced production of p19 compared to that with ZAP1 siRNA. These results indicate that ZAP expression negatively regulates virus production in HTLV-1-infected cells. We also performed ZAP over-expression and knock-down experiments using the HIV-1-infected cell line J1.1. There were no significant changes in p24 expression with either ZAP over-expression or knock-down (Fig. 4e, f, respectively). These data demonstrate that HTLV-1 is susceptible to the host cellular defence mechanism mediated by ZAP, possibly due to the high CG content of the viral genome sequence.

Fig. 4
figure 4

The involvement of ZAP in the regulation of HTLV-1 production. a HeLa cells were transfected with a Tax expression vector in the presence of increasing amounts of a ZAP expression vector. Changes in the level of tax transcripts (top) were measured by RT-qPCR, in addition to those of hZAP (bottom). *p < 0.0001. b JEX22 cells were transfected with increasing amounts of a human ZAP expression vector and cultured for 24 h. Stimulation with PMA/ionomycin was performed for 4 h before collecting the supernatant and cells for analysis. c, d JEX22 cells were transfected with the indicated siRNAs and cultured for 24 h. They were then stimulated, as indicated previously in this figure, before analysis. HTLV-1 RNAs (c) or p19 protein (d) was was measured (top panel) upon knock-down of endogenous ZAP (bottom panel). e J1.1 cells were transfected with increasing amounts of a human ZAP expression vector and cultured for 24 h. Stimulation with TNFα was performed for 4 h before collecting the supernatant and the cells for analysis. f J1.1 cells were transfected with the indicated siRNAs and cultured for 24 h. They were then stimulated as indicated before analysis. No significant changes were observed in p24 protein levels in the supernatant (top panel) upon knock-down of endogenous ZAP (bottom panel). *p < 0.05; **p ≤ 0.001; ***p < 0.0001


During HTLV-1 natural infection, there is no detectable viremia in the peripheral blood of infected individuals, even in the absence of anti-retroviral drugs. This is in sharp contrast to that with another human retrovirus, HIV-1, in which viral latency is rare in the absence of anti-retroviral drugs. It seems that HIV-1 and HTLV-1 evolved quite differently in terms of strategies to achieve persistent infection in the host. Both HIV-1 and HTLV-1 target CD4+ T cells. However, HIV-1 induces apoptosis in infected cells through vigorous virus production, which suppresses the anti-viral immune response and enables the virus to achieve persistent infection in the host. In contrast, HTLV-1 rarely produces viral particles, but rather promotes the proliferation and survival of infected cells to maintain a viral reservoir in the host. These different viral strategies to achieve persistent infection are associated with viral pathogenesis. HIV-1 induces acquired immunodeficiency syndrome by depleting infected CD4+ T cells, whereas HTLV-1 leads to the development of leukaemia in HTLV-1-infected cells in some infected individuals after a long latency, as a consequence of the enhanced proliferation and/or survival of infected CD4+ T cells. Several reasons have been proposed as to explain the enhanced latency of HTLV-1, such as different activity of the viral LTR promoters and the presence of antisense transcripts in HTLV-1 [42,43,44].

In this study, we identified another mechanism that could explain why HTLV-1 is prone to latency in vivo. The anti-viral systems of the host cell are negative regulators of viruses. However, HTLV-1 seems to take advantage of its high susceptibility to the host antiviral system, specifically ZAP-mediated viral RNA processing, to minimize viral antigen expression and thereby maintain latent infection. ZAP-mediated RNA processing is also a type of restriction factor for these viruses. Based on our findings including an abnormal CAGE pattern (Fig. 1) and high CG-dinucleotide content in the HTLV-1 genome (Fig. 2), we suggest that ZAP might regulate HTLV-1 transcripts at the post-transcriptional level. CAGE is a type of RNA-seq that facilitates the identification of the 5′ end of an RNA molecule by capturing its 5′ cap structure. In principle, we can detect both coding and non-coding 5′-capped RNA. Our CAGE and NET-CAGE results indicated that the broad CAGE signals in the internal proviral region result from RNAs present in the cytoplasm but not in the nucleus. This suggests the possibility that HTLV-1 RNA is processed at least partially mediated by ZAP, can be re-capped in the cytoplasm and thereby detected only by CAGE.

It has been reported that CAGE signals can cross exon–exon junctions, and therefore, these must have arisen from at least partially processed mRNAs [29]. Our data also indicate that HTLV-1 transcripts are processed at least partially by ZAP and can be recapped by an unknown mechanism and thereby detected by CAGE.

It has been reported recently that HTLV-1 transcription in the sense orientation is only intermittently active in an ATL cell line and primary infected T-cell clones [25, 26]. Further, there is a strong burst of proviral sense transcription but the expression is transient and is terminated spontaneously. Glucose metabolism and oxygen availability play a role in the reactivation of proviral expression from latency [45]; however, how such HTLV-1 transcriptional bursts terminate remains elusive. Since ZAP expression is induced by viral infection via an IRF-3-dependent pathway [46], the HTLV-1 burst might trigger ZAP expression. Then induced ZAP might target HTLV-1 transcripts as a negative feedback mechanism. It has been reported that promoter-associated small RNAs generated from the TSS of the c-Myc gene suppress c-Myc messenger RNA abundance [29]. Thus, processed viral RNA might not be just a consequence of RNA degradation but also could have a regulatory function to control HTLV-1 production at the post-transcriptional and translational level.

The abundance of antisense transcript CAGE signals was not significantly different between conventional CAGE and NET-CAGE (Fig. 1c), although both sense and antisense transcripts were found to contain a high CG-dinucleotide content. The CG-dinucleotide content not only determines ZAP susceptibility, but other factors such as the secondary structure of RNA are also involved [36, 37]. The abundance of transcripts might also play a role in susceptibility to ZAP; therefore, the low level of antisense transcripts in TBX-4B cells makes them less sensitive to ZAP. It has been reported that antisense proviral transcription is constitutively active, whereas sense transcription is frequently suppressed or expressed only intermittently. Previous studies also reported that transcription is regulated by genetic and epigenetic mechanisms [47,48,49,50]; however, the findings of the current study indicate that there is another regulatory mechanism controlling proviral transcription at the post-transcriptional level.


HTLV-1 has evolved to maintain latency via multiple mechanisms. We show in this study that post-transcriptional RNA processing via antiviral ZAP is an additional strategy through which HTLV-1 achieves persistent infection in the host.


Cell lines

TBX-4B [28], an HTLV-1-infected clone derived from PBMCs of a HAM/TSP patient, was kindly provided by Dr. Charles Bangham (Imperial College London). These cells were cultured in RPMI supplemented with 20% FBS (SIGMA), 200 U/ml human recombinant IL-2 (Wako), 100 U/ml penicillin (Nacalai Tesque), and 100 μg/ml streptomycin (Nacalai Tesque). TBX-4B contains one copy of the integrated provirus in chromosome 22. JEX22 cells are latently infected with HTLV-1, and were kindly provided by Dr. Jun-ichi Fujisawa (Kansai Medical University). These cells contain two copies of integrated proviruses, in chromosomes 4 and 16. HeLa, a human adenocarcinoma cell line, was also used for transfection experiments. J1.1cells were obtained through the AIDS Research and Reference Reagent Program, Division of AIDS, NIAID, NIH from Dr. Thomas Folks [51]. These cells were handled in a bio-containment level 3 room. Except for TBX-4B cells, which were cultured as detailed, all other cells were cultured in RPMI supplemented with 10% FBS, 100 U/ml penicillin (Nacalai Tesque), and 100 μg/ml streptomycin (Nacalai Tesque).

Proviral DNA sequence analysis

The proviral sequences of several retroviruses were obtained from PubMed. Accession numbers are provided in Additional file 1: Table S1. The number CG di-nucleotides was counted using Geneious (Biomatters Ltd.) software, which was also used to generate the schematic representation of their distribution within the proviral sequences. The number of CG di-nucleotides for each virus was plotted in graphs generated with GraphPad software. For comparison, a 9040 nucleotide-long random sequence was generated with the following website: ( The rho statistic (ρ) was computed for each dinucleotide pair using the R library seqinr. In brief, ρ measures how over- or under-represented a particular DNA nucleotide is and for a DNA dinucleotide, ρ is defined as:

$$\rho \left( {xy} \right) = \frac{{f\left( {xy} \right)}}{f\left( x \right) \times f\left( y \right)}$$

where f is the frequency of the nucleotide x/y/xy. ρ is expected to be equal to 1 when dinucleotide (xy) is formed by chance. If ρ is more than 1, the dinucleotide is much more common than expected, i.e. over-represented and vice versa.

Bioinformatic analysis of human and viral transcripts

The correlation between CG di-nucleotide content and the length of the transcript was calculated and plotted in graphs for human, HIV-1, and HTLV-1 genes using the ggplot2 package in R. Datasets are provided in Additional file 1: Table S2.


Total and nascent RNAs were harvested from TBX-4B cells as previously described [31]. CAGE libraries were generated using CAGE library preparation kit (KK DNAFORM) following the manufacturer’s instructions. Briefly, first strand cDNA was synthesized from 5 μg of total RNA using random primers. The cap at the 5′end of the RNAs was biotinylated to facilitate the subsequent cap-trapping step. Remaining RNA fragments were digested with RNaseONE enzyme. Approximately 10 ng of each cDNA was used for linker ligation and library preparation. CAGE libraries were quantified by qPCR and size distribution was evaluated by TapeStation (Agilent Technologies) before sequencing in a NextSeq device (Illumina) as described previously [52]. NET-CAGE was performed as described previously [31]. We added the step to separate nuclear RNA and cytoplasmic RNA before we performed CAGE protocol.

Data analysis for CAGE and NET-CAGE

Fastq files obtained from the sequencers were quality-checked and adaptor sequences were trimmed. Alignment to the human genome (hg19) and the HTLV-1 genome (Genbank, AB513134.1) was performed using the BWA-MEM algorithm with default parameters [53, 54]. TSSs were counted after obtaining the position of the first nucleotide of each read from the sequencing data.

Knock-down and over-expression of ZAP

The knock-down of endogenous ZAP expression in JEX22 cells was carried out using two different siRNAs targeting the following sequences: GGUAAAACCUGGACGGACU (siZAP1) and GUGUAAGGGUUGUCCGCUU (siZAP2) [34]. siRNAs were transfected by electroporation into 2 × 106 cells (NepaGene). After overnight culture, cells were stimulated for 4 h with PMA (50 ng/ml) and ionomycin (1 μM). The culture supernatant was collected to determine p19 presence by ELISA (RETROtek) in accordance with the manufacturer’s instructions and RNA was extracted using a RNeasy (Qiagen). To analyze the efficiency of knock-down of ZAP, 500–1000 ng of total RNA was used to synthesize cDNA, and ZAP expression was determined by RT-qPCR. HTLV-1 transcripts were also quantified by RT-qPCR. Results were calculated using the delta–delta CT method, normalising to 18SrRNA expression and comparing to ZAP expression in cells transfected with control siRNA (GAPDH). The sequences of the primers used are listed in Additional file 1: Table S3.

The same procedure was followed for ZAP knock-down in J1.1 cells, an HIV-1-infected cell line. For this, stimulation was performed with TNFα (10 ng/ml), and virus production was determined based on the presence of p24 antigen in the culture supernatant by ELISA (RETROtek) following the manufacturer’s instructions.

For ZAP over-expression experiments, JEX22 or J1.1 cells were electroporated with increasing amounts of a ZAP expression vector (Addgene number 45907). The following day, cells were stimulated with PMA/ionomycin or TNFα for 4 h. Culture supernatants were collected to measure the presence of viral antigens by ELISA, as described previously herein. RNA was extracted from the transfected cells to confirm the effective over-expression of ZAP.

Over-expression of Tax and ZAP

HeLa cells (1.5 × 106 cells/2 ml) were seeded in a 6-well plate, and the following day they were transfected with a Tax expression vector, pCG-Tax [55], in the presence of increasing amounts of a ZAP expression vector. Twenty-four hours later, RNA was extracted, and Tax transcripts were semi-quantified by RT-qPCR, in addition to ZAP, based the delta–delta CT method as reported previously [49].

Availability of data and materials

The fastq files that were obtained in this study have been deposited in the DDBJ, DNA Data Bank of Japan (accession no. DRA009348). Data on the findings reported here are available from the corresponding author upon request.


  1. Poiesz BJ, Ruscetti FW, Gazdar AF, Bunn PA, Minna JD, Gallo RC. Detection and isolation of type C retrovirus particles from fresh and cultured lymphocytes of a patient with cutaneous T-cell lymphoma. Proc Natl Acad Sci USA. 1980;77:7415–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  2. Hinuma Y, Nagata K, Hanaoka M, Nakai M, Matsumoto T, Kinoshita KI, Shirakawa S, Miyoshi I. Adult T-cell leukemia: antigen in an ATL cell line and detection of antibodies to the antigen in human sera. Proc Natl Acad Sci USA. 1981;78:6476–80.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  3. Futsch N, Mahieux R, Dutartre H. HTLV-1, the other pathogenic yet neglected human retrovirus: from transmission to therapeutic treatment. Viruses. 2017;10:1.

    Article  PubMed Central  CAS  Google Scholar 

  4. Uchiyama T, Yodoi J, Sagawa K, Takatsuki K. Adult T-cell leukemia: clinical and hematologic features of 16 cases. Blood. 1977;50:481–92.

    Article  CAS  PubMed  Google Scholar 

  5. Gessain A, Barin F, Vernant JC, Gout O, Maurs L, Calender A, de The G. Antibodies to human T-lymphotropic virus type-I in patients with tropical spastic paraparesis. Lancet. 1985;2:407–10.

    Article  CAS  PubMed  Google Scholar 

  6. Osame M, Usuku K, Izumo S, Ijichi N, Amitani H, Igata A, Matsumoto M, Tara M. HTLV-I associated myelopathy, a new clinical entity. Lancet. 1986;1:1031–2.

    Article  CAS  PubMed  Google Scholar 

  7. Watanabe T. HTLV-1-associated diseases. Int J Hematol. 1997;66:257–78.

    Article  CAS  PubMed  Google Scholar 

  8. Gessain A, Cassar O. Epidemiological aspects and world distribution of HTLV-1 infection. Front Microbiol. 2012;3:388.

    Article  PubMed  PubMed Central  Google Scholar 

  9. Journo C, Douceron E, Mahieux R. HTLV gene regulation: because size matters, transcription is not enough. Future Microbiol. 2009;4:425–40.

    Article  CAS  PubMed  Google Scholar 

  10. Maldarelli F, Wu X, Su L, Simonetti FR, Shao W, Hill S, Spindler J, Ferris AL, Mellors JW, Kearney MF, et al. HIV latency. Specific HIV integration sites are linked to clonal expansion and persistence of infected cells. Science. 2014;345:179–83.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  11. Wagner TA, McLaughlin S, Garg K, Cheung CY, Larsen BB, Styrchak S, Huang HC, Edlefsen PT, Mullins JI, Frenkel LM. HIV latency. Proliferation of cells with HIV integrated into cancer genes contributes to persistent infection. Science. 2014;345:570–3.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  12. Cohn LB, Silva IT, Oliveira TY, Rosales RA, Parrish EH, Learn GH, Hahn BH, Czartoski JL, McElrath MJ, Lehmann C, et al. HIV-1 integration landscape during latent and active infection. Cell. 2015;160:420–32.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  13. Hanon E, Stinchcombe JC, Saito M, Asquith BE, Taylor GP, Tanaka Y, Weber JN, Griffiths GM, Bangham CR. Fratricide among CD8(+) T lymphocytes naturally infected with human T cell lymphotropic virus type I. Immunity. 2000;13:657–64.

    Article  CAS  PubMed  Google Scholar 

  14. Bangham CRM, Miura M, Kulkarni A, Matsuoka M. Regulation of Latency in the human T cell leukemia virus, HTLV-1. Annu Rev Virol. 2019;6:365–85.

    Article  CAS  PubMed  Google Scholar 

  15. Satou Y, Yasunaga J, Yoshida M, Matsuoka M. HTLV-I basic leucine zipper factor gene mRNA supports proliferation of adult T cell leukemia cells. Proc Natl Acad Sci USA. 2006;103:720–5.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  16. Usui T, Yanagihara K, Tsukasaki K, Murata K, Hasegawa H, Yamada Y, Kamihira S. Characteristic expression of HTLV-1 basic zipper factor (HBZ) transcripts in HTLV-1 provirus-positive cells. Retrovirology. 2008;5:34.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  17. Kulkarni A, Bangham CRM. HTLV-1: regulating the balance between proviral latency and reactivation. Front Microbiol. 2018;9:449.

    Article  PubMed  PubMed Central  Google Scholar 

  18. Kulkarni A, Taylor GP, Klose RJ, Schofield CJ, Bangham CR. Histone H2A monoubiquitylation and p38-MAPKs regulate immediate-early gene-like reactivation of latent retrovirus HTLV-1. JCI Insight. 2018;3:20.

    Article  Google Scholar 

  19. Nakano K, Watanabe T. HTLV-1 Rex: the courier of viral messages making use of the host vehicle. Front Microbiol. 2012;3:330.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  20. Rende F, Cavallari I, Corradin A, Silic-Benussi M, Toulza F, Toffolo GM, Tanaka Y, Jacobson S, Taylor GP, D’Agostino DM, et al. Kinetics and intracellular compartmentalization of HTLV-1 gene expression: nuclear retention of HBZ mRNAs. Blood. 2011;117:4855–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  21. Wu J, Chen ZJ. Innate immune sensing and signaling of cytosolic nucleic acids. Annu Rev Immunol. 2014;32:461–88.

    Article  CAS  PubMed  Google Scholar 

  22. Chemudupati M, Kenney AD, Bonifati S, Zani A, McMichael TM, Wu L, Yount JS. From APOBEC to ZAP: diverse mechanisms used by cellular restriction factors to inhibit virus infections. Biochim Biophys Acta Mol Cell Res. 2019;1866:382–94.

    Article  CAS  PubMed  Google Scholar 

  23. Gao G, Guo X, Goff SP. Inhibition of retroviral RNA production by ZAP, a CCCH-type zinc finger protein. Science. 2002;297:1703–6.

    Article  CAS  PubMed  Google Scholar 

  24. Journo C, Mahieux R. HTLV-1 and innate immunity. Viruses. 2011;3:1374–94.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  25. Billman MR, Rueda D, Bangham CRM. Single-cell heterogeneity and cell-cycle-related viral gene bursts in the human leukaemia virus HTLV-1. Wellcome Open Res. 2017;2:87.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Mahgoub M, Yasunaga JI, Iwami S, Nakaoka S, Koizumi Y, Shimura K, Matsuoka M. Sporadic on/off switching of HTLV-1 Tax expression is crucial to maintain the whole population of virus-induced leukemic cells. Proc Natl Acad Sci USA. 2018;115:E1269–78.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  27. Shiraki T, Kondo S, Katayama S, Waki K, Kasukawa T, Kawaji H, Kodzius R, Watahiki A, Nakamura M, Arakawa T, et al. Cap analysis gene expression for high-throughput analysis of transcriptional starting point and identification of promoter usage. Proc Natl Acad Sci USA. 2003;100:15776–81.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  28. Cook LB, Melamed A, Niederer H, Valganon M, Laydon D, Foroni L, Taylor GP, Matsuoka M, Bangham CR. The role of HTLV-1 clonality, proviral structure, and genomic integration site in adult T-cell leukemia/lymphoma. Blood. 2014;123:3925–31.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  29. Affymetrix ETP. Cold Spring Harbor Laboratory ETP: post-transcriptional processing generates a diversity of 5′-modified long and short RNAs. Nature. 2009;457:1028–32.

    Article  CAS  Google Scholar 

  30. Djebali S, Davis CA, Merkel A, Dobin A, Lassmann T, Mortazavi A, Tanzer A, Lagarde J, Lin W, Schlesinger F, et al. Landscape of transcription in human cells. Nature. 2012;489:101–8.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  31. Hirabayashi S, Bhagat S, Matsuki Y, Takegami Y, Uehata T, Kanemaru A, Itoh M, Shirakawa K, Takaori-Kondo A, Takeuchi O, et al. NET-CAGE characterizes the dynamics and topology of human transcribed cis-regulatory elements. Nat Genet. 2019;51:1369–79.

    Article  CAS  PubMed  Google Scholar 

  32. Bick MJ, Carroll JW, Gao G, Goff SP, Rice CM, MacDonald MR. Expression of the zinc-finger antiviral protein inhibits alphavirus replication. J Virol. 2003;77:11555–62.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  33. Muller S, Moller P, Bick MJ, Wurr S, Becker S, Gunther S, Kummerer BM. Inhibition of filovirus replication by the zinc finger antiviral protein. J Virol. 2007;81:2391–400.

    Article  PubMed  CAS  Google Scholar 

  34. Zhu Y, Chen G, Lv F, Wang X, Ji X, Xu Y, Sun J, Wu L, Zheng YT, Gao G. Zinc-finger antiviral protein inhibits HIV-1 infection by selectively targeting multiply spliced viral mRNAs for degradation. Proc Natl Acad Sci USA. 2011;108:15834–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  35. Mao R, Nie H, Cai D, Zhang J, Liu H, Yan R, Cuconati A, Block TM, Guo JT, Guo H. Inhibition of hepatitis B virus replication by the host zinc finger antiviral protein. PLoS Pathog. 2013;9:e1003494.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  36. Huang Z, Wang X, Gao G. Analyses of SELEX-derived ZAP-binding RNA aptamers suggest that the binding specificity is determined by both structure and sequence of the RNA. Protein Cell. 2010;1:752–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  37. Chen S, Xu Y, Zhang K, Wang X, Sun J, Gao G, Liu Y. Structure of N-terminal domain of ZAP indicates how a zinc-finger protein recognizes complex RNA. Nat Struct Mol Biol. 2012;19:430–5.

    Article  CAS  PubMed  Google Scholar 

  38. Takata MA, Goncalves-Carneiro D, Zang TM, Soll SJ, York A, Blanco-Melo D, Bieniasz PD. CG dinucleotide suppression enables antiviral defence targeting non-self RNA. Nature. 2017;550:124–7.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  39. Greenbaum BD, Levine AJ, Bhanot G, Rabadan R. Patterns of evolution and host gene mimicry in influenza and other RNA viruses. PLoS Pathog. 2008;4:e1000079.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  40. Karlin S, Mrazek J. Compositional differences within and between eukaryotic genomes. Proc Natl Acad Sci USA. 1997;94:10227–32.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  41. Verdonck K, Gonzalez E, Van Dooren S, Vandamme AM, Vanham G, Gotuzzo E. Human T-lymphotropic virus 1: recent knowledge about an ancient infection. Lancet Infect Dis. 2007;7:266–81.

    Article  CAS  PubMed  Google Scholar 

  42. Gaudray G, Gachon F, Basbous J, Biard-Piechaczyk M, Devaux C, Mesnard JM. The complementary strand of the human T-cell leukemia virus type 1 RNA genome encodes a bZIP transcription factor that down-regulates viral transcription. J Virol. 2002;76:12813–22.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  43. Grassmann R, Aboud M, Jeang KT. Molecular mechanisms of cellular transformation by HTLV-1 Tax. Oncogene. 2005;24:5976–85.

    Article  CAS  PubMed  Google Scholar 

  44. Yoshida M, Satou Y, Yasunaga J, Fujisawa J, Matsuoka M. Transcriptional control of spliced and unspliced human T-cell leukemia virus type 1 bZIP factor (HBZ) gene. J Virol. 2008;82:9359–68.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  45. Kulkarni A, Mateus M, Thinnes CC, McCullagh JS, Schofield CJ, Taylor GP, Bangham CRM. Glucose metabolism and oxygen availability govern reactivation of the latent human retrovirus HTLV-1. Cell Chem Biol. 2017;24(1377–1387):e1373.

    Google Scholar 

  46. Wang N, Dong Q, Li J, Jangra RK, Fan M, Brasier AR, Lemon SM, Pfeffer LM, Li K. Viral induction of the zinc finger antiviral protein is IRF3-dependent but NF-kappaB-independent. J Biol Chem. 2010;285:6080–90.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  47. Takeda S, Maeda M, Morikawa S, Taniguchi Y, Yasunaga J, Nosaka K, Tanaka Y, Matsuoka M. Genetic and epigenetic inactivation of tax gene in adult T-cell leukemia cells. Int J Cancer. 2004;109:559–67.

    Article  CAS  PubMed  Google Scholar 

  48. Taniguchi Y, Nosaka K, Yasunaga J, Maeda M, Mueller N, Okayama A, Matsuoka M. Silencing of human T-cell leukemia virus type I gene transcription by epigenetic mechanisms. Retrovirology. 2005;2:64.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  49. Satou Y, Miyazato P, Ishihara K, Yaguchi H, Melamed A, Miura M, Fukuda A, Nosaka K, Watanabe T, Rowan AG, et al. The retrovirus HTLV-1 inserts an ectopic CTCF-binding site into the human genome. Proc Natl Acad Sci USA. 2016;113:3054–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  50. Melamed A, Yaguchi H, Miura M, Witkover A, Fitzgerald TW, Birney E, Bangham CR. The human leukemia virus HTLV-1 alters the structure and transcription of host chromatin in cis. Elife. 2018;7:e36245.

    Article  PubMed  PubMed Central  Google Scholar 

  51. Perez VL, Rowe T, Justement JS, Butera ST, June CH, Folks TM. An HIV-1-infected T cell clone defective in IL-2 production and Ca2+ mobilization after CD3 stimulation. J Immunol. 1991;147:3145–8.

    CAS  PubMed  Google Scholar 

  52. Iwase SC, Miyazato P, Katsuya H, Islam S, Yang BTJ, Ito J, Matsuo M, Takeuchi H, Ishida T, Matsuda K, et al. HIV-1 DNA-capture-seq is a useful tool for the comprehensive characterization of HIV-1 provirus. Sci Rep. 2019;9:12326.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  53. Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–60.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  54. Miyazato P, Katsuya H, Fukuda A, Uchiyama Y, Matsuo M, Tokunaga M, Hino S, Nakao M, Satou Y. Application of targeted enrichment to next-generation sequencing of retroviruses integrated into the host human genome. Sci Rep. 2016;6:28324.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  55. Fujisawa J, Seiki M, Toita M, Miyatake S, Arai K, Yoshida M. Cell-line specific activation of SV40 transcriptional enhancer by p40tax of HTLV-1. Jpn J Cancer Res. 1988;79:800–4.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  56. Ocwieja KE, Sherrill-Mix S, Mukherjee R, Custers-Allen R, David P, Brown M, Wang S, Link DR, Olson J, Travers K, et al. Dynamic regulation of HIV-1 mRNA populations analyzed by single-molecule enrichment and long-read sequencing. Nucleic Acids Res. 2012;40:10345–55.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references


We thank Michi Miura for the R program to perform the quality check of Index reads and Jocelyn Y P N Turpin for advice and helpful discussion. We also thank Charles R.M. Bangham for providing the TBX-4B cell line and for his advice and helpful discussion. Finally, we thank Jun-ichi Fujisawa and Takaharu Ueno for establishing and providing the JEX22 cell line.


This work was supported by the JSPS KAKENHI (JP17K09016 and JP18KK0230 to YS; JP16KK0206, JP18K16122 to HK; JP18K08437 and JP18KK0452 to PM), the AMED (JP17fm0208012, JP19fk0410023 to YS), the Grant for Joint Research Project of the Institute of Medical Science, the University of Tokyo to YS, the grant from Kumamoto University Excellent Research Projects to YS, JST MIRAI to YS, and Kumamoto University Fellowship for Excellent Graduate Students to MM.

The funders had no role in the design of the study, collection of data, its interpretation, or the discussion to submit the work for publication.

Author information

Authors and Affiliations



YS conceived the study; PM, MM, BJYT, HK, MT, SI, JI, and YM performed methodology and formal analysis; MM, BJYT, JI and YS were involved in data curation; PM and YS wrote the original draft; PM, MM, BJYT, HK, MT, SI, JI, YM, and YS were involved in writing/review and editing; YS supervised the project; PM, HK, and YS were involved in project administration and Funding acquisition. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Yorifumi Satou.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Additional file 1: Fig. S1.

CAGE result of TBX-4B cells.

Additional file 2: Table S1.

Genome accession numbers. Table S2. Dataset accession numbers. Table S3. Primers and oligonucleotides used in this study. Table S4. HIV-1 sequences we analyzed in this study.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Miyazato, P., Matsuo, M., Tan, B.J.Y. et al. HTLV-1 contains a high CG dinucleotide content and is susceptible to the host antiviral protein ZAP. Retrovirology 16, 38 (2019).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: