Skip to main content

Intragenic transcriptional cis-activation of the human immunodeficiency virus 1 does not result in allele-specific inhibition of the endogenous gene



The human immunodeficiency virus type 1 (HIV-1) favors integration in active genes of host chromatin. It is believed that transcriptional interference of the viral promoter over the endogenous gene or vice versa might occur with implications in HIV-1 post-integrative transcriptional latency.


In this work a cell line has been transduced with a HIV-based vector and selected for Tat-inducible expression. These cells were found to carry a single silent integration in sense orientation within the second intron of the HMBOX1 gene. The HIV-1 Tat transactivator induced the viral LTR and repressed HMBOX1 expression independently of vector integration. Instead, single-cell quantitative in situ hybridization revealed that allele-specific transcription of HMBOX1 carrying the integrated provirus was not affected by the transactivation of the viral LTR in cis.


A major observation of the work is that the HIV-1 genome has inserted in genes that are also repressed by Tat and this could be an advantage for the virus during transcriptional reactivation. In addition, it has also been observed that transcription of the provirus and of the endogenous gene in which it is integrated may coexist at the same time in the same genomic location.


Retroviruses, such as human immunodeficiency virus type 1 (HIV-1) require reverse transcription and integration into host chromatin to establish a provirus as an obligatory replication step. The choice of the integration site is a crucial intermediate of the virus life cycle. The chromatin context determines the efficiency of viral transcription and is involved in the establishment of post-integrative latency that is the major obstacle to HIV-1 eradication with current antiviral therapies [13]. In addition, insertion of a provirus in the human genome can cause several adverse effects [4]. For example, insertion of the retrovirus close to a proto-oncogene may induce transformation of the cell. Gene therapy approaches suffer most from these effects and recently it has been demonstrated that the activation of an oncogene caused transformation in several children treated with a therapeutic retroviral vector [5]. In principle, insertion of an ectopic transcription unit within a gene may also result either in disruption of exonic sequences, introduction of alternative splicing or transcriptional interference. Clearly, these negative effects would increase in importance relative to the increasing unbalance of the endogenous gene expression between alleles.

Integration site selection by retroviruses is not sequence-specific but also not random. HIV-1 favors integration within active transcription units [68]. Additional features are the requirement of host factors such as the lens epithelium-derived growth factor LEDGF/p75 for efficient targeting of active transcription units [9] and a DNA substrate wrapped around nucleosomes. Indeed, integration of HIV-1 is linked to nucleosomal markers of active transcription (H3/H4 acetylation, H3K4 methylation) and negatively correlated with inhibitory modifications (H3K27 trimethylation and DNA CpG methylation) [10]. Subtle differences in the integration site choice exist among retroviruses. Murine leukemia virus (MLV) integrates within highly active promoters at ± 5 kb from the transcription start sites [7, 11]. HIV-1 instead, although also favoring active genes, does not show a preference for promoter-proximal integration. Rather, the virus inserts throughout the transcriptional unit with a bias towards intronic sequences: this is the likely result of the greater size of introns compared to exons within a gene [6].

A crucial aspect of HIV-1 pathogenesis is the control of provirus transcription. In particular the ability of the virus to maintain a reservoir of transcriptionally silent proviruses in resting memory T cells for long periods of time. Multiple mechanisms have been postulated to concur in these processes. Host factors, for example, may be limiting the activity of the Tat transactivator. Tat interacts with a cis-acting RNA element (trans-activation-responsive region; TAR) present at the 5' end of each viral transcript [12]. Through this interaction, the protein activates HIV-1 transcription by promoting the assembly of transcriptionally active complexes at the LTR through multiple protein-RNA and protein-protein interactions [13]. Tat interacts with the core RNA polymerase II [14, 15], the TATA-binding protein associated factor (TAFII) [16], TFIIH [17], cyclin-dependent protein kinase 7 [18], SP1 [19], nuclear factor of activated T cells (NFAT) [20], several histone acetyltransferases [2123] and cyclin T1 [24]. On the other hand, the chromatin context at the site of integration should determine whether the provirus is transcriptionally active, poised for activation or inactive [25]. Early studies showed that latency involved integration into regions of heterochromatin [26, 27]. More recent systematic genome-wide analysis of the chromosomal features negatively associated to HIV-1 transcription revealed that low levels of LTR-driven expression correlated with integration in gene deserts and in centromeric heterochromatin, but also in highly expressed cellular genes [28]. Furthermore, HIV-1 has been found in intronic regions of actively transcribed genes in resting memory CD4+ cells derived from patient on highly active antiretroviral treatment [29]. The paradox of HIV-1 integration in active genes while being transcriptionally silent requires molecular investigation of the phenomenon. Unfortunately most cellular models of HIV-1 post-integrative latency harbor the provirus outside of transcribed genes [3]. In this work a cell-line that carries a single repressed provirus integrated within the active transcription unit of the HMBOX1 gene has been generated. Tat-mediated induction of provirus transcription resulted in the inhibition of HMBOX1 expression. However, this effect could be ascribed to Tat expression and not to activation of the viral LTR. Indeed, a subset of activated cells showed bi-allelic expression of HMBOX1 together with expression of the provirus within one of the alleles. These results are discussed in light both of HIV-1 pathogenesis and of the potential use of lentiviral vectors for gene therapy applications.


Generation and characterization of a cell line carrying a stably integrated lentiviral vector

The HIV-Intro-MS2 × 24-ECFPskl-IRES-TK lentiviral vector (for simplicity: HIV-Intro) has been engineered to contain the elements required for RNA production: the 5' LTR, the major splice donor (SD1), the packaging signal Ψ, the RRE, the splice acceptor A7 and the 3' LTR that drives 3'-end formation (Figure 1). The construct carries also an array of 24 repeats of the MS2 phage coat protein within the intron, to increase specific detection of nascent mRNA, a reporter of gene expression (ECFP) fused to the peroxisome localization signal Ser-Lys-Leu (skl) and the selectable marker thymidine kinase (TK) of herpes simplex type 1.

Figure 1
figure 1

Genomic organization of the HMBOX1 gene and of the HIV-intro construct in HOS_A4 cells. Position of the RT-PCR primers are indicated by black arrows. Positions of the FISH probes are indicated by red bars.

In order to characterize this construct extensively before transduction, HeLa cells were transfected with plasmid HIV-Intro together with a plasmid expressing a monomeric DsRed-tagged Tat Figure 2A, top panels). As expected from previous studies showing transcribed nascent RNA by MS2-tagging [30, 31], bright yellow spots appeared within the nucleus. Each spot corresponds to several plasmids clustered together that express viral RNA [32]. As expected, Tat was found at transcription sites because it binds the 5'-end of each transcript. The reporter of gene expression ECFPskl was found in the cytoplasm. When a plasmid expressing a DsRed-tagged Rev was co-transfected together with Tat (without tag), the unspliced RNA was found in the cytoplasm, consistent with its Rev-mediated export (Figure 2A, bottom panels). These results are mirrored by the behavior in RT-PCR using a set of primers that distinguish pre-mRNA from spliced RNA. As shown in figure 2B, basal transcription is up-regulated by Tat with a higher proportion of spliced over unspliced RNA. Co-transfection of a plasmid encoding pEYFP-MS2nls does not affect the splicing reaction, ruling out perturbation of the system by such a strong RNA binding protein. Expression of Rev instead increased the proportion of unspliced RNA, consistent with its role in RRE-containing RNA stabilization and export.

Figure 2
figure 2

A) HeLa cells were cotransfected with pHIV-Intro, pEYFP-MS2nls and either mDsRed tagged Tat (top) or Tat and mDsRed tagged Rev (bottom). Yellow spots in the nucleus correspond to nascent RNA from transfected plasmids. Cyan spots in the cytoplasm correspond to ECFPskl localized to peroxisomes. B) RT-PCR on HeLa cells transfected as indicated. Three primers were used, their position is shown in Figure 1. Resulting bands correspond to the unspliced and spliced HIV-Intro RNAs. Bottom panels: β-actin loading control (M = molecular weight marker). C) Effect of RNAPII inhibitors on HIV-Intro transcription in transfected HeLa cells transfected as indicated in Figure 2A, top panels. Nuclei showing transcription spots were scored 1 hour (gray bars) an 6 hours (black bars) after treatment with Actinomicin D (10 μg/ml), α-Amanitin (10 μg/ml) or Flavopiridol (500 μM).

A key question that arose while doing these experiments was the real nature of these yellow spots in the nucleus (Figure 2A, top panels). To confirm that these where sites of HIV-Intro transcription we incubated the cells with inhibitors such as Actinomicin D, α-Amanitin or Flavopiridol. As shown in Figure 2C, a rapid decrease of the number of transcription spots was observed with all three inhibitors. Hence, RNA-dependent accumulation of RNA at these sites was dependent on RNAPII activity.

Next a strategy was designed to express the HIV-Intro construct from a single chromatinized location in a Tat-inducible way. Osteosarcoma HOS_143b cells, that are negative for thymidine kinase activity (TK-), were transduced with the HIV-Intro vector pseudotyped with the VSV-G envelope. To select for clones that carry an inducible integrated provirus, cells that constitutively expressed high levels of HSV-TK were selected against by treatment with 50 μg/ml ganciclovir. Surviving cells, that were either non-transduced, or transduced but with a low level of TK expression, were treated with GST-Tat and briefly selected for inducible HSV-TK expression in hypoxanthine, aminopterin and thymidine (HAT) medium. Clonal populations were obtained by limiting dilutions and colonies were visually scored for low basal level of ECFP expression in the cytoplasm and to be highly inducible by GST-Tat by fluorescence microscopy. The HOS_A4 cell clone showed a robust and homogenous induction of ECFPskl in the cytoplasm upon treatment with GST-Tat (Figure 3A). These cells were transfected with plasmids encoding EYFP-MS2nls and Tat-mDsRed. As shown in figure 3C, HOS_A4 showed ECFPskl in the cytoplasm and presented one single bright yellow spot in the nucleus compatible with a single site of HIV-Intro transcription that co-localized with Tat-mDsRed. Immuonfluorescence with antisera against Cyclin T1, the P-TEFb component recruited directly by Tat on the viral RNA, or RNAPII demonstrated enrichment of such factors at these sites (Figure 3D).

Figure 3
figure 3

A) Generation of HOS_A4 cells by transduction with HIV-Intro and selection as described in the text. Tat induction induced the expression of ECFPskl in the cytoplasm. Top panels: phase contrast. Bottom panels: ECFP channel. B) Southern blot analysis of HOS_A4 cells shows the presence of a single integration event. Genomic DNA was digested with XhoI or SpeI and hybridized with a probe encompassing ECFP. C) Effect of Tat-mDsRed on HOS_A4 cells. Co-localization of Tat and HIV-Intro RNA is shown on the single transcription spot present in HOS_A4 cells. Correct gene expression is demonstrated by the ECPFskl signal in the cytoplasm. D) Co-localization of RNAPII and Cyclin T1 on HOS_A4 transcription spots. Cells were transfected with pEYFP-MS2nls and Tat, fixed and Cyclin T1 (top panels) or RNAPII (bottom panels) detected by immunofluorescence as described in [30].

These results are compatible with one integration event of the HIV vector in HOS_A4 cells. Indeed, analysis by southern blotting (Figure 3B) and cloning of the integration sites by inverse PCR revealed that the provirus lay within the HMBOX1 (homeobox containing 1) cellular gene.

Allele-specific expression of HMBOX1following HIV-Intro-MS2 × 24-ECFPskl-IRES-TK transactivation

Human HMBOX1 is composed of 11 exons, spanning about 160 kb within chromosome 8 p21.1 (Figure 1). HMBOX1 is believed to encode for a transcription factor involved in the transcriptional regulation of key eukaryotic developmental processes. HMBOX1 is widely expressed in pancreas and the expression of this gene can also be detected in pallium, hippocampus and hypothalamus [33]. In HOS_A4 cells the HIV-Intro lentiviral vector integrated within the second giant intron of HMBOX1 (Figure 1). In order to assess expression of HMBOX1 in HOS cells RT-PCR was performed with primers specific for Exon3 or encompassing Exon2/Exon3 junction (Figure 4A). Both parental HOS 143b and HOS_A4 expressed HMBOX1 at similar levels. However, this assay was neither quantitative nor specific for the HMBOX1 allele carrying the integrated vector. A similar approach was also employed for the HIV-Intro transcript. As shown in Figure 4B, a basal level of HIV-Intro expression was detected in HOS_A4 that could be up regulated by Tat transfection.

Figure 4
figure 4

A) RT-PCR analysis of HMBOX1 expression in parental HOS_143b and clone HOS_A4 using primers for HMBOX1 Exon 3 (79 bp, top panel) and Exon 2/Exon 3 splicing (86 bp, middle panel). Bottom panel: β-actin control (230 bp). B) RT-PCR analysis of HIV-Intro expression (280 bp top panel) in parental HOS_143b and clone HOS_A4 using primers HIV_SPLICED and HIV_NUC (Table 1). Bottom panel: β-actin control (M = molecular weight marker; T = hours after Tat induction). C-H) Quantitative RT PCR for HIV-Intro, HMBOX1 and GAPDH expression using the indicated primers shown in Figure 1 and Table 1. Each histogram is the mean of three experiments normalized for β-actin expression and corrected for primer efficiency (E = 10-1/slope).

In order to detect allele-specific transcription in the HMBOX1 locus carrying the integrated provirus a quantitative RT-PCR was developed according to the protocol of Han and collaborators [29]. As shown in the diagram of Figure 1, RT-PCR primers were designed to detect also HMBOX1 transcripts containing HIV sequences upstream of the viral transcription start site. RNA from HOS_A4 cells was reverse-transcribed and the resulting cDNA was amplified with two primers that share the HIV_RT primer. HIV_UP5 amplifies only HIV-Intro sequences produced as a result of transcription of HMBOX1 reading through the HIV-Intro genome that is inserted into the gene. Because the forward primer is located upstream of the transcription start site and the reverse primer is located downstream of the LTR, only RNA species initiating upstream of the HIV-1 transcription start site could be amplified. HIV_START instead is able to amplify any HIV-Intro transcript that has initiated at the viral start site. To prevent amplification from HIV-1 DNA, isolated RNA was treated with DNase before RT-PCR. In addition, control reactions from which RT was omitted were included in each experiment and were invariably negative. A positive control of HIV transactivation involved a set of primers for the HIV-1 spliced RNA product (primers HIV_nuc and HIV_spliced). PCR amplification was conducted in the presence of the dye CyberGreen for relative quantification of PCR products. Transfection of Tat induced HIV-1 transcripts several folds with both primer sets detecting HIV-1 transcripts (Figure 4C and Figure 4D). This result is perfectly in line with the well-known Tat-transactivation of the viral LTR and with RT PCR data shown in Figure 4B[34]. Allele-specific detection of HMBOX1 RNA instead showed a marked decrease in response to Tat (Figure 4E). This result would be explained by negative interference with the expression of HMBOX1 due to activation of a strong promoter embedded within the gene. However, when the analysis was conducted on two primer sets specific for the HMBOX1 gene both in HOS_A4 and the parental HOS_143b we realized that expression of HMBOX1 was affected by the presence of Tat per se and not by the activation of the viral LTR (Figure 4F and Figure 4G). This effect was not a general effect on transcription since the GAPDH gene was not affected (Figure 4H).

Single-cell analysis of HMBOX1and HIV-1 expression

Ensemble-averaged analysis such as RT-PCR that relies on the evaluation of a number of cells does not allow distinction between expression of each HMBOX1 allele. In fact, although we analyzed allele-specific expression of the allele carrying the provirus, still we don't know whether HMBOX1 expression was balanced between alleles or not.

In order to evaluate the simultaneous expression of both HMBOX1 alleles and of the integrated provirus, transcripts were detected by quantitative fluorescent in situ RNA hybridization (FISH). The amount of RNA on the transcription spot is determined by the rate of transcription and the rate of RNA processing. At steady state it could be derived from the intensity of the fluorescence signal compared with the intensity of the signal from a known reference as described previously [30]. For this purpose for each probe and each acquisition we prepared a calibration curve spotting different amounts of probe on a coverslip in a constant volume. The probes were acquired and deconvoluted using the same conditions used for the samples (see Methods). The z-projection sum of all planes was averaged and this value represent the signal emitted by each amount of probe. Therefore, the number of probes for each voxel (a volume pixel in a three-dimensional image) could be calculated for each point of the calibration curve. In the case of HIV-Intro transcripts, the number of RNAs on the transcription spot in the presence of Tat was calculated to be 17 ± 4.

HMBOX1 expression was low and could not be detected by a single probe. Therefore a mixture of eight oligonucleotides, distributed in the first and fourth exon and in the second intron before and after the integration (Figure 1), were designed to detect the nascent unprocessed RNA transcripts of HMBOX1 (Table 1). As shown in figure 5A, in parental HOS-143b two spots of equal intensity were clearly visible in 37.2% of nuclei indicating that the HMBOX1 gene is expressed from both alleles, but only in a fraction of the asynchronous population of cells. This is not surprising since there is ample variation of the number of alleles/nucleus detected by this method, depending on how robust is gene expression and how efficient is the processing of the RNA; both contribute to the level of RNA at the site of transcription at steady state [35]. It is however unlikely that detection was lowered by scarce accessibility of the probe to the RNA since positive cells showed invariably two alleles of equal intensity, where in the case of technical problems there would be a higher proportion of single-allele expressing cells. The same was observed in derivative HOS_A4 cells where 36.8% of nuclei showed biallelic expression of the HMBOX1 gene (Figure 5C). The number of nascent RNAs present on the transcription site at steady-state was calculated to be significantly similar in both cell lines: 4.13 ± 1.02 for HOS_143b and 4.11 ± 0.91 for HOS_A4 (p value = 0.18) (Figure 6B). Most importantly, the ratio between the intensity of the signal of the two loci was invariably close to 1 in both cell lines demonstrating bi-allelic expression of the HMBOX1 gene with comparable levels (Figure 6C).

Figure 5
figure 5

A) FISH for HMBOX1 RNA on parental HOS_143b cells. Top panel: large field image (bar = 10 μm). Bottom panels: single cell from the figure above (inset). Two distinct hybridization signals per nucleus demonstrate bi-allelic expression of the HMBOX1 gene. B) Same as A after transfection of Tat-EGFP. Also the signal for Tat-EGFP is shown (middle panels). Bottom panel: merge of HMBOX1 hybridization and Tat-EGFP expression. C) FISH for HIV-Intro RNA on HOS_A4 cells. Top panel: large field image (bar = 10 μm). Bottom panels: single cell from the figure above (inset). Absence of the hybridization signal with the HIV probe is due to silencing of the gene without Tat (left panels). Two distinct hybridization signals per nucleus demonstrate bi-allelic expression of the HMBOX1 gene (middle panels).

Figure 6
figure 6

A) The number of cells expressing HMBOX1 in both loci before and after Tat-EGFP transfection is shown. B) The number of HMBOX1 RNAs on the transcription spots before and after Tat-EGFP transfection is shown. C) The ratio between the intensity of the hybridization signals on the transcription spots of the HMBOX1 alleles is shown. D) The intensity of the MS2 hybridization signal on HIV-Intro transcription spots is shown both on a HMBOX1 active and inactive background.

Table 1 Primers for RT-PCR and probes for RNA FISH used in this work.

Next we investigated the effect of Tat transfection. Consistently with what has been observed with RT PCR analysis, the number of Tat-transfected nuclei showing expression of the two HMBOX1 loci decreased (Figure 6A). This difference was significant in both cell lines (p value = 1.34 × 10-23 for HOS_143b and p value = 2.63 × 10-4 for HOS_A4). Interestingly, in those cells where both spots were detected, the number of RNAs and the ratio of the HMBOX1 alleles were not affected (Figure 6B and Figure 6C). HIV transcripts instead were present in most (93%) of Tat-EGFP transfected cells, consistent with transactivation of the viral LTR in the clonal population (Figure 7). Interestingly, most cells that express the vector do not show expression of HMBOX1 in both alleles, indicating that the effect on the expression of HMBOX1 was dependent on the expression of Tat and not on the transactivation of the provirus. Even more strikingly, in 11% of Tat-EGFP transfected nuclei both HMBOX1 alleles and the proviral transcript were active (Figure 7). In this subset of cells the ratio between HMBOX1 alleles was also close to 1 and the intensity of the proviral signal comparable to that of cells where there was no HMBOX1 expression. In fact, as shown in figure 6D, transcription of HIV-1 was not significantly affected by allele-specific HMBOX1 transcription (p value = 0.91). Hence, there are conditions where intragenic transcription of HIV-1 can occur in the presence of transcription of the host gene.

Figure 7
figure 7

FISH for HMBOX1 and HIV-Intro RNA on HOS_A4 cells after transfection of Tat-EGFP. Merged images are shown in color both for the large fields and for the zoomed figures (bar = 10 μm).


Integration of HIV-1 in host chromatin is a crucial event for viral pathogenesis. Chromatin control of provirus gene expression has been postulated to be a major determinant of post-integration latency that is the cause of failure to eradicate HIV-1 infection by current antiretroviral regimens [13]. In addition, development of lentiviral vectors for gene therapy requires that endogenous genes shouldn't be affected by the integration event. However, recent evidence suggests that down-modulation of HIV-1 expression occurs also within active genes, in the absence of a repressive chromatin context. A cell line harboring a Tat-inducible HIV-1 vector integrated within the HMBOX1 endogenous gene has been engineered in this work. This allowed the detailed investigation of the reciprocal influence of HIV-1 and HMBOX1 expression both with or without Tat induction.

As a result of the double selection procedure, HOS_A4 showed a basal level of HIV-1 RNA by RT PCR that could be assigned to HMBOX1 read-through transcription across the silent HIV-1 provirus since neither RNA was evident in FISH (Figure 5C) nor ECFPskl could be detected in the cytoplasm (Figure 3A). Tat overexpression, while increasing HIV-1 expression as expected, also reduced the level of expression of HMBOX1 in both alleles. Besides its essential role in trans-activating HIV-1 transcription, Tat is known to regulate key host cell functions, primarily at the level of transcription. For example, Tat down-regulates MHC class II by preventing the interaction of cyclin T1 with the class II transactivator CIITA [36]. It is conceivable that Tat, being able to interact with a variety of host factors required for HIV transactivation [13, 37], at the same time pulls these factors away from specific host genes, altering transcription from these promoters. Genome-wide expression profiling indeed revealed that Tat overexpression resulted in down-modulation of many cellular genes, possibly through targeting of general factors such as the SWI/SNF chromatin remodeling complex and the p300 acetyltransferase [38, 39]. Hence, HMBOX1 adds to the list of genes being down modulated by Tat overexpression. It is possible that during the establishment of the HOS_A4 cell line there has been a positive selection of integration loci where Tat induced repression of transcription. If expression of the endogenous gene interferes in cis with the expression of the provirus, the net result of Tat induction would have been of increased LTR-driven expression due also in part to the decrease of HMBOX1 expression. However, the number of HIV-1 nascent RNAs in Tat-transfected HOS_A4 cells that were negative for HMBOX1 expression was similar to the number of those where instead HMBOX1 was active (Figure 6D).

Another important finding of this work was that in some cells expression of the endogenous HMBOX1 gene and of the provirus coexisted at the same transcription site. This finding could not be anticipated since it was believed that transcriptional interference should have occurred. Transcriptional interference is the suppressive influence of one transcriptional process directly and in cis, on a second transcriptional process [40, 41]. Several combinations of the disposition of the two transcription units produce different effects [42]. For example, two promoters firing in opposite orientation would end up in collision of the two converging transcription elongation complexes. In HOS_A4 cells instead, the two promoters elongate in the same direction allowing a possible transcriptional interference through occlusion of the downstream promoter. In such model transcription from the upstream HMBOX1 promoter should transiently preclude the occupation by RNAPII and/or associated transcription factors of the downstream LTR promoter. Alternatively, the LTR could pose a roadblock to the progress of the transcription-elongation complex firing from the HMBOX1 promoter resulting in its inhibition of expression from one allele.

The observation that in the absence of Tat transcription both HMBOX1 alleles were equally expressed while expression from the viral LTR remained undetectable might indicate that transcription elongation of RNAPII across the HMBOX1 gene occluded the viral LTR. The situation changed dramatically in the presence of Tat. Transcription from the viral LTR is switched on as shown in RT PCR as well as demonstrated by the appearance of the viral transcript in RNA FISH. As discussed previously, Tat repressed HMBOX1 expression while allowing provirus transcription. However, in a subset of cells, expression of both genes coexisted. Recent data also analyzed HIV proviral gene expression from within a cellular gene [43]. Work by Peterlin's group showed transcriptional interference occurring from elongating polymerase firing from the host gene [44], whereas work from Silicano's laboratory showed that elongating polymerase from the host gene could enhance HIV transcription when orientated in the same direction [45]. It is difficult to compare directly these data with those presented here since different cell lines and host cell genes were studied. However the findings presented in this work show an alternative situation that will require further analysis to understand the molecular basis of the phenomenon.


In this work HIV-1 gene expression was studied from within the endogenous gene HMBOX1. Transcriptional Tat-transactivation of the viral LTR resulted in up-regulation of HIV-1 transcription while it repressed HMBOX1 gene expression in both alleles, independent of vector integration. Hence, it could be proposed that HIV-1 genome insertion in genes repressed by Tat could be an advantage for the virus allowing its transactivation from a low-transcribing endogenous gene. It has also been observed that both HIV-1 and HMBOX1 gene expression may occur at the same genomic location in the same cell, allowing speculation on the lack mutual interference between transcription units.


Plasmids and cells

Plasmid pHIV-Intro was derived from the plasmid pEV731 [27] by cloning 24 MS2 repeats into the NotI restriction site. A cassette encoding for ECFP with the peroxisome localization signal Ser-Lys-Leu (skl) was inserted between the ClaI and XhoI sites, the IRES from EMCV and the thymidine kinase from HSV-1 were cloned in the XhoI site [46]. To obtain cells transduced by the HIV-Intro lentiviral vector expressing the HSV-TK gene we exploited a protocol for negative and positive selection of TK- cells. HEK-293T cells were transfected with the vector plasmid pHIV-Intro together with the packaging plasmids as previously described [26]. The supernatant was filtered and used to transduce human bone osteosarcoma TK- cells (HOS 143b, ECACC n. 91112502). Next day cells were treated with ganciclovir at 50 μg/ml. Surviving cells were expanded and then treated with 2.5 μg/ml of GST-Tat to induce LTR expression as described previously [34, 47]. The following day cells were incubated in hypoxanthine, aminopterin and thymidine (HAT) medium and then cloned by colony picking and expansion in complete DMEM medium. Colonies were visually scored for low basal level of ECFP expression and to be highly inducible by GST-Tat by fluorescence microscopy. The number of integrations was assessed by southern blotting with a probe for ECFP and the cloning of the integration sites was obtained by inverse PCR essentially as described [26]. Briefly, genomic DNA was digested with BamHI (cleavage site within HIV-Exo-MS2 × 24), and the resulting products were circularized with DNA ligase. The product of two nested PCRs performed with primers pointing outwards from the vector was cloned and sequenced.

Plasmids expressing Tat and MS2 have been described previously [30, 48]. Tat and Rev were fused to pDsRedMonomer-N1 (Clontech) by PCR.

Allele-specific RT-PCR

Total RNA was extracted as described by the kit (Qiagen, RNA easy). Reverse transcription was performed with M-MLV RT (Invitrogen) using random primers. Amplification of the cDNA was conducted in the presence of CyberGreen™ (Applied Biosystems) and monitored on AbiPrism 7000 (Applied Biosystems). Specific primers are shown in Table 1.

Quantitative RNA fluorescent in situ hybridization (RNA FISH)

In situ hybridization was performed essentially as previously described [30]. Cells were fixed with 4% PFA pH7.4 and permeabilized overnight in 70% ethanol. Formamide concentration was 40% for HMBOX1 probes and 20% for MS2. The amino-allyl thymidine modified oligonucleotide probes (Table 1) have been synthesized by J-M Escudier (Platforme de synthèse d'Oligonucléotides modifiés de l'Interface Chimie Biologie de l'ITAV, Toulouse, France). Probes were labeled with Cy3 or Cy5 (Cyn monoreactive dye, Amersham). For quantitative measurements stacks of 21 planes were acquired at bin = 2 with steps of 0.5 μm in the z-axis using a wide-field Leica DMRI inverted microscope (63× objective, NA 1.3) controlled by Metamorph (Universal Imaging). Digital images were collected using a CoolSnap K CCD camera (Roper scientific). The three-dimensional deconvolution and reconstruction was performed with the ImageJ plug-in "Iterative Deconvolve 3D". The total light intensity at the transcription site was calculated and divided for the number of planes and the number of molecule were computed from a calibration curve of the probes in solution [30].


  1. Han Y, Wind-Rotolo M, Yang HC, Siliciano JD, Siliciano RF: Experimental approaches to the study of HIV-1 latency. Nat Rev Microbiol. 2007, 5: 95-106. 10.1038/nrmicro1580.

    Article  CAS  PubMed  Google Scholar 

  2. Bisgrove D, Lewinski M, Bushman F, Verdin E: Molecular mechanisms of HIV-1 proviral latency. Expert Rev Anti Infect Ther. 2005, 3: 805-814. 10.1586/14787210.3.5.805.

    Article  CAS  PubMed  Google Scholar 

  3. Marcello A: Latency: the hidden HIV-1 challenge. Retrovirology. 2006, 3: 7-10.1186/1742-4690-3-7.

    Article  PubMed Central  PubMed  Google Scholar 

  4. Daniel R, Smith JA: Integration site selection by retroviral vectors: molecular mechanism and clinical consequences. Hum Gene Ther. 2008, 19: 557-568. 10.1089/hum.2007.148.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  5. Hacein-Bey-Abina S, Von Kalle C, Schmidt M, McCormack MP, Wulffraat N, Leboulch P, Lim A, Osborne CS, Pawliuk R, Morillon E, Sorensen R, Forster A, Fraser P, Cohen JI, de Saint Basile G, Alexander I, Wintergerst U, Frebourg T, Aurias A, Stoppa-Lyonnet D, Romana S, Radford-Weiss I, Gross F, Valensi F, Delabesse E, Macintyre E, Sigaux F, Soulier J, Leiva LE, Wissler M, Prinz C, Rabbitts TH, Le Deist F, Fischer A, Cavazzana-Calvo M: LMO2-associated clonal T cell proliferation in two patients after gene therapy for SCID-X1. Science. 2003, 302: 415-419. 10.1126/science.1088547.

    Article  CAS  PubMed  Google Scholar 

  6. Schroder AR, Shinn P, Chen H, Berry C, Ecker JR, Bushman F: HIV-1 integration in the human genome favors active genes and local hotspots. Cell. 2002, 110: 521-529. 10.1016/S0092-8674(02)00864-4.

    Article  CAS  PubMed  Google Scholar 

  7. Wu X, Li Y, Crise B, Burgess SM: Transcription start regions in the human genome are favored targets for MLV integration. Science. 2003, 300: 1749-1751. 10.1126/science.1083413.

    Article  CAS  PubMed  Google Scholar 

  8. Mitchell RS, Beitzel BF, Schroder AR, Shinn P, Chen H, Berry CC, Ecker JR, Bushman FD: Retroviral DNA integration: ASLV, HIV, and MLV show distinct target site preferences. PLoS Biol. 2004, 2: E234-10.1371/journal.pbio.0020234.

    Article  PubMed Central  PubMed  Google Scholar 

  9. Ciuffi A, Llano M, Poeschla E, Hoffmann C, Leipzig J, Shinn P, Ecker JR, Bushman F: A role for LEDGF/p75 in targeting HIV DNA integration. Nat Med. 2005, 11: 1287-1289. 10.1038/nm1329.

    Article  CAS  PubMed  Google Scholar 

  10. Wang GP, Ciuffi A, Leipzig J, Berry CC, Bushman FD: HIV integration site selection: analysis by massively parallel pyrosequencing reveals association with epigenetic modifications. Genome Res. 2007, 17: 1186-1194. 10.1101/gr.6286907.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  11. De Palma M, Montini E, Santoni de Sio FR, Benedicenti F, Gentile A, Medico E, Naldini L: Promoter trapping reveals significant differences in integration site selection between MLV and HIV vectors in primary hematopoietic cells. Blood. 2005, 105: 2307-2315. 10.1182/blood-2004-03-0798.

    Article  CAS  PubMed  Google Scholar 

  12. Berkhout B, Silverman RH, Jeang KT: Tat trans-activates the human immunodeficiency virus through a nascent RNA target. Cell. 1989, 59: 273-282. 10.1016/0092-8674(89)90289-4.

    Article  CAS  PubMed  Google Scholar 

  13. Marcello A, Zoppe M, Giacca M: Multiple modes of transcriptional regulation by the HIV-1 Tat transactivator. IUBMB Life. 2001, 51: 175-181. 10.1080/152165401753544241.

    Article  CAS  PubMed  Google Scholar 

  14. Cujec TP, Cho H, Maldonado E, Meyer J, Reinberg D, Peterlin BM: The human immunodeficiency virus transactivator Tat interacts with the RNA polymerase II holoenzyme. Mol Cell Biol. 1997, 17: 1817-1823.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  15. Mavankal G, Ignatius Ou SH, Oliver H, Sigman D, Gaynor RB: Human immunodeficiency virus type 1 and 2 Tat proteins specifically interact with RNA polymerase II. Proc Natl Acad Sci USA. 1996, 93: 2089-2094. 10.1073/pnas.93.5.2089.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  16. Chiang CM, Roeder RG: Cloning of an intrinsic human TFIID subunit that interacts with multiple transcriptional activators. Science. 1995, 267: 531-536. 10.1126/science.7824954.

    Article  CAS  PubMed  Google Scholar 

  17. Blau J, Xiao H, McCracken S, O'Hare P, Greenblatt J, Bentley D: Three functional classes of transcriptional activation domain. Mol Cell Biol. 1996, 16: 2044-2055.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  18. Cujec TP, Okamoto H, Fujinaga K, Meyer J, Chamberlin H, Morgan DO, Peterlin BM: The HIV transactivator TAT binds to the CDK-activating kinase and activates the phosphorylation of the carboxy-terminal domain of RNA polymerase II. Genes Dev. 1997, 11: 2645-2657. 10.1101/gad.11.20.2645.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  19. Jeang KT, Chun R, Lin NH, Gatignol A, Glabe CG, Fan H: In vitro and in vivo binding of human immunodeficiency virus type 1 Tat protein and Sp1 transcription factor. J Virol. 1993, 67: 6224-6233.

    PubMed Central  CAS  PubMed  Google Scholar 

  20. Macian F, Rao A: Reciprocal modulatory interaction between human immunodeficiency virus type 1 Tat and transcription factor NFAT1. Mol Cell Biol. 1999, 19: 3645-3653.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  21. Marzio G, Tyagi M, Gutierrez MI, Giacca M: HIV-1 tat transactivator recruits p300 and CREB-binding protein histone acetyltransferases to the viral promoter. Proc Natl Acad Sci USA. 1998, 95: 13519-13524. 10.1073/pnas.95.23.13519.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  22. Benkirane M, Chun RF, Xiao H, Ogryzko VV, Howard BH, Nakatani Y, Jeang KT: Activation of integrated provirus requires histone acetyltransferase. p300 and P/CAF are coactivators for HIV-1 Tat. J Biol Chem. 1998, 273: 24898-24905. 10.1074/jbc.273.38.24898.

    Article  CAS  PubMed  Google Scholar 

  23. Hottiger MO, Nabel GJ: Interaction of human immunodeficiency virus type 1 Tat with the transcriptional coactivators p300 and CREB binding protein. J Virol. 1998, 72: 8252-8256.

    PubMed Central  CAS  PubMed  Google Scholar 

  24. Wei P, Garber ME, Fang SM, Fischer WH, Jones KA: A novel CDK9-associated C-type cyclin interacts directly with HIV-1 Tat and mediates its high-affinity, loop-specific binding to TAR RNA. Cell. 1998, 92: 451-462. 10.1016/S0092-8674(00)80939-3.

    Article  CAS  PubMed  Google Scholar 

  25. Marcello A, Lusic M, Pegoraro G, Pellegrini V, Beltram F, Giacca M: Nuclear organization and the control of HIV-1 transcription. Gene. 2004, 326: 1-11. 10.1016/j.gene.2003.10.018.

    Article  CAS  PubMed  Google Scholar 

  26. Jordan A, Bisgrove D, Verdin E: HIV reproducibly establishes a latent infection after acute infection of T cells in vitro. Embo J. 2003, 22: 1868-1877. 10.1093/emboj/cdg188.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  27. Jordan A, Defechereux P, Verdin E: The site of HIV-1 integration in the human genome determines basal transcriptional activity and response to Tat transactivation. Embo J. 2001, 20: 1726-1738. 10.1093/emboj/20.7.1726.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  28. Lewinski MK, Bisgrove D, Shinn P, Chen H, Hoffmann C, Hannenhalli S, Verdin E, Berry CC, Ecker JR, Bushman FD: Genome-wide analysis of chromosomal features repressing human immunodeficiency virus transcription. J Virol. 2005, 79: 6610-6619. 10.1128/JVI.79.11.6610-6619.2005.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  29. Han Y, Lassen K, Monie D, Sedaghat AR, Shimoji S, Liu X, Pierson TC, Margolick JB, Siliciano RF, Siliciano JD: Resting CD4+ T cells from human immunodeficiency virus type 1 (HIV-1)-infected individuals carry integrated HIV-1 genomes within actively transcribed host genes. J Virol. 2004, 78: 6122-6133. 10.1128/JVI.78.12.6122-6133.2004.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  30. Boireau S, Maiuri P, Basyuk E, de la Mata M, Knezevich A, Pradet-Balade B, Backer V, Kornblihtt A, Marcello A, Bertrand E: The transcriptional cycle of HIV-1 in real-time and live cells. J Cell Biol. 2007, 179: 291-304. 10.1083/jcb.200706018.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  31. Darzacq X, Shav-Tal Y, de Turris V, Brody Y, Shenoy SM, Phair RD, Singer RH: In vivo dynamics of RNA polymerase II transcription. Nat Struct Mol Biol. 2007, 14: 796-806. 10.1038/nsmb1280.

    Article  CAS  PubMed  Google Scholar 

  32. Binnie A, Castelo-Branco P, Monks J, Proudfoot NJ: Homologous gene sequences mediate transcription-domain formation. J Cell Sci. 2006, 119: 3876-3887. 10.1242/jcs.03050.

    Article  CAS  PubMed  Google Scholar 

  33. Chen S, Saiyin H, Zeng X, Xi J, Liu X, Li X, Yu L: Isolation and functional analysis of human HMBOX1, a homeobox containing protein with transcriptional repressor activity. Cytogenet Genome Res. 2006, 114: 131-136. 10.1159/000093328.

    Article  CAS  PubMed  Google Scholar 

  34. Lusic M, Marcello A, Cereseto A, Giacca M: Regulation of HIV-1 gene expression by histone acetylation and factor recruitment at the LTR promoter. Embo J. 2003, 22: 6550-6561. 10.1093/emboj/cdg631.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  35. Levsky JM, Shenoy SM, Pezo RC, Singer RH: Single-cell gene expression profiling. Science. 2002, 297: 836-840. 10.1126/science.1072241.

    Article  CAS  PubMed  Google Scholar 

  36. Kanazawa S, Okamoto T, Peterlin BM: Tat competes with CIITA for the binding to P-TEFb and blocks the expression of MHC class II genes in HIV infection. Immunity. 2000, 12: 61-70. 10.1016/S1074-7613(00)80159-4.

    Article  CAS  PubMed  Google Scholar 

  37. Jeang KT, Xiao H, Rich EA: Multifaceted activities of the HIV-1 transactivator of transcription, Tat. J Biol Chem. 1999, 274: 28837-28840. 10.1074/jbc.274.41.28837.

    Article  CAS  PubMed  Google Scholar 

  38. de la Fuente C, Santiago F, Deng L, Eadie C, Zilberman I, Kehn K, Maddukuri A, Baylor S, Wu K, Lee CG, Pumfery A, Kashanchi F: Gene expression profile of HIV-1 Tat expressing cells: a close interplay between proliferative and differentiation signals. BMC Biochem. 2002, 3: 14-10.1186/1471-2091-3-14.

    Article  PubMed Central  PubMed  Google Scholar 

  39. Gibellini D, Re MC, La Placa M, Zauli G: Differentially expressed genes in HIV-1 tat-expressing CD4(+) T-cell line. Virus Res. 2002, 90: 337-345. 10.1016/S0168-1702(02)00253-8.

    Article  CAS  PubMed  Google Scholar 

  40. Mazo A, Hodgson JW, Petruk S, Sedkov Y, Brock HW: Transcriptional interference: an unexpected layer of complexity in gene regulation. J Cell Sci. 2007, 120: 2755-2761. 10.1242/jcs.007633.

    Article  CAS  PubMed  Google Scholar 

  41. Shearwin KE, Callen BP, Egan JB: Transcriptional interference – a crash course. Trends Genet. 2005, 21: 339-345. 10.1016/j.tig.2005.04.009.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  42. Eszterhas SK, Bouhassira EE, Martin DI, Fiering S: Transcriptional interference by independently regulated genes occurs in any relative arrangement of the genes and is influenced by chromosomal integration position. Mol Cell Biol. 2002, 22: 469-479. 10.1128/MCB.22.2.469-479.2002.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  43. Perkins KJ, Proudfoot NJ: An ungracious host for an unwelcome guest. Cell Host Microbe. 2008, 4: 89-91. 10.1016/j.chom.2008.07.010.

    Article  CAS  PubMed  Google Scholar 

  44. Lenasi T, Contreras X, Peterlin BM: Transcriptional interference antagonizes proviral gene expression to promote HIV latency. Cell Host Microbe. 2008, 4: 123-133. 10.1016/j.chom.2008.05.016.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  45. Han Y, Lin YB, An W, Xu J, Yang HC, O'Connell K, Dordai D, Boeke JD, Siliciano JD, Siliciano RF: Orientation-dependent regulation of integrated HIV-1 expression by host gene transcriptional readthrough. Cell Host Microbe. 2008, 4: 134-146. 10.1016/j.chom.2008.06.008.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  46. Marcello A, Giaretta I: Inducible expression of herpes simplex virus thymidine kinase from a bicistronic HIV1 vector. Res Virol. 1998, 149: 419-431. 10.1016/S0923-2516(99)80010-7.

    Article  CAS  PubMed  Google Scholar 

  47. du Chéné I, Basyuk E, Lin YL, Triboulet R, Knezevich A, Chable-Bessia C, Mettling C, Baillat V, Reynes J, Corbeau P, Bertrand E, Marcello A, Emiliani S, Kiernan R, Benkirane M: Suv39H1 and HP1gamma are responsible for chromatin-mediated HIV-1 transcriptional silencing and post-integration latency. Embo J. 2007, 26: 424-435. 10.1038/sj.emboj.7601517.

    Article  PubMed  Google Scholar 

  48. Marcello A, Cinelli RA, Ferrari A, Signorelli A, Tyagi M, Pellegrini V, Beltram F, Giacca M: Visualization of in vivo direct interaction between HIV-1 TAT and human cyclin T1 in specific subcellular compartments by fluorescence resonance energy transfer. J Biol Chem. 2001, 276: 39220-39225. 10.1074/jbc.M104830200.

    Article  CAS  PubMed  Google Scholar 

Download references


We thank Edouard Bertrand (IGMM, Montpellier, France) for the gift of reagents and useful suggestions and Gianluca Pegoraro (NIH, Bethesda, US) for critically reading the manuscript. This work was supported in part by a HFSP Young Investigators Grant, by the Italian FIRB program of the "Ministero dell'Istruzione, Universita' e Ricerca" of Italy and by the AIDS Program of the "Istituto Superiore di Sanità" of Italy.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Alessandro Marcello.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

ADM and CB carried out the RT-PCR, ADM and PM carried out the in situ hybridization and quantitative analysis, AK, CV and AM prepared and characterized the cell line, AM contributed to the experimental design and coordination of the study, data analysis, as well as to writing the manuscript. All authors have read and approved the final manuscript.

Authors’ original submitted files for images

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

De Marco, A., Biancotto, C., Knezevich, A. et al. Intragenic transcriptional cis-activation of the human immunodeficiency virus 1 does not result in allele-specific inhibition of the endogenous gene. Retrovirology 5, 98 (2008).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: