- Open Access
Microarray analysis reveals global modulation of endogenous retroelement transcription by microbes
Retrovirologyvolume 11, Article number: 59 (2014)
A substantial proportion of both the mouse and human genomes comprise of endogenous retroelements (REs), which include endogenous retroviruses. Over evolutionary time, REs accumulate inactivating mutations or deletions and thus lose the ability to replicate. Additionally, REs can be transcriptionally repressed by dedicated mechanisms of the host. Nevertheless, many of them still possess and express intact open reading frames, and their transcriptional activity has been associated with many physiological and pathological processes of the host. However, this association remains tenuous due to incomplete understanding of the mechanism by which RE transcription is regulated. Here, we use a bioinformatics tool to examine RE transcriptional activity, measured by microarrays, in murine and human immune cells responding to microbial stimulation.
Immune cell activation by microbial signals in vitro caused extensive changes in the transcription not only of the host genes involved in the immune response, but also of numerous REs. Modulated REs were frequently found near or embedded within similarly-modulated host genes. Focusing on probes reporting single-integration, intergenic REs, revealed extensive transcriptional responsiveness of these elements to microbial signals. Microbial stimulation modulated RE expression in a cell-intrinsic manner. In line with these results, the transcriptional activity of numerous REs followed characteristics in different tissues according to exposure to environmental microbes and was further heavily altered during viral infection or imbalances with intestinal microbiota, both in mice and humans.
Together, these results highlight the utility of improved methodologies in assessing RE transcription profiles in both archived and new microarray data sets. More importantly, application of this methodology suggests that immune activation, as a result of infection with pathogens or dysbiosis with commensal microbes, causes global modulation of RE transcription. RE responsiveness to external stimuli should, therefore, be considered in any association between RE transcription and disease.
While the existence of repetitive genetic elements has been recognized since the 1950s, the scale of their contribution to overall genome size was only fully realized through the sequencing of the human and mouse genomes [1, 2]. In total, repetitive elements comprise around 40% of both genomes, representing millions of years of accumulation. Over 90% of these sequences are retroelements (REs), replicating through a mechanism of reverse transcription. This group comprises long and short interspersed nuclear elements (LINEs and SINEs), and long-terminal repeat (LTR)-retroelements. The latter include endogenous retroviruses (ERVs) and mammalian apparent LTR-retrotransposons (MaLRs) that together comprise around 9% of both genomes [1, 2].
Originally identified as leukemogenic agents in mice, both exogenous and endogenous retroviruses have been extensively studied for potential contributions to cancer and disease in many species . Many ERVs were integrated and fixed in the germ-line prior to many speciation events. During this time, they have suffered significant mutation, recombination, and deletion, and no infectious ERVs are currently recognized in the human genome . The potential influence of ERVs polymorphic in the human population  is unknown, however, and ERVs and other REs are increasingly implicated in distinct physiological and pathological processes of the host [4, 6].
Dependent on their relative distance and orientation, REs have been suggested to act as transcriptional promoters and enhancers, canonical and alternative transcription initiation and termination points, splice donor and acceptor sites  and polyadenylation signals . Further, there is increasing evidence that REs may be crucial components of the long intergenic non-coding RNA (lincRNA) regulatory system . Over 80% of lincRNAs have been found to contain REs, which were enriched around the transcription start site of the transcript, suggesting a role in expression regulation .
Through co-option by the host, REs, and ERVs in particular, can have more direct effects. The fusogenic and immunomodulatory roles of certain ERV envelope sequences have been acquired as ‘syncytins’ separately in a variety of placental mammals . Knock-out and knock-down studies have shown the crucial significance of these genes [11, 12]. More counterintuitively, endogenous retroviral sequences have also been co-opted to play roles in retroviral defense, as genes such as Fv1 and Fv4[13–16].
Despite the lack of infectious ERVs in the human genome, ERV-encoded envelope glycoprotein antigens have been suggested as putative autoantigens in human autoimmune conditions and viral-like particles have been observed in a variety of human diseases [17, 18]. Complicating the establishment of causality, however, viral-like particles have also been noted in breast milk and tissues from healthy individuals, and can be induced from transformed cells from healthy donors . Thus, while the potential impact of REs in infection and disease is a large area of current study, research is complicated by the scarcity of data describing their natural spatial and temporal patterns of transcription, and responsiveness to ubiquitous stimuli, including elements of diet . An improved understanding of these areas is increasingly important given the recent identification of REs as potential vaccination targets in both cancer and human immunodeficiency virus-1 (HIV-1) infection .
Using mice with distinct immunodeficiencies, we have previously reported the spontaneous emergence and establishment of replication-competent murine leukemia viruses (MLVs) through recombination between replication-defective ERVs . The appearance of infectious MLVs in immunodeficient mice was influenced by their exposure to environmental factors, most notably commensal microbes. It is possible that microbial stimulation induces the necessary expression of precursor ERVs, the first step in the recombination process, or the subsequent steps allowing the spread of these recombinant MLVs within and between animals. Although certain endogenous MLVs are known to be responsive to stimulation by microbial products, such as Toll-like receptor (TLR) agonists, ERV transcription is thought to be suppressed primarily by epigenetic silencing . Whether the induction of ERVs by microbial stimulation is common or isolated remains unknown. To address this question, we have employed a microarray-based method that allows the determination of ERV expression more broadly. Using this method, we describe extensive patterns of ERV modulation by commensal or pathogenic microbes in both murine and human tissues.
Results and discussion
RE-reporting probes frequently follow the expression of their neighboring gene
Studies of RE transcription have to date relied primarily on PCR-based methods [23, 24], which has rendered techniques limited in scope to either expression analysis of individual loci or, conversely, to determination of generic, ‘family-wide’, expression patterns. Expressed sequence tag (EST) analysis  and customized spotted and, more recently, in situ synthesized microarrays [26, 27] have also been used to determine RE expression. However, such methodologies require specialized expertise or equipment, preventing their application in the majority of exploratory settings. Nevertheless, work with microarrays and related Northern-based approaches has so far revealed the potential for human ERV (HERV) induction by a variety of methods, including UV irradiation  and cytokine exposure .
While it has been known for some time that microarray platforms from various commercial manufacturers contain probe sequences corresponding to repetitive genetic elements, the major focus in the literature has been on the removal of such probes from analysis pipelines [30, 31]. Recently, reversal of this methodology, allowing the compilation of such probes, has been shown to facilitate determination of the genome-wide expression patterns of large numbers of diverse REs . Previous work by Reichmann et al. detailed a methodology designed to identify probes reporting RE expression. This methodology was updated in this study to utilize the latest version of the mouse and human genome sequences and extended to a larger set of microarray platforms. Marginally increased numbers of probes were identified, likely due to differences in the RepeatMasker  and RepBase  libraries used and the masking sensitivity. High levels of overall correspondence in identified probes were achieved with the previous study  (data not shown). Whilst cross-hybridization of microarray probes may potentially affect the assessment of expression of members of high-copy repeat families, large percentages (70-95%) of identified RE-reporting probes were mapped uniquely at a ≥ 95% identity level and thus likely reported the expression of single elements. Where probes were uniquely matched to the genome in this way, the distances to the nearest 3′ and 5′ genes, as well as their identities, were also recorded.Using the Affymetrix Mouse Genome 430v2 (MG430v2) platform, where a probeset was noted as containing RE-reporting probes, a median of 3 probes from the group were identified (Figure 1A). Only 12% of probesets identified consisted of a majority (>75%) of RE-reporting probes, however, and over 20% of probesets contained only a single RE-reporting probe (Figure 1A). Further, 68% of RE-reporting probes identified were within or immediately adjacent to annotated protein-coding genes (Figure 1B), raising the confounding factor that many REs reported may be co-regulated with neighboring genes, are included in canonical genic transcripts, or represented in mRNAs corresponding to alternative isoforms or splice variants (Figure 1C). This confounding factor broadly impacts analyses made with virtually any methodology used to date, excepting in instances where elements are successfully, specifically and uniquely targeted.
To assess the potential impact of such co-regulation, three independent experiments using MG430v2, originally designed to determine tissue-specific expression patterns, were analyzed for significantly regulated RE-reporting probes. While obvious clustering of tissues was observed (data not shown), the most highly expressed RE-reporting probes were members of probesets reporting the expression of known tissue specific genes, including Tnnt2 (troponin T2, cardiac) within heart tissue , Ldb3 (LIM domain binding 3) within skeletal muscle , and Ighv14-2 (immunoglobulin heavy variable 14–2) within the spleen . Further supporting this observation, in a separate global analysis we found that when probesets contained a single RE-reporting probe, the behavior of the RE-reporting probe did not differ from that of the remainder of probes in the probeset across 9 tissues analyzed, in the vast majority of probesets (>86%) (p > 0.05, Holm-Bonferroni t test). To further investigate the extent of linkage between RE-reporting probe expression and that of a neighboring gene, correlation was assessed for heart tissue samples, which previously showed the greatest independence in RE-reporting probe expression. Varying significant (p < 0.0001) positive correlations were observed for LTR elements, LINEs and SINEs, suggesting expression patterns of neighboring genes explain ~30% of observed RE expression levels (Figure 2A).
While the differential regulation of RE-reporting probes in this manner may still have relevance, and indeed the transcriptional capacity of the RE may influence that of the gene, the independent regulation of REs within the genome cannot be easily assessed using this approach. To improve upon this, the published methodology was redesigned to increase stringency. Only RE-reporting probes from probesets that could be uniquely placed on the genome in a position intergenic to known protein-coding genes, and where >75% of probes were specific for a RE integration were retained. Numbers of probes passing this filtering are shown in Table 1.
Tissue-specific RE expression patterns were again assessed using this filtering (Additional file 1: Figure S1). While considerably fewer RE-reporting probes were identified as differentially regulated, samples clustered according to tissue and, secondarily, by experiment (Additional file 1: Figure S1). Although all three groups exhibited robust tissue specificity, LTR elements represented the majority of REs that differed between tissues, followed by LINEs and then by SINEs (Additional file 1: Figure S1). This order reflected the representation of LTR, LINE and SINE elements on the microarray platforms, which favored LTR elements, whereas LINEs and, to a greater degree, SINEs were underrepresented (Table 1), likely due to their more repetitive nature in comparison with LTR elements.
The correlation between RE and neighboring gene expression was again assessed, with weaker positive correlations (p < 0.0219) being observed as the result of the enhanced filtering of RE-reporting probes (Figure 2B). In this analysis, LINEs displayed marginally higher degree of co-regulation with their nearest gene than either LTR elements or SINEs (Figure 2B). Thus, in addition to differences in their representation on the microarray platforms, LTR, LINE and SINE expression may involve divergent transcriptional mechanisms and linkage with neighboring genes. For these reasons, the remaining analyses focus solely on investigation of LTR elements, which were separated into the three classes recognized according to sequence similarity , with MaLRs included in class III.
Assessment of RE expression in environmentally-exposed surfaces
Previous work had outlined a potential role for husbandry conditions and the presence of commensal microbiota in influencing rates and probability of endogenous MLV recombination and subsequent emergence of infectious virus in variously immunodeficient mice on the commonly-used C57BL/6 (B6) genetic background . To investigate this link further, a MG430v2 microarray dataset reporting expression patterns for environmental surfaces (lung, small and large intestine, and epidermis) was analyzed for RE expression (Figure 3A). Interestingly, all small and large intestine tissue samples showed elevated MLV expression. Expression in the intestinal tract was secondarily confirmed using an Affymetrix Mouse Gene 1.0 ST (MoGene1.0) dataset, which additionally showed in both the small intestine and lung high levels of mouse mammary tumor virus (MMTV) expression (Figure 3B), an ERV type not well represented in MG430v2. High levels of MMTV expression were confirmed in large intestine tissue samples by qRT-PCR (Figure 3C) using a methodology previously described , further supporting a potential link to microbial exposure in the control of ERV expression and validating the microarray data.
ERV expression in the gut is dependent on both microbiota and genotype
Microbial products are recognized by pattern recognition receptors, such as TLRs, and previous work has shown the widespread and diverse impacts of various TLR agonists on ERV expression in both murine and human cells . Subsequent to agonist recognition, TLR signaling converges through a limited number of downstream pathways, including, for many TLRs, a route including the Myd88 adapter molecule.
To further investigate the dependence of ERV expression on the presence of a microbiota and on signaling from microbial products, the developed microarray methodology was applied to a MoGene1.0 array comparing a range of gut tissues from both wild-type and Myd88−/− mice housed in both specific pathogen-free (SPF) and germ-free (GF) conditions (Figure 4A).
This analysis confirmed that, within wild-type mice, expression of certain RE families was dependent on the presence of the gut microbiota (Figure 4A). MLV expression, including that of the sole endogenous ecotropic MLV (eMLV) of B6 mice, Emv2, appeared entirely reliant on the presence of the microbiota. RLTR44-int (ERVK), MT2B (ERVL), and MMTV expression was also noticeably increased in SPF mice, albeit in tissue-specific manners (Figure 4A). A similar comparison within Myd88−/− mice, while also showing largely decreased expression in GF housing conditions, also revealed the retention of some tissue-specific ERV regulation patterns. This included limited MLV expression within individual mice across multiple tissues (Figure 4A). A proportion of probes showed an opposing expression pattern, being elevated in tissues from GF mice, but represented various classes of REs, and no grouping was noted.
Comparison within SPF mice shows a marked effect of genotype, with significantly (p < 10−7) reduced MLV expression across all tissues sampled in the absence of Myd88 (Figure 4A). This finding suggested a role for Myd88 in the sensing of microbial stimuli that induced MLV expression specifically in SPF mice.
Together, these data supported a role for the microbiota and microbial signaling in elevating basal expression of both MLVs and MMTVs in the gut. We had previously linked the probability of recombinational rescue of Emv2 to husbandry conditions, with no infectious virus being detectible in immunodeficient strains offered acidified water or maintained in entirely GF conditions. Interestingly, Myd88-/- mice were an exception to this rule, maintaining some positivity when maintained with acidified water sources in various facilities . GF Myd88-/- mice were not available at the time to assess whether this viral rescue was, in fact, independent of the microbiota. To further investigate this question, therefore, wild-type and Myd88−/−Ticam1−/- mice housed in GF conditions were compared with wild-type and Myd88-/- controls maintained in SPF facilities (Figure 4B). No evidence of emergent virus was seen in GF Myd88−/−Ticam1−/- mice.
Therefore, both the basal expression of MLVs and MMTVs in the gut, as well as the ultimate restoration of Emv2 infectivity and the emergence of infectious recombinant MLVs rely on the gut microbiota in all strains tested.
Microbial stimulation activates MLVs in a cell-autonomous manner
A recombinational rescue of Emv2, as previously noted in certain immunodeficient strains, would require transcription of not only the Emv2 provirus, but concurrent and sufficient expression of a number of suitable recombination partners. These requirements, followed by the stochastic process of successful recombination, may act as a rate-limiting step in the production of infectious exogenous MLVs.
Xmv43 (Bxv1), the expression of which is lipopolysaccharide (LPS)-inducible , was previously highlighted as a significant recombination partner in the rescue of Emv2. The potential for stimulation with LPS or other TLR agonists to produce simultaneous expression of both proviruses was, therefore, examined in bone marrow dendritic cells (BMDCs) (Figure 5A). Expression levels were also compared to treatment with the halogenated thymidine analogue bromodeoxyuridine (BrdU), a treatment known to induce Emv2 expression . Treatment with both LPS, a TLR4 agonist, and polyinosinic-polycytidylic acid (poly(I:C)), a TLR3 agonist, significantly induced expression of both proviruses in culture, although no treatment with a TLR agonist matched the induction of Emv2 seen upon BrdU treatment (Figure 5A). Treatment with Pam3CSK4, a TLR1/2 agonist, significantly induced Xmv43 expression but caused a non-significant reduction in Emv2 expression.
These data confirmed the possibility for TLR stimulation to cause the simultaneous expression of two viable recombination partners, but did not confirm that this occurred within the same cell. This requirement was investigated using co-culture of BMDCs produced from 129 mice, lacking Xmv43, and either wild-type or Tlr4−/− B6 mice, retaining Xmv43 but varying in their potential to respond to LPS stimulation (Figure 5B). Addition of LPS to co-cultures with Tlr4−/− BMDCs gave only a small level of Xmv43 induction, suggesting a minimal autocrine effect resulting from the stimulation of LPS-responsive 129 BMDCs. Significantly higher Xmv43 induction was seen upon stimulation of co-cultures containing LPS-responsive wild-type B6 BMDCs (Figure 5B), however, suggesting that the majority of expression occurs in a cell-intrinsic manner.
REs are significantly regulated on infection in both mice and humans
Recognition of pathogen-associated molecular patterns by pattern recognition receptors, such as TLRs, while perhaps a ubiquitous feature of the presence of commensals, is also more obviously associated with the detection of infection. Such signaling is crucial to the formation of appropriate defensive responses, and, alongside other pathways, can establish sustained differences in gene expression and protein production .
To investigate the potential impact of viral infection on RE expression, microarray data examining influenza A infection in two strains of mice was analyzed. B6 and DBA2 mice, respectively resistant and susceptible to infection with influenza A, show differing immune responses , and, likewise, RE expression also varied (Figure 6A and B). Interestingly, B6 and DBA2 mice have different complements of all classes of endogenous MLV loci [43, 44], and display divergent expression patterns of MLV expression upon infection with influenza A. MLV induction within DBA2 mice was transient, appearing at day 2 post-infection before returning to baseline, whereas induction in B6 was sustained from day 2 post-infection for the duration of the experiment (Figure 6C). This difference likely not only reflects distinct programs of cellular gene expression, but also the particular responsiveness of individual proviral integrations.
While various factors may impact RE expression in mice, the complement, age, and degeneracy of REs and ERVs differs markedly between the mouse and human genomes. To allow comparisons to human datasets, the developed microarray methodology was extended to a variety of human microarray platforms (Table 2). HERV-K elements, subdivided into the HML-1 to −11 subgroups, contain the most recently endogenized proviruses within the human genome. Certain HERV-K(HML-2) proviruses remain polymorphic within the human population  and are suggested to be expressed in various situations, including upon HIV-1 infection [45–47]. The potential diagnostic or therapeutic relevance of HML-2 proviruses is a large area of current study, and, consequently, whilst the sequence similarity of these elements complicates the interpretation of expression measures (highly similar elements likely contribute, at least partially, but by an unquantified amount, to the expression observed for specific probes) the activity of HERVK-int, LTR5A, LTR5B, and LTR5_Hs elements was investigated where possible.
Previous work has identified the potential regulation of HERV-W family proviruses by influenza A . To further translate the impact of influenza infection on the expression of murine REs to a human system, a comparative analysis of a human microarray dataset was made. This revealed a smaller effect of influenza infection (Figure 7A). Many fewer REs were significantly regulated, with similar numbers induced and repressed. The relatively small number of regulated elements found, whilst likely a factor of the size of the microarray platform used, may also be due to sampling peripheral blood, which might not reflect the full extent of disease activity in the target organ (lung).To investigate RE activity directly in an affected organ during viral infection, we applied the developed method on data from lymph node biopsies isolated from HIV-1-infected or uninfected individuals. Analysis of patients with acute HIV-1 infection or AIDS in comparison with healthy controls revealed a much larger number of significantly regulated elements (Figure 7B). Again, samples could be clustered effectively according to RE expression (Figure 7B).
Lastly, we examined if, similarly with their murine counterparts, expression of human REs and ERVs is influenced by exposure to microbial stimulation not only following infection, but also as a result of imbalanced homeostasis with gut microbes. Increasing volumes of research focus not only on the gut microbiome, but also on enteric fungal and viral constituents and the establishment and maintenance of gut immune homeostasis . Fungal and viral patterns may also cause TLR stimulation, but are also recognized by a number of external pathways, which may act cooperatively or independently of TLRs. Dectin-1, for example, is suggested to allow the recognition of β-glucans, major constituents of the fungal cell wall . To capture the complexity of such interactions, we compared human RE transcriptional profiles in gut biopsies from healthy individuals and ulcerative colitis patients. This analysis revealed extensive regulation, both induction and suppression, of a large number of REs in diseased tissue samples (Figure 7C).The potential regulation of HML-2 elements was investigated in all three cases, but low numbers of reporting probes prevent detailed analysis. A single HML-2-specific transcript reported by a LTR5A probe was upregulated in influenza A infection (Figure 7A). Transcripts reported by two probes (LTR5B and LTRBA/B) were modulated in acute HIV-1 infection and subsequent progression to AIDS (Figure 7B). Both of these were, however, reduced in abundance in infected individuals compared with uninfected controls (Figure 7B). In contrast, transcripts reported by three HML-2 specific probes (2 LTR5B and a LTR5_Hs) were significantly increased in ulcerative colitis samples in comparison with biopsies from healthy individuals (Figure 7C).
Thus, the analysis of tissues from individuals with viral infection or dysbiosis with intestinal microbiota demonstrated extensive modulation of RE activity, including members of the HML-2 family. However, due to the complex cellular composition of these tissues, combined with changes in this composition during infection or inflammation, these data did not allow determination of whether RE transcriptional changes were the result of genuine modulation in a specific cell-type or a side-effect of changing cellular composition of complex tissues. For example, the apparent decrease or increase of HML-2 activity in HIV-1 infection or ulcerative colitis samples, respectively, may simply represent the relative presence of lymphocytes or other hematopoietic cells in the tissue. Therefore, cell-intrinsic modulation of RE activity would require investigation of single cell types.
Human RE transcriptional modulation by microbial stimulation is cell-intrinsic
To address this issue of cell composition in inflamed or healthy tissues, we analyzed the transcriptional activity of REs in specific human cell types either isolated ex vivo from human viral infection or exposed to microbial stimuli in vitro. The activity of several human REs was found altered in purified CD11c+ myeloid DCs isolated from peripheral blood mononuclear cells (PBMCs) of HIV-infected or uninfected individuals (Figure 8A). HML-2 transcripts reported by two of the three HML-2-specific probes that were found modulated in this comparison were downregulated in HIV-1 infection, whereas the third was upregulated (Figure 8A).
In a separate experiment, human DCs experimentally treated in vitro with HIV-1-based viruses and with Simian immunodeficiency virus (SIV) viral-like particles, a treatment that allows DC infection, exhibited an altered RE expression profile in comparison with all other treatment groups (Figure 8B), but no HML-2-specific probe was significantly regulated, potentially due to the omission of unstimulated control samples.
Lastly, human DCs in vitro infected with Leishmania major also considerably altered their RE expression profile, with numerous elements, including several HML-2 elements, significantly induced (Figure 8C). Induction of some REs appeared very rapid (2–4 hours), whereas other REs required prolonged stimulation (24 hours) (Figure 8C). Thus, direct microbial stimulation or infection of purified human immune cells causes extensive modulation of RE activity.
Commercial microarray platforms contain thousands of RE-reporting probes, which can be used to assess RE transcriptional activity in a wealth of available data sets. However, these RE-reporting probes frequently correspond to REs that are near or within hosts genes and appear co-regulated with their nearest gene. Such co-regulation may be due to the capacity of REs to influence gene expression patterns within distinct cell types and to contribute to establishing the cell identity. It may also be partly due to the efforts of microarray manufacturers to focus on host gene transcription. Indeed, different microarray platforms detect certain RE families with variable coverage, and, therefore, the representation of REs in any one platform is incomplete. We further refined the microarray-based method to filter for RE-reporting probes identified as intergenic and as belonging to probesets where the majority of constituent probes report RE expression, to show global modulation of RE transcription at the level of individual cells or entire organs in both humans and mice exposed to microbial stimulation. As RE representation in this analysis is not complete, it is likely that the effect of microbial exposure on RE activity is even more extensive.
It is becoming clear that gene expression patterns are not fixed within cell types. Several cell types will respond to cues from other cells or the environment, and this is particularly true for immune cells responding to, for example an infection. Transcriptional reprogramming of immune cells also involves REs. In addition to immune cells tasked with sensing microbes, organs that are constantly exposed to the environment will express REs according to their microbial exposure. By being responsive to external stimuli, REs may not only participate in establishing the cell identity during development, but also help rewire gene expression networks to new patterns, ones that underlie the cellular response to these external stimuli.
Identification of probes reporting retroelement expression
The GRCm38.72 and GRCh37.72 releases of the mouse and human genomes were downloaded with accompanying gene annotation files and local BLAST + databases were constructed using BLAST 2.2.28+. RepeatMasker 4.0.3 (configured with TRF 4.04  and RMBLASTn 2.2.28+ alongside the 20120418 RepBase library ) was used to mask both genomes using the ‘-s’ (sensitive) parameter. Microarray probe sequences and unique identification numbers were obtained either from annotation databases supplied for use with the ‘oligo’  microarray analysis Bioconductor  package or from the manufacturer’s website.
A Python (http://www.python.org) script was produced to run and query BLASTn of the downloaded probes against the relevant genome using the ‘-task blastn-short’ parameter. The number of times an individual probe could be localized to the genome with ≥95% identity was recorded, along with the location of the highest scoring hit. A further Python script was used to parse these data to identify probes falling entirely within regions masked by RepeatMasker and to identify those in the correct orientation to report sense expression of the particular element. For technologies hybridizing antisense cRNA (e.g. Affymetrix 3′ microarrays), probes are sense to the retroelement, whereas for technologies hybridizing sense cDNA (e.g. Affymetrix Gene microarrays), probes are required to be antisense to the retroelement. The nearest genes chromosomally 5′ and 3′, as well as their locations, were recorded from the gene annotation files and, together, this information was compiled to form an annotation file for probes identified as reporting retroelement expression. Where probes were originally identified as reporting expression from multiple genomic loci, annotation information requiring a specific genomic context was omitted. This probe list was filtered using an additional script for probes derived from probesets where >75% of probes report retroelement expression, and where the probe was identified as >1 kb from the nearest protein coding gene. Annotation files are supplied as Additional files 2 and 3.
Analysis of Affymetrix microarray data
Raw CEL files corresponding to accessions E-GEOD-97 , E-GEOD-1986, E-GEOD-6269 , E-GEOD-9954 , E-GEOD-10246 , E-GEOD-16363 , E-GEOD-17438 , E-GEOD-22589 , E-GEOD-24940 , E-GEOD-38713 , E-GEOD-42058 , E-GEOD-42088, and E-MTAB-835  were downloaded from ArrayExpress (http://www.ebi.ac.uk/arrayexpress). Pseudo-images of the array chips were visually inspected for spatial artifacts and arrays that passed this inspection were analyzed at the probe level with a custom R script utilizing routines provided within ‘oligo’. Perfect-match (PM) probe expression data for the entire dataset were RMA background corrected and quantile normalized before log2 transformation and export. Downstream analysis, probe annotation, batch-effect correction (where appropriate), and heatmap production was thereafter performed with Qlucore Omics Explorer (Qlucore, Lund, Sweden). To reduce the size of heatmaps and to decrease artificial clustering resulting from multiple probes from the same probeset, probes identified as significant were collapsed into their respective probesets using facilities build into Qlucore Omics Explorer.
Other figure production and statistical analysis was performed with SigmaPlot v12 (Systat Software Inc, San Jose, CA, USA).
Calculation of the one-step Tukey’s biweight w-estimator for probeset expression followed the algorithms defined by Affymetrix . For a number,N, of probe expression values, x, where denotes the median of x, and S denotes the median absolute deviation of x, the w-estimator is calculated as , where and , given the fixed values c = 5 and ϵ = 0.0001.
Inbred B6 and 129 wild-type strains, as well as B6-backcrossed MyD88-deficient B6.129P2-Myd88tm1Aki (Myd88−/−) and TLR4-deficient B6.129P2-Tlr4tm1Aki (Tlr4−/−) mice have been described [66, 67]. Mice were bred in individually ventilated cages (IVCs) before being transferred to SPF facilities at the NIMR, and maintained on UV-irradiated, filtered neutral pH water. B6 and B6.129P2-Myd88tm1AkiTicam1tm1Aki (Myd88−/−Ticam1−/−) mice, additionally deficient for toll-like receptor adaptor molecule 1 (TICAM-1) , were also maintained in germ-free facilities at the Unit for Laboratory Animal Medicine, University of Michigan, MI, USA (UMICH) and kept on autoclaved distilled water. Animal experiments were approved by the ethical committee of the NIMR, and conducted according to local guidelines and UK Home Office regulations under the Animals Scientific Procedures Act 1986 (ASPA) and the authority of Project License PPL 70/7643.
For the production of BMDCs, bone marrow was flushed from the femurs and tibiae of culled mice and incubated in IMDM supplemented with 5% FCS (Sigma-Aldrich, St Louis, MO, USA) and 10% GM-CSF for 7 days at 37°C and 5% CO2. Adherent DCs could typically be obtained after this time at a purity of 50-70%. TLR agonists were introduced for 48 hours at 1 μg/ml for LPS (from Salmonella minnesota R595, Axxora, CA, USA), 10 μg/ml for poly(I:C) (Sigma-Aldrich) and 0.25 μg/ml for Pam3CSK4 (Axxora). BrdU (Sigma-Aldrich) was introduced at 20 μg/ml.
qRT-PCR and microarray analyses
Prior to cDNA preparation, all samples were stored in RNAlater (Qiagen, Hilden, Germany) at −20°C. Where tissues were processed, samples were disrupted using a TissueLyser LT (Qiagen). RNA was extracted from samples using RNeasy spin columns (Qiagen) and extracted nucleic acids were subjected to DNaseI (Qiagen) treatment in solution and a further column cleanup. RNA for qRT-PCR was reverse transcribed using the Applied Biosystems (Carlsbad, CA, USA) high capacity reverse transcription kit with an added RNase-inhibitor (Promega Biosciences, Madison, WI, USA) and cDNA was cleaned using QIAquick spin columns (Qiagen). All elutions were conducted with nuclease-free water (Qiagen).
Purified cDNA was used as template for the amplification of target gene transcripts with SYBR Green PCR master mix (Applied Biosystems) using the ABI Prism SDS 7000 and 7900HT machines (Applied Biosystems). Target gene expression was determined relative to Hprt using the ΔCT method using previously-described primer sets and methodology . In plots showing expression, a hashed line indicating the theoretical detection limit is shown. Fold change values are calculated against an unstimulated control, represented by the hashed line, which is standardized to 1.
GRY and BM are post-doctoral Career Development Fellows in GK’s laboratory. GK is a program leader at MRC National Institute for Medical Research, UK, and Professor of Retrovirology at Imperial College London, UK.
Bone marrow dendritic cell
Expressed sequence tag
Human immunodeficiency virus 1
Long intergenic non-coding RNA
Long interspersed nuclear element
Murine leukemia virus
Mouse mammary tumor virus
Peripheral blood mononuclear cell
Short interspersed nuclear element
Simian immunodeficiency virus
Mouse Genome Sequencing Consortium: Initial sequencing and comparative analysis of the mouse genome. Nature. 2002, 420: 520-562.
International Human Genome Sequencing Consortium: Initial sequencing and analysis of the human genome. Nature. 2001, 409: 860-921.
Kassiotis G: Endogenous retroviruses and the development of cancer. J Immunol. 2014, 192: 1343-1349.
Stoye JP: Studies of endogenous retroviruses reveal a continuing evolutionary saga. Nat Rev Microbiol. 2012, 10: 395-406.
Subramanian RP, Wildschutte JH, Russo C, Coffin JM: Identification, characterization, and comparative genomic distribution of the HERV-K (HML-2) group of human endogenous retroviruses. Retrovirology. 2011, 8: 90-
Young GR, Stoye JP, Kassiotis G: Are human endogenous retroviruses pathogenic? An approach to testing the hypothesis. Bioessays. 2013, 35: 794-803.
Cohen CJ, Lock WM, Mager DL: Endogenous retroviral LTRs as promoters for human genes: a critical assessment. Gene. 2009, 448: 105-114.
Mager DL, Hunter DG, Schertzer M, Freeman JD: Endogenous retroviruses provide the primary polyadenylation signal for two new human genes (HHLA2 and HHLA3). Genomics. 1999, 59: 255-263.
Li L, Feng T, Lian Y, Zhang G, Garen A, Song X: Role of human noncoding RNAs in the control of tumorigenesis. Proc Natl Acad Sci U S A. 2009, 106: 12956-12961.
Lavialle C, Cornelis G, Dupressoir A, Esnault C, Heidmann O: Paleovirology of “syncytins”, retroviral env genes exapted for a role in placentation. Philos Trans R Soc B Biol Sci. 2013, 386: 1471-2970.
Dunlap KA, Palmarini M, Varela M, Burghardt RC, Hayashi K, Farmer JL, Spencer TE: Endogenous retroviruses regulate periimplantation placental growth and differentiation. Proc Natl Acad Sci U S A. 2006, 103: 14390-14395.
Dupressoir A, Vernochet C, Bawa O, Harper F, Pierron G, Opolon P, Heidmann T: Syncytin-A knockout mice demonstrate the critical role in placentation of a fusogenic, endogenous retrovirus-derived, envelope gene. Proc Natl Acad Sci U S A. 2009, 106: 12127-12132.
Best S, Le Tissier P, Towers G, Stoye JP: Positional cloning of the mouse retrovirus restriction gene Fv1. Nature. 1996, 382: 826-829.
Yan Y, Buckler-White A, Wollenberg K, Kozak CA: Origin, antiviral function and evidence for positive selection of the gammaretrovirus restriction gene Fv1 in the genus Mus. Proc Natl Acad Sci U S A. 2009, 106: 3259-3263.
Ikeda H, Laigret F, Martin MA, Repaske ROY: Characterization of a molecularly cloned retroviral sequence associated with Fv-4 resistance. J Virol. 1985, 55: 768-777.
Goff SP: Retrovirus restriction factors. Mol Cell. 2004, 16: 849-859.
Downey RF, Sullivan FJ, Wang-johanning F, Ambs S, Giles FJ, Glynn SA: Human endogenous retrovirus K and cancer: innocent bystander or tumorigenic accomplice?. Front Oncol. 2014, doi:10.1002/ijc.29003
Magiorkinis G: “There and back again”: revisiting the pathophysiological roles of human endogenous retroviruses in the post-genomic era. Philos Trans R Soc B Biol Sci. 2013, 368: 20120504-
Waterland RA, Jirtle RL: Transposable elements: targets for early nutritional effects on epigenetic gene regulation. Mol Cell Biol. 2003, 23: 5293-5300.
Sacha JB, Kim I, Chen L, Jakir H, Goodwin DA, Simmons HA, Daniel I, Von Pelchrzim F, Gifford RJ, Nimityongskul FA, Newman LP, Lappin PB, Hammond D, Piaskowski SM, Reed JS, Kerry A, Tharmanathan T, Zhang N, Rieger M, Fernandes C, Ii JPG, Gebhard DH, Shoieb A, Pierce BG, Trajkovic D, Rakasz E, Rong S, Mccluskie M, Christy C, Merson JR, et al: Vaccination with cancer- and HIV infection-associated endogenous retrotransposable elements is safe and immunogenic. J Immunol. 2012, 189: 1467-1479.
Young GR, Eksmond U, Salcedo R, Alexopoulou L, Stoye JP, Kassiotis G: Resurrection of endogenous retroviruses in antibody-deficient mice. Nature. 2012, 491: 774-778.
Slotkin RK, Martienssen R: Transposable elements and the epigenetic regulation of the genome. Nat Rev Genet. 2007, 8: 272-285.
Okahara G, Matsubara S, Oda T, Sugimoto J, Jinno Y, Kanaya F: Expression analyses of human endogenous retroviruses (HERVs): tissue-specific and developmental stage-dependent expression of HERVs. Genomics. 2004, 84: 982-990.
Muradrasoli S, Forsman A, Hu L, Blikstad V, Blomberg J: Development of real-time PCRs for detection and quantitation of human MMTV-like (HML) sequences HML expression in human tissues. J Virol Methods. 2006, 136: 83-92.
Oja M, Peltonen J, Blomberg J, Kaski S: Methods for estimating human endogenous retrovirus activities from EST databases. BMC Bioinformatics. 2007, 8 Suppl 2: S11-
Seifarth W, Spiess B, Zeilfelder U, Speth C, Hehlmann R, Leib-Mösch C: Assessment of retroviral activity using a universal retrovirus chip. J Virol Methods. 2003, 112: 79-91.
Pérot P, Mugnier N, Montgiraud C, Gimenez J, Jaillard M, Bonnaud B, Mallet F: Microarray-based sketches of the HERV transcriptome landscape. PLoS One. 2012, 7: e40194-
Hohenadl C, Germaier H, Walchner M, Hagenhofer M, Herrmann M, Sturzl M, Kind P, Hehlmann R, Erfle V, Leib-Mösch C: Transcriptional activation of endogenous retroviral sequences in human epidermal keratinocytes by UVB irradiation. J Invest Dermatol. 1999, 113: 587-594.
Katsumata K, Ikeda H, Sato M, Ishizu A, Kawarada Y, Kato H, Wakisaka A, Koike T, Yoshiki T: Cytokine regulation of env gene expression of human endogenous retrovirus-R in human vascular endothelial cells. Clin Immunol. 1999, 93: 75-80.
Dai M, Wang P, Boyd AD, Kostov G, Athey B, Jones EG, Bunney WE, Myers RM, Speed TP, Akil H, Watson SJ, Meng F: Evolving gene/transcript definitions significantly alter the interpretation of GeneChip data. Nucleic Acids Res. 2005, 33: e175-
De Leeuw WC, Rauwerda H, Jonker MJ, Breit TM: Salvaging Affymetrix probes after probe-level re-annotation. BMC Res Notes. 2008, 1: 66-
Reichmann J, Crichton JH, Madej MJ, Taggart M, Gautier P, Garcia-Perez JL, Meehan RR, Adams IR: Microarray Analysis of LTR retrotransposon silencing identifies Hdac1 as a regulator of retrotransposon expression in mouse embryonic stem cells. PLoS Comput Biol. 2012, 8: e1002486-
Smit AFA, Hubley R, Green P: RepeatMasker Open-3.0. 1996–2004, [http://www.repeatmasker.org]
Kohany O, Gentles AJ, Hankus L, Jurka J: Annotation, submission and screening of repetitive elements in repbase: RepbaseSubmitter and censor. BMC Bioinformatics. 2006, 7: 474-
Chen Z, Friedrich GA, Soriano P: Transcriptional enhancer factor 1 disruption by a retroviral gene trap leads to heart defects and embryonic lethality in mice. Genes Dev. 1994, 8: 2293-2301.
Faulkner G, Pallavicini A, Formentin E, Comelli A, Ievolella C, Trevisan S, Bortoletto G, Scannapieco P, Salamon M, Mouly V, Valle G, Lanfranchi G: ZASP: a new Z-band alternatively spliced PDZ-motif protein. J Cell Biol. 1999, 146: 465-475.
Taylor BA, Bailey DW, Cherry M, Riblet R, Weigert M: Genes for immunoglobulin heavy chain and serum prealbumin protein are linked in mouse. Nature. 1975, 256: 644-646.
Stocking C, Kozak CA: Murine endogenous retroviruses. Cell Mol Life Sci. 2008, 65: 3383-3398.
Stoye JP, Moroni C: Endogenous retrovirus expression in stimulated murine lymphocytes. Identification of a new locus controlling mitogen induction of a defective virus. J Exp Med. 1983, 157: 1660-1674.
Kozak CA, Rowe WP: Genetic mapping of ecotropic murine leukemia virus-inducing loci in six inbred strains. J Exp Med. 1982, 155: 524-534.
Cláudio N, Dalet A, Gatti E, Pierre P: Mapping the crossroads of immune activation and cellular stress response pathways. EMBO J. 2013, 32: 1214-1224.
Alberts R, Srivastava B, Wu H, Viegas N, Geffers R, Klawonn F, Novoselova N, Do Valle TZ, Panthier J-J, Schughart K: Gene expression changes in the host response between resistant and susceptible inbred mouse strains after influenza A infection. Microbes Infect. 2010, 12: 309-318.
Stoye JP, Coffin JM: Polymorphism of murine endogenous proviruses revealed by using virus class-specific oligonucleotide probes. J Virol. 1988, 62: 168-175.
Jenkins NA, Copeland NG, Taylor BA, Lee BK: Organization, distribution, and stability of endogenous ecotropic murine leukemia virus DNA sequences in chromosomes of Mus musculus. J Virol. 1982, 43: 26-36.
Brinzevich D, Young GR, Sebra R, Ayllon J, Maio SM, Deikus G, Chen BK, Fernandez-Sesma A, Simon V, Mulder LCF: HIV-1 Interacts with Human Endogenous Retrovirus K (HML-2) Envelopes Derived from Human Primary Lymphocytes. J Virol. 2014, 88: 6213-6223.
Gonzalez-Hernandez MJ, Cavalcoli JD, Sartor MA, Contreras-Galindo R, Meng F, Dai M, Dube D, Saha AK, Gitlin SD, Omenn GS, Kaplan MH, Markovitz DM: Regulation of the HERV-K (HML-2) transcriptome by the HIV-1 Tat protein. J Virol. 2014, doi:10.1128/JVI.00556–14
Contreras-Galindo R, Kaplan MH, Contreras-Galindo AC, Gonzalez-Hernandez MJ, Ferlenghi I, Giusti F, Lorenzo E, Gitlin SD, Dosik MH, Yamamura Y, Markovitz DM: Characterization of human endogenous retroviral elements in the blood of HIV-1-infected individuals. J Virol. 2012, 86: 262-276.
Nellåker C, Yao Y, Jones-Brando L, Mallet F, Yolken RH, Karlsson H: Transactivation of elements in the human endogenous retrovirus W family by viral infection. Retrovirology. 2006, 3: 44-
Mavrommatis B, Young GR, Kassiotis G: Counterpoise between the microbiome, host immune activation and pathology. Curr Opin Immunol. 2013, 25: 456-462.
Iliev ID, Funari VA, Taylor KD, Nguyen Q, Reyes CN, Strom SP, Brown J, Becker CA, Fleshner PR, Dubinsky M, Rotter JI, Wang HL, Mcgovern DPB, Brown GD, Underhill DM: Interactions between commensal fungi and the C-type lectin receptor dectin-1 influence colitis. Science. 2012, 336: 1314-1317.
Benson G: Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res. 1999, 27: 573-580.
Jurka J, Kapitonov VV, Pavlicek A, Klonowski P, Kohany O, Walichiewicz J: Repbase update, a database of eukaryotic repetitive elements. Cytogenet Genome Res. 2005, 110: 462-467.
Carvalho BS, Irizarry RA: A framework for oligonucleotide microarray preprocessing. Bioinformatics. 2010, 26: 2363-2367.
Gentleman RC, Carey VJ, Bates DM, Bolstad B, Dettling M, Dudoit S, Ellis B, Gautier L, Ge Y, Gentry J, Hornik K, Hothorn T, Huber W, Iacus S, Irizarry R, Leisch F, Li C, Maechler M, Rossini AJ, Sawitzki G, Smith C, Smyth G, Tierney L, Yang JYH, Zhang J: Bioconductor: open software development for computational biology and bioinformatics. Genome Biol. 2004, 5: R80-
Su AI, Cooke MP, Ching KA, Hakak Y, Walker JR, Wiltshire T, Orth AP, Vega RG, Sapinoso LM, Moqrich A, Patapoutian A, Hampton GM, Schultz PG, Hogenesch JB: Large-scale analysis of the human and mouse transcriptomes. Proc Natl Acad Sci U S A. 2002, 99: 4465-4470.
Ramilo O, Allman W, Chung W, Mejias A, Ardura M, Glaser C, Wittkowski KM, Piqueras B, Banchereau J, Palucka AK, Chaussabel D: Gene expression patterns in blood leukocytes discriminate patients with acute infections. Blood. 2007, 109: 2066-2077.
Thorrez L, Van Deun K, Tranchevent L-C, Van Lommel L, Engelen K, Marchal K, Moreau Y, Van Mechelen I, Schuit F: Using ribosomal protein genes as reference: a tale of caution. PLoS One. 2008, 3: e1854-
Lattin JE, Schroder K, Su AI, Walker JR, Zhang J, Wiltshire T, Saijo K, Glass CK, Hume DA, Kellie S, Sweet MJ: Expression analysis of G protein-coupled receptors in mouse macrophages. Immunome Res. 2008, 4: 5-
Li Q, Smith AJ, Schacker TW, Carlis JV, Duan L, Reilly CS, Haase AT: Microarray analysis of lymphatic tissue reveals stage-specific, gene expression signatures in HIV-1 infection. J Immunol. 2009, 183: 1975-1982.
Larsson E, Tremaroli V, Lee YS, Koren O, Nookaew I, Fricker A, Nielsen J, Ley RE, Bäckhed F: Analysis of gut microbial regulation of host gene expression along the length of the gut and regulation of gut microbial ecology through MyD88. Gut. 2012, 61: 1124-1131.
Manel N, Hogstad B, Wang Y, Levy DE, Unutmaz D, Littman DR: A cryptic sensor for HIV-1 activates antiviral innate immunity in dendritic cells. Nature. 2010, 467: 214-217.
Thorrez L, Laudadio I, Van Deun K, Quintens R, Hendrickx N, Granvik M, Lemaire K, Schraenen A, Van Lommel L, Lehnert S, Aguayo-Mazzucato C, Cheng-Xue R, Gilon P, Van Mechelen I, Bonner-Weir S, Lemaigre F, Schuit F: Tissue-specific disallowance of housekeeping genes: the other face of cell differentiation. Genome Res. 2011, 21: 95-105.
Planell N, Lozano JJ, Mora-Buch R, Masamunt MC, Jimeno M, Ordás I, Esteller M, Ricart E, Piqué JM, Panés J, Salas A: Transcriptional analysis of the intestinal mucosa of patients with ulcerative colitis in remission reveals lasting epithelial cell alterations. Gut. 2013, 62: 967-976.
Nagy LH, Grishina I, Macal M, Hirao LA, Hu WK, Sankaran-Walters S, Gaulke CA, Pollard R, Brown J, Suni M, Baumler AJ, Ghanekar S, Marco ML, Dandekar S: Chronic HIV infection enhances the responsiveness of antigen presenting cells to commensal Lactobacillus. PLoS One. 2013, 8: e72789-
Statistical Algorithms Description Document. [http://www.affymetrix.com/support/technical/whitepapers/sadd_whitepaper.pdf]
Adachi O, Kawai T, Takeda K, Matsumoto M, Tsutsui H, Sakagami M, Nakanishi K, Akira S: Targeted disruption of the MyD88 gene results in loss of IL-1- and IL-18-mediated function. Immunity. 1998, 9: 143-150.
Hoshino K, Takeuchi O, Kawai T, Sanjo H, Ogawa T, Takeda Y, Takeda K, Akira S: Toll-like receptor 4 (TLR4)-deficient mice are hyporesponsive to lipopolysaccharide: evidence for TLR4 as the Lps gene product. J Immunol. 1999, 162: 3749-3752.
Yamamoto M, Sato S, Hemmi H, Hoshino K, Kaisho T, Sanjo H, Takeuchi O, Sugiyama M, Okabe M, Takeda K, Akira S: Role of adaptor TRIF in the MyD88-independent toll-like receptor signaling pathway. Science. 2003, 301: 640-643.
We thank Jonathan Stoye for invaluable discussion and comments. This work was supported by the UK Medical Research Council (U117581330) and the Wellcome Trust (102898/Z/13/Z).
The authors declare that they have no competing interests.
GRY, BM and GK designed the study. GRY carried out computational analysis and BM carried out experiments and analyzed data. GRY and GK prepared the manuscript. All authors read and approved the final manuscript.
Electronic supplementary material
Additional file 1: Figure S1: Tissue-specific RE expression patterns. Hierarchally-clustered heatmap of RE-reporting probes significantly regulated between B6 tissues for three independent experiments using the Mouse Genome 430 v2 microarray platform (p < 0.001 by ANOVA comparing tissues and eliminating experiment). Data are obtained from E-GEOD-1986, −9954, and −10246, which are identified with numbers. (PDF 6 MB)
Additional file 2: Archive of mouse annotation files. Probe annotation files (csv format), as defined in the Methods, for the following Affymetrix platforms: mg_u74a, mg_u74a_v2, mg_b74b, mg_u74b_v2, mg_b74c, mg_u74c_v2, moe_430a, moe_430b, ht_mg_430a, mogene_1_0_st, mogene_2_0, mouse430_v2, mouse430a_v2. Shortened platform names correspond to identifiers used within the ‘oligo’ Bioconductor R package. Column identifiers are pid – probe id, probeset – Affymetrix probeset, plen – probe length, sid – target chromosome, sstart – start position of probe on sid, send – end position of probe on sid, nident – identity within the region sstart to send, numhits – number of hits recorded by BLASTn, repeat – RepBase-defined repeat, repclass – RepBase-defined repeat class, rstart – start position of repetitive element, rend – end position of repetitive element, 5id – symbol of nearest 5′ protein coding gene, 5start – start position of 5id, 5stop – end position of 5id, 3id – symbol of nearest 3′ protein coding gene, 3start – start position of 3id, 3stop – end position of 3id. (ZIP 846 KB)
Additional file 3: Archive of human annotation files. Probe annotation files (csv format), as defined in the Methods, for the following Affymetrix platforms: hg_u95a, hg_u95a_v2, hg_u95b, hg_u95c, hg_u95e, hu6800, hg_u133a, hg_u133a_v2, hg_u133b, hg_u133_plus_v2, hg_u219, hg_focus, hugene_1_0_st, and hugene_2_0_st. Shortened platform names correspond to identifiers used within the ‘oligo’ Bioconductor R package. Column identifiers are pid – probe id, probeset – Affymetrix probeset, plen – probe length, sid – target chromosome, sstart – start position of probe on sid, send – end position of probe on sid, nident – identity within the region sstart to send, numhits – number of hits recorded by BLASTn, repeat – RepBase-defined repeat, repclass – RepBase-defined repeat class, rstart – start position of repetitive element, rend – end position of repetitive element, 5id – symbol of nearest 5′ protein coding gene, 5start – start position of 5id, 5stop – end position of 5id, 3id – symbol of nearest 3′ protein coding gene, 3start – start position of 3id, 3stop – end position of 3id. (ZIP 3 MB)
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.