- Open Access
Plasma proteome analysis in HTLV-1-associated myelopathy/tropical spastic paraparesis
Retrovirologyvolume 8, Article number: 81 (2011)
Human T lymphotropic virus Type 1 (HTLV-1) causes a chronic inflammatory disease of the central nervous system known as HTLV-1-associated myelopathy/tropical spastic paraparesis (HAM) which resembles chronic spinal forms of multiple sclerosis (MS). The pathogenesis of HAM remains uncertain. To aid in the differential diagnosis of HAM and to identify pathogenetic mechanisms, we analysed the plasma proteome in asymptomatic HTLV-1 carriers (ACs), patients with HAM, uninfected controls, and patients with MS. We used surface-enhanced laser desorption-ionization (SELDI) mass spectrometry to analyse the plasma proteome in 68 HTLV-1-infected individuals (in two non-overlapping sets, each comprising 17 patients with HAM and 17 ACs), 16 uninfected controls, and 11 patients with secondary progressive MS. Candidate biomarkers were identified by tandem Q-TOF mass spectrometry.
The concentrations of three plasma proteins - high [β2-microglobulin], high [Calgranulin B], and low [apolipoprotein A2] - were specifically associated with HAM, independently of proviral load. The plasma [β2-microglobulin] was positively correlated with disease severity.
The results indicate that monocytes are activated by contact with activated endothelium in HAM. Using β2-microglobulin and Calgranulin B alone we derive a diagnostic algorithm that correctly classified the disease status (presence or absence of HAM) in 81% of HTLV-1-infected subjects in the cohort.
Human T lymphotropic virus Type 1 (HTLV-1) persists lifelong in the host and is associated with two distinct types of disease: a range of chronic inflammatory diseases, of which the most commonly recognized is HTLV-1-associated myelopathy/tropic spastic paraparesis (HAM/TSP, abbreviated hereafter as HAM), and an aggressive T cell malignancy known as adult T cell leukaemia/lymphoma (ATLL). The cumulative lifetime risk of HAM ranges between 0.1% and 3% of infected individuals; the cumulative lifetime risk of ATLL ranges from 1% to 5%.
The strongest correlate of the risk of HTLV-1-associated inflammatory diseases such as HAM is the proviral load (PVL), i.e. the percentage of peripheral blood mononuclear cells (PBMCs) that carry the provirus [1, 2]. The PVL remains approximately constant within each infected individual, but differs among individuals by over 1000 times. However, the range of PVL overlaps extensively between patients with HAM and asymptomatic carriers: although a PVL > 1% PBMCs is strongly associated with HAM, 50% of asymptomatic carriers also have a PVL > 1%, which reduces the value of this measure in the clinical diagosis of HAM. Furthermore, other clinical manifestions of HTLV-1 are less well-defined, and it is difficult to identify in a given case whether HTLV-1 infection is co-incidental or causative. There is therefore an urgent need for additional tools to aid in the diagnosis of HTLV-1-associated disease both clinically and epidemiologically.
The mechanisms of pathogenesis of the HTLV-1-associated inflammatory diseases such as HAM remain uncertain. To date, most virological and immunological markers of HAM correlate with proviral load, but do not differ between patients with HAM and asymptomatic carriers at a given PVL. Few factors have been identified that differ systematically between asymptomatic carriers and patients with HAM at a given proviral load: the frequency of certain lymphocyte subsets (HTLV-1-specific CD4+ T cells [3, 4]; FoxP3+ CD4+ T cells ; natural killer (NK) cells  and NKT cells ); the level of expression of HTLV-1 genes in fresh PBMCs [5, 8–10]; and the pattern of integration of the HTLV-1 provirus in the host cell genome . However, none of these parameters is useful in the differential diagnosis of HAM from other causes of spastic paraparesis, and these parameters give only indirect suggestions as to the pathogenesis of the inflammatory conditions such as HAM.
The aim of the present study was to identify plasma proteins whose concentration is associated with HAM or correlated with proviral load, to help in the differential diagnosis of HAM and to provide further clues as to the mechanism of pathogenesis of the inflammatory disease. In a two-stage case-control study, we used surface-enhanced laser desorption ionization time-of-flight mass spectrometry (SELDI-TOF-MS; abbreviated here as SELDI) to identify plasma protein biomarkers that distinguished patients with HAM from both asymptomatic HTLV-1 carriers and patients with progressive multiple sclerosis, which closely resembles HAM clinically. Three biomarkers were identified by tandem mass spectrometry. We derive algorithms to estimate the utility of these biomarkers in the differential diagnosis of HAM, and discuss their possible significance in the pathogenesis of the disease.
Univariate analysis revealed 4 biomarkers of HAM
Four successive pairwise comparisons were carried out: HAM vs. AC; AC vs. U; HAM vs. U; and (HAM and AC) vs. U. The results in Table 1 show the molecular weights of the peaks identified in each comparison for the original and verification data sets respectively. The p-values returned by the Biomarker Wizard software were converted into q-values, which estimate the false discovery rate  and account for multiple testing.
In the comparison of HAM vs. AC, four peaks remained statistically significant in both the original and verification data sets after multiple testing correction: the 11.7 kDa, 11.9 kDa, 13.3 kDa and 14.7 kDa peaks. As illustrated in Figure 1, the intensities of all four peaks were typically higher in the HAM group. The intensities of all four peaks in HAM patients also differed significantly from those in uninfected controls (Table 1; Additional File 1, S4).
The plasma proteomic profile differed between HAM and multiple sclerosis
The samples from patients with HAM were compared with the samples from patients with secondary progressive MS. The results are shown in Table 2.
Several peaks differed in intensity between the HAM and MS spectra (Table 2). The first four of these peaks (10.1 kDa, 3.8 kDa, 7.7 kDa and 9.2 kDa) were detected consistently only amongst the MS spectra; in the HAM spectra, their intensities fell below the noise level. For all but 2 of the peaks (9.4 kDa and 14.0 kDa), the mean intensity was higher in samples from patients with MS.
Protein peak intensities were uncorrelated with proviral load
After correction for multiple testing, we found no peaks whose normalized log intensities remained significantly correlated with log(proviral load) in either the original or verification sets.
SELDI data enabled HAM and AC to be distinguished with 79% cross-validation success rate
We aimed to identify protein peaks that discriminated between: (1) HAM and AC; and (2) HAM and MS.
1. HAM vs. AC. Three peaks (13.3 kDa, 11.7 kDa and 17.6 kDa) gave the lowest mean cross-validation (CV) misclassification rate (20.7%). The mean CV misclassification rate for the 11.7 kDa peak alone was 22.8%, and for 11.7 kDa and 13.3 kDa together was 21.4%.
2. HAM vs. MS. The 10.1 kDa peak alone gave the lowest mean CV misclassification rate of 6.6%, and further selections did not improve predictive performance.
The separation between the subject groups provided by the two most stably selected peaks in each of these cases is shown in Figure 2.
Proviral load provided additional discriminatory information
Repeating the multivariate analysis of the HAM and AC classes including log(proviral load) as a predictor, we identified four parameters as significant discriminators: proviral load and the 13.3 kDa, 11.7 kDa and 11.9 kDa peaks. These four parameters combined gave a mean misclassification rate of 14.6%; the log(proviral load) alone provided a mean CV misclassification rate of 21.0%.
A simple classification rule using the SELDI data
We constructed a classifier to discriminate between HAM and AC using only the 11.7 kDa and 13.3 kDa peaks, since these peaks were selected consistently in all analyses. For simplicity, we categorized the observed intensity values for each of these two peaks as either 'normal' or 'abnormal'. 'Normal' intensity measurements (Figure 2a, shaded region) were defined as those within 2 standard deviations of the mean amongst the uninfected controls (intensity = 3.2 ± 1.1 for the 11.7 kDa peak; 1.2 ± 1.2 for 13.3 kDa). From a logistic regression analysis of the resulting categorical data (see Additional file 1, S5), we derived the following rule:
"If the intensity of either the 11.7 kDa or the 13.3 kDa peak (or both) is abnormal, then classify as HAM".
This rule correctly classified 55/68 = 81% of seropositive subjects in the 'combined' data set; this performance level may be overestimated, since the same data were used both to devise and assess the rule. Of the 13 misclassified individuals, 2 were ACs misclassified as HAM (i.e. false positive diagnosis of HAM), corresponding to a false positive rate of 5.9% and a false negative rate of 32.4%.
Because of their consistent significance in the statistical analysis, we wished to identify the proteins that constituted the 11.7 kDa and 13.3 kDa peaks. We also attempted identification of the 17.4 kDa and 14.7 kDa peaks (see Table 1).
Q-TOF mass spectrometry identified the 17.4 kDa protein as apolipoprotein A-II (ApoA-II). The species present was presumably the S-S linked homodimer , whose calculated mass is 17.416 kDa, since the theoretical mass of its monomer (residues 24-100) is 8.708 kDa. This identification was confirmed by adding dithiothreitol (DTT) to the eluate containing the 17.4 kDa protein and repeating the SELDI analysis (Figure 3). Details of peptide fragments detected in the Q-TOF analysis are available on request.
Q-TOF analysis also identified the 11.7 kDa and 13.3 kDa protein eluates as β2-microglobulin (calculated mass 11.731 kDa) and Calgranulin B (Protein S100-A9) (calculated mass 13.224 kDa).
Purification of the 14.7 kDa protein by anionic fractionation and 1D-SDS-PAGE failed to resolve the peaks into separate gel bands. Three consistent peaks were detected by SELDI analysis. Q-TOF-MS analysis revealed three potential identities: transthyretin (calculated mass = 13761.4 Da); serum amyloid A-4 (SAA-4) (calculated mass = 12863.28 Da; glycosylation may increase the mass in vivo); and lysozyme.
Confirmation of identity of β2-microglobulin
The plasma concentration of β2-microglobulin was measured in the subjects in the original sample set (17 patients with HAM, 17 ACs, 16 uninfected subjects) by rate nephelometry. The area under the 11.7 kDa peak was significantly correlated (Spearman rank correlation, 2-tailed) with the β2-microglobulin concentration in both the uninfected individuals (p = 0.023) and the asymptomatic carriers (p = 0.006); the correlation was suggestive in the patients with HAM (p = 0.09). The statistical significance of the correlation in the three subject groups combined was p = 0.0009, by Fisher's method of combining probabilities . We conclude that the 11.7 kDa peak was formed by β2-microglobulin.
Correlation between plasma β2-microglobulin concentration and disability in HAM patients
A good objective measure of the disability caused by HAM is the time taken by the patient to walk 10 m on a flat, smooth surface. There was a significant positive correlation between the 10 m timed walk and the plasma β2-microglobulin concentration (Figure 4).
Biomarkers serve three chief purposes: to assist in the diagnosis of a clinical condition, to follow the progression of disease and response to treatment, and to give clues as to the pathogenetic mechanisms of the disease in question.
We identified two plasma proteins whose abundance in HTLV-1-infected individuals was significantly and independently associated with the presence of HAM: β2-microglobulin and Calgranulin B. Moreover, the plasma β2-microglobulin concentration was significantly correlated with the degree of disability resulting from HAM as measure by the 10 m timed walk (Figure 4). Two further peaks (11.9 kDa and 14.7 kDa) had intensities that were strongly correlated with the 11.7 kDa (β2-microglobulin) and 13.3 kDa (Calgranulin B) peaks respectively. However, inclusion of these peaks did not add significantly to the discrimination between asymptomatic HTLV-1 carriers and patients with HAM. A logistic regression model including the abundance of the two most strongly associated biomarkers, β2-microglobulin and Calgranulin B, correctly classified the clinical status of HTLV-1-infected subjects in the present cohort with 79% accuracy (i.e. CV misclassification rate of 21%). A 17.4 kDa protein peak, identified as Apolipoprotein-AII (Apo-AII), was also of particular interest as it was more abundant in asymptomatic carriers than in patients with HAM.
Two of the identified peaks (β2-microglobulin and Calgranulin B) discriminated between HAM and MS patients, with a CV misclassification rate of 25.0%. However, the best discrimination between patients with HAM and those with secondary progressive MS was provided by the protein peak at 10.1 kDa (Figure 2b): this protein awaits identification.
The serum concentration of β2-microglobulin is in clinical use as a marker of chronic renal disease, in risk stratification and as a measure of disease burden in certain haematological malignancies [15–17], and as a measure of disease activity and progression in HIV-1  and HIV-2 infection . In Alzheimer's disease, Simonsen et al.  found a lower concentration of β2-microglobulin in the cerebrospinal fluid (CSF) in patients compared with healthy age-matched controls, using SELDI techniques. In one study of multiple sclerosis, a higher concentration of β2-microglobulin was found in CSF in patients with active disease compared with healthy individuals ; however, this finding was not reproduced in subsequent studies, either in CSF [22, 23] or in the serum β2-microglobulin concentration . In patients with HAM, as in HIV-1  and HIV-2 infections , it is likely that the high β2-microglobulin concentration in serum is a result of chronic activation of large numbers of T cells.
Calgranulin B [24, 25] is also known as S100A9 and as myeloid-related protein 14 (MRP14). Together with the related protein Calgranulin A (S100A8; MRP8), Calgranulin B forms a heterodimer called Calprotectin, which plays a central part in calcium ion signalling during inflammation. Calgranulin B is expressed chiefly in neutrophils, and in monocytes after activation of protein kinase C. It is also expressed in some epithelial cells, but not in fixed tissue macrophages or in lymphocytes. Contact with activated endothelium induces monocytes to secrete Calgranulin B (by a microtubule-dependent mechanism), but contact with resting endothelium suppresses this secretion . The concentration of Calgranulin B in serum (or other fluids such as synovial fluid) correlates with disease activity in a wide range of inflammatory conditions, including rheumatoid arthritis , pauciarticular onset juvenile rheumatoid arthritis ; inflammatory bowel disease ; chronic airway inflammation ; kidney allograft rejection  and acute appendicitis .
The strong association found in this study between serum Calgranulin B abundance and HAM suggests that monocytes are activated by contact with activated endothelium in patients with this disease. However, macrophages themselves may not contribute to the tissue damage seen in HAM: macrophages are not a prominent feature of the cellular infiltrate in CNS lesions .
Apolipoprotein-AII (Apo-AII)  is a major constituent of high-density lipoproteins (HDL). Castellani et al.  found that overexpression of Apo-AII in transgenic mice converted HDL to proinflammatory particles which induced transmigration of monocytes across the arterial wall, and these authors suggested a role of Apo-AII in atherosclerosis. Further evidence of an inflammatory role of Apo-AII was found by Thompson et al. , who showed that Apo-AII enhanced the monocyte response to bacterial lipopolysaccharide. The present results show that a high peak of Apo-AII was associated with asymptomatic carrier status. Further work is warranted to investigate the possible contribution of monocytes to the pathogenesis of HAM.
Semmes et al.  reported a SELDI analysis of serum proteins in HTLV-1 infection. These authors compared the SELDI proteomic profile between patients with ATLL, patients with HAM, and uninfected control subjects. They found that two serum proteins were overexpressed in ATLL: these proteins were identified as alpha-1-antitrypsin and haptoglobin-2 respectively by tandem mass spectrometry. The authors concluded that these two proteins could serve as biomarkers to distinguish between ATLL and HAM. However, these authors did not examine the serum proteome in asymptomatic HTLV-1 carriers or patients with multiple sclerosis, and the relationship between the protein peak intensity and the proviral load of HTLV-1 was not explored.
The biomarkers identified here are potentially of clinical use, because they were specifically associated with the presence of the disease, HAM, and gave additional discriminatory information to that provided by the proviral load. These biomarkers may therefore serve both in the differential diagnosis of HAM and in following the disease activity and response to treatment in individual patients. The plasma (or serum) concentration of β2-microglobulin may be of particular clinical utility, because the assay is readily available in clinical chemistry laboratories and because the concentration correlates with clinical severity (degree of disability). The classification rule ("if the intensity of either the 11.7 kDa or the 13.3 kDa peak - or both - is abnormal, then classify as HAM") provides a useful basis for further clinical testing. In addition, it will be interesting to investigate these biomarkers in other diseases that are known or suspected to be caused by HTLV-1, such as polymyositis, arthritis and uveitis.
Subjects; plasma samples
Plasma was prepared from EDTA-anticoagulated peripheral venous blood samples donated by a total of 95 subjects, comprising 68 HTLV-1 infected patients (34 patients with HTLV-1 associated myelopathy/tropical spastic paraparesis (HAM) and 34 asymptomatic HTLV-1 carriers (ACs)) and 16 ethnically matched uninfected controls (U) attending clinic at the National Centre for Human Retrovirology, St Mary's Hospital, London. These samples form part of the Research Tissue Bank (Imperial College London), approved by the UK National Research Ethics Service (09/H0606/106). EDTA-anticoagulated peripheral venous blood samples were also donated by 11 patients with secondary progressive multiple sclerosis attending Charing Cross Hospital, London. All patients gave written informed consent. Details of participating subjects (age, sex, ethnicity, disease status) are available on request.
Samples were analysed by SELDI using the CM10 ProteinChip array (Bio-Rad, Hemel Hempstead, UK) with 50 mM sodium acetate buffer (pH4), and sinapinic acid as the energy-absorbing matrix. Data were collected at low- and high-mass ranges consecutively, using the manufacturer's protocol. The low-mass range was 1-30 kDa (highest mass collected 50 kDa) and the high-mass range 10-75 kDa (highest mass collected 100 kDa). For each subject we then combined the data in the optimal part of each mass range (1 to 10 kDa from the low-mass range and 10 to 100 kDa from the high-mass range). See Additional file 1, S1 for details.
Processing of spectral data: Biomarker Wizard
Spectra were processed using the Ciphergen ProteinChip Software (version 3.2.0). Before statistical analysis with the proprietary software package Biomarker Wizard (Bio-Rad), spectra were calibrated using four molecular weight standards and normalized by intensity (total ion current). The spectra from each experiment were analysed as either low- or high-mass data, in two groups (HAM vs AC; AC vs U; HTLV-1+ vs MS). Following the manufacturer's guidelines, criteria for protein peak detection were a signal to noise ratio (S/N) of ≥10 and presence of the peak in ≥40% of samples in at least one subject group. All spectra were inspected visually by an experienced SELDI operator; no spectra were omitted from the study.
The study comprised three stages. In stage 1 we analysed samples from 17 patients with HAM, 17 ACs and 16 Us. In stage 2 we analysed samples from a non-overlapping cohort of 17 HAM patients and 17 ACs together with repeat aliquots of the plasma samples from 14 of the 16 Us studied in the first set. Finally, samples from 11 patients with MS were analysed together with 12 samples chosen randomly from the 34 patients with HAM. These three data sets were respectively denoted the 'original', 'verification' and 'MS' sets.
To derive predictive multivariate models, we pooled data from the 'original' and 'verification' sets to create a 'combined' data set comprising samples from all 68 HTLV-1-seropositive individuals, after further normalization to minimize bias due to chip-specific effects . Further details are provided in Additional file 1, S2. As an alternative approach, we trained a predictive model using the 'original' dataset only, and then tested its performance on the 'verification' dataset. This method also yielded good results (see Additional file 1, S8). The workflow of sample processing and analysis is summarized in Additional file 1, S9.
Association with disease group
For each comparison of interest we identified significant differences (Mann-Whitney U test) in the distributions of peak intensities between the two respective subject groups.
Correlation with proviral load
We tested for a correlation (Kendall's τ rank correlation coefficient; exact p-values) between protein peak intensities and proviral load separated in samples from ACs and patients with HAM in the original and verification data sets.
Multiple testing correction
Prediction and peak selection
We used a logistic regression model to describe the probability of disease outcome. To fit the model to the data we used a maximum likelihood method incorporating the lasso penalty [39, 40]. A stability selection approach  was used to select the peaks whose selection was robust to data variability. We selected the optimum number of peaks using a cross-validation assessment of predictive performance. Further details are provided in Additional file 1, S3.
Identification of candidate biomarker proteins
To identify the candidate proteins, plasma samples with a high abundance of the desired protein (i.e. high peak height on the SELDI spectrum) were selected. HAM samples were used to identify 11.7 kDa, 13.3 kDa and 14.7 kDa proteins, whereas AC plasma was used to identify the 17.4 kDa protein. Albumin-depleted plasma was subjected to successive anionic exchange fractionation and reverse phase fractionation; and separated by 1D-SDS-PAGE. Proteins at the target molecular weight were digested with trypsin (either in gel or after passive elution) and analysed using Q-TOF-MS/MS. All samples were run in duplicate. See Additional file 1, S6 for further details.
Protein identification by tandem mass spectrometry
Gel bands were excised and digested with trypsin as previously described . Samples were analysed by high performance liquid chromatography coupled to electrospray ionization tandem mass spectrometry (HPLC ESI MS/MS using a Waters Q-TOF instrument). Proteins were identified by correlation of uninterpreted spectra to the SwissProt database (Release 2010_04) using Mascot (version 2.2: http://www.matrixscience.com). MS/MS ion searches specified up to two missed cleavages per peptide, a precursor mass tolerance of ± 100 ppm and a fragment ion mass tolerance of ± 0.5Da. Carbamidomethylation of cysteines and methionine oxidation were specified as fixed and variable modifications respectively.
Q-TOF MS/MS based peptide and protein identifications were validated using the Protein and Peptide Prophet algorithms, as implemeted in the program Scaffold version 3.01  (Proteome Software Inc., Portland, Oregon). Peptide identifications were accepted if they could be established at greater than 95.0% probability. Protein identifications were accepted if established at greater than 99.0% probability and contained at least 2 matched peptides.
Nagai M, Usuku K, Matsumoto W, Kodama D, Takenouchi N, Moritoyo T, Hashiguchi S, Ichinose M, Bangham CR, Izumo S, Osame M: Analysis of HTLV-I proviral load in 202 HAM/TSP patients and 243 asymptomatic HTLV-I carriers: high proviral load strongly predisposes to HAM/TSP. J Neurovirol. 1998, 4: 586-593. 10.3109/13550289809114225.
Bangham CRM, Meekings K, Toulza F, Nejmeddine M, Majorovits E, Asquith B, Taylor GP: The immune control of HTLV-1 infection: selection forces and dynamics. Frontiers in Bioscience. 2009, 14: 2889-2903.
Goon PK, Hanon E, Igakura T, Tanaka Y, Weber JN, Taylor GP, Bangham CR: High frequencies of Th1-type CD4(+) T cells specific to HTLV-1 Env and Tax proteins in patients with HTLV-1-associated myelopathy/tropical spastic paraparesis. Blood. 2002, 99: 3335-3341. 10.1182/blood.V99.9.3335.
Goon PK, Igakura T, Hanon E, Mosley AJ, Asquith B, Gould KG, Taylor GP, Weber JN, Bangham CR: High circulating frequencies of tumor necrosis factor alpha- and interleukin-2-secreting human T-lymphotropic virus type 1 (HTLV-1)-specific CD4+ T cells in patients with HTLV-1-associated neurological disease. J Virol. 2003, 77: 9716-9722. 10.1128/JVI.77.17.9716-9722.2003.
Toulza F, Heaps A, Tanaka Y, Taylor GP, Bangham CR: High frequency of CD4+FoxP3+ cells in HTLV-1 infection: inverse correlation with HTLV-1-specific CTL response. Blood. 2008, 111: 5047-5053. 10.1182/blood-2007-10-118539.
Yu F, Itoyama Y, Fujihara K, Goto I: Natural killer (NK) cells in HTLV-I-associated myelopathy/tropical spastic paraparesis-decrease in NK cell subset populations and activity in HTLV-I seropositive individuals. J Neuroimmunol. 1991, 33: 121-128. 10.1016/0165-5728(91)90056-D.
Saito M, Braud VM, Goon P, Hanon E, Taylor GP, Saito A, Eiraku N, Tanaka Y, Usuku K, Weber JN, et al: Low frequency of CD94/NKG2A+ T lymphocytes in patients with HTLV-1-associated myelopathy/tropical spastic paraparesis, but not in asymptomatic carriers. Blood. 2003, 102: 577-584. 10.1182/blood-2002-09-2855.
Yamano Y, Cohen CJ, Takenouchi N, Yao K, Tomaru U, Li HC, Reiter Y, Jacobson S: Increased expression of human T lymphocyte virus type I (HTLV-I) Tax11-19 peptide-human histocompatibility leukocyte antigen A*201 complexes on CD4+ CD25+ T Cells detected by peptide-specific, major histocompatibility complex-restricted antibodies in patients with HTLV-I-associated neurologic disease. J Exp Med. 2004, 199: 1367-1377. 10.1084/jem.20032042.
Yamano Y, Nagai M, Brennan M, Mora CA, Soldan SS, Tomaru U, Takenouchi N, Izumo S, Osame M, Jacobson S: Correlation of human T-cell lymphotropic virus type 1 (HTLV-1) mRNA with proviral DNA load, virus-specific CD8(+) T cells, and disease severity in HTLV-1-associated myelopathy (HAM/TSP). Blood. 2002, 99: 88-94. 10.1182/blood.V99.1.88.
Asquith B, Mosley AJ, Heaps A, Tanaka Y, Taylor GP, McLean AR, Bangham CR: Quantification of the virus-host interaction in human T lymphotropic virus I infection. Retrovirology. 2005, 2: 75-10.1186/1742-4690-2-75.
Meekings KN, Leipzig J, Bushman FD, Taylor GP, Bangham CR: HTLV-1 integration into transcriptionally active genomic regions is associated with proviral expression and with HAM/TSP. PLoS Pathog. 2008, 4: e1000027-10.1371/journal.ppat.1000027.
Storey J: A direct approach to false discovery rates. J Roy Stat Soc B. 2002, 64: 479-498. 10.1111/1467-9868.00346.
Niederkofler EE, Tubbs KA, Kiernan UA, Nedelkov D, Nelson RW: Novel mass spectrometric immunoassays for the rapid structural characterization of plasma apolipoproteins. J Lipid Res. 2003, 44: 630-639. 10.1194/jlr.D200034-JLR200.
Sokal RR, J RF: Biometry. 2004, New York: W. H. Freeman
Kay NE, Shanafelt TD: Prognostic factors in chronic lymphocytic leukemia. Curr Hematol Malig Rep. 2007, 2: 49-55. 10.1007/s11899-007-0007-1.
Swan F, Velasquez WS, Tucker S, Redman JR, Rodriguez MA, McLaughlin P, Hagemeister FB, Cabanillas F: A new serologic staging system for large-cell lymphomas based on initial beta 2-microglobulin and lactate dehydrogenase levels. J Clin Oncol. 1989, 7: 1518-1527.
Bergsagel PL: Individualizing therapy using molecular markers in multiple myeloma. Clin Lymphoma Myeloma. 2007, 7: S170-174. 10.3816/CLM.2007.s.019.
Fahey JL, Taylor JM, Detels R, Hofmann B, Melmed R, Nishanian P, Giorgi JV: The prognostic value of cellular and serologic markers in infection with human immunodeficiency virus type 1. N Engl J Med. 1990, 322: 166-172. 10.1056/NEJM199001183220305.
Leligdowicz A, Feldmann J, Jaye A, Cotten M, Dong T, McMichael A, Whittle H, Rowland-Jones S: Direct relationship between virus load and systemic immune activation in HIV-2 infection. J Infect Dis. 2010, 201: 114-122.
Simonsen AH, McGuire J, Podust VN, Davies H, Minthon L, Skoog I, Andreasen N, Wallin A, Waldemar G, Blennow K: Identification of a novel panel of cerebrospinal fluid biomarkers for Alzheimer's disease. Neurobiol Aging. 2008, 29: 961-968. 10.1016/j.neurobiolaging.2007.01.011.
Bjerrum OW, Bach FW, Zeeberg I: Increased level of cerebrospinal fluid beta 2-microglobulin is related to neurologic impairment in multiple sclerosis. Acta Neurol Scand. 1988, 78: 72-75.
Carrieri PB, Indaco A, Maiorino A, Buscaino GA, Ponticiello A, Liberti A, Perrella O: Cerebrospinal fluid beta-2-microglobulin in multiple sclerosis and AIDS dementia complex. Neurol Res. 1992, 14: 282-283.
Ott M, Demisch L, Engelhardt W, Fischer PA: Interleukin-2, soluble interleukin-2-receptor, neopterin, L-tryptophan and beta 2-microglobulin levels in CSF and serum of patients with relapsing-remitting or chronic-progressive multiple sclerosis. J Neurol. 1993, 241: 108-114. 10.1007/BF00869773.
Hessian PA, Edgeworth J, Hogg N: MRP-8 and MRP-14, two abundant Ca(2+)-binding proteins of neutrophils and monocytes. J Leukoc Biol. 1993, 53: 197-204.
Passey RJ, Xu K, Hume DA, Geczy CL: S100A8: emerging functions and regulation. J Leukoc Biol. 1999, 66: 549-556.
Frosch M, Strey A, Vogl T, Wulffraat NM, Kuis W, Sunderkotter C, Harms E, Sorg C, Roth J: Myeloid-related proteins 8 and 14 are specifically secreted during interaction of phagocytes and activated endothelium and are useful markers for monitoring disease activity in pauciarticular-onset juvenile rheumatoid arthritis. Arthritis Rheum. 2000, 43: 628-637. 10.1002/1529-0131(200003)43:3<628::AID-ANR20>3.0.CO;2-X.
Odink K, Cerletti N, Bruggen J, Clerc RG, Tarcsay L, Zwadlo G, Gerhards G, Schlegel R, Sorg C: Two calcium-binding proteins in infiltrate macrophages of rheumatoid arthritis. Nature. 1987, 330: 80-82. 10.1038/330080a0.
Rugtveit J, Brandtzaeg P, Halstensen TS, Fausa O, Scott H: Increased macrophage subset in inflammatory bowel disease: apparent recruitment from peripheral blood monocytes. Gut. 1994, 35: 669-674. 10.1136/gut.35.5.669.
Roth J, Teigelkamp S, Wilke M, Grun L, Tummler B, Sorg C: Complex pattern of the myelo-monocytic differentiation antigens MRP8 and MRP14 during chronic airway inflammation. Immunobiology. 1992, 186: 304-314.
Goebeler M, Roth J, Burwinkel F, Vollmer E, Bocker W, Sorg C: Expression and complex formation of S100-like proteins MRP8 and MRP14 by macrophages during renal allograft rejection. Transplantation. 1994, 58: 355-361.
Bealer JF, Colgin M: S100A8/A9: a potential new diagnostic aid for acute appendicitis. Acad Emerg Med. 2010, 17: 333-336. 10.1111/j.1553-2712.2010.00663.x.
Iwasaki Y: Human T cell leukemia virus type I infection and chronic myelopathy. Brain Pathol. 1993, 3: 1-10. 10.1111/j.1750-3639.1993.tb00719.x.
Martin-Campos JM, Escola-Gil JC, Ribas V, Blanco-Vaca F: Apolipoprotein A-II, genetic variation on chromosome 1q21-q24, and disease susceptibility. Curr Opin Lipidol. 2004, 15: 247-253. 10.1097/00041433-200406000-00003.
Castellani LW, Navab M, Van Lenten BJ, Hedrick CC, Hama SY, Goto AM, Fogelman AM, Lusis AJ: Overexpression of apolipoprotein AII in transgenic mice converts high density lipoproteins to proinflammatory particles. J Clin Invest. 1997, 100: 464-474. 10.1172/JCI119554.
Thompson PA, Berbee JF, Rensen PC, Kitchens RL: Apolipoprotein A-II augments monocyte responses to LPS by suppressing the inhibitory activity of LPS-binding protein. Innate Immun. 2008, 14: 365-374. 10.1177/1753425908099171.
Semmes OJ, Cazares LH, Ward MD, Qi L, Moody M, Maloney E, Morris J, Trosset MW, Hisada M, Gygi S, Jacobson S: Discrete serum protein signatures discriminate between human retrovirus-associated hematologic and neurologic disease. Leukemia. 2005, 19: 1229-1238. 10.1038/sj.leu.2403781.
Churchill GA: Using ANOVA to analyze microarray data. Biotechniques. 2004, 37: 173-175. 177
Benjamini Y, Hochberg Y: Controlling the false discovery rate: a practical and powerful approach to multiple testing. J Roy Stat Soc B. 1995, 57: 289-300.
Friedman J, Hastie T, Tibshirani R: Regularization paths for generalized linear models via coordinate descent. J Stat Softw. 2010, 33: 1-22.
Tibshirani R: Regression shrinkage and selection via the Lasso. Journal of the Royal Statistical Society Series B-Methodological. 1996, 58: 267-288.
Saeys Y, Abeel T, Van de Peer Y: Robust feature selection using ensemble feature selection techniques. Lecture Notes in Computer Science. 2008, Berlin/Heidelberg: Springer, 5212: 313-325. 10.1007/978-3-540-87481-2_21.
Wait R, Miller I, Eberini I, Cairoli F, Veronesi C, Battocchio M, Gemeiner M, Gianazza E: Strategies for proteomics with incompletely characterized genomes: the proteome of Bos taurus serum. Electrophoresis. 2002, 23: 3418-3427. 10.1002/1522-2683(200210)23:19<3418::AID-ELPS3418>3.0.CO;2-7.
Searle BC: Scaffold: a bioinformatic tool for validating MS/MS-based proteomic studies. Proteomics. 2010, 10: 1265-1269. 10.1002/pmic.200900437.
We thank our colleagues in the Department of Immunology for help with extraction and preparation of samples; volunteers in the clinic at St Mary's Hospital for providing the samples; Jeremy Chataway and Maria-Antonietta Demontis for help with the samples from the MS study; Joanna Sheldon (Protein Reference Unit, St Georges Hospital, London) for carrying out the rate nephelometry assay. We would also like to thank Nathan Harris for help with peptide mass fingerprinting of the samples, and Becca Asquith for comments on the manuscript.
This work was supported by the Wellcome Trust (UK) [Programme Grant number 080871 to C.R.M.B. and PhD studentship 080713 to P.D.W.K.] and the NIHR Biomedical Research Centre, Imperial College London.
The authors declare that they have no competing interests.
PK contributed to the study design, carried out the mathematical, statistical and computational analysis and wrote the paper; AW carried out the sample preparation and SELDI analysis, and the data analysis using BioMarker Wizard; AC carried out the experimental work to identify candidate biomarkers; AL contributed to mathematical and statistical analysis; RW identified the candidate biomarkers by tandem mass spectrometry; MS and SR contributed to mathematical and statistical analysis; GP contributed to study design, clinical diagnosis, and writing the paper; CB conceived the study and contributed to study design, analysis and writing the paper. All authors read and approved the final manuscript.
Paul DW Kirk, Aviva Witkover contributed equally to this work.