Skip to Main Content

Skip Nav Destination

CORRESPONDENCE| June 27, 2013

Response: platelet transcriptome and proteome—relation rather than correlation

Jörg Geiger,

Jörg Geiger

1Interdisciplinary Bank of Biomaterials and Data, Würzburg, Germany

Search for other works by this author on:

PubMed

Google Scholar

Julia M. Burkhart,

Julia M. Burkhart

2Leibniz-Institut für Analytische Wissenschaften - ISAS - e.V., Dortmund, Germany

Search for other works by this author on:

PubMed

Google Scholar

Stepan Gambaryan,

Stepan Gambaryan

3Institut für Klinische Biochemie und Pathobiochemie, Universitätsklinikum Würzburg, Würzburg, Germany

Search for other works by this author on:

PubMed

Google Scholar

Ulrich Walter,

Ulrich Walter

4Center for Thrombosis and Haemostasis, University Medical Center, Johannes Gutenberg-University Mainz, Mainz, Germany

Search for other works by this author on:

PubMed

Google Scholar

Albert Sickmann,

Albert Sickmann

5Leibniz-Institut für Analytische Wissenschaften - ISAS - e.V., Dortmund, Germany

6Medizinisches Proteom-Center, Ruhr-Universität, Bochum, Germany

Search for other works by this author on:

PubMed

Google Scholar

René P. Zahedi

René P. Zahedi

7Leibniz-Institut für Analytische Wissenschaften - ISAS - e.V., Dortmund, Germany

Search for other works by this author on:

PubMed

Google Scholar

Blood (2013) 121 (26): 5257–5258.

https://doi.org/10.1182/blood-2013-04-493403

We have demonstrated by a detailed statistical analysis of proteome and transcriptome data of human platelets and human cell lines that protein and transcript abundance in platelets, if at all, are only weakly correlated.¹ This analysis appears to be in contradiction to previous claims made inter alia by Rowley and Weyrich,² who again advanced their opinion that transcript numbers would indeed reflect the extent of protein expression in human platelets.³ However, we do not agree that clear evidence about close transcriptome and proteome correlation is provided by previous publications, and from our perspective, the publications Rowley and Weyrich allude to^4-6 do not convey clear proof for their hypothesis. None of the publications deal with the problem of comprehensive and comparative analysis of the transcriptome and proteome of human platelets, but rather are primarily focused either on the transcriptome⁶ or on a small number of individual proteins.^4,5 In the publication by Gnatenko et al, the authors clearly state that: “the molecular analysis of the platelet transcriptome may be confounded by the constant decay of m[essenger] RNAs in the absence of new gene transcription”.⁵

We have carefully analyzed their response in order to understand the reason for the apparently contradicting view. Some of the remarks in the letter by Rowley and Weyrich are undeniably correct. For instance, in a few cases, we have missed transcripts and assumed that, although the protein being present as evident from mass spectrometry, the corresponding transcript would be absent. Indeed the transcript was present (eg, for GPIbα), but because of inconsistencies in the annotation systems, the refseq identifier could not be mapped to the correct protein. Unfortunately, this is a rather common problem with large data sets, so that some of the refseq identifiers provided by the authors² were actually deleted or superseded in the meantime and in certain cases could even not be mapped on the sequence data level. Splicing variants and multiple identifiers assigned to the same protein further complicate the alignment of proteome and transcriptome data. Using the gene names that were listed along with the refseq identifiers did not appear recommendable to us, because they cannot be expected to be unique. Inspired by the letter from Rowley et al, we once more revised the data using alternative approaches for mapping transcripts to protein identifiers. Again identifiers were mapped exclusively to stable identifiers, and pseudogenes, hypothetical proteins, and so on were omitted. However, unexpectedly, this extended strategy yielded only 24 additional transcript-protein pairs for reads per kilobase per million (RPKM) >1 (Table 1); except for GPIbα, only 5 additional transcripts contributed significantly (RPKM > 100).

Table 1

Reassignment of incorrectly assigned transcripts to proteome data

Protein	Name	qMS protein copies	RNAseq (Trizol)	RNAseq (column)
P24844	MYL9	66 201.00	1075.77	470.75
P07359	GP1BA	17 878.00	198.00	56.93
P78417	GSTO1	44 394.00	173.23	43.36
P59998	ARPC4	23 521.00	168.27	28.86
Q99952	PTPN18	674.00	109.61	43.89
P01893	HLA-H	1.00	48.55	13.68
P09496	CLTA	3072.00	17.14	4.56
Q9UP65	PLA2G4C	289.00	8.10	0.83
A6QL63	BTBD11	399.00	6.04	2.73
O95139	NDUFB6	968.00	3.69	0.39
Q63HN8	RNF213	364.00	3.57	1.14
Q5VYK3	KIAA0368	1623.00	3.36	0.35
Q96J02	ITCH	472.00	1.98	0.13
Q99460	PSMD1	2219.00	1.97	0.29
O75521	PECI	767.00	1.89	0.00
O75167	PHACTR2	388.00	1.88	0.43
O00499	BIN1	1.00	1.78	0.00
P45984	MAPK9	1213.00	1.68	0.24
Q6YHK3	CD109	1781.00	1.52	0.87
Q5XPI4	RNF123	312.00	1.48	0.23
Q9H8M7	FAM188A	340.00	1.38	0.12
Q8WYN0	ATG4A	668.00	1.01	0.00
Q92696	RABGGTA	1534.00	0.99	0.24
P63096	GNAI1	10 443.00	0.99	0.65

Protein	Name	qMS protein copies	RNAseq (Trizol)	RNAseq (column)
P24844	MYL9	66 201.00	1075.77	470.75
P07359	GP1BA	17 878.00	198.00	56.93
P78417	GSTO1	44 394.00	173.23	43.36
P59998	ARPC4	23 521.00	168.27	28.86
Q99952	PTPN18	674.00	109.61	43.89
P01893	HLA-H	1.00	48.55	13.68
P09496	CLTA	3072.00	17.14	4.56
Q9UP65	PLA2G4C	289.00	8.10	0.83
A6QL63	BTBD11	399.00	6.04	2.73
O95139	NDUFB6	968.00	3.69	0.39
Q63HN8	RNF213	364.00	3.57	1.14
Q5VYK3	KIAA0368	1623.00	3.36	0.35
Q96J02	ITCH	472.00	1.98	0.13
Q99460	PSMD1	2219.00	1.97	0.29
O75521	PECI	767.00	1.89	0.00
O75167	PHACTR2	388.00	1.88	0.43
O00499	BIN1	1.00	1.78	0.00
P45984	MAPK9	1213.00	1.68	0.24
Q6YHK3	CD109	1781.00	1.52	0.87
Q5XPI4	RNF123	312.00	1.48	0.23
Q9H8M7	FAM188A	340.00	1.38	0.12
Q8WYN0	ATG4A	668.00	1.01	0.00
Q92696	RABGGTA	1534.00	0.99	0.24
P63096	GNAI1	10 443.00	0.99	0.65

The MS copy numbers and RNAseq data were merged after reassigning the identifiers. Only the combinations for proteome and transcriptome data for RPKM > 1, which were newly found in addition to the published data¹ are shown.

MS, mass spectometry; qMS, quantified by MS.

A major challenge for studies dealing with native material, particularly when isolated from blood, is posed by the high demands on sample purity. Because the protein content of platelets is comparable to other cells, contaminations have a comparatively small and predictable effect on the quality of proteome analysis. For instance, plasma proteins cannot be entirely removed from platelet preparations because of the “sponge-like” platelet surface formed by the open canalicular system, which is virtually inaccessible to purification techniques. In contrast, RNA content in platelets is ∼4 orders of magnitude lower than in leukocytes⁷; consequently, contaminations have a much stronger impact on data quality in platelet transcriptome analysis. Platelet RNA content is governed by exogenous and endogenous conditions as well as intrinsic factors. Because, to our knowledge, platelets have no transcription machinery, the RNA found apparently might be a relic of megakaryocyte RNA from proplatelet formation, rendering it difficult to deduce which of the transcripts contribute to the actual platelet proteome. Moreover, the amount of platelet RNA is affected by aging and most probably by platelet activating mechanisms.^8,9 Apart from contamination by other cells or material, platelets may also incorporate foreign RNA, as demonstrated for tumor biomarkers,¹⁰ and may also transfer their RNA to other cells, as described recently.¹¹

Considering the constraints and technical limitations of both techniques, we decided to choose a statistical approach rather than a straightforward comparison of the data. Quantitative proteomic data reflect normal distributions for protein frequency densities, as to be expected. In contrast, the transcriptome data provided by the authors show an almost exponential distribution, indicating a strong increase in the number of transcripts with decreasing transcript frequency (Figure 1). A recent publication by the Mann group¹² provided evidence that transcriptome data may yield an almost identical frequency density distribution as proteomics data. However, in this analysis, a bimodal distribution was observed when the threshold for detection was set below 1 FPKM—the authors hypothesized that the low-frequency peak results from transcripts indeed not expressed as proteins. In our opinion, lowering the threshold for comparing the data, as proposed by Rowley et al, will thus certainly increase the coverage of the proteome, however, at the expense of validity, because the number of false-positive transcripts will concurrently increase. Because the frequency density distributions of the transcriptome data by Rowley et al and our proteome data do not share any similarities, we chose to rank each data set. In addition, we stratified the data to enable a direct comparison of high, medium, and low expression/transcription. Neither the rank correlation for the whole data set nor the correlation of the stratified data resulted in a correlation coefficient greater than 0.3. By including more low-rank data, the correlation can be improved, as Rowley et al demonstrated in their letter,³ but even then does not exceed 0.5, which would suggest a systematic rather than a purely random relation.

Figure 1. Frequency density distribution of RNAseq and quantified by mass spectometry (qMS) data from human platelets. The traces represent logarithmized original data from the publications.1,2 The lower threshold for RNAseq has been set to RPKM > 0.3 as proposed elsewhere.3

View large Download PPT

Figure 1

Frequency density distribution of RNAseq and quantified by mass spectometry (qMS) data from human platelets. The traces represent logarithmized original data from the publications.^1,2 The lower threshold for RNAseq has been set to RPKM > 0.3 as proposed elsewhere.³

With respect to the articles cited and the reasoning of Rowley et al, we presume that the present discussion may partly result from a misunderstanding of the term “correlation.” Whereas there is no doubt that the presence of a transcript may on the whole serve as an indication for the expression of the related protein and vice versa, any definite or even quantitative claim is only possible by careful, direct observation. The numerous factors affecting the kind and number of transcripts in anucleated cells such as platelets, most of which are concealed from analysis, prohibit a valid quantitative assertion. In contrast, proteomic studies are suited to provide quantitative data on protein expression as we and others could unquestionably show,¹ yet not to appraise the actual presence or absence of a particular protein. In consequence, it seems that neither of the 2 methods on its own is sufficient to meet the requirements of current and, most probably future, systems biology research on human platelets, though most probably nucleated cells may be investigated by both methods with comparable quality of results, as suggested by Nagaray et al.¹²

Authorship

Contribution: J.G. analyzed the statistical data and wrote the manuscript; J.M.B. collected and analyzed the data and edited the manuscript; S.G. provided study material and critically reviewed and edited the manuscript; U.W. and A.S. designed the study and critically reviewed and edited the manuscript; and R.P.Z. designed the study and wrote the manuscript.

Conflict-of-interest disclosure: The authors declare no competing financial interests.

Correspondence: Joerg Geiger, Interdisciplinary Bank of Biomaterials and Data, Straubmuehlweg 2a/Bldg A9, 97078 Wuerzburg, Germany; e-mail: joerg.geiger@uni-wuerzburg.de.

References

1

Burkhart

JM

,

Vaudel

M

,

Gambaryan

S

, et al. ,

The first comprehensive and quantitative analysis of human platelet protein composition allows the comparative analysis of structural and functional pathways.

,

Blood

,

2012

, vol.

120

15

(pg.

e73

-

e82

)

2

Rowley

JW

,

Oler

AJ

,

Tolley

ND

, et al. ,

Genome-wide RNA-seq analysis of human and mouse platelet transcriptomes.

,

Blood

,

2011

, vol.

118

14

(pg.

e101

-

e111

)

3

Rowley

JW

,

Weyrich

AS

.

Coordinate expression of transcripts and proteins in platelets. Blood 2013, in print

4

McRedmond

JP

,

Park

SD

,

Reilly

DF

, et al. ,

Integration of proteomics and genomics in platelets: a profile of platelet proteins and platelet-specific genes.

,

Mol Cell Proteomics

,

2004

, vol.

3

2

(pg.

133

-

144

)

5

Gnatenko

DV

,

Dunn

JJ

,

McCorkle

SR

,

Weissmann

D

,

Perrotta

PL

,

Bahou

WF

. ,

Transcript profiling of human platelets using microarray and serial analysis of gene expression.

,

Blood

,

2003

, vol.

101

6

(pg.

2285

-

2293

)

6

Colombo

G

,

Gertow

K

,

Marenzi

G

, et al. ,

Gene expression profiling reveals multiple differences in platelets from patients with stable angina or non-ST elevation acute coronary syndrome.

,

Thromb Res

,

2011

, vol.

128

2

(pg.

161

-

168

)

7

Amisten

S

. ,

A rapid and efficient platelet purification protocol for platelet gene expression studies.

,

Methods Mol Biol

,

2012

, vol.

788

(pg.

155

-

172

)

8

Harrison

P

,

Goodall

AH

. ,

“Message in the platelet”—more than just vestigial mRNA!

,

Platelets

,

2008

, vol.

19

6

(pg.

395

-

404

)

9

Bray

PF

,

McKenzie

SE

,

Edelstein

LC

, et al. ,

The complex transcriptional landscape of the anucleate human platelet.

,

BMC Genomics

,

2013

, vol.

14

1

pg.

1

10

Nilsson

RJ

,

Balaj

L

,

Hulleman

E

, et al. ,

Blood platelets contain tumor-derived RNA biomarkers.

,

Blood

,

2011

, vol.

118

13

(pg.

3680

-

3683

)

11

Risitano

A

,

Beaulieu

LM

,

Vitseva

O

,

Freedman

JE

. ,

Platelets and platelet-like particles mediate intercellular RNA transfer.

,

Blood

,

2012

, vol.

119

26

(pg.

6288

-

6295

)

12

Nagaraj

N

,

Wisniewski

JR

,

Geiger

T

, et al. ,

Deep proteome and transcriptome mapping of a human cancer cell line.

,

Mol Syst Biol

,

2011

, vol.

7

pg.

548

© 2013 by The American Society of Hematology

2013

Sign in via your Institution