Document detail
ID

oai:pubmedcentral.nih.gov:1036...

Topic
Article
Author
Sweatt, Andrew J. Griffiths, Cameron D. Paudel, B. Bishal Janes, Kevin A.
Langue
en
Editor

Cold Spring Harbor Laboratory

Category

biorxiv

Year

2023

listing date

8/2/2023

Keywords
models cancer abundance cell biological luminal protein
Metrics

Abstract

Protein copy numbers constrain systems-level properties of regulatory networks, but absolute proteomic data remain scarce compared to transcriptomics obtained by RNA sequencing.

We addressed this persistent gap by relating mRNA to protein statistically using best-available data from quantitative proteomics–transcriptomics for 4366 genes in 369 cell lines.

The approach starts with a central estimate of protein copy number and hierarchically appends mRNA-protein and mRNA-mRNA dependencies to define an optimal gene-specific model that links mRNAs to protein.

For dozens of independent cell lines and primary prostate samples, these protein inferences from mRNA outmatch stringent null models, a count-based protein-abundance repository, and empirical protein-to-mRNA ratios.

The optimal mRNA-to-protein relationships capture biological processes along with hundreds of known protein-protein interaction complexes, suggesting mechanistic relationships are embedded.

We use the method to estimate viral-receptor abundances of CD55–CXADR from human heart transcriptomes and build 1489 systems-biology models of coxsackievirus B3 infection susceptibility.

When applied to 796 RNA sequencing profiles of breast cancer from The Cancer Genome Atlas, inferred copy-number estimates collectively reclassify 26% of Luminal A and 29% of Luminal B tumors.

Protein-based reassignments strongly involve a pharmacologic target for luminal breast cancer (CDK4) and an α-catenin that is often undetectable at the mRNA level (CTTNA2).

Thus, by adopting a gene-centered perspective of mRNA-protein covariation across different biological contexts, we achieve accuracies comparable to the technical reproducibility limits of contemporary proteomics.

The collection of gene-specific models is assembled as a web tool for users seeking mRNA-guided predictions of absolute protein abundance ( http://janeslab.shinyapps.io/Pinferna ).

SIGNIFICANCE STATEMENT: It is easier to quantify mRNA in cells than it is to quantify protein, but proteins are what execute most cellular functions.

Even though protein is synthesized from mRNA in cells, relating a cellular quantity of mRNA to a quantity of protein is challenging.

Here, we bring together quantitative measures of mRNA and protein for 4366 genes in 369 different cultured cell types to build equations that predict protein abundance from the abundance of mRNAs expressed.

These equations capture facets of biological regulation and work better than existing alternatives that rely on consensus values or ratios.

Since mRNA measurements are more widespread than protein, this study makes new analyses possible by protein estimation from mRNA.

Sweatt, Andrew J.,Griffiths, Cameron D.,Paudel, B. Bishal,Janes, Kevin A., 2023, Proteome-wide copy-number estimation from transcriptomics , Cold Spring Harbor Laboratory

Document

Open Open

Share

Source

Articles recommended by ES/IODE AI

A Novel MR Imaging Sequence of 3D-ZOOMit Real Inversion-Recovery Imaging Improves Endolymphatic Hydrops Detection in Patients with Ménière Disease
ménière disease p < detection imaging sequences 3d-zoomit 3d endolymphatic real tse reconstruction ir inversion-recovery hydrops ratio
Successful omental flap coverage repair of a rectovaginal fistula after low anterior resection: a case report
rectovaginal fistula rectal cancer low anterior resection omental flap muscle flap rectal cancer pod initial repair rvf flap omental lar coverage