TY - JOUR
T1 - Functional annotation of proteome encoded by human chromosome 22
AU - Pinto, Sneha M.
AU - Manda, Srikanth S.
AU - Kim, Min Sik
AU - Taylor, Kyonese
AU - Selvan, Lakshmi Dhevi Nagarajha
AU - Balakrishnan, Lavanya
AU - Subbannayya, Tejaswini
AU - Yan, Fangfei
AU - Prasad, T. S.Keshava
AU - Gowda, Harsha
AU - Lee, Charles
AU - Hancock, William S.
AU - Pandey, Akhilesh
PY - 2014/6/6
Y1 - 2014/6/6
N2 - As part of the chromosome-centric human proteome project (C-HPP) initiative, we report our progress on the annotation of chromosome 22. Chromosome 22, spanning 51 million base pairs, was the first chromosome to be sequenced. Gene dosage alterations on this chromosome have been shown to be associated with a number of congenital anomalies. In addition, several rare but aggressive tumors have been associated with this chromosome. A number of important gene families including immunoglobulin lambda locus, Crystallin beta family, and APOBEC gene family are located on this chromosome. On the basis of proteomic profiling of 30 histologically normal tissues and cells using high-resolution mass spectrometry, we show protein evidence of 367 genes on chromosome 22. Importantly, this includes 47 proteins, which are currently annotated as "missing" proteins. We also confirmed the translation start sites of 120 chromosome 22-encoded proteins. Employing a comprehensive proteogenomics analysis pipeline, we provide evidence of novel coding regions on this chromosome which include upstream ORFs and novel exons in addition to correcting existing gene structures. We describe tissue-wise expression of the proteins and the distribution of gene families on this chromosome. These data have been deposited to ProteomeXchange with the identifier PXD000561.
AB - As part of the chromosome-centric human proteome project (C-HPP) initiative, we report our progress on the annotation of chromosome 22. Chromosome 22, spanning 51 million base pairs, was the first chromosome to be sequenced. Gene dosage alterations on this chromosome have been shown to be associated with a number of congenital anomalies. In addition, several rare but aggressive tumors have been associated with this chromosome. A number of important gene families including immunoglobulin lambda locus, Crystallin beta family, and APOBEC gene family are located on this chromosome. On the basis of proteomic profiling of 30 histologically normal tissues and cells using high-resolution mass spectrometry, we show protein evidence of 367 genes on chromosome 22. Importantly, this includes 47 proteins, which are currently annotated as "missing" proteins. We also confirmed the translation start sites of 120 chromosome 22-encoded proteins. Employing a comprehensive proteogenomics analysis pipeline, we provide evidence of novel coding regions on this chromosome which include upstream ORFs and novel exons in addition to correcting existing gene structures. We describe tissue-wise expression of the proteins and the distribution of gene families on this chromosome. These data have been deposited to ProteomeXchange with the identifier PXD000561.
KW - "missing" proteins
KW - genome annotation
KW - human proteome
KW - uORF
UR - http://www.scopus.com/inward/record.url?scp=84902108690&partnerID=8YFLogxK
U2 - 10.1021/pr401169d
DO - 10.1021/pr401169d
M3 - Article
C2 - 24669763
AN - SCOPUS:84902108690
SN - 1535-3893
VL - 13
SP - 2749
EP - 2760
JO - Journal of Proteome Research
JF - Journal of Proteome Research
IS - 6
ER -