上海交通大学学报(英文版) ›› 2014, Vol. 19 ›› Issue (6): 663-668.doi: 10.1007/s12204-014-1563-x

• • 上一篇    下一篇

Genome-Wide Identification of Coding Small Open Reading Frames: The Unknown Transcriptome

LI Hong-meia (李红梅), HU Chuan-shenga (胡传圣), BAI Lingb* (白玲)   

  1. (a. Key Laboratory of Systems Biomedicine (Ministry of Education); b. School of Biomedical Engineering, Shanghai Jiaotong University, Shanghai 200240, China)
  • 出版日期:2014-12-31 发布日期:2014-12-08
  • 通讯作者: BAI Ling (白玲) E-mail: lbai@sjtu.edu.cn

Genome-Wide Identification of Coding Small Open Reading Frames: The Unknown Transcriptome

LI Hong-meia (李红梅), HU Chuan-shenga (胡传圣), BAI Lingb* (白玲)   

  1. (a. Key Laboratory of Systems Biomedicine (Ministry of Education); b. School of Biomedical Engineering, Shanghai Jiaotong University, Shanghai 200240, China)
  • Online:2014-12-31 Published:2014-12-08
  • Contact: BAI Ling (白玲) E-mail: lbai@sjtu.edu.cn

摘要: The identification of the complete repertoire of functional peptides in a cell is ultimately essential for a systems-wide understanding of its behavior. There have indeed been a plethora of studies purportedly designed to this end. However, these studies in fact routinely overlook a potentially significant portion of their data that might encode for peptides that are smaller than 100 amino acids. This is largely owing to technical reasons associated with the difficulty of distinguishing, with statistical significance, a coding sequence of this length from a non-coding sequence. Recently, a growing number of studies have shown that there are indeed many small open reading frame (sORF) encoded peptides that play important roles in a wide range of different biological processes. As such, there is now significant interest in methodologies that can be used to identify this drastically neglected portion of the cellular proteome. In this review, we introduce the presently known annotated sORFs and describe the new strategies that have been used to determine the coding sORFs, genome-wide.

关键词: small peptides, mass spectroscopy, ribosome footprinting, bioinformatics

Abstract: The identification of the complete repertoire of functional peptides in a cell is ultimately essential for a systems-wide understanding of its behavior. There have indeed been a plethora of studies purportedly designed to this end. However, these studies in fact routinely overlook a potentially significant portion of their data that might encode for peptides that are smaller than 100 amino acids. This is largely owing to technical reasons associated with the difficulty of distinguishing, with statistical significance, a coding sequence of this length from a non-coding sequence. Recently, a growing number of studies have shown that there are indeed many small open reading frame (sORF) encoded peptides that play important roles in a wide range of different biological processes. As such, there is now significant interest in methodologies that can be used to identify this drastically neglected portion of the cellular proteome. In this review, we introduce the presently known annotated sORFs and describe the new strategies that have been used to determine the coding sORFs, genome-wide.

Key words: small peptides, mass spectroscopy, ribosome footprinting, bioinformatics

中图分类号: