Genome Research 2003-10-01

The secreted protein discovery initiative (SPDI), a large-scale effort to identify novel human secreted and transmembrane proteins: a bioinformatics assessment.

Hilary F Clark, Austin L Gurney, Evangeline Abaya, Kevin Baker, Daryl Baldwin, Jennifer Brush, Jian Chen, Bernard Chow, Clarissa Chui, Craig Crowley, Bridget Currell, Bethanne Deuel, Patrick Dowd, Dan Eaton, Jessica Foster, Christopher Grimaldi, Qimin Gu, Philip E Hass, Sherry Heldens, Arthur Huang, Hok Seon Kim, Laura Klimowski, Yisheng Jin, Stephanie Johnson, James Lee, Lhney Lewis, Dongzhou Liao, Melanie Mark, Edward Robbie, Celina Sanchez, Jill Schoenfeld, Somasekar Seshagiri, Laura Simmons, Jennifer Singh, Victoria Smith, Jeremy Stinson, Alicia Vagts, Richard Vandlen, Colin Watanabe, David Wieand, Kathryn Woods, Ming-Hong Xie, Daniel Yansura, Sothy Yi, Guoying Yu, Jean Yuan, Min Zhang, Zemin Zhang, Audrey Goddard, William I Wood, Paul Godowski, Alane Gray

Index: Genome Res. 13(10) , 2265-70, (2003)

Full Text: HTML

Abstract

A large-scale effort, termed the Secreted Protein Discovery Initiative (SPDI), was undertaken to identify novel secreted and transmembrane proteins. In the first of several approaches, a biological signal sequence trap in yeast cells was utilized to identify cDNA clones encoding putative secreted proteins. A second strategy utilized various algorithms that recognize features such as the hydrophobic properties of signal sequences to identify putative proteins encoded by expressed sequence tags (ESTs) from human cDNA libraries. A third approach surveyed ESTs for protein sequence similarity to a set of known receptors and their ligands with the BLAST algorithm. Finally, both signal-sequence prediction algorithms and BLAST were used to identify single exons of potential genes from within human genomic sequence. The isolation of full-length cDNA clones for each of these candidate genes resulted in the identification of >1000 novel proteins. A total of 256 of these cDNAs are still novel, including variants and novel genes, per the most recent GenBank release version. The success of this large-scale effort was assessed by a bioinformatics analysis of the proteins through predictions of protein domains, subcellular localizations, and possible functional roles. The SPDI collection should facilitate efforts to better understand intercellular communication, may lead to new understandings of human diseases, and provides potential opportunities for the development of therapeutics.

Structure	Name/CAS No.	Molecular Formula	Articles
	L-Amino acid oxidase CAS:9000-89-9	C₃₁H₂₇NO₄
	4-Bromobiphenyl CAS:9001-62-1	C₁₁H₉N₃O_2.Na+
	SPHINGOMYELINASE CAS:9031-54-3
	Acyl coenzyme A synthetase CAS:9013-18-7	C₂₆H₂₂N₄O₇S₂
	3-Hydroxybutyrate dehydrogenase CAS:9028-38-0	C₃₁H₃₈ClN₅O
	Phospholipase D CAS:9001-87-0	C₉H₁₄N₄O₃
	sPLA2 (Type III) CAS:9001-84-7
	Lipoprotein Lipase fromPseudomonas CAS:9004-02-8
	Native Sweet Potato Non-Prostatic Acid Phosphatase CAS:9001-77-8	C₆H₁₀O₂

The status, quality, and expansion of the NIH full-length cD...
2004-10-01
[Genome Res. 14 , 2121-7, (2004)]

Mechanisms of action of escapin, a bactericidal agent in the...
2012-04-01
[Antimicrob. Agents Chemother. 56(4) , 1725-34, (2012)]

Advances in non-snake venom L-amino acid oxidase.
2012-05-01
[Appl. Biochem. Biotechnol. 167(1) , 1-13, (2012)]

Phylogenetic analysis supports horizontal gene transfer of L...
2012-07-01
[Infect. Genet. Evol. 12(5) , 1005-9, (2012)]

L-amino acid oxidase-induced apoptosis in filamentous Botryt...
2012-01-01
[Anal. Biochem. 420(1) , 93-5, (2012)]

The secreted protein discovery initiative (SPDI), a large-scale effort to identify novel human secreted and transmembrane proteins: a bioinformatics assessment.

Abstract

Related Compounds