openSputnik—a database to ESTablish comparative plant genomics using unsaturated sequence collections
AUTOR(ES)
Rudd, Stephen
FONTE
Oxford University Press
RESUMO
The public expressed sequence tag collections are continually being enriched with high-quality sequences that represent an ever-expanding range of taxonomically diverse plant species. While these sequence collections provide biased insight into the populations of expressed genes available within individual species and their associated tissues, the information is conceivably of wider relevance in a comparative context. When we consider the available expressed sequence tag (EST) collections of summer 2004, most of the major plant taxonomic clades are at least superficially represented. Investigation of the five million available plant ESTs provides a wealth of information that has applications in modelling the routes of plant genome evolution and the identification of lineage-specific genes and gene families. Over four million ESTs from over 50 distinct plant species have been collated within an EST analysis pipeline called openSputnik. The ESTs were resolved down into approximately one million unigene sequences. These have been annotated using orthology-based annotation transfer from reference plant genomes and using a variety of contemporary bioinformatics methods to assign peptide, structural and functional attributes. The openSputnik database is available at http://sputnik.btk.fi.
ACESSO AO ARTIGO
http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=539994Documentos Relacionados
- Sputnik: a database platform for comparative plant genomics
- CORG: a database for COmparative Regulatory Genomics
- MolliGen, a database dedicated to the comparative genomics of Mollicutes
- Using genetic diversity information to establish core collections of Stylosanthes capitata and Stylosanthes macrocephala
- PlantsP: a functional genomics database for plant phosphorylation