Identification of protein coding regions in the human genome by quadratic discriminant analysis
AUTOR(ES)
Zhang, M. Q.
FONTE
The National Academy of Sciences of the USA
RESUMO
A new method for predicting internal coding exons in genomic DNA sequences has been developed. This method is based on a prediction algorithm that uses the quadratic discriminant function for multivariate statistical pattern recognition. Substantial improvements have been made (with only 9 discriminant variables) when compared with existing methods: hexon [Solovyev, V. V., Salamov, A. A. & Lawrence, C. B. (1994) Nucleic Acids Res. 22, 5156–5163] (based on linear discriminant analysis) and grail2 [Uberbacher, E. C. & Mural, R. J. (1991) Proc. Natl. Acad. Sci. USA 88, 11261–11265] (based on neural networks). A computer program called mzef is freely available to the genome community and allows users to adjust prior probability and to output alternative overlapping exons.
ACESSO AO ARTIGO
http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=19553Documentos Relacionados
- Principal Component Analysis with Linear and Quadratic Discriminant Analysis for Identification of Cancer Samples Based on Mass Spectrometry
- Self-identification of protein-coding regions in microbial genomes
- Splice site prediction with quadratic discriminant analysis using diversity measure
- Identification and Analysis of Over 2000 Ribosomal Protein Pseudogenes in the Human Genome
- A Quadratic Discriminant of Zygosity from Fingerprints