Statistical analysis of nucleotide sequences.
AUTOR(ES)
Stückle, E E
RESUMO
In order to scan nucleic acid databases for potentially relevant but as yet unknown signals, we have developed an improved statistical model for pattern analysis of nucleic acid sequences by modifying previous methods based on Markov chains. We demonstrate the importance of selecting the appropriate parameters in order for the method to function at all. The model allows the simultaneous analysis of several short sequences with unequal base frequencies and Markov order k not equal to 0 as is usually the case in databases. As a test of these modifications, we show that in E. coli sequences there is a bias against palindromic hexamers which correspond to known restriction enzyme recognition sites.
ACESSO AO ARTIGO
http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=332623Documentos Relacionados
- Methods and algorithms for statistical analysis of protein sequences.
- 'ZSTATS'--a statistical analysis for potential Z-DNA sequences.
- Circular DNA of human immunodeficiency virus: analysis of circle junction nucleotide sequences.
- On the statistical assessment of similarities in DNA sequences.
- A model for nucleotide sequences.