Nucleotide, dinucleotide and trinucleotide frequencies explain patterns observed in chaos game representations of DNA sequences.
AUTOR(ES)
Goldman, N
RESUMO
The chaos game representation (CGR) is a scatter plot derived from a DNA sequence, with each point of the plot corresponding to one base of the sequence. If the DNA sequence were a random collection of bases, the CGR would be a uniformly filled square; conversely, any patterns visible in the CGR represent some pattern (information) in the DNA sequence. In this paper, patterns previously observed in a variety of DNA sequences are explained solely in terms of nucleotide, dinucleotide and trinucleotide frequencies.
ACESSO AO ARTIGO
http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=309551Documentos Relacionados
- Conservation patterns in angiosperm rDNA ITS2 sequences.
- WORDUP: an efficient algorithm for discovering statistically significant patterns in DNA sequences.
- Statistical analysis of nucleotide sequences.
- Symmetry observations in long nucleotide sequences.
- Circular DNA of human immunodeficiency virus: analysis of circle junction nucleotide sequences.