Atypical regions in large genomic DNA sequences.
AUTOR(ES)
Scherer, S
RESUMO
Large genomic DNA sequences contain regions with distinctive patterns of sequence organization. We describe a method using logarithms of probabilities based on seventh-order Markov chains to rapidly identify genomic sequences that do not resemble models of genome organization built from compilations of octanucleotide usage. Data bases have been constructed from Escherichia coli and Saccharomyces cerevisiae DNA sequences of > 1000 nt and human sequences of > 10,000 nt. Atypical genes and clusters of genes have been located in bacteriophage, yeast, and primate DNA sequences. We consider criteria for statistical significance of the results, offer possible explanations for the observed variation in genome organization, and give additional applications of these methods in DNA sequence analysis.
ACESSO AO ARTIGO
http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=44353Documentos Relacionados
- Recognition of protein coding regions in DNA sequences.
- Correlation approach to identify coding regions in DNA sequences.
- In Silico Prediction of Scaffold/Matrix Attachment Regions in Large Genomic Sequences
- Comparisons of eukaryotic genomic sequences.
- Construction of small-insert genomic DNA libraries highly enriched for microsatellite repeat sequences.