A Random Sequencing Approach for the Analysis of the Trypanosoma cruzi Genome: General Structure, Large Gene and Repetitive DNA Families, and Gene Discovery
AUTOR(ES)
Agüero, Fernán
FONTE
Cold Spring Harbor Laboratory Press
RESUMO
A random sequence survey of the genome of Trypanosoma cruzi, the agent of Chagas disease, was performed and 11,459 genomic sequences were obtained, resulting in ∼4.3 Mb of readable sequences or ∼10% of the parasite haploid genome. The estimated total GC content was 50.9%, with a high representation of A and T di- and trinucleotide repeats. Out of the estimated 5000 parasite genes, 947 putative new genes were identified. Another 1723 sequences corresponded to genes detected previously in T. cruzi through expression sequence tag analysis. 7735 sequences had no matches in the database, but the presence of open reading frames that passed Fickett's test suggests that some might contain coding DNA. The survey was highly redundant, with ∼35% of the sequences included in a few large sequence families. Some of them code for protein families present in dozens of copies, including proteins essential for parasite survival and retrotransposons. Other sequence families include repetitive DNA present in thousands of copies per haploid genome. Some families in the latter group are new, parasite-specific, repetitive DNAs. These results suggest that T. cruzi could constitute an interesting model to analyze gene and genome evolution due to its plasticity in terms of sequence amplification and divergence. Additional information can be found at http://www.iib.unsam.edu.ar/tcruzi.gss.html.
ACESSO AO ARTIGO
http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=313047Documentos Relacionados
- A new approach for potential drug target discovery through in silico metabolic pathway analysis using Trypanosoma cruzi genome information
- Gene Discovery through Expressed Sequence Tag Sequencing in Trypanosoma cruzi
- Integration of Cot Analysis, DNA Cloning, and High-Throughput Sequencing Facilitates Genome Characterization and Gene Discovery
- Efficient large-scale sequencing of the Escherichia coli genome: implementation of a transposon- and PCR-based strategy for the analysis of ordered lambda phage clones.
- A large-scale, gene-driven mutagenesis approach for the functional analysis of the mouse genome