Fast analysis of DNA and protein sequence on Apple IIe: restriction sites search, alignment of short sequence and dot matrix analysis.
AUTOR(ES)
Marck, C
RESUMO
A fast restriction sites search algorithm using a quadruplet look-ahead feature has been written in 6502 assembly language code. The search time, tested on the sequence of pBR322, is 4.1 s/kilobase using a restriction site library including 112 specificities corresponding to a total site length of over 700 bases. The search for a short sequence (less than 36 bases) within a longer one (up to 9999 bases) with a given number of mismatches or gaps allowed has also been written in assembly language. Typical run time for the search of a 12 base sequence with 1, 2 or 3 gaps allowed are 6.2, 9.4 or 13.6 s/kilobase, respectively. The dot matrix analysis needs 7.5 minutes per square kilobase when using a stringency of 15 matched bases out of 25. A 7/21 matrix of two 500 amino acid proteins is obtained in 3 minutes. These three routines are included in DPSA, a general package of programs allowing manipulation and analysis of DNA and protein sequences.
ACESSO AO ARTIGO
http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=339445Documentos Relacionados
- 'DNA Strider': a 'C' program for the fast analysis of DNA and protein sequences on the Apple Macintosh family of computers.
- Comparative statistics for DNA and protein sequences: single sequence analysis.
- Comparative statistics for DNA and protein sequences: multiple sequence analysis.
- Search, encounter rates, and the evolution of anisogamy.
- Multiple DNA and protein sequence alignment based on segment-to-segment comparison.