Clustering and artificial neural networks: classification of variable lengths of Helminth antigens in set of domains
AUTOR(ES)
Rodrigues, Thiago de Souza, Pacífico, Lucila Grossi Gonçalves, Teixeira, Santuza Maria Ribeiro, Oliveira, Sérgio Costa, Braga, Antônio de Pádua
FONTE
Genetics and Molecular Biology
DATA DE PUBLICAÇÃO
2004
RESUMO
A new scheme for representing proteins of different lengths in number of amino acids that can be presented to a fixed number of inputs Artificial Neural Networks (ANNs) speel-out classification is described. K-Means's clustering of the new vectors with subsequent classification was then possible with the dimension reduction technique Principal Component Analysis applied previously. The new representation scheme was applied to a set of 112 antigens sequences from several parasitic helminths, selected in the National Center for Biotechnology Information and classified into fourth different groups. This bioinformatic tool permitted the establishment of a good correlation with domains that are already well characterized, regardless of the differences between the sequences that were confirmed by the PFAM database. Additionally, sequences were grouped according to their similarity, confirmed by hierarchical clustering using ClustalW.
Documentos Relacionados
- Automation in accession classification of Brazilian Capsicum germplasm through artificial neural networks
- Artificial neural networks in the classification and identification of soybean cultivars by planting region
- Noise injection for training artificial neural networks: A comparison with weight decay and early stopping
- Prediction of Protein Functional Domains from Sequences Using Artificial Neural Networks
- The use of Wavelet packet transform and artificial neural networks in analysis and classification of dysphonic voices