CLASSIFICAÇÃO E SEGMENTAÇÃO DE ÁUDIO A PARTIR DE FATORES DE ESCALA MPEG / CLASSIFICATION AND SEGMENTATION OF MPEG AUDIO BASED ON SCALE FACTORS
AUTOR(ES)
FERNANDO RIMOLA DA CRUZ MANO
DATA DE PUBLICAÇÃO
2007
RESUMO
With the growth of production and storing of digital media, audio segmentation and classification are becoming increasingly important. This work is based on characteristics of the MPEG standard, considered to be the standard for digital media storage and retrieval, to propose efficient algorithms to perform these tasks. While there are many studies based on video analysis, the audio information is still not widely used in an efficient way. The suggested algorithms for both tasks are based only on the scale factors present on layer 2 MPEG audio. That allows them to read the smallest amount of information possible, significantly diminishing the amount of data manipulated during the analysis and making their performance excellent in terms of processing time. The algorithm proposed for audio classification divides audio in four possible types: silent, speech, music and applause. The segmentation algorithm finds significant changes on the audio signal that represent clues of audio segments and scene changes. Tests were made with a wide range of types of video, and both algorithms show good results.
ASSUNTO(S)
classificacao audio analysis classification fatores de escala scale factors analise do audio segmentation mpeg mpeg segmentacao
ACESSO AO ARTIGO
Documentos Relacionados
- Segmentação de imagens por classificação de cores: uma abordagem neural.
- Transformações multi-escala para a segmentação de imagens de impressões digitais
- Segmentation and classification of digital images of cutaneous ulcers through artificial neural networks
- Segmentação de voz baseada na análise fractal e na transformada wavelet.
- LIVER SEGMENTATION AND VISUALIZATION FROM COMPUTER TOMOGRAPHY IMAGES