Hostname: page-component-8448b6f56d-42gr6 Total loading time: 0 Render date: 2024-04-15T15:36:29.101Z Has data issue: false hasContentIssue false

MARSYAS: a framework for audio analysis

Published online by Cambridge University Press:  16 November 2000

George Tzanetakis
Affiliation:
Department of Computer Science, 35 Olden Street, and Department of Music, Princeton University, Princeton, NJ 08544, USA. E-mail: gtzan@cs.princeton.edu, prc@cs.princeton.edu Fax: + 1609-258-1771
Perry Cook
Affiliation:
Department of Computer Science, 35 Olden Street, and Department of Music, Princeton University, Princeton, NJ 08544, USA. E-mail: gtzan@cs.princeton.edu, prc@cs.princeton.edu Fax: + 1609-258-1771

Abstract

Existing audio tools handle the increasing amount of computer audio data inadequately. The typical tape-recorder paradigm for audio interfaces is inflexible and time consuming, especially for large data sets. On the other hand, completely automatic audio analysis and annotation is impossible using current techniques. Alternative solutions are semi-automatic user interfaces that let users interact with sound in flexible ways based on content. This approach offers significant advantages over manual browsing, annotation and retrieval. Furthermore, it can be implemented using existing techniques for audio content analysis in restricted domains. This paper describes MARSYAS, a framework for experimenting, evaluating and integrating such techniques. As a test for the architecture, some recently proposed techniques have been implemented and tested. In addition, a new method for temporal segmentation based on audio texture is described. This method is combined with audio analysis techniques and used for hierarchical browsing, classification and annotation of audio files.

Type
Research Article
Copyright
© 1999 Cambridge University Press

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)