Por favor, use este identificador para citar o enlazar a este item: http://hdl.handle.net/10261/133865
COMPARTIR / EXPORTAR:
logo share SHARE logo core CORE BASE
Visualizar otros formatos: MARC | Dublin Core | RDF | ORE | MODS | METS | DIDL | DATACITE

Invitar a revisión por pares abierta
Título

Tonal representations for music retrieval: from version identification to query-by-humming

AutorSalamon, Justin; Serra, Joan CSIC ORCID; Gómez, Emilia
Palabras claveBass line
Music retrieval
Version identification
Query by humming
Music similarity
Cover song detection
Harmony
Melody extraction
Fecha de publicación2013
EditorSpringer Nature
CitaciónInternational Journal of Multimedia Information Retrieval 2 (1): 45- 58 (2013)
ResumenIn this study we compare the use of different music representations for retrieving alternative performances of the same musical piece, a task commonly referred to as version identification. Given the audio signal of a song, we compute descriptors representing its melody, bass line and harmonic progression using state-of-the-art algorithms. These descriptors are then employed to retrieve different versions of the same musical piece using a dynamic programming algorithm based on nonlinear time series analysis. First, we evaluate the accuracy obtained using individual descriptors, and then we examine whether performance can be improved by combining these music representations (i.e. descriptor fusion). Our results show that whilst harmony is the most reliable music representation for version identification, the melody and bass line representations also carry useful information for this task. Furthermore, we show that by combining these tonal representations we can increase version detection accuracy. Finally, we demonstrate how the proposed version identification method can be adapted for the task of query-by-humming. We propose a melody-based retrieval approach, and demonstrate how melody representations extracted from recordings of a cappella singing can be successfully used to retrieve the original song from a collection of polyphonic audio. The current limitations of the proposed approach are discussed in the context of version identification and query-by-humming, and possible solutions and future research directions are proposed.
URIhttp://hdl.handle.net/10261/133865
DOI10.1007/s13735-012-0026-0
Identificadoresdoi: 10.1007/s13735-012-0026-0
issn: 2192-6611
Aparece en las colecciones: (IIIA) Artículos




Ficheros en este ítem:
Fichero Descripción Tamaño Formato
IJMIR2(1)_45-58.pdf684,95 kBAdobe PDFVista previa
Visualizar/Abrir
Mostrar el registro completo

CORE Recommender

SCOPUSTM   
Citations

53
checked on 24-mar-2024

Page view(s)

166
checked on 23-abr-2024

Download(s)

308
checked on 23-abr-2024

Google ScholarTM

Check

Altmetric

Altmetric


NOTA: Los ítems de Digital.CSIC están protegidos por copyright, con todos los derechos reservados, a menos que se indique lo contrario.