Bidirectional Alignment of Glottal Pulse Length Sequences for the Evaluation of Pitch Detection Algorithms
Fecha
2019-10-28
Autores
Ferrer-Riesgo, Carlos A.
Rodriguez-Guillen, Reinier
Nöth, Elmar
Título de la revista
ISSN de la revista
Título del volumen
Editor
Resumen
This paper describes a problem in a reported Dynamic Time Warping (DTW) alignment procedure to compare the reference and detected glottal pulse length sequences, oriented to compare the evaluation of Pitch Detection Algorithms (PDAs) in pathological voices. The problem in the existing alignment method tends to overestimate the failure of the PDA, by aligning only the detected to the reference sequence. A solution is presented, which performs a bidirectional alignment reducing the differences present in the definitive comparison. The proposal is evaluated in both synthetic and real voice signals, by running three well-known PDAs, and the magnitude of the error reduction along with comments on the possible factors influencing its value, are given. The alignment variant introduced in this paper allows to perform a fairer comparison of the PDAs performances.
Descripción
Palabras clave
Dynamic Time Warping - Pitch Detection Algorithms - Jitter - Alignment
Citación
FERRER, Carlos A.; GUILLÉN, Reinier Rodríguez; NÖTH, Elmar. Bidirectional Alignment of Glottal Pulse Length Sequences for the Evaluation of Pitch Detection Algorithms. En Iberoamerican Congress on Pattern Recognition. Springer, Cham, 2019. p. 707-716.