Mobile Music Modeling, Analysis and Recognition
Venue
International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2012)
Publication Year
2012
Authors
Pavel Golik, Boulos Harb, Ananya Misra, Michael Riley, Alex Rudnick, Eugene Weinstein
BibTeX
Abstract
We present an analysis of music modeling and recognition techniques in the context
of mobile music matching, substantially improving on the techniques presented in
[Mohri et al., 2010]. We accomplish this by adapting the features specifically to
this task, and by introducing new modeling techniques that enable using a corpus of
noisy and channel-distorted data to improve mobile music recognition quality. We
report the results of an extensive empirical investigation of the system's
robustness under realistic channel effects and distortions. We show an improvement
of recognition accuracy by explicit duration modeling of music phonemes and by
integrating the expected noise environment into the training process. Finally, we
propose the use of frame-to-phoneme alignment for high-level structure analysis of
polyphonic music.
