Fast Recognition of Remixed Music Audio
Source:
Honolulu, Hawaii (2007)
URL:
http://cobweb.ecn.purdue.edu/~malcolm/yahoo/Casey2007(FastRecognitionRemixedAudioICASSP).pdf
Abstract:
We present an efficient algorithm for automatically detecting
remixes of pop songs in large commercial collections. Remixes
are closely related as commercial products but they are not closely
related in their audio spectral content because of the nature of the
remixing process. Therefore spectral modelling approaches to audio
similarity fail to recognize them. We propose a new approach– that
chops songs into small chunks called audio shingles– to recognize
remixed songs. We model the distribution of pair-wise distances between shingles by two independent processes– one corresponding to
remix content and the other correspoding to non-remix content in a
database. A nearest neighbour algorithm groups songs if they share
shingles drawn from the remix process. Our results show 1) log-chromagram shingles separate remixed from non-remixed content
with 75%-75% precision-recall performance, cepstral coefficient features do not separate the two distributions adequately 2) increasing
the observations from the remix distribution increases the separability. Efficient implementation follows from the separability of the
distributions using locality sensitive hashing (LSH) which speeds up
automatic grouping of remixes by between one to two orders of magnitude in a 2018-song test set.