Electronic International Standard Serial Number (EISSN)
1744-5027
abstract
Advances in chord recognition research using machine learning are hampered by two factors: the scarcity of annotated training data, and the limited complexity of the features and models used. Both problems are intertwined, as with few training examples, increasing the complexity of the model would inevitably lead to overfitting. In this paper we develop a way to address the first problem by exploiting chord annotations from online chord databases. We show how such chord annotations, despite being noisy and lacking exact chord onset times, can be put to use both during the recognition and training stage. We note that the ability to exploit this large untapped resource may enable researchers to also address the second problem: with more training data, one may be able to use more complex models without running the same high risk of overfitting.