Skip to main content


  • vocal imitation of a sound for fine-grained search

    Fine-grained Vocal Imitation Set

    Bongjun Kim, Bryan Pardo

    This dataset includes 763 crowd-sourced vocal imitations of 108 sound events. The sound event recordings were taken from a subset of Vocal Imitation Set.

  • OtoMechanic logo


    Max Morrison and Bryan Pardo

    OtoMobile dataset is a collection of recordings of failing car components, created by the Interactive Audio Lab at Northwestern University.

  • Slakh compared to other datasets.


    Ethan Manilow, Gordon Wichern, Prem Seetharaman, Jonathan Le Roux

    The Synthesized Lakh (Slakh) Dataset contains 2100 automatically mixed tracks and accompanying MIDI files synthesized using a professional-grade sampling engine.

  • vocal imitation of a sound


    Bongjun Kim, Mark Cartwright, Fatemeh Pishdadian, Bryan Pardo

    VimSketch Dataset combines two publicly available datasets, created by the Interactive Audio Lab for the task of Query by Vocal Imitation (QBV).