ISED
Bongjun Kim and Bryan Pardo
Web demo
This work is supported National Science Foundation award 1617107
Interactive Sound Event Detector (I-SED) is a human-in-the-loop interface for sound event annotation that helps users label sound events of interest within a lengthy recording quickly. The annotation is performed by a collaboration between a user and a machine.
The system lets you reduce the time required to search for a set of sound events of interest in an audio recording when there are too few labeled examples (e.g., one) of the sound class to train a state-of-the-art machine audio labeling system.
Demo video
Related publications
[pdf] B. Kim and B. Pardo, “Sound Event Detection using point-labeled data,” in IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2019), New Paltz, NY, USA, 2019.
[pdf] Bongjun Kim and Bryan Pardo, “A Human-in-the-loop System for Sound Event Detection and Annotation,” ACM Transaction on Interactive Intelligent System (TiiS), Vol. 8, Issue 2, Article 13, July 2018.
[pdf] Bongjun Kim and Bryan Pardo, “I-SED: an Interactive Sound Event Detector,” ACM International Conference on Intelligent User Interface (IUI), Limassol, Cyprus, Mar. 2017