Power Pooling: An Adaptive Pooling Function for Weakly Labelled Sound Event Detection

Facebook NLP Research

Abstract

Access to large corpora with strongly labelled sound events is expensive and difficult in engineering applications. Many researches turn to address the problem of how to detect both the types and the timestamps of sound events with weak labels that only specify the types. This task can be treated as a multiple instance learning (MIL) problem, and a key to it in the sound event detection (SED) task is the design of a pooling function. The linear softmax pooling function achieves state-of-the-art

 

 

To finish reading, please visit source site