Easy to use Audio Tagging in PyTorch
audio-tagging
Audio Classification, Tagging & Sound Event Detection in PyTorch
Progress:
- [x] Fine-tune on audio classification
- [ ] Fine-tune on audio tagging
- [ ] Fine-tune on sound event detection
- [x] Add tagging metrics
- [ ] Add Tutorial
- [x] Add Augmentation Notebook
- [ ] Add more schedulers
- [ ] Add FSDKaggle2019 dataset
- [ ] Add MTT dataset
- [ ] Add DESED
Model Zoo
AudioSet Pretrained Models
Model | Task | mAP (%) |
Sample Rate (kHz) |
Window Length | Num Mels | Fmax | Weights |
---|---|---|---|---|---|---|---|
CNN14 | Tagging | 43.1 | 32 | 1024 | 64 | 14k | download |
CNN14_16k | Tagging | 43.8 | 16 | 512 | 64 | 8k | download |
CNN14_DecisionLevelMax | SED | 38.5 | 32 | 1024 | 64 | 14k | download |
Note: These models will be used as a pretrained model in the fine-tuning