May 6, 2021 Natural Language Processing

Identify the emotion of multiple speakers in an Audio Segment

MevonAI-Speech-Emotion-Recognition

Identify the emotion of multiple speakers in a Audio Segment

Getting Started

Follow the Below Instructions for setting the project up on your local Machine.

Installation

Create a python virtual environment

sudo apt install python3-venv
mkdir mevonAI
cd mevonAI
python3 -m venv mevon-env
source mevon-env/bin/activate

Clone the repo

git clone https://github.com/SuyashMore/MevonAI-Speech-Emotion-Recognition.git

Install Dependencies

cd MevonAI-Speech-Emotion-Recognition/
cd src/
sudo chmod +x setup.sh
./setup.sh

Running the Application

Add audio files in .wav format for analysis in src/input/ folder
Run Speech Emotion Recognition using

python3 speechEmotionRecognition.py

By Default , the application will use the Pretrained Model Available in “src/model/”
Diarized files

To finish reading, please visit source site