On-device Emotion Recognition from Spoken Language in Embedded Devices

Neeraj Boddeda, Sharvari Wanjari, Shashank Goud Boorgu, Prasenjit Karmakar, Sandip Chakraborty

Posted March, 2025

Abstract

Audio-based emotion recognition has many applications in human-computer interaction, mental health assessment, and customer service analytics. This paper presents a machine learning-based on-device emotion (ie, anger, disgust, fear, happiness, neutrality, sadness, and surprise) recognition from audio for low-cost embedded devices. We show the influence of the speaker’s mental state on various acoustic features, such as intensity, shimmer, etc. However, classifying the emotions from audio is challenging, as these emotions sound ambiguous for different speakers. Our extensive evaluation with lightweight machine learning models indicates an overall F1-score of 61.2% with below 50 ms response time and 256 KB memory usage in modern embedded devices.

Type

Conference paper

Publication

IEEE PerCom 2025 (WIP)