A Neuromorphic Approach to Emotion Detection from Speech with Spiking Neural Networks

International Journal of Innovative Research in Computer and Communication Engineering

ISSN Approved Journal | Impact factor: 8.771 | ESTD: 2013 | Follows UGC CARE Journal Norms and Guidelines

| Monthly, Peer-Reviewed, Refereed, Scholarly, Multidisciplinary and Open Access Journal | High Impact Factor 8.771 (Calculated by Google Scholar and Semantic Scholar | AI-Powered Research Tool | Indexing in all Major Database & Metadata, Citation Generator | Digital Object Identifier (DOI) |

TITLE	A Neuromorphic Approach to Emotion Detection from Speech with Spiking Neural Networks
ABSTRACT	Speech Emotion Recognition refers to the study of how we can recognize different emotional states through spoken words. Neuromorphic Computing relates to artificial neural networks designed to mimic biological neural circuits or function like biological brains. Spiking Neural Networks (SNNs) relate to artificial neural networks with discrete time representations of activity (i.e., the "spike" of an action potential). Spiking Convolutional Neural Network (SCNN) relates to SNNs which utilize convolutional layers to analyze spatially localized input patterns. Leaky Integrate-and-Fire (LIF) Neuron Model relates to a biologically inspired neural network model that uses LIF neurons. Mel-Frequency Cepstral Coefficients (MFCC) relates to a type of audio feature extraction that represents frequency components of audio in mel-frequency space. Audio Feature Extraction relates to extracting specific types of features from audio to be used as input to machine learning algorithms. Spike-Based Learning relates to how neural networks learn from data when they use spikes to communicate. Temporal Signal Processing relates to signal processing techniques that take advantage of temporal relationships in signals. Event-Driven Computation relates to computing approaches where computation occurs only when events occur (e.g., spikes). Human–Computer Interaction relates to the design and engineering of interfaces that allow humans to interact with computers. Real-Time Emotion Detection relates to detecting emotional states in real-time from people's speech. Brain-Inspired Artificial Intelligence relates to AI systems that are designed to behave similarly to biological brains.
AUTHOR	K.SANDYA RANI, HASANSABUGARI AYAS, MADHAVARAM JOSEPH BABU, KANCHAM SUNIL REDDY, AMBALDAGE CHANDRA VIGNESH Assistant Professor, Department of Artificial Intelligence, G. Pullaiah College of Engineering and Technology, Kurnool, Andhra Pradesh, India Student, Department of Artificial Intelligence, G. Pullaiah College of Engineering and Technology,Kurnool, Andhra Pradesh, India
VOLUME	181
DOI	DOI: 10.15680/IJIRCCE.2026.1402043
PDF	pdf/43_A Neuromorphic Approach to Emotion Detection from Speech with Spiking Neural Networks.pdf
KEYWORDS
References	[1] J. K. Eshwarappa and G. Orchard, “spiking Neural Network Torch (snntorch): a software library for training deep Spiking Neural Networks on commodity hardware,” arXiv preprint arXiv:2110.00840, 2021. [2] A. Paszke et al., “pytorch: an imperative-style, high-level deep learning library,” Advances in Neural Information Processing Systems (NeurIPS), Vol. 32, 2019. [3] S.R. Burkert, “Speech Emotion Recognition using Spiking Neural Networks and Leaky Integrate-and-Fire (LIF) Neurons,” [Your Institution/Project Repository], 2025. (You can insert your actual project title and name here.) [4] L. van der Maaten and G. Hinton, “Visualizing Data Using t-SNE,” Journal of Machine Learning Research, Vol. 9, 2579 – 2605, 2008. (If you’re using t-SNE to visualize your emotion clusters.) [5] Y. Gao, Y. Sandamirskaya, and G. Indiveri, “A Spiking Neural Network Model of Auditory Feature Extraction and Analysis,” Frontiers in Neuroscience, Vol. 13, 424, 2019. [6] M. Zhang et al., “A Survey of Neuromorphic Computing and Spiking Neural Networks: Advancing Toward Biological Efficiency,” IEEE Press, 2022. (To support your use of LIF neurons to achieve efficiency.) [7] J. Wu, Y. Chua, M. Zhang, G. Li, H. Li, and K.C. Tan, “Deep Spiking Convolutional Neural Network for Single-Channel Speech Separation,” Frontiers in Neuroscience, Vol. 12, 945, 2019. [8] F. Pedregosa et al., “scikit-learn: machine learning in python,” Journal of Machine Learning Research, Vol. 12, 2825 – 2830, 2011. (Standard reference for your confusion matrix and accuracy metrics.) [9] L.N. Smith, “a disciplined approach to neural network hyper-parameters: part 1 — learning rate, batch size, momentum, and weight decay,” arXiv preprint arXiv:1803.09820, 2018. (Relevant for the training loop and optimizer settings we covered.)

About Us

The primary objective of IJIRCCE is to serve as an international scholarly platform that enables researchers, innovators, students, and research scholars to disseminate their research findings and technological advancements to a global academic audience.

About Us

GET IN TOUCH

Useful Links

ARTICLES

About Us

GET IN TOUCH

Useful Links