IECE Transactions on Intelligent Systematics | Volume 2, Issue 1: 27-37, 2024 | DOI:10.62762/TIS.2024.649374
Abstract
Speaker identification systems have gained significant attention due to their potential applications in security and personalized systems. This study evaluates the performance of various time and frequency domain physical features for text-independent speaker identification. Specifically, four key features—pitch, intensity, spectral flux, and spectral slope—were examined along with their statistical variations (minimum, maximum, and average values). These features were fused with log power spectral features and trained using a Convolutional Neural Network (CNN). The goal was to identify the most effective feature combinations for improving speaker identification accuracy. The experimenta... More >
Graphical Abstract