Speech Noise Clustering

Abstract

An academical effort was invested in finding the intersection between machine learning algorithms and digital signal processing (DSP) tasks. This research tried to utilize clustering algorithms, in particular spectral clustering and Independent Component Analysis, to reduce noise from speech centric audio recordings. The results were compared with a noise-reduced outcome, which was generated using existing post-production techniques, such as equalizing and frequency bandpasses, that were implemented using Python. The noise-reduced audio outcome using machine learning algorithms was not as good as the audio outcome using post-production techniques, but led to some interesting conclusions and ideas for further research.

read complete paper

🔊 hear results using post-production techniques (Python)

🔊 hear results using spectral clustering algorithms (python)

ingredients

Machine learning package - Scikit-learn
DSP package - LibROSA
Data managements - Pandas
Plots - Matplotlib + Seaborn

project GitHub repository