Skip to content

pyvoicebox

A complete Python port of the VOICEBOX Speech and Audio Processing toolbox, originally written in MATLAB by Mike Brookes at Imperial College London.

280+ functions, fully typed, validated against the original MATLAB source via GNU Octave with 500+ automated tests.

What is VOICEBOX?

VOICEBOX is a comprehensive MATLAB toolkit for speech and audio signal processing maintained since the 1990s. It covers areas that most Python audio libraries don't touch:

See how it compares to librosa and openSMILE.

Quick install

pip install pyvoicebox-sap                # core (numpy, scipy, soundfile)
pip install "pyvoicebox-sap[plot]"        # with matplotlib for plotting functions

See the Getting Started guide for examples and usage details.

Notebooks

Interactive Jupyter notebooks are available in the notebooks/ directory:

Notebook Description Colab
Visualize Speech Waveform, spectrogram, MFCCs, and pitch tracking Open in Colab
Clean Up Noisy Speech Add noise, run MMSE enhancement, measure SNR improvement Open in Colab
Inside the Vocal Tract LPC spectral envelopes, coefficient conversions, bandwidth expansion Open in Colab
Who Said That? Speaker identification with GMMs Open in Colab
Emotion Recognition TEO vs MFCC features on EmoDB with Random Forest Open in Colab