pyvoicebox

$ pip install pyvoicebox-sap ⎘

280+ speech and audio processing functions, fully typed, ported from MATLAB to Python.

A complete port of the VOICEBOX Speech and Audio Processing toolbox by Mike Brookes, Imperial College London.

Get Started → API Reference

Capabilities

Everything you need to analyze, enhance, and transform speech signals

LPC Analysis

60+ functions for linear predictive coding — every conversion between AR, reflection coefficients, line spectra, cepstra, and more.

Speech Enhancement

Spectral subtraction, MMSE estimators, noise estimation, dereverberation, and voice activity detection.

Psychoacoustic Metrics

PESQ/MOS, Speech Intelligibility Index, STOI, phon/sone loudness, active speech level, and segmental SNR.

Pitch Detection

PEFAC, RAPT, and DYPSA glottal closure detection — battle-tested algorithms from decades of speech research.

Gaussian Mixtures

Complete GMM toolkit: EM fitting, scoring, merging, marginals, conditionals, Bhattacharyya and KL divergence.

Rotations & Geometry

Euler angles, rotation matrices, quaternions, polygon/polyhedron geometry, and spherical harmonics.

Developer Experience

Faithful MATLAB port with a Pythonic API

Every function preserves its original VOICEBOX name and behavior, rigorously validated against the MATLAB source via GNU Octave. Drop-in replacements using NumPy arrays.

from pyvoicebox import *
import numpy as np

# Frame audio into overlapping windows
signal = np.random.randn(16000)
frames = v_enframe(signal, 400, 160)

# Mel-scaled filterbank
m, _, _ = v_melbankm(26, 512, 16000)

# LPC analysis → cepstral coefficients
ar, e = v_lpcauto(frames, 12)
cc = v_lpcar2cc(ar, 12)

addpath('voicebox');

% Frame audio into overlapping windows
signal = randn(1, 16000);
frames = v_enframe(signal, 400, 160);

% Mel-scaled filterbank
[m, ~, ~] = v_melbankm(26, 512, 16000);

% LPC analysis → cepstral coefficients
[ar, e] = v_lpcauto(frames, 12);
cc = v_lpcar2cc(ar, 12);

Try it yourself

Interactive notebooks

Jupyter notebooks with real speech data.

Ready to get started?

280+ functions. Fully typed. 500+ tests against the original MATLAB source.

Getting Started → View on GitHub

pyvoicebox

Everything you need to analyze, enhance, and transform speech signals

LPC Analysis

Speech Enhancement

Psychoacoustic Metrics

Pitch Detection

Gaussian Mixtures

Rotations & Geometry

Faithful MATLAB port with a Pythonic API

Interactive notebooks

Visualize Speech

Clean Up Noisy Speech

Inside the Vocal Tract

Who Said That?

Emotion Recognition

Ready to get started?