SPX Format Guide

Available Conversions

SPX to AAC

Convert Speex to AAC for Apple device compatibility and modern lossy compression

SPX to AC3

Convert Speex to AC3 (Dolby Digital) for surround sound and home theater systems

SPX to AIFF

Convert Speex to AIFF for professional audio editing and Mac-based workflows

SPX to ALAC

Convert Speex to Apple Lossless for lossless playback on Apple devices

SPX to AMR

Convert Speex to AMR for mobile voice messaging and telephony applications

SPX to CAF

Convert Speex to Core Audio Format for Apple platform audio integration

SPX to DTS

Convert Speex to DTS for cinema and high-fidelity surround sound systems

SPX to FLAC

Convert Speex to FLAC lossless format for archival and high-fidelity storage

SPX to M4A

Convert Speex to M4A for iTunes and Apple ecosystem compatibility

SPX to MKA

Convert Speex to Matroska Audio container for flexible multi-track storage

SPX to MP2

Convert Speex to MP2 (MPEG Audio Layer II) for broadcasting and legacy systems

SPX to MP3

Convert Speex to MP3 for universal playback on any device and platform

SPX to OGG

Convert Speex to OGG Vorbis for open-source general-purpose audio

SPX to OPUS

Convert Speex to its successor Opus for superior speech and music encoding

SPX to TTA

Convert Speex to True Audio for lossless archival with fast decoding

SPX to WAV

Convert Speex to uncompressed WAV for editing and professional production

SPX to WMA

Convert Speex to WMA for Windows ecosystem and Windows Media Player compatibility

SPX to WV

Convert Speex to WavPack for flexible lossless and hybrid compression modes

Convert to SPX

AAC to SPX

Convert AAC to Speex for optimized speech encoding and VoIP applications

AC3 to SPX

Convert Dolby Digital AC3 to Speex for speech-optimized compression

AIFF to SPX

Convert AIFF to Speex for lightweight speech audio with low bandwidth requirements

ALAC to SPX

Convert Apple Lossless to Speex for speech-focused compression and VoIP use

AMR to SPX

Convert AMR to Speex for open-source speech coding with better quality options

CAF to SPX

Convert Core Audio Format to Speex for speech-optimized open-source compression

DTS to SPX

Convert DTS surround audio to Speex for speech extraction and voice encoding

FLAC to SPX

Convert FLAC to Speex for compact speech encoding from lossless source

M4A to SPX

Convert M4A to Speex for open-source speech compression and VoIP integration

MKA to SPX

Convert Matroska Audio to Speex for speech-optimized lightweight encoding

MP2 to SPX

Convert MP2 broadcast audio to Speex for speech-focused compression

MP3 to SPX

Convert MP3 to Speex for optimized speech encoding at very low bitrates

OGG to SPX

Convert OGG Vorbis to Speex for speech-specific encoding within the Ogg ecosystem

Opus to SPX

Convert Opus to its predecessor Speex for legacy VoIP system compatibility

TTA to SPX

Convert True Audio to Speex for speech-optimized compression from lossless source

WAV to SPX

Convert uncompressed WAV to Speex for efficient speech compression and VoIP use

WMA to SPX

Convert WMA to Speex for open-source speech coding from Windows Media format

WV to SPX

Convert WavPack to Speex for speech-optimized encoding from lossless source

About SPX Format

SPX (Speex) is a free, open-source audio codec specifically designed for speech compression. Developed by Jean-Marc Valin under the Xiph.org Foundation, Speex was created to provide a patent-free alternative to proprietary speech codecs used in Voice over IP (VoIP) and other telecommunications applications. The .spx file extension identifies Speex audio files, which are typically encapsulated in the Ogg container format (.ogg or .spx). Speex is optimized for speech signals rather than general-purpose audio, achieving excellent quality at bitrates from 2 to 44 kbps. The codec supports narrowband (8 kHz), wideband (16 kHz), and ultra-wideband (32 kHz) sampling rates, covering the full range of speech telephony requirements. While Speex has been officially superseded by the Opus codec (also from Xiph.org), it remains widely deployed in existing VoIP systems and legacy applications.

History of Speex

Speex was created by Jean-Marc Valin in 2002 while he was a graduate student at the University of Sherbrooke in Quebec, Canada. Valin developed Speex as part of the Xiph.org Foundation's mission to create free, open multimedia standards, complementing the existing Vorbis (music) and Theora (video) codecs. The initial release focused on narrowband speech at 8 kHz, targeting VoIP applications that were growing rapidly in the early 2000s. Wideband and ultra-wideband modes were added in subsequent releases, improving speech quality significantly. Speex quickly gained adoption in open-source VoIP software including Asterisk PBX, Ooh323c, Opal, and Ooh323c SIP stacks. The codec was integrated into numerous commercial and open-source applications, including Skype (early versions), Mumble, TeamSpeak, and various gaming voice chat systems. In 2007, Valin began work on a new codec that would eventually become Opus, designed to handle both speech and music efficiently. In 2012, the IETF standardized Opus as RFC 6716, and Xiph.org officially declared Speex obsolete in favor of Opus. Despite this, Speex continues to be used in many existing systems and remains a part of the Xiph.org codec family.

Key Features and Uses

Speex employs Code-Excited Linear Prediction (CELP) encoding, which models the human vocal tract to achieve efficient speech compression. The codec operates at variable bitrates from 2.15 kbps (narrowband, quality 0) to 44.2 kbps (ultra-wideband, quality 10), with most practical usage between 8 and 24 kbps. Speex supports both Variable Bit Rate (VBR) and Constant Bit Rate (CBR) encoding, with VBR providing better quality-to-size ratios for non-real-time applications. The codec includes built-in acoustic echo cancellation, noise suppression, and automatic gain control features through the SpeexDSP library, making it a complete speech processing toolkit. Speex supports Voice Activity Detection (VAD) and Discontinuous Transmission (DTX), which reduce bandwidth usage during silence periods. The codec's frame size is 20 ms, providing a good balance between latency and compression efficiency for real-time communications. Speex also supports intensity stereo encoding for wideband and ultra-wideband modes.

Common Applications

Speex has been widely deployed in VoIP applications, where its speech-optimized compression and low latency make it ideal for real-time voice communication. The codec is used in open-source PBX systems like Asterisk and FreeSWITCH for internal and external voice calls. Gaming voice chat applications including Mumble and early versions of TeamSpeak used Speex for in-game communication. Speex found significant use in embedded systems and IoT devices where speech compression was needed with minimal computational resources. The codec is used in various recording applications for dictation, voice memos, and interview recordings where speech quality is prioritized over music fidelity. Speex is deployed in conference calling systems and webinar platforms that need efficient narrowband or wideband speech encoding. The SpeexDSP library's echo cancellation and noise suppression features are used independently of the codec in many audio processing applications. While new projects generally adopt Opus instead, Speex remains in active use in legacy VoIP infrastructure, embedded systems with existing Speex implementations, and applications where the simpler Speex decoder is preferred over the more complex Opus decoder.

Advantages and Disadvantages

Advantages

  • Speech-Optimized: Designed specifically for human voice with CELP encoding
  • Very Low Bitrates: Usable speech quality from 2 kbps, excellent at 8-15 kbps
  • Patent-Free: Completely open and free under BSD license
  • Built-in DSP: Echo cancellation, noise suppression, and gain control included
  • Low Latency: 20 ms frame size suitable for real-time communications
  • Voice Activity Detection: VAD and DTX reduce bandwidth during silence
  • Multiple Sampling Rates: Narrowband, wideband, and ultra-wideband modes
  • Lightweight Decoder: Low CPU requirements for embedded and mobile systems
  • Widely Deployed: Extensive existing VoIP and gaming infrastructure

Disadvantages

  • Officially Obsolete: Xiph.org recommends Opus as the successor codec
  • Speech Only: Poor quality for music, sound effects, and non-speech audio
  • Inferior to Opus: Opus provides better quality at all bitrates for speech
  • Limited Sample Rate: Maximum 32 kHz, insufficient for high-fidelity audio
  • No Browser Support: Web browsers do not support Speex natively
  • Declining Adoption: New projects overwhelmingly choose Opus instead
  • No Lossless Mode: Always lossy compression, unsuitable for archival
  • Mono/Stereo Only: No multichannel surround sound support