Convert SHN to SPX

Drag and drop files here or click to select.
Max file size 100mb.
Uploading progress:

SHN vs SPX Format Comparison

Aspect SHN (Source Format) SPX (Target Format)
Format Overview
SHN
Shorten Audio Format

Shorten is a lossless audio compression format created by Tony Robinson at SoftSound in 1993. It was one of the earliest practical lossless audio codecs and became the de facto standard for trading live concert recordings online during the late 1990s and early 2000s, particularly among fans of Grateful Dead, Phish, and other jam bands. Though largely superseded by FLAC, SHN files remain common in legacy music archives.

Lossless Legacy
SPX
Speex Audio Codec

Speex is an open-source, patent-free audio codec designed specifically for speech compression, developed by Jean-Marc Valin under the Xiph.Org Foundation and first released in 2003. Using Code-Excited Linear Prediction (CELP), Speex achieves excellent speech quality at very low bitrates (2-44 kbps) with built-in features like acoustic echo cancellation, noise suppression, and voice activity detection. While officially superseded by Opus in 2012, Speex remains in use in legacy VoIP and embedded systems.

Lossy Legacy
Technical Specifications
Sample Rates: 8 kHz – 96 kHz
Bit Depth: 8, 16-bit integer
Channels: Mono, Stereo
Codec: Shorten (predictive coding + Huffman)
Container: Raw Shorten stream (.shn)
Sample Rates: 8 kHz (narrowband), 16 kHz (wideband), 32 kHz (ultra-wideband)
Bit Rates: 2.15–44.2 kbps (VBR/CBR)
Channels: Mono, Stereo
Codec: CELP (Code-Excited Linear Prediction)
Container: Ogg Speex (.spx)
Audio Encoding

Shorten uses linear prediction to model audio samples and encodes residuals with Huffman coding, achieving lossless compression ratios of roughly 2:1:

# Decode SHN to WAV (intermediate)
ffmpeg -i input.shn output.wav

# Direct SHN to Speex conversion
ffmpeg -i input.shn -codec:a libspeex \
  -ar 32000 -b:a 44k output.spx

Speex uses CELP analysis-by-synthesis with an adaptive codebook, optimized for modeling human vocal tract characteristics:

# Encode speech at wideband quality
ffmpeg -i input.wav -codec:a libspeex \
  -ar 16000 -b:a 24k output.spx

# Ultra-wideband for best speech quality
ffmpeg -i input.wav -codec:a libspeex \
  -ar 32000 -b:a 44k output.spx
Audio Features
  • Metadata: No native tag support (relies on external .txt files)
  • Album Art: Not supported
  • Gapless Playback: Supported natively — important for live recordings
  • Streaming: Not designed for streaming
  • Seeking: Limited — requires seek tables or full decode
  • Chapters: Not supported
  • Metadata: Vorbis Comment tags in Ogg container
  • Album Art: Possible via Ogg metadata (rarely used)
  • Gapless Playback: Supported through Ogg container framing
  • Streaming: Designed for real-time VoIP streaming
  • Seeking: Ogg page-based seeking
  • Chapters: Not commonly used
Advantages
  • Bit-perfect lossless compression preserving every audio detail
  • Historical standard for live concert recording archives
  • Simple codec with fast decoding speed
  • Gapless playback ideal for continuous live performances
  • Widely recognized in tape-trading and bootleg communities
  • Extremely compact files — speech at 8-24 kbps
  • Open-source and completely royalty-free
  • Built-in acoustic echo cancellation and noise suppression
  • Voice activity detection for silence compression
  • Very low CPU requirements — runs on embedded hardware
  • Optimized specifically for human voice reproduction
Disadvantages
  • Obsolete — superseded by FLAC with better compression
  • Limited software support in modern players
  • No metadata or tagging capability
  • Larger files than FLAC for equivalent lossless content
  • Poor seeking performance without seek tables
  • Poor music quality — designed exclusively for speech
  • Officially obsoleted by Opus codec since 2012
  • Maximum 32 kHz sample rate limits audio bandwidth
  • Limited modern software and hardware support
  • Not suitable for preserving musical instrument fidelity
Common Uses
  • Live concert recording archives (Grateful Dead, Phish)
  • Legacy lossless music collections from 1990s–2000s
  • Tape-trading community distributions
  • Archival of audience recordings and soundboard tapes
  • Source files for re-encoding to modern formats
  • Legacy VoIP and conferencing systems
  • Embedded voice recording on constrained hardware
  • Audiobook and spoken word compression
  • Voice memo and dictation storage
  • Low-bandwidth voice communication links
Best For
  • Preserving original live concert recordings bit-perfectly
  • Maintaining legacy archive compatibility
  • Source material for transcoding to any target format
  • Collections where historical provenance matters
  • Extracting spoken content (stage banter, announcements)
  • Legacy systems requiring Speex input format
  • Ultra-compact voice-only archives
  • Embedded devices with Speex-only decoder support
Version History
Introduced: 1993 (Tony Robinson, SoftSound)
Current Version: Shorten 3.x
Status: Legacy, no active development
Evolution: Shorten (1993) → largely replaced by FLAC (2001)
Introduced: 2003 (Jean-Marc Valin, Xiph.Org)
Current Version: Speex 1.2.1 (2019)
Status: Maintenance mode — Opus recommended instead
Evolution: Speex 1.0 (2003) → 1.2 (2008) → obsoleted by Opus (2012)
Software Support
Media Players: foobar2000, VLC, Winamp (plugin)
Decoders: FFmpeg, shorten CLI tool
Mobile: Not natively supported
Web Browsers: Not supported
Archives: etree.org, archive.org, bt.etree.org
Media Players: VLC, foobar2000, mpv
Encoders: speexenc, FFmpeg (libspeex)
Mobile: Limited (VLC on Android/iOS)
Web Browsers: Not natively supported
VoIP: Legacy Asterisk, FreeSWITCH, Ogg streaming

Why Convert SHN to SPX?

Converting SHN to Speex serves a specialized purpose: extracting and compressing spoken content from concert recordings at extremely low bitrates. Speex is a speech-optimized codec, not a general music format, so this conversion is appropriate primarily for stage banter, audience announcements, spoken introductions, and other voice-centric segments from live recordings. For these specific use cases, Speex produces remarkably compact files with clear vocal reproduction.

Concert recordings often contain valuable spoken content beyond the music itself. Band members introduce songs, make dedications, interact with the audience, and share stories between musical passages. Historians and researchers studying live music culture may want to extract and archive these spoken segments efficiently. Speex at 16-24 kbps produces intelligible, clear speech in files that are 95% smaller than the original lossless SHN data.

Speex also remains relevant for legacy VoIP and embedded systems that specifically require the Speex codec. Some older conferencing platforms, Asterisk PBX installations, and industrial voice recording systems accept only Speex input. If you need to integrate concert audio into one of these legacy systems — for example, creating a phone-based concert information line or embedding audio in an older VoIP application — SHN-to-Speex conversion is necessary.

It is important to understand that Speex will significantly degrade the musical content of concert recordings. The CELP codec is modeled on the human vocal tract and does not faithfully reproduce instruments, harmonics, or the spatial ambience of a live venue. For general music listening, use FLAC, MP3, AAC, OGG, or Opus instead. Reserve Speex conversion for cases where speech intelligibility is the goal and music fidelity is not required.

Key Benefits of Converting SHN to SPX:

  • Ultra-Compact: Speech at 8-24 kbps — files are 95%+ smaller than SHN
  • Speech Optimized: CELP coding models human voice for clear vocal reproduction
  • Noise Suppression: Built-in processing reduces background noise from recordings
  • Open Source: Patent-free, royalty-free, part of Xiph.Org ecosystem
  • Legacy VoIP: Compatible with Asterisk, FreeSWITCH, and older conferencing
  • Low CPU: Decodes on minimal hardware including embedded systems
  • Voice Detection: Built-in VAD compresses silence efficiently

Practical Examples

Example 1: Stage Banter Archive

Scenario: A researcher studying Jerry Garcia's between-song conversations needs to extract and archive spoken segments from hundreds of SHN concert recordings in the most compact format possible.

Source: Extracted banter clips from GD SHN recordings
Conversion: SHN → SPX (wideband, 24 kbps)
Result: Hours of speech in minimal storage

Banter archive:
  Original SHN clips: 15 GB (500 banter segments)
  Speex output: 450 MB (same 500 segments)
  → 97% reduction while preserving speech clarity
  + Voice activity detection skips applause gaps
  + Searchable by show date and venue in Ogg tags
  + Jerry's voice clearly intelligible at 24 kbps

Example 2: Concert Information Phone System

Scenario: A venue operates a legacy Asterisk-based phone system that plays back concert information and audio clips to callers, requiring Speex format for its VoIP codec.

Source: SHN concert audio clips and announcements
Conversion: SHN → SPX (narrowband, 16 kbps)
Result: Phone-quality audio for Asterisk PBX

VoIP integration:
+ Speex is native to Asterisk conferencing system
+ 8 kHz narrowband matches telephone audio quality
+ 16 kbps fits within VoIP bandwidth allocation
+ Callers hear concert clips and show information
+ Minimal server storage for audio prompts library

Example 3: Spoken Word Podcast from Concert Archives

Scenario: A podcast producer creates episodes about live music history using spoken segments extracted from SHN recordings, needing lightweight audio for supplementary web content.

Source: Selected spoken clips from SHN concert archives
Conversion: SHN → SPX (ultra-wideband, 44 kbps)
Result: Web-optimized speech clips

Podcast supplement use:
+ Ultra-compact files for web page embedding
+ 44 kbps UWB provides good speech clarity
+ Spoken introductions and band commentary preserved
+ Minimal bandwidth for mobile web visitors
+ Used alongside higher-quality music clips in other formats

Frequently Asked Questions (FAQ)

Q: What is Speex and why is it speech-specific?

A: Speex uses Code-Excited Linear Prediction (CELP), a technique that models audio as if it were produced by the human vocal tract. This makes it extremely efficient for speech — capturing voice at very low bitrates with good intelligibility. However, this same approach means it poorly represents instruments, harmonics, and complex audio scenes. Speex was created by Jean-Marc Valin (who later co-created Opus) specifically for voice communication, not music.

Q: Will my concert music sound good in Speex?

A: No. Speex significantly degrades musical content because its CELP model is optimized for voice frequencies and characteristics. Instruments will sound muffled, distorted, or artificial. Audience ambience and room acoustics will be lost. Use Speex only for spoken content (stage banter, announcements, spoken introductions). For music from concert recordings, use FLAC, MP3, AAC, OGG Vorbis, or Opus instead.

Q: Is Speex still relevant or should I use Opus?

A: The Xiph.Org Foundation officially recommends Opus over Speex for all new applications. Opus was designed as Speex's successor and outperforms it in every dimension — speech quality, music quality, latency, and bitrate efficiency. The only reasons to use Speex are compatibility with legacy systems that specifically require it (older VoIP platforms, embedded devices with Speex-only decoders) or when working with existing Speex infrastructure.

Q: What sample rate and bitrate should I use?

A: For speech extraction from concerts, wideband (16 kHz, 24 kbps) provides the best balance of quality and size — voice sounds natural and clear. Ultra-wideband (32 kHz, 44 kbps) adds minimal quality improvement for speech. Narrowband (8 kHz, 16 kbps) sounds like a telephone call but produces the smallest files. Choose based on your quality requirements and storage constraints.

Q: Can I play SPX files on my computer or phone?

A: VLC plays Speex files on all platforms — Windows, macOS, Linux, Android, and iOS. foobar2000 and mpv also support Speex decoding. However, native operating system media players generally do not support .spx files. Web browser support is also absent. If playback convenience matters, consider Opus instead, which has much broader native support while still being efficient for speech content.

Q: How does Speex noise suppression help with concert recordings?

A: Speex includes built-in preprocessing features like noise suppression and automatic gain control. When converting spoken segments from concert recordings, these features can help reduce background crowd noise, venue reverb, and instrument bleed, making the speech clearer. However, this processing also removes ambient concert atmosphere. For pure speech clarity from noisy recordings, this is a benefit; for preserving the concert experience, it is a limitation.

Q: How small are Speex files compared to other formats?

A: Extremely small. At wideband 24 kbps, one minute of speech takes about 180 KB in Speex. Compare this to: SHN (roughly 10 MB/min), FLAC (roughly 5 MB/min), MP3 320k (roughly 2.4 MB/min), Opus 128k (roughly 960 KB/min). Speex produces files 55 times smaller than SHN for speech content. This extreme compression is the main reason to choose Speex — when storage and bandwidth are severely constrained.

Q: How long does SHN to Speex conversion take?

A: Very fast — Speex encoding is lightweight by design, originally intended for real-time voice communication. A 70-minute recording converts in 1-3 minutes, roughly 30-50 times faster than real-time. The process involves SHN decoding (fast), resampling to Speex's supported rates (trivial), and Speex encoding (very fast). Batch processing of many segments completes quickly.