You recorded a product walkthrough with narration and now need just the audio for a podcast episode. Or you've got a meeting recording and want to extract the discussion as an MP3 for someone who prefers listening on the go.
Extracting audio from video is simpler than most people think.
Browser-Based Extraction
Zumie Video to MP3 Converter — Upload any video, extract the audio track as MP3. Runs entirely in your browser using the Mediabunny media toolkit. No server upload, your video stays on your device.
This is the fastest option for one-off conversions — no software to install, no CLI to learn.
CLI Extraction with FFmpeg
For batch processing or automation, FFmpeg is unbeatable:
Extract Without Re-encoding (Fastest)
If your video already has AAC audio (most MP4s do), you can copy the audio stream directly:
# Extract audio without re-encoding — nearly instant
ffmpeg -i recording.mp4 -vn -acodec copy output.m4a
The -vn flag drops the video stream. -acodec copy copies the audio as-is.
Convert to MP3
If you specifically need MP3 format:
# Convert to MP3 at 192kbps (good quality)
ffmpeg -i recording.mp4 -vn -acodec libmp3lame -ab 192k output.mp3
Audio Quality Options
| Bitrate | Quality | Use Case | File Size (per minute) |
|---|---|---|---|
| 320kbps | Excellent | Music, archival | ~2.4MB |
| 192kbps | Very good | Narration, podcasts | ~1.4MB |
| 128kbps | Good | Voice recordings | ~960KB |
| 64kbps | Acceptable | Speech only | ~480KB |
For screen recording narration, 128-192kbps is more than enough. Human speech doesn't benefit much from higher bitrates.
Common Use Cases
Turn Walkthroughs into Podcast Content
You've recorded a detailed product demo. The visual component is great for YouTube, but the narration also works as a standalone audio piece. Extract the audio, do minimal editing (trim intro/outro), and publish it as a podcast episode.
Archive Meeting Recordings as Audio
Video recordings of Zoom calls are massive files. The visual component (people's faces + screenshare) is rarely worth the storage. Extract the audio for a 90% reduction in file size while keeping everything important.
Create Transcriptions
Most transcription services (Whisper, Otter.ai, Rev) accept audio files and process them faster than video. Extract the audio first for quicker, cheaper transcription.
Accessibility
Some users prefer audio-only content. Providing an MP3 version of your video tutorials makes them accessible to people who are visually impaired or who prefer listening while multitasking.
Audio Formats Compared
| Format | Compression | Quality | Compatibility |
|---|---|---|---|
| MP3 | Lossy | Good | Universal |
| AAC (.m4a) | Lossy | Better than MP3 | Very wide |
| OGG Vorbis | Lossy | Similar to AAC | Good (not iOS Safari) |
| WAV | None | Perfect | Universal (huge files) |
| FLAC | Lossless | Perfect | Wide |
For most purposes, MP3 is the right choice. It's universally supported, well-compressed, and the quality is more than sufficient for speech content.
Tips for Better Audio Extraction
Check the Source Quality First
Extracting audio can't improve what's already there. If the original recording has poor audio (echo, background noise, low volume), the extracted audio will have the same issues.
Normalize Volume
If you're extracting audio from multiple recordings and combining them, normalize the volume so levels are consistent:
ffmpeg -i input.mp3 -filter:a loudnorm output.mp3
Trim Before Extracting
If you only need audio from a specific portion of the video, trim first to reduce processing time:
ffmpeg -ss 00:01:00 -to 00:05:30 -i recording.mp4 -vn -ab 192k output.mp3
Better Recordings from the Start
The best audio extraction starts with a clean recording. Zumie captures your screen with clear audio and efficient encoding, so when you do need to extract the audio track, you're starting with a quality source. Plus, the automatic zoom and backgrounds mean your video looks great too — no reason to settle for audio-only.