The audio format ecosystem looks intimidating from the outside and turns out, on closer inspection, to be governed by a small number of decisions. The format that is right for you depends on what you are doing with the audio, who needs to hear it, what equipment they will use, and how long you need the file to remain useful. The same person picks WAV for studio capture, FLAC for the home library, MP3 for the car stereo, and Opus for a voice memo, and each choice is correct in its context.

This article maps the formats to the people who use them. A music collector ripping CDs has different needs than a sound designer mixing for film, and both differ from a journalist recording an interview on a phone. The technical facts about each format are useful only when paired with the question of who is making the choice and why. The recommendations below are calibrated to the realistic constraints of each user category, including budget, ecosystem lock-in, and the patience for technical configuration.

The Three Categories That Matter

Audio formats divide cleanly into three categories: uncompressed, lossless compressed, and lossy compressed. The rest of the format zoo sits inside these three.

Uncompressed formats store raw PCM samples at a fixed sample rate and bit depth. WAV, AIFF, and BWF are the common containers. File sizes are large (about 10 MB per minute for stereo CD-quality, 30 MB per minute for 24-bit / 96 kHz), but the format is simple, fast to read, and zero-overhead for editing. Use uncompressed for studio capture, editing, and any work where every operation runs directly on samples.

Lossless compressed formats reduce file size by roughly half without altering a single sample. FLAC, ALAC, WavPack, and Monkey's Audio fall into this group. The compression is reversible: decompressing a FLAC produces a bit-identical PCM stream to the original. Use lossless compressed for archiving music, distributing high-quality downloads, and any case where storage cost matters but quality must not be compromised.

Lossy compressed formats discard psychoacoustically inaudible information to achieve dramatically smaller files. MP3, AAC, Opus, Vorbis, and Musepack are the common formats. At sufficient bitrates (typically 192 kbps or more), the loss is inaudible on consumer playback equipment. At low bitrates, the loss becomes obvious. Use lossy compressed for distribution to listeners, storage on space-constrained devices, and streaming.

CategoryFormatsTypical Size (1 hour stereo)When To Use
UncompressedWAV, AIFF, BWF600 MB at 16/44.1, 1 GB at 24/48Studio capture, editing, mastering
Lossless compressedFLAC, ALAC, WavPack300-400 MBArchiving, distribution to audiophiles
Lossy compressed (high)AAC 256, MP3 320, Opus 128120-150 MBPremium streaming, downloads
Lossy compressed (mid)AAC 128, MP3 192, Opus 9660-90 MBGeneral consumer listening
Lossy compressed (low)AAC 64, MP3 96, Opus 4825-45 MBPodcasts, voice content
Lossy compressed (very low)Opus 16-328-16 MBVoice messaging, low-bandwidth voice
"The right amount of fidelity is the amount the listener can hear under their actual listening conditions. Anything more is engineering vanity." Susan Rogers, This Is What It Sounds Like

Choose By User Type

Different user types have different correct answers. The technical facts are the same; the priorities differ.

The casual listener wants easy-to-play files that work everywhere and do not eat phone storage. The right answer is AAC at 256 kbps in M4A or MP3 at 320 kbps. Either sounds excellent on consumer earbuds and phone speakers. Apple Music and Spotify both deliver in proprietary formats that effectively land here. There is no audible reason for this user to seek lossless.

The music collector and audiophile wants the same file to sound right on a $5,000 home audio system and on a phone in the gym. The right answer is FLAC for the library and on-the-fly transcoding to MP3 or AAC for the phone. Foobar2000, Roon, and Apple's own iTunes/Music app all support this workflow. Storage is cheap; archive in lossless and let software handle delivery.

The studio engineer wants files that survive every editing operation without compounding loss. The right answer is uncompressed WAV at 24-bit / 48 kHz minimum, kept in WAV throughout production, and converted to delivery formats only at the final stage. Multitrack sessions live in WAV; stems are exported as WAV; the final mix is mastered as WAV; lossy delivery is generated last from the WAV master.

The podcaster wants files small enough to download on phone data plans and universally compatible with podcast apps. The right answer is MP3 at 96 kbps mono. Some publishers add an Opus alternative for listeners on modern apps where the same quality fits in half the bytes.

The voice memo user wants quick capture with minimal storage and easy transcription. The right answer is Opus at 32 kbps in OGG. Modern voice apps use this internally; the file is small enough to send through messaging apps and clean enough for voice-to-text services to transcribe accurately.

The radio broadcaster wants files that conform to broadcast loudness standards and integrate with playout systems. The right answer is BWF (Broadcast Wave Format) at 48 kHz / 24-bit, mastered to EBU R 128 (-23 LUFS integrated). The BWF metadata supports timecode and originator identification required by broadcast workflow.

The film and television sound designer wants the same WAV the studio engineer wants, plus support for multichannel layouts (5.1, 7.1, Dolby Atmos beds). The right answer is WAV with channel layout metadata, often delivered as broadcast wave files inside MXF wrappers for studio post-production pipelines.

The game audio designer wants files that the game engine can stream efficiently and that compress well for shipping inside the game executable. The right answers vary by engine: Unity prefers OGG Vorbis, Unreal Engine prefers OGG Vorbis or PCM WAV, and console-specific engines use proprietary formats but accept WAV at import.

"Engineers think the format is the question. The question is the room and the listener. The format is downstream of those." Bob Ludwig, mastering engineer

Format-by-Format Field Notes

A short tour of the formats users actually encounter.

WAV is the workhorse of audio production. Microsoft's RIFF container with PCM payload. Universal compatibility across platforms. Files larger than 4 GB require the RF64 extension because the 32-bit size field overflows. Recommended for capture, editing, and mastering.

AIFF is Apple's equivalent of WAV. Functionally similar; less common outside the Apple ecosystem.

BWF (Broadcast Wave Format) is WAV with extended professional metadata. Timecode, originator, description, and reference fields. Required by professional video and broadcast workflows. The European Broadcasting Union maintains the standard.

FLAC is the dominant lossless compressed format. Open source. Files about half the size of WAV at zero quality loss. Supported by every modern player. Recommended for music archives.

ALAC is Apple's lossless format. Slightly larger files than FLAC. Best on Apple platforms; works elsewhere since Apple open-sourced the codec in 2011. Choose between ALAC and FLAC based on the playback ecosystem.

MP3 is the universal lossy format. Patent-free since 2017. At 192 kbps and above, sounds nearly indistinguishable from CD quality on consumer equipment. The right choice for maximum compatibility.

AAC is the technical successor to MP3. Sounds noticeably better at the same bitrate. Used by Apple Music, YouTube, and most modern streaming services. Container is M4A.

Opus is the modern lossy codec. Outperforms AAC and MP3 across the bitrate range. Excellent for voice and music. Container is OGG or WebM. Recommended for new projects where compatibility allows.

Vorbis is the open-source predecessor to Opus. Still used in game audio engines and some legacy distribution. New projects should prefer Opus.

WavPack is a lossless compressed format with optional hybrid mode that produces a small lossy file plus a correction file that combines back to lossless. Niche but elegant.

Monkey's Audio (APE) is a lossless compressed format common in Eastern European music distribution. Similar compression to FLAC. Less universally supported.

DSD (Direct Stream Digital) is the format of SACD. 1-bit samples at 2.8 MHz or higher. Niche audiophile format. Most playback chains convert DSD to PCM internally.

MQA (Master Quality Authenticated) is a controversial lossy-with-marketing format pitched as approximating studio masters. Independent testing has questioned its claims. Avoid for new projects; the cleaner choice is FLAC at the actual master resolution.

Bitrate Sweet Spots

The right bitrate depends on the codec and the content. A useful set of reference points for typical listening.

CodecSpoken Word MonoMusic Stereo (Casual)Music Stereo (Critical)
MP396 kbps192 kbps320 kbps (V0)
AAC64 kbps128 kbps256 kbps
Opus32 kbps96 kbps160 kbps
Vorbis64 kbps160 kbps256 kbps
FLACn/an/aLossless (700-1100 kbps typical)
The "casual" column is what most listeners will hear on phone earbuds in everyday environments. The "critical" column is what audiophile listeners want for the home system and what engineers should deliver as the high-quality option.

Going below the casual column produces noticeable artifacts on attentive listening. Going above the critical column produces files that are larger without audible benefit on any common playback setup.

Sample Rate and Bit Depth Choices

For most consumer playback, 44.1 kHz / 16-bit is the right delivery rate. CD compatibility, no upsampling required by Bluetooth headphones, no audible information lost above the 22.05 kHz Nyquist limit.

For studio and mastering work, 48 kHz / 24-bit is the floor and 96 kHz / 24-bit is common. 192 kHz exists but rarely produces audible benefit; the principal advantage is processing headroom for pitch shifting, time stretching, and other operations that benefit from a higher sample rate intermediate.

DSD and SACD operate at non-PCM sample rates (2.8224 MHz, 5.6448 MHz). These are conversion-dense formats; most playback chains internally convert to PCM at 88.2 or 176.4 kHz before D/A conversion.

Sample RateUse Case
8-22.05 kHzTelephone, voice messaging, low-bandwidth voice
32 kHzFM broadcast, some video systems
44.1 kHzCD audio, consumer streaming
48 kHzVideo, broadcast, professional working rate
88.2 kHzHigh-resolution audio, mastering
96 kHzHigh-resolution audio, mastering
176.4 kHzAudiophile distribution, niche use
192 kHzAudiophile distribution, professional mastering
2.8224 MHzDSD64 (SACD)
5.6448 MHzDSD128

Compatibility Reality in 2026

The 2025 OS releases shifted the compatibility picture noticeably. Windows 11 added native FLAC, ALAC, and Opus support in Media Player. macOS has supported all major formats natively for years. iOS and Android handle FLAC, ALAC, MP3, AAC, Opus, and Vorbis without third-party software. Hardware media players and car audio systems lag the OS releases; many car head units released in 2020 or earlier still struggle with FLAC and ignore Opus entirely.

For files that need to play on unknown hardware, MP3 remains the safest choice. For files that need to play on any modern phone or computer released in the last five years, AAC and FLAC are also safe. For files intended for software-defined playback inside an app, Opus is the technical winner.

The expert-written certification preparation content at Pass4Sure on credential portability has the same character as audio format compatibility: the format that opens the most doors is the one accepted by the broadest set of systems, even when better technical options exist. The note-keeping pipelines explored at When Notes Fly emphasize the same principle for personal information capture: standardize on formats that any tool can read.

"Compatibility is not a technical property. It is a social agreement about what counts as a normal file. The agreement changes slowly even when the technology does not." Tim Bray, on the standardization of file formats

A Decision Tree That Works

A practical decision flow for picking an audio format.

If you are recording original audio, use WAV at 48 kHz / 24-bit. If your tool supports BWF, use BWF for the metadata.

If you are archiving music for a personal library, use FLAC. Convert from FLAC to lossy on demand for portable devices.

If you are distributing audio publicly, pick by audience. For maximum compatibility use MP3 at 192-320 kbps. For modern apps and bandwidth-sensitive delivery use Opus at 96-128 kbps. For Apple ecosystem distribution use AAC at 256 kbps in M4A.

If you are publishing a podcast, use MP3 at 96 kbps mono with optional Opus alternative.

If you are recording voice memos for personal use or transcription, use Opus at 32 kbps in OGG.

If you are working in professional video or broadcast, use BWF at the project's working sample rate (48 kHz typical) and 24-bit.

If you are receiving audio from a source you do not control, accept whatever they send and convert to your working format on import.

Common Misconceptions

A short list of misconceptions that show up regularly in format discussions.

"Higher bitrate is always better." True up to a threshold, false beyond it. MP3 above 320 kbps and AAC above 256 kbps produce no audible improvement on consumer playback; the file is just larger.

"24-bit / 96 kHz sounds better than 16-bit / 44.1 kHz." For finished consumer playback, no audible difference exists when the 16-bit master was made with proper dither. The 24/96 advantage appears only in mastering, where headroom for processing matters.

"FLAC sounds better than WAV." FLAC and WAV both deliver bit-exact PCM samples. The decoded audio is identical. FLAC just stores the same data in less space.

"MP3 quality has improved since the 2000s." The MP3 format specification has not changed; encoder implementations have. Modern LAME at V0 sounds noticeably better than turn-of-the-century encoders at the same nominal bitrate, because LAME's psychoacoustic model has been refined extensively.

"Opus is just for voice." Opus is a general-purpose codec that handles music as well as voice. The --application audio flag tunes it for music; the --application voip flag tunes it for speech. Both modes outperform legacy codecs at the same bitrate.

"Lossy compression damages audio permanently." It removes information that, at sufficient bitrates, is psychoacoustically inaudible. The file is "damaged" in the sense that bit-exact reconstruction is impossible; whether the listener can hear the damage is a separate question with a content-dependent answer.

For related guidance, see understanding mp3 vs flac which audio format to choose and audio formats explained choose right format project.

References

  1. International Telecommunication Union. ITU-R BS.1770-5 Algorithms to measure audio programme loudness. https://www.itu.int/rec/R-REC-BS.1770
  1. European Broadcasting Union. EBU R 128 Loudness normalisation and permitted maximum level of audio signals. https://tech.ebu.ch/publications/r128
  1. Xiph.Org Foundation. FLAC Format Specification. https://xiph.org/flac/format.html
  1. Internet Engineering Task Force. Definition of the Opus Audio Codec. RFC 6716. https://www.rfc-editor.org/rfc/rfc6716
  1. ISO/IEC 14496-3:2019 Information technology, Coding of audio-visual objects, Part 3: Audio. https://www.iso.org/standard/76383.html
  1. ISO/IEC 11172-3:1993 MPEG-1 Audio. https://www.iso.org/standard/22411.html
  1. Microsoft. WAVEFORMATEXTENSIBLE structure. https://learn.microsoft.com/en-us/windows-hardware/drivers/audio/extensible-wave-format-descriptors
  1. EBU Tech 3285. Specification of the Broadcast Wave Format (BWF). https://tech.ebu.ch/docs/tech/tech3285.pdf