Why Opus is everywhere for voice messages
Opus was designed for two things: efficient voice compression at low bitrates (down to 6 kbps for usable speech) and low encoding/decoding latency for real-time use. WhatsApp, Telegram, and Discord all picked it because it does both better than any alternative — a 1-minute voice note is typically 30–80 KB at WhatsApp's default bitrate, vs ~500 KB as MP3 or ~5 MB as WAV. That size difference is what makes voice messages work over patchy mobile data; the same property is why Opus voice messages have largely replaced MP3 in messaging apps.
Practical workflows for voice-message transcription
Read instead of listen: the most obvious case. Save the WhatsApp voice note (long-press → Save), drop into MDisBetter, read the transcript in 30 seconds vs listening to a 2-minute monologue. Search across voice messages: if you save voice notes from a key contact (sales calls, client briefings, reporter source recordings), batching them through transcription gives you a searchable text archive of conversations that would otherwise be opaque audio. Quote extraction: for journalists or researchers, voice-message-as-source becomes citable text.
Note on Opus vs OGG
The file extension can vary — Opus audio commonly ships as .opus (bare Opus stream), .ogg (Opus inside an OGG container, common for WhatsApp/Telegram exports), or even .oga. MDisBetter accepts all three; the transcription pipeline detects the actual codec from the file header rather than the extension. See also OGG to Markdown for the broader OGG container.