Conference call vs in-person meeting — what differs
Conference calls add three transcription challenges over in-person meetings. Audio quality variance: participants on cell vs broadband vs hotel WiFi, with vastly different microphone quality, all in one mix. Speaker overlap on lag: network delays cause people to talk over each other accidentally; the transcript needs to handle that gracefully. Many participants: 5–15 people on a call is normal, vs the typical 2–4 in an in-person meeting. The transcription pipeline handles all three with positional speaker labels and is robust to typical conference-call audio quality.
What you get back
The Markdown output structures a conference call into sections at topic shifts (the agenda items as they're actually discussed), positional speaker labels at every turn (rename to actual names with a one-time find-replace), timestamps at major junctures, and an action items section at the end summarising decisions and assignments where the conversation made them explicit. For a 60-minute call with 8 participants, the output is typically 30–50 KB of structured Markdown that's scannable in 5 minutes — vs the call itself which would take 60 minutes to re-listen and 4+ hours to manually transcribe.
Recording practicalities
Most conference call platforms (Zoom, Teams, Google Meet, Webex, RingCentral) have built-in recording — the host clicks Record, the file is saved locally or to the platform's cloud after the call. For phone-bridge conference calls without built-in recording, a quality recorder app on a phone placed near the speakerphone works well; the audio quality is lower than direct platform recording, but transcription handles it. Either way, the resulting MP3/M4A/WebM is the input MDisBetter takes.