Upload your audio files: interviews, depositions, earnings calls, clinical recordings. Get clean, punctuated transcripts with speaker detection. Process one file or an entire archive. Supports MP3, WAV, M4A, FLAC, OGG, AAC, and WMA.
No credit card required · Try it free
Good afternoon, everyone. Welcome to our fourth quarter earnings call.
— Revenue grew 18% year-over-year, driven primarily by enterprise expansion in EMEA and APAC.
— Could you elaborate on the margin improvement? What’s driving the 340 basis point increase?
Drop your audio files — one or a hundred. Fluen auto-detects the language and starts transcribing. Supports MP3, WAV, M4A, FLAC, OGG, AAC, and WMA. Batch processing runs in the background.
Your transcript appears in our built-in editor with full audio playback sync. Punctuation, speaker changes, and paragraph breaks are already in place. Apply your custom glossary for domain-specific terms.
Download as plain text, SRT, or WebVTT. Or connect via REST API to automate your transcription pipeline — ideal for recurring content libraries.
Fluen’s Custom Glossary ensures domain-specific terms appear exactly as they should. No more correcting “HIPAA” to “hippo” or “amortization” to “a mortization.”
The deponent testified that the force majeure clause was invoked following the breach of fiduciary duty allegation. Counsel moved for a motion in limine to exclude the hearsay evidence obtained during the voir dire proceedings.
— Were the interrogatories served within the discovery deadline?
Yes, all interrogatories and requests for admission were filed pursuant to Rule 36 of the Federal Rules of Civil Procedure.
Patient presents with bilateral pulmonary embolism confirmed via CT pulmonary angiography. Initial troponin I was elevated at 0.42 ng/mL. We initiated the heparin protocol and ordered serial D-dimer measurements.
— Is there concern for right ventricular dysfunction?
Echo showed mild RV dilation with preserved TAPSE. We’ll reassess with repeat echocardiography at 48 hours per the ESC guidelines.
Looking at the EBITDA margin expansion, we achieved 340 basis points of improvement year-over-year. The free cash flow conversion was 92%, well above our guidance range of 85 to 90 percent.
— Can you speak to the amortization of the goodwill from the Q2 acquisition?
The purchase price allocation is still being finalized, but preliminary intangible asset amortization is running at approximately $12 million per quarter on a straight-line basis.
We migrated the inference pipeline to Kubernetes with autoscaling based on GPU utilization thresholds. The latency P99 dropped from 1,200 milliseconds to 340 milliseconds after switching to gRPC over REST.
— What about the model quantization impact on accuracy?
INT8 quantization showed less than 0.3% WER degradation compared to the FP16 baseline, while reducing VRAM usage by 40 percent.
| Manual Service | Basic AI Tool | Free Converter | Fluen AI | |
|---|---|---|---|---|
| Accuracy | ~99% (human) | 85–92% | 75–85% | Up to 99.2% |
| Turnaround | 24–72 hours | Minutes | Minutes | Minutes (production-ready) |
| Jargon Handling | Specialist needed | No glossary | No glossary | Custom glossary per project |
| Speaker Detection | Manual tagging | Basic | None | Automatic speaker changes |
| Batch Processing | Per-file pricing | Limited | One file at a time | Unlimited queue |
| Editing Tools | Email revisions | Basic editor | Copy-paste | Built-in editor with playback sync |
| API Access | None | Varies | None | Full REST API |
| Cost | $1–3 per minute | $0.10–0.30/min | Free (limited) | From $0.19/min |
We route your audio to the best speech engine for your content type, language, and recording quality. No single-engine lock-in.
Detects when a new person starts speaking in interviews, depositions, and multi-person recordings. A dash marks each speaker turn.
Upload and queue entire audio archives. Fluen processes your files in the background — ideal for thousands of hours of recordings.
Define how legal terms, medical jargon, brand names, and technical vocabulary should appear in your transcripts.
Review your transcript synced to the audio. Edit text inline, navigate by timestamp, and fix any rare errors before export.
Automate your transcription pipeline. Submit files, poll for status, and retrieve results programmatically. Full documentation included.
Interviews, field recordings, press briefings, podcasts
Depositions, hearings, compliance calls, witness statements
Focus groups, patient interviews, clinical dictation, academic lectures
Earnings calls, board meetings, training recordings, podcast episodes
Fluen handles video transcription with the same accuracy, speaker detection, and batch processing. Same workflow, same quality.
Explore Video Transcription
Translate your content into 50+ languages with context-aware AI. Natural, properly segmented results ready for global distribution.
Explore TranslationFluen achieves up to 99.2% accuracy by routing your audio to the best speech-to-text engine for your content type, language, and recording quality. Our multi-engine approach means you’re not locked into a single AI. Transcripts include proper punctuation, capitalization, speaker detection, and clean paragraph breaks.
Fluen supports all major audio formats including MP3, WAV, M4A, FLAC, OGG, AAC, and WMA. You can also upload video files (MP4, MOV, MKV, AVI) and Fluen will extract and transcribe the audio track automatically.
Yes. Fluen supports batch processing — upload as many files as you need and they’ll be queued and transcribed in the background. You’ll be notified as each transcript is ready for review. This is ideal for teams processing interview libraries, meeting archives, or podcast back-catalogs.
The Custom Glossary lets you define how domain-specific terms should appear in your transcripts. Add legal terminology, medical jargon, brand names, financial acronyms, or any specialized vocabulary. Fluen will recognize and render these terms correctly, eliminating the most common source of transcription errors in professional content.
Yes. Fluen detects when a new person starts speaking and marks the change with a dash at the beginning of the line. This makes it easy to follow multi-person conversations in depositions, interviews, meetings, and panel discussions.
Yes. Fluen provides a full REST API that lets you submit audio files, poll for processing status, and retrieve completed transcripts programmatically. This is ideal for teams with recurring content pipelines — podcast networks, legal departments, corporate communications teams, and any workflow that benefits from automation. View API docs →
Most files are processed in minutes. A 60-minute audio recording typically takes 3–5 minutes to transcribe. Processing time depends on file length, recording quality, and the AI engine selected. Batch files are processed in parallel, so large queues complete faster than you’d expect.
Yes. You can sign up and process your first 3 files completely free — no credit card required. This gives you access to the AI transcription engines, custom glossary, and the built-in editor so you can evaluate the quality before committing to a plan. See pricing →
Upload your first file free. No credit card, no commitment, no waiting.
Or compare plans