Related tools
Why use speech to text?
Dictate notes, draft messages hands-free, or capture rough transcripts without installing separate dictation software.
Benefits
- Hands-free: speak instead of typing.
- Speed: many people talk faster than they type.
- Accessibility: alternative input path.
- Lightweight: runs from a normal web page.
- Export: TXT or JSON for reuse.
How it works
Your microphone feeds the browser’s speech recognizer; words stream into the transcript area.
Recognition flow
- Audio: captured while listening.
- Interim: tentative text shown in gray italics.
- Final: committed text appended to the main transcript.
- Language: BCP-47 tag on the recognition instance.
- Stop: ends the recognition session.
Typical workflow
- Language: select first.
- Start: begin speaking after the prompt.
- Review: watch interim vs final.
- Stop: when finished.
- Copy: or export.
Facts
Quality varies by device; Chromium-class browsers are the most reliable for this API.
Key points
- Continuous + interim modes are both enabled in code.
- Changing language while listening stops the session first.
- Word count uses simple whitespace splitting.
- JSON export uses English key names for compatibility.
- Safari/Firefox support may differ from Chrome/Edge.
Best practices
Reduce errors before relying on the transcript.
Tips
- Use a quiet room and decent microphone.
- Match the dropdown to how you actually speak.
- Proofread names, numbers, and technical terms.
- Restart if recognition stops responding.
Ideal use cases
- Notes: quick capture.
- Drafts: first pass dictation.
- Accessibility: when typing is hard.
When not to rely on it
- Certified legal or medical transcription without review.
- Guaranteed offline-only if your browser uses cloud recognition.
Limitations
Browser-dependent API; microphone required; not all languages work equally everywhere.
Speech recognition runs in your browser with microphone permission and no upload to our servers; transcripts update live as you speak.
Frequently asked questions
How does this work?
The page uses the browser SpeechRecognition API to turn speech into text. Behavior depends on Chrome/Edge vs other browsers.
Microphone permission?
Required while listening. The page does not upload audio to our servers; recognition may still be processed by the browser vendor per your browser settings.
How accurate is it?
Depends on accent, noise, mic quality, and language match. Review important transcripts manually.
Noisy room?
Background noise hurts accuracy; a quiet space and headset mic help.
Which languages?
The dropdown lists common BCP-47 tags; actual support still depends on your browser and OS.
Long sessions?
Continuous mode is on; stop and restart if the engine stalls.
Is audio stored?
We do not store recordings on our site. Check your browser documentation for whether recognition is on-device or cloud-backed.