macOS Speech to Text: What It Does, What It Does Not, and When to Use Something Better
macOS has had built-in dictation since Mountain Lion. It works for basic tasks. Most Mac users have no idea it exists, or they tried it once and moved on. This page is for people who want to understand what macOS speech to text actually does, where it falls short, and how to decide whether the built-in tool is sufficient or whether a dedicated local app makes more sense for their workflow.
This is not a sales page. The honest answer is that Apple's built-in dictation is fine for casual use. For professionals with accuracy requirements, file transcription needs, or serious privacy concerns, it is not.
TL;DR
- macOS Dictation is free, works in any text field, and has had an on-device mode since macOS Ventura. It's good for casual dictation of non-sensitive content.
- It cannot import audio files, has no transcription history, no professional vocabulary, and no export tools. It is live dictation only.
- Default mode sends audio to Apple's servers. Enhanced/on-device mode stays local but is not a compliance instrument for regulated professionals.
- When accuracy, file transcription, privacy compliance, or professional vocabulary matter, a dedicated on-device app like VoicePrivate is the appropriate next step.
How macOS Speech to Text Works
macOS Dictation is built into the operating system and requires no third-party software. Here is how to enable and use it:
- Go to System Settings > Keyboard > Dictation and toggle it on
- Activate dictation with a double-press of the Fn key (or Globe key on newer keyboards) or the microphone icon in the menu bar
- Dictation works inline in any text field across any app system-wide, including Mail, Messages, Pages, Word, and browser text inputs
- It supports basic punctuation commands ("period," "comma," "new paragraph") and simple emoji insertion
There are two processing modes. The default mode routes your audio to Apple's servers for processing and returns text to your device. Enhanced Dictation, available since macOS Ventura (13.0), processes audio entirely on-device using Apple's local speech recognition model. On Apple Silicon Macs, the Enhanced mode is the default for supported languages.
Both modes produce text that appears inline where your cursor is. There is no separate transcription window, no history, no output file.
What macOS Dictation Does Well
Apple's built-in dictation is genuinely good at a few things:
- Free and always available. Every Mac running macOS Ventura or later has on-device dictation. No download, no account, no cost.
- System-wide integration. It works in every text field on the system. You do not need to open a separate app or configure anything per-application.
- No internet required (on Apple Silicon, macOS Ventura+). Once Enhanced Dictation is enabled, processing stays local. You can dictate offline without issue.
- Low friction for simple use cases. If you want to narrate a quick email, a note, or a message, macOS Dictation gets the job done without switching to another tool.
- Solid general accuracy for clear speech. For everyday vocabulary spoken at a normal pace, the recognition quality is adequate for most casual use.
What macOS Dictation Cannot Do
This is where the built-in tool runs out of road. These are not minor limitations.
- No file import. macOS Dictation only processes live microphone input. You cannot drop in a recorded meeting, an interview, a field recording, or any audio file. If you need to transcribe existing audio, the built-in tool does not apply.
- No transcription history. There is no log of what you have dictated. Nothing is saved, searchable, or retrievable after the fact. Each session is ephemeral.
- No professional vocabulary. Apple's model is a general-purpose recognizer. Medical terminology, legal citations, financial instruments, and specialized technical jargon all suffer. The model has no domain tuning and no way to add custom terms.
- No model or accuracy controls. You cannot adjust for speed versus accuracy, switch between model sizes, or tune behavior for your microphone setup or speaking style.
- No export. Text lands in the active text field. There is no way to export a transcript to a file, a structured format, or anywhere else. Copy and paste is your only option.
- Live input only. There is no batch processing, no queue, no ability to handle multiple files. It is strictly real-time dictation into the active cursor position.
If your use case involves any of those capabilities, you are looking for something other than what macOS Dictation provides.
Privacy: What Apple Documents vs. What Matters for Professionals
The privacy story depends entirely on which mode you are running.
Default mode (cloud-assisted): Audio is routed to Apple's servers for processing. Apple's privacy policy governs what happens to it. This mode should not be used for sensitive conversations, privileged client communications, protected health information, or any content subject to confidentiality obligations.
Enhanced / on-device mode (macOS Ventura+ on Apple Silicon): Audio is processed locally. It does not leave your device during transcription. Apple's improvement opt-in (sharing audio samples with Apple to improve Siri and dictation) is on by default and should be reviewed separately if on-device privacy is a concern.
Here is the practical limit of the on-device mode for regulated professionals: Apple's privacy policy is not a compliance instrument. Attorneys handling privileged client matters, physicians working with protected health information, and financial advisors subject to information barrier requirements need something more than a consumer operating system feature with a favorable privacy posture. The architecture may be sound, but no independent audit, no signed compliance agreement, and no organizational control mechanism exists around macOS Dictation as a professional tool.
For a deeper look at what on-device privacy actually means for Mac users, see the Mac voice-to-text privacy guide and our coverage of Apple Dictation privacy risks.
When macOS Dictation Is Good Enough
There is no point recommending a paid tool when the free one does the job. macOS Dictation is the right choice when:
- You are dictating personal notes, casual memos, or low-stakes writing
- The content is not sensitive and does not involve privileged or confidential information
- You only need live dictation, not file transcription or transcription history
- Occasional recognition errors are acceptable and easy to manually correct
- General vocabulary covers everything you are dictating
For a lot of everyday Mac users, that is most of their dictation. Use the built-in tool. It is free and it works.
When You Need a Dedicated Mac Speech-to-Text App
The calculus changes when your requirements move beyond casual dictation.
- You work in a regulated environment. Healthcare providers handling PHI, attorneys with attorney-client privilege obligations, financial advisors with information barrier requirements, and insurance professionals working with non-public policyholder data all need more than a consumer OS feature. Architecture matters; consumer tools are not designed for professional compliance contexts.
- You need to transcribe recorded audio. Meetings, interviews, field recordings, depositions, patient intakes, earnings calls: none of these can be processed by macOS Dictation. You need file import, and macOS Dictation does not have it.
- Accuracy on specialized vocabulary matters. If you regularly dictate medical terminology, legal language, financial instruments, or technical jargon, a general-purpose model will produce recognition errors that slow you down. Domain-specific vocabulary tuning is the solution.
- You need a record of what was transcribed. If searchable transcription history is part of your workflow, you need a dedicated tool. macOS Dictation keeps nothing.
- You want structured export. Plain text to clipboard is not a workflow. SRT for captioning, JSON for downstream processing, Markdown for structured notes: these require a dedicated transcription tool.
macOS Dictation vs. VoicePrivate: Feature Comparison
| Feature | macOS Dictation | VoicePrivate |
|---|---|---|
| Price | Free (built-in) | Free tier available; $9.99/mo or $84/yr |
| Audio file import | No | Yes |
| Live dictation | Yes | Yes |
| On-device processing | Partial (Enhanced mode, macOS Ventura+ on Apple Silicon) | Always, 100% local |
| Professional vocabulary | No | Yes, edition-specific (Healthcare, Legal, Finance, Insurance) |
| Custom vocabulary | No | Yes |
| Transcription history | No | Yes, encrypted local database |
| Export formats | None (clipboard only) | TXT, JSON, Markdown, SRT, WebVTT |
| Speaker diarization | No | Yes (paid plans) |
| Cloud dependency | Default mode requires internet | None |
| Account required | No (Apple ID not required for dictation) | No |
| Compliance posture | Consumer OS feature | Architecture-based local guarantee, no data transmitted |
For a detailed side-by-side analysis, see the full comparison: VoicePrivate vs. macOS Dictation.
Frequently Asked Questions
Is macOS speech to text accurate enough for professional use?
For general vocabulary and casual content, yes. For domain-specific terminology in healthcare, law, finance, or insurance, general-purpose models produce more errors on technical terms than most professionals can tolerate. Dedicated apps with domain vocabulary tuning close that gap.
Does macOS Dictation send my audio to Apple?
In the default configuration, yes. Enhanced Dictation (on-device mode on Apple Silicon Macs running macOS Ventura or later) processes audio locally. Check your System Settings > Privacy & Security > Analytics & Improvements to review whether you are sharing audio samples with Apple for model improvement.
Can I use macOS speech to text to transcribe a recorded meeting?
No. macOS Dictation only accepts live microphone input. You cannot import an audio or video file for transcription. For that, you need a dedicated transcription app like VoicePrivate.
Is macOS Dictation HIPAA compliant?
Apple does not sign Business Associate Agreements for consumer OS features including macOS Dictation. It is not an appropriate tool for processing Protected Health Information in a covered entity context. Consult your compliance team for specific guidance.
How is VoicePrivate different from macOS Dictation?
VoicePrivate adds file import, transcription history, export formats (SRT, JSON, Markdown, WebVTT), professional vocabulary for Healthcare, Legal, Finance, and Insurance, speaker diarization, and guaranteed on-device processing with no cloud dependency on any plan. It is built for professional workflows, not casual dictation. The free tier covers core transcription with no account required.
Does VoicePrivate work offline?
After a one-time model download on first launch, VoicePrivate requires no internet connection. All transcription runs locally on your Mac. This works on a plane, in a facility with a locked-down network, or anywhere without internet access.
Getting Started with VoicePrivate
VoicePrivate runs on macOS 12 and later, with Apple Silicon optimization for M-series chips. The free tier gives you core transcription features with no account required and no credit card needed. Download and test on-device processing before deciding whether a paid plan makes sense for your workflow.
Paid plans unlock speaker diarization, longer file support, additional export formats, and access to specialty editions with domain-specific vocabulary for Healthcare, Legal, Finance, and Insurance.
Related Resources
- VoicePrivate vs. macOS Dictation: Full Comparison
- Offline Speech-to-Text for Mac
- Mac Voice-to-Text Privacy Guide
- Voice to Text for Mac
- Apple Dictation Privacy Risks
Key Takeaways
- macOS Dictation is free, on-device (Enhanced mode, Apple Silicon), and adequate for casual dictation. Use it if it fits your needs.
- It does not handle file import, transcription history, professional vocabulary, export formats, or speaker diarization. If those matter, it is the wrong tool.
- Default mode sends audio to Apple's servers. Enhanced mode stays local but is a consumer feature, not a compliance instrument.
- VoicePrivate fills the gap for Mac users who need professional-grade transcription, guaranteed on-device processing, or domain-specific accuracy without cloud dependency.
- The free tier covers core features, no account required. Paid plans add diarization, longer files, structured export, and specialty editions.