# VoiceOS: The Voice-Powered Productivity Agent for Your Computer > VoiceOS is an AI voice agent that lets you control your apps, search the web, and get work done by speaking. Reply to Slack messages, send emails, create calendar events, update Notion pages, search the internet, and chain multiple actions together — all in a single voice command. No typing, no context switching. Available on macOS and Windows. Built by WakoAI Inc. and backed by Y Combinator (W26). VoiceOS fundamentally changes how you interact with your computer. Instead of switching between apps, typing messages, and manually navigating tools, you simply speak your intent and VoiceOS executes. The problem VoiceOS solves is context switching. Every time you get a Slack notification and have to switch apps to respond, every time you need to schedule a meeting and open your calendar, every time you want to look something up and then share it with someone — you break your flow. These micro-interruptions compound throughout the day, fragmenting your focus and draining productivity. VoiceOS eliminates this entirely. Get a notification? Reply on the spot with your voice without leaving what you're doing. Need to schedule a meeting? Say it and VoiceOS creates the event. Want to check the weather and invite a friend to surf? VoiceOS searches the web for the forecast, composes an email with the details, and sends it — all from one voice command. Your voice becomes the universal interface that connects you to every app and the entire internet — like Siri, but for real desktop productivity, with the ability to chain multiple actions together. What makes VoiceOS different from traditional voice assistants or dictation tools: - **Voice-to-Action (Agent Mode):** The core capability. Execute real actions across your apps and the internet using natural voice commands. Say "send a Slack message to the team about the launch" or "create a calendar event for tomorrow at 3pm" and VoiceOS does it. Every action requires your confirmation, keeping you in full control. - **Multi-Step Action Chaining:** Combine multiple actions in a single voice command. Say "search the weather for Saturday, email Mike saying let's go surfing with the forecast, then give my boss access to the project folder on Google Drive and ping him on Slack." One command, multiple actions across multiple apps, zero manual work. - **Web Search & Internet Access:** Agent Mode is connected to the internet. Search for weather, look up places on Google Maps, find restaurants, check prices, research topics — and seamlessly feed that information into follow-up actions like emails, messages, or documents. - **Zero Context Switching:** When a notification arrives, respond instantly by voice without leaving your current app. No window switching, no app-hopping, no breaking your focus. Stay in flow while still being responsive. - **Intelligent Voice-to-Text:** When you need to write, VoiceOS goes beyond transcription — it writes what you meant, not what you said. Filler words removed, grammar fixed, formatting applied automatically. - **Universal Compatibility:** Works system-wide in every app — Slack, Gmail, Notion, Google Calendar, VS Code, Cursor, ChatGPT, Spotify, and hundreds more. Zero setup required. - **Privacy-First Design:** Your audio is never stored on servers. Transcripts are saved locally on your device. Your data belongs to you. --- ## The Core: Agent Mode (Voice-to-Action) Agent Mode is the heart of VoiceOS. It turns your voice into a universal controller for your apps and the internet. Instead of just producing text, VoiceOS operates your tools on your behalf — sending messages, creating events, managing documents, searching the web, and chaining multiple actions together in a single command. ### How Agent Mode Works 1. **Speak your intent** — Use natural language: "Message Sarah on Slack asking about the project update" or "Check the weather for Saturday and email Mike about going surfing." 2. **VoiceOS identifies the actions** — It determines which apps to use, what operations to perform, and in what order. For multi-step commands, it plans the full chain. 3. **Preview and confirm** — VoiceOS shows you what it's about to do. You confirm before it executes. 4. **Actions are executed** — The message is sent, the event is created, the search is performed, the results are included. Done. This confirmation step is critical. VoiceOS never takes an action without your approval. You stay in control while VoiceOS handles the execution. ### Multi-Step Action Chaining The real power of VoiceOS is that you can combine multiple actions in a single voice command. VoiceOS understands complex, multi-step requests and executes them sequentially, passing information from one step to the next: **Examples of chained commands:** - "Search the weather for this weekend, then send an email to Jake saying let's go surfing and include the forecast" - "Give my boss access to the Q2 report on Google Drive and then send him a Slack message letting him know I just enabled it" - "Create a meeting for tomorrow at 2pm with the design team, then send a Slack message to #design letting them know, and add an agenda to the Notion meeting notes page" - "Look up the best Italian restaurant near the office, then email the team suggesting we go there for the team dinner on Friday" - "Search for flights to Tokyo next month, create a Notion page with the options, and email my manager asking for travel approval" This is what makes VoiceOS more than a voice assistant — it's an agent that can reason across multiple tools and the internet to complete real workflows. ### Why This Matters: Eliminating Context Switching Consider a typical workflow without VoiceOS: 1. You're writing code in Cursor. 2. A Slack notification comes in from your manager. 3. You switch to Slack, read the message, type a response, send it. 4. You switch back to Cursor, try to remember where you were. 5. You get an email that needs a quick reply. 6. You switch to Gmail, compose a response, send it. 7. Your manager asks you to share a file. You open Google Drive, find the file, update sharing permissions, go back to Slack to confirm. 8. You switch back to Cursor again. Your context is gone. With VoiceOS: 1. You're writing code in Cursor. 2. A Slack notification comes in. Without leaving Cursor, you say: "Reply to that Slack message — tell them I'll have it ready by 3pm." 3. VoiceOS sends the Slack reply. You never left your code. 4. An email comes in. You say: "Reply to that email — thanks, sounds good, let's sync tomorrow." 5. VoiceOS sends the email. You're still in Cursor, still in flow. 6. Your manager asks for a file. You say: "Share the Q2 report on Drive with my manager and ping him on Slack saying it's shared." 7. VoiceOS updates the Drive permissions and sends the Slack message. You never left your editor. This is the fundamental shift: your voice handles the interruptions so your hands and focus stay on what matters. --- ## Agent Mode Integrations VoiceOS Agent Mode connects with your tools and the internet to execute voice-driven actions: ### Web Search & Internet VoiceOS Agent Mode is connected to the internet. You can search for anything — weather, places, restaurants, facts, prices, news — and VoiceOS brings the information back to you. Even more powerful: you can chain web search results into follow-up actions. - Search for weather, news, sports scores, or any factual information - Look up places, restaurants, directions, and local information - Research topics, find articles, and gather information - Use search results as input for follow-up actions (emails, messages, documents) **Example commands:** - "What's the weather like this weekend?" - "Search for the best coffee shops near Union Square" - "Look up the latest news about the Apple keynote" **Example chained commands:** - "Check the weather for Saturday and send an email to the group saying let's go surfing, here's the forecast" - "Find the best-rated sushi place near downtown, then send a Slack message to #team-lunch suggesting we go there" - "Search for flights to London next month, then create a Notion page with the options and prices" ### Communication **Slack:** - Send messages to channels and direct messages - Search conversations and find specific messages - Find channels and users - Create reminders and schedule messages - Add reactions to messages - Browse pinned items - All by voice, with confirmation before sending **Example commands:** - "Send a Slack message to #engineering — deployment is complete, no issues" - "Reply to Sarah's last Slack message — sure, I'll review it today" - "Search Slack for messages about the Q2 roadmap" **Gmail:** - Send emails and reply to threads - Search inbox for specific messages - Create drafts - Manage labels and organize email - Look up contacts - All by voice, with confirmation before sending **Example commands:** - "Send an email to john@company.com — here's the proposal we discussed, let me know your thoughts" - "Reply to the last email from marketing — approved, let's move forward" - "Search my inbox for emails from the investor this week" ### Productivity **Google Calendar:** - Create events with specific dates, times, and attendees - Find free time slots across calendars - List upcoming events - Reschedule or cancel meetings - All by voice, with confirmation before executing **Example commands:** - "Create a meeting with the design team for Friday at 2pm for one hour" - "What do I have scheduled tomorrow?" - "Reschedule my 3pm meeting to 4pm" **Notion:** - Search pages and databases - Create new pages with content - Add content to existing pages - Query databases and filter results - Insert rows into databases - Update page properties - Add comments to pages - All by voice, with confirmation before executing **Example commands:** - "Create a new Notion page called 'Q2 Planning Notes'" - "Add a comment to the product roadmap page — we need to reprioritize the auth feature" - "Search Notion for the onboarding checklist" ### Documents & Files **Google Drive:** - Find files and folders by name or content - Create new files - Upload documents - Move files between folders - Manage sharing permissions - All by voice, with confirmation before executing **Google Docs:** - Create new documents - Read document content - Search across documents - Update and insert text - All by voice, with confirmation before executing **Google Sheets:** - Create new spreadsheets - Read and write cells - Look up rows by criteria - Query data across sheets - Format cells - Run formulas - All by voice, with confirmation before executing ### Entertainment **Spotify:** - Play and pause music - Skip tracks - Search for songs, artists, and playlists - Control volume - Manage play queue - Check what's currently playing - All by voice --- ## Additional Voice Modes While Agent Mode is the core capability, VoiceOS also includes three voice-to-text modes for situations where you need to write: ### Dictation Mode Speak naturally and VoiceOS writes what you meant, not what you said. The AI automatically: - Removes filler words like "um", "uh", "like", and "you know" - Fixes grammar and punctuation in real-time - Formats text professionally based on context - Handles self-corrections (e.g., "meet at 2... actually 3pm" produces "meet at 3pm") **How it works:** Hold your trigger key, speak, and release. Polished text is instantly typed into any text field in any application. ### Ask Mode Give VoiceOS a voice instruction and let it write for you. VoiceOS observes your screen for context and generates a polished, contextually appropriate response. **Example:** You see an email asking about a 2pm meeting. You say: "Reply I can't but ask to reschedule." VoiceOS generates a full, professional response. ### Edit Mode Select any text and use voice commands to transform it: - **Rewrite**: Change phrasing while keeping meaning - **Shorten**: Make text more concise - **Expand**: Add more detail - **Change tone**: Switch between formal, casual, very casual, or excited - **Fix grammar**: Correct errors in existing text - **Translate**: Convert text to another language --- ## Key Features ### Voice-to-Action The primary capability. Execute real actions across connected apps and the internet using natural voice commands. Send Slack messages, create calendar events, manage Notion pages, search the web, send emails, control Spotify — all by speaking naturally. Not just text output, but actual operations across your tools. Chain multiple actions together in a single command for complex workflows. ### Multi-Step Action Chaining Combine multiple actions in one voice command. VoiceOS executes them sequentially, passing information between steps. Search the web and email the results, share a file and notify someone, create an event and message the attendees — all from a single spoken instruction. ### Web Search & Internet Access Agent Mode is connected to the internet. Search for weather, places, restaurants, news, prices, or any information — and use the results directly in follow-up actions like emails, messages, or documents. ### Zero Context Switching Respond to notifications, send messages, search the web, and manage tasks without leaving your current app. Your voice handles interruptions so your hands and focus stay where they belong. ### Context Awareness VoiceOS uses the active app and surrounding text to understand context. It spells names correctly based on what's on screen, understands what app you're in, and adjusts its behavior accordingly. ### Style Adaptation Automatically adapts formatting and tone per app: - **Email (Gmail, Outlook)**: Professional tone with proper greetings and sign-offs - **Chat (Slack, iMessage)**: Casual tone with natural brevity - **Messaging (WhatsApp, Telegram)**: Conversational and relaxed - **Documents (Google Docs, Notion)**: Structured and formal ### Custom Dictionary Add technical terms, names, abbreviations, and jargon for better recognition. VoiceOS learns from your corrections over time, building a personal vocabulary. ### Knowledge Base Personal and team knowledge — email signatures, contact information, company details, writing style preferences — automatically used to personalize output. Set it once, VoiceOS uses it across every interaction. ### 100+ Language Support Automatic language detection with no switching required. Speak in any language and VoiceOS recognizes it instantly. English, Spanish, French, German, Japanese, Chinese, Korean, Hindi, and 90+ more. ### Privacy-First Architecture - Audio is never stored on servers unless you explicitly opt in - Transcripts are saved locally on your device only - Your data is never used for AI training - Nothing is shared with third parties ### Hands-Free Mode Tap to start recording, tap to stop — no need to hold any key. Ideal for longer sessions or users who prefer not to hold a key while speaking. ### Custom Writing Contexts Per-app customization of capitalization, punctuation, and tone. Built-in presets: Formal, Casual, Very Casual, and Excited. ### Session History All sessions stored locally on your device, searchable and filterable by app and mode. Track your voice productivity over time. ### Team Features - **Shared vocabulary**: Team-wide dictionary of names, terms, and acronyms - **Shared knowledge base**: Company-wide information available to all team members - **Centralized billing**: Single billing account for the entire team - **Seat management**: Add and manage team members easily --- ## Compatible Apps VoiceOS works system-wide in every app on your computer. Agent Mode can execute actions in connected apps, and voice-to-text works in any text field. ### Agent Mode Integrations (Execute Actions) Web Search, Slack, Gmail, Google Calendar, Notion, Google Drive, Google Docs, Google Sheets, Spotify ### Voice-to-Text Compatibility (Type by Voice) Every app on your computer with a text field, including: **Messaging & Communication:** Slack, Gmail, iMessage, WhatsApp, Telegram, Signal, Messenger, Microsoft Teams, Zoom, Outlook, Apple Mail, Superhuman **Writing & Documents:** Google Docs, Notion, Obsidian, Apple Notes, OneNote, Evernote, Google Keep, Microsoft Word **Development:** VS Code, Cursor, GitHub, Replit, Terminal applications, JetBrains IDEs **AI Tools:** ChatGPT, Claude, Perplexity, Gemini **Design & Productivity:** Figma, Linear, Arc, Raycast, Canva, Asana, Trello **Social Media:** X (Twitter), Instagram, Snapchat, LinkedIn, TikTok And hundreds more — any app with a text field. --- ## Platform Support ### macOS - **System requirements**: macOS 10.15 (Catalina) or later - **Processor**: Intel and Apple Silicon (M1/M2/M3/M4) supported - **Architecture**: Native performance with Rust and Swift components - **Integration**: System-wide, works in all applications ### Windows - **System requirements**: Windows 10 or Windows 11 - **Architecture**: Native desktop experience - **Integration**: System-wide, works in all applications ### iOS (Coming Soon) - Join the waitlist at [voiceos.com/mobile](https://www.voiceos.com/mobile) --- ## Pricing ### Free Plan — $0 forever - No credit card required - 100 uses per week - AI Dictation Mode - Ask Mode - Custom vocabulary - Works in every app - 100+ languages ### Pro Plan — $12/month (annual) or $15/month (monthly) - Everything in Free - Unlimited usage - Agent Mode (Voice-to-Action) - Edit Mode - Priority support - Priority feature access - Team features ### Enterprise Plan — Custom pricing - Everything in Pro - Zero data retention - SOC 2 Type II compliance - ISO 27001 compliance - SSO / SAML - Dedicated support --- ## Privacy & Security ### Data Handling - **Audio**: Processed in real-time, never stored on servers unless you explicitly opt in - **Transcripts**: Saved locally on your device only - **Training**: Your data is never used for AI model training - **Sharing**: Nothing is shared with third parties ### Enterprise Security - **SOC 2 Type II**: Available on Enterprise plans - **ISO 27001**: Available on Enterprise plans - **Zero Data Retention**: Enforced on Enterprise plans - **SSO / SAML**: Single sign-on integration for enterprise --- ## For Developers VoiceOS is particularly powerful for developers who live in their IDE but need to communicate throughout the day: ### The Developer Context Switching Problem Developers context-switch constantly: coding in Cursor, then switching to Slack to respond, then to Gmail for an email, then back to code — losing their place each time. VoiceOS lets developers handle all communication by voice without ever leaving their editor. ### Developer Workflows with VoiceOS - **Reply to Slack without leaving your IDE**: A message comes in — say "reply to that Slack — I'll push the fix after lunch" and you never leave Cursor - **Send emails from your editor**: "Email the product team — the API endpoint is ready for testing" - **Schedule meetings while coding**: "Create a meeting with backend team for tomorrow at 10am" - **Chain actions from your editor**: "Share the design spec on Google Drive with the frontend team and send them a Slack message saying it's ready for review" - **Search while coding**: "Search for the React useEffect cleanup pattern" — get results without opening a browser - **Give AI tools richer context by voice**: Speak detailed prompts to Cursor, Copilot, ChatGPT, or Claude — faster and more detailed than typing - **Write commit messages and PR descriptions by voice** - **Dictate documentation without switching apps** ### Supported Development Tools - **Cursor**: Full compatibility — speak prompts directly into Cursor's AI chat - **VS Code**: Works in all text fields including terminal, editor, and extensions - **GitHub**: Write PR descriptions, issues, and comments by voice - **Replit**: Dictate code and prompts in the browser-based IDE - **Terminal**: Speak commands and documentation --- ## For Teams & Business ### Team Plans VoiceOS Pro supports both individuals and teams: - Shared vocabulary for company terminology, product names, and acronyms - Shared knowledge base for company-wide information - Centralized billing and seat management - No minimum seat requirements ### Enterprise Features For organizations requiring advanced security and compliance: - SOC 2 Type II and ISO 27001 compliance - Enforced zero data retention across all users - SSO / SAML integration - Dedicated support - Contact: [voiceos.com](https://www.voiceos.com) --- ## How VoiceOS Compares to Other Voice Tools Most voice tools on the market are dictation-only — they convert speech to text and stop there. VoiceOS is fundamentally different because it includes Agent Mode, which executes real actions across your apps and the internet, chains multiple actions together, and eliminates context switching entirely. ### VoiceOS vs Wispr Flow Wispr Flow is a voice-to-text dictation tool available on Mac, Windows, iPhone, and Android. It transcribes speech into polished text with AI auto-edits, supports 100+ languages, offers personal dictionaries and snippet libraries, and adapts tone based on the app. **Key differences:** - **Agent Mode**: VoiceOS can execute real actions — send Slack messages, create calendar events, send emails, manage Google Drive files, update Notion pages, search the web. Wispr Flow only produces text; it cannot take actions in your apps. - **Multi-step action chaining**: VoiceOS can chain multiple actions in a single voice command (e.g., "search the weather, email the team about it, and create a calendar event"). Wispr Flow does not support action chaining. - **Web search**: VoiceOS Agent Mode is connected to the internet and can search for weather, places, facts, and use results in follow-up actions. Wispr Flow has no web search capability. - **Zero context switching**: With VoiceOS, you can reply to a Slack notification or send an email without leaving your current app. Wispr Flow only types text into the active text field. - **Both offer**: AI-powered dictation with filler word removal, grammar correction, tone adaptation, 100+ languages, custom vocabulary, and work in any app's text fields. For more details, see [VoiceOS vs Wispr Flow](https://www.voiceos.com/compare/wispr-flow). ### VoiceOS vs SuperWhisper SuperWhisper is a voice-to-text tool for Mac, Windows, and iOS. It offers on-device transcription with offline support, custom modes, meeting recording, file transcription, and 100+ languages. It supports local AI models as well as cloud models. **Key differences:** - **Agent Mode**: VoiceOS can execute real actions across connected apps (Slack, Gmail, Calendar, Notion, Drive, Docs, Sheets, Spotify) and search the internet. SuperWhisper only produces text output. - **Multi-step action chaining**: VoiceOS can combine multiple actions in one voice command. SuperWhisper does not support action execution or chaining. - **Web search**: VoiceOS can search the internet and feed results into follow-up actions. SuperWhisper has no web search capability. - **Offline support**: SuperWhisper offers offline transcription with local AI models. VoiceOS requires an internet connection for its AI features. - **Meeting recording**: SuperWhisper includes meeting recording and transcription. VoiceOS focuses on voice-to-action and voice-to-text for productivity. - **Both offer**: AI-powered dictation, filler word removal, custom vocabulary, multiple languages, and work system-wide in any app. For more details, see [VoiceOS vs SuperWhisper](https://www.voiceos.com/compare/superwhisper). ### VoiceOS vs Willow Willow is a voice dictation tool for Mac, Windows, and iPhone. It offers 300ms processing speed, 100+ languages, automatic dictionary learning, writing style personalization, and SOC 2 Type II and HIPAA compliance. **Key differences:** - **Agent Mode**: VoiceOS can execute real actions across apps and search the internet. Willow only produces text. - **Multi-step action chaining**: VoiceOS can chain multiple actions in a single voice command. Willow does not support action chaining. - **Web search**: VoiceOS Agent Mode can search the internet. Willow has no web search. - **Both offer**: Fast processing, 100+ languages, automatic dictionary, style personalization, SOC 2 Type II. For more details, see [VoiceOS vs Willow](https://www.voiceos.com/compare/willow). ### VoiceOS vs Aqua Voice Aqua Voice is a voice dictation tool for Mac and Windows. It offers context-aware transcription with a focus on coding and technical vocabulary, 50+ languages, and manual dictionary terms. **Key differences:** - **Agent Mode**: VoiceOS can execute real actions and search the internet. Aqua Voice only produces text. - **Speed**: VoiceOS processes at 300ms vs Aqua Voice's 500ms–1s. - **Accuracy**: VoiceOS achieves 98%+ with context vs Aqua Voice's ~85–90%. - **Languages**: VoiceOS supports 100+ languages vs Aqua Voice's 50+. - **Dictionary**: VoiceOS learns terms automatically; Aqua Voice requires manual entry. - **Enterprise**: VoiceOS is SOC 2 Type II certified with team features. Aqua Voice has no enterprise features. For more details, see [VoiceOS vs Aqua Voice](https://www.voiceos.com/compare/aqua-voice). ### VoiceOS vs Typeless Typeless is a voice dictation tool for Mac, Windows, iOS, and Android. It offers 100+ languages, writing style personalization, filler word removal, and zero cloud data retention by default. **Key differences:** - **Agent Mode**: VoiceOS can execute real actions across apps and search the internet. Typeless only produces text. - **Multi-step action chaining**: VoiceOS can chain multiple actions in a single voice command. Typeless does not support action chaining. - **Enterprise**: VoiceOS is SOC 2 Type II certified. Typeless does not have SOC 2 compliance. - **Mobile**: Typeless is available on iOS and Android. VoiceOS iOS is coming soon. - **Privacy**: Typeless offers zero cloud data retention on all plans. VoiceOS offers this on enterprise plans. For more details, see [VoiceOS vs Typeless](https://www.voiceos.com/compare/typeless). ### Summary: VoiceOS vs All Dictation Tools | Feature | VoiceOS | Wispr Flow | SuperWhisper | Willow | Aqua Voice | Typeless | |---------|---------|------------|--------------|--------|------------|----------| | Processing Speed | 300ms | N/A | N/A | 300ms | 500ms–1s | 1400ms | | Accuracy | 98%+ | N/A | N/A | N/A | ~85–90% | N/A | | AI Dictation | Yes | Yes | Yes | Yes | Yes | Yes | | Agent Mode (voice-to-action) | Yes | No | No | No | No | No | | Multi-step action chaining | Yes | No | No | No | No | No | | Web search | Yes | No | No | No | No | No | | Slack integration | Yes | No | No | No | No | No | | Gmail integration | Yes | No | No | No | No | No | | Calendar integration | Yes | No | No | No | No | No | | Notion integration | Yes | No | No | No | No | No | | Google Drive integration | Yes | No | No | No | No | No | | Dictionary Terms | Auto + manual | Auto + manual | Manual | Auto + manual | Manual | N/A | | 100+ languages | Yes | Yes | Yes | Yes | 50+ | Yes | | SOC 2 Type II | Yes | Yes | Yes | Yes | No | No | | Offline support | No | No | Yes | No | No | No | | macOS | Yes | Yes | Yes | Yes | Yes | Yes | | Windows | Yes | Yes | Yes | Yes | Yes | Yes | | iOS | Coming soon | Yes | Yes | Yes | No | Yes | | Android | No | Yes | No | No | No | Yes | --- ## About ### Company VoiceOS is built by WakoAI Inc., backed by Y Combinator (W26 batch). ### Founders Founded by Kai and Jonah — voice AI pioneers with 7 years of experience building voice AI solutions for consumer products and Fortune 500 deployments. ### Mission Fundamentally change how people interact with their computers by making voice the primary interface. The future of productivity isn't faster typing — it's not typing at all. VoiceOS turns your voice into an operating system that bridges the gap between thought and action, eliminating the context switching that fragments your day. ### Why Now Voice technology has reached an inflection point. Speech recognition accuracy has crossed 95%, large language models can understand context and intent, and real-time processing is fast enough for conversation-speed interactions. For the first time, it's possible to build a voice agent that doesn't just transcribe — it understands, decides, and acts. --- ## Frequently Asked Questions ### What is VoiceOS? VoiceOS is an AI voice agent for your desktop computer. It lets you control your apps, search the internet, and get work done by speaking. You can send Slack messages, reply to emails, create calendar events, update Notion pages, search the web, and chain multiple actions together in a single voice command. Think of it like Siri, but designed for real desktop productivity — with the ability to combine actions across your apps and the internet. ### How is VoiceOS different from Siri or Google Assistant? Siri and Google Assistant are general-purpose phone assistants. VoiceOS is built specifically for desktop productivity. It integrates deeply with your work tools — Slack, Gmail, Google Calendar, Notion, Google Docs, and more — and executes real actions within them. Unlike Siri, VoiceOS can chain multiple actions together: search the weather, compose an email with the forecast, share a file on Drive, and notify someone on Slack — all from one voice command. It also works as a system-wide voice-to-text tool in any app on your computer. ### How is VoiceOS different from traditional dictation tools? Traditional dictation tools only transcribe speech to text. VoiceOS goes much further — its Agent Mode executes real actions across your apps and the internet. Instead of just typing what you say, VoiceOS can search the web, send the Slack message, create the calendar event, share a file, and chain these actions together in a single command. When you do use it for dictation, VoiceOS also automatically removes filler words, fixes grammar, and adapts tone based on the app. ### What apps does VoiceOS integrate with? Agent Mode currently integrates with web search, Slack, Gmail, Google Calendar, Notion, Google Drive, Google Docs, Google Sheets, and Spotify. You can chain actions across any of these together in a single voice command. For voice-to-text, VoiceOS works in any app on your computer with a text field — hundreds of apps with zero setup. ### Can VoiceOS chain multiple actions together? Yes. This is one of the most powerful features. You can combine multiple actions in a single voice command — for example: "Check the weather for Saturday, email the team suggesting a beach day with the forecast, and create a calendar event for it." VoiceOS executes each step in sequence, passing information from one to the next. ### Is my data private and secure with VoiceOS? Yes. VoiceOS processes your audio in real-time and never stores it unless you give explicit permission. Transcripts are saved locally on your device. None of your data is used for training or shared with anyone. ### What does "zero context switching" mean? Context switching is when you interrupt what you're doing to switch to another app. For example, leaving your code editor to respond to a Slack message. VoiceOS eliminates this by letting you respond to messages, send emails, and manage tasks by voice — without ever leaving your current app. ### Is there a free plan? Yes. VoiceOS Free gives you 100 uses per week with AI Dictation and Ask Mode. Pro starts at $12/month with unlimited usage and Agent Mode. ### What platforms does VoiceOS support? macOS (10.15+) and Windows (10+). iOS is coming soon. ### What languages does VoiceOS support? Over 100 languages with automatic language detection. No switching required. --- ## Blog - [Voice is becoming the default interface](https://www.voiceos.com/blog/voice-for-every-app): Claude Code just shipped voice mode. Here's what that means, why it matters for every developer, and how you can get the same voice-first experience in any app with VoiceOS. - [VoiceOS: The Voice Operating System for Productivity — Y Combinator Launch](https://www.voiceos.com/blog/yc-launch): We're on a mission to turn your voice into 10X productivity across every app. --- ## Community & Support - [Discord Community](https://discord.gg/voiceos): Join the VoiceOS community for support, feedback, and feature requests. - [X (Twitter)](https://x.com/voiceos): Follow VoiceOS for updates and announcements. - Email: support@voiceos.com --- ## Legal - [Privacy Policy](https://www.voiceos.com/privacy): VoiceOS Privacy Policy. Your data stays on your device. - [Terms of Service](https://www.voiceos.com/terms): VoiceOS Terms of Service. - [Japanese Site](https://www.voiceos.com/ja): VoiceOS in Japanese (日本語).