№ 04Learn With Darin · Field Guide

ChatGPT: a practitioner’s field guide.

The shape of the consumer ChatGPT app in 2026: desktop, mobile, and Atlas browser. What each plan buys you, where the model picker quietly demotes you, and the workflows that survive a year of use.

Updated May 2026 ~25 min read Covers Free through Enterprise
Part 01

The shape of ChatGPT in 2026

ChatGPT in 2026 looks very different from ChatGPT in 2024. There are now six surfaces, six pricing tiers, and a model picker whose options change based on which one you're paying for. Most of the confusion people have isn't about how to use the product. It's about which version of the product they're using.

The surfaces:

  • Web at chatgpt.com, the canonical surface. Every feature lands here first.
  • Mac desktop app: native Apple-Silicon-only client (macOS 14+), with menu bar, global hotkey, screen sharing, and Companion Mode.
  • Windows desktop app: wrapped web app with a global hotkey and a few native affordances.
  • iOS app: native iPhone/iPad client with Lock Screen widgets, Siri Shortcuts, and Action Button support.
  • Android app: native client with Quick Settings tile, widgets, and standard share-intent integration.
  • ChatGPT Atlas: OpenAI's standalone web browser (Mac at launch, October 2025; Windows in 2026) with ChatGPT as a constant companion, Operator-style agent mode, and "browser memories."

Behind every surface is the same model lineup, but which models the picker offers depends on your plan. As of May 2026 the lineup most users see is:

  • GPT-5.3 Instant: the everyday model. Free users are pinned here.
  • GPT-5.5: the smarter day-to-day model on Plus and above.
  • GPT-5 Pro: the long-context, deeper-reasoning model on Pro and above (1M-token context on the $200 tier).
  • GPT-4.5o and o3-mini, still in the picker for tasks that benefit from older characteristics, gradually being deprecated.
Note OpenAI announced in March 2026 that ChatGPT Atlas, the ChatGPT desktop app, and Codex would be combined into a single desktop application. That work is in flight; as of May 2026 the three are still separate installs but expect that to change.

The most important thing to internalize: the same conversation can quietly drop to a smaller model when you exceed soft caps. ChatGPT does not loudly tell you about the model change. The same chat keeps going, the answers just get worse. The web app surfaces a small badge near the model picker; the desktop and mobile apps often hide it. Watch for it.

Part 02

The plans (and the message caps that bite)

There are six consumer plans in 2026. The pricing table is messy because OpenAI raised, lowered, and reshuffled tiers throughout late 2025 and early 2026.

Plan Cost Default model Cap Notable
Free $0 GPT-5.3 Instant ~10 messages / 5 hours Web search, basic image generation, voice (standard mode), Atlas browser access.
Go $8/mo GPT-5.3 Instant ~80 / 5 hours Adds raised limits and image upload; introduced as a low-cost on-ramp in late 2025.
Plus $20/mo GPT-5.5 400–2000 / 5 hours Adds GPT-5.5, Deep Research (10/mo), Sora (video), Agent Mode, Custom GPTs, advanced voice.
Pro $100 $100/mo GPT-5.5 + GPT-5 Pro 5× Plus Launched April 9, 2026 as the middle tier. 5× Plus headroom and priority on new features.
Pro $200 $200/mo GPT-5 Pro 20× Plus Adds 1M-token context, near-unlimited usage, early access. The "I bill it as a tool" tier.
Business $20/seat annual
$25 monthly
GPT-5.5 Plus-equivalent Org admin, shared workspaces, billing centralization. Repriced down from $25 in April 2026.
Enterprise Sales GPT-5 Pro Pro-equivalent + custom SSO, SCIM, audit logs, custom retention, dedicated capacity. The compliance tier.

The 5-hour rolling window matters: it's not "per day," it's "any 5 contiguous hours." A heavy 1pm session leaves you tighter at 5pm but back to full at 7pm. Voice consumes this budget faster than text, sometimes by 3-5×, depending on how much Advanced Voice Mode is in use.

Warn The Deep Research 10/month cap on Plus is a hard ceiling, not a soft one. If you batch long-form research tasks, you'll burn through this in a week. Pro $100 raises it; Pro $200 effectively unbounds it for most people.
Part 03

Mac vs Windows

The desktop apps look similar. They're not similar. The Mac app is a real native client; the Windows app is a more polished wrapper over the web. That difference shows up everywhere.

The fundamental asymmetry

The Mac app is Apple Silicon only (macOS 14+). It uses Apple frameworks, supports trackpad haptics, integrates cleanly with the system menu bar, and behaves like a real first-class macOS application. The Windows app is functional but more clearly a wrapped web view. It works, but it doesn't feel native in the way Mac users expect.

The single most-used feature of either app is the global hotkey:

macOS

  • Default global hotkey: +Space.
  • Customizable in Settings → Shortcut.
  • Conflicts with Spotlight only if you've remapped Spotlight off ⌘+Space. Most don't, so Option+Space is clean.
  • Menu-bar icon for one-click access; closing the main window keeps it alive.
  • Trackpad haptics on Magic Trackpad.

Windows

  • Default global hotkey: Alt+Space.
  • Conflicts with the legacy "system menu" command on focused windows; change it on day one.
  • System tray icon; closing the window minimizes to tray.
  • Some users report the hotkey gets unbound after Windows updates; rebinding in Settings fixes it.
  • Available via Microsoft Store or direct download from chatgpt.com.

Companion Mode

Both apps support Companion Mode, a small floating window that pins ChatGPT next to whatever you're working on. It's the headline feature of the desktop apps, and the most-cited reason people install them.

Practical notes:

  • On Mac, the companion window is a real always-on-top window. On Windows, it's "always-on-top" with caveats: some full-screen apps will steal focus and hide it.
  • Screen sharing from the companion window works on both apps. Mac uses ScreenCaptureKit (the modern API); Windows uses GDI capture, which is slower and chokes on hardware-accelerated video.
  • Companion Mode is not the same as Operator / Agent Mode. The companion is a window; Operator is a separate flow that takes actions in your browser. The naming is confusingly similar.

Voice mode on desktop

Both apps support voice mode, and both apps offer Advanced Voice Mode on paid plans. The Mac app gets a slightly nicer experience: the voice indicator integrates with the menu bar, and the audio routes to your system default cleanly. On Windows, voice mode respects the default communication device, which may differ from your default playback. Exactly the same trap as the Claude app.

Tip On Mac, the desktop app's voice mode is genuinely better than the mobile app's for sit-down, headphones-in conversations because the audio path is shorter and the latency is lower. Use the desktop for "deep voice" sessions; use the phone for ambient ones.

File handling

Drag-and-drop works on both, with the same caveats Claude has on Windows: OneDrive on-demand files need to be hydrated first; antivirus may flag the desktop app's clipboard-paste behavior. Both apps cap individual uploads at ~512MB and per-message totals at ~2GB on paid tiers; Free is much lower.

One Mac-only nicety: quick look-style PDF preview shows up after upload, so you can confirm the right file went in. Windows shows just a filename.

Part 04

iOS vs Android

Mobile is where ChatGPT's per-platform divergence is sharpest, because Apple and Google give app developers fundamentally different primitives for OS-level integration.

The basics

From inside a chat the two apps are nearly identical. Same model lineup, same Custom GPTs, same Memory, same image and document upload, same Canvas (when available on your tier). The fork shows up in how you get to ChatGPT from elsewhere on the device.

Voice mode

Both platforms have Standard Voice Mode (free) and Advanced Voice Mode (paid). The technical difference matters:

  • Standard Voice: speech-to-text, then GPT-5.3 Instant, then text-to-speech. Slower, less expressive, but available to all users.
  • Advanced Voice: directly processes audio. No text round-trip. Picks up tone, hesitation, accent. Can be interrupted naturally.

If you've never used Advanced Voice, the experience is meaningfully different from any other AI voice product. It's the closest "talking to a person" comes today, and most casual ChatGPT users haven't tried it because Standard Voice is the default.

Where the platforms diverge

iOS

  • Lock Screen widgets with one-tap voice: the fastest path to Advanced Voice on any platform.
  • Action Button (iPhone 15 Pro and up): assign ChatGPT as your Action Button target. Squeeze, talk.
  • Siri Shortcuts: "Hey Siri, ask ChatGPT…" Handoff is functional but jankier than native Siri because Apple gates intents.
  • StandBy support: while charging on its side, the phone can show a "tap to talk" affordance.
  • Standard share sheet integration; selected text to "ChatGPT" target works in any app.
  • Siri can be set as the system fallback for queries it can't answer, and that fallback can route to ChatGPT (Apple Intelligence integration).

Android

  • Quick Settings tile: the killer Android feature, one tap from any app, even from inside another full-screen app. iOS has no equivalent surface.
  • Home Screen widget with New Chat / Voice / Camera buttons (same triple Claude offers).
  • Long-press launcher menu shortcuts: New Chat, Voice, Last Conversation.
  • Standard share intent. Share to ChatGPT works from any app.
  • No lock-screen widget (Android's lock screen is more constrained than iOS 16+).
  • Cannot be set as the system-default assistant, but Bixby/Google Assistant can hand off via intent.

Net: if your interaction pattern is "phone is locked, I want to talk to ChatGPT now," iOS wins by a lot. Lock Screen widget + Action Button + StandBy gives multiple zero-step paths. If your pattern is "I'm always inside another app and want a quick interjection," Android wins via Quick Settings.

Camera, vision, and live video

Both apps support live video on Advanced Voice: the camera streams to the model and you can have a real-time conversation about what it sees. This was OpenAI's flagship demo for advanced voice and it's now broadly available on paid tiers. Both platforms work; iOS gets slightly better motion stabilization on iPhone 15 Pro and newer due to camera-side compute.

Background limits

  • iOS handles voice mode in the background reliably. Locking the screen with Advanced Voice running keeps the conversation alive.
  • Android's aggressive battery management is the same trap as the Claude app: Samsung One UI's "Sleeping apps" and Xiaomi's MIUI background killers will sever long voice sessions. Allowlist ChatGPT in battery optimization settings.
Part 05

The Atlas browser

ChatGPT Atlas launched on Mac in October 2025 and broadened to Windows in 2026. It's a Chromium-based browser with ChatGPT built in as a sidecar. Strategically: OpenAI is making a play for the browser to become a primary surface for ChatGPT, similar to what Arc tried before its sunset.

What's distinctive:

  • Companion sidebar: every webpage opens with ChatGPT pinned next to it. Click a search result and you get the page + a chat that already knows the page's contents.
  • Browser memories: ChatGPT remembers context from sites you visit. The privacy model is opt-in per-site; you can review and delete memories from the settings.
  • Agent Mode (Operator): ChatGPT can take actions in the browser on your behalf, like booking restaurants, filling forms, comparing prices, completing checkouts. This is integrated more deeply into Atlas than into the standalone ChatGPT apps.
  • Same account, same conversations: chats from Atlas show up in your ChatGPT history alongside chats from the desktop app and mobile.
Warn The "browser memories" feature is the single most underestimated privacy surface in the OpenAI ecosystem. By default, ChatGPT can recall context from sites you visited days or weeks ago in conversations that have nothing to do with those sites. Read the prompt at first launch carefully, or at minimum review Settings → Memories → Site memories after a few days.

Atlas is free for everyone, including Free-tier ChatGPT users. The agent mode and most advanced features require Plus or above.

One more note for the medium term: OpenAI announced plans to merge Atlas, the ChatGPT desktop app, and Codex into a single desktop application (March 2026 announcement). That hasn't shipped yet but is on the public roadmap. If you're on the fence about which desktop install to commit to, this convergence reduces the stakes. They'll all become facets of the same install eventually.

Part 06

Capability matrix

The features that exist in your account, as of May 2026:

Capability Web Mac Win iOS Android Atlas
Text chat
File upload
Image input (vision)
Image generation
Sora video (Plus+)
Custom GPTs (Plus+)
Canvaspartialpartial
Memory
Projects
Tasks (scheduled)
Standard Voice
Advanced Voice (Plus+)
Live video on voice
Deep Research (Plus+)
Agent Mode / Operator (Plus+)partialpartialpartial
Code Interpreter
Companion Mode
Screen sharing
Global hotkey
Lock-screen widget
Quick Settings tile
Action Button
Siri Shortcuts

"Partial" on Canvas mobile means: you can view and edit a Canvas document, but the keyboard-driven editing flow is meaningfully degraded vs desktop. Don't write a long doc on mobile if you can help it.

"Partial" on Agent Mode for the standalone apps means: agent mode runs on web but is most natural in Atlas, where the browser context is the agent's working surface.

Part 07

Limits, surfaced honestly

The model fallback dance

The most opaque and most painful behavior in ChatGPT in 2026 is silent model fallback. When you exceed soft caps:

  1. You start on GPT-5.5 (or GPT-5 Pro on Pro tiers).
  2. The same conversation continues, but the badge near the model picker quietly switches.
  3. Subsequent responses come from a smaller model (GPT-5.3 Instant on Plus, or GPT-4.5o on Free).
  4. The conversation feels worse but doesn't fail.

The problem: the badge is small and the apps don't always show it. If a conversation suddenly feels worse, check the model badge. It probably changed without telling you. Starting a new chat usually resets you to your preferred model, but only if the cap window has cycled.

Context window

200K on Plus and Pro $100. 1M on Pro $200. Enterprise gets even longer in negotiated configurations. ChatGPT degrades around context-fill the same way Claude does: earlier turns get summarized aggressively, then forgotten. The 1M context on Pro $200 is the killer feature for analysts and lawyers who feed it large document corpora.

Attachment ceilings

  • ~10 files per message; soft total of 512MB on paid plans.
  • PDFs: ~100 pages on most plans; longer on Pro $200.
  • Images: 20MB each, max 10 per turn. Mobile compresses uploads, so for screenshots of tiny text, prefer desktop.
  • Code Interpreter file uploads are scoped to the session and disappear when the session ends. Don't rely on persistence.

Deep Research and Sora caps

Deep Research is hard-capped: 10 runs/month on Plus, 50 on Pro $100, effectively unlimited on Pro $200. Sora video generation has its own monthly cap that varies by plan (and length and resolution). Both can be surprisingly easy to deplete in a busy week. Track your usage.

Memory is not a database

Memory is limited to ~10K tokens of accumulated summaries. New "facts" displace old ones in roughly chronological order. Don't treat it as durable storage. If you need persistence, use Projects with explicit knowledge files.

Part 08

Best practices, by surface

i.

Pin your model. Don't trust auto-routing.

ChatGPT's model picker often "auto" routes based on the prompt. For anything important, pin GPT-5.5 or GPT-5 Pro explicitly. The auto-router optimizes for plan-cost, which sometimes means it gives you the smaller model for prompts that would benefit from the bigger one.

ii.

Custom GPTs for shape; Projects for state.

A Custom GPT is a reusable persona: you publish it once and reach for it when you need that role. A Project is a reusable workspace, carrying documents, memory, and conversation history. Mixing them up wastes both. Use a Custom GPT when the instructions are the constant; use a Project when the state is the constant.

iii.

Treat Memory as scoped, not global.

Same as Claude's memory: turn off global Memory in Settings → Personalization, and let Projects' own state do the remembering. Memory's surprises (a recipe-context recalled in a code review, etc.) outweigh the convenience for any user beyond the most casual.

iv.

Advanced Voice is for dialogue, not dictation.

If you want it to type what you say, use the standard mic icon, not Advanced Voice. Advanced Voice's strength is back-and-forth: interruptions, pauses, exploratory thinking. It's a worse transcription tool than the standard one because it's actively conversing with you while you talk.

v.

Use Atlas when the task is "the web."

If your work is reading, comparing, summarizing, or acting across web pages (research, shopping, multi-tab analysis), Atlas is the right surface. The companion sidebar with built-in page context is a real productivity uplift. For everything else, stick to the desktop app or web.

vi.

Batch your Deep Research prompts.

Plus's 10/month Deep Research cap goes fast if you fire off whims. Batch a week's worth of "I should look into X" thoughts into a Saturday session and triage which ones are worth Deep Research vs a normal chat.

vii.

On mobile, set up the one-touch voice path you'll actually use.

iOS users: assign the Action Button to ChatGPT Voice, or pin a Lock Screen widget. Android users: drop the Quick Settings tile into your pull-down. Whichever you pick, do it once and rely on it. The friction of "open app, tap mic" disappears the moment one-touch is one swipe away.

viii.

Branch instead of restart.

Same advice as for Claude: edit the message at the point you want to fork, rather than starting over. ChatGPT's branching UI is in the message hover menu on web, and a long-press on mobile. Most users don't know it exists. The conversation tree is more valuable than a new chat almost every time.

Part 09

A note on privacy posture

OpenAI's defaults shifted in 2025 and again in early 2026. The current state:

  • By default, on Free / Go / Plus, OpenAI uses your conversations to train future models unless you opt out in Settings → Data Controls → "Improve the model for everyone."
  • On Pro, Business, and Enterprise, training opt-out is the default. Pro is the cheapest tier with this default.
  • Atlas's browser memories are a separate consent surface; they're stored even if you've opted out of training, and need to be managed independently.
  • Voice mode does store transcripts, not audio.
  • Custom GPTs published publicly are accessible by their author (even if conversations with them are private to you) for usage analytics.
  • Memory is visible to you in Settings → Memory; you can delete individual entries.

The one control most people miss: "Temporary Chat", available on every surface. Temporary Chats don't persist, don't update Memory, and don't (per OpenAI) feed training. Use them when you're sharing something you don't want logged.

Part 10

Troubleshooting playbook

The failure modes you'll trip on, in roughly the order they show up.

"My answers got worse"

Check the model badge near the picker. The single most common cause of "ChatGPT got dumber" is silent model fallback. If the badge shows a smaller model than you expect, you've burned your Plus / Pro headroom for this 5-hour window — start a new chat once the window cycles.

"Where did my chat history go"

Likely culprits:

  • You're signed into the wrong account (personal vs Business vs Edu); check the avatar.
  • You're in a Project's filtered view rather than All Chats.
  • You opted in to Temporary Chat and the chat genuinely doesn't persist.
  • Sync lag between mobile and desktop; give it 30 seconds, force-refresh.
"Voice mode keeps cutting out" (mobile)
  • iOS: check Background App Refresh is on for ChatGPT in Settings.
  • Android: allowlist ChatGPT in battery optimization.
  • Both: weak network on cellular degrades Advanced Voice fastest. Standard Voice tolerates it better; if your signal is bad, fall back.
"Hotkey stopped working" (Windows)

Common after Windows updates. Open the app's Settings → Shortcut and rebind. If rebinding fails, restart the app from the tray (close fully, not just minimize).

"Atlas is using too much memory"

It's a Chromium-based browser. Chromium uses RAM. If you're complaining about Atlas memory, the right comparison is Chrome, not the standalone app. Atlas is no worse than Chrome with similar tab counts. Limit tabs the same way you'd limit them in any browser.

"My Custom GPT 'forgot' something"

Custom GPTs don't have memory of past conversations by design. Each chat starts fresh against the GPT's system prompt and knowledge. If you need cross-conversation memory, use a Project (not a Custom GPT). The two get conflated; they're different mechanisms.

"Deep Research returned in 30 seconds and looks shallow"

Deep Research has a fast-path it sometimes takes for prompts it deems easy. If you wanted depth and got speed, restate the prompt with explicit depth cues: "search across at least 10 sources, compare them, identify disagreements." That re-routes to the slow path (which can take 5-15 minutes). And it counts the same against your monthly cap either way.

"Operator (Atlas) failed at checkout"

Operator is conservative around payment flows. It will pause for human confirmation at the point money changes hands; if you walked away during that pause, the session expired. This is by design. It's the same reason Claude's Computer Use refuses certain kinds of actions. Don't fight it; just be present at checkout.

Part 11

Closing thought

ChatGPT in 2026 is no longer a chatbot. It's a portfolio of products that all happen to share a model and an account. Atlas is a browser. Operator is an agent. Custom GPTs are a marketplace. Projects are a workspace. Tasks are a scheduler. Memory is a profile. Voice mode is a phone call. The chat box at chatgpt.com is one entrypoint among many.

Treat ChatGPT as five products, not one. Pick the surface that fits the task, not the surface you opened it from yesterday. — TWD

The trap is using Plus from the same web tab for everything because that's how you started. Each surface rewards a different working pattern; learning which is which is the difference between paying $20/month for a chat window and paying $20/month for an actual productivity layer.

If anything in here is out of date by the time you read it, OpenAI's release notes are the source of truth. The pace of change is high; pricing, models, and feature flags shift more or less monthly.