Add browser automation docs page (#622)

* Add browser automation docs page (#594)

Comprehensive reference for all cmux browser subcommands: navigation,
waiting, DOM interaction, inspection, JS eval, state management,
tabs, dialogs, frames, and downloads. Includes common patterns section.

* Remove unnecessary callout from browser automation docs
This commit is contained in:
Lawrence Chen 2026-02-27 00:42:50 -08:00 committed by GitHub
parent e14c5a383f
commit d3cf9335d2
No known key found for this signature in database
GPG key ID: B5690EEEBB952194
2 changed files with 287 additions and 0 deletions

View file

@ -4,6 +4,7 @@ export const navItems = [
{ title: "Configuration", href: "/docs/configuration" },
{ title: "Keyboard Shortcuts", href: "/docs/keyboard-shortcuts" },
{ title: "API Reference", href: "/docs/api" },
{ title: "Browser Automation", href: "/docs/browser-automation" },
{ title: "Notifications", href: "/docs/notifications" },
{ title: "Changelog", href: "/docs/changelog" },
];

View file

@ -0,0 +1,286 @@
import type { Metadata } from "next";
import { CodeBlock } from "../../components/code-block";
import { Callout } from "../../components/callout";
export const metadata: Metadata = {
title: "Browser Automation",
description:
"cmux browser command reference for navigation, DOM interaction, waiting, inspection, JavaScript evaluation, tabs, dialogs, frames, downloads, and browser state.",
};
export default function BrowserAutomationPage() {
return (
<>
<h1>Browser Automation</h1>
<p>
The <code>cmux browser</code> command group provides browser automation
against cmux browser surfaces. Use it to navigate, interact with DOM
elements, inspect page state, evaluate JavaScript, and manage browser
session data.
</p>
<h2>Command Index</h2>
<table>
<thead>
<tr>
<th>Category</th>
<th>Subcommands</th>
</tr>
</thead>
<tbody>
<tr>
<td>Navigation and targeting</td>
<td>
<code>identify</code>, <code>open</code>, <code>open-split</code>,{" "}
<code>navigate</code>, <code>back</code>, <code>forward</code>,{" "}
<code>reload</code>, <code>url</code>, <code>focus-webview</code>,{" "}
<code>is-webview-focused</code>
</td>
</tr>
<tr>
<td>Waiting</td>
<td>
<code>wait</code>
</td>
</tr>
<tr>
<td>DOM interaction</td>
<td>
<code>click</code>, <code>dblclick</code>, <code>hover</code>,{" "}
<code>focus</code>, <code>check</code>, <code>uncheck</code>,{" "}
<code>scroll-into-view</code>, <code>type</code>, <code>fill</code>,{" "}
<code>press</code>, <code>keydown</code>, <code>keyup</code>,{" "}
<code>select</code>, <code>scroll</code>
</td>
</tr>
<tr>
<td>Inspection</td>
<td>
<code>snapshot</code>, <code>screenshot</code>, <code>get</code>,{" "}
<code>is</code>, <code>find</code>, <code>highlight</code>
</td>
</tr>
<tr>
<td>JavaScript and injection</td>
<td>
<code>eval</code>, <code>addinitscript</code>, <code>addscript</code>,{" "}
<code>addstyle</code>
</td>
</tr>
<tr>
<td>Frames, dialogs, downloads</td>
<td>
<code>frame</code>, <code>dialog</code>, <code>download</code>
</td>
</tr>
<tr>
<td>State and session data</td>
<td>
<code>cookies</code>, <code>storage</code>, <code>state</code>
</td>
</tr>
<tr>
<td>Tabs and logs</td>
<td>
<code>tab</code>, <code>console</code>, <code>errors</code>
</td>
</tr>
</tbody>
</table>
<h2>Targeting a browser surface</h2>
<p>
Most subcommands require a target surface. You can pass it positionally
or with <code>--surface</code>.
</p>
<CodeBlock lang="bash">{`# Open a new browser split
cmux browser open https://example.com
# Discover focused IDs and browser metadata
cmux browser identify
cmux browser identify --surface surface:2
# Positional vs flag targeting are equivalent
cmux browser surface:2 url
cmux browser --surface surface:2 url`}</CodeBlock>
<h2>Navigation</h2>
<CodeBlock lang="bash">{`cmux browser open https://example.com
cmux browser open-split https://news.ycombinator.com
cmux browser surface:2 navigate https://example.org/docs --snapshot-after
cmux browser surface:2 back
cmux browser surface:2 forward
cmux browser surface:2 reload --snapshot-after
cmux browser surface:2 url
cmux browser surface:2 focus-webview
cmux browser surface:2 is-webview-focused`}</CodeBlock>
<h2>Waiting</h2>
<p>
Use <code>wait</code> to block until selectors, text, URL fragments,
load state, or a JavaScript condition is satisfied.
</p>
<CodeBlock lang="bash">{`cmux browser surface:2 wait --load-state complete --timeout-ms 15000
cmux browser surface:2 wait --selector "#checkout" --timeout-ms 10000
cmux browser surface:2 wait --text "Order confirmed"
cmux browser surface:2 wait --url-contains "/dashboard"
cmux browser surface:2 wait --function "window.__appReady === true"`}</CodeBlock>
<h2>DOM Interaction</h2>
<p>
Mutating actions support <code>--snapshot-after</code> for fast
verification in scripts.
</p>
<CodeBlock lang="bash">{`cmux browser surface:2 click "button[type='submit']" --snapshot-after
cmux browser surface:2 dblclick ".item-row"
cmux browser surface:2 hover "#menu"
cmux browser surface:2 focus "#email"
cmux browser surface:2 check "#terms"
cmux browser surface:2 uncheck "#newsletter"
cmux browser surface:2 scroll-into-view "#pricing"
cmux browser surface:2 type "#search" "cmux"
cmux browser surface:2 fill "#email" --text "ops@example.com"
cmux browser surface:2 fill "#email" --text ""
cmux browser surface:2 press Enter
cmux browser surface:2 keydown Shift
cmux browser surface:2 keyup Shift
cmux browser surface:2 select "#region" "us-east"
cmux browser surface:2 scroll --dy 800 --snapshot-after
cmux browser surface:2 scroll --selector "#log-view" --dx 0 --dy 400`}</CodeBlock>
<h2>Inspection</h2>
<p>
Use structured getters for scripts and snapshots/screenshots for human
review.
</p>
<CodeBlock lang="bash">{`cmux browser surface:2 snapshot --interactive --compact
cmux browser surface:2 snapshot --selector "main" --max-depth 5
cmux browser surface:2 screenshot --out /tmp/cmux-page.png
cmux browser surface:2 get title
cmux browser surface:2 get url
cmux browser surface:2 get text "h1"
cmux browser surface:2 get html "main"
cmux browser surface:2 get value "#email"
cmux browser surface:2 get attr "a.primary" --attr href
cmux browser surface:2 get count ".row"
cmux browser surface:2 get box "#checkout"
cmux browser surface:2 get styles "#total" --property color
cmux browser surface:2 is visible "#checkout"
cmux browser surface:2 is enabled "button[type='submit']"
cmux browser surface:2 is checked "#terms"
cmux browser surface:2 find role button --name "Continue"
cmux browser surface:2 find text "Order confirmed"
cmux browser surface:2 find label "Email"
cmux browser surface:2 find placeholder "Search"
cmux browser surface:2 find alt "Product image"
cmux browser surface:2 find title "Open settings"
cmux browser surface:2 find testid "save-btn"
cmux browser surface:2 find first ".row"
cmux browser surface:2 find last ".row"
cmux browser surface:2 find nth 2 ".row"
cmux browser surface:2 highlight "#checkout"`}</CodeBlock>
<h2>JavaScript Eval and Injection</h2>
<CodeBlock lang="bash">{`cmux browser surface:2 eval "document.title"
cmux browser surface:2 eval --script "window.location.href"
cmux browser surface:2 addinitscript "window.__cmuxReady = true;"
cmux browser surface:2 addscript "document.querySelector('#name')?.focus()"
cmux browser surface:2 addstyle "#debug-banner { display: none !important; }"`}</CodeBlock>
<h2>State</h2>
<p>
Session data commands cover cookies, local/session storage, and full
browser state snapshots.
</p>
<CodeBlock lang="bash">{`cmux browser surface:2 cookies get
cmux browser surface:2 cookies get --name session_id
cmux browser surface:2 cookies set session_id abc123 --domain example.com --path /
cmux browser surface:2 cookies clear --name session_id
cmux browser surface:2 cookies clear --all
cmux browser surface:2 storage local set theme dark
cmux browser surface:2 storage local get theme
cmux browser surface:2 storage local clear
cmux browser surface:2 storage session set flow onboarding
cmux browser surface:2 storage session get flow
cmux browser surface:2 state save /tmp/cmux-browser-state.json
cmux browser surface:2 state load /tmp/cmux-browser-state.json`}</CodeBlock>
<h2>Tabs</h2>
<p>
Browser tab operations map to browser surfaces in the active browser tab
group.
</p>
<CodeBlock lang="bash">{`cmux browser surface:2 tab list
cmux browser surface:2 tab new https://example.com/pricing
# Switch by index or by target surface
cmux browser surface:2 tab switch 1
cmux browser surface:2 tab switch surface:7
# Close current tab or a specific target
cmux browser surface:2 tab close
cmux browser surface:2 tab close surface:7`}</CodeBlock>
<h2>Console and Errors</h2>
<CodeBlock lang="bash">{`cmux browser surface:2 console list
cmux browser surface:2 console clear
cmux browser surface:2 errors list
cmux browser surface:2 errors clear`}</CodeBlock>
<h2>Dialogs</h2>
<CodeBlock lang="bash">{`cmux browser surface:2 dialog accept
cmux browser surface:2 dialog accept "Confirmed by automation"
cmux browser surface:2 dialog dismiss`}</CodeBlock>
<h2>Frames</h2>
<CodeBlock lang="bash">{`# Enter an iframe context
cmux browser surface:2 frame "iframe[name='checkout']"
cmux browser surface:2 click "#pay-now"
# Return to the top-level document
cmux browser surface:2 frame main`}</CodeBlock>
<h2>Downloads</h2>
<CodeBlock lang="bash">{`cmux browser surface:2 click "a#download-report"
cmux browser surface:2 download --path /tmp/report.csv --timeout-ms 30000`}</CodeBlock>
<h2>Common Patterns</h2>
<h3>Navigate, wait, inspect</h3>
<CodeBlock lang="bash">{`cmux browser open https://example.com/login
cmux browser surface:2 wait --load-state complete --timeout-ms 15000
cmux browser surface:2 snapshot --interactive --compact
cmux browser surface:2 get title`}</CodeBlock>
<h3>Fill a form and verify success text</h3>
<CodeBlock lang="bash">{`cmux browser surface:2 fill "#email" --text "ops@example.com"
cmux browser surface:2 fill "#password" --text "$PASSWORD"
cmux browser surface:2 click "button[type='submit']" --snapshot-after
cmux browser surface:2 wait --text "Welcome"
cmux browser surface:2 is visible "#dashboard"`}</CodeBlock>
<h3>Capture debug artifacts on failure</h3>
<CodeBlock lang="bash">{`cmux browser surface:2 console list
cmux browser surface:2 errors list
cmux browser surface:2 screenshot --out /tmp/cmux-failure.png
cmux browser surface:2 snapshot --interactive --compact`}</CodeBlock>
<h3>Persist and restore browser session</h3>
<CodeBlock lang="bash">{`cmux browser surface:2 state save /tmp/session.json
# ...later...
cmux browser surface:2 state load /tmp/session.json
cmux browser surface:2 reload`}</CodeBlock>
</>
);
}