Test Browser
Browser Test Skill
Section titled “Browser Test Skill”Run end-to-end browser tests on pages affected by a PR or branch changes using the agent-browser CLI.
Use agent-browser Only For Browser Automation
Section titled “Use agent-browser Only For Browser Automation”This workflow uses the agent-browser CLI exclusively. Do not use any alternative browser automation system, browser MCP integration, or built-in browser-control tool. If the platform offers multiple ways to control a browser, always choose agent-browser.
Use agent-browser for: opening pages, clicking elements, filling forms, taking screenshots, and scraping rendered content.
Platform-specific hints:
- In OpenCode, do not use Chrome MCP tools (
mcp__claude-in-chrome__*). - In Codex, do not substitute unrelated browsing tools.
Prerequisites
Section titled “Prerequisites”- Local development server running (e.g.,
bin/dev,rails server,npm run dev) agent-browserCLI installed (see Setup below)- Git repository with changes to test
command -v agent-browser >/dev/null 2>&1 && echo "Installed" || echo "NOT INSTALLED"Install if needed:
npm install -g agent-browseragent-browser installSee the agent-browser skill for detailed usage.
Workflow
Section titled “Workflow”1. Verify Installation
Section titled “1. Verify Installation”Before starting, verify agent-browser is available:
command -v agent-browser >/dev/null 2>&1 && echo "Ready" || (echo "Installing..." && npm install -g agent-browser && agent-browser install)If installation fails, inform the user and stop.
2. Ask Browser Mode
Section titled “2. Ask Browser Mode”Ask the user whether to run headed or headless (using the platform’s question tool — e.g., question in OpenCode, request_user_input in Codex, ask_user in Gemini — or present options and wait for a reply):
Do you want to watch the browser tests run?
1. Headed (watch) - Opens visible browser window so you can see tests run2. Headless (faster) - Runs in background, faster but invisibleStore the choice and use the --headed flag when the user selects option 1.
3. Determine Test Scope
Section titled “3. Determine Test Scope”If PR number provided:
gh pr view [number] --json files -q '.files[].path'If ‘current’ or empty:
git diff --name-only main...HEADIf branch name provided:
git diff --name-only main...[branch]4. Map Files to Routes
Section titled “4. Map Files to Routes”Map changed files to testable routes:
| File Pattern | Route(s) |
|---|---|
app/views/users/* | /users, /users/:id, /users/new |
app/controllers/settings_controller.rb | /settings |
app/javascript/controllers/*_controller.js | Pages using that Stimulus controller |
app/components/*_component.rb | Pages rendering that component |
app/views/layouts/* | All pages (test homepage at minimum) |
app/assets/stylesheets/* | Visual regression on key pages |
app/helpers/*_helper.rb | Pages using that helper |
src/app/* (Next.js) | Corresponding routes |
src/components/* | Pages using those components |
Build a list of URLs to test based on the mapping.
5. Detect Dev Server Port
Section titled “5. Detect Dev Server Port”Determine the dev server port using this priority:
- Explicit argument — if the user passed
--port 5000, use that directly - Project instructions — check
AGENTS.md,AGENTS.md, or other instruction files for port references - package.json — check dev/start scripts for
--portflags - Environment files — check
.env,.env.local,.env.developmentforPORT= - Default — fall back to
3000
PORT="${EXPLICIT_PORT:-}"if [ -z "$PORT" ]; then PORT=$(grep -Eio '(port\s*[:=]\s*|localhost:)([0-9]{4,5})' AGENTS.md 2>/dev/null | grep -Eo '[0-9]{4,5}' | head -1) if [ -z "$PORT" ]; then PORT=$(grep -Eio '(port\s*[:=]\s*|localhost:)([0-9]{4,5})' AGENTS.md 2>/dev/null | grep -Eo '[0-9]{4,5}' | head -1) fifiif [ -z "$PORT" ]; then PORT=$(grep -Eo '\-\-port[= ]+[0-9]{4,5}' package.json 2>/dev/null | grep -Eo '[0-9]{4,5}' | head -1)fiif [ -z "$PORT" ]; then PORT=$(grep -h '^PORT=' .env .env.local .env.development 2>/dev/null | tail -1 | cut -d= -f2)fiPORT="${PORT:-3000}"echo "Using dev server port: $PORT"6. Verify Server is Running
Section titled “6. Verify Server is Running”agent-browser open http://localhost:${PORT}agent-browser snapshot -iIf the server is not running, inform the user:
Server not running on port ${PORT}
Please start your development server:- Rails: `bin/dev` or `rails server`- Node/Next.js: `npm run dev`- Custom port: run this skill again with `--port <your-port>`
Then re-run this skill.7. Test Each Affected Page
Section titled “7. Test Each Affected Page”For each affected route:
Navigate and capture snapshot:
agent-browser open "http://localhost:${PORT}/[route]"agent-browser snapshot -iFor headed mode:
agent-browser --headed open "http://localhost:${PORT}/[route]"agent-browser --headed snapshot -iVerify key elements:
- Use
agent-browser snapshot -ito get interactive elements with refs - Page title/heading present
- Primary content rendered
- No error messages visible
- Forms have expected fields
Test critical interactions:
agent-browser click @e1agent-browser snapshot -iTake screenshots:
agent-browser screenshot page-name.pngagent-browser screenshot --full page-name-full.png8. Human Verification (When Required)
Section titled “8. Human Verification (When Required)”Pause for human input when testing touches flows that require external interaction:
| Flow Type | What to Ask |
|---|---|
| OAuth | ”Please sign in with [provider] and confirm it works” |
| ”Check your inbox for the test email and confirm receipt” | |
| Payments | ”Complete a test purchase in sandbox mode” |
| SMS | ”Verify you received the SMS code” |
| External APIs | ”Confirm the [service] integration is working” |
Ask the user (using the platform’s question tool, or present numbered options and wait):
Human Verification Needed
This test touches [flow type]. Please:1. [Action to take]2. [What to verify]
Did it work correctly?1. Yes - continue testing2. No - describe the issue9. Handle Failures
Section titled “9. Handle Failures”When a test fails:
-
Document the failure:
- Screenshot the error state:
agent-browser screenshot error.png - Note the exact reproduction steps
- Screenshot the error state:
-
Ask the user how to proceed:
Test Failed: [route]Issue: [description]Console errors: [if any]How to proceed?1. Fix now - I'll help debug and fix2. Create todo - Add a todo for later (using the todo-create skill)3. Skip - Continue testing other pages -
If “Fix now”: investigate, propose a fix, apply, re-run the failing test
-
If “Create todo”: load the
todo-createskill and create a todo with priority p1 and descriptionbrowser-test-{description}, continue -
If “Skip”: log as skipped, continue
10. Test Summary
Section titled “10. Test Summary”After all tests complete, present a summary:
## Browser Test Results
**Test Scope:** PR #[number] / [branch name]**Server:** http://localhost:${PORT}
### Pages Tested: [count]
| Route | Status | Notes ||-------|--------|-------|| `/users` | Pass | || `/settings` | Pass | || `/dashboard` | Fail | Console error: [msg] || `/checkout` | Skip | Requires payment credentials |
### Console Errors: [count]- [List any errors found]
### Human Verifications: [count]- OAuth flow: Confirmed- Email delivery: Confirmed
### Failures: [count]- `/dashboard` - [issue description]
### Created Todos: [count]- `005-pending-p1-browser-test-dashboard-error.md`
### Result: [PASS / FAIL / PARTIAL]Quick Usage Examples
Section titled “Quick Usage Examples”# Test current branch changes (auto-detects port)/test-browser
# Test specific PR/test-browser 847
# Test specific branch/test-browser feature/new-dashboard
# Test on a specific port/test-browser --port 5000agent-browser CLI Reference
Section titled “agent-browser CLI Reference”# Navigationagent-browser open <url> # Navigate to URLagent-browser back # Go backagent-browser close # Close browser
# Snapshots (get element refs)agent-browser snapshot -i # Interactive elements with refs (@e1, @e2, etc.)agent-browser snapshot -i --json # JSON output
# Interactions (use refs from snapshot)agent-browser click @e1 # Click elementagent-browser fill @e1 "text" # Fill inputagent-browser type @e1 "text" # Type without clearingagent-browser press Enter # Press key
# Screenshotsagent-browser screenshot out.png # Viewport screenshotagent-browser screenshot --full out.png # Full page screenshot
# Headed mode (visible browser)agent-browser --headed open <url> # Open with visible browseragent-browser --headed click @e1 # Click in visible browser
# Waitagent-browser wait @e1 # Wait for elementagent-browser wait 2000 # Wait milliseconds