Browser
Enable browser automation to let the agent browse the web, fill forms, click buttons, and interact with your logged-in sessions.
Two Browser Modes
Electron Mode (Default)
Runs in a hidden window. No setup needed. Works for JavaScript rendering, screenshots, and basic interactions.
Chrome Mode (CDP)
Connects to your actual Chrome browser with all your logged-in sessions (Gmail, GitHub, etc.). Also enables multi-tab management.
Quick Setup (Chrome Mode)
Method 1: Built-in Launcher (Recommended)
- Open Settings → Browser section
- Enable Enable Browser toggle
- Select your browser from the dropdown (Chrome, Edge, Brave, Arc, Chromium)
- Click Launch Browser
- Wait for status to show “Connected”
Note: The app will automatically launch your browser with remote debugging enabled. No manual commands needed.
Method 2: Manual Launch
- Launch Chrome with remote debugging:
/Applications/Google\ Chrome.app/Contents/MacOS/Google\ Chrome --remote-debugging-port=9222 - Enable Browser in settings
- Click Test Connection to verify
Tip: Create a shell alias for quick manual launching:
alias chrome-debug="/Applications/Google\ Chrome.app/Contents/MacOS/Google\ Chrome --remote-debugging-port=9222"Settings
| Enable Browser | Toggle browser automation on/off |
| Use My Browser | Prefer Chrome mode (CDP) over Electron mode |
| Browser Selector | Choose which browser to launch (Chrome, Edge, Brave, Arc, Chromium) |
| Launch Button | One-click launch with remote debugging enabled |
| Test Connection | Verify CDP connection is working |
| CDP URL | Chrome DevTools Protocol URL Default: http://localhost:9222 |
| Port | Custom port for CDP connection Default: 9222 |
Capabilities
Basic Actions (Both Modes)
- Navigate to URLs
- Click buttons and links (by CSS selector)
- Type into form fields
- Take screenshots
- Execute JavaScript
- Extract page content (text, HTML, links, tables)
- Scroll pages and elements
- Hover over elements
File Operations (Both Modes)
- Download files (with custom save path)
- Upload files to form inputs
Chrome Mode Only (CDP)
- Access to your logged-in sessions (Gmail, GitHub, etc.)
- List all open tabs
- Open new tabs
- Switch between tabs
- Close tabs
- Multi-tab workflows
Troubleshooting
Browser already running
Close all instances of the browser first, then use the Launch button. The app needs to start Chrome with the remote debugging flag.
Connection failed
Make sure the browser was launched with --remote-debugging-port=9222. Use Test Connection to verify.
Screen locked
For Chrome mode, your screen must be unlocked and active. The browser automation needs an active session to respond.