Browser

Enable browser automation to let the agent browse the web, fill forms, click buttons, and interact with your logged-in sessions.

Two Browser Modes

Electron Mode (Default)

Runs in a hidden window. No setup needed. Works for JavaScript rendering, screenshots, and basic interactions.

Chrome Mode (CDP)

Connects to your actual Chrome browser with all your logged-in sessions (Gmail, GitHub, etc.). Also enables multi-tab management.

Quick Setup (Chrome Mode)

Method 1: Built-in Launcher (Recommended)

  1. Open Settings → Browser section
  2. Enable Enable Browser toggle
  3. Select your browser from the dropdown (Chrome, Edge, Brave, Arc, Chromium)
  4. Click Launch Browser
  5. Wait for status to show “Connected”

Note: The app will automatically launch your browser with remote debugging enabled. No manual commands needed.

Method 2: Manual Launch

  1. Launch Chrome with remote debugging:
    /Applications/Google\ Chrome.app/Contents/MacOS/Google\ Chrome --remote-debugging-port=9222
  2. Enable Browser in settings
  3. Click Test Connection to verify

Tip: Create a shell alias for quick manual launching:

alias chrome-debug="/Applications/Google\ Chrome.app/Contents/MacOS/Google\ Chrome --remote-debugging-port=9222"

Settings

Enable BrowserToggle browser automation on/off
Use My BrowserPrefer Chrome mode (CDP) over Electron mode
Browser SelectorChoose which browser to launch (Chrome, Edge, Brave, Arc, Chromium)
Launch ButtonOne-click launch with remote debugging enabled
Test ConnectionVerify CDP connection is working
CDP URLChrome DevTools Protocol URL
Default: http://localhost:9222
PortCustom port for CDP connection
Default: 9222

Capabilities

Basic Actions (Both Modes)

  • Navigate to URLs
  • Click buttons and links (by CSS selector)
  • Type into form fields
  • Take screenshots
  • Execute JavaScript
  • Extract page content (text, HTML, links, tables)
  • Scroll pages and elements
  • Hover over elements

File Operations (Both Modes)

  • Download files (with custom save path)
  • Upload files to form inputs

Chrome Mode Only (CDP)

  • Access to your logged-in sessions (Gmail, GitHub, etc.)
  • List all open tabs
  • Open new tabs
  • Switch between tabs
  • Close tabs
  • Multi-tab workflows

Troubleshooting

Browser already running

Close all instances of the browser first, then use the Launch button. The app needs to start Chrome with the remote debugging flag.

Connection failed

Make sure the browser was launched with --remote-debugging-port=9222. Use Test Connection to verify.

Screen locked

For Chrome mode, your screen must be unlocked and active. The browser automation needs an active session to respond.