BrowserWing Executor provides comprehensive browser automation capabilities through HTTP APIs. You can control browser navigation, interact with page elements, extract data, and analyze page structure.
API Base URL: http://localhost:8080/api/v1/executor
Authentication: Use X-BrowserWing-Key: <api-key> header or Authorization: Bearer <token>
IMPORTANT: Always call this endpoint first to see all available commands and their parameters.
curl -X GET 'http://localhost:8080/api/v1/executor/help'
Response: Returns complete list of all commands with parameters, examples, and usage guidelines.
Query specific command:
curl -X GET 'http://localhost:8080/api/v1/executor/help?command=extract'
CRITICAL: Always call this after navigation to understand page structure and get element RefIDs.
curl -X GET 'http://localhost:8080/api/v1/executor/snapshot'
Response Example:
{
"success": true,
"snapshot_text": "Clickable Elements:\n @e1 Login (role: button)\n @e2 Sign Up (role: link)\n\nInput Elements:\n @e3 Email (role: textbox) [placeholder: your@email.com]\n @e4 Password (role: textbox)"
}
Use Cases:
curl -X POST 'http://localhost:8080/api/v1/executor/navigate' \
-H 'Content-Type: application/json' \
-d '{"url": "https://example.com"}'
curl -X POST 'http://localhost:8080/api/v1/executor/click' \
-H 'Content-Type: application/json' \
-d '{"identifier": "@e1"}'
Identifier formats:
@e1, @e2 (from snapshot)#button-id, .class-name
//button[@type='submit']
Login (text content)curl -X POST 'http://localhost:8080/api/v1/executor/type' \
-H 'Content-Type: application/json' \
-d '{"identifier": "@e3", "text": "user@example.com"}'
curl -X POST 'http://localhost:8080/api/v1/executor/extract' \
-H 'Content-Type: application/json' \
-d '{
"selector": ".product-item",
"fields": ["text", "href"],
"multiple": true
}'
curl -X POST 'http://localhost:8080/api/v1/executor/wait' \
-H 'Content-Type: application/json' \
-d '{"identifier": ".loading", "state": "hidden", "timeout": 10}'
curl -X POST 'http://localhost:8080/api/v1/executor/batch' \
-H 'Content-Type: application/json' \
-d '{
"operations": [
{"type": "navigate", "params": {"url": "https://example.com"}, "stop_on_error": true},
{"type": "click", "params": {"identifier": "@e1"}, "stop_on_error": true},
{"type": "type", "params": {"identifier": "@e3", "text": "query"}, "stop_on_error": true}
]
}'
Step-by-step workflow:
Discover commands: Call GET /help to see all available operations and their parameters (do this first if unsure).
Navigate: Use POST /navigate to open the target webpage.
Analyze page: Call GET /snapshot to understand page structure and get element RefIDs.
Interact: Use element RefIDs (like @e1, @e2) or CSS selectors to:
POST /click
POST /type
POST /select
POST /wait
Extract data: Use POST /extract to get information from the page.
Present results: Format and show extracted data to the user.
User Request: "Search for 'laptop' on example.com and get the first 5 results"
Your Actions:
curl -X POST 'http://localhost:8080/api/v1/executor/navigate' \
-H 'Content-Type: application/json' \
-d '{"url": "https://example.com/search"}'
curl -X GET 'http://localhost:8080/api/v1/executor/snapshot'
Response shows: @e3 Search (role: textbox) [placeholder: Search...]
curl -X POST 'http://localhost:8080/api/v1/executor/type' \
-H 'Content-Type: application/json' \
-d '{"identifier": "@e3", "text": "laptop"}'
curl -X POST 'http://localhost:8080/api/v1/executor/press-key' \
-H 'Content-Type: application/json' \
-d '{"key": "Enter"}'
curl -X POST 'http://localhost:8080/api/v1/executor/wait' \
-H 'Content-Type: application/json' \
-d '{"identifier": ".search-results", "state": "visible", "timeout": 10}'
curl -X POST 'http://localhost:8080/api/v1/executor/extract' \
-H 'Content-Type: application/json' \
-d '{
"selector": ".result-item",
"fields": ["text", "href"],
"multiple": true
}'
Found 15 results for 'laptop':
1. Gaming Laptop - $1299 (https://...)
2. Business Laptop - $899 (https://...)
...
POST /navigate - Navigate to URLPOST /go-back - Go back in historyPOST /go-forward - Go forward in historyPOST /reload - Reload current pagePOST /click - Click element (supports: RefID @e1, CSS selector, XPath, text content)POST /type - Type text into input (supports: RefID @e3, CSS selector, XPath)POST /select - Select dropdown optionPOST /hover - Hover over elementPOST /wait - Wait for element state (visible, hidden, enabled)POST /press-key - Press keyboard key (Enter, Tab, Ctrl+S, etc.)POST /extract - Extract data from elements (supports multiple elements, custom fields)POST /get-text - Get element text contentPOST /get-value - Get input element valueGET /page-info - Get page URL and titleGET /page-text - Get all page textGET /page-content - Get full HTMLGET /snapshot - Get accessibility snapshot (⭐ ALWAYS call after navigation)GET /clickable-elements - Get all clickable elementsGET /input-elements - Get all input elementsPOST /screenshot - Take page screenshot (base64 encoded)POST /evaluate - Execute JavaScript codePOST /batch - Execute multiple operations in sequencePOST /scroll-to-bottom - Scroll to page bottomPOST /resize - Resize browser windowPOST /tabs - Manage browser tabs (list, new, switch, close)POST /fill-form - Intelligently fill multiple form fields at onceGET /console-messages - Get browser console messages (logs, warnings, errors)GET /network-requests - Get network requests made by the pagePOST /handle-dialog - Configure JavaScript dialog (alert, confirm, prompt) handlingPOST /file-upload - Upload files to input elementsPOST /drag - Drag and drop elementsPOST /close-page - Close the current page/tabYou can identify elements using:
RefID (Recommended): @e1, @e2, @e3
/snapshot endpoint"identifier": "@e1"
CSS Selector: #id, .class, button[type="submit"]
"identifier": "#login-button"
XPath: //button[@id='login'], //a[contains(text(), 'Submit')]
"identifier": "//button[@id='login']"
Text Content: Login, Sign Up, Submit
"identifier": "Login"
ARIA Label: Elements with aria-label attribute
Before starting:
GET /help if you're unsure about available commands or their parametersDuring automation:
/snapshot after navigation to get page structure and RefIDs@e1) over CSS selectors for reliability and stability/wait for dynamic content that loads asynchronously/batch for multiple sequential operations to improve efficiencyError handling:
/snapshot again to refresh page structureData extraction:
fields parameter to specify what to extract: ["text", "href", "src"]
multiple: true to extract from multiple elementsScenario: User wants to login to a website
User: "Please log in to example.com with username 'john' and password 'secret123'"
Your Actions:
Step 1: Navigate to login page
POST http://localhost:8080/api/v1/executor/navigate
{"url": "https://example.com/login"}
Step 2: Get page structure
GET http://localhost:8080/api/v1/executor/snapshot
Response:
Clickable Elements:
@e1 Login (role: button)
Input Elements:
@e2 Username (role: textbox)
@e3 Password (role: textbox)
Step 3: Enter username
POST http://localhost:8080/api/v1/executor/type
{"identifier": "@e2", "text": "john"}
Step 4: Enter password
POST http://localhost:8080/api/v1/executor/type
{"identifier": "@e3", "text": "secret123"}
Step 5: Click login button
POST http://localhost:8080/api/v1/executor/click
{"identifier": "@e1"}
Step 6: Wait for login success (optional)
POST http://localhost:8080/api/v1/executor/wait
{"identifier": ".welcome-message", "state": "visible", "timeout": 10}
Step 7: Inform user
"Successfully logged in to example.com!"
Scenario: Fill out a form with multiple fields
Instead of making 5 separate API calls, use one batch operation:
curl -X POST 'http://localhost:8080/api/v1/executor/batch' \
-H 'Content-Type: application/json' \
-d '{
"operations": [
{
"type": "navigate",
"params": {"url": "https://example.com/form"},
"stop_on_error": true
},
{
"type": "type",
"params": {"identifier": "#name", "text": "John Doe"},
"stop_on_error": true
},
{
"type": "type",
"params": {"identifier": "#email", "text": "john@example.com"},
"stop_on_error": true
},
{
"type": "select",
"params": {"identifier": "#country", "value": "United States"},
"stop_on_error": true
},
{
"type": "click",
"params": {"identifier": "#submit"},
"stop_on_error": true
}
]
}'
/help or /help?command=<name> to learn about commands/snapshot after navigation to understand the page/wait before interacting with elements that load asynchronously/batch for multiple sequential operations/type for each field: @e1, @e2, etc./select for dropdowns/wait
/extract with CSS selector and multiple: true
["text", "href", "src"]
@e2
@e3
@e1
wait_visible: true (default) for reliable element interactionlocalhost:8080 with actual API host addressX-BrowserWing-Key header or JWT tokenElement not found:
/snapshot to see available elementsTimeout errors:
/wait with appropriate state before interactionExtraction returns empty:
/wait first)# Discover commands
GET localhost:8080/api/v1/executor/help
# Navigate
POST localhost:8080/api/v1/executor/navigate {"url": "..."}
# Get page structure
GET localhost:8080/api/v1/executor/snapshot
# Click element
POST localhost:8080/api/v1/executor/click {"identifier": "@e1"}
# Type text
POST localhost:8080/api/v1/executor/type {"identifier": "@e3", "text": "..."}
# Extract data
POST localhost:8080/api/v1/executor/extract {"selector": "...", "fields": [...], "multiple": true}
All operations return:
{
"success": true,
"message": "Operation description",
"timestamp": "2026-01-15T10:30:00Z",
"data": {
// Operation-specific data
}
}
Error response:
{
"error": "error.operationFailed",
"detail": "Detailed error message"
}