browser-automation — community browser-automation, marketplace, community, ide skills, Claude Code, Cursor, Windsurf

v1.0.0
GitHub

About this Skill

Ideal for Automation Agents requiring enterprise-grade browser control through the WebDriver protocol. Security-audited skills for Claude, Codex & Claude Code. One-click install, quality verified.

aiskillstore aiskillstore
[0]
[0]
Updated: 2/20/2026

Agent Capability Analysis

The browser-automation skill by aiskillstore is an open-source community AI agent skill for Claude Code and other IDE workflows, helping agents execute tasks with better context, repeatability, and domain-specific guidance.

Ideal Agent Persona

Ideal for Automation Agents requiring enterprise-grade browser control through the WebDriver protocol.

Core Value

Empowers agents to automate web browsers with performance monitoring, multi-session support, and health management using the rust-browser-mcp server and its 45+ MCP tools, including core automation tools like navigation and element interaction.

Capabilities Granted for browser-automation

Automating web application testing with navigation and element interaction tools
Generating performance reports through multi-session support and health management
Debugging web browser issues with core automation tools like click and send_keys

! Prerequisites & Limits

  • Requires rust-browser-mcp server setup
  • WebDriver protocol compatibility needed
  • Limited to enterprise-grade browser control
Labs Demo

Browser Sandbox Environment

⚡️ Ready to unleash?

Experience this Agent in a zero-setup browser environment powered by WebContainers. No installation required.

Boot Container Sandbox

browser-automation

Install browser-automation, an AI agent skill for AI agent workflows and automation. Works with Claude Code, Cursor, and Windsurf with one-command setup.

SKILL.md
Readonly

Browser Automation Skill

This skill provides guidance for using the rust-browser-mcp server to automate web browsers through the WebDriver protocol. It enables enterprise-grade browser control with performance monitoring, multi-session support, and health management.

Overview

The rust-browser-mcp server provides 45+ MCP tools for browser automation:

Core Automation Tools (25)

  • Navigation: navigate, back, forward, refresh
  • Element Interaction: click, send_keys, hover, find_element, find_elements
  • Information Extraction: get_title, get_text, get_attribute, get_property, get_page_source
  • Advanced: fill_and_submit_form, login_form, scroll_to_element, wait_for_element
  • JavaScript: execute_script
  • Visual: screenshot, resize_window, get_current_url, get_page_load_status

Performance Monitoring Tools (5)

  • get_performance_metrics - Page load times, resource timing, navigation data
  • monitor_memory_usage - Heap monitoring, memory leak detection
  • get_console_logs - Error detection, log filtering
  • run_performance_test - Automated performance analysis
  • monitor_resource_usage - Network, FPS, CPU tracking

Driver Management Tools (7)

  • start_driver, stop_driver, stop_all_drivers
  • list_managed_drivers
  • get_healthy_endpoints, refresh_driver_health
  • force_cleanup_orphaned_processes

Recipe System (4)

  • create_recipe - Create reusable automation workflows
  • execute_recipe - Run a saved recipe
  • list_recipes - List all available recipes
  • delete_recipe - Remove a recipe

Setup Instructions

Prerequisites

Ensure you have at least one WebDriver installed:

  • Chrome: ChromeDriver (must match Chrome version)
  • Firefox: GeckoDriver
  • Edge: MSEdgeDriver

Configuration for Claude Desktop

Add to your claude_desktop_config.json:

json
1{ 2 "mcpServers": { 3 "browser": { 4 "command": "/path/to/rust-browser-mcp", 5 "args": ["--transport", "stdio", "--browser", "chrome"] 6 } 7 } 8}

Environment Variables

VariableDefaultDescription
WEBDRIVER_ENDPOINTautoWebDriver URL or "auto" for auto-discovery
WEBDRIVER_HEADLESStrueRun browsers in headless mode
WEBDRIVER_PREFERRED_DRIVER-Preferred browser: chrome, firefox, edge
WEBDRIVER_CONCURRENT_DRIVERSfirefox,chromeBrowsers to start concurrently
WEBDRIVER_POOL_ENABLEDtrueEnable connection pooling
WEBDRIVER_POOL_MAX_CONNECTIONS3Max connections per driver type

Usage Patterns

Basic Navigation

1. Use `navigate` with URL to load a page
2. Use `wait_for_element` to ensure page loads
3. Use `get_title` or `get_text` to verify content

Form Filling

1. Navigate to the form page
2. Use `find_element` with CSS selector to locate fields
3. Use `send_keys` to input values
4. Use `click` on submit button, or use `fill_and_submit_form` for convenience

Web Scraping

1. Navigate to target page
2. Use `find_elements` to get multiple matching elements
3. Use `get_text` or `get_attribute` to extract data
4. Use `execute_script` for complex DOM traversal

Performance Testing

1. Navigate to page under test
2. Use `run_performance_test` for automated analysis
3. Use `get_performance_metrics` for detailed timing data
4. Use `monitor_memory_usage` to detect leaks
5. Use `get_console_logs` to capture errors

Multi-Step Workflows with Recipes

1. Define a recipe with `create_recipe` including steps array
2. Each step specifies: action (tool name), arguments, optional retry logic
3. Execute with `execute_recipe` and parameters
4. Recipes support conditions and browser-specific variants

Session Management

Browser-Specific Sessions

Use session IDs prefixed with browser name for explicit browser control:

  • chrome_session1 - Uses Chrome
  • firefox_work - Uses Firefox
  • edge_testing - Uses Edge

Multi-Session Support

You can run multiple browser sessions concurrently by using different session IDs:

Session: chrome_user1 -> Opens first Chrome tab
Session: chrome_user2 -> Opens second Chrome tab
Session: firefox_admin -> Opens Firefox for different workflow

Best Practices

Error Handling

  1. Always use wait_for_element before interacting with dynamic content
  2. Check get_page_load_status for slow-loading pages
  3. Use get_console_logs to debug JavaScript errors

Performance

  1. Enable connection pooling (default) for better resource usage
  2. Reuse session IDs when possible
  3. Use headless mode for faster execution

Security

  1. Never store credentials in recipes
  2. Use environment variables for sensitive data
  3. Clear sessions after authentication workflows

Troubleshooting

Driver Not Starting

  • Verify WebDriver is installed and in PATH
  • Check browser version matches driver version
  • Use list_managed_drivers to see status

Element Not Found

  • Use browser DevTools to verify selector
  • Wait for page load with wait_for_element
  • Try different selector strategies (CSS, XPath)

Performance Issues

  • Check monitor_memory_usage for leaks
  • Use get_console_logs for JavaScript errors
  • Consider reducing concurrent sessions

Reference Files

See companion files for detailed information:

  • reference/tools.md - Complete tool documentation
  • reference/recipes.md - Recipe system guide
  • examples/ - Example automation scripts

FAQ & Installation Steps

These questions and steps mirror the structured data on this page for better search understanding.

? Frequently Asked Questions

What is browser-automation?

Ideal for Automation Agents requiring enterprise-grade browser control through the WebDriver protocol. Security-audited skills for Claude, Codex & Claude Code. One-click install, quality verified.

How do I install browser-automation?

Run the command: npx killer-skills add aiskillstore/marketplace/browser-automation. It works with Cursor, Windsurf, VS Code, Claude Code, and 19+ other IDEs.

What are the use cases for browser-automation?

Key use cases include: Automating web application testing with navigation and element interaction tools, Generating performance reports through multi-session support and health management, Debugging web browser issues with core automation tools like click and send_keys.

Which IDEs are compatible with browser-automation?

This skill is compatible with Cursor, Windsurf, VS Code, Trae, Claude Code, OpenClaw, Aider, Codex, OpenCode, Goose, Cline, Roo Code, Kiro, Augment Code, Continue, GitHub Copilot, Sourcegraph Cody, and Amazon Q Developer. Use the Killer-Skills CLI for universal one-command installation.

Are there any limitations for browser-automation?

Requires rust-browser-mcp server setup. WebDriver protocol compatibility needed. Limited to enterprise-grade browser control.

How To Install

  1. 1. Open your terminal

    Open the terminal or command line in your project directory.

  2. 2. Run the install command

    Run: npx killer-skills add aiskillstore/marketplace/browser-automation. The CLI will automatically detect your IDE or AI agent and configure the skill.

  3. 3. Start using the skill

    The skill is now active. Your AI agent can use browser-automation immediately in the current project.

Related Skills

Looking for an alternative to browser-automation or another community skill for your workflow? Explore these related open-source skills.

View All

widget-generator

Logo of f
f

f.k.a. Awesome ChatGPT Prompts. Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.

149.6k
0
AI

flags

Logo of vercel
vercel

flags is a Next.js feature management skill that enables developers to efficiently add or modify framework feature flags, streamlining React application development.

138.4k
0
Browser

zustand

Logo of lobehub
lobehub

The ultimate space for work and life — to find, build, and collaborate with agent teammates that grow with you. We are taking agent harness to the next level — enabling multi-agent collaboration, effortless agent team design, and introducing agents as the unit of work interaction.

72.8k
0
AI

data-fetching

Logo of lobehub
lobehub

The ultimate space for work and life — to find, build, and collaborate with agent teammates that grow with you. We are taking agent harness to the next level — enabling multi-agent collaboration, effortless agent team design, and introducing agents as the unit of work interaction.

72.8k
0
AI