Browser Use
An open-source Python library that gives AI agents the ability to control a real web browser — navigating pages, filling forms, clicking buttons, and extracting data.
Works with Claude, Gemini, OpenAI, and local models. Self-host for free or use Browser Use Cloud for managed stealth browsers and parallel execution.
What This Skill Does
Core Capabilities
- Web Automation: AI agents that browse the web like a human — click, type, scroll, navigate
- Multi-LLM Support: Works with Claude, GPT-4, Gemini, and local models via a unified interface
- Form Handling: Fill and submit forms, handle auth flows, multi-step checkout
- Research Tasks: Gather information across multiple sites in a single agent run
- Cloud Option: Browser Use Cloud provides stealth browsers and parallel execution at scale
- Custom Tools: Extend agents with your own tool definitions
Installation
# Initialize a new project
uvx browser-use init
# Or add to existing project
uv add browser-use
Quick Start
from browser_use import Agent
from langchain_anthropic import ChatAnthropic
agent = Agent(
task="Find the best laptop deals under $1000 and summarize them",
llm=ChatAnthropic(model="claude-sonnet-4-5"),
)
await agent.run()
Use Cases
- E-commerce research: Compare prices across multiple retailers
- Lead generation: Gather contact information from directories
- Form automation: Fill and submit repetitive web forms
- Content monitoring: Track changes on websites over time
- QA testing: Automate end-to-end user flow testing
- Data extraction: Scrape structured data from any website
Fine-Tuned Model
Browser Use offers a proprietary fine-tuned model optimized for browser automation:
browser-use/bu-30b-a3b-preview— 3–5x faster than general-purpose models with state-of-the-art accuracy on browser tasks- Available via Browser Use Cloud
Browser Use Cloud
For production workloads, Browser Use Cloud provides:
- Stealth browsers that bypass bot detection
- Parallel execution across multiple browser instances
- Managed infrastructure — no setup required
- Pricing: $0.20/M input tokens, $2.00/M output tokens
Persistent Sessions
Use the CLI for browser sessions that persist across agent runs:
browser-use --task "Log into my dashboard and check today's metrics"
Browser Use is MIT licensed. Commercial use is fully permitted for the self-hosted library.