
AI agents control real browsers via token-efficient ASCII wireframes
Visit WebsitePros
Cons
Free
No reviews yet. Be the first to review Agent Browser!
Agent Browser provides AI agents with ASCII wireframes of web pages. These wireframes present interactive elements with numbered references, allowing the AI to interact with the page using a minimal number of tokens.
Agent Browser can be integrated in two main ways: by adding it as a server to MCP clients like Cursor or Claude Desktop, or by utilizing the Vercel AI SDK. Both methods allow AI agents to control a real browser through the provided tools.
AI agents can perform a comprehensive set of browser actions including launching, navigating, getting wireframes, clicking, typing, filling forms, double-clicking, hovering, pressing keys, selecting elements, checking/unchecking boxes, scrolling, taking screenshots, and closing the browser.
Yes, Agent Browser provides an interactive command-line interface (CLI) for manual testing. Users can launch it directly using `npx @agent-browser-io/browser` or `agent-browser-cli` after installation.
Agent Browser leverages a PlaywrightBrowserBackend for its underlying browser operations. This backend enables the AI agent to control a real browser and perform actions like navigation and interaction.
The AI agent receives an ASCII wireframe representation of the web page. This wireframe visually depicts the page's layout and numbers all interactive elements, allowing the agent to precisely identify and reference them for interaction.
Source: agent-browser.io