Stop feeding your agent raw HTML. BrowserControl gives LLMs true visual grounding with Set of Marks (SoM), letting them click exactly what they see without hallucinating selectors. With built-in developer tools, session recording, and persistent logins, it transforms web automation from a brittle guessing game into a reliable, human-like capability. Give your agent eyes, not just code.
Server Config
{
"mcpServers": {
"browsercontrol": {
"command": "python",
"args": [
"-m",
"browsercontrol"
]
}
}
}