Let an AI agent drive your whole browser โ over Hypha RPC. Tabs, navigation, DOM, screenshots, click/type by index, React, and arbitrary JavaScript on any page.
MV3 ยท Chrome / Edge 116+ ยท load unpacked
Open, close, switch, and navigate tabs across windows โ list_tabs, open_tab, activate_tab, navigate, history.
Smart indexed DOM, click/type by index, select, scroll, screenshots, query DOM, get HTML, and a React component tree.
execute_script runs arbitrary JavaScript even on strict-CSP pages, via the Chrome debugger (CDP Page.setBypassCSP).
The agent accumulates markdown Agent Skills bound to each site origin โ it gets smarter and cheaper on every site over time.
Pin the tab the agent works on; keep browsing in other tabs. The pin survives the extension going to sleep.
Hand the agent one URL. GET /get_skill_md returns the full tool reference with curl examples.
chrome://extensions and enable Developer mode (top-right).manifest.json).Click Connect browser in the side panel, copy the Service URL, then:
curl "$SERVICE_URL/get_skill_md?_mode=last" # full API + usage
curl "$SERVICE_URL/list_tabs?_mode=last"
curl -X POST "$SERVICE_URL/open_tab" -d '{"url":"https://example.com"}'
curl "$SERVICE_URL/get_browser_state?_mode=last" # current target tab