Pixel‑accurate capture
Screens, windows, menu bar. Optional Retina 2× scaling. Fast enough for feedback loops.
CLI: peekaboo image, peekaboo see
Two ways: native app/CLI via Homebrew, or MCP server via npm. Both expose the same toolset.
peekaboo "Open Safari, go to github.com, and search for Peekaboo"
Tools you can trust: deterministic outputs, typed JSON, and composable automation.
Screens, windows, menu bar. Optional Retina 2× scaling. Fast enough for feedback loops.
CLI: peekaboo image, peekaboo see
Resolve UI elements and act on them: click, type, scroll, drag, hotkeys, menus, Spaces.
CLI: peekaboo click, peekaboo type, peekaboo menu
Natural language that chains tools — plus an MCP server so Claude Desktop/Cursor can drive it.
CLI: peekaboo agent, peekaboo mcp
A tight loop: capture → interpret → act. Repeat until the task is done.
Capture a screen/window and get a structured UI map with stable IDs.
Pick a target: a button label, a menu path, a specific window, a Space.
Click, type, scroll, drag, or run a full agent plan — with receipts you can log.
Start here, then dive deep.