{"id":"manushi4-screenhand","name":"Screenhand","af_score":62.2,"security_score":24.8,"reliability_score":28.8,"what_it_does":"ScreenHand is an open-source MCP server (stdio transport) that gives AI agents local control over macOS and Windows desktop UIs (via Accessibility APIs) and optionally browser automation (Chrome DevTools Protocol). It exposes a large set of tools for UI inspection, interaction (click/type/keys/scroll/menus/drag), perception/OCR fallbacks, job orchestration, and per-app “app mastery map” learning.","best_when":"You want a local-first AI agent that can directly operate native apps and (optionally) Chrome with low latency, and you can grant the required local OS permissions.","avoid_when":"You cannot safely grant accessibility/automation permissions or you need a networked SaaS API with built-in policy enforcement/audit trails.","last_evaluated":"2026-03-30T15:33:28.540280+00:00","has_mcp":true,"has_api":false,"auth_methods":["None explicitly described for MCP/stdio server usage"],"has_free_tier":false,"known_gotchas":["Requires macOS Accessibility permission for the terminal app to allow UI control.","Browser automation requires launching Chrome with --remote-debugging-port=9222 and a running instance with remote debugging enabled.","Tool calls can have side effects (click/type/JS execution); agents should use confirmation/guardrails for destructive actions.","Cross-app control assumes UI state stability; dynamic layouts may still require fallback strategies (Accessibility→CDP→OCR→coordinates) and careful recovery."],"error_quality":0.0}