{"id":"wjz-p-gemini-skill","name":"gemini-skill","homepage":null,"repo_url":"https://github.com/WJZ-P/gemini-skill","category":"ai-ml","subcategories":[],"tags":["mcp","browser-automation","cdp","puppeteer","gemini","image-generation","nodejs","stealth"],"what_it_does":"Provides an MCP server that automates Gemini (web) interactions by driving a real browser via CDP/DevTools. It spawns/controls a background browser daemon, exposes MCP tools for chat, image generation, image upload/extraction/download, session navigation, and includes a watermark-removal step for downloaded images.","use_cases":["Agent-driven Gemini chat sessions and model switching","Prompt-to-image generation with full-size downloads","Reference-image uploads and image extraction from conversations","Browser-automation-backed MCP integration for AI agents (e.g., OpenClaw-capable MCP clients)","Automated handling of Gemini conversation history navigation"],"not_for":["Production-grade, fully compliant automation where UI scraping/stealth behavior is disallowed","Use cases requiring a stable, vendor-supported public API for Gemini","Multi-tenant or high-concurrency workloads needing parallel browser instances","Systems requiring strong auditability and deterministic behavior across time"],"best_when":"You need an MCP-compatible way to let an AI agent drive Gemini web for image/chat automation, and you can accept browser automation dependencies (login state, browser updates, UI changes).","avoid_when":"You require rigorous security/compliance guarantees around automated access (e.g., stealth/anti-bot bypass) or you cannot store/manage persistent browser user data for Google login.","alternatives":["Use official Gemini APIs (where available) for chat/image generation instead of UI automation","If MCP is required, wrap any official API behind an MCP server","Use other browser automation MCP skills targeting different LLM/image providers"],"af_score":51.0,"security_score":47.8,"reliability_score":31.2,"package_type":"mcp_server","discovery_source":["github"],"priority":"high","status":"evaluated","version_evaluated":null,"last_evaluated":"2026-03-30T13:27:58.778663+00:00","interface":{"has_rest_api":true,"has_graphql":false,"has_grpc":false,"has_mcp_server":true,"mcp_server_url":null,"has_sdk":false,"sdk_languages":[],"openapi_spec_url":null,"webhooks":false},"auth":{"methods":["Manual Google account login in a persistent browser profile (user data directory)"],"oauth":false,"scopes":false,"notes":"Authentication is effectively delegated to Gemini web login inside the automated browser. There is no documented API-key/OAuth flow for the MCP server itself."},"pricing":{"model":null,"free_tier_exists":false,"free_tier_limits":null,"paid_tiers":[],"requires_credit_card":false,"estimated_workload_costs":null,"notes":"README does not provide pricing; it appears to be self-hosted software relying on Gemini web access."},"requirements":{"requires_signup":true,"requires_credit_card":false,"domain_verification":false,"data_residency":[],"compliance":[],"min_contract":null},"agent_readiness":{"af_score":51.0,"security_score":47.8,"reliability_score":31.2,"mcp_server_quality":80.0,"documentation_accuracy":65.0,"error_message_quality":0.0,"error_message_notes":null,"auth_complexity":35.0,"rate_limit_clarity":5.0,"tls_enforcement":85.0,"auth_strength":30.0,"scope_granularity":10.0,"dependency_hygiene":55.0,"secret_handling":65.0,"security_notes":"Security posture is tied to controlling a real browser session with persistent Google login state (userDataDir). The project documentation does not describe MCP server authentication/authorization, input validation, or CSRF-like protections; anyone who can reach the daemon/MCP process may be able to trigger actions. It also uses stealth/anti-bot techniques, which may create compliance risk depending on your environment. The README mentions watermark removal and downloading images; handling of files/paths and logging of sensitive data is not specified.","uptime_documented":0.0,"version_stability":45.0,"breaking_changes_history":45.0,"error_recovery":35.0,"idempotency_support":"false","idempotency_notes":"Tools include chat/image generation actions that are likely non-idempotent (each call can produce new UI actions and new outputs). No explicit idempotency keys or safe-retry guidance documented.","pagination_style":"none","retry_guidance_documented":false,"known_agent_gotchas":["Requires an interactive/manual Google login on first run; subsequent operations depend on persistent user data directory.","Single CDP port per browser instance—running multiple instances can conflict unless ports/user profiles are isolated.","Image generation may take 60–120 seconds; agent timeouts should be set appropriately (README suggests >=180000ms).","Reliability may degrade if Gemini UI changes; tool execution depends on DOM selectors.","Daemon lifetime is governed by TTL (default 30 minutes) and will release/exit when idle."]}}