kwin-mcp

kwin-mcp is an MCP (Model Context Protocol) server for Linux KDE Plasma 6 on Wayland that enables AI agents to start isolated virtual KWin sessions (or connect to live sessions) and automate desktop GUI interactions. It provides MCP tools for session management, UI observation via AT-SPI2 accessibility trees and screenshots, and action injection via KWin’s Emulated Input Server (EIS)/libei, covering mouse, keyboard, touch, clipboard, window management, and some advanced D-Bus/diagnostic utilities.

Evaluated Mar 30, 2026 (66d ago)

Homepage ↗ Repo ↗ Automation mcp model-context-protocol desktop-automation gui-automation wayland kde kwin at-spi2 accessibility libei e2e-testing headless-testing python

⚙ Agent Friendliness

/ 100

Can an agent use this?

🔒 Security

/ 100

Is it safe for agents?

⚡ Reliability

/ 100

Does it work consistently?

Score Breakdown

⚙ Agent Friendliness

MCP Quality

Documentation

Error Messages

Auth Simplicity

Rate Limits

🔒 Security

TLS Enforcement

Auth Strength

Scope Granularity

Dep. Hygiene

Secret Handling

Local automation tool with powerful input injection capabilities. README does not document authentication/authorization controls for MCP access; security largely relies on local process execution permissions and session isolation choices. It claims isolation via dbus-run-session + virtual KWin and optional home isolation (isolate_home), which is a positive security model, but there is no explicit discussion of permission boundaries, audit logging, or safe handling of sensitive UI/clipboard data. Dependencies listed include mcp, PyGObject, dbus-python, Pillow; no vulnerability/CVE posture is provided.

⚡ Reliability

Uptime/SLA

Version Stability

Breaking Changes

Error Recovery

Best When

You control the host environment and want an agent to run GUI automation inside isolated KWin Wayland sessions on KDE Plasma 6, using accessibility trees for robust element targeting.

Avoid When

You need a hardened, auth-gated service exposed to untrusted networks/clients, or you cannot run the required KDE/Wayland/EIS/AT-SPI2 components.

Use Cases

• End-to-end GUI testing for Wayland apps on KDE Plasma in isolated virtual sessions
• AI-driven desktop automation using structured accessibility trees
• Live desktop collaboration/automation by attaching to an existing KWin session (including containerized ones)
• Headless-ish CI/CD style desktop testing using virtual KWin compositors
• Kiosk/embedded device UI automation on KDE Plasma/Wayland

Not For

• Secure remote multi-tenant deployments without additional network/isolation controls
• Environments that require strong user-consent prompts for input injection
• Non-KDE/Wayland systems (or non-Plasma 6 Wayland setups) where KWin EIS/libei/AT-SPI2 integration may not apply
• Use as a general-purpose web/API service over HTTPS (it is a local MCP stdio server)

Interface

REST API

GraphQL

gRPC

MCP Server

Yes

SDK

Webhooks

Authentication

Methods: Local stdio MCP transport (client launches/execs the server via command/args)

OAuth: No Scopes: No

The README emphasizes 'zero authorization prompts' by using private KWin/EIS D-Bus interfaces, but it does not describe any authentication/authorization mechanism for controlling the MCP server itself. In practice, auth likely depends on who can execute/use the local process and access the user session/DBus.

Pricing

Free tier: No

Requires CC: No

Open-source MIT package; no pricing model indicated.

Agent Metadata

Pagination

none

Idempotent

False

Retry Guidance

Not documented

Known Gotchas

⚠ Input injection targets are best derived from accessibility_tree/find_ui_elements; coordinate-based actions may be brittle across window layouts/resolutions.
⚠ Session lifecycle matters: session_stop behavior differs between virtual (terminates KWin/apps) and live (disconnects only).
⚠ Clipboard tools require enable_clipboard=true in session_start (and wl-clipboard installed).
⚠ keyboard_type_unicode may depend on wtype or clipboard fallback utilities being installed.

Alternatives

SikuliX (vision-based, less robust for accessibility-first agents) Playwright (web-focused, not general Wayland desktop GUI) Robot Framework + Sikuli/remote desktop approaches VNC/remote desktop automation tools (less integrated with Wayland accessibility trees) Other MCP servers for UI automation (if available for your desktop stack)

Full Evaluation Report

Comprehensive deep-dive: security analysis, reliability audit, agent experience review, cost modeling, competitive positioning, and improvement roadmap for kwin-mcp.

AI-powered analysis · PDF + markdown · Delivered within 30 minutes

$99

Package Brief

Quick verdict, integration guide, cost projections, gotchas with workarounds, and alternatives comparison.

Delivered within 10 minutes

Score Monitoring

Get alerted when this package's AF, security, or reliability scores change significantly. Stay ahead of regressions.

Continuous monitoring

$3/mo

API endpoint ↗ Agent guide ↗ Report inaccuracy

Scores are editorial opinions as of 2026-03-30.