freshcrate
Home > MCP Servers > mcp-server-macos-use

mcp-server-macos-use

AI agent that controls computer with OS-level tools, MCP compatible, works with any model

Description

AI agent that controls computer with OS-level tools, MCP compatible, works with any model

README

mcp-server-macos-use

Official website: https://macos-use.dev/

Model Context Protocol (MCP) server in Swift. It allows controlling macOS applications by leveraging the accessibility APIs, primarily through the MacosUseSDK.

You can use it in Claude Desktop or other compatible MCP-client.

The server listens for MCP commands over standard input/output (stdio) and exposes several tools to interact with applications.

macos-use-sdk.mp4
macos-use-demo.mp4

Available Tools

The server exposes the following tools via the CallTool MCP method:

  1. macos-use_open_application_and_traverse

    • Description: Opens or activates a specified application and then traverses its accessibility tree.
    • Parameters:
      • identifier (String, Required): The application's name, bundle ID, or file path.
  2. macos-use_click_and_traverse

    • Description: Simulates a mouse click at specific coordinates within the window of the target application (identified by PID) and then traverses its accessibility tree.
    • Parameters:
      • pid (Number, Required): The Process ID (PID) of the target application.
      • x (Number, Required): The X-coordinate for the click (relative to the window/screen, depending on SDK behavior).
      • y (Number, Required): The Y-coordinate for the click.
  3. macos-use_type_and_traverse

    • Description: Simulates typing text into the target application (identified by PID) and then traverses its accessibility tree.
    • Parameters:
      • pid (Number, Required): The Process ID (PID) of the target application.
      • text (String, Required): The text to be typed.
  4. macos-use_press_key_and_traverse

    • Description: Simulates pressing a specific keyboard key (e.g., 'Enter', 'Tab', 'a', 'B') with optional modifier keys held down, targeting the application specified by PID, and then traverses its accessibility tree.
    • Parameters:
      • pid (Number, Required): The Process ID (PID) of the target application.
      • keyName (String, Required): The name of the key (e.g., Return, Escape, ArrowUp, Delete, a, B). Case-sensitive for letters if no modifiers are active.
      • modifierFlags (Array, Optional): An array of modifier keys to hold during the press. Valid values: CapsLock (or Caps), Shift, Control (or Ctrl), Option (or Opt, Alt), Command (or Cmd), Function (or Fn), NumericPad (or Numpad), Help.
  5. macos-use_refresh_traversal

    • Description: Only performs the accessibility tree traversal for the specified application (identified by PID). Useful for getting the current UI state without performing an action.
    • Parameters:
      • pid (Number, Required): The Process ID (PID) of the application to traverse.

Common Optional Parameters (for CallTool)

These can potentially be passed in the arguments object for any tool call to override default MacosUseSDK behavior (refer to ActionOptions in the code):

  • traverseBefore (Boolean, Optional): Traverse accessibility tree before the primary action.
  • traverseAfter (Boolean, Optional): Traverse accessibility tree after the primary action (usually defaults to true).
  • showDiff (Boolean, Optional): Include a diff between traversals (if applicable).
  • onlyVisibleElements (Boolean, Optional): Limit traversal to visible elements.
  • showAnimation (Boolean, Optional): Show visual feedback animation for actions.
  • animationDuration (Number, Optional): Duration of the feedback animation.
  • delayAfterAction (Number, Optional): Add a delay after performing the action.

Dependencies

  • MacosUseSDK (Assumed local or external Swift package providing macOS control functionality)

Building and Running

# Example build command (adjust as needed, use 'debug' for development)
swift build -c debug # Or 'release' for production

# Running the server (it communicates via stdin/stdout)
./.build/debug/mcp-server-macos-use

Integrating with Clients (Example: Claude Desktop)

Once built, you need to tell your client application where to find the server executable. For example, to configure Claude Desktop, you might add the following to its configuration:

{
    "mcpServers": {
        "mcp-server-macos-use": {
            "command": "/path/to/your/project/mcp-server-macos-use/.build/debug/mcp-server-macos-use"
        }
    }
}

Replace /path/to/your/project/ with the actual absolute path to your mcp-server-macos-use directory.

Help

Reach out to matt@mediar.ai Discord: m13v_

Plans

Happy to tailor the server for your needs, feel free to open an issue or reach out

Release History

VersionChangesUrgencyDate
v0.1.15Tag v0.1.15Low3/18/2026
v0.1.0Initial release with demo videoLow3/10/2026

Dependencies & License Audit

Loading dependencies...

Similar Packages

stigixStigix is a lab platform to validate SD‑WAN and SASE. ​ It unifies SaaS traffic generation, security tests, convergence probes, and voice/IoT/VyOS simulations in one web UIv1.2.2-patch.81
kibiRepo-local, per-git-branch, queryable knowledge base for LLM Agents.kibi-opencode@0.8.0
claw-pilotMulti-agent orchestration runtime with task board, flow engine, budget control, MCP integration and real-time dashboard. Self-hosted on Linux/macOS.v0.81.1
trace-mcpMCP server for Claude Code and Codex. One tool call replaces ~42 minutes of agent explorationv1.28.0
@baseplate-dev/plugin-aiAI agent integration plugin for Baseplate β€” generates AGENTS.md, CLAUDE.md, .mcp.json, and .agents/ configuration files0.6.8