Skip to content

MCP server — connect any AI client

OculiX ships an MCP server (oculixmcp) that exposes the visual-automation engine — click, type, find, OCR, screenshot — to any Model Context Protocol client. Every action is signed (Ed25519) and written to a hash-chained audit journal, so the server is built for controlled, auditable environments rather than free-form consumer use.

The runnable artifact is oculix-mcp-server.jar. Two transports:

Terminal window
# Default transport — used by Claude Desktop, Cursor, VS Code, Gemini CLI…
java -jar oculix-mcp-server.jar run

Environment variables:

VariableEffect
OCULIX_MCP_TOKENPre-shared client token gating initialize (recommended for any non-loopback bind)
OCULIX_MCP_MODEopen or confidential — confidential mode lands sensitive output in a vault dir
OCULIX_MCP_VAULTPath of the confidential landing directory
OCULIX_MCP_TRUST_TLS_TERMINATIONAcknowledge upstream TLS for non-loopback binds (otherwise plain HTTP on a public interface is refused)

Audit / key management: verify, rotate-key, rotate-session-key, recover.

Almost every MCP client uses one of four config shapes. Pick your client below.

File: ~/Library/Application Support/Claude/claude_desktop_config.json (macOS) · %APPDATA%\Claude\claude_desktop_config.json (Windows)

{
"mcpServers": {
"oculix": {
"command": "java",
"args": ["-jar", "/absolute/path/oculix-mcp-server.jar", "run"]
}
}
}

Restart Claude Desktop. The tools appear behind the MCP indicator.

A note on consumer cloud assistants (ChatGPT, etc.)

Section titled “A note on consumer cloud assistants (ChatGPT, etc.)”

Ten tools in the default (open) mode:

ToolDoes
oculix_find_imageLocate an image on screen
oculix_click_imageFind an image, then click it
oculix_click_at_pointClick at explicit coordinates
oculix_exists_imageTest whether an image is present (no failure thrown)
oculix_wait_for_imageWait for an image to appear
oculix_type_textType a string
oculix_key_comboSend a key combination (e.g. Ctrl+S)
oculix_find_textLocate text on screen via OCR
oculix_screenshotCapture the screen
oculix_read_text_in_regionOCR a region and return the text

Every call is gated (ActionGate), signed with Ed25519, and appended to a SHA-256-chained JSONL journal you can replay and verify with oculix-mcp-server.jar verify.