OpenClaw Nodes & Companion App

01 What Is OpenClaw?

The self-hosted AI agent platform that surpassed React on GitHub.

OpenClaw is a free, open-source, self-hosted AI agent platform created by Peter Steinberger (@steipete). It runs on your own machine and connects to messaging platforms — WhatsApp, Telegram, Discord, Slack, Signal, iMessage, Google Chat, and 15+ others — as its primary user interface. Its tagline: "The AI that actually does things."

Unlike SaaS AI products, OpenClaw is infrastructure you own. Users bring their own API keys (Anthropic Claude, OpenAI, Google Gemini, DeepSeek, local models). The software is free; users pay only for LLM API usage.

345K+

GitHub Stars

68K+

Forks

24.5K

Commits

1,243

Contributors

Brief History

Date	Event
Nov 2025	Weekend hack called "WhatsApp Relay" by @steipete
Jan 25, 2026	Public launch as "Clawdbot" — 9,000 stars on day one, 60,000 in 72 hours
Jan 27, 2026	Renamed to "Moltbot" after Anthropic trademark complaint
Jan 30, 2026	Renamed to "OpenClaw" (final name)
Feb 14, 2026	Steinberger announces joining OpenAI; project transitions to foundation governance
Mar 2026	Surpasses React as most-starred project on GitHub. Sponsored by OpenAI, NVIDIA, Vercel

02 What Are Nodes?

The Gateway runs the brain. Nodes run the hands.

Nodes are companion devices — iOS phones, Android phones, macOS menu bar apps, or headless Linux/Windows machines — that connect to the OpenClaw Gateway via WebSocket with role: "node". They function as peripherals, not gateways.

"The Gateway runs the brain. Nodes run the hands."
— OpenClaw Documentation

This is the key insight: nodes give an AI agent physical-world capabilities. Your phone's camera becomes the agent's eyes. Your device's GPS becomes the agent's sense of location. Your computer's shell becomes the agent's ability to execute commands. The AI doesn't just chat — it reaches into the real world through distributed peripheral devices.

Node Types

💻

macOS Companion App

Native Swift menu bar app. Manages Gateway lifecycle, owns TCC permissions, exposes Canvas, Camera, Screen Recording, and shell execution. Supports Voice Wake ("Hey OpenClaw") and Talk Mode.

📱

iOS & Android Apps

Full-featured mobile nodes with camera, Canvas (WebView), GPS, screen recording, SMS (Android), contacts, calendar, notifications, photos, and motion/pedometer sensors.

🖥

Headless Nodes

Linux, Windows, or macOS machines with no GUI. Run as persistent services via openclaw node install. Perfect for build servers, Raspberry Pis, or remote infrastructure.

03 Architecture Deep Dive

Hub-and-spoke topology with a single source of truth.

Hub-and-Spoke Model

The architecture centers on a single long-lived Gateway daemon (WebSocket server at 127.0.0.1:18789). This is the single source of truth for sessions, routing, and channel connections. Everything connects to this one port: channel adapters, CLI tools, web UI, companion apps, and nodes.

                     ┌─────────────────────────────────┐
                     │     OPENCLAW GATEWAY :18789      │
                     │                                  │
                     │  ┌──────────┐  ┌──────────────┐ │
                     │  │ Sessions │  │ Agent Runtime │ │
                     │  └──────────┘  └──────────────┘ │
                     │  ┌──────────┐  ┌──────────────┐ │
                     │  │ Routing  │  │   Memory DB   │ │
                     │  └──────────┘  └──────────────┘ │
                     └───────────┬─────────────────────┘
                                 │ WebSocket
               ┌─────────────┬──┴──┬──────────────┬──────────┐
               ▼             ▼     ▼              ▼          ▼
        ┌────────────┐ ┌──────┐ ┌──────┐  ┌────────────┐ ┌──────┐
        │  Channels  │ │ CLI  │ │ Web  │  │    Nodes   │ │ More │
        │            │ │      │ │  UI  │  │            │ │Nodes │
        │ WhatsApp   │ └──────┘ └──────┘  │ iPhone     │ │      │
        │ Telegram   │                    │ Mac App    │ │ RPi  │
        │ Discord    │                    │ Android    │ │ VPS  │
        │ Slack ...  │                    │ Linux Box  │ │ ...  │
        └────────────┘                    └────────────┘ └──────┘

         Messages IN                      Capabilities OUT
       (where users talk)              (eyes, hands, sensors)

Wire Protocol

All communication uses WebSocket text frames with JSON payloads. The protocol follows a strict handshake:

Node opens WebSocket connection to Gateway
First frame must be a connect message with role: "node" and declared capabilities
Gateway creates a pairing request
User approves via CLI: openclaw devices approve <requestId>
Node marked as "paired" — its command surface is now available to the agent
Agent invokes capabilities via node.invoke RPC calls

Three-Layer Authorization

🔒

Layer 1: Device Pairing

Connectivity gate. Nodes must be explicitly paired and approved by the user before they can communicate with the Gateway.

🛡

Layer 2: Command Policy

Gateway-level RPC allowance. Controls which commands a node is permitted to expose and the agent is permitted to invoke.

🔍

Layer 3: Exec Approvals

Per-node shell command execution gate. Approvals stored in ~/.openclaw/exec-approvals.json. Human-in-the-loop for dangerous operations.

Node Command Surface

Nodes expose capabilities through namespaced commands via the node.invoke protocol:

Namespace	Examples	Platforms
`canvas.*`	present, navigate, eval, snapshot, hide	All
`camera.*`	snap (photo), clip (video up to 60s)	All
`screen.*`	record (MP4, configurable FPS)	All
`location.*`	get (GPS with accuracy, timestamp)	All
`system.*`	run (shell), notify, which	macOS, Headless
`sms.*`	send, search	Android
`contacts.*`	search, add	Android, iOS
`calendar.*`	events, add	Android, iOS
`photos.*`	latest	Android, iOS
`notifications.*`	list, actions	Android
`motion.*`	activity, pedometer	Android, iOS
`device.*`	status, info, permissions, health	Android

Latency Budget

The team defines an explicit performance envelope for every request:

Access Control     ~10ms
Session Load       ~50ms
Prompt Assembly    ~100ms
First Token (LLM)  200-500ms
Tool Execution     100ms (bash) to 1-3s (browser automation)
─────────────────────────────
Total              ~500ms - 4s typical end-to-end

04 Design Rationale

Why it was built this way — the key architectural decisions and their motivations.

1. Nodes as Peripherals, Not Gateways

This is the most fundamental design decision. By separating the interface layer (where messages come from — Telegram, WhatsApp, etc.) from the capability layer (where actions happen — cameras, shells, sensors), OpenClaw achieves a clean separation of concerns. One persistent assistant is accessible through any messaging app, with conversation state managed centrally. Your phone is a sensor array, not a brain.

2. Single Gateway Per Host

Only one Gateway runs per machine. This prevents protocol conflicts (e.g., WhatsApp's single-device requirement), centralizes state management, and eliminates distributed consensus complexity. The trade-off — no horizontal scaling — is acceptable because this is personal infrastructure, not enterprise SaaS.

3. Loopback-Default Networking

The Gateway binds to 127.0.0.1 by default. Zero network exposure out of the box. Remote access requires explicit opt-in through SSH tunnels or Tailscale Serve/Funnel. This is security through architecture, not configuration.

# Remote access requires explicit setup
ssh -N -L 18790:127.0.0.1:18789 user@gateway-host
openclaw node run --host 127.0.0.1 --port 18790 --display-name "Remote Build Node"

4. Human-in-the-Loop Pairing

Device-based pairing on top of token auth prevents credential reuse and enables device revocation. Unlike API key auth alone, a compromised token can't impersonate a different device. Every new device requires explicit human approval.

5. Event-Driven, Not Poll-Based

Clients subscribe to event streams (agent, presence, health, tick) rather than polling. This reduces latency, eliminates wasted bandwidth, and scales naturally with the number of connected nodes.

6. Files Over Database

Session state is stored as persistent files rather than in a database. This makes storage git-friendly, supports easy backups, enables branching (experiment with conversation forks), and provides full transparency. You can cat your assistant's memory.

7. Canvas Isolation

The Canvas (WebView) runs in a separate process from the Gateway. A crashed Canvas can't take down the core assistant. Different security boundaries prevent malicious web content from accessing agent internals.

05 Security Landscape

With great power comes great attack surface.

OpenClaw's power — giving AI agents access to cameras, file systems, shell execution, and messaging — creates an inherently large attack surface. The project has faced significant security challenges:

⚠

Cisco Research

Found 9 security issues (2 critical, 5 high severity) including active data exfiltration via malicious third-party skills and direct prompt injection bypasses.

🐛

CVE-2026-25253

CVSS 8.8 — Incorrect Resource Transfer vulnerability. Discovered by Mav Levin, patched in version 2026.1.29.

💀

ClawHavoc

824+ confirmed malicious skills on ClawHub (out of 2,857 total). A supply-chain attack that compromised the skill marketplace ecosystem.

🌍

Public Exposure

21,000+ publicly exposed instances detected by Censys in January 2026, up from ~1,000 in less than a week. Many without proper authentication.

Built-in Mitigations

The OpenClaw team has responded with multiple layers of defense:

✅ Docker sandboxing for skill execution
✅ Exec approvals and allowlists per node
✅ Device pairing with human-in-the-loop approval
✅ Loopback-default networking (no public exposure by default)
✅ Environment variable stripping (dangerous keys like DYLD_*, LD_* removed)
✅ openclaw doctor diagnostic tool
✅ DM pairing and allowlists for messaging channels

"If you can't understand how to run a command line, this is far too dangerous."
— OpenClaw maintainer "Shadow"

06 Real-World Usage

How people are actually using nodes in production.

📧

Email & Inbox

AI triages email, processes spam, drafts responses, and manages inbox zero — all triggered via Telegram commands.

🏠

Smart Home

Controls HomePods, Alexa, and Homey hubs through macOS system commands. "Hey OpenClaw, dim the lights and play jazz."

📅

Calendar & Meetings

Reads calendars via mobile nodes, generates meeting prep briefings, optimizes scheduling across time zones.

💰

Financial Automation

Grocery ordering, insurance claim filing, invoice generation — with Canvas for visual review of documents before submission.

🏋

Health Tracking

Pulls data from Garmin and WHOOP via phone nodes. Summarizes activity, sleep, and recovery metrics on demand.

🤖

Multi-Agent Orchestration

Power users run 10+ coordinated agents with dedicated nodes for code review, CI/CD, monitoring, and deployment.

"It's running my company." — @therno

"First time I've felt like I am living in the future since ChatGPT." — @davemorin

"It genuinely feels like early AGI." — @tobi_bsf

07 New Product Ideas

Seven ventures inspired by the "distributed AI peripherals" paradigm.

The OpenClaw nodes architecture reveals a powerful insight: AI becomes dramatically more useful when it can sense and act in the physical world through distributed peripheral devices. Here are seven product ideas that extend this concept into new markets.

Enterprise

1. NodeMesh — Enterprise Physical AI Fabric

"Deploy AI hands across your entire organization."

An enterprise-grade version of OpenClaw nodes designed for organizations, not individuals. Centrally managed fleets of nodes across offices, factories, and remote sites — with RBAC, audit logging, SOC2 compliance, and MDM integration.

Key Differentiators

Fleet management dashboard. Node health monitoring across sites. Role-based capability exposure (accounting gets calendar.*, not system.run). Compliance audit trail for every node.invoke call. Integration with Okta/Azure AD for identity. Centralized exec-approval policies pushed from IT, not per-device.

Revenue Model

Per-node-per-month SaaS pricing ($5-20/node/month). Enterprise tier with dedicated support, SLAs, and on-prem Gateway hosting. Professional services for deployment.

Why Now

OpenClaw proved the peripheral-node model works. Enterprises want AI agents but can't deploy MIT-licensed software with no support, no compliance, and 824 malicious skills on ClawHub. The gap between "it works on my Mac" and "it works for 500 employees" is exactly where enterprise products thrive.

Hardware

2. SenseNode — Purpose-Built AI Peripheral Hardware

"A $49 device that gives any AI agent eyes, ears, and hands."

A small, dedicated hardware device (think Raspberry Pi form factor) pre-loaded with OpenClaw node software and equipped with a camera, microphone, speaker, GPIO pins, and optional sensors (temperature, humidity, air quality, motion). Plug it in, pair it to your Gateway, and your AI can see, hear, and control things in that room.

Key Differentiators

Zero-config setup: power on, scan QR code, paired. Always-on (no phone battery drain). Mounts anywhere: wall, desk, door frame. GPIO for controlling relays, motors, lights. Optional PoE for single-cable deployment. Tamper-evident enclosure for security-conscious deployments.

Use Cases

Home: Baby monitor that answers "is the baby sleeping?" via Telegram. Pet feeder triggered by AI schedule. Package detection at front door.
Business: Meeting room occupancy. Inventory monitoring (point camera at shelf, ask "how many boxes left?"). Equipment status monitoring via camera + GPIO sensors.

Revenue Model

Hardware margin ($49 device, ~60% margin at scale). Optional cloud Gateway hosting subscription ($5/month). Sensor expansion packs ($15-25).

Platform

3. NodeMarket — Shared AI Capability Marketplace

"Rent out your node's capabilities. Buy access to nodes you don't have."

A peer-to-peer marketplace where node operators can offer their device capabilities to others. Don't have an Android phone? Rent SMS-sending capability from someone who does. Need a node in Tokyo for location-specific tasks? Connect to one. Think Airbnb, but for AI peripheral access.

Key Differentiators

Reputation system for node operators. Encrypted capability proxying (the provider never sees your data, only relays the command). Geographic node discovery ("I need a camera in Berlin"). Time-sliced access (rent a node for 10 minutes, not a month). Escrow-based payments.

Example Scenarios

A researcher rents access to 50 nodes globally to capture synchronized weather photos. A small business rents an Android SMS node for appointment reminders instead of buying Twilio. A traveler rents a node at their destination to check real-time conditions via camera.

Revenue Model

15% marketplace commission on all transactions. Premium listings for commercial node operators. Enterprise API access for programmatic node discovery and provisioning.

Developer Tools

4. NodeKit — Universal AI Peripheral SDK

"Turn any device into an AI-controllable peripheral in 10 lines of code."

An open-source SDK that implements the node.invoke protocol for any platform: Arduino, ESP32, Raspberry Pi, browser extensions, desktop widgets, smart TVs, cars, and industrial PLCs. If it has a network connection, it can become an AI node.

Key Differentiators

Language-agnostic wire protocol implementation (C, Python, Rust, JavaScript, Go). Auto-discovery via mDNS. Capability declaration DSL. Built-in security (TLS, device attestation). Simulator mode for development without physical hardware.

// ESP32 example — make a temperature sensor an AI node
#include <nodekit.h>

NodeKit node("temp-sensor-lab", GATEWAY_HOST, 18789);

node.capability("sensors.temperature", []() {
    return json({ "celsius": readDHT22(), "timestamp": millis() });
});

node.capability("relay.toggle", [](json params) {
    digitalWrite(RELAY_PIN, params["state"] ? HIGH : LOW);
    return json({ "ok": true });
});

node.connect();

Revenue Model

Open-source core (MIT). Commercial license for automotive/medical/industrial use with certified builds and long-term support ($500-5000/year). Managed fleet orchestration SaaS. Hardware partner program (certified "NodeKit Ready" devices).

Consumer

5. GuardianNode — AI-Powered Elderly Care Network

"Your parent's home becomes an intelligent care network."

A consumer product that deploys multiple SenseNode-style devices throughout an elderly person's home. The AI agent monitors daily routines via motion sensors and cameras, detects anomalies (didn't take medication, unusual inactivity, fall detection), and communicates with family caregivers via their preferred messaging app.

Key Differentiators

Non-wearable — no compliance burden on the elderly person. Privacy-first — raw camera feeds never leave the home; only AI-generated summaries are shared. Natural language interaction — the elderly person talks to their home ("What's for lunch today?") via a speaker node. Medication tracking via camera on pill organizer. Family dashboard showing activity patterns and alerts.

Revenue Model

Hardware kit ($199 for 3-node starter pack). Monthly monitoring subscription ($29/month). Premium tier with 24/7 human escalation ($79/month). Insurance partnerships for subsidized deployment.

Security

6. AuditClaw — AI Agent Security & Compliance Platform

"Know exactly what your AI agents are doing, in real time."

A security-focused platform that sits between OpenClaw Gateways and nodes, providing real-time monitoring, anomaly detection, and compliance enforcement for AI agent actions. Born from the hard lessons of ClawHavoc and CVE-2026-25253.

Key Differentiators

Real-time node.invoke traffic analysis. Behavioral anomaly detection (agent suddenly requesting camera access at 3 AM?). Automated skill scanning (catch malicious skills before ClawHub). Policy-as-code enforcement (define what agents can and cannot do in YAML). SOC2/HIPAA compliance reporting. Kill switch — instantly revoke all node access across an organization.

Why This Is Critical

21,000+ exposed instances. 824+ malicious skills. CVE-8.8. The Chinese government banned it from state agencies. As AI agents gain physical-world capabilities, the security stakes become existential. Every system.run is a potential breach. Every camera.snap is a potential privacy violation. Organizations deploying AI agents need a security layer purpose-built for this paradigm.

Revenue Model

Free tier for individual developers (up to 3 nodes). Team tier $49/month (up to 20 nodes). Enterprise tier: custom pricing with dedicated threat analysts and incident response.

Research / Frontier

7. HiveMinds — Multi-Agent Swarm Intelligence Platform

"When one agent isn't enough, deploy a swarm."

A platform that orchestrates multiple AI agents, each with their own Gateway and node fleet, working together on complex goals. Inspired by the multi-agent orchestration use cases (10+ coordinated agents) already emerging in the OpenClaw community. HiveMinds provides the coordination layer: shared memory, task delegation, conflict resolution, and collective learning.

Architecture

Each agent runs its own Gateway (preserving the single-Gateway-per-host principle). A Hive Controller sits above, routing high-level goals to specialized agents. Agent A handles code review (headless node with IDE access). Agent B manages customer communication (mobile nodes with messaging). Agent C monitors infrastructure (nodes on servers). They share a vector memory layer and coordinate via structured message passing.

Key Differentiators

Visual swarm designer (drag-and-drop agent topologies). Inter-agent communication protocol with priority and conflict resolution. Shared memory with access control (Agent A can read Agent B's findings but not modify them). Emergent behavior monitoring (detect when agents collectively pursue unintended goals). Cost optimization (route sub-tasks to the cheapest capable agent).

Revenue Model

Open-source Hive Controller. Managed cloud orchestration ($99-499/month based on agent count). Enterprise consulting for custom swarm architectures. Training and certification programs.

08 Competitive Landscape

How OpenClaw compares and where the gaps remain.

Project	Strength	Weakness vs OpenClaw
Claude Code	Superior for pure coding tasks	No physical-world node system
NanoClaw	Security-first design	Smaller community, fewer integrations
Nanobot	4,000 lines of Python simplicity	No companion apps or node ecosystem
Knolli.ai	Enterprise no-code workflows	Closed-source, no self-hosting
memU	Superior local knowledge graph	Memory-focused; no peripheral access

The key insight: no competitor has replicated OpenClaw's node ecosystem. The technical challenge of building reliable WebSocket peripheral connections across iOS, Android, macOS, and headless Linux — with proper security, pairing, and capability negotiation — is a moat that's wider than it appears.

09 Conclusion

The bigger picture.

OpenClaw's node architecture represents a paradigm shift in how we think about AI agents. The insight — separating the "brain" (Gateway) from the "hands" (nodes) — is elegant and powerful. It transforms every connected device into an extension of AI capability, creating a personal AI that doesn't just chat but perceives and acts in the physical world.

The design decisions are principled: hub-and-spoke topology for simplicity, loopback-default for security, files over databases for transparency, human-in-the-loop pairing for trust. These aren't just engineering choices; they're philosophical statements about how AI should integrate into human life — as a tool you control, running on infrastructure you own, with capabilities you explicitly approve.

The security challenges are real and sobering. But they're the growing pains of a genuinely new category. The product ideas above — from enterprise fleet management to elderly care networks to multi-agent swarms — all build on the same core insight: AI becomes transformatively useful when it can reach beyond the chat window into the real world.

The Gateway runs the brain. Nodes run the hands. And the future belongs to whoever builds the best hands.

OpenClaw Documentation

Main Docs: docs.openclaw.ai • Nodes: docs.openclaw.ai/nodes • Architecture: docs.openclaw.ai/architecture

OpenClaw Nodes &The Companion App

01 What Is OpenClaw?

Brief History

02 What Are Nodes?

Node Types

macOS Companion App

iOS & Android Apps

Headless Nodes

03 Architecture Deep Dive

Hub-and-Spoke Model

Wire Protocol

Three-Layer Authorization

Layer 1: Device Pairing

Layer 2: Command Policy

Layer 3: Exec Approvals

Node Command Surface

Latency Budget

04 Design Rationale

1. Nodes as Peripherals, Not Gateways

2. Single Gateway Per Host

3. Loopback-Default Networking

4. Human-in-the-Loop Pairing

5. Event-Driven, Not Poll-Based

6. Files Over Database

7. Canvas Isolation

05 Security Landscape

Cisco Research

CVE-2026-25253

ClawHavoc

Public Exposure

Built-in Mitigations

06 Real-World Usage

Email & Inbox

Smart Home

Calendar & Meetings

Financial Automation

Health Tracking

Multi-Agent Orchestration

07 New Product Ideas

1. NodeMesh — Enterprise Physical AI Fabric

Key Differentiators

Revenue Model

Why Now

2. SenseNode — Purpose-Built AI Peripheral Hardware

Key Differentiators

Use Cases

Revenue Model

3. NodeMarket — Shared AI Capability Marketplace

Key Differentiators

Example Scenarios

Revenue Model

4. NodeKit — Universal AI Peripheral SDK

Key Differentiators

Revenue Model

5. GuardianNode — AI-Powered Elderly Care Network

Key Differentiators

Revenue Model

6. AuditClaw — AI Agent Security & Compliance Platform

Key Differentiators

Why This Is Critical

Revenue Model

7. HiveMinds — Multi-Agent Swarm Intelligence Platform

Architecture

Key Differentiators

Revenue Model

08 Competitive Landscape

09 Conclusion

OpenClaw Nodes &
The Companion App