2025-12-09 17:51:05 +00:00
|
|
|
|
---
|
2026-01-05 20:10:56 +01:00
|
|
|
|
summary: "WebSocket gateway architecture, components, and client flows"
|
2025-12-09 17:51:05 +00:00
|
|
|
|
read_when:
|
|
|
|
|
|
- Working on gateway protocol, clients, or transports
|
|
|
|
|
|
---
|
2026-01-08 23:06:56 +01:00
|
|
|
|
# Gateway architecture
|
2025-12-09 14:41:41 +01:00
|
|
|
|
|
2026-01-19 08:54:21 +00:00
|
|
|
|
Last updated: 2026-01-19
|
2025-12-09 14:41:41 +01:00
|
|
|
|
|
|
|
|
|
|
## Overview
|
2026-01-08 23:06:56 +01:00
|
|
|
|
|
|
|
|
|
|
- A single long‑lived **Gateway** owns all messaging surfaces (WhatsApp via
|
|
|
|
|
|
Baileys, Telegram via grammY, Slack, Discord, Signal, iMessage, WebChat).
|
2026-01-12 04:44:14 +00:00
|
|
|
|
- Control-plane clients (macOS app, CLI, web UI, automations) connect to the
|
|
|
|
|
|
Gateway over **WebSocket** on the configured bind host (default
|
2026-01-08 23:06:56 +01:00
|
|
|
|
`127.0.0.1:18789`).
|
2026-01-19 08:54:21 +00:00
|
|
|
|
- **Nodes** (macOS/iOS/Android/headless) also connect over **WebSocket**, but
|
|
|
|
|
|
declare `role: node` with explicit caps/commands.
|
2026-01-08 23:06:56 +01:00
|
|
|
|
- One Gateway per host; it is the only place that opens a WhatsApp session.
|
|
|
|
|
|
- A **canvas host** (default `18793`) serves agent‑editable HTML and A2UI.
|
2026-01-05 21:30:19 +01:00
|
|
|
|
|
2025-12-09 14:41:41 +01:00
|
|
|
|
## Components and flows
|
2026-01-08 23:06:56 +01:00
|
|
|
|
|
|
|
|
|
|
### Gateway (daemon)
|
|
|
|
|
|
- Maintains provider connections.
|
|
|
|
|
|
- Exposes a typed WS API (requests, responses, server‑push events).
|
|
|
|
|
|
- Validates inbound frames against JSON Schema.
|
|
|
|
|
|
- Emits events like `agent`, `chat`, `presence`, `health`, `heartbeat`, `cron`.
|
|
|
|
|
|
|
|
|
|
|
|
### Clients (mac app / CLI / web admin)
|
|
|
|
|
|
- One WS connection per client.
|
|
|
|
|
|
- Send requests (`health`, `status`, `send`, `agent`, `system-presence`).
|
|
|
|
|
|
- Subscribe to events (`tick`, `agent`, `presence`, `shutdown`).
|
|
|
|
|
|
|
2026-01-19 08:54:21 +00:00
|
|
|
|
### Nodes (macOS / iOS / Android / headless)
|
|
|
|
|
|
- Connect to the **same WS server** with `role: node`.
|
2026-01-08 23:06:56 +01:00
|
|
|
|
- Pair with the Gateway to receive a token.
|
|
|
|
|
|
- Expose commands like `canvas.*`, `camera.*`, `screen.record`, `location.get`.
|
|
|
|
|
|
|
2026-01-12 04:44:14 +00:00
|
|
|
|
Protocol details:
|
|
|
|
|
|
- [Gateway protocol](/gateway/protocol)
|
|
|
|
|
|
|
2026-01-08 23:06:56 +01:00
|
|
|
|
### WebChat
|
|
|
|
|
|
- Static UI that uses the Gateway WS API for chat history and sends.
|
|
|
|
|
|
- In remote setups, connects through the same SSH/Tailscale tunnel as other
|
|
|
|
|
|
clients.
|
2025-12-09 14:41:41 +01:00
|
|
|
|
|
|
|
|
|
|
## Connection lifecycle (single client)
|
2026-01-08 23:06:56 +01:00
|
|
|
|
|
2025-12-09 14:41:41 +01:00
|
|
|
|
```
|
|
|
|
|
|
Client Gateway
|
|
|
|
|
|
| |
|
2025-12-12 23:29:57 +00:00
|
|
|
|
|---- req:connect -------->|
|
|
|
|
|
|
|<------ res (ok) ---------| (or res error + close)
|
2026-01-08 23:06:56 +01:00
|
|
|
|
| (payload=hello-ok carries snapshot: presence + health)
|
2025-12-09 14:41:41 +01:00
|
|
|
|
| |
|
2026-01-08 23:06:56 +01:00
|
|
|
|
|<------ event:presence ---|
|
|
|
|
|
|
|<------ event:tick -------|
|
2025-12-09 14:41:41 +01:00
|
|
|
|
| |
|
|
|
|
|
|
|------- req:agent ------->|
|
|
|
|
|
|
|<------ res:agent --------| (ack: {runId,status:"accepted"})
|
|
|
|
|
|
|<------ event:agent ------| (streaming)
|
|
|
|
|
|
|<------ res:agent --------| (final: {runId,status,summary})
|
|
|
|
|
|
| |
|
|
|
|
|
|
```
|
2026-01-08 23:06:56 +01:00
|
|
|
|
|
2025-12-09 14:41:41 +01:00
|
|
|
|
## Wire protocol (summary)
|
2026-01-08 23:06:56 +01:00
|
|
|
|
|
2025-12-09 14:41:41 +01:00
|
|
|
|
- Transport: WebSocket, text frames with JSON payloads.
|
2026-01-08 23:06:56 +01:00
|
|
|
|
- First frame **must** be `connect`.
|
|
|
|
|
|
- After handshake:
|
|
|
|
|
|
- Requests: `{type:"req", id, method, params}` → `{type:"res", id, ok, payload|error}`
|
|
|
|
|
|
- Events: `{type:"event", event, payload, seq?, stateVersion?}`
|
|
|
|
|
|
- If `CLAWDBOT_GATEWAY_TOKEN` (or `--token`) is set, `connect.params.auth.token`
|
|
|
|
|
|
must match or the socket closes.
|
|
|
|
|
|
- Idempotency keys are required for side‑effecting methods (`send`, `agent`) to
|
|
|
|
|
|
safely retry; the server keeps a short‑lived dedupe cache.
|
2026-01-19 08:54:21 +00:00
|
|
|
|
- Nodes must include `role: "node"` plus caps/commands/permissions in `connect`.
|
2025-12-09 14:41:41 +01:00
|
|
|
|
|
2026-01-08 23:06:56 +01:00
|
|
|
|
## Protocol typing and codegen
|
|
|
|
|
|
|
|
|
|
|
|
- TypeBox schemas define the protocol.
|
|
|
|
|
|
- JSON Schema is generated from those schemas.
|
|
|
|
|
|
- Swift models are generated from the JSON Schema.
|
2025-12-09 14:41:41 +01:00
|
|
|
|
|
|
|
|
|
|
## Remote access
|
2026-01-08 23:06:56 +01:00
|
|
|
|
|
|
|
|
|
|
- Preferred: Tailscale or VPN.
|
|
|
|
|
|
- Alternative: SSH tunnel
|
|
|
|
|
|
```bash
|
|
|
|
|
|
ssh -N -L 18789:127.0.0.1:18789 user@host
|
|
|
|
|
|
```
|
|
|
|
|
|
- The same handshake + auth token apply over the tunnel.
|
2026-01-19 08:54:21 +00:00
|
|
|
|
- TLS + optional pinning can be enabled for WS in remote setups.
|
2025-12-09 14:41:41 +01:00
|
|
|
|
|
|
|
|
|
|
## Operations snapshot
|
|
|
|
|
|
|
2026-01-08 23:06:56 +01:00
|
|
|
|
- Start: `clawdbot gateway` (foreground, logs to stdout).
|
|
|
|
|
|
- Health: `health` over WS (also included in `hello-ok`).
|
|
|
|
|
|
- Supervision: launchd/systemd for auto‑restart.
|
|
|
|
|
|
|
|
|
|
|
|
## Invariants
|
|
|
|
|
|
|
|
|
|
|
|
- Exactly one Gateway controls a single Baileys session per host.
|
|
|
|
|
|
- Handshake is mandatory; any non‑JSON or non‑connect first frame is a hard close.
|
|
|
|
|
|
- Events are not replayed; clients must refresh on gaps.
|