OmniAgent¶
Your AI representative across communication channels.
OmniAgent is a personal AI assistant that routes messages across multiple communication platforms, processes them via an AI agent, and responds on your behalf.
Features¶
- 💬 Multi-Channel Support - Telegram, Discord, Slack, WhatsApp, and more
- 🤖 AI-Powered Responses - Powered by omnillm (Claude, GPT, Gemini, etc.)
- 🎤 Voice Notes - Transcribe incoming voice, respond with synthesized speech via OmniVoice
- 🧩 Skills System - Extensible skills compatible with OpenClaw/ClawHub
- 🔒 Secure Sandboxing - WASM and Docker isolation for tool execution
- 🌐 Browser Automation - Built-in browser control via Rod
- 🔌 WebSocket Gateway - Real-time control plane for device connections
- 📊 Observability - Integrated tracing via omniobserve
Quick Start¶
Installation¶
WhatsApp + OpenAI¶
The fastest way to get started:
# Set your OpenAI API key
export OPENAI_API_KEY="sk-..."
# Run with WhatsApp enabled
OMNIAGENT_AGENT_PROVIDER=openai \
OMNIAGENT_AGENT_MODEL=gpt-4o \
WHATSAPP_ENABLED=true \
omniagent gateway run
A QR code will appear in your terminal. Scan it with WhatsApp to connect.
Architecture¶
+-------------------------------------------------------------+
| Messaging Channels |
| Telegram | Discord | Slack | WhatsApp | ... |
+---------------------------+---------------------------------+
|
+---------------------------v---------------------------------+
| Gateway (WebSocket Control Plane) |
| ws://127.0.0.1:18789 |
+---------------------------+---------------------------------+
|
+---------------------------v---------------------------------+
| Agent Runtime |
| +------------------+ +------------------+ |
| | Skills | | Sandbox | |
| | (SKILL.md) | | (WASM/Docker) | |
| +------------------+ +------------------+ |
| - omnillm (LLM providers) |
| - omnivoice (STT/TTS) |
| - omniobserve (tracing) |
| - Tools (browser, shell, http) |
+-------------------------------------------------------------+
Dependencies¶
| Package | Purpose |
|---|---|
| omnichat | Unified messaging (WhatsApp, Telegram, Discord) |
| omnillm | Multi-provider LLM abstraction |
| omnivoice | Voice STT/TTS interfaces |
| omniobserve | LLM observability |
| wazero | WASM runtime for sandboxing |
| moby | Docker SDK for container isolation |
| Rod | Browser automation |
License¶
MIT License - see LICENSE for details.