30+ languages · 3 TTS engines · Self-hosted

Give your apps
a voice

AI-powered voice messaging platform. Synthesize speech, clone voices, translate across languages — deliver via iOS, Telegram, or API.

Explore API Download App

48+

Voices

30+

Languages

API Endpoints

TTS Engines

Capabilities

Everything voice,
in one platform

From synthesis to cloning to cross-language translation — a complete voice infrastructure that runs on CPU.

Text-to-Speech

Three-tier TTS: Kokoro for speed, Piper for extended languages, XTTS v2 for voice cloning. All CPU-friendly, fully self-hosted.

Speech-to-Text

Self-hosted Whisper with faster-whisper. Accurate transcription with automatic language detection, running on CPU with int8 quantization.

Voice Cloning

Clone any voice from a 5-second audio sample via XTTS v2. Fully local, no external API. Use cloned voices across all messaging and synthesis endpoints.

Voice Translation

Full STT → LLM → TTS pipeline. Translate spoken audio or text across 30+ languages with automatic voice selection per target language.

Voice Messaging

Send synthesized voice messages between users. Cross-channel delivery via iOS push notifications and Telegram. Scheduled messages supported.

Webhooks

HMAC-SHA256 signed webhook delivery for voice events. Auto-deactivation after failures. Full CRUD management via API or MCP.

TTS Engines

Three tiers of synthesis

Automatic routing picks the right engine for the language and quality level. All standard engines run entirely on CPU.

Standard

Kokoro

82M parameter model. Apache 2.0 licensed. The default engine — fast, natural, and completely self-hosted.

Languages 9

Voices 30+

Speed 3–11× realtime

Cost Free (self-hosted)

Extended

Piper

ONNX-based engine. Auto-fallback for 17 additional languages not covered by Kokoro. Models downloaded lazily.

Languages 17 additional

Model Size ~65 MB each

Loading Lazy (on first use)

Cost Free (self-hosted)

Cloned

XTTS v2

Self-hosted voice cloning. 17 languages, zero-shot from 5-second samples. Fully local, no external API.

Languages 17

Voice Cloning 5-15s sample

Synthesis Async (20-60s CPU)

Cost Free (self-hosted)

Platforms

Reach users everywhere

Native iOS app, full-featured Telegram bot, and a REST API for building custom integrations.

iOS App

Native Swift app with push notification sounds. Receive voice messages as custom notification audio. Friends, inbox, and voice selection built in.

App Store

Telegram Bot

Full-featured bot: /voice, /send, /translate, /clonevoice, /ai mode, friend management. Auto-provisions users. Redis rate limiting.

Open in Telegram

REST API

47+ endpoints across 14 categories. JWT, API keys, and magic codes for auth. Interactive Swagger docs. OpenAPI 3.0 spec included.

API Documentation

For Developers

Integrate in minutes

Magic codes for one-liner access. MCP server for AI agents. Full Swagger docs for everything else.

Magic Codes

One-line API calls with no JWT or headers needed. Generate a MV-XXXXXX code and start calling endpoints immediately.

                # Synthesize speech in one line

                curl -X POST https://api.magnavoices.com/v1/quick/tts \

                  -H "Content-Type: application/json" \

                  -d '{"code":"MV-XXXXXX","text":"Hello!","format":"mp3"}' \

                  -o voice.mp3

MCP Server

Model Context Protocol server exposing 7 tools — synthesize, transcribe, translate, send_voice, list_voices, clone_voice, manage_webhooks. Works with Claude Code, Cursor, and any MCP client.

                # Add to Claude Code config

                "magna-vox": {

                  "command": "node",

                  "args": ["mcp-server/index.js"],

                  "env": { "MAGNA_VOX_TTS_URL": "..." }

                }

Swagger UI

Interactive API documentation with try-it-out for all 47+ endpoints. Authenticate with JWT, API key, or magic code and test directly in your browser.

Open Swagger Docs

OpenClaw Skill

Ready-made skill for the OpenClaw AI assistant. Exposes synthesize, transcribe, send_voice, list_voices, and clone_voice as AI-callable actions.

                # Install the skill

                cp -r openclaw-skill/magna-vox \

                  ~/.openclaw/skills/magna-vox/

Authentication

Three ways to authenticate

Choose the right auth level for your use case — from quick scripts to production integrations.

Quick

Magic Codes

6-character codes for scripts and AI agents. Configurable TTL from 1 hour to 7 days. No headers required.

Format MV-XXXXXX

TTL 1h – 7d

Best for Scripts, bots

Persistent

API Keys

Long-lived keys for production integrations. SHA-256 hashed storage. Pass via X-API-Key or Authorization header.

Format mvk_live_...

Storage SHA-256 hashed

Best for Production apps

Full Access

JWT

180-day tokens from Apple or Google OAuth. Full user context, all endpoints. Used by the iOS app.

TTL 180 days

OAuth Apple / Google

Best for Mobile apps

Give your appsa voice

Everything voice,in one platform

Text-to-Speech

Speech-to-Text

Voice Cloning

Voice Translation

Voice Messaging

Webhooks

Three tiers of synthesis

Kokoro

Piper

XTTS v2

Reach users everywhere

iOS App

Telegram Bot

REST API

Integrate in minutes

Magic Codes

MCP Server

Swagger UI

OpenClaw Skill

Three ways to authenticate

Magic Codes

API Keys

JWT

Give your apps
a voice

Everything voice,
in one platform