freshcrate
Skin:/
Home > MCP Servers > vexa

vexa

Open-source meeting transcription API for Google Meet, Microsoft Teams & Zoom. Auto-join bots, real-time WebSocket transcripts, MCP server for AI agents. Self-host or use hosted SaaS.

Why this rank:Strong adoptionRecent releaseHealthy release cadence

Description

Open-source meeting transcription API for Google Meet, Microsoft Teams & Zoom. Auto-join bots, real-time WebSocket transcripts, MCP server for AI agents. Self-host or use hosted SaaS.

README

Vexa Logo

Vexa

Open-source meeting bot API & transcription API

meeting bots • real-time transcription • interactive bots • MCP server • self-hosted

Google Meet Google Meet     •     Microsoft Teams Microsoft Teams     •     Zoom Zoom

StarsLicenseDiscord What's newQuickstartAPIDocsRoadmapDiscord


Vexa is an open-source, self-hostable meeting bot API and meeting transcription API for Google Meet, Microsoft Teams, and Zoom. Alternative to Recall.ai, Otter.ai, and Fireflies.ai — self-host so meeting data never leaves your infrastructure, or use vexa.ai hosted.


Data sovereignty — self-host so meeting data never leaves your infrastructure

Cost — replace $20/seat SaaS with your own infrastructure

Embed in your product — multi-tenant meeting bot API with scoped tokens

AI agents — MCP server with 17 tools


Capabilities

Meeting bot API Send a bot to any meeting: auto-join, record, speak, chat, share screen. Open-source alternative to Recall.ai.
Meeting transcription API Real-time transcripts via REST API and WebSocket. Self-hosted alternative to Otter.ai and Fireflies.ai.
Real-time transcription Sub-second per-speaker transcripts during the call. 100+ languages via Whisper. WebSocket streaming.
Interactive bots Make bots speak, send/read chat, share screen content, and set avatar in live meetings.
Browser bots CDP + Playwright browser automation with persistent authenticated sessions via S3.
MCP server 17 meeting tools for Claude, Cursor, Windsurf. AI agents join calls, read transcripts, speak in meetings.
Multi-tenant Users, scoped API tokens, isolated containers. Deploy once, serve your team.
Dashboard Open-source Next.js web UI — meetings, transcripts, agent chat, browser sessions. Ready to use out of the box.
Self-hostable Run on your infra. Meeting data never leaves your infrastructure.

Every feature is a separate service. Pick what you need, skip what you don't. Self-host everything or use vexa.ai hosted.


Why Self-Host Meeting Transcription?

For regulated industries — banks, financial services, healthcare — meeting data can't leave your infrastructure. Self-hosting Vexa means zero external data transmission and full audit trail on your own infrastructure.

For cost-conscious teams — replace per-seat SaaS pricing. A team paying $17/seat/mo for meeting transcription can self-host Vexa and drop that to infrastructure cost.

For developers — embed a meeting bot API in your product. Multi-tenant, scoped API tokens, no per-user infrastructure.

Build meeting assistants like Otter.ai, Fireflies.ai, or Fathom — or build a meeting bot API like Recall.ai — self-hosted on your infrastructure.

  • Vexa (self-host) — your infra cost. Data never leaves your infrastructure. Meeting bot API, real-time transcription, interactive bots, MCP server. Open source, Apache 2.0. Google Meet, Teams, Zoom*.
  • Recall.ai — $0.50/hr. No self-hosting. Meeting bot API, real-time transcription. No MCP, limited interactive bots. Closed source. Meet, Teams, Zoom, Webex.
  • Otter.ai — $17-20/seat/mo. No self-hosting. No API. Limited real-time transcription. Closed source. Meet, Teams, Zoom.

* Zoom support is experimental

Or use vexa.ai hosted — get an API key and start sending bots immediately, no infrastructure needed.

Built for Data Sovereignty

Meeting data never leaves your infrastructure. Self-host for complete control. Modular architecture scales from edge devices to millions of users.

1. Hosted service At vexa.ai — get an API key and start sending bots. No infrastructure needed. Ready to integrate


2. Self-host with Vexa transcription Run Vexa yourself, use vexa.ai for transcription — ready to go, no GPU needed. Control with minimal DevOps — see deploy/ for setup guides.


3. Fully self-host Run everything including your own GPU transcription service. Meeting data never leaves your infrastructure — see deploy/ for setup guides.

What's new

v0.10 — full architecture refactor

  • Services refactored — runtime-api as infrastructure layer (container orchestration), meeting-api as data layer, agent-api as high-level intelligence layer. Clean separation of concerns.
  • Real-time pipeline moved into bots — transcription pipeline now runs inside bot containers, eliminating external dependencies
  • Agent API (experimental) — ephemeral containers for AI agents. See services/agent-api/.
  • Helm/K8s — production Kubernetes deployment with built images and global.imageTag support

v0.9

  • Zoom (experimental) — initial Zoom Meeting SDK support
  • Interactive Bots API — speak, chat, screen share, avatar controls during live meetings
  • MCP server — 17 tools for AI agents
  • Recordings — S3-compatible storage

See full release notes: https://github.com/Vexa-ai/vexa/releases


Quickstart

Self-host with Docker

On a fresh Linux machine (Ubuntu 24.04):

apt-get update && apt-get install -y make git curl
curl -fsSL https://get.docker.com | sh
git clone https://github.com/Vexa-ai/vexa.git && cd vexa

Then choose:

Command What you get Best for
make lite Single container, all services Quick evaluation, small teams
make all Full stack, each service separate Development, production

Both prompt for a transcription token on first run. Get one at vexa.ai/account, or self-host transcription with a GPU.

Guides: Vexa Lite | Docker Compose | Helm (K8s)

Hosted (no deployment needed)

Get your API key at vexa.ai/account and start sending bots immediately.

Meeting API — Send Bots, Get Transcripts

Send a bot, get real-time transcripts with per-speaker audio and interactive controls (speak, chat, share screen).

# Send a bot to Google Meet
curl -X POST "$API_BASE/bots" \
  -H "Content-Type: application/json" \
  -H "X-API-Key: <API_KEY>" \
  -d '{"platform": "google_meet", "native_meeting_id": "abc-defg-hij"}'

# Get transcripts
curl -H "X-API-Key: <API_KEY>" \
  "$API_BASE/transcripts/google_meet/abc-defg-hij"

Works with Google Meet, Microsoft Teams, and Zoom. Set API_BASE to https://api.cloud.vexa.ai (hosted) or http://localhost:8056 (self-hosted).

For real-time WebSocket streaming, see the WebSocket guide. For full REST details, see the User API Guide.


Browser Bots — Persistent Browser Containers for Agents

Remote browser containers with CDP + Playwright access and persistent session storage via S3. Agents get a real browser that stays logged in across restarts — Google, Microsoft, or any web session.

  • CDP + Playwright — full browser automation via Chrome DevTools Protocol
  • Persistent sessions — authenticated browser state saved to S3, restored on next spin-up
  • VNC access — humans can observe and control the browser in real time alongside agents
  • On-demand containers — spin up in seconds, auto-reclaim when idle

See features/browser-session/ and features/remote-browser/ for details.


MCP Server — Meeting Tools for AI Agents

17 tools that let AI agents join meetings, read transcripts, speak, chat, and share screen. Works with Claude, Cursor, Windsurf, and any MCP-compatible client.

Your AI agent can join a meeting, listen to the conversation, and participate — all through MCP tool calls. See services/mcp/ for setup and tool reference.


Modular — Pick What You Need

Vexa is a toolkit, not a monolith. Every feature works independently. Use one or all — they compose when you need them to.

You're building... Features you need Skip the rest
Self-hosted Otter replacement transcription + multi-platform + webhooks agent runtime, scheduler, MCP
Meeting data pipeline transcription + webhooks + post-meeting speaking-bot, chat, agent runtime
AI meeting assistant product transcription + MCP + speaking-bot + chat remote-browser, scheduler
Meeting bot API (like Recall.ai) multi-platform + transcription + token-scoping agent runtime, workspaces

You don't pay complexity tax for features you don't use. Each service is a separate container. Don't need agents? Don't run agent-api. Don't need TTS? Don't run tts-service. Services communicate via REST and Redis, not tight coupling.


Roadmap

For the up-to-date roadmap and priorities, see GitHub Issues and Milestones. Issues are grouped by milestones to show what's coming next, in what order, and what's currently highest priority.

For discussion/support, join our Discord.

Architecture & Feature Status

Each service and feature has its own README with architecture, DoD table, and evidence-based confidence scores.


Contributing

We use GitHub Issues as our main feedback channel — triaged within 72 hours. Look for good-first-issue to get started. Join Discord to discuss ideas and get assigned.

License

Links

WebsiteDocsDiscordLinkedInX (@grankin_d)Meet Founder

Related: vexa-lite-deployVexa Dashboard

Release History

VersionChangesUrgencyDate
v0.10.6.3# Vexa 0.10.6.3 7-pack stitched release replayed from `v0.10.6` lineage, stitched into one candidate, validated end-to-end on three production-shape lanes (Compose VM, Lite VM, LKE Helm), and shipped after pack-by-pack sign-off. ## What's new - **Pack 1 — Recording Playback Trust** (#364): one canonical master recording artifact; dashboard playback path with same-origin raw audio proxy; finalizer self-heal recovers JSONB from storage when bot exits before chunk-write; trust UX (neutral "RecorHigh5/28/2026
v0.10.6.2## v0.10.6.2 `v0.10.6.2` is the corrective patch for the published `v0.10.6.1` release. It is **v0.10.6.1 plus a small regression-fix layer**, not a new feature release. `v0.10.6.1` remains the validated base release. This patch fixes the deployment/browser regressions discovered after publication. ### Fixed since v0.10.6.1 - Dashboard `/api/config` now returns browser-reachable API/WebSocket URLs as the runtime SSOT instead of leaking internal DNS or localhost. - Dashboard browser/VNC view High5/20/2026
0.10.6**Major release** — Pack U structural fix for the audio recording pipeline + Pack M chunk-accumulation memory leak fix + Pack H helm chart hardening. ## Highlights - **Pack M (chunk-leak fix)** — closes the v0.10.5.2 24-min recording crash class. Splice-on-upload + cap=10 across all 3 platforms. - **Pack U.5 (server-side master finalizer)** — recordings built server-side from chunks already in MinIO. Crash-safe for the first time: SIGKILL'd / OOM'd / evicted bots produce playable masters. - **High5/3/2026
v0.10.5.2**Surgical patch on v0.10.5.** One-line revert closing the cross-platform `Execution context destroyed` mid-meeting crash. ## What's fixed - **`#291` (umbrella) + `#284` + `#281`** — `transceiver.direction` mutation at site 2 of `services/vexa-bot/core/src/services/screen-content.ts:1218-1228` removed. v0.10.5 reverted the same pattern at site 1 in [`8ab7f49`](https://github.com/Vexa-ai/vexa/commit/8ab7f49); this release reverts the duplicate site 2 inside the live `RTCPeerConnection` track-evHigh5/1/2026
v0.10.4# Vexa v0.10.4 — Zoom Web bot **Release date:** 2026-04-27 **Cycle:** `260426-zoom` **Highlight:** Zoom support via the official Zoom **Web Client** (no proprietary SDK required). --- ## What's new ### 🎯 Zoom Web is the bot's default join path Spawn a Zoom bot with the same API as Google Meet / MS Teams — **no Zoom SDK credentials, no special configuration**: ```bash curl -X POST $GATEWAY/bots \ -H "X-API-Key: $TOKEN" -H "Content-Type: application/json" \ -d '{"platform":"zoom","nativHigh4/26/2026
vexa-0.10.3Vexa - self-hosted real-time meeting transcription platformHigh4/22/2026
v0.10.3# v0.10.3 — Post-incident stabilization **Release tag**: `0.10.0-260421-2337` · **Compose**: `docker pull vexaai/meeting-api:0.10.0-260421-2337` · **Helm**: chart `vexa-0.1.0` values `global.imageTag=latest` Seven issue packs addressing GitHub reports + the 2026-04-20 incident-doc findings. Full release protocol run (groom → plan → develop → deploy → validate → triage → human → ship) with registry-gated regression guards. ## Highlights | Area | Change | |---|---| | 🛡️ **Chart secrets** (#22High4/21/2026
v0.10.2Small tuning patch on top of v0.10.1. ## Changes - **`perf(transcription-service)`** — two tuning knobs for the GPU transcription stack: - `VAD_MIN_SILENCE_DURATION_MS=80` on both worker pools. Halves the min-silence window the Silero VAD uses to split chunks (default ~160ms). More responsive segmentation for conversational audio; small CTranslate2 invocation bump, fine on the GPU tier. - nginx `client_max_body_size 50M → 500M`. The 50M cap was rejecting longer single-file uploads (~10–15 High4/19/2026
v0.10.1Security patch release on top of v0.10 — closes two reporter-filed CVEs and lands 6 upstream-hardening packs. ## CVEs closed | CVE | Severity | Advisory | Reporter | |-----|:--------:|----------|----------| | [CVE-2026-25058](https://github.com/Vexa-ai/vexa/security/advisories/GHSA-w73r-2449-qwgh) | HIGH (7.5) | Unauthenticated internal transcript endpoint exposed by default | Ariel Silver ([@SilverPlate3](https://github.com/SilverPlate3)) | | [CVE-2026-25883](https://github.com/Vexa-ai/vexa/sHigh4/19/2026
v0.10# v0.10 — Vexa goes cloud-native **The biggest refactor ever.** v0.9 was a bot that joined meetings. v0.10 is a **cloud-native agent platform** where meeting bots are one workload type among many. From single-container all-in-one → first-class Kubernetes, scalable horizontally, with a general-purpose agent runtime on top. 723 commits · 1014 files changed · +142k / −44k lines. --- ## The new architecture ### Two-layer model: runtime + applications v0.10 introduces a clean two-layer split thHigh4/19/2026
v0.9# v0.9 Release since v0.6 — 272 files changed, ~54k lines added. ## New platforms - **Zoom SDK integration** — native Zoom Meeting SDK bot with real-time transcription, speaker diarization, and C++ audio bridge (`services/vexa-bot/core/src/platforms/zoom/`) - **MS Teams full URL support** — enterprise deep links, v2 fragment format (`#/meet/<id>`), ZoomGov URLs, Meet nickname URLs ## Interactive bots New bot capabilities beyond passive transcription: - **Speak** — TTS playback into meetingsHigh4/8/2026
v0.6# Vexa v0.6.0 — Microsoft Teams + WebSockets **Release date:** 4 Oct 2025 🎉 What’s new in v0.6 * **Microsoft Teams support** (alongside Google Meet) * **WebSocket transcript streaming** for efficient sub-second delivery * Numerous **reliability and joining improvements** from real-world usage of our hosted service Vexa drops a bot into your online meeting and streams transcripts to your apps in real time. --- ## Platforms * Google Meet * Microsoft Teams ## Transport Low10/4/2025
v0.5What’s New in v0.5 MCP Server Integration – You can now send bots and get real-time assistance with Claude, Cursor, or any MCP-capable agent. Nomad Orchestrator Support – Native support for Nomad to manage and scale workloads. Simplified Deployment – Updated self-hosting instructions with a single make all command. Bug Fixes – Stability and reliability enhancements across the stack. Performance Improvements – Various optimizations for faster and smoother operation. Google Meet Low9/4/2025
v0.4.1Release v0.4.1Low6/6/2025
v0.4📣 Open-source Google Meet real-time transcription API, now with speaker identification! Big milestone for the community—our most requested feature is now live. Build products like meeting notetakers, call analyzers, or AI workflows in minutes using Vexa. 🔥 What’s New Speaker identification for Google Meet Now your transcripts come labeled with speakers, making them instantly usable for CRMs, meeting notes, AI agents, and analytics. Improved stability and bot management More roLow6/5/2025
v0.3.2Numerous bug fixes and resilience upgradesLow5/28/2025
v0.3.1Real‑time Google Meet transcription, now one‑command on any laptop.Low5/22/2025
v0.3Vexa Major Release - **Public API Now Available with Real-Time Translation from Google Meet** The Vexa API is now publicly available at vexa.ai with self-service access - **get your API key in 3 clicks and have everything running in under 5 minutes.** Key features in this release: - **Instant API Access**: Self-service API keys available directly from vexa.ai - **Google Meet Bot Integration**: Programmatically send bots to join and transcribe meetings - **Real-Time Transcription**: AcceLow4/24/2025
v0.2Vexa Release v0.2 - Google Meet Bot & Real-time Transcription API You can now programmatically send a Vexa bot into a Google Meet call using simple API. Once joined, the bot delivers real-time transcription of the meeting, accessible through the same API. This provides a seamless way to integrate automated transcription directly into your workflows. This release establishes the foundation of our scalable microservices architecture, ensuring reliability and paving the way for futuLow4/9/2025
v0.1.0We’re excited to announce the v0.1.0 alpha release of Vexa, a **secure real-time meeting transcription and knowledge management platform designed for corporate in-house deployment**. With enterprise-grade security at its core, Vexa enables organizations to transcribe, capture, and manage critical meeting insights while maintaining complete control over their data in private or on-premises environments.Low3/10/2025

Dependencies & License Audit

Loading dependencies...

Similar Packages

AI-Skills🤖 Enhance AI capabilities with modular Skills that provide expert knowledge, workflows, and integrations for any project.main@2026-06-07
telegram-mcp🤖 Manage multiple Telegram accounts effortlessly with AI-driven tools for bulk messaging, scheduling, and more in one easy-to-use platform.main@2026-06-07
Enterprise-Multi-AI-Agent-Systems-🤖 Build and deploy scalable Multi-AI Agent systems with LangGraph and Groq LLMs to enhance intelligence across enterprise applications.main@2026-06-07
AIDomesticCoreAIJ🛠️ Build a robust AI Kernel for stable, auditable, and sovereign AI systems, ensuring secure execution and compliance across various domains.main@2026-06-07
argus-mcp🔍 Enhance code quality with Argus MCP, an AI-driven code review server using a Zero-Trust model for safe and efficient development.main@2026-06-07

More in MCP Servers

claude-plugins-officialOfficial, Anthropic-managed directory of high quality Claude Code Plugins.
langchain4jLangChain4j is an open-source Java library that simplifies the integration of LLMs into Java applications through a unified API, providing access to popular LLMs and vector databases. It makes impleme
hyperframesWrite HTML. Render video. Built for agents.
claude-code-guideClaude Code Guide - Setup, Commands, workflows, agents, skills & tips-n-tricks go from beginner to power user!