Home > MCP Servers > vexa

vexa

Open-source meeting transcription API for Google Meet, Microsoft Teams & Zoom. Auto-join bots, real-time WebSocket transcripts, MCP server for AI agents. Self-host or use hosted SaaS.

google-meet meeting-assistant meeting-minutes meeting-notes ms-teams ms-teams-app notetaker python zoom

Why this rank:Strong adoptionRecent releaseHealthy release cadence

Description

Open-source meeting transcription API for Google Meet, Microsoft Teams & Zoom. Auto-join bots, real-time WebSocket transcripts, MCP server for AI agents. Self-host or use hosted SaaS.

README

Vexa

Open-source meeting bot API & transcription API

meeting bots • real-time transcription • interactive bots • MCP server • self-hosted

Google Meet • Microsoft Teams • Zoom

What's new • Quickstart • API • Docs • Roadmap • Discord

Vexa is an open-source, self-hostable meeting bot API and meeting transcription API for Google Meet, Microsoft Teams, and Zoom. Alternative to Recall.ai, Otter.ai, and Fireflies.ai — self-host so meeting data never leaves your infrastructure, or use vexa.ai hosted.

Data sovereignty — self-host so meeting data never leaves your infrastructure

Cost — replace $20/seat SaaS with your own infrastructure

Embed in your product — multi-tenant meeting bot API with scoped tokens

AI agents — MCP server with 17 tools

Capabilities


Meeting bot API	Send a bot to any meeting: auto-join, record, speak, chat, share screen. Open-source alternative to Recall.ai.
Meeting transcription API	Real-time transcripts via REST API and WebSocket. Self-hosted alternative to Otter.ai and Fireflies.ai.
Real-time transcription	Sub-second per-speaker transcripts during the call. 100+ languages via Whisper. WebSocket streaming.
Interactive bots	Make bots speak, send/read chat, share screen content, and set avatar in live meetings.
Browser bots	CDP + Playwright browser automation with persistent authenticated sessions via S3.
MCP server	17 meeting tools for Claude, Cursor, Windsurf. AI agents join calls, read transcripts, speak in meetings.
Multi-tenant	Users, scoped API tokens, isolated containers. Deploy once, serve your team.
Dashboard	Open-source Next.js web UI — meetings, transcripts, agent chat, browser sessions. Ready to use out of the box.
Self-hostable	Run on your infra. Meeting data never leaves your infrastructure.

Every feature is a separate service. Pick what you need, skip what you don't. Self-host everything or use vexa.ai hosted.

Why Self-Host Meeting Transcription?

For regulated industries — banks, financial services, healthcare — meeting data can't leave your infrastructure. Self-hosting Vexa means zero external data transmission and full audit trail on your own infrastructure.

For cost-conscious teams — replace per-seat SaaS pricing. A team paying $17/seat/mo for meeting transcription can self-host Vexa and drop that to infrastructure cost.

For developers — embed a meeting bot API in your product. Multi-tenant, scoped API tokens, no per-user infrastructure.

Build meeting assistants like Otter.ai, Fireflies.ai, or Fathom — or build a meeting bot API like Recall.ai — self-hosted on your infrastructure.

Vexa (self-host) — your infra cost. Data never leaves your infrastructure. Meeting bot API, real-time transcription, interactive bots, MCP server. Open source, Apache 2.0. Google Meet, Teams, Zoom*.
Recall.ai — $0.50/hr. No self-hosting. Meeting bot API, real-time transcription. No MCP, limited interactive bots. Closed source. Meet, Teams, Zoom, Webex.
Otter.ai — $17-20/seat/mo. No self-hosting. No API. Limited real-time transcription. Closed source. Meet, Teams, Zoom.

* Zoom support is experimental

Or use vexa.ai hosted — get an API key and start sending bots immediately, no infrastructure needed.

Built for Data Sovereignty

Meeting data never leaves your infrastructure. Self-host for complete control. Modular architecture scales from edge devices to millions of users.

1. Hosted service At vexa.ai — get an API key and start sending bots. No infrastructure needed. Ready to integrate

2. Self-host with Vexa transcription Run Vexa yourself, use vexa.ai for transcription — ready to go, no GPU needed. Control with minimal DevOps — see deploy/ for setup guides.

3. Fully self-host Run everything including your own GPU transcription service. Meeting data never leaves your infrastructure — see deploy/ for setup guides.

What's new

v0.10 — full architecture refactor

Services refactored — runtime-api as infrastructure layer (container orchestration), meeting-api as data layer, agent-api as high-level intelligence layer. Clean separation of concerns.
Real-time pipeline moved into bots — transcription pipeline now runs inside bot containers, eliminating external dependencies
Agent API (experimental) — ephemeral containers for AI agents. See services/agent-api/.
Helm/K8s — production Kubernetes deployment with built images and global.imageTag support

v0.9

Zoom (experimental) — initial Zoom Meeting SDK support
Interactive Bots API — speak, chat, screen share, avatar controls during live meetings
MCP server — 17 tools for AI agents
Recordings — S3-compatible storage

See full release notes: https://github.com/Vexa-ai/vexa/releases

Quickstart

Self-host with Docker

On a fresh Linux machine (Ubuntu 24.04):

apt-get update && apt-get install -y make git curl
curl -fsSL https://get.docker.com | sh
git clone https://github.com/Vexa-ai/vexa.git && cd vexa

Then choose:

Command	What you get	Best for
`make lite`	Single container, all services	Quick evaluation, small teams
`make all`	Full stack, each service separate	Development, production

Both prompt for a transcription token on first run. Get one at vexa.ai/account, or self-host transcription with a GPU.

Guides: Vexa Lite | Docker Compose | Helm (K8s)

Hosted (no deployment needed)

Get your API key at vexa.ai/account and start sending bots immediately.

Meeting API — Send Bots, Get Transcripts

Send a bot, get real-time transcripts with per-speaker audio and interactive controls (speak, chat, share screen).

# Send a bot to Google Meet
curl -X POST "$API_BASE/bots" \
  -H "Content-Type: application/json" \
  -H "X-API-Key: <API_KEY>" \
  -d '{"platform": "google_meet", "native_meeting_id": "abc-defg-hij"}'

# Get transcripts
curl -H "X-API-Key: <API_KEY>" \
  "$API_BASE/transcripts/google_meet/abc-defg-hij"

Works with Google Meet, Microsoft Teams, and Zoom. Set API_BASE to https://api.cloud.vexa.ai (hosted) or http://localhost:8056 (self-hosted).

For real-time WebSocket streaming, see the WebSocket guide. For full REST details, see the User API Guide.

Browser Bots — Persistent Browser Containers for Agents

Remote browser containers with CDP + Playwright access and persistent session storage via S3. Agents get a real browser that stays logged in across restarts — Google, Microsoft, or any web session.

CDP + Playwright — full browser automation via Chrome DevTools Protocol
Persistent sessions — authenticated browser state saved to S3, restored on next spin-up
VNC access — humans can observe and control the browser in real time alongside agents
On-demand containers — spin up in seconds, auto-reclaim when idle

See features/browser-session/ and features/remote-browser/ for details.

MCP Server — Meeting Tools for AI Agents

17 tools that let AI agents join meetings, read transcripts, speak, chat, and share screen. Works with Claude, Cursor, Windsurf, and any MCP-compatible client.

Your AI agent can join a meeting, listen to the conversation, and participate — all through MCP tool calls. See services/mcp/ for setup and tool reference.

Modular — Pick What You Need

Vexa is a toolkit, not a monolith. Every feature works independently. Use one or all — they compose when you need them to.

You're building...	Features you need	Skip the rest
Self-hosted Otter replacement	transcription + multi-platform + webhooks	agent runtime, scheduler, MCP
Meeting data pipeline	transcription + webhooks + post-meeting	speaking-bot, chat, agent runtime
AI meeting assistant product	transcription + MCP + speaking-bot + chat	remote-browser, scheduler
Meeting bot API (like Recall.ai)	multi-platform + transcription + token-scoping	agent runtime, workspaces

You don't pay complexity tax for features you don't use. Each service is a separate container. Don't need agents? Don't run agent-api. Don't need TTS? Don't run tts-service. Services communicate via REST and Redis, not tight coupling.

Roadmap

For the up-to-date roadmap and priorities, see GitHub Issues and Milestones. Issues are grouped by milestones to show what's coming next, in what order, and what's currently highest priority.

Issues: https://github.com/Vexa-ai/vexa/issues
Milestones: https://github.com/Vexa-ai/vexa/milestones

For discussion/support, join our Discord.

Architecture & Feature Status

Each service and feature has its own README with architecture, DoD table, and evidence-based confidence scores.

Services: api-gateway • meeting-api • admin-api • runtime-api • vexa-bot • transcription-service • tts-service • mcp • dashboard • agent-api (experimental)
Features: realtime-transcription • bot-lifecycle • browser-session • remote-browser • speaking-bot • meeting-chat • webhooks • authenticated-meetings
Deploy: Docker Compose • Vexa Lite • Helm/K8s
Guides: Vexa Lite Deployment • Docker Compose Deployment • Self-Hosted Management • Recording Storage

Contributing

We use GitHub Issues as our main feedback channel — triaged within 72 hours. Look for good-first-issue to get started. Join Discord to discuss ideas and get assigned.

Links

Website • Docs • Discord • LinkedIn • X (@grankin_d) • Meet Founder

Related: vexa-lite-deploy • Vexa Dashboard

Release History

Version	Changes	Urgency	Date
v0.10.6.3	# Vexa 0.10.6.3 7-pack stitched release replayed from `v0.10.6` lineage, stitched into one candidate, validated end-to-end on three production-shape lanes (Compose VM, Lite VM, LKE Helm), and shipped after pack-by-pack sign-off. ## What's new - Pack 1 — Recording Playback Trust (#364): one canonical master recording artifact; dashboard playback path with same-origin raw audio proxy; finalizer self-heal recovers JSONB from storage when bot exits before chunk-write; trust UX (neutral "Recor	High	5/28/2026
v0.10.6.2	## v0.10.6.2 `v0.10.6.2` is the corrective patch for the published `v0.10.6.1` release. It is v0.10.6.1 plus a small regression-fix layer, not a new feature release. `v0.10.6.1` remains the validated base release. This patch fixes the deployment/browser regressions discovered after publication. ### Fixed since v0.10.6.1 - Dashboard `/api/config` now returns browser-reachable API/WebSocket URLs as the runtime SSOT instead of leaking internal DNS or localhost. - Dashboard browser/VNC view	High	5/20/2026
0.10.6	Major release — Pack U structural fix for the audio recording pipeline + Pack M chunk-accumulation memory leak fix + Pack H helm chart hardening. ## Highlights - Pack M (chunk-leak fix) — closes the v0.10.5.2 24-min recording crash class. Splice-on-upload + cap=10 across all 3 platforms. - Pack U.5 (server-side master finalizer) — recordings built server-side from chunks already in MinIO. Crash-safe for the first time: SIGKILL'd / OOM'd / evicted bots produce playable masters. - **	High	5/3/2026
v0.10.5.2	Surgical patch on v0.10.5. One-line revert closing the cross-platform `Execution context destroyed` mid-meeting crash. ## What's fixed - `#291` (umbrella) + `#284` + `#281` — `transceiver.direction` mutation at site 2 of `services/vexa-bot/core/src/services/screen-content.ts:1218-1228` removed. v0.10.5 reverted the same pattern at site 1 in [`8ab7f49`](https://github.com/Vexa-ai/vexa/commit/8ab7f49); this release reverts the duplicate site 2 inside the live `RTCPeerConnection` track-ev	High	5/1/2026
v0.10.4	# Vexa v0.10.4 — Zoom Web bot Release date: 2026-04-27 Cycle: `260426-zoom` Highlight: Zoom support via the official Zoom Web Client (no proprietary SDK required). --- ## What's new ### 🎯 Zoom Web is the bot's default join path Spawn a Zoom bot with the same API as Google Meet / MS Teams — no Zoom SDK credentials, no special configuration: ```bash curl -X POST $GATEWAY/bots \ -H "X-API-Key: $TOKEN" -H "Content-Type: application/json" \ -d '{"platform":"zoom","nativ	High	4/26/2026
vexa-0.10.3	Vexa - self-hosted real-time meeting transcription platform	High	4/22/2026
v0.10.3	# v0.10.3 — Post-incident stabilization Release tag: `0.10.0-260421-2337` · Compose: `docker pull vexaai/meeting-api:0.10.0-260421-2337` · Helm: chart `vexa-0.1.0` values `global.imageTag=latest` Seven issue packs addressing GitHub reports + the 2026-04-20 incident-doc findings. Full release protocol run (groom → plan → develop → deploy → validate → triage → human → ship) with registry-gated regression guards. ## Highlights \| Area \| Change \| \|---\|---\| \| 🛡️ Chart secrets (#22	High	4/21/2026
v0.10.2	Small tuning patch on top of v0.10.1. ## Changes - `perf(transcription-service)` — two tuning knobs for the GPU transcription stack: - `VAD_MIN_SILENCE_DURATION_MS=80` on both worker pools. Halves the min-silence window the Silero VAD uses to split chunks (default ~160ms). More responsive segmentation for conversational audio; small CTranslate2 invocation bump, fine on the GPU tier. - nginx `client_max_body_size 50M → 500M`. The 50M cap was rejecting longer single-file uploads (~10–15	High	4/19/2026
v0.10.1	Security patch release on top of v0.10 — closes two reporter-filed CVEs and lands 6 upstream-hardening packs. ## CVEs closed \| CVE \| Severity \| Advisory \| Reporter \| \|-----\|:--------:\|----------\|----------\| \| [CVE-2026-25058](https://github.com/Vexa-ai/vexa/security/advisories/GHSA-w73r-2449-qwgh) \| HIGH (7.5) \| Unauthenticated internal transcript endpoint exposed by default \| Ariel Silver ([@SilverPlate3](https://github.com/SilverPlate3)) \| \| [CVE-2026-25883](https://github.com/Vexa-ai/vexa/s	High	4/19/2026
v0.10	# v0.10 — Vexa goes cloud-native The biggest refactor ever. v0.9 was a bot that joined meetings. v0.10 is a cloud-native agent platform where meeting bots are one workload type among many. From single-container all-in-one → first-class Kubernetes, scalable horizontally, with a general-purpose agent runtime on top. 723 commits · 1014 files changed · +142k / −44k lines. --- ## The new architecture ### Two-layer model: runtime + applications v0.10 introduces a clean two-layer split th	High	4/19/2026
v0.9	# v0.9 Release since v0.6 — 272 files changed, ~54k lines added. ## New platforms - Zoom SDK integration — native Zoom Meeting SDK bot with real-time transcription, speaker diarization, and C++ audio bridge (`services/vexa-bot/core/src/platforms/zoom/`) - MS Teams full URL support — enterprise deep links, v2 fragment format (`#/meet/<id>`), ZoomGov URLs, Meet nickname URLs ## Interactive bots New bot capabilities beyond passive transcription: - Speak — TTS playback into meetings	High	4/8/2026
v0.6	# Vexa v0.6.0 — Microsoft Teams + WebSockets Release date: 4 Oct 2025 🎉 What’s new in v0.6 * Microsoft Teams support (alongside Google Meet) * WebSocket transcript streaming for efficient sub-second delivery * Numerous reliability and joining improvements from real-world usage of our hosted service Vexa drops a bot into your online meeting and streams transcripts to your apps in real time. --- ## Platforms * Google Meet * Microsoft Teams ## Transport	Low	10/4/2025
v0.5	What’s New in v0.5 MCP Server Integration – You can now send bots and get real-time assistance with Claude, Cursor, or any MCP-capable agent. Nomad Orchestrator Support – Native support for Nomad to manage and scale workloads. Simplified Deployment – Updated self-hosting instructions with a single make all command. Bug Fixes – Stability and reliability enhancements across the stack. Performance Improvements – Various optimizations for faster and smoother operation. Google Meet	Low	9/4/2025
v0.4.1	Release v0.4.1	Low	6/6/2025
v0.4	📣 Open-source Google Meet real-time transcription API, now with speaker identification! Big milestone for the community—our most requested feature is now live. Build products like meeting notetakers, call analyzers, or AI workflows in minutes using Vexa. 🔥 What’s New Speaker identification for Google Meet Now your transcripts come labeled with speakers, making them instantly usable for CRMs, meeting notes, AI agents, and analytics. Improved stability and bot management More ro	Low	6/5/2025
v0.3.2	Numerous bug fixes and resilience upgrades	Low	5/28/2025
v0.3.1	Real‑time Google Meet transcription, now one‑command on any laptop.	Low	5/22/2025
v0.3	Vexa Major Release - Public API Now Available with Real-Time Translation from Google Meet The Vexa API is now publicly available at vexa.ai with self-service access - get your API key in 3 clicks and have everything running in under 5 minutes. Key features in this release: - Instant API Access: Self-service API keys available directly from vexa.ai - Google Meet Bot Integration: Programmatically send bots to join and transcribe meetings - Real-Time Transcription: Acce	Low	4/24/2025
v0.2	Vexa Release v0.2 - Google Meet Bot & Real-time Transcription API You can now programmatically send a Vexa bot into a Google Meet call using simple API. Once joined, the bot delivers real-time transcription of the meeting, accessible through the same API. This provides a seamless way to integrate automated transcription directly into your workflows. This release establishes the foundation of our scalable microservices architecture, ensuring reliability and paving the way for futu	Low	4/9/2025
v0.1.0	We’re excited to announce the v0.1.0 alpha release of Vexa, a secure real-time meeting transcription and knowledge management platform designed for corporate in-house deployment. With enterprise-grade security at its core, Vexa enables organizations to transcribe, capture, and manage critical meeting insights while maintaining complete control over their data in private or on-premises environments.	Low	3/10/2025

Dependencies & License Audit

Loading dependencies...

Similar Packages

AI-Skills🤖 Enhance AI capabilities with modular Skills that provide expert knowledge, workflows, and integrations for any project.main@2026-06-07

telegram-mcp🤖 Manage multiple Telegram accounts effortlessly with AI-driven tools for bulk messaging, scheduling, and more in one easy-to-use platform.main@2026-06-07

Enterprise-Multi-AI-Agent-Systems-🤖 Build and deploy scalable Multi-AI Agent systems with LangGraph and Groq LLMs to enhance intelligence across enterprise applications.main@2026-06-07

AIDomesticCoreAIJ🛠️ Build a robust AI Kernel for stable, auditable, and sovereign AI systems, ensuring secure execution and compliance across various domains.main@2026-06-07

argus-mcp🔍 Enhance code quality with Argus MCP, an AI-driven code review server using a Zero-Trust model for safe and efficient development.main@2026-06-07

More in MCP Servers

claude-plugins-officialOfficial, Anthropic-managed directory of high quality Claude Code Plugins.

langchain4jLangChain4j is an open-source Java library that simplifies the integration of LLMs into Java applications through a unified API, providing access to popular LLMs and vector databases. It makes impleme

hyperframesWrite HTML. Render video. Built for agents.

claude-code-guideClaude Code Guide - Setup, Commands, workflows, agents, skills & tips-n-tricks go from beginner to power user!