Search results for "transcription"
Faster Whisper transcription with CTranslate2
An offline AI-powered video analysis tool with object detection (YOLO), image captioning (BLIP), speech transcription (Whisper), audio event detection (PANNs), and AI-generated summaries (LLMs via Oll
423 plugins, 2,849 skills, 177 agents for Claude Code. Open-source marketplace at tonsofskills.com with the ccpi CLI package manager.
RESTai is an AIaaS (AI as a Service) open-source platform. Supports many public and local LLM suported by Ollama/vLLM/etc. Precise embeddings usage, tuning, analytics etc. Built-in image/audio generat
Your AI assistant that never forgets and runs 100% privately on your computer. Leave it on 24/7 - it learns your preferences, helps with code, manages your health goals, searches the web, and connects
The agent that grows with you
An open-source AI assistant framework with skills and agent architecture
vMLX - Home of JANG_Q - Cont Batch, Prefix, Paged, KV Cache Quant, VL - Powers MLX Studio. Image gen/edit, OpenAI/Anth
Open-source framework for conversational voice AI agents
Open-source multi-agent AI assistant powered by LangGraph, FastAPI & Next.js β 16+ agents, Human-in-the-Loop, MCP integration, voice TTS, RAG, 500+ metrics, 6 languages.
π‘ All-in-one AI framework for semantic search, LLM orchestration and language model workflows
A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp
π‘βοΈAI-Powered Penetration Testing Framework with automated vulnerability scanning, multi-agent system, and compliance reportingπ‘βοΈ
Unified framework for building enterprise RAG pipelines with small, specialized models
Curated list of the best truly open-source AI projects, models, tools, and infrastructure.
Agentic AI assistant on Telegram, powered by Claude Code. Runs locally with shell access, spec-driven PR reviews, layered security, persistent memory, and scheduled jobs. Your machine, your data, your
A flexible multi-interface AI agent framework for building agents with reasoning, tool use, memory, deep research, blockchain interaction, MCP, and agents-as-a-service.
Ambient intelligence that sees what you see, hears what you hear, and acts on your behalf
A Multi-Agentic AI Assistant/Builder
Ham radio & GMRS gateway, repeater and packet radio β bridges two-way radios to Mumble, Broadcastify, and the internet. AIOC USB, RSPduo dual SDR, TH-9800/D75/KV4P CAT control, AI announcements, ADS-B
π Self-hosted multi-agent AI orchestrator β chat with Claude, Gemini & Copilot CLI from Telegram, WebEx, or browser. 5 runtimes, 17+ models, task scheduling, skill plugins.
The localhost AI Agent Runtime -- Chat UI, Tools, RAG, and MCP in one pip install
π€ Transform speech to text on Windows with fast, local AI processing. Enjoy seamless recording and automatic integration for effective communication.
