freshcrate

Search results for "whisper"

Clear filters
37 results found (Python)
faster-whisperπŸ“1.2.1πŸ›οΈ Flagship⭐22,327

Faster Whisper transcription with CTranslate2

ai-powered-video-analyzerπŸ“0.0.0🌿 Growing⭐71

An offline AI-powered video analysis tool with object detection (YOLO), image captioning (BLIP), speech transcription (Whisper), audio event detection (PANNs), and AI-generated summaries (LLMs via Oll

cognithorπŸ“v0.92.3🌿 Growing⭐115

Cognithor - Agent OS: Local-first autonomous agent operating system. 16 LLM providers, 17 channels, 112+ MCP tools, 5-tier memory, A2A protocol, knowledge vault, voice, browser automation, Computer-us

voicemodeπŸ“v8.6.1🌳 Mature⭐1,103

Natural (2-way) voice conversations with Claude Code

jarvisπŸ“v1.28.0🌿 Growing⭐300

Your AI assistant that never forgets and runs 100% privately on your computer. Leave it on 24/7 - it learns your preferences, helps with code, manages your health goals, searches the web, and connects

awesome-cli-coding-agentsπŸ“main@2026-04-18🌿 Growing⭐244

Curated directory of terminal-native AI coding agents and the harnesses that orchestrate them. Covers open-source tools (Pi, OpenCode, Aider, Goose), platform agents (Claude Code, Codex, Gemini CLI),

pixeltableπŸ“v0.5.28🌳 Mature⭐1,549

Data Infrastructure providing a declarative, incremental approach for multimodal AI workloads.

cyllamaπŸ“0.2.11🌱 Seedling⭐25

A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp

vllm-mlxπŸ“v0.2.8🌳 Mature⭐917

OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX bac

vmlxπŸ“v1.3.34🌿 Growing⭐348

vMLX - Home of JANG_Q - Cont Batch, Prefix, Paged, KV Cache Quant, VL - Powers MLX Studio. Image gen/edit, OpenAI/Anth

obsidian-second-brainπŸ“v4.0.0🌿 Growing⭐244

A Claude Code skill that turns your Obsidian vault into a living second brain β€” autonomous writes, thinking tools, knowledge ingestion, scheduled agents, and _CLAUDE.md for cross-surface context.

chak-aiπŸ“v0.3.1🌿 Growing⭐212

A simple, yet handy, LLM gateway.

animaworksπŸ“v0.6.2🌿 Growing⭐230

Organization-as-Code for autonomous AI agents. Brain-inspired memory that grows, consolidates, and forgets. Multi-model (Claude/Codex/Gemini/Cursor/Ollama).

LIA-AssistantπŸ“v1.17.1🌱 Seedling⭐17

Open-source multi-agent AI assistant powered by LangGraph, FastAPI & Next.js β€” 16+ agents, Human-in-the-Loop, MCP integration, voice TTS, RAG, 500+ metrics, 6 languages.

txtaiπŸ“v9.7.0πŸ›οΈ Flagship⭐12,412

πŸ’‘ All-in-one AI framework for semantic search, LLM orchestration and language model workflows

cognitaπŸ“0.0.0🌳 Mature⭐4,405

RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry

Zen-Ai-PentestπŸ“v3.0.0🌿 Growing⭐355

πŸ›‘βš”οΈAI-Powered Penetration Testing Framework with automated vulnerability scanning, multi-agent system, and compliance reportingπŸ›‘βš”οΈ

orbitπŸ“v2.6.6🌿 Growing⭐250

One API for 20+ LLM providers, your databases, and your files β€” self-hosted, open-source AI gateway with RAG, voice, and guardrails.

py-gptπŸ“v2.7.12🌳 Mature⭐1,738

Desktop AI Assistant powered by GPT-5, GPT-4, o1, o3, Gemini, Claude, Ollama, DeepSeek, Perplexity, Grok, Bielik, chat, vision, voice, RAG, image and video generation, agents, tools, MCP, plugins, spe

mcp-videoπŸ“v1.2.1🌱 Seedling⭐5

Video editing MCP server for AI agents. 83 tools, 858 tests collected, 3 interfaces. Works with Claude Code, Cursor, and any MCP client. Local, fast, free.

RAPTORπŸ“0.0.0🌱 Seedling⭐14

RAPTOR (Robust AI-Powered Toolkit for Operational Robots) is an AI-native Content Insight Engine that transforms passive media storage into an intelligent knowledge platform through automated analysis

auraπŸ“main@2026-04-21🌿 Growing⭐55

A sovereign cognitive architecture with IIT 4.0 integrated information, residual-stream affective steering (CAA), Global Workspace Theory, active inference, and 72 consciousness modules β€” running loca

arcade-mcpπŸ“main@2026-04-21🌿 Growing⭐864

The best way to create, deploy, and share MCP Servers

awesome-opensource-aiπŸ“main@2026-04-20🌿 Growing⭐2,849

Curated list of the best truly open-source AI projects, models, tools, and infrastructure.

kaiπŸ“v1.4.0🌱 Seedling⭐29

Agentic AI assistant on Telegram, powered by Claude Code. Runs locally with shell access, spec-driven PR reviews, layered security, persistent memory, and scheduled jobs. Your machine, your data, your

RIGELπŸ“0.0.0🌱 Seedling⭐26

A Multi-Agentic AI Assistant/Builder

radio-gatewayπŸ“v3.3.0🌱 Seedling⭐5

Ham radio & GMRS gateway, repeater and packet radio β€” bridges two-way radios to Mumble, Broadcastify, and the internet. AIOC USB, RSPduo dual SDR, TH-9800/D75/KV4P CAT control, AI announcements, ADS-B

sinain-hudπŸ“overlay-v2.8.0🌱 Seedling⭐5

Ambient intelligence that sees what you see, hears what you hear, and acts on your behalf

apiclawπŸ“v2.0.0🌱 Seedling⭐7

The API layer for AI agents. Dashboard + 22K APIs + 18 Direct Call providers. MCP native.

SomiπŸ“Mineralization🌱 Seedling⭐20

Local-first AI agent framework with GUI, memory, web search, personality constructs, speech i/o, tools, skills, CLI & Telegram features β€” fully self-hosted via Ollama.

clonemeπŸ“0.0.0πŸ’€ Dormant⭐38

CloneMe is an advanced AI platform that builds your digital twinβ€”an AI that chats like you, remembers details, and supports multiple platforms. Customizable, memory-driven, and hot-reloadable, it's th

Wee-OrchestratorπŸ“main@2026-04-21🌱 Seedling⭐6

πŸ€ Self-hosted multi-agent AI orchestrator β€” chat with Claude, Gemini & Copilot CLI from Telegram, WebEx, or browser. 5 runtimes, 17+ models, task scheduling, skill plugins.

second-brainπŸ“1.0🌱 Seedling⭐461

Second Brain is a desktop application that acts as a personal knowledge base, using retrieval-augmented generation (RAG), multimodal AI models, and a hybrid lexical/semantic search algorithm to intera

openchatciπŸ“v0.42.0🌱 Seedling⭐1

The localhost AI Agent Runtime -- Chat UI, Tools, RAG, and MCP in one pip install

entonπŸ“main@2026-04-21🌱 Seedling⭐1

Builds an autonomous AI robot with vision, voice, and decision-making capabilities using Python, PyTorch, and CUDA technology.

taijiπŸ“v0.2.0🌱 Seedling

AI-powered self-learning OS with I Ching philosophy | θžεˆζ˜“η»ε“²ε­¦ηš„θ‡ͺε­¦εž‹ AI ζ“δ½œη³»η»Ÿ