freshcrate — Search

Search results for "speech"

50 results found (Python)

docling 📁2.90.0🏛️ Flagship⭐58,310

SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for powering downstream workflows such as gen AI applications.

convert docling document docx html layout markdown pdf pypiby pypiPython

faster-whisper 📁1.2.1🏛️ Flagship⭐22,327

Faster Whisper transcription with CTranslate2

ctranslate2 inference openai pypi quantization speech transformer whisperby Guillaume KleinPython

elevenlabs 📁2.44.0🌳 Mature⭐2,935

No description

pypiby pypiPython

weasel 📁1.0.0🌿 Growing⭐93

Weasel: A small and easy workflow system

pypiby ExplosionPython

ai-powered-video-analyzer 📁0.0.0🌿 Growing⭐71

An offline AI-powered video analysis tool with object detection (YOLO), image captioning (BLIP), speech transcription (Whisper), audio event detection (PANNs), and AI-generated summaries (LLMs via Oll

ai-video-analysis audio-event-detection blip2 gui image-captioning image-captioning-ai llm llm-summarization pythonby arashsajjadiPython

onyx 📁v3.2.6🏛️ Flagship⭐27,905

Open Source AI Platform - AI Chat with advanced features that works with every LLM

ai ai-chat chatgpt chatui enterprise-search gen-ai information-retrieval llm python ragby onyx-dot-appPython

voicemode 📁v8.6.1🌳 Mature⭐1,103

Natural (2-way) voice conversations with Claude Code

anthropic asr claude claudecode kokoro livekit mcp mcp-server pythonby mbaileyPython

npcpy 📁v1.4.21🌳 Mature⭐1,307

The python library for research and development in NLP, multimodal LLMs, Agents, ML, Knowledge Graphs, and more.

agents ai llm mcp mcp-client mcp-server ollama perplexity pythonby NPC-WorldwidePython

claude-code-plugins-plus-skills 📁v4.26.0🌳 Mature⭐1,995

423 plugins, 2,849 skills, 177 agents for Claude Code. Open-source marketplace at tonsofskills.com with the ccpi CLI package manager.

agent-skills ai ai-agents anthropic automation claude-code claude-code-plugins developer-tools mcp pythonby jeremylongshorePython

jarvis 📁v1.28.0🌿 Growing⭐300

Your AI assistant that never forgets and runs 100% privately on your computer. Leave it on 24/7 - it learns your preferences, helps with code, manages your health goals, searches the web, and connects

ai assistant health machine-learning mcp nutrition privacy private pythonby isairPython

agentscope 📁v1.0.19🏛️ Flagship⭐24,189

Build and run agents you can see, understand and trust.

agent chatbot large-language-models llm llm-agent mcp multi-agent multi-modal pythonby agentscope-aiPython

Auto-claude-code-research-in-sleep 📁v0.4.4🏛️ Flagship⭐7,173

ARIS ⚔️ (Auto-Research-In-Sleep) — Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in — works wi

ai-research ai-tools aris autonomous-agent claude claude-code claude-code-skills codex pythonby wanshuiyinPython

litellm 📁v1.83.7-stable🏛️ Flagship⭐44,168

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropi

ai-gateway anthropic azure-openai bedrock gateway langchain litellm llm pythonby BerriAIPython

agent-zero 📁v1.9🏛️ Flagship⭐17,142

Agent Zero AI framework

agent ai assistant autonomous linux python zeroby agent0aiPython

vllm-mlx 📁v0.2.8🌳 Mature⭐917

OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX bac

anthropic apple-silicon audio-processing claude-code computer-vision image-understanding inference llm pythonby waybarriosPython

simplechat 📁v0.241.006🌿 Growing⭐129

Secure AI conversations with documents, video, audio, and more. Personal workspaces for focused context, group spaces for shared insight. Classify docs, reuse prompts, and extend with modular features

ai-chatbot azure azure-openai collaboration document-chat document-classification modular python ragby microsoftPython

vmlx 📁v1.3.34🌿 Growing⭐348

vMLX - Home of JANG_Q - Cont Batch, Prefix, Paged, KV Cache Quant, VL - Powers MLX Studio. Image gen/edit, OpenAI/Anth

anthropic-api kvcache-compression kvcache-optimization kvcache-reuse llm lmstudio macbook mcp-server pythonby jjang-aiPython

chak-ai 📁v0.3.1🌿 Growing⭐212

A simple, yet handy, LLM gateway.

pythonby zhixiangxuePython

ten-framework 📁0.11.63🏛️ Flagship⭐10,435

Open-source framework for conversational voice AI agents

ai multi-modal python real-time video voiceby TEN-frameworkPython

txtai 📁v9.7.0🏛️ Flagship⭐12,412

💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows

agents ai ai-agents embeddings information-retrieval language-model large-language-models llm python vector-databaseby neumlPython

py-gpt 📁v2.7.12🌳 Mature⭐1,738

Desktop AI Assistant powered by GPT-5, GPT-4, o1, o3, Gemini, Claude, Ollama, DeepSeek, Perplexity, Grok, Bielik, chat, vision, voice, RAG, image and video generation, agents, tools, MCP, plugins, spe

ai ai-assistant artificial-intelligence autonomous-agent chatbot claude deepseek desktop-app pythonby szczyglis-devPython

VideoGraphAI 📁0.0.0🌿 Growing⭐57

🎬 AI-powered YouTube Shorts automation tool using LLMs, real-time search, and text-to-speech. Create engaging short-form videos with automated research, voiceovers, and subtitles.

ai-tools ai-video-generation artificial-intelligence content-automation content-creation llm machine-learning open-source pythonby mikeoller82Python

PythonClaw 📁0.0.0🌱 Seedling⭐23

OpenClaw reimagined in pure Python — autonomous AI agent with memory, RAG, skills, web dashboard, voice input, daemon, and multi-channel support.

ai ai-agent autonomous-agent chatbot deepseek framework grok llm pythonby ericwang915Python

cyllama 📁0.2.11🌱 Seedling⭐25

A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp

agents cython cython-wrapper llama-cpp python python3 rag stable-diffusion-cpp whisper-cppby shakfuPython

orbit 📁v2.6.6🌿 Growing⭐250

One API for 20+ LLM providers, your databases, and your files — self-hosted, open-source AI gateway with RAG, voice, and guardrails.

ai-assistant ai-gateway ai-safety anthropic chatbot developer-tools elasticsearch llm pythonby schmitechPython

RAPTOR 📁0.0.0🌱 Seedling⭐14

RAPTOR (Robust AI-Powered Toolkit for Operational Robots) is an AI-native Content Insight Engine that transforms passive media storage into an intelligent knowledge platform through automated analysis

ai ai-automation ai-framework ai-orchestration artificial-intelligence audio-processing computer-vision content-analysis pythonby DHT-AI-StudioPython

awesome-opensource-ai 📁main@2026-04-20🌿 Growing⭐2,849

Curated list of the best truly open-source AI projects, models, tools, and infrastructure.

agents ai artificial-intelligence awesome awesome-list generative-ai llm machine-learning python ragby alvinrealPython

codec 📁main@2026-04-16🌿 Growing⭐90

Open-Source Intelligent Command Layer

llm-agent llm-agent-framework local-ai local-ai-agents local-ai-development local-ai-llm mac-os mlx pythonby AVADSA25Python

agenticSeek 📁main@2026-04-11🌿 Growing⭐26,028

Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity. 🔔 Official updates only via twitter @Martin993

agentic-ai agents ai autonomous-agents deepseek-r1 llm llm-agents python voice-assistantby FosowlPython

kai 📁v1.4.0🌱 Seedling⭐29

Agentic AI assistant on Telegram, powered by Claude Code. Runs locally with shell access, spec-driven PR reviews, layered security, persistent memory, and scheduled jobs. Your machine, your data, your

ai-agent ai-assistant anthropic automation claude claude-code llm pythonby dcellisonPython

ai-engineering-from-scratch 📁0.0.0🌱 Seedling⭐4,649

Learn it. Build it. Ship it for others.

agents ai ai-agents ai-engineering computer-vision course deep-learning from-scratch mcp pythonby rohitg00Python

heurist-agent-framework 📁0.0.0🌱 Seedling⭐798

A flexible multi-interface AI agent framework for building agents with reasoning, tool use, memory, deep research, blockchain interaction, MCP, and agents-as-a-service.

agentic-ai agentic-framework ai mcp pythonby heurist-networkPython

Ultimate-Agent-Directory 📁0.0.0🌱 Seedling⭐51

🤖 The most comprehensive directory of AI agent frameworks, platforms, tools, and resources - hundreds of curated entries covering open-source, no-code, enterprise, and autonomous solutions. NEW Boil

agent agentic agentic-ai agents boilerplate boilerplate-application boilerplate-template pythonby moshehbenavrahamPython

RIGEL 📁0.0.0🌱 Seedling⭐26

A Multi-Agentic AI Assistant/Builder

agentic-ai ai-assistant ai-framework chatbot dbus groq linux llm pythonby Zerone-LaboratoriesPython

radio-gateway 📁v3.3.0🌱 Seedling⭐5

Ham radio & GMRS gateway, repeater and packet radio — bridges two-way radios to Mumble, Broadcastify, and the internet. AIOC USB, RSPduo dual SDR, TH-9800/D75/KV4P CAT control, AI announcements, ADS-B

adsb aioc broadcastify gmrs ham-radio kv4p linux mcp pythonby ukbodypilotPython

Open-Sable 📁v1.7.0🌱 Seedling⭐19

Open-Sable is a local-first autonomous agent framework with AGI-inspired cognitive subsystems (goals, memory, metacognition, tool use). It can run continuously on your machine, integrate with chat int

agentic agentic-ai ai ai-assistant open-source pythonby IdeoaLabsPython

hermes-life-os 📁v1.3.0🌱 Seedling⭐35

Personal OS agent that learns who you are, detects life patterns, and grows smarter about you every day. Memory + Cron + Atropos RL

atropos autonomous-agent autonomous-agents hermes-agent life-assistant memory nous-research personal-os pythonby Lethe044Python

apiclaw 📁v2.0.0🌱 Seedling⭐7

The API layer for AI agents. Dashboard + 22K APIs + 18 Direct Call providers. MCP native.

ai-agents ai-tools api-platform claude llm mcp model-context-protocol pythonby nordsymPython

Somi 📁Mineralization🌱 Seedling⭐20

Local-first AI agent framework with GUI, memory, web search, personality constructs, speech i/o, tools, skills, CLI & Telegram features — fully self-hosted via Ollama.

ai-agents ai-framework arti automation cli gui homeb local pythonby Somi-ProjectPython

LLM-Agent-Paper-daily 📁main@2026-04-21🌱 Seedling⭐20

Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)

llm llm-agent pythonby Lyz103Python

cloneme 📁0.0.0💤 Dormant⭐38

CloneMe is an advanced AI platform that builds your digital twin—an AI that chats like you, remembers details, and supports multiple platforms. Customizable, memory-driven, and hot-reloadable, it's th

ai ai-assistant automation autonomous-agent chatbot conversational-ai developer-tools digital-twin pythonby vibheksoniPython

second-brain 📁1.0🌱 Seedling⭐461

Second Brain is a desktop application that acts as a personal knowledge base, using retrieval-augmented generation (RAG), multimodal AI models, and a hybrid lexical/semantic search algorithm to intera

python ragby henrydaumPython

openchatci 📁v0.42.0🌱 Seedling⭐1

The localhost AI Agent Runtime -- Chat UI, Tools, RAG, and MCP in one pip install

ag-ui agent-framework agent-runtime ai-agent ai-tools azure-openai chatbot fastapi pythonby motojinc25Python

AttentiveSupport 📁0.0.0💤 Dormant⭐36

llm-based robot that intervenes only when needed

autonomous-agent large-language-model python roboticsby HRI-EUPython

awesome-lark-bots 📁main@2026-04-21🌱 Seedling⭐2

Provide open-source AI bots for Lark to automate tasks like brainstorming, project planning, content creation, and monitoring within a secure chat interface.

ai ai-agent automation brainstorming chatbot content-creation deepseek feishu pythonby umarqadri345Python

JianYan 📁main@2026-04-21🌱 Seedling⭐2

🎤 Transform speech to text on Windows with fast, local AI processing. Enjoy seamless recording and automatic integration for effective communication.

ai-agent asr audiototext funasr github-config nvidia openai productivity pythonby Jnewton-labPython

Government-Citizen-Services-Voice-Agent 📁main@2026-04-15🌱 Seedling⭐1

Autonomous, multilingual AI voice agent using ElevenLabs, LangGraph, and RAG for government services

conversational-ai elevenlabs fastapi govtech langgraph python rag voice-agentby AutomaticarePython

seedance-2-ai 📁main@2026-04-21🌱 Seedling⭐1

🎥 Generate AI-driven videos with Seedance 2.0, offering precise physics, lip-sync, and prompt accuracy for seamless content creation.

ai-alignment ai-video-generator aigc cinematic-ai cursor-skills deepseek-video generative-ai image-to-video prompt-engineering pythonby palamas86Python

enton 📁main@2026-04-21🌱 Seedling⭐1

Builds an autonomous AI robot with vision, voice, and decision-making capabilities using Python, PyTorch, and CUDA technology.

ai autonomous-agent computer-vision cuda github-config llm python pytorchby tareq3743Python

pyannote-metrics4.0.0🌱 Seedling

A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems

pypiby pypiPython