freshcrate — Search

Search results for "audio"

64 results found (Python)

restai 📁v6.1.45🌿 Growing⭐483

RESTai is an AIaaS (AI as a Service) open-source platform. Supports many public and local LLM suported by Ollama/vLLM/etc. Precise embeddings usage, tuning, analytics etc. Built-in image/audio generat

blocky embeddings fastapi langchain llama llamaindex llm ollama python ragby apocasPython

RAPTOR 📁0.0.0🌱 Seedling⭐13

RAPTOR (Robust AI-Powered Toolkit for Operational Robots) is an AI-native Content Insight Engine that transforms passive media storage into an intelligent knowledge platform through automated analysis

ai ai-automation ai-framework ai-orchestration artificial-intelligence audio-processing computer-vision content-analysis pythonby DHT-AI-StudioPython

ai-powered-video-analyzer 📁0.0.0🌿 Growing⭐68

An offline AI-powered video analysis tool with object detection (YOLO), image captioning (BLIP), speech transcription (Whisper), audio event detection (PANNs), and AI-generated summaries (LLMs via Oll

ai-video-analysis audio-event-detection blip2 gui image-captioning image-captioning-ai llm llm-summarization pythonby arashsajjadiPython

voicemode 📁v8.6.1🌳 Mature⭐1,103

Natural (2-way) voice conversations with Claude Code

anthropic asr claude claudecode kokoro livekit mcp mcp-server pythonby mbaileyPython

story-shot-agent 📁v0.2.4🌿 Growing⭐52

剧本分镜智能体（PenShot）：剧本→分镜→片段→prompt | 基于 LangGraph+LLM，自动解析任意格式剧本，生成 Sora/Veo/Runway 等模型可用的连贯text-to-video提示词。保持角色/剧情跨片段一致，支持 MCP/REST API/函数调用 | Python库 + A2A集成。（LLM-powered screenplay-to-video-prompt a

agent-to-agent ai-filmmaking ai-video-generation character-consistency function-calling kling-ai langgraph-agent llm-agent pythonby neopenPython

npcpy 📁v1.4.21🌳 Mature⭐1,287

The python library for research and development in NLP, multimodal LLMs, Agents, ML, Knowledge Graphs, and more.

agents ai llm mcp mcp-client mcp-server ollama perplexity pythonby NPC-WorldwidePython

claude-code-plugins-plus-skills 📁v4.26.0🌳 Mature⭐1,995

423 plugins, 2,849 skills, 177 agents for Claude Code. Open-source marketplace at tonsofskills.com with the ccpi CLI package manager.

agent-skills ai ai-agents anthropic automation claude-code claude-code-plugins developer-tools mcp pythonby jeremylongshorePython

jarvis 📁v1.28.0🌿 Growing⭐174

Your AI assistant that never forgets and runs 100% privately on your computer. Leave it on 24/7 - it learns your preferences, helps with code, manages your health goals, searches the web, and connects

ai assistant health machine-learning mcp nutrition privacy private pythonby isairPython

cyllama 📁0.2.11🌱 Seedling⭐22

A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp

agents cython cython-wrapper llama-cpp python python3 rag stable-diffusion-cpp whisper-cppby shakfuPython

litellm 📁v1.83.7-stable🌳 Mature⭐42,951

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropi

ai-gateway anthropic azure-openai bedrock gateway langchain litellm llm pythonby BerriAIPython

solace-agent-mesh 📁1.18.40🌳 Mature⭐3,101

An event-driven framework designed to build and orchestrate multi-agent AI systems. It enables seamless integration of AI agents with real-world data sources and systems, facilitating complex, multi-s

a2a agentframework agentic agentic-ai agentic-framework agentic-workflow agenticai agents pythonby SolaceLabsPython

LLM-Agents-Ecosystem-Handbook 📁0.0.0🌳 Mature⭐508

One-stop handbook for building, deploying, and understanding LLM agents with 60+ skeletons, tutorials, ecosystem guides, and evaluation tools.

ai ai-agent ai-agents fine-tuning finetuning-llms freamework llm llmops pythonby oxbshwPython

openakita 📁v1.27.9🌳 Mature⭐1,655

An open-source AI assistant framework with skills and agent architecture

agent ai assistant automation claw clawd clawdbot openclaw pythonby openakitaPython

ten-framework 📁0.11.63🏛️ Flagship⭐10,435

Open-source framework for conversational voice AI agents

ai multi-modal python real-time video voiceby TEN-frameworkPython

llm_intents 📁1.7.1🌿 Growing⭐122

Exposes internet search tools for use by LLM-backed Assist in Home Assistant

assist hacs hacs-integration hassio hassio-integration home-assistant home-assistant-integration home-assistant-voice pythonby skye-harrisPython

cognithor 📁v0.92.2🌿 Growing⭐94

Cognithor - Agent OS: Local-first autonomous agent operating system. 16 LLM providers, 17 channels, 112+ MCP tools, 5-tier memory, A2A protocol, knowledge vault, voice, browser automation, Computer-us

agent-os ai-agent anthropic autonomous-agent discord-bot document-analysis gdpr-compliant gemini pythonby Alex8791-cyberPython

antigravity-awesome-skills 📁main@2026-04-21🌱 Seedling⭐30

🌌 Explore 255+ essential skills for AI coding assistants like Claude Code and GitHub Copilot to enhance your development workflow.

agentic-skills ai-agents antigravity antigravity-ide audio autonomous-coding claude claude-code mcp pythonby cleodinPython

Zen-Ai-Pentest 📁v3.0.0🌿 Growing⭐355

🛡⚔️AI-Powered Penetration Testing Framework with automated vulnerability scanning, multi-agent system, and compliance reporting🛡⚔️

ai automation compliance cybersecurity ethical-hacking framework penetration-testing pentesting pythonby SHAdd0WTAkaPython

mcp-client-for-ollama 📁v0.28.0🌿 Growing⭐599

A text-based user interface (TUI) client for interacting with MCP servers using Ollama. Features include agent mode, multi-server, model switching, streaming responses, tool management, human-in-the-l

agentic-ai ai command-line-tool generative-ai linux llm local-llm macos pythonby joniglPython

vllm-mlx 📁v0.2.8🌿 Growing⭐798

OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX bac

anthropic apple-silicon audio-processing claude-code computer-vision image-understanding inference llm pythonby waybarriosPython

hermes-agent 📁v2026.4.16🌿 Growing⭐57,954

The agent that grows with you

ai ai-agent ai-agents anthropic chatgpt claude claude-code clawdbot pythonby NousResearchPython

simplechat 📁v0.241.006🌿 Growing⭐128

Secure AI conversations with documents, video, audio, and more. Personal workspaces for focused context, group spaces for shared insight. Classify docs, reuse prompts, and extend with modular features

ai-chatbot azure azure-openai collaboration document-chat document-classification modular python ragby microsoftPython

VideoGraphAI 📁0.0.0🌿 Growing⭐54

🎬 AI-powered YouTube Shorts automation tool using LLMs, real-time search, and text-to-speech. Create engaging short-form videos with automated research, voiceovers, and subtitles.

ai-tools ai-video-generation artificial-intelligence content-automation content-creation llm machine-learning open-source pythonby mikeoller82Python

mcp 📁2026.04.20260414152327🌿 Growing⭐8,740

Official MCP Servers for AWS

aws mcp mcp-client mcp-clients mcp-host mcp-server mcp-servers mcp-tools pythonby awslabsPython

py-gpt 📁v2.7.12🌳 Mature⭐1,738

Desktop AI Assistant powered by GPT-5, GPT-4, o1, o3, Gemini, Claude, Ollama, DeepSeek, Perplexity, Grok, Bielik, chat, vision, voice, RAG, image and video generation, agents, tools, MCP, plugins, spe

ai ai-assistant artificial-intelligence autonomous-agent chatbot claude deepseek desktop-app pythonby szczyglis-devPython

RIGEL 📁0.0.0🌱 Seedling⭐26

A Multi-Agentic AI Assistant/Builder

agentic-ai ai-assistant ai-framework chatbot dbus groq linux llm pythonby Zerone-LaboratoriesPython

sinain-hud 📁overlay-v2.8.0🌱 Seedling⭐5

Ambient intelligence that sees what you see, hears what you hear, and acts on your behalf

agent ai audio-transcription hud macos mcp overlay privacy pythonby anthillnetPython

sdk-python 📁v1.36.0🌿 Growing⭐5,602

A model-driven approach to building AI agents in just a few lines of code.

agentic agentic-ai agents ai anthropic autonomous-agents bedrock genai pythonby strands-agentsPython

codec 📁main@2026-04-16🌿 Growing⭐89

Open-Source Intelligent Command Layer

llm-agent llm-agent-framework local-ai local-ai-agents local-ai-development local-ai-llm mac-os mlx pythonby AVADSA25Python

openai-python 📁v2.32.0🌿 Growing⭐30,457

The official Python library for the OpenAI API

openai pythonby openaiPython

llmware 📁v0.4.6🌿 Growing⭐14,857

Unified framework for building enterprise RAG pipelines with small, specialized models

agents generative-ai-tools llamacpp llm onnx openvino parsing python retrieval-augmented-generationby llmware-aiPython

obsidian-second-brain 📁v4.0.0🌿 Growing⭐105

A Claude Code skill that turns your Obsidian vault into a living second brain — autonomous writes, thinking tools, knowledge ingestion, scheduled agents, and _CLAUDE.md for cross-surface context.

ai-agents claude-code obsidian obsidian-plugin productivity python second-brainby eugeniughelburPython

chak-ai 📁v0.3.1🌿 Growing⭐211

A simple, yet handy, LLM gateway.

pythonby zhixiangxuePython

prompt-os 📁v1.0.0🌱 Seedling⭐6

A desktop AI agent that controls your local machine — runs commands, manages files, executes code, browses the web autonomously etc. Supports Claude, GPT, Gemini, Llama, DeepSeek, and more. .exe avail

agentic-ai ai-agent anthropic autonomous-agent browser-use customtkinter deepseek desktop-app pythonby thomastschinkelPython

LIA-Assistant 📁v1.17.1🌱 Seedling⭐17

Open-source multi-agent AI assistant powered by LangGraph, FastAPI & Next.js — 16+ agents, Human-in-the-Loop, MCP integration, voice TTS, RAG, 500+ metrics, 6 languages.

ai ai-agent ai-assistant assistant chatbot claude claude-code clawdbot pythonby jgouviergmailPython

heurist-agent-framework 📁0.0.0🌱 Seedling⭐798

A flexible multi-interface AI agent framework for building agents with reasoning, tool use, memory, deep research, blockchain interaction, MCP, and agents-as-a-service.

agentic-ai agentic-framework ai mcp pythonby heurist-networkPython

claude-code-config 📁0.0.0🌱 Seedling⭐88

Claude Code skills, architectural principles, and alternative approaches for AI-assisted development

ai-agents claude claude-code llm machine-learning mcp prompt-engineering python skillsby AnastasiyaWPython

locallens 📁v0.0.3🌱 Seedling⭐7

Search your files by talking to them - 100% offline

edge-ai local-ai local-first python qdrant qdrant-edge vector-database voiceby mahimairajaPython

little-coder 📁v0.0.4🌱 Seedling⭐31

A coding agent optimized to smaller LLMs

ai-coding-assistant aider-polygot benchmark code-generation coding-agent coding-agents local-llm ollama pythonby itayinbarrPython

hermes-gate 📁0.0.0🌱 Seedling⭐18

🏛️ Hermes Gate — Terminal TUI for managing remote Hermes Agent sessions with auto-reconnect, detach support, and zero config

agent ai ai-agent anthropic chatgpt claude claude-code clawdbot llm-agent pythonby LehaoLinPython

mcp-video 📁v1.2.1🌱 Seedling⭐5

Video editing MCP server for AI agents. 83 tools, 858 tests collected, 3 interfaces. Works with Claude Code, Cursor, and any MCP client. Local, fast, free.

agent-tools ai-agents ai-video animation claude claude-code cursor ffmpeg mcp pythonby Pastorsimon1798Python

apiclaw 📁v2.0.0🌱 Seedling⭐7

The API layer for AI agents. Dashboard + 22K APIs + 18 Direct Call providers. MCP native.

ai-agents ai-tools api-platform claude llm mcp model-context-protocol pythonby nordsymPython

Open-Sable 📁v1.7.0🌱 Seedling⭐18

Open-Sable is a local-first autonomous agent framework with AGI-inspired cognitive subsystems (goals, memory, metacognition, tool use). It can run continuously on your machine, integrate with chat int

agentic agentic-ai ai ai-assistant open-source pythonby IdeoaLabsPython

radio-gateway 📁v3.3.0🌱 Seedling⭐5

Ham radio & GMRS gateway, repeater and packet radio — bridges two-way radios to Mumble, Broadcastify, and the internet. AIOC USB, RSPduo dual SDR, TH-9800/D75/KV4P CAT control, AI announcements, ADS-B

adsb aioc broadcastify gmrs ham-radio kv4p linux mcp pythonby ukbodypilotPython

cloneme 📁0.0.0💤 Dormant⭐38

CloneMe is an advanced AI platform that builds your digital twin—an AI that chats like you, remembers details, and supports multiple platforms. Customizable, memory-driven, and hot-reloadable, it's th

ai ai-assistant automation autonomous-agent chatbot conversational-ai developer-tools digital-twin pythonby vibheksoniPython

ComfyUI-AudioSR 📁main@2026-04-21🌱 Seedling⭐2

🎶 Enhance audio quality with ComfyUI-AudioSR, a versatile tool for upscaling sounds to 48kHz for better clarity and listening experience.

cemu comfy comfyui-nodes copilot cpp deepseek dit emulator llm-agent pythonby xaeksxPython

JianYan 📁main@2026-04-21🌱 Seedling⭐2

🎤 Transform speech to text on Windows with fast, local AI processing. Enjoy seamless recording and automatic integration for effective communication.

ai-agent asr audiototext funasr github-config nvidia openai productivity pythonby Jnewton-labPython

second-brain 📁1.0🌱 Seedling⭐461

Second Brain is a desktop application that acts as a personal knowledge base, using retrieval-augmented generation (RAG), multimodal AI models, and a hybrid lexical/semantic search algorithm to intera

python ragby henrydaumPython

dj-treta-being 📁main@2026-04-19🌱 Seedling⭐2

Install your own AI DJ Being. She searches, downloads, listens, mixes, and generates music — autonomously. 30hrs for $0.04.

ai-beings ai-dj autonomous-agent dj gemini mixxx music music-generation pythonby VeltriaAIPython

seedance-2-ai 📁main@2026-04-21🌱 Seedling⭐1

🎥 Generate AI-driven videos with Seedance 2.0, offering precise physics, lip-sync, and prompt accuracy for seamless content creation.

ai-alignment ai-video-generator aigc cinematic-ai cursor-skills deepseek-video generative-ai image-to-video prompt-engineering pythonby palamas86Python

pyannote-audio4.0.4🌱 Seedling

State-of-the-art speaker diarization toolkit

pypiby pypiPython

banks 📁2.4.1🌱 Seedling

A prompt programming language

pypiby pypiPython

magika 📁1.0.2🌱 Seedling

A tool to determine the content type of a file with deep learning

content detection learning machine pypi typeby pypiPython

docling 📁2.90.0🌱 Seedling

SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for powering downstream workflows such as gen AI applications.

convert docling document docx html layout markdown pdf pypiby pypiPython

faster-whisper 📁1.2.1🌱 Seedling

Faster Whisper transcription with CTranslate2

ctranslate2 inference openai pypi quantization speech transformer whisperby Guillaume KleinPython

mistral-common1.11.0🌱 Seedling

Mistral-common is a library of common utilities for Mistral AI.

pypiby pypiPython

elevenlabs 📁2.44.0🌱 Seedling

No description

pypiby pypiPython

librosa 📁0.11.0🌱 Seedling

Python module for audio and music processing

pypiby Brian McFee, librosa development teamPython

torchmetrics 📁1.9.0🌱 Seedling

PyTorch native Metrics

ai deep learning machine metrics pypi pytorchby Lightning-AI et al.Python

json-repair 📁0.59.4🌱 Seedling

A package to repair broken json strings

json llm parser pypi repairby pypiPython

google-ai-generativelanguage 📁0.11.0🌱 Seedling

Google Ai Generativelanguage API client library

pypiby Google LLCPython

keras 📁3.14.0🌱 Seedling

Multi-backend Keras

pypiby pypiPython

azure-storage-blob 📁12.28.0🌱 Seedling

Microsoft Azure Blob Storage Client Library for Python

azure pypi sdkby Microsoft CorporationPython

transformers 📁5.5.4🌱 Seedling

Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

deep-learning llm machine-learning nlp pypi python pytorch transformer vlmby The Hugging Face team (past and future) with the help of all our contributors (https://github.com/huPython