freshcrate

Search results for "audio"

Clear filters
64 results found (Python)
restai๐Ÿ“v6.1.45๐ŸŒฟ Growingโญ483

RESTai is an AIaaS (AI as a Service) open-source platform. Supports many public and local LLM suported by Ollama/vLLM/etc. Precise embeddings usage, tuning, analytics etc. Built-in image/audio generat

RAPTOR๐Ÿ“0.0.0๐ŸŒฑ Seedlingโญ13

RAPTOR (Robust AI-Powered Toolkit for Operational Robots) is an AI-native Content Insight Engine that transforms passive media storage into an intelligent knowledge platform through automated analysis

ai-powered-video-analyzer๐Ÿ“0.0.0๐ŸŒฟ Growingโญ68

An offline AI-powered video analysis tool with object detection (YOLO), image captioning (BLIP), speech transcription (Whisper), audio event detection (PANNs), and AI-generated summaries (LLMs via Oll

voicemode๐Ÿ“v8.6.1๐ŸŒณ Matureโญ1,103

Natural (2-way) voice conversations with Claude Code

story-shot-agent๐Ÿ“v0.2.4๐ŸŒฟ Growingโญ52

ๅ‰งๆœฌๅˆ†้•œๆ™บ่ƒฝไฝ“๏ผˆPenShot๏ผ‰๏ผšๅ‰งๆœฌโ†’ๅˆ†้•œโ†’็‰‡ๆฎตโ†’prompt | ๅŸบไบŽ LangGraph+LLM๏ผŒ่‡ชๅŠจ่งฃๆžไปปๆ„ๆ ผๅผๅ‰งๆœฌ๏ผŒ็”Ÿๆˆ Sora/Veo/Runway ็ญ‰ๆจกๅž‹ๅฏ็”จ็š„่ฟž่ดฏtext-to-videoๆ็คบ่ฏใ€‚ไฟๆŒ่ง’่‰ฒ/ๅ‰งๆƒ…่ทจ็‰‡ๆฎตไธ€่‡ด๏ผŒๆ”ฏๆŒ MCP/REST API/ๅ‡ฝๆ•ฐ่ฐƒ็”จ | Pythonๅบ“ + A2A้›†ๆˆใ€‚๏ผˆLLM-powered screenplay-to-video-prompt a

npcpy๐Ÿ“v1.4.21๐ŸŒณ Matureโญ1,287

The python library for research and development in NLP, multimodal LLMs, Agents, ML, Knowledge Graphs, and more.

claude-code-plugins-plus-skills๐Ÿ“v4.26.0๐ŸŒณ Matureโญ1,995

423 plugins, 2,849 skills, 177 agents for Claude Code. Open-source marketplace at tonsofskills.com with the ccpi CLI package manager.

jarvis๐Ÿ“v1.28.0๐ŸŒฟ Growingโญ174

Your AI assistant that never forgets and runs 100% privately on your computer. Leave it on 24/7 - it learns your preferences, helps with code, manages your health goals, searches the web, and connects

cyllama๐Ÿ“0.2.11๐ŸŒฑ Seedlingโญ22

A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp

litellm๐Ÿ“v1.83.7-stable๐ŸŒณ Matureโญ42,951

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropi

solace-agent-mesh๐Ÿ“1.18.40๐ŸŒณ Matureโญ3,101

An event-driven framework designed to build and orchestrate multi-agent AI systems. It enables seamless integration of AI agents with real-world data sources and systems, facilitating complex, multi-s

LLM-Agents-Ecosystem-Handbook๐Ÿ“0.0.0๐ŸŒณ Matureโญ508

One-stop handbook for building, deploying, and understanding LLM agents with 60+ skeletons, tutorials, ecosystem guides, and evaluation tools.

openakita๐Ÿ“v1.27.9๐ŸŒณ Matureโญ1,655

An open-source AI assistant framework with skills and agent architecture

ten-framework๐Ÿ“0.11.63๐Ÿ›๏ธ Flagshipโญ10,435

Open-source framework for conversational voice AI agents

llm_intents๐Ÿ“1.7.1๐ŸŒฟ Growingโญ122

Exposes internet search tools for use by LLM-backed Assist in Home Assistant

cognithor๐Ÿ“v0.92.2๐ŸŒฟ Growingโญ94

Cognithor - Agent OS: Local-first autonomous agent operating system. 16 LLM providers, 17 channels, 112+ MCP tools, 5-tier memory, A2A protocol, knowledge vault, voice, browser automation, Computer-us

antigravity-awesome-skills๐Ÿ“main@2026-04-21๐ŸŒฑ Seedlingโญ30

๐ŸŒŒ Explore 255+ essential skills for AI coding assistants like Claude Code and GitHub Copilot to enhance your development workflow.

Zen-Ai-Pentest๐Ÿ“v3.0.0๐ŸŒฟ Growingโญ355

๐Ÿ›กโš”๏ธAI-Powered Penetration Testing Framework with automated vulnerability scanning, multi-agent system, and compliance reporting๐Ÿ›กโš”๏ธ

mcp-client-for-ollama๐Ÿ“v0.28.0๐ŸŒฟ Growingโญ599

A text-based user interface (TUI) client for interacting with MCP servers using Ollama. Features include agent mode, multi-server, model switching, streaming responses, tool management, human-in-the-l

vllm-mlx๐Ÿ“v0.2.8๐ŸŒฟ Growingโญ798

OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX bac

hermes-agent๐Ÿ“v2026.4.16๐ŸŒฟ Growingโญ57,954

The agent that grows with you

simplechat๐Ÿ“v0.241.006๐ŸŒฟ Growingโญ128

Secure AI conversations with documents, video, audio, and more. Personal workspaces for focused context, group spaces for shared insight. Classify docs, reuse prompts, and extend with modular features

VideoGraphAI๐Ÿ“0.0.0๐ŸŒฟ Growingโญ54

๐ŸŽฌ AI-powered YouTube Shorts automation tool using LLMs, real-time search, and text-to-speech. Create engaging short-form videos with automated research, voiceovers, and subtitles.

mcp๐Ÿ“2026.04.20260414152327๐ŸŒฟ Growingโญ8,740

Official MCP Servers for AWS

py-gpt๐Ÿ“v2.7.12๐ŸŒณ Matureโญ1,738

Desktop AI Assistant powered by GPT-5, GPT-4, o1, o3, Gemini, Claude, Ollama, DeepSeek, Perplexity, Grok, Bielik, chat, vision, voice, RAG, image and video generation, agents, tools, MCP, plugins, spe

RIGEL๐Ÿ“0.0.0๐ŸŒฑ Seedlingโญ26

A Multi-Agentic AI Assistant/Builder

sinain-hud๐Ÿ“overlay-v2.8.0๐ŸŒฑ Seedlingโญ5

Ambient intelligence that sees what you see, hears what you hear, and acts on your behalf

sdk-python๐Ÿ“v1.36.0๐ŸŒฟ Growingโญ5,602

A model-driven approach to building AI agents in just a few lines of code.

openai-python๐Ÿ“v2.32.0๐ŸŒฟ Growingโญ30,457

The official Python library for the OpenAI API

llmware๐Ÿ“v0.4.6๐ŸŒฟ Growingโญ14,857

Unified framework for building enterprise RAG pipelines with small, specialized models

obsidian-second-brain๐Ÿ“v4.0.0๐ŸŒฟ Growingโญ105

A Claude Code skill that turns your Obsidian vault into a living second brain โ€” autonomous writes, thinking tools, knowledge ingestion, scheduled agents, and _CLAUDE.md for cross-surface context.

chak-ai๐Ÿ“v0.3.1๐ŸŒฟ Growingโญ211

A simple, yet handy, LLM gateway.

prompt-os๐Ÿ“v1.0.0๐ŸŒฑ Seedlingโญ6

A desktop AI agent that controls your local machine โ€” runs commands, manages files, executes code, browses the web autonomously etc. Supports Claude, GPT, Gemini, Llama, DeepSeek, and more. .exe avail

LIA-Assistant๐Ÿ“v1.17.1๐ŸŒฑ Seedlingโญ17

Open-source multi-agent AI assistant powered by LangGraph, FastAPI & Next.js โ€” 16+ agents, Human-in-the-Loop, MCP integration, voice TTS, RAG, 500+ metrics, 6 languages.

heurist-agent-framework๐Ÿ“0.0.0๐ŸŒฑ Seedlingโญ798

A flexible multi-interface AI agent framework for building agents with reasoning, tool use, memory, deep research, blockchain interaction, MCP, and agents-as-a-service.

claude-code-config๐Ÿ“0.0.0๐ŸŒฑ Seedlingโญ88

Claude Code skills, architectural principles, and alternative approaches for AI-assisted development

locallens๐Ÿ“v0.0.3๐ŸŒฑ Seedlingโญ7

Search your files by talking to them - 100% offline

hermes-gate๐Ÿ“0.0.0๐ŸŒฑ Seedlingโญ18

๐Ÿ›๏ธ Hermes Gate โ€” Terminal TUI for managing remote Hermes Agent sessions with auto-reconnect, detach support, and zero config

mcp-video๐Ÿ“v1.2.1๐ŸŒฑ Seedlingโญ5

Video editing MCP server for AI agents. 83 tools, 858 tests collected, 3 interfaces. Works with Claude Code, Cursor, and any MCP client. Local, fast, free.

apiclaw๐Ÿ“v2.0.0๐ŸŒฑ Seedlingโญ7

The API layer for AI agents. Dashboard + 22K APIs + 18 Direct Call providers. MCP native.

Open-Sable๐Ÿ“v1.7.0๐ŸŒฑ Seedlingโญ18

Open-Sable is a local-first autonomous agent framework with AGI-inspired cognitive subsystems (goals, memory, metacognition, tool use). It can run continuously on your machine, integrate with chat int

radio-gateway๐Ÿ“v3.3.0๐ŸŒฑ Seedlingโญ5

Ham radio & GMRS gateway, repeater and packet radio โ€” bridges two-way radios to Mumble, Broadcastify, and the internet. AIOC USB, RSPduo dual SDR, TH-9800/D75/KV4P CAT control, AI announcements, ADS-B

cloneme๐Ÿ“0.0.0๐Ÿ’ค Dormantโญ38

CloneMe is an advanced AI platform that builds your digital twinโ€”an AI that chats like you, remembers details, and supports multiple platforms. Customizable, memory-driven, and hot-reloadable, it's th

ComfyUI-AudioSR๐Ÿ“main@2026-04-21๐ŸŒฑ Seedlingโญ2

๐ŸŽถ Enhance audio quality with ComfyUI-AudioSR, a versatile tool for upscaling sounds to 48kHz for better clarity and listening experience.

JianYan๐Ÿ“main@2026-04-21๐ŸŒฑ Seedlingโญ2

๐ŸŽค Transform speech to text on Windows with fast, local AI processing. Enjoy seamless recording and automatic integration for effective communication.

second-brain๐Ÿ“1.0๐ŸŒฑ Seedlingโญ461

Second Brain is a desktop application that acts as a personal knowledge base, using retrieval-augmented generation (RAG), multimodal AI models, and a hybrid lexical/semantic search algorithm to intera

dj-treta-being๐Ÿ“main@2026-04-19๐ŸŒฑ Seedlingโญ2

Install your own AI DJ Being. She searches, downloads, listens, mixes, and generates music โ€” autonomously. 30hrs for $0.04.

seedance-2-ai๐Ÿ“main@2026-04-21๐ŸŒฑ Seedlingโญ1

๐ŸŽฅ Generate AI-driven videos with Seedance 2.0, offering precise physics, lip-sync, and prompt accuracy for seamless content creation.

pyannote-audio4.0.4๐ŸŒฑ Seedling

State-of-the-art speaker diarization toolkit

banks๐Ÿ“2.4.1๐ŸŒฑ Seedling

A prompt programming language

magika๐Ÿ“1.0.2๐ŸŒฑ Seedling

A tool to determine the content type of a file with deep learning

docling๐Ÿ“2.90.0๐ŸŒฑ Seedling

SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for powering downstream workflows such as gen AI applications.

mistral-common1.11.0๐ŸŒฑ Seedling

Mistral-common is a library of common utilities for Mistral AI.

elevenlabs๐Ÿ“2.44.0๐ŸŒฑ Seedling

No description

librosa๐Ÿ“0.11.0๐ŸŒฑ Seedling

Python module for audio and music processing

json-repair๐Ÿ“0.59.4๐ŸŒฑ Seedling

A package to repair broken json strings

google-ai-generativelanguage๐Ÿ“0.11.0๐ŸŒฑ Seedling

Google Ai Generativelanguage API client library

keras๐Ÿ“3.14.0๐ŸŒฑ Seedling

Multi-backend Keras

azure-storage-blob๐Ÿ“12.28.0๐ŸŒฑ Seedling

Microsoft Azure Blob Storage Client Library for Python

transformers๐Ÿ“5.5.4๐ŸŒฑ Seedling

Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.