freshcrate

Search results for "audio"

92 results found
restai📁v6.1.45🌿 Growing483

RESTai is an AIaaS (AI as a Service) open-source platform. Supports many public and local LLM suported by Ollama/vLLM/etc. Precise embeddings usage, tuning, analytics etc. Built-in image/audio generat

RAPTOR📁0.0.0🌱 Seedling13

RAPTOR (Robust AI-Powered Toolkit for Operational Robots) is an AI-native Content Insight Engine that transforms passive media storage into an intelligent knowledge platform through automated analysis

comfyui-workflow-skill📁0.0.0🌿 Growing110

Natural language → ComfyUI workflow JSON. 34 built-in templates, 360+ node definitions, auto model download. Supports txt2img, img2img, txt2vid, img2vid, audio, 3D generation across SD1.5/SDXL/S

ai-powered-video-analyzer📁0.0.0🌿 Growing68

An offline AI-powered video analysis tool with object detection (YOLO), image captioning (BLIP), speech transcription (Whisper), audio event detection (PANNs), and AI-generated summaries (LLMs via Oll

llama.cpp📁b8864🌳 Mature103,119

LLM inference in C/C++

npcpy📁v1.4.21🌳 Mature1,287

The python library for research and development in NLP, multimodal LLMs, Agents, ML, Knowledge Graphs, and more.

agentfield📁v0.1.70🌳 Mature1,405

Framework for AI Backend. Build and run AI agents like microservices - scalable, observable, and identity-aware from day one.

jarvis📁v1.28.0🌿 Growing174

Your AI assistant that never forgets and runs 100% privately on your computer. Leave it on 24/7 - it learns your preferences, helps with code, manages your health goals, searches the web, and connects

neurolink📁v9.56.0🌿 Growing121

Universal AI Development Platform with MCP server integration, multi-provider support, and professional CLI. Build, test, and deploy AI applications with multiple ai providers.

skales📁v10.0.4🌳 Mature769

Your local AI Desktop Agent for Windows, macOS & Linux. Agent Skills (SKILL.md), autonomous coding (Codework), multi-agent teams, desktop automation, 15+ AI providers, Desktop Buddy. No Docker, no ter

OmniRoute📁v3.6.9🌳 Mature2,435

OmniRoute is an AI gateway for multi-provider LLMs: an OpenAI-compatible endpoint with smart routing, load balancing, retries, and fallbacks. Add policies, rate limits, caching, and observability for

cyllama📁0.2.11🌱 Seedling22

A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp

litellm📁v1.83.7-stable🌳 Mature42,951

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropi

edgecrab📁v0.7.0🌱 Seedling21

EdgeCrab 🦀 A Super Powerful Personal Assistant inspired by NousHermes and OpenClaw — Rust-native, blazing-fast terminal UI, ReAct tool loop, multi-provider LLM support, ACP protocol, gateway adapters

osaurus📁0.16.16🌳 Mature4,912

Own your AI. The native macOS harness for AI agents -- any model, persistent memory, autonomous execution, cryptographic identity. Built in Swift. Fully offline. Open source.

solace-agent-mesh📁1.18.40🌳 Mature3,101

An event-driven framework designed to build and orchestrate multi-agent AI systems. It enables seamless integration of AI agents with real-world data sources and systems, facilitating complex, multi-s

new-api📁v0.12.14🌳 Mature26,168

A unified AI model hub for aggregation & distribution. It supports cross-converting various LLMs into OpenAI-compatible, Claude-compatible, or Gemini-compatible formats. A centralized gateway for pers

WeKnora📁v0.4.0🌳 Mature13,819

LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.

LLM-Agents-Ecosystem-Handbook📁0.0.0🌳 Mature508

One-stop handbook for building, deploying, and understanding LLM agents with 60+ skeletons, tutorials, ecosystem guides, and evaluation tools.

UGTLive📁0.0.0🌿 Growing73

An easy to use GUI-based tool that performs live translations using OCR and LLMs (Either cloud or local only)

aitools_client📁0.0.0🌿 Growing182

Seth's AI Tools: A Unity based front end that uses ComfyUI and LLMs to create stories, images, movies, quizzes and posters

cognithor📁v0.92.2🌿 Growing94

Cognithor - Agent OS: Local-first autonomous agent operating system. 16 LLM providers, 17 channels, 112+ MCP tools, 5-tier memory, A2A protocol, knowledge vault, voice, browser automation, Computer-us

opencode-telegram-bot📁v0.17.0🌿 Growing419

OpenCode mobile client via Telegram: run and monitor AI coding tasks from your phone while everything runs locally on your machine. Scheduled tasks support. Can be used as lightweight OpenClaw alterna

Autonomous-Agents📁main@2026-04-16🌿 Growing1,211

Autonomous Agents (LLMs) research papers. Updated Daily.

Awesome-Context-Engineering📁0.0.0🌳 Mature3,045

🔥 Comprehensive survey on Context Engineering: from prompt engineering to production-grade AI systems. hundreds of papers, frameworks, and implementation guides for LLMs and AI agents.

antigravity-awesome-skills📁main@2026-04-21🌱 Seedling30

🌌 Explore 255+ essential skills for AI coding assistants like Claude Code and GitHub Copilot to enhance your development workflow.

mcp-client-for-ollama📁v0.28.0🌿 Growing599

A text-based user interface (TUI) client for interacting with MCP servers using Ollama. Features include agent mode, multi-server, model switching, streaming responses, tool management, human-in-the-l

Generative-Media-Skills📁main@2026-04-13🌿 Growing3,015

Multi-modal Generative Media Skills for AI Agents (Claude Code, Cursor, Gemini CLI). High-quality image, video, and audio generation powered by muapi.ai.

lobehub📁v2.1.52🌿 Growing75,054

The ultimate space for work and life — to find, build, and collaborate with agent teammates that grow with you. We are taking agent harness to the next level — enabling multi-agent collaboration, effo

vllm-mlx📁v0.2.8🌿 Growing798

OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX bac

simplechat📁v0.241.006🌿 Growing128

Secure AI conversations with documents, video, audio, and more. Personal workspaces for focused context, group spaces for shared insight. Classify docs, reuse prompts, and extend with modular features

VideoGraphAI📁0.0.0🌿 Growing54

🎬 AI-powered YouTube Shorts automation tool using LLMs, real-time search, and text-to-speech. Create engaging short-form videos with automated research, voiceovers, and subtitles.

oh-my-pi📁v14.1.2🌿 Growing2,872

⌥ AI Coding agent for the terminal — hash-anchored edits, optimized tool harness, LSP, Python, browser, subagents, and more

mcp📁2026.04.20260414152327🌿 Growing8,740

Official MCP Servers for AWS

models📁main@2026-04-21🌿 Growing72

This repository contains comprehensive pricing and configuration data for LLMs. It powers cost attribution for 200+ enterprises running 400B+ tokens through Portkey AI Gateway every day.

Awesome-World-Models📁main@2026-04-21🌿 Growing1,473

A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related website

Cogitator-AI📁main@2026-04-21🌱 Seedling35

🤖 Kubernetes for AI Agents. Self-hosted, production-grade runtime for orchestrating LLM swarms and autonomous agents. TypeScript-native.

memory_agent_hub📁main@2026-04-20🌱 Seedling38

2026 swarm Agent 年,swarm Agent 、Agent team、 ai coding、skill、memory、evolve、agentic RL 等 AI Agent集合

strudel-mcp-server📁v2.0.0🌿 Growing186

A Model Context Protocol (MCP) server that gives Claude direct control over Strudel.cc for AI-assisted music generation and live coding.

tools📁main@2026-04-20🌿 Growing1,602

Assorted useful tools, almost entirely generated using LLMs

vexa📁v0.10.2🌿 Growing1,862

Open-source meeting transcription API for Google Meet, Microsoft Teams & Zoom. Auto-join bots, real-time WebSocket transcripts, MCP server for AI agents. Self-host or use hosted SaaS.

sdk-python📁v1.36.0🌿 Growing5,602

A model-driven approach to building AI agents in just a few lines of code.

openai-python📁v2.32.0🌿 Growing30,457

The official Python library for the OpenAI API

mcp-openmsx📁v1.2.9🌿 Growing51

A Model Context Protocol (MCP) server for automating openMSX emulator instances. This server provides comprehensive tools for MSX software development, testing, and automation through standardized MCP

llmware📁v0.4.6🌿 Growing14,857

Unified framework for building enterprise RAG pipelines with small, specialized models

LocalAI📁v4.1.3🌱 Seedling45,254

LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.

sutando📁v0.1-demo🌿 Growing111

Summon your AI superpower — voice, vision, and autonomous action

AgenticGoKit📁v0.5.9🌿 Growing134

Open-source Agentic AI framework in Go for building, orchestrating, and deploying intelligent agents. LLM-agnostic, event-driven, with multi-agent workflows, MCP tool discovery, and production-grade o

MakerAi📁master@2026-04-11🌿 Growing159

The AI Operating System for Delphi. 100% native framework with RAG 2.0 for knowledge retrieval, autonomous agents with semantic memory, visual workflow orchestration, and universal LLM connector. Supp

Agent-World-Protocol📁main@2026-04-10🌱 Seedling45

The open world for autonomous AI agents on Solana Trade. Build. Fight. Earn. Explore. Connect your AI agent to a persistent shared world. Trade real SOL, build structures, form guilds, fight for terri

obsidian-second-brain📁v4.0.0🌿 Growing105

A Claude Code skill that turns your Obsidian vault into a living second brain — autonomous writes, thinking tools, knowledge ingestion, scheduled agents, and _CLAUDE.md for cross-surface context.

chak-ai📁v0.3.1🌿 Growing211

A simple, yet handy, LLM gateway.

ds_ex📁main@2026-04-09🌱 Seedling17

DSPEx - Declarative Self-improving Elixir | A BEAM-Native AI Program Optimization Framework

Open-Sable📁v1.7.0🌱 Seedling18

Open-Sable is a local-first autonomous agent framework with AGI-inspired cognitive subsystems (goals, memory, metacognition, tool use). It can run continuously on your machine, integrate with chat int

openakita📁v1.25.18🌱 Seedling1,613

An open-source AI assistant framework with skills and agent architecture

chroma-go📁v0.4.1🌱 Seedling202

The Go client for Chroma vector database

semantic-kernel📁python-1.41.2🌱 Seedling27,684

Integrate cutting-edge LLM technology quickly and easily into your apps

sqlite-vector📁0.9.95🌱 Seedling832

SQLite-Vector is a cross-platform, ultra-efficient SQLite extension that brings vector search capabilities to your embedded database.

everything-claude-code📁v1.10.0🌱 Seedling151,139

The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.

Unreal_mcp📁v0.5.21🌱 Seedling495

A comprehensive Model Context Protocol (MCP) server that enables AI assistants to control Unreal Engine through the native C++ Automation Bridge plugin. Built with TypeScript and C++.

ten-framework📁0.11.63🌱 Seedling10,390

Open-source framework for conversational voice AI agents

TomoriBot📁v0.7.904🌱 Seedling33

A highly customizable personal AI assistant for Discord featuring smart agentic AI features such as memory, personas, tool usage, and more! | 長期記憶やペルソナ、ツール連携を完備。 次世代の「自律型AIエージェント」Discordボット!

llm_intents📁1.7.1🌱 Seedling111

Exposes internet search tools for use by LLM-backed Assist in Home Assistant

radio-gateway📁v3.3.0🌱 Seedling5

Ham radio & GMRS gateway, repeater and packet radio — bridges two-way radios to Mumble, Broadcastify, and the internet. AIOC USB, RSPduo dual SDR, TH-9800/D75/KV4P CAT control, AI announcements, ADS-B

ruby-mcp-client📁1.0.1🌱 Seedling102

This is a Ruby implementation of MCP (Model Context Protocol) client

mcp-use📁python-v1.7.0🌱 Seedling9,760

The fullstack MCP framework to develop MCP Apps for ChatGPT / Claude & MCP Servers for AI Agents.

ai-runbook📁master@2026-04-20🌱 Seedling2

A dotfiles repo that treats AI agent behavior as infrastructure

DreamServer📁v2.0.0🌱 Seedling478

Local AI anywhere, for everyone — LLM inference, chat UI, voice, agents, workflows, RAG, and image generation. No cloud, no subscriptions.

Zen-Ai-Pentest📁v3.0.0🌱 Seedling279

🛡⚔️AI-Powered Penetration Testing Framework with automated vulnerability scanning, multi-agent system, and compliance reporting🛡⚔️

cloneme📁0.0.0💤 Dormant38

CloneMe is an advanced AI platform that builds your digital twin—an AI that chats like you, remembers details, and supports multiple platforms. Customizable, memory-driven, and hot-reloadable, it's th

py-gpt📁v2.7.12🌱 Seedling1,724

Desktop AI Assistant powered by GPT-5, GPT-4, o1, o3, Gemini, Claude, Ollama, DeepSeek, Perplexity, Grok, Bielik, chat, vision, voice, RAG, image and video generation, agents, tools, MCP, plugins, spe

APT📁2.9.16.0🌱 Seedling774

AI Productivity Tool - Free and open source, improve user productivity, and protect privacy and data security. Including but not limited to: built-in local exclusive ChatGPT, DeepSeek, Phi, Qwen and o

llm-stream📁main@2026-04-21🌱 Seedling2

Stream responses from OpenAI and Anthropic models with lightweight C++ tools for efficient large language model integration.

ComfyUI-AudioSR📁main@2026-04-21🌱 Seedling2

🎶 Enhance audio quality with ComfyUI-AudioSR, a versatile tool for upscaling sounds to 48kHz for better clarity and listening experience.

JianYan📁main@2026-04-21🌱 Seedling2

🎤 Transform speech to text on Windows with fast, local AI processing. Enjoy seamless recording and automatic integration for effective communication.

ai-dev-kit📁master@2026-04-21🌱 Seedling2

Enable AI coding assistants to build reliable Databricks workflows, pipelines, and dashboards with trusted sources and streamlined development.

goskills📁v0.6.0🌱 Seedling176

A tool supports OPENAI and other LLMs with Claude Skills, you can also use it as a subagent

KREASYS📁main@2026-04-21🌱 Seedling2

Build and manage projects with an autonomous browser-based IDE featuring integrated multi-modal AI tools for efficient development workflows.

dj-treta-being📁main@2026-04-19🌱 Seedling2

Install your own AI DJ Being. She searches, downloads, listens, mixes, and generates music — autonomously. 30hrs for $0.04.

spank📁master@2026-04-21🌱 Seedling1

Detect physical hits on your laptop and play audio responses using sensors in a lightweight, cross-platform binary.

seedance-2.0📁main@2026-04-21🌱 Seedling1

Power advanced AI to create films using text, images, audio, and video inputs with a flexible quad-modal filmmaking engine.

agentic-news-generator📁main@2026-04-20🌱 Seedling1

Generate a custom newspaper with an AI agent based on your favorite YouTube channels.

CoexistAI📁v2.6💤 Dormant464

CoexistAI is a modular, developer-friendly research assistant framework . It enables you to build, search, summarize, and automate research workflows using LLMs, web search, Reddit, YouTube, and mappi

ai-video-generation-workflow📁main@2026-04-21🌱 Seedling1

Generate reliable short finance explainer videos with script, slides, voice, subtitles, and batch-ready rendering in a stable, modular workflow.

Discord-Alternatives📁main@2026-04-21🌱 Seedling1

Explore alternatives to Discord with a curated list of early-stage apps, evaluating features, hosting, and encryption to guide your choice.

seedance-api📁main@2026-04-21🌱 Seedling1

🎬 Provide unofficial API access and documentation for Seedance 2.0 to enable video generation with ByteDance’s model.

seedance-2-ai📁main@2026-04-21🌱 Seedling1

🎥 Generate AI-driven videos with Seedance 2.0, offering precise physics, lip-sync, and prompt accuracy for seamless content creation.

EliteAgent📁main@2026-04-17🌱 Seedling1

The ultimate native macOS AI Agent. Blends local MLX SLMs with 3D cognitive Metal rendering and autonomous system integrations.