freshcrate — Search

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropi

ai-gateway anthropic azure-openai bedrock gateway langchain litellm llm pythonby BerriAIPython

openshell-deepagent 📁0.0.0🌿 Growing⭐121

A general-purpose coding agent that runs inside an NVIDIA OpenShell sandbox, orchestrated by Deep Agents and powered by NVIDIA Nemotron. The agent writes and executes code in an isolated, policy-gover

pythonby langchain-aiPython

mcp-nixos 📁v2.4.0🌳 Mature⭐597

MCP-NixOS - Model Context Protocol Server for NixOS resources

ai-assistant ai-integration ai-tools anthropic claude developer-tools devops-tools fastmcp pythonby utensilsPython

cognithor 📁v0.92.3🌿 Growing⭐115

Cognithor - Agent OS: Local-first autonomous agent operating system. 16 LLM providers, 17 channels, 112+ MCP tools, 5-tier memory, A2A protocol, knowledge vault, voice, browser automation, Computer-us

agent-os ai-agent anthropic autonomous-agent discord-bot document-analysis gdpr-compliant gemini pythonby Alex8791-cyberPython

phoenix 📁arize-phoenix-v14.10.0🏛️ Flagship⭐9,377

AI Observability & Evaluation

agents ai-monitoring ai-observability aiengineering anthropic datasets evals jupyter notebook langchain prompt-engineeringby Arize-aiPython

EvoScientist 📁v0.0.8🌳 Mature⭐2,796

🔬 Harness Vibe Research with Self-evolving AI Scientists

ai-agent ai4science multi-agent-system python vibe-researchby EvoScientistPython

restai 📁v6.1.45🌿 Growing⭐485

RESTai is an AIaaS (AI as a Service) open-source platform. Supports many public and local LLM suported by Ollama/vLLM/etc. Precise embeddings usage, tuning, analytics etc. Built-in image/audio generat

blocky embeddings fastapi langchain llama llamaindex llm ollama python ragby apocasPython

jarvis 📁v1.28.0🌿 Growing⭐300

Your AI assistant that never forgets and runs 100% privately on your computer. Leave it on 24/7 - it learns your preferences, helps with code, manages your health goals, searches the web, and connects

ai assistant health machine-learning mcp nutrition privacy private pythonby isairPython

clawmetry 📁v0.12.122🌿 Growing⭐275

See your agent think. Real-time observability dashboard for OpenClaw AI agents.

ai-agent clawmetry dashboard monitoring observability openclaw opentelemetry pythonby vivekchandPython

SmarterRouter 📁2.2.5🌿 Growing⭐113

SmarterRouter: An intelligent LLM gateway and VRAM-aware router for Ollama, llama.cpp, and OpenAI. Features semantic caching, model profiling, and automatic failover for local AI labs.

ai-cache ai-gateway docker fastapi gpu-monitoring llm llm-proxy llm-router pythonby peva3Python

vllm 📁v0.19.1🏛️ Flagship⭐77,587

A high-throughput and memory-efficient inference and serving engine for LLMs

amd blackwell cuda deepseek deepseek-v3 gpt gpt-oss inference pythonby vllm-projectPython

awesome-cli-coding-agents 📁main@2026-04-18🌿 Growing⭐244

Curated directory of terminal-native AI coding agents and the harnesses that orchestrate them. Covers open-source tools (Pi, OpenCode, Aider, Goose), platform agents (Claude Code, Codex, Gemini CLI),

by bradAGIPython

llamafarm 📁v0.0.31🌳 Mature⭐819

Deploy any AI model, agent, database, RAG, and pipeline locally or remotely in minutes

ai aiproject chatgpt claude edge edge-computing finetuning-llms gemma prompt-engineering pythonby llama-farmPython

hermes-agent 📁v2026.4.16🌳 Mature⭐107,978

The agent that grows with you

ai ai-agent ai-agents anthropic chatgpt claude claude-code clawdbot pythonby NousResearchPython

monocle 📁v0.7.8🌿 Growing⭐79

Monocle is a framework for tracing GenAI app code. This repo contains implementation of Monocle for GenAI apps written in Python.

generative-ai linux-foundation llm-agent llm-inference llms observability opentelemetry oss pythonby monocle2aiPython

openakita 📁v1.27.9🌳 Mature⭐1,655

An open-source AI assistant framework with skills and agent architecture

agent ai assistant automation claw clawd clawdbot openclaw pythonby openakitaPython

open-responses-server 📁v0.4.3🌿 Growing⭐167

Wraps any OpenAI API interface as Responses with MCPs support so it supports Codex. Adding any missing stateful features. Ollama and Vllm compliant.

ai codex generative-ai mcp mcp-client openai openai-api openai-codex pythonby teabranchPython

sage 📁0.0.0🌿 Growing⭐261

Official Code Release of SAGE: Scalable Agentic 3D Scene Generation for Embodied AI

pythonby NVlabsPython

ai-powered-video-analyzer 📁0.0.0🌿 Growing⭐71

An offline AI-powered video analysis tool with object detection (YOLO), image captioning (BLIP), speech transcription (Whisper), audio event detection (PANNs), and AI-generated summaries (LLMs via Oll

ai-video-analysis audio-event-detection blip2 gui image-captioning image-captioning-ai llm llm-summarization pythonby arashsajjadiPython

cyllama 📁0.2.11🌱 Seedling⭐25

A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp

agents cython cython-wrapper llama-cpp python python3 rag stable-diffusion-cpp whisper-cppby shakfuPython

outlines 📁1.2.12🏛️ Flagship⭐13,705

Structured Outputs

cfg generative-ai json llms prompt-engineering python regex structured-generation symbolic-aiby dottxt-aiPython

LLM-API-Key-Proxy 📁dev/build-20260301-1-b62f6e4🌿 Growing⭐465

Universal LLM Gateway: One API, every LLM. OpenAI/Anthropic-compatible endpoints with multi-provider translation and intelligent load-balancing.

api-key gemini-api large-language-model large-language-models llm pythonby MirrowelPython

unsloth-buddy 📁main@2026-04-15🌿 Growing⭐230

Zero-friction LLM fine-tuning skill for Claude Code, Gemini CLI & any ACP agent. Unsloth on NVIDIA · TRL+MPS/MLX on Apple Silicon. Automates env setup, LoRA training (SFT, DPO, GRPO, vision), post-hoc

apple-silicon claude-code dpo fine-tuning gaslamp grpo huggingface lora pythonby TYH-labsPython

orbit 📁v2.6.6🌿 Growing⭐250

One API for 20+ LLM providers, your databases, and your files — self-hosted, open-source AI gateway with RAG, voice, and guardrails.

ai-assistant ai-gateway ai-safety anthropic chatbot developer-tools elasticsearch llm pythonby schmitechPython

AgenticX 📁v0.3.7🌿 Growing⭐114

AgenticX is a unified, production-ready multi-agent platform — Python SDK + CLI (agx) + Studio server + Machi desktop app. Features Meta-Agent orchestration, 15+ LLM providers, MCP Hub, hierarchical m

agent-framework agentic-workflows ai-agent ai-orchestration chatbot desktop-app electron fastapi pythonby DemonDamonPython

llmware 📁v0.4.6🌿 Growing⭐14,862

Unified framework for building enterprise RAG pipelines with small, specialized models

agents generative-ai-tools llamacpp llm onnx openvino parsing python retrieval-augmented-generationby llmware-aiPython

ros-mcp-server 📁v3.0.1🌳 Mature⭐1,176

Connect AI models like Claude & GPT with robots using MCP and ROS.

mcp mcp-server modelcontextprotocol python ros ros-mcp-server ros2 ros2-mcp-serverby robotmcpPython

tsunami 📁main@2026-04-21🌱 Seedling⭐16

autonomous AI agent that builds full-stack apps. local models. no cloud. no API keys. runs on your hardware.

agentic-ai ai-agent ai-coding-assistant app-builder autonomous-agent code-generation coding-agent developer-tools pythonby gobbleyourdongPython

awesome-opensource-ai 📁main@2026-04-20🌿 Growing⭐2,849

Curated list of the best truly open-source AI projects, models, tools, and infrastructure.

agents ai artificial-intelligence awesome awesome-list generative-ai llm machine-learning python ragby alvinrealPython

GTA 📁v0.2.0🌿 Growing⭐143

[NeurIPS 2024 D&B] GTA: A Benchmark for General Tool Agents & [arXiv 2026] GTA-2

llm-agent llm-evaluation pythonby open-compassPython

Dragon-Brain 📁v1.1.0🌱 Seedling⭐43

Dragon Brain — persistent long-term memory for AI agents via MCP (Model Context Protocol). Knowledge graph (FalkorDB) + vector search (Qdrant) + CUDA GPU embeddings. Works with Claude, Gemini CLI, Cur

ai-memory claude codex-cli cursor falkordb gemini-cli knowledge-graph llm-tools pythonby iikarusPython

vektori 📁main@2026-04-19🌿 Growing⭐111

Memory that remembers the story not just the facts. Three layer sentence graph for AI agents -> Facts, Episodes, raw Sentences. One DB. Zero config.

ai-agents knowledge-graph llm long-term-memory memory ollama open-source pgvector python vector-databaseby vektori-aiPython

auto-deep-researcher-24x7 📁main@2026-04-19🌿 Growing⭐622

🔥 An autonomous AI agent that runs your deep learning experiments 24/7 while you sleep. Zero-cost monitoring, Leader-Worker architecture, constant-size memory.

ai-agent autonomous-agent claude-code deep-learning experiment-automation gpu hyperparameter-tuning llm-agent pythonby Xiangyue-ZhangPython

OmicsClaw 📁main@2026-04-18🌿 Growing⭐124

Conversational & memory-enabled AI research partner for multi-omics analysis. From biological idea to full research paper.

bioinformatics knowledge-graph llm-agent multi-agents multi-omics python single-cell spatial-transcriptomicsby TianGzlabPython

rag-chatbot 📁main@2026-04-14🌿 Growing⭐407

RAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.

chatbot chromadb gpu lamacpp llama3 llm python qwen3-5 ragby umbertogriffoPython

codexlens-search 📁v0.8.0🌱 Seedling⭐44

Lightweight semantic code search engine — 2-stage vector + FTS + RRF fusion + MCP server for Claude Code

pythonby catlog22Python

RIGEL 📁0.0.0🌱 Seedling⭐26

A Multi-Agentic AI Assistant/Builder

agentic-ai ai-assistant ai-framework chatbot dbus groq linux llm pythonby Zerone-LaboratoriesPython

Auto-Use 📁V1.0🌱 Seedling⭐24

Auto-Use Computer Use — drives your OS, browser, scours the web, writes your code. One agent, end to end.

agentic-ai ai-agents anthropic anthropic-claude autonomous-agents browser-use claude-ai computer-use llm-agent pythonby auto-usePython

vllm-cli 📁v0.2.5💤 Dormant⭐491

A command-line interface tool for serving LLM using vLLM.

llm llm-inference llm-tools python vllmby Chen-zexiPython

robots 📁v0.3.8🌱 Seedling⭐44

Control robots and physical hardware with natural language through Strands Agents.

agentic agentic-ai ai genai gr00t lerobot machine-learning pythonby strands-labsPython

free-claude-code 📁main@2026-04-21🌱 Seedling⭐11

🚀 Use Claude Code CLI for free with NVIDIA's unlimited API. This proxy converts requests to NIM format and integrates with a Telegram bot for remote control.

acp ai-agent ai-tools anthropic-ai antigravity browser-automation chat citations code-generation pythonby Andrewkeith83Python

Somi 📁Mineralization🌱 Seedling⭐20

Local-first AI agent framework with GUI, memory, web search, personality constructs, speech i/o, tools, skills, CLI & Telegram features — fully self-hosted via Ollama.

ai-agents ai-framework arti automation cli gui homeb local pythonby Somi-ProjectPython

My_AI 📁v7.2.0🌱 Seedling⭐7

Local-first AI assistant — 9 specialized agents (code, web, debug, security…), 10M token vector memory, mobile relay via secure tunnel, real-time web search and document processing. Runs 100% on your

ai-assistant chatbot chromadb code-generation customtkinter document-processing gui local-llm pythonby gonicolas12Python

JianYan 📁main@2026-04-21🌱 Seedling⭐2

🎤 Transform speech to text on Windows with fast, local AI processing. Enjoy seamless recording and automatic integration for effective communication.

ai-agent asr audiototext funasr github-config nvidia openai productivity pythonby Jnewton-labPython

enton 📁main@2026-04-21🌱 Seedling⭐1

Builds an autonomous AI robot with vision, voice, and decision-making capabilities using Python, PyTorch, and CUDA technology.

ai autonomous-agent computer-vision cuda github-config llm python pytorchby tareq3743Python

cuda-toolkit13.2.1🌱 Seedling

CUDA Toolkit meta-package

cuda nvidia pypiby pypiPython

nvidia-cuda-cupti-cu1212.9.79🌱 Seedling

CUDA profiling tools runtime libs.

cuda deep learning machine nvidia pypi runtimeby Nvidia CUDA Installer TeamPython

tritonclient2.67.0🌱 Seedling

Python client library and utilities for communicating with Triton Inference Server

client grpc http inference pypi server service tensorrt tritonby NVIDIA Inc.Python