freshcrate

Search results for "nvidia"

103 results found
simsimd📁6.5.16🌳 Mature1,801

Portable mixed-precision BLAS-like vector math library for x86 and ARM

uv-dynamic-versioning📁0.14.0🌿 Growing189

Dynamic versioning based on VCS tags for uv/hatch project

cupy-cuda12x📁14.0.1🏛️ Flagship10,905

CuPy: NumPy & SciPy for GPU

faster-whisper📁1.2.1🏛️ Flagship22,327

Faster Whisper transcription with CTranslate2

xgrammar📁0.1.33🌳 Mature1,637

Efficient, Flexible and Portable Structured Generation

hydra-core📁1.3.2🏛️ Flagship10,328

A framework for elegantly configuring complex applications

keras📁3.14.0🏛️ Flagship64,025

Multi-backend Keras

sglang📁0.5.10.post1🏛️ Flagship26,220

SGLang is a fast serving framework for large language models and vision language models.

litellm📁v1.83.7-stable🏛️ Flagship44,168

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropi

openshell-deepagent📁0.0.0🌿 Growing121

A general-purpose coding agent that runs inside an NVIDIA OpenShell sandbox, orchestrated by Deep Agents and powered by NVIDIA Nemotron. The agent writes and executes code in an isolated, policy-gover

llama.cpp📁b8871🏛️ Flagship105,537

LLM inference in C/C++

mcp-nixos📁v2.4.0🌳 Mature597

MCP-NixOS - Model Context Protocol Server for NixOS resources

ClawRouter📁v0.12.159🏛️ Flagship6,212

The agent-native LLM router for OpenClaw. 41+ models, <1ms routing, USDC payments on Base & Solana via x402.

cognithor📁v0.92.3🌿 Growing115

Cognithor - Agent OS: Local-first autonomous agent operating system. 16 LLM providers, 17 channels, 112+ MCP tools, 5-tier memory, A2A protocol, knowledge vault, voice, browser automation, Computer-us

minutes📁v0.13.3🌳 Mature1,116

Every meeting, every idea, every voice note — searchable by your AI. Open-source, privacy-first conversation memory layer.

EvoScientist📁v0.0.8🌳 Mature2,796

🔬 Harness Vibe Research with Self-evolving AI Scientists

Joanium📁v2026.421.1🌱 Seedling23

Your smart, reliable, and friendly personal AI assistant.

gproxy📁v1.0.18🌿 Growing104

gproxy is a Rust-based multi-channel LLM proxy that exposes OpenAI / Claude / Gemini-style APIs through a unified gateway, with a built-in admin console, user/key management, and request/usage auditin

spiceai📁v2.0.0-rc.3🌳 Mature2,880

A portable accelerated SQL query, search, and LLM-inference engine, written in Rust, for data-grounded AI apps and agents.

restai📁v6.1.45🌿 Growing485

RESTai is an AIaaS (AI as a Service) open-source platform. Supports many public and local LLM suported by Ollama/vLLM/etc. Precise embeddings usage, tuning, analytics etc. Built-in image/audio generat

jarvis📁v1.28.0🌿 Growing300

Your AI assistant that never forgets and runs 100% privately on your computer. Leave it on 24/7 - it learns your preferences, helps with code, manages your health goals, searches the web, and connects

Agenvoy📁v0.19.4🌿 Growing61

Agentic framework | Self-improving memory | Pluggable tool extensions | Sandbox execution

onnxruntime📁v1.25.0🏛️ Flagship19,924

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

haystack📁v2.28.0🏛️ Flagship24,941

Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, m

mesh-llm📁v0.64.0🌳 Mature834

Distributed AI/LLM for the people. Share compute privately or publicly to power your agents and chat.

OmniRoute📁v3.6.9🌳 Mature3,250

OmniRoute is an AI gateway for multi-provider LLMs: an OpenAI-compatible endpoint with smart routing, load balancing, retries, and fallbacks. Add policies, rate limits, caching, and observability for

clawmetry📁v0.12.122🌿 Growing275

See your agent think. Real-time observability dashboard for OpenClaw AI agents.

SmarterRouter📁2.2.5🌿 Growing113

SmarterRouter: An intelligent LLM gateway and VRAM-aware router for Ollama, llama.cpp, and OpenAI. Features semantic caching, model profiling, and automatic failover for local AI labs.

NornicDB📁v1.0.42-hotfix🌳 Mature643

Nornicdb is a low-latency, Graph + Vector, Temporal MVCC with all sub-ms HNSW search, graph traversal, and writes. Uses Neo4j Bolt/Cypher and qdrant's gRPC drivers so you can switch with no changes. T

vllm📁v0.19.1🏛️ Flagship77,587

A high-throughput and memory-efficient inference and serving engine for LLMs

awesome-cli-coding-agents📁main@2026-04-18🌿 Growing244

Curated directory of terminal-native AI coding agents and the harnesses that orchestrate them. Covers open-source tools (Pi, OpenCode, Aider, Goose), platform agents (Claude Code, Codex, Gemini CLI),

llamafarm📁v0.0.31🌳 Mature819

Deploy any AI model, agent, database, RAG, and pipeline locally or remotely in minutes

milvus📁v2.6.15🏛️ Flagship43,898

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

cordum📁V0.9.9.1🌿 Growing465

The open agent control plane. Govern autonomous AI agents with pre-execution policy enforcement, approval gates, and audit trails. Works with LangChain, CrewAI, MCP, and any framework.

monocle📁v0.7.8🌿 Growing79

Monocle is a framework for tracing GenAI app code. This repo contains implementation of Monocle for GenAI apps written in Python.

WeKnora📁v0.4.0🏛️ Flagship13,971

LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.

oh-my-pi📁v14.1.2🌳 Mature3,285

⌥ AI Coding agent for the terminal — hash-anchored edits, optimized tool harness, LSP, Python, browser, subagents, and more

next-plaid📁v1.2.0🌿 Growing383

NextPlaid, ColGREP: Multi-vector search, from database to coding agents.

openakita📁v1.27.9🌳 Mature1,655

An open-source AI assistant framework with skills and agent architecture

ai-gateway📁v1.0.4🌿 Growing68

One API for 25+ LLMs, OpenAI, Anthropic, Bedrock, Azure. Caching, guardrails & cost controls. Go-native LiteLLM & Kong AI Gateway alternative.

ai-forge-mcp📁0.0.0🌿 Growing51

565 AI-callable tools across 16 MCP servers. Full-pipeline AAA game asset production. Controls Blender, Substance Suite, Maya, Houdini, and Unreal Engine 5. 50 specialized AI agents. One prompt in, ga

semantic-kernel📁python-1.41.2🏛️ Flagship27,750

Integrate cutting-edge LLM technology quickly and easily into your apps

UGTLive📁0.0.0🌿 Growing75

An easy to use GUI-based tool that performs live translations using OCR and LLMs (Either cloud or local only)

oramacore📁v1.2.38🌿 Growing249

OramaCore is the complete runtime you need for your projects, answer engines, copilots, and search. It includes a fully-fledged full-text search engine, vector database, LLM interface, and many more u

LocalAI📁v4.1.3🏛️ Flagship45,672

LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.

spacebot📁v0.4.1🌳 Mature2,119

An AI agent for teams, communities, and multi-user environments.

anything-llm📁v1.12.0🏛️ Flagship58,708

The all-in-one AI productivity accelerator. On device and privacy first with no annoying setup or configuration.

open-responses-server📁v0.4.3🌿 Growing167

Wraps any OpenAI API interface as Responses with MCPs support so it supports Codex. Adding any missing stateful features. Ollama and Vllm compliant.

sage📁0.0.0🌿 Growing261

Official Code Release of SAGE: Scalable Agentic 3D Scene Generation for Embodied AI

jan📁v0.7.9🏛️ Flagship42,053

Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.

houtini-lm📁v2.8.0🌿 Growing71

MCP server that saves Claude Code tokens by delegating bounded tasks to local or cloud LLMs. Works with LM Studio, Ollama, vLLM, DeepSeek, Groq, Cerebras.

ai-powered-video-analyzer📁0.0.0🌿 Growing71

An offline AI-powered video analysis tool with object detection (YOLO), image captioning (BLIP), speech transcription (Whisper), audio event detection (PANNs), and AI-generated summaries (LLMs via Oll

DeepCamera📁v2026.3🌳 Mature2,689

Open-Source AI Camera Skills Platform, AI NVR & CCTV Surveillance. Local VLM video analysis with Qwen, DeepSeek, SmolVLM, LLaVA, YOLO26. LLM-powered agentic security camera agent — watches, understand

cyllama📁0.2.11🌱 Seedling25

A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp

LLM-API-Key-Proxy📁dev/build-20260301-1-b62f6e4🌿 Growing465

Universal LLM Gateway: One API, every LLM. OpenAI/Anthropic-compatible endpoints with multi-provider translation and intelligent load-balancing.

sandboxed.sh📁v0.10.0🌿 Growing392

Self-hosted orchestrator for AI autonomous agents. Run Claude Code & Open Code in isolated linux workspaces. Manage your skills, configs and encrypted secrets with a git repo.

unsloth-buddy📁main@2026-04-15🌿 Growing230

Zero-friction LLM fine-tuning skill for Claude Code, Gemini CLI & any ACP agent. Unsloth on NVIDIA · TRL+MPS/MLX on Apple Silicon. Automates env setup, LoRA training (SFT, DPO, GRPO, vision), post-hoc

orbit📁v2.6.6🌿 Growing250

One API for 20+ LLM providers, your databases, and your files — self-hosted, open-source AI gateway with RAG, voice, and guardrails.

AgenticX📁v0.3.7🌿 Growing114

AgenticX is a unified, production-ready multi-agent platform — Python SDK + CLI (agx) + Studio server + Machi desktop app. Features Meta-Agent orchestration, 15+ LLM providers, MCP Hub, hierarchical m

This GitHub repository contains the complete code for building Business-Ready Generative AI Systems (GenAISys) from scratch. It guides you through architecting and implementing advanced AI controllers

llmware📁v0.4.6🌿 Growing14,862

Unified framework for building enterprise RAG pipelines with small, specialized models

ros-mcp-server📁v3.0.1🌳 Mature1,176

Connect AI models like Claude & GPT with robots using MCP and ROS.

tsunami📁main@2026-04-21🌱 Seedling16

autonomous AI agent that builds full-stack apps. local models. no cloud. no API keys. runs on your hardware.

awesome-prompts📁main@2026-04-21🌿 Growing7,671

Curated list of chatgpt prompts from the top-rated GPTs in the GPTs Store. Prompt Engineering, prompt attack & prompt protect. Advanced Prompt Engineering papers.

awesome-opensource-ai📁main@2026-04-20🌿 Growing2,849

Curated list of the best truly open-source AI projects, models, tools, and infrastructure.

GTA📁v0.2.0🌿 Growing143

[NeurIPS 2024 D&B] GTA: A Benchmark for General Tool Agents & [arXiv 2026] GTA-2

redis-ai-resources📁main@2026-04-20🌿 Growing451

✨ A curated list of awesome community resources, integrations, and examples of Redis in the AI ecosystem.

awesome-ai-tools📁main@2026-04-19🌿 Growing390

🔴 VERY LARGE AI TOOL LIST! 🔴 Curated list of AI Tools - Updated 2026

Dragon-Brain📁v1.1.0🌱 Seedling43

Dragon Brain — persistent long-term memory for AI agents via MCP (Model Context Protocol). Knowledge graph (FalkorDB) + vector search (Qdrant) + CUDA GPU embeddings. Works with Claude, Gemini CLI, Cur

vektori📁main@2026-04-19🌿 Growing111

Memory that remembers the story not just the facts. Three layer sentence graph for AI agents -> Facts, Episodes, raw Sentences. One DB. Zero config.

auto-deep-researcher-24x7📁main@2026-04-19🌿 Growing622

🔥 An autonomous AI agent that runs your deep learning experiments 24/7 while you sleep. Zero-cost monitoring, Leader-Worker architecture, constant-size memory.

OmicsClaw📁main@2026-04-18🌿 Growing124

Conversational & memory-enabled AI research partner for multi-omics analysis. From biological idea to full research paper.

rag-chatbot📁main@2026-04-14🌿 Growing407

RAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.

TomoriBot📁v0.7.904🌱 Seedling34

A highly customizable personal AI assistant for Discord featuring smart agentic AI features such as memory, personas, tool usage, and more! | 長期記憶やペルソナ、ツール連携を完備。 次世代の「自律型AIエージェント」Discordボット!

codexlens-search📁v0.8.0🌱 Seedling44

Lightweight semantic code search engine — 2-stage vector + FTS + RRF fusion + MCP server for Claude Code

mayros📁v0.3.2🌱 Seedling10

Production-ready AI agent framework — semantic memory, multi-agent mesh, MCP server, intelligent routing, governance, and 67+ platform integrations.

crewform📁v1.8.2🌱 Seedling10

Build your AI team with Crewform. Orchestrate specialized, autonomous agents to collaborate on complex tasks and connect outputs to your stack. — AI Orchestration for Everyone

FinGPT📁v1.0.0🌱 Seedling19,689

FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.

kernel📁v3.97.0🌱 Seedling12

kbot — the AI agent that dreams, learns, and evolves. 764+ tools, 35 agents, 20 providers. Music production, iPhone control, financial analysis, cyber threat intel. Always-on daemon. Runs offline. npm

Auto-Use📁V1.0🌱 Seedling24

Auto-Use Computer Use — drives your OS, browser, scours the web, writes your code. One agent, end to end.

cosmotop📁v0.14.0🌱 Seedling64

Multiplatform system monitoring tool using Cosmopolitan Libc

DreamServer📁v2.0.0🌿 Growing443

Local AI anywhere, for everyone — LLM inference, chat UI, voice, agents, workflows, RAG, and image generation. No cloud, no subscriptions.

vllm-cli📁v0.2.5💤 Dormant491

A command-line interface tool for serving LLM using vLLM.

robots📁v0.3.8🌱 Seedling44

Control robots and physical hardware with natural language through Strands Agents.

free-claude-code📁main@2026-04-21🌱 Seedling11

🚀 Use Claude Code CLI for free with NVIDIA's unlimited API. This proxy converts requests to NIM format and integrates with a Telegram bot for remote control.

OriginDL📁v1.0.0🌱 Seedling260

Implement a Pytorch-like DL library in C++ from scratch, step by step

Somi📁Mineralization🌱 Seedling20

Local-first AI agent framework with GUI, memory, web search, personality constructs, speech i/o, tools, skills, CLI & Telegram features — fully self-hosted via Ollama.

awesome-vector-databases📁0.0.0🌱 Seedling14

A curated list of vector database solutions, libraries, and resources for AI applications - https://vectordb.works

ralphglasses📁v0.2.0🌱 Seedling3

Multi-LLM agent orchestration TUI — parallel Claude/Gemini/Codex sessions, 126 MCP tools

nikola📁v0.2.7🌱 Seedling1

Nikola — autonomous AI system based on ATPM consciousness architecture. Aria is its primary language substrate.

My_AI📁v7.2.0🌱 Seedling7

Local-first AI assistant — 9 specialized agents (code, web, debug, security…), 10M token vector memory, mobile relay via secure tunnel, real-time web search and document processing. Runs 100% on your

JianYan📁main@2026-04-21🌱 Seedling2

🎤 Transform speech to text on Windows with fast, local AI processing. Enjoy seamless recording and automatic integration for effective communication.

enton📁main@2026-04-21🌱 Seedling1

Builds an autonomous AI robot with vision, voice, and decision-making capabilities using Python, PyTorch, and CUDA technology.

cuda-toolkit13.2.1🌱 Seedling

CUDA Toolkit meta-package

tritonclient2.67.0🌱 Seedling

Python client library and utilities for communicating with Triton Inference Server