Search results for "nvidia"
Portable mixed-precision BLAS-like vector math library for x86 and ARM
Dynamic versioning based on VCS tags for uv/hatch project
Faster Whisper transcription with CTranslate2
Efficient, Flexible and Portable Structured Generation
A framework for elegantly configuring complex applications
SGLang is a fast serving framework for large language models and vision language models.
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropi
A general-purpose coding agent that runs inside an NVIDIA OpenShell sandbox, orchestrated by Deep Agents and powered by NVIDIA Nemotron. The agent writes and executes code in an isolated, policy-gover
MCP-NixOS - Model Context Protocol Server for NixOS resources
The agent-native LLM router for OpenClaw. 41+ models, <1ms routing, USDC payments on Base & Solana via x402.
Cognithor - Agent OS: Local-first autonomous agent operating system. 16 LLM providers, 17 channels, 112+ MCP tools, 5-tier memory, A2A protocol, knowledge vault, voice, browser automation, Computer-us
eBPF-based GPU causal observability agent
Every meeting, every idea, every voice note — searchable by your AI. Open-source, privacy-first conversation memory layer.
AI Observability & Evaluation
🔬 Harness Vibe Research with Self-evolving AI Scientists
Your smart, reliable, and friendly personal AI assistant.
gproxy is a Rust-based multi-channel LLM proxy that exposes OpenAI / Claude / Gemini-style APIs through a unified gateway, with a built-in admin console, user/key management, and request/usage auditin
A portable accelerated SQL query, search, and LLM-inference engine, written in Rust, for data-grounded AI apps and agents.
RESTai is an AIaaS (AI as a Service) open-source platform. Supports many public and local LLM suported by Ollama/vLLM/etc. Precise embeddings usage, tuning, analytics etc. Built-in image/audio generat
Your AI assistant that never forgets and runs 100% privately on your computer. Leave it on 24/7 - it learns your preferences, helps with code, manages your health goals, searches the web, and connects
Agentic framework | Self-improving memory | Pluggable tool extensions | Sandbox execution
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, m
Distributed AI/LLM for the people. Share compute privately or publicly to power your agents and chat.
OmniRoute is an AI gateway for multi-provider LLMs: an OpenAI-compatible endpoint with smart routing, load balancing, retries, and fallbacks. Add policies, rate limits, caching, and observability for
See your agent think. Real-time observability dashboard for OpenClaw AI agents.
SmarterRouter: An intelligent LLM gateway and VRAM-aware router for Ollama, llama.cpp, and OpenAI. Features semantic caching, model profiling, and automatic failover for local AI labs.
Nornicdb is a low-latency, Graph + Vector, Temporal MVCC with all sub-ms HNSW search, graph traversal, and writes. Uses Neo4j Bolt/Cypher and qdrant's gRPC drivers so you can switch with no changes. T
A high-throughput and memory-efficient inference and serving engine for LLMs
Curated directory of terminal-native AI coding agents and the harnesses that orchestrate them. Covers open-source tools (Pi, OpenCode, Aider, Goose), platform agents (Claude Code, Codex, Gemini CLI),
Deploy any AI model, agent, database, RAG, and pipeline locally or remotely in minutes
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
The agent that grows with you
The open agent control plane. Govern autonomous AI agents with pre-execution policy enforcement, approval gates, and audit trails. Works with LangChain, CrewAI, MCP, and any framework.
Monocle is a framework for tracing GenAI app code. This repo contains implementation of Monocle for GenAI apps written in Python.
LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.
⌥ AI Coding agent for the terminal — hash-anchored edits, optimized tool harness, LSP, Python, browser, subagents, and more
NextPlaid, ColGREP: Multi-vector search, from database to coding agents.
An open-source AI assistant framework with skills and agent architecture
One API for 25+ LLMs, OpenAI, Anthropic, Bedrock, Azure. Caching, guardrails & cost controls. Go-native LiteLLM & Kong AI Gateway alternative.
565 AI-callable tools across 16 MCP servers. Full-pipeline AAA game asset production. Controls Blender, Substance Suite, Maya, Houdini, and Unreal Engine 5. 50 specialized AI agents. One prompt in, ga
Integrate cutting-edge LLM technology quickly and easily into your apps
An easy to use GUI-based tool that performs live translations using OCR and LLMs (Either cloud or local only)
OramaCore is the complete runtime you need for your projects, answer engines, copilots, and search. It includes a fully-fledged full-text search engine, vector database, LLM interface, and many more u
LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.
An AI agent for teams, communities, and multi-user environments.
The all-in-one AI productivity accelerator. On device and privacy first with no annoying setup or configuration.
Wraps any OpenAI API interface as Responses with MCPs support so it supports Codex. Adding any missing stateful features. Ollama and Vllm compliant.
Official Code Release of SAGE: Scalable Agentic 3D Scene Generation for Embodied AI
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.
MCP server that saves Claude Code tokens by delegating bounded tasks to local or cloud LLMs. Works with LM Studio, Ollama, vLLM, DeepSeek, Groq, Cerebras.
An offline AI-powered video analysis tool with object detection (YOLO), image captioning (BLIP), speech transcription (Whisper), audio event detection (PANNs), and AI-generated summaries (LLMs via Oll
Open-Source AI Camera Skills Platform, AI NVR & CCTV Surveillance. Local VLM video analysis with Qwen, DeepSeek, SmolVLM, LLaVA, YOLO26. LLM-powered agentic security camera agent — watches, understand
A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp
Structured Outputs
Universal LLM Gateway: One API, every LLM. OpenAI/Anthropic-compatible endpoints with multi-provider translation and intelligent load-balancing.
Self-hosted orchestrator for AI autonomous agents. Run Claude Code & Open Code in isolated linux workspaces. Manage your skills, configs and encrypted secrets with a git repo.
Zero-friction LLM fine-tuning skill for Claude Code, Gemini CLI & any ACP agent. Unsloth on NVIDIA · TRL+MPS/MLX on Apple Silicon. Automates env setup, LoRA training (SFT, DPO, GRPO, vision), post-hoc
One API for 20+ LLM providers, your databases, and your files — self-hosted, open-source AI gateway with RAG, voice, and guardrails.
AgenticX is a unified, production-ready multi-agent platform — Python SDK + CLI (agx) + Studio server + Machi desktop app. Features Meta-Agent orchestration, 15+ LLM providers, MCP Hub, hierarchical m
This GitHub repository contains the complete code for building Business-Ready Generative AI Systems (GenAISys) from scratch. It guides you through architecting and implementing advanced AI controllers
Unified framework for building enterprise RAG pipelines with small, specialized models
Connect AI models like Claude & GPT with robots using MCP and ROS.
autonomous AI agent that builds full-stack apps. local models. no cloud. no API keys. runs on your hardware.
Curated list of chatgpt prompts from the top-rated GPTs in the GPTs Store. Prompt Engineering, prompt attack & prompt protect. Advanced Prompt Engineering papers.
Curated list of the best truly open-source AI projects, models, tools, and infrastructure.
[NeurIPS 2024 D&B] GTA: A Benchmark for General Tool Agents & [arXiv 2026] GTA-2
✨ A curated list of awesome community resources, integrations, and examples of Redis in the AI ecosystem.
🔴 VERY LARGE AI TOOL LIST! 🔴 Curated list of AI Tools - Updated 2026
Dragon Brain — persistent long-term memory for AI agents via MCP (Model Context Protocol). Knowledge graph (FalkorDB) + vector search (Qdrant) + CUDA GPU embeddings. Works with Claude, Gemini CLI, Cur
Memory that remembers the story not just the facts. Three layer sentence graph for AI agents -> Facts, Episodes, raw Sentences. One DB. Zero config.
🔥 An autonomous AI agent that runs your deep learning experiments 24/7 while you sleep. Zero-cost monitoring, Leader-Worker architecture, constant-size memory.
Conversational & memory-enabled AI research partner for multi-omics analysis. From biological idea to full research paper.
RAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.
A curated list of awesome works related to high dimensional structure/vector search & database
A highly customizable personal AI assistant for Discord featuring smart agentic AI features such as memory, personas, tool usage, and more! | 長期記憶やペルソナ、ツール連携を完備。 次世代の「自律型AIエージェント」Discordボット!
Lightweight semantic code search engine — 2-stage vector + FTS + RRF fusion + MCP server for Claude Code
Production-ready AI agent framework — semantic memory, multi-agent mesh, MCP server, intelligent routing, governance, and 67+ platform integrations.
Build your AI team with Crewform. Orchestrate specialized, autonomous agents to collaborate on complex tasks and connect outputs to your stack. — AI Orchestration for Everyone
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
A Multi-Agentic AI Assistant/Builder
kbot — the AI agent that dreams, learns, and evolves. 764+ tools, 35 agents, 20 providers. Music production, iPhone control, financial analysis, cyber threat intel. Always-on daemon. Runs offline. npm
Auto-Use Computer Use — drives your OS, browser, scours the web, writes your code. One agent, end to end.
Multiplatform system monitoring tool using Cosmopolitan Libc
Local AI anywhere, for everyone — LLM inference, chat UI, voice, agents, workflows, RAG, and image generation. No cloud, no subscriptions.
A command-line interface tool for serving LLM using vLLM.
Control robots and physical hardware with natural language through Strands Agents.
🚀 Use Claude Code CLI for free with NVIDIA's unlimited API. This proxy converts requests to NIM format and integrates with a Telegram bot for remote control.
Implement a Pytorch-like DL library in C++ from scratch, step by step
Local-first AI agent framework with GUI, memory, web search, personality constructs, speech i/o, tools, skills, CLI & Telegram features — fully self-hosted via Ollama.
A curated list of vector database solutions, libraries, and resources for AI applications - https://vectordb.works
Multi-LLM agent orchestration TUI — parallel Claude/Gemini/Codex sessions, 126 MCP tools
Nikola — autonomous AI system based on ATPM consciousness architecture. Aria is its primary language substrate.
Local-first AI assistant — 9 specialized agents (code, web, debug, security…), 10M token vector memory, mobile relay via secure tunnel, real-time web search and document processing. Runs 100% on your
🎤 Transform speech to text on Windows with fast, local AI processing. Enjoy seamless recording and automatic integration for effective communication.
Builds an autonomous AI robot with vision, voice, and decision-making capabilities using Python, PyTorch, and CUDA technology.
CUDA profiling tools runtime libs.
