Search results for "gpu"
Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Management, Vault, Playground. ππ» Integrates with 50+ LLM Providers,
Nornicdb is a low-latency, Graph + Vector, Temporal MVCC with all sub-ms HNSW search, graph traversal, and writes. Uses Neo4j Bolt/Cypher and qdrant's gRPC drivers so you can switch with no changes. T
βΎοΈ Private Agent Fleet with Spec Coding. Each agent gets their own GPU-accelerated desktop. Run Claude, Codex, Gemini and open models on a full private AI Stack βΎοΈ
AI agent toolkit: coding agent CLI, unified LLM API, TUI & web UI libraries, Slack bot, vLLM pods
Your AI assistant that never forgets and runs 100% privately on your computer. Leave it on 24/7 - it learns your preferences, helps with code, manages your health goals, searches the web, and connects
Build and run AI agents using Docker Compose. A collection of ready-to-use examples for orchestrating open-source LLMs, tools, and agent runtimes.
ARIS βοΈ (Auto-Research-In-Sleep) β Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in β works wi
A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp
EdgeCrab π¦ A Super Powerful Personal Assistant inspired by NousHermes and OpenClaw β Rust-native, blazing-fast terminal UI, ReAct tool loop, multi-provider LLM support, ACP protocol, gateway adapters
LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.
Code repo for "Most Language Models can be Poets too: An AI Writing Assistant and Constrained Text Generation Studio" at the (CAI2) workshop, jointly held at (COLING 2022)
The implementation for SIGIR 2026: Learning to Retrieve from Agent Trajectories.
An easy to use GUI-based tool that performs live translations using OCR and LLMs (Either cloud or local only)
Seth's AI Tools: A Unity based front end that uses ComfyUI and LLMs to create stories, images, movies, quizzes and posters
Cognithor - Agent OS: Local-first autonomous agent operating system. 16 LLM providers, 17 channels, 112+ MCP tools, 5-tier memory, A2A protocol, knowledge vault, voice, browser automation, Computer-us
Knowledge Engine for AI Agent Memory in 6 lines of code
Semiont supports human+ai collaborative knowledge work. Use it as: a Wiki, Semantic Layer, Context Graph, Knowledge Base, Annotator, Research Tool, or Agentic Memory...
An offline AI-powered video analysis tool with object detection (YOLO), image captioning (BLIP), speech transcription (Whisper), audio event detection (PANNs), and AI-generated summaries (LLMs via Oll
JRVS AI Agent with JARCORE autonomous coding engine - RAG knowledge base, web scraping, calendar, code generation. Powered by whatever local AI you choose.
Generic rag framework to apply the power of LLMs on any given dataset
Minimalist web-searching platform with an AI assistant that runs directly from your browser. Uses WebLLM, Wllama and SearXNG. Demo: https://felladrin-minisearch.hf.space
Dragon Brain β persistent long-term memory for AI agents via MCP (Model Context Protocol). Knowledge graph (FalkorDB) + vector search (Qdrant) + CUDA GPU embeddings. Works with Claude, Gemini CLI, Cur
π₯ An autonomous AI agent that runs your deep learning experiments 24/7 while you sleep. Zero-cost monitoring, Leader-Worker architecture, constant-size memory.
SmarterRouter: An intelligent LLM gateway and VRAM-aware router for Ollama, llama.cpp, and OpenAI. Features semantic caching, model profiling, and automatic failover for local AI labs.
RAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.
A text-based user interface (TUI) client for interacting with MCP servers using Ollama. Features include agent mode, multi-server, model switching, streaming responses, tool management, human-in-the-l
AgenticX is a unified, production-ready multi-agent platform β Python SDK + CLI (agx) + Studio server + Machi desktop app. Features Meta-Agent orchestration, 15+ LLM providers, MCP Hub, hierarchical m
Brain-inspired knowledge graph: spreading activation, Hebbian learning, memory consolidation.
Enterprise-grade (40m+ lines) codebase intelligence in a zero-setup, private and local Claude Plugin or MCP: managed indexing, hybrid semantic search, polyglot code dependency graphs, and DB/API/infra
The agent that grows with you
β₯ AI Coding agent for the terminal β hash-anchored edits, optimized tool harness, LSP, Python, browser, subagents, and more
A Multi-Agentic AI Assistant/Builder
autonomous AI agent that builds full-stack apps. local models. no cloud. no API keys. runs on your hardware.
mkdir beats vector DB. B-tree NeuronFS: 0-byte folders govern AI β β©0 infrastructure, ~200x token efficiency. OS-native constraint engine for LLM agents.
2026 swarm Agent εΉ΄οΌswarm Agent γAgent teamγ ai codingγskillγmemoryγevolveγagentic RL η AI Agentιε
A curated list of products, benchmarks, and research papers on autonomous code agents. Beyond coding β they're redefining how software changes the world.
Open-source meeting transcription API for Google Meet, Microsoft Teams & Zoom. Auto-join bots, real-time WebSocket transcripts, MCP server for AI agents. Self-host or use hosted SaaS.
One API for 20+ LLM providers, your databases, and your files β self-hosted, open-source AI gateway with RAG, voice, and guardrails.
METAβAGENTIC Ξ±βAGI ποΈβ¨ β Mission π― Endβtoβend: Identify π β OutβLearn π β OutβThink π§ β OutβDesign π¨ β OutβStrategise βοΈ β OutβExecute β‘
A high-throughput and memory-efficient inference and serving engine for LLMs
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
Pragmatic AI Labs MCP Agent Toolkit - An MCP Server designed to make code with agents more deterministic
MeiGen-AI-Design-MCP β Turn Claude Code / OpenClaw into your local Lovart. Local ComfyUI, 1,400+ prompt library, multi-direction parallel generation.
Unified framework for building enterprise RAG pipelines with small, specialized models
LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.
Advanced AI Real Estate Assistant using RAG, LLMs, and Python. Features market analysis, property valuation, and intelligent search.
A curated list of awesome works related to high dimensional structure/vector search & database
MCP server for OpenAI's Deep Research APIs, Gemini Deep Research Agent, and Hugging Face's Open Deep Research
OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX bac
Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity. π Official updates only via twitter @Martin993
NextPlaid, ColGREP: Multi-vector search, from database to coding agents.
Vector search engine inside Milvus, integrating FAISS, HNSW, DiskANN.
Engine-agnostic LLM gateway in Rust. Full OpenAI & Anthropic API compatibility across SGLang, vLLM, TRT-LLM, OpenAI, Gemini & more. Industry-first gRPC pipeline, KV cache-aware routing, chat history,
A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines
π¨ Enhance your Delphi applications with FluidVCL, a modern set of high-performance VCL components offering sleek visuals and full customization.
OramaCore is the complete runtime you need for your projects, answer engines, copilots, and search. It includes a fully-fledged full-text search engine, vector database, LLM interface, and many more u
Swift-based vector database for on-device RAG using MLTensor and MLX Embedders
AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
A comprehensive Model Context Protocol (MCP) server that enables AI assistants to control Unreal Engine through the native C++ Automation Bridge plugin. Built with TypeScript and C++.
Autonomous VAPT platform. Give it a target (FQDN, IP, CIDR) β it hunts, it reports. Inspired by the Obsidian Order.
Organization-as-Code for autonomous AI agents. Brain-inspired memory that grows, consolidates, and forgets. Multi-model (Claude/Codex/Gemini/Cursor/Ollama).
MCP Server for Computer Use in Windows
Your second brain, starting today. CLI + MCP server that helps you build, maintain, and search a knowledge vault that gets better every day. Works with any AI provider. Local-first, zero-prereq instal
Droid LLM Hunter is a tool to scan for vulnerabilities in Android applications using Large Language Models (LLMs).
"RAG-Anything: All-in-One RAG Framework"
High-Performance Engine for Multi-Vector Search
Lightweight semantic code search engine β 2-stage vector + FTS + RRF fusion + MCP server for Claude Code
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.
DSL and compiler framework for automated finite-differences and stencil computation
A guidance language for controlling large language models.
Local-first AI agent framework with GUI, memory, web search, personality constructs, speech i/o, tools, skills, CLI & Telegram features β fully self-hosted via Ollama.
A type-safe, lightweight, modern, and performant binding Java binding of Microsoft's ONNX Runtime
KawaiiGPT β Open-source LLM gateway accessing DeepSeek, Gemini, and Kimi-K2 through reverse-engineered Pollinations API with no API keys required, built-in prompt injection capabilities for security r
Local AI anywhere, for everyone β LLM inference, chat UI, voice, agents, workflows, RAG, and image generation. No cloud, no subscriptions.
Open Framework for AI Agents to play Red Alert through Reinforcement Learning
Extract tables precisely from PDFs and convert them to clean HTML for RAG pipelines, running fast on CPU without external dependencies.
Self-hosted orchestrator for AI autonomous agents. Run Claude Code & Open Code in isolated linux workspaces. Manage your skills, configs and encrypted secrets with a git repo.
Explore curated Seedance 2.0 prompts with proven results, clear sources, and ready-to-use templates for faster content generation.
Implement a Pytorch-like DL library in C++ from scratch, step by step
An easy-to-use framework for modular RAG
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
Desktop AI Assistant powered by GPT-5, GPT-4, o1, o3, Gemini, Claude, Ollama, DeepSeek, Perplexity, Grok, Bielik, chat, vision, voice, RAG, image and video generation, agents, tools, MCP, plugins, spe
Broken RAG For The Broken Souls
AI co-pilot for ComfyUI β 113 tools for workflow authoring, model provisioning, and iterative rendering. Multi-provider (Claude, GPT-4o, Gemini, Ollama). Ships as MCP server or standalone CLI.
Syllabus-aware RAG study assistant for university students. Answers strictly from your own notes & PDFs, unit-scoped retrieval, cross-encoder reranking, and a hallucination gate β built to help studen
Your personal Multi-Agent portfolio manager and financial analyst team
Deliver high-speed 4K text-to-image generation with 5-character consistency using the open-source Gemini 3.1 Flash Image model.
Build and manage projects with an autonomous browser-based IDE featuring integrated multi-modal AI tools for efficient development workflows.
π Run Python on Kaggle's free GPUs directly from your terminal without the need for a browser, streamlining your data science workflow.
High-performance crystal structure modeling and DFT/MD file preparation. Native desktop app fusing a Rust/C++ physics kernel, a GPU-accelerated Metal/Vulkan renderer, and an AI-driven command bus for
π Generate JSON prompts for ultra-photorealistic images of nano bananas and related subjects, ensuring reproducible and high-quality visual outputs.
A code generator for array-based code on CPUs and GPUs
Lightweight hallucination detection framework for RAG applications
A command-line interface tool for serving LLM using vLLM.
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
