freshcrate

Search results for "gpu"

99 results found
openlitπŸ“openlit-1.18.1🌿 Growing⭐2,358

Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Management, Vault, Playground. πŸš€πŸ’» Integrates with 50+ LLM Providers,

NornicDBπŸ“v1.0.42-hotfix🌿 Growing⭐401

Nornicdb is a low-latency, Graph + Vector, Temporal MVCC with all sub-ms HNSW search, graph traversal, and writes. Uses Neo4j Bolt/Cypher and qdrant's gRPC drivers so you can switch with no changes. T

helixπŸ“2.9.30🌿 Growing⭐757

♾️ Private Agent Fleet with Spec Coding. Each agent gets their own GPU-accelerated desktop. Run Claude, Codex, Gemini and open models on a full private AI Stack ♾️

llama.cppπŸ“b8864🌳 Mature⭐103,119

LLM inference in C/C++

pi-monoπŸ“v0.68.0🌳 Mature⭐34,430

AI agent toolkit: coding agent CLI, unified LLM API, TUI & web UI libraries, Slack bot, vLLM pods

jarvisπŸ“v1.28.0🌿 Growing⭐174

Your AI assistant that never forgets and runs 100% privately on your computer. Leave it on 24/7 - it learns your preferences, helps with code, manages your health goals, searches the web, and connects

compose-for-agentsπŸ“main@2026-04-20🌳 Mature⭐910

Build and run AI agents using Docker Compose. A collection of ready-to-use examples for orchestrating open-source LLMs, tools, and agent runtimes.

Auto-claude-code-research-in-sleepπŸ“v0.4.4🌳 Mature⭐6,182

ARIS βš”οΈ (Auto-Research-In-Sleep) β€” Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in β€” works wi

cyllamaπŸ“0.2.11🌱 Seedling⭐22

A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp

edgecrabπŸ“v0.7.0🌱 Seedling⭐21

EdgeCrab πŸ¦€ A Super Powerful Personal Assistant inspired by NousHermes and OpenClaw β€” Rust-native, blazing-fast terminal UI, ReAct tool loop, multi-provider LLM support, ACP protocol, gateway adapters

WeKnoraπŸ“v0.4.0🌳 Mature⭐13,819

LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.

agentic-memoryπŸ“0.0.0🌿 Growing⭐162

No description

by lhl
Constrained-Text-Generation-StudioπŸ“0.0.0🌿 Growing⭐216

Code repo for "Most Language Models can be Poets too: An AI Writing Assistant and Constrained Text Generation Studio" at the (CAI2) workshop, jointly held at (COLING 2022)

LRATπŸ“0.0.0🌱 Seedling⭐34

The implementation for SIGIR 2026: Learning to Retrieve from Agent Trajectories.

UGTLiveπŸ“0.0.0🌿 Growing⭐73

An easy to use GUI-based tool that performs live translations using OCR and LLMs (Either cloud or local only)

aitools_clientπŸ“0.0.0🌿 Growing⭐182

Seth's AI Tools: A Unity based front end that uses ComfyUI and LLMs to create stories, images, movies, quizzes and posters

cognithorπŸ“v0.92.2🌿 Growing⭐94

Cognithor - Agent OS: Local-first autonomous agent operating system. 16 LLM providers, 17 channels, 112+ MCP tools, 5-tier memory, A2A protocol, knowledge vault, voice, browser automation, Computer-us

semiontπŸ“v0.4.20🌱 Seedling⭐44

Semiont supports human+ai collaborative knowledge work. Use it as: a Wiki, Semantic Layer, Context Graph, Knowledge Base, Annotator, Research Tool, or Agentic Memory...

ai-powered-video-analyzerπŸ“0.0.0🌿 Growing⭐68

An offline AI-powered video analysis tool with object detection (YOLO), image captioning (BLIP), speech transcription (Whisper), audio event detection (PANNs), and AI-generated summaries (LLMs via Oll

JRVSπŸ“0.0.0🌿 Growing⭐236

JRVS AI Agent with JARCORE autonomous coding engine - RAG knowledge base, web scraping, calendar, code generation. Powered by whatever local AI you choose.

RAGMeUpπŸ“scala-ui🌳 Mature⭐675

Generic rag framework to apply the power of LLMs on any given dataset

MiniSearchπŸ“main@2026-04-20🌿 Growing⭐553

Minimalist web-searching platform with an AI assistant that runs directly from your browser. Uses WebLLM, Wllama and SearXNG. Demo: https://felladrin-minisearch.hf.space

Dragon-BrainπŸ“v1.1.0🌱 Seedling⭐43

Dragon Brain β€” persistent long-term memory for AI agents via MCP (Model Context Protocol). Knowledge graph (FalkorDB) + vector search (Qdrant) + CUDA GPU embeddings. Works with Claude, Gemini CLI, Cur

auto-deep-researcher-24x7πŸ“main@2026-04-19🌿 Growing⭐261

πŸ”₯ An autonomous AI agent that runs your deep learning experiments 24/7 while you sleep. Zero-cost monitoring, Leader-Worker architecture, constant-size memory.

SmarterRouterπŸ“2.2.5🌿 Growing⭐105

SmarterRouter: An intelligent LLM gateway and VRAM-aware router for Ollama, llama.cpp, and OpenAI. Features semantic caching, model profiling, and automatic failover for local AI labs.

rag-chatbotπŸ“main@2026-04-14🌿 Growing⭐402

RAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.

shaiπŸ“v0.0.9🌱 Seedling⭐39

sandboxing shell for ai coding agents

mcp-client-for-ollamaπŸ“v0.28.0🌿 Growing⭐599

A text-based user interface (TUI) client for interacting with MCP servers using Ollama. Features include agent mode, multi-server, model switching, streaming responses, tool management, human-in-the-l

AgenticXπŸ“v0.3.7🌿 Growing⭐105

AgenticX is a unified, production-ready multi-agent platform β€” Python SDK + CLI (agx) + Studio server + Machi desktop app. Features Meta-Agent orchestration, 15+ LLM providers, MCP Hub, hierarchical m

synaptic-memoryπŸ“v0.16.0🌱 Seedling⭐25

Brain-inspired knowledge graph: spreading activation, Hebbian learning, memory consolidation.

SocratiCodeπŸ“v1.6.1🌿 Growing⭐810

Enterprise-grade (40m+ lines) codebase intelligence in a zero-setup, private and local Claude Plugin or MCP: managed indexing, hybrid semantic search, polyglot code dependency graphs, and DB/API/infra

hermes-agentπŸ“v2026.4.16🌿 Growing⭐57,954

The agent that grows with you

oh-my-piπŸ“v14.1.2🌿 Growing⭐2,872

βŒ₯ AI Coding agent for the terminal β€” hash-anchored edits, optimized tool harness, LSP, Python, browser, subagents, and more

RIGELπŸ“0.0.0🌱 Seedling⭐26

A Multi-Agentic AI Assistant/Builder

tsunamiπŸ“main@2026-04-21🌱 Seedling⭐13

autonomous AI agent that builds full-stack apps. local models. no cloud. no API keys. runs on your hardware.

NeuronFSπŸ“main@2026-04-21🌿 Growing⭐136

mkdir beats vector DB. B-tree NeuronFS: 0-byte folders govern AI β€” β‚©0 infrastructure, ~200x token efficiency. OS-native constraint engine for LLM agents.

memory_agent_hubπŸ“main@2026-04-20🌱 Seedling⭐38

2026 swarm Agent 年,swarm Agent 、Agent team、 ai coding、skill、memory、evolve、agentic RL η­‰ AI Agentι›†εˆ

awesome-code-agentsπŸ“main@2026-04-20🌿 Growing⭐94

A curated list of products, benchmarks, and research papers on autonomous code agents. Beyond coding β€” they're redefining how software changes the world.

vexaπŸ“v0.10.2🌿 Growing⭐1,862

Open-source meeting transcription API for Google Meet, Microsoft Teams & Zoom. Auto-join bots, real-time WebSocket transcripts, MCP server for AI agents. Self-host or use hosted SaaS.

orbitπŸ“v2.6.6🌿 Growing⭐250

One API for 20+ LLM providers, your databases, and your files β€” self-hosted, open-source AI gateway with RAG, voice, and guardrails.

AGI-Alpha-Agent-v0πŸ“main@2026-04-18🌿 Growing⭐283

META‑AGENTIC α‑AGI πŸ‘οΈβœ¨ β€” Mission 🎯 End‑to‑end: Identify πŸ” β†’ Out‑Learn πŸ“š β†’ Out‑Think 🧠 β†’ Out‑Design 🎨 β†’ Out‑Strategise β™ŸοΈ β†’ Out‑Execute ⚑

vllmπŸ“v0.19.1🌿 Growing⭐76,155

A high-throughput and memory-efficient inference and serving engine for LLMs

milvusπŸ“v2.6.15🌿 Growing⭐43,734

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

paiml-mcp-agent-toolkitπŸ“v3.14.0🌿 Growing⭐148

Pragmatic AI Labs MCP Agent Toolkit - An MCP Server designed to make code with agents more deterministic

MeiGen-AI-Design-MCPπŸ“v1.2.8🌿 Growing⭐569

MeiGen-AI-Design-MCP β€” Turn Claude Code / OpenClaw into your local Lovart. Local ComfyUI, 1,400+ prompt library, multi-direction parallel generation.

llmwareπŸ“v0.4.6🌿 Growing⭐14,857

Unified framework for building enterprise RAG pipelines with small, specialized models

LocalAIπŸ“v4.1.3🌱 Seedling⭐45,254

LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.

ai-real-estate-assistantπŸ“dev@2026-04-13🌿 Growing⭐159

Advanced AI Real Estate Assistant using RAG, LLMs, and Python. Features market analysis, property valuation, and intelligent search.

awesome-vector-databaseπŸ“main@2026-04-13🌿 Growing⭐341

A curated list of awesome works related to high dimensional structure/vector search & database

deep-research-mcpπŸ“main@2026-04-13🌿 Growing⭐58

MCP server for OpenAI's Deep Research APIs, Gemini Deep Research Agent, and Hugging Face's Open Deep Research

vllm-mlxπŸ“v0.2.8🌿 Growing⭐798

OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX bac

agenticSeekπŸ“main@2026-04-11🌿 Growing⭐25,891

Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity. πŸ”” Official updates only via twitter @Martin993

next-plaidπŸ“v1.2.0🌿 Growing⭐331

NextPlaid, ColGREP: Multi-vector search, from database to coding agents.

knowhereπŸ“v2.6.11🌱 Seedling⭐337

Vector search engine inside Milvus, integrating FAISS, HNSW, DiskANN.

smgπŸ“v1.4.1🌿 Growing⭐156

Engine-agnostic LLM gateway in Rust. Full OpenAI & Anthropic API compatibility across SGLang, vLLM, TRT-LLM, OpenAI, Gemini & more. Industry-first gRPC pipeline, KV cache-aware routing, chat history,

UltraRAGπŸ“v0.3.0.2🌿 Growing⭐5,480

A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines

FluidVCLπŸ“main@2026-04-21🌱 Seedling⭐7

🎨 Enhance your Delphi applications with FluidVCL, a modern set of high-performance VCL components offering sleek visuals and full customization.

oramacoreπŸ“v1.2.38🌱 Seedling⭐249

OramaCore is the complete runtime you need for your projects, answer engines, copilots, and search. It includes a fully-fledged full-text search engine, vector database, LLM interface, and many more u

VecturaKitπŸ“5.3.0🌱 Seedling⭐280

Swift-based vector database for on-device RAG using MLTensor and MLX Embedders

AutoRAGπŸ“v0.3.22🌱 Seedling⭐4,693

AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation

Unreal_mcpπŸ“v0.5.21🌱 Seedling⭐495

A comprehensive Model Context Protocol (MCP) server that enables AI assistants to control Unreal Engine through the native C++ Automation Bridge plugin. Built with TypeScript and C++.

vikramadityaπŸ“main@2026-04-20🌱 Seedling⭐5

Autonomous VAPT platform. Give it a target (FQDN, IP, CIDR) β€” it hunts, it reports. Inspired by the Obsidian Order.

animaworksπŸ“v0.6.2🌱 Seedling⭐225

Organization-as-Code for autonomous AI agents. Brain-inspired memory that grows, consolidates, and forgets. Multi-model (Claude/Codex/Gemini/Cursor/Ollama).

Windows-MCPπŸ“v0.7.1🌱 Seedling⭐5,075

MCP Server for Computer Use in Windows

neurostackπŸ“v0.11.1🌱 Seedling⭐40

Your second brain, starting today. CLI + MCP server that helps you build, maintain, and search a knowledge vault that gets better every day. Works with any AI provider. Local-first, zero-prereq instal

droid-llm-hunterπŸ“v1.0.0🌱 Seedling⭐95

Droid LLM Hunter is a tool to scan for vulnerabilities in Android applications using Large Language Models (LLMs).

RAG-AnythingπŸ“v1.2.10🌱 Seedling⭐15,557

"RAG-Anything: All-in-One RAG Framework"

fast-plaidπŸ“1.4.5🌱 Seedling⭐239

High-Performance Engine for Multi-Vector Search

codexlens-searchπŸ“v0.8.0🌱 Seedling⭐44

Lightweight semantic code search engine β€” 2-stage vector + FTS + RRF fusion + MCP server for Claude Code

janπŸ“v0.7.9🌱 Seedling⭐41,710

Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.

devitoπŸ“v4.8.21🌱 Seedling⭐689

DSL and compiler framework for automated finite-differences and stencil computation

guidanceπŸ“0.3.2🌱 Seedling⭐21,378

A guidance language for controlling large language models.

SomiπŸ“Mineralization🌱 Seedling⭐21

Local-first AI agent framework with GUI, memory, web search, personality constructs, speech i/o, tools, skills, CLI & Telegram features β€” fully self-hosted via Ollama.

onnxruntime-javaπŸ“v2.1.0🌱 Seedling⭐29

A type-safe, lightweight, modern, and performant binding Java binding of Microsoft's ONNX Runtime

KawaiiGPTπŸ“KawaiiGPT🌱 Seedling⭐831

KawaiiGPT β€” Open-source LLM gateway accessing DeepSeek, Gemini, and Kimi-K2 through reverse-engineered Pollinations API with no API keys required, built-in prompt injection capabilities for security r

DreamServerπŸ“v2.0.0🌱 Seedling⭐478

Local AI anywhere, for everyone β€” LLM inference, chat UI, voice, agents, workflows, RAG, and image generation. No cloud, no subscriptions.

OpenRA-RLπŸ“v0.4.1🌱 Seedling⭐118

Open Framework for AI Agents to play Red Alert through Reinforcement Learning

ragtable-extractπŸ“main@2026-04-21🌱 Seedling⭐1

Extract tables precisely from PDFs and convert them to clean HTML for RAG pipelines, running fast on CPU without external dependencies.

sandboxed.shπŸ“v0.10.0🌱 Seedling⭐371

Self-hosted orchestrator for AI autonomous agents. Run Claude Code & Open Code in isolated linux workspaces. Manage your skills, configs and encrypted secrets with a git repo.

awesome-seedance-promptsπŸ“main@2026-04-21🌱 Seedling⭐4

Explore curated Seedance 2.0 prompts with proven results, clear sources, and ready-to-use templates for faster content generation.

OriginDLπŸ“v1.0.0🌱 Seedling⭐245

Implement a Pytorch-like DL library in C++ from scratch, step by step

PAI-RAGπŸ“v0.4.3🌱 Seedling⭐450

An easy-to-use framework for modular RAG

ragflowπŸ“v0.24.0🌱 Seedling⭐77,784

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

py-gptπŸ“v2.7.12🌱 Seedling⭐1,724

Desktop AI Assistant powered by GPT-5, GPT-4, o1, o3, Gemini, Claude, Ollama, DeepSeek, Perplexity, Grok, Bielik, chat, vision, voice, RAG, image and video generation, agents, tools, MCP, plugins, spe

DOXπŸ“main@2026-04-15🌱 Seedling⭐1

Broken RAG For The Broken Souls

Comfy-CozyπŸ“v4.0.0🌱 Seedling⭐3

AI co-pilot for ComfyUI β€” 113 tools for workflow authoring, model provisioning, and iterative rendering. Multi-provider (Claude, GPT-4o, Gemini, Ollama). Ships as MCP server or standalone CLI.

uniAIπŸ“0.0.0🌱 Seedling⭐1

Syllabus-aware RAG study assistant for university students. Answers strictly from your own notes & PDFs, unit-scoped retrieval, cross-encoder reranking, and a hallucination gate β€” built to help studen

scorpio-analystπŸ“v0.2.4🌱 Seedling⭐1

Your personal Multi-Agent portfolio manager and financial analyst team

nano-banana-2-aiπŸ“main@2026-04-21🌱 Seedling⭐2

Deliver high-speed 4K text-to-image generation with 5-character consistency using the open-source Gemini 3.1 Flash Image model.

KREASYSπŸ“main@2026-04-21🌱 Seedling⭐2

Build and manage projects with an autonomous browser-based IDE featuring integrated multi-modal AI tools for efficient development workflows.

kagglerunπŸ“master@2026-04-21🌱 Seedling⭐1

πŸš€ Run Python on Kaggle's free GPUs directly from your terminal without the need for a browser, streamlining your data science workflow.

CrystalCanvasπŸ“main@2026-04-14🌱 Seedling⭐1

High-performance crystal structure modeling and DFT/MD file preparation. Native desktop app fusing a Rust/C++ physics kernel, a GPU-accelerated Metal/Vulkan renderer, and an AI-driven command bus for

🍌 Generate JSON prompts for ultra-photorealistic images of nano bananas and related subjects, ensuring reproducible and high-quality visual outputs.

loopyπŸ“v2025.2πŸ’€ Dormant⭐629

A code generator for array-based code on CPUs and GPUs

LettuceDetectπŸ“0.1.8πŸ’€ Dormant⭐545

Lightweight hallucination detection framework for RAG applications

vllm-cliπŸ“v0.2.5πŸ’€ Dormant⭐487

A command-line interface tool for serving LLM using vLLM.

Qwen-AgentπŸ“v0.0.26πŸ’€ Dormant⭐15,963

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

replicate-pythonπŸ“1.0.7πŸ’€ Dormant⭐900

Python client for Replicate