freshcrate — Search

Search results for "cuda"

40 results found

llama.cpp 📁b8864🌳 Mature⭐103,119

LLM inference in C/C++

jarvis 📁v1.28.0🌿 Growing⭐174

Your AI assistant that never forgets and runs 100% privately on your computer. Leave it on 24/7 - it learns your preferences, helps with code, manages your health goals, searches the web, and connects

ai assistant health machine-learning mcp nutrition privacy private pythonby isairPython

Auto-claude-code-research-in-sleep 📁v0.4.4🌳 Mature⭐6,182

ARIS ⚔️ (Auto-Research-In-Sleep) — Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in — works wi

ai-research ai-tools aris autonomous-agent claude claude-code claude-code-skills codex pythonby wanshuiyinPython

cyllama 📁0.2.11🌱 Seedling⭐22

A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp

agents cython cython-wrapper llama-cpp python python3 rag stable-diffusion-cpp whisper-cppby shakfuPython

Constrained-Text-Generation-Studio 📁0.0.0🌿 Growing⭐216

Code repo for "Most Language Models can be Poets too: An AI Writing Assistant and Constrained Text Generation Studio" at the (CAI2) workshop, jointly held at (COLING 2022)

pythonby HellisotherpeoplePython

LRAT 📁0.0.0🌱 Seedling⭐34

The implementation for SIGIR 2026: Learning to Retrieve from Agent Trajectories.

agent agentic llm python searchby Yuqi-ZhouPython

cactus 📁0.0.0🌿 Growing⭐50

LLM Agent that leverages cheminformatics tools to provide informed responses.

cheminformatics chemistry foundation-models jupyter notebook llm llm-agent nlp scienceby pnnlJupyter Notebook

UGTLive 📁0.0.0🌿 Growing⭐73

An easy to use GUI-based tool that performs live translations using OCR and LLMs (Either cloud or local only)

c#by SethRobinsonC#

Autonomous-Agents 📁main@2026-04-16🌿 Growing⭐1,211

Autonomous Agents (LLMs) research papers. Updated Daily.

agent agentic agentic-ai agents ai ai-agents aiagent aiagentsby tmgthb

RAGMeUp 📁scala-ui🌳 Mature⭐675

Generic rag framework to apply the power of LLMs on any given dataset

javascriptby SensAI-PTJavaScript

Dragon-Brain 📁v1.1.0🌱 Seedling⭐43

Dragon Brain — persistent long-term memory for AI agents via MCP (Model Context Protocol). Knowledge graph (FalkorDB) + vector search (Qdrant) + CUDA GPU embeddings. Works with Claude, Gemini CLI, Cur

ai-memory claude codex-cli cursor falkordb gemini-cli knowledge-graph llm-tools pythonby iikarusPython

vllm 📁v0.19.1🌿 Growing⭐76,155

A high-throughput and memory-efficient inference and serving engine for LLMs

amd blackwell cuda deepseek deepseek-v3 gpt gpt-oss inference pythonby vllm-projectPython

mcp-devtools 📁v0.59.53🌿 Growing⭐133

A modular MCP server that provides commonly used developer tools for AI coding agents

agentic ai cline coding devtools go llm mcp sammcjby sammcjGo

SocratiCode 📁v1.6.1🌿 Growing⭐810

Enterprise-grade (40m+ lines) codebase intelligence in a zero-setup, private and local Claude Plugin or MCP: managed indexing, hybrid semantic search, polyglot code dependency graphs, and DB/API/infra

ai ai-assistant ast claude code-graph codebase-analysis codebase-intelligence docker typescript vector-databaseby giancarloerraTypeScript

VideoGraphAI 📁0.0.0🌿 Growing⭐54

🎬 AI-powered YouTube Shorts automation tool using LLMs, real-time search, and text-to-speech. Create engaging short-form videos with automated research, voiceovers, and subtitles.

ai-tools ai-video-generation artificial-intelligence content-automation content-creation llm machine-learning open-source pythonby mikeoller82Python

arag 📁v0.1.0🌿 Growing⭐247

A-RAG: Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces. State-of-the-art RAG framework with keyword, semantic, and chunk read tools for multi-hop QA.

agent agentic-ai agenticrag deepresearch evaluation graphrag llm llmagents pythonby Ayanami0730Python

RIGEL 📁0.0.0🌱 Seedling⭐26

A Multi-Agentic AI Assistant/Builder

agentic-ai ai-assistant ai-framework chatbot dbus groq linux llm pythonby Zerone-LaboratoriesPython

AReaL 📁v1.0.3🌿 Growing⭐5,017

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

agent llm llm-agent llm-reasoning machine-learning-systems mlsys python reinforcement-learning rlby inclusionAIPython

llmware 📁v0.4.6🌿 Growing⭐14,857

Unified framework for building enterprise RAG pipelines with small, specialized models

agents generative-ai-tools llamacpp llm onnx openvino parsing python retrieval-augmented-generationby llmware-aiPython

rag-chatbot 📁main@2026-04-14🌿 Growing⭐402

RAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.

chatbot chromadb gpu lamacpp llama3 llm python qwen3-5 ragby umbertogriffoPython

deep-research-mcp 📁main@2026-04-13🌿 Growing⭐58

MCP server for OpenAI's Deep Research APIs, Gemini Deep Research Agent, and Hugging Face's Open Deep Research

pythonby pminerviniPython

next-plaid 📁v1.2.0🌿 Growing⭐331

NextPlaid, ColGREP: Multi-vector search, from database to coding agents.

agentic-rag cli grep multi-vector rust vector-databaseby lightonaiRust

Open-Sable 📁v1.7.0🌱 Seedling⭐18

Open-Sable is a local-first autonomous agent framework with AGI-inspired cognitive subsystems (goals, memory, metacognition, tool use). It can run continuously on your machine, integrate with chat int

agentic agentic-ai ai ai-assistant open-source pythonby IdeoaLabsPython

oramacore 📁v1.2.38🌱 Seedling⭐249

OramaCore is the complete runtime you need for your projects, answer engines, copilots, and search. It includes a fully-fledged full-text search engine, vector database, LLM interface, and many more u

fulltext-search inference llms rust vector-database vector-searchby oramasearchRust

LocalAI 📁v4.1.3🌱 Seedling⭐45,254

LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.

agents ai api audio-generation decentralized distributed go image-generation libp2pby mudlerGo

everything-claude-code 📁v1.10.0🌱 Seedling⭐151,139

The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.

ai-agents anthropic claude claude-code developer-tools javascript llm mcp productivityby affaan-mJavaScript

spiceai 📁v1.11.5🌱 Seedling⭐2,868

A portable accelerated SQL query, search, and LLM-inference engine, written in Rust, for data-grounded AI apps and agents.

artificial-intelligence data data-federation developers full-text-search infrastructure llm-inference machine-learning rustby spiceaiRust

RAG-Anything 📁v1.2.10🌱 Seedling⭐15,557

"RAG-Anything: All-in-One RAG Framework"

multi-modal-rag python retrieval-augmented-generationby HKUDSPython

fast-plaid 📁1.4.5🌱 Seedling⭐239

High-Performance Engine for Multi-Vector Search

colbert colpali information-retrieval python rust vector-databaseby lightonaiPython

codexlens-search 📁v0.8.0🌱 Seedling⭐44

Lightweight semantic code search engine — 2-stage vector + FTS + RRF fusion + MCP server for Claude Code

pythonby catlog22Python

OriginDL 📁v1.0.0🌱 Seedling⭐245

Implement a Pytorch-like DL library in C++ from scratch, step by step

ai-framework ai-infra c++cuda deeplearning pytorch yoloby jinbooooomC++

Somi 📁Mineralization🌱 Seedling⭐21

Local-first AI agent framework with GUI, memory, web search, personality constructs, speech i/o, tools, skills, CLI & Telegram features — fully self-hosted via Ollama.

ai-agents ai-framework arti automation cli gui homeb local pythonby Somi-ProjectPython

onnxruntime-java 📁v2.1.0🌱 Seedling⭐29

A type-safe, lightweight, modern, and performant binding Java binding of Microsoft's ONNX Runtime

ai-framework deep-learning ffi foreign-function-and-memory-api java machine-learning onnx onnx-inferenceby yuzawa-sanJava

DreamServer 📁v2.0.0🌱 Seedling⭐478

Local AI anywhere, for everyone — LLM inference, chat UI, voice, agents, workflows, RAG, and image generation. No cloud, no subscriptions.

ai-agents amd comfyui docker llama-cpp llm local-ai n8n rustby Light-Heart-LabsRust

PAI-RAG 📁v0.4.3🌱 Seedling⭐450

An easy-to-use framework for modular RAG

pythonby aigc-appsPython

tabby 📁v0.32.0🌱 Seedling⭐33,397

Self-hosted AI coding assistant

ai codegen coding-assistant coding-language developer-experience developer-tools gen-ai ide rustby TabbyMLRust

uniAI 📁0.0.0🌱 Seedling⭐1

Syllabus-aware RAG study assistant for university students. Answers strictly from your own notes & PDFs, unit-scoped retrieval, cross-encoder reranking, and a hallucination gate — built to help studen

ai chromadb django genai information-retrieval llm local-llm ollama python vector-databaseby git-pratap-shreyPython

enton 📁main@2026-04-21🌱 Seedling⭐1

Builds an autonomous AI robot with vision, voice, and decision-making capabilities using Python, PyTorch, and CUDA technology.

ai autonomous-agent computer-vision cuda github-config llm python pytorchby tareq3743Python

loopy 📁v2025.2💤 Dormant⭐629

A code generator for array-based code on CPUs and GPUs

array code-generation code-generator code-optimization code-transformation cuda ispc loop-optimization pythonby inducerPython

vllm-cli 📁v0.2.5💤 Dormant⭐487

A command-line interface tool for serving LLM using vLLM.

llm llm-inference llm-tools python vllmby Chen-zexiPython