freshcrate — Search

Search results for "cuda"

40 results found (Python)

jarvis 📁v1.28.0🌿 Growing⭐174

Your AI assistant that never forgets and runs 100% privately on your computer. Leave it on 24/7 - it learns your preferences, helps with code, manages your health goals, searches the web, and connects

ai assistant health machine-learning mcp nutrition privacy private pythonby isairPython

Auto-claude-code-research-in-sleep 📁v0.4.4🌳 Mature⭐6,182

ARIS ⚔️ (Auto-Research-In-Sleep) — Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in — works wi

ai-research ai-tools aris autonomous-agent claude claude-code claude-code-skills codex pythonby wanshuiyinPython

cyllama 📁0.2.11🌱 Seedling⭐22

A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp

agents cython cython-wrapper llama-cpp python python3 rag stable-diffusion-cpp whisper-cppby shakfuPython

ContextPilot 📁v0.4.1🌿 Growing⭐79

Accelerating Long Context LLM Inference with Accuracy-Preserving Context Optimization in SGLang, vLLM, llama.cpp, OpenClaw, RAG, and Agentic AI.

ai-agents context-api context-engineering hermes-agent inference-optimization openclaw prompt-engineering pythonby EfficientContextPython

Constrained-Text-Generation-Studio 📁0.0.0🌿 Growing⭐216

Code repo for "Most Language Models can be Poets too: An AI Writing Assistant and Constrained Text Generation Studio" at the (CAI2) workshop, jointly held at (COLING 2022)

pythonby HellisotherpeoplePython

LRAT 📁0.0.0🌱 Seedling⭐34

The implementation for SIGIR 2026: Learning to Retrieve from Agent Trajectories.

agent agentic llm python searchby Yuqi-ZhouPython

RAG-Anything 📁v1.2.10🏛️ Flagship⭐16,761

"RAG-Anything: All-in-One RAG Framework"

multi-modal-rag python retrieval-augmented-generationby HKUDSPython

fast-plaid 📁1.4.5🌿 Growing⭐245

High-Performance Engine for Multi-Vector Search

colbert colpali information-retrieval python rust vector-databaseby lightonaiPython

Dragon-Brain 📁v1.1.0🌱 Seedling⭐43

Dragon Brain — persistent long-term memory for AI agents via MCP (Model Context Protocol). Knowledge graph (FalkorDB) + vector search (Qdrant) + CUDA GPU embeddings. Works with Claude, Gemini CLI, Cur

ai-memory claude codex-cli cursor falkordb gemini-cli knowledge-graph llm-tools pythonby iikarusPython

vllm 📁v0.19.1🌿 Growing⭐76,155

A high-throughput and memory-efficient inference and serving engine for LLMs

amd blackwell cuda deepseek deepseek-v3 gpt gpt-oss inference pythonby vllm-projectPython

VideoGraphAI 📁0.0.0🌿 Growing⭐54

🎬 AI-powered YouTube Shorts automation tool using LLMs, real-time search, and text-to-speech. Create engaging short-form videos with automated research, voiceovers, and subtitles.

ai-tools ai-video-generation artificial-intelligence content-automation content-creation llm machine-learning open-source pythonby mikeoller82Python

PAI-RAG 📁v0.4.3🌿 Growing⭐455

An easy-to-use framework for modular RAG

pythonby aigc-appsPython

arag 📁v0.1.0🌿 Growing⭐247

A-RAG: Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces. State-of-the-art RAG framework with keyword, semantic, and chunk read tools for multi-hop QA.

agent agentic-ai agenticrag deepresearch evaluation graphrag llm llmagents pythonby Ayanami0730Python

RIGEL 📁0.0.0🌱 Seedling⭐26

A Multi-Agentic AI Assistant/Builder

agentic-ai ai-assistant ai-framework chatbot dbus groq linux llm pythonby Zerone-LaboratoriesPython

GTA 📁v0.2.0🌿 Growing⭐143

[NeurIPS 2024 D&B] GTA: A Benchmark for General Tool Agents & [arXiv 2026] GTA-2

llm-agent llm-evaluation pythonby open-compassPython

AReaL 📁v1.0.3🌿 Growing⭐5,017

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

agent llm llm-agent llm-reasoning machine-learning-systems mlsys python reinforcement-learning rlby inclusionAIPython

llmware 📁v0.4.6🌿 Growing⭐14,857

Unified framework for building enterprise RAG pipelines with small, specialized models

agents generative-ai-tools llamacpp llm onnx openvino parsing python retrieval-augmented-generationby llmware-aiPython

rag-chatbot 📁main@2026-04-14🌿 Growing⭐402

RAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.

chatbot chromadb gpu lamacpp llama3 llm python qwen3-5 ragby umbertogriffoPython

deep-research-mcp 📁main@2026-04-13🌿 Growing⭐58

MCP server for OpenAI's Deep Research APIs, Gemini Deep Research Agent, and Hugging Face's Open Deep Research

pythonby pminerviniPython

codexlens-search 📁v0.8.0🌱 Seedling⭐44

Lightweight semantic code search engine — 2-stage vector + FTS + RRF fusion + MCP server for Claude Code

pythonby catlog22Python

pipulate 📁voice-synthesis-breakthrough🌱 Seedling⭐11

Local First AI SEO Software on Nix, FastHTML & HTMX

ai fasthtml htmx machine-learning mcp mcp-client mcp-server nix pythonby pipulatePython

qwe-qwe 📁v0.17.6🌱 Seedling⭐35

⚡ Lightweight offline AI agent for local models. No cloud, no API keys — just your GPU.

agent ai ai-agent pythonby deepfounder-aiPython

little-coder 📁v0.0.4🌱 Seedling⭐31

A coding agent optimized to smaller LLMs

ai-coding-assistant aider-polygot benchmark code-generation coding-agent coding-agents local-llm ollama pythonby itayinbarrPython

server-nexe 📁v1.0.0-beta🌱 Seedling⭐9

Local AI server with persistent memory, RAG, and multi-backend inference (MLX / llama.cpp / Ollama). Runs entirely on your machine — zero data sent to external services.

ai apple-silicon embeddings fastapi llama-cpp llm local-ai mlx python vector-databaseby jgoy-labsPython

Open-Sable 📁v1.7.0🌱 Seedling⭐18

Open-Sable is a local-first autonomous agent framework with AGI-inspired cognitive subsystems (goals, memory, metacognition, tool use). It can run continuously on your machine, integrate with chat int

agentic agentic-ai ai ai-assistant open-source pythonby IdeoaLabsPython

Somi 📁Mineralization🌱 Seedling⭐21

Local-first AI agent framework with GUI, memory, web search, personality constructs, speech i/o, tools, skills, CLI & Telegram features — fully self-hosted via Ollama.

ai-agents ai-framework arti automation cli gui homeb local pythonby Somi-ProjectPython

vllm-cli 📁v0.2.5💤 Dormant⭐491

A command-line interface tool for serving LLM using vLLM.

llm llm-inference llm-tools python vllmby Chen-zexiPython

uniAI 📁0.0.0🌱 Seedling⭐1

Syllabus-aware RAG study assistant for university students. Answers strictly from your own notes & PDFs, unit-scoped retrieval, cross-encoder reranking, and a hallucination gate — built to help studen

ai chromadb django genai information-retrieval llm local-llm ollama python vector-databaseby git-pratap-shreyPython

enton 📁main@2026-04-21🌱 Seedling⭐1

Builds an autonomous AI robot with vision, voice, and decision-making capabilities using Python, PyTorch, and CUDA technology.

ai autonomous-agent computer-vision cuda github-config llm python pytorchby tareq3743Python

loopy 📁v2025.2💤 Dormant⭐630

A code generator for array-based code on CPUs and GPUs

array code-generation code-generator code-optimization code-transformation cuda ispc loop-optimization pythonby inducerPython

clang-format 📁22.1.4🌱 Seedling

Clang-Format is an LLVM-based code formatting tool

pypiby pypiPython

flashinfer-python 📁0.6.8.post1🌱 Seedling

FlashInfer: Kernel Library for LLM Serving