freshcrate

Search results for "gpu"

Clear filters
55 results found (Python)
openlitπŸ“openlit-1.18.1🌿 Growing⭐2,358

Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Management, Vault, Playground. πŸš€πŸ’» Integrates with 50+ LLM Providers,

jarvisπŸ“v1.28.0🌿 Growing⭐174

Your AI assistant that never forgets and runs 100% privately on your computer. Leave it on 24/7 - it learns your preferences, helps with code, manages your health goals, searches the web, and connects

Auto-claude-code-research-in-sleepπŸ“v0.4.4🌳 Mature⭐6,182

ARIS βš”οΈ (Auto-Research-In-Sleep) β€” Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in β€” works wi

cyllamaπŸ“0.2.11🌱 Seedling⭐22

A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp

Constrained-Text-Generation-StudioπŸ“0.0.0🌿 Growing⭐216

Code repo for "Most Language Models can be Poets too: An AI Writing Assistant and Constrained Text Generation Studio" at the (CAI2) workshop, jointly held at (COLING 2022)

LRATπŸ“0.0.0🌱 Seedling⭐34

The implementation for SIGIR 2026: Learning to Retrieve from Agent Trajectories.

cognithorπŸ“v0.92.2🌿 Growing⭐94

Cognithor - Agent OS: Local-first autonomous agent operating system. 16 LLM providers, 17 channels, 112+ MCP tools, 5-tier memory, A2A protocol, knowledge vault, voice, browser automation, Computer-us

ai-powered-video-analyzerπŸ“0.0.0🌿 Growing⭐68

An offline AI-powered video analysis tool with object detection (YOLO), image captioning (BLIP), speech transcription (Whisper), audio event detection (PANNs), and AI-generated summaries (LLMs via Oll

JRVSπŸ“0.0.0🌿 Growing⭐236

JRVS AI Agent with JARCORE autonomous coding engine - RAG knowledge base, web scraping, calendar, code generation. Powered by whatever local AI you choose.

Dragon-BrainπŸ“v1.1.0🌱 Seedling⭐43

Dragon Brain β€” persistent long-term memory for AI agents via MCP (Model Context Protocol). Knowledge graph (FalkorDB) + vector search (Qdrant) + CUDA GPU embeddings. Works with Claude, Gemini CLI, Cur

auto-deep-researcher-24x7πŸ“main@2026-04-19🌿 Growing⭐261

πŸ”₯ An autonomous AI agent that runs your deep learning experiments 24/7 while you sleep. Zero-cost monitoring, Leader-Worker architecture, constant-size memory.

SmarterRouterπŸ“2.2.5🌿 Growing⭐105

SmarterRouter: An intelligent LLM gateway and VRAM-aware router for Ollama, llama.cpp, and OpenAI. Features semantic caching, model profiling, and automatic failover for local AI labs.

rag-chatbotπŸ“main@2026-04-14🌿 Growing⭐402

RAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.

mcp-client-for-ollamaπŸ“v0.28.0🌿 Growing⭐599

A text-based user interface (TUI) client for interacting with MCP servers using Ollama. Features include agent mode, multi-server, model switching, streaming responses, tool management, human-in-the-l

AgenticXπŸ“v0.3.7🌿 Growing⭐105

AgenticX is a unified, production-ready multi-agent platform β€” Python SDK + CLI (agx) + Studio server + Machi desktop app. Features Meta-Agent orchestration, 15+ LLM providers, MCP Hub, hierarchical m

synaptic-memoryπŸ“v0.16.0🌱 Seedling⭐25

Brain-inspired knowledge graph: spreading activation, Hebbian learning, memory consolidation.

hermes-agentπŸ“v2026.4.16🌿 Growing⭐57,954

The agent that grows with you

RIGELπŸ“0.0.0🌱 Seedling⭐26

A Multi-Agentic AI Assistant/Builder

tsunamiπŸ“main@2026-04-21🌱 Seedling⭐13

autonomous AI agent that builds full-stack apps. local models. no cloud. no API keys. runs on your hardware.

awesome-code-agentsπŸ“main@2026-04-20🌿 Growing⭐94

A curated list of products, benchmarks, and research papers on autonomous code agents. Beyond coding β€” they're redefining how software changes the world.

orbitπŸ“v2.6.6🌿 Growing⭐250

One API for 20+ LLM providers, your databases, and your files β€” self-hosted, open-source AI gateway with RAG, voice, and guardrails.

AGI-Alpha-Agent-v0πŸ“main@2026-04-18🌿 Growing⭐283

META‑AGENTIC α‑AGI πŸ‘οΈβœ¨ β€” Mission 🎯 End‑to‑end: Identify πŸ” β†’ Out‑Learn πŸ“š β†’ Out‑Think 🧠 β†’ Out‑Design 🎨 β†’ Out‑Strategise β™ŸοΈ β†’ Out‑Execute ⚑

vllmπŸ“v0.19.1🌿 Growing⭐76,155

A high-throughput and memory-efficient inference and serving engine for LLMs

llmwareπŸ“v0.4.6🌿 Growing⭐14,857

Unified framework for building enterprise RAG pipelines with small, specialized models

ai-real-estate-assistantπŸ“dev@2026-04-13🌿 Growing⭐159

Advanced AI Real Estate Assistant using RAG, LLMs, and Python. Features market analysis, property valuation, and intelligent search.

deep-research-mcpπŸ“main@2026-04-13🌿 Growing⭐58

MCP server for OpenAI's Deep Research APIs, Gemini Deep Research Agent, and Hugging Face's Open Deep Research

vllm-mlxπŸ“v0.2.8🌿 Growing⭐798

OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX bac

agenticSeekπŸ“main@2026-04-11🌿 Growing⭐25,891

Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity. πŸ”” Official updates only via twitter @Martin993

UltraRAGπŸ“v0.3.0.2🌿 Growing⭐5,480

A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines

AutoRAGπŸ“v0.3.22🌱 Seedling⭐4,693

AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation

vikramadityaπŸ“main@2026-04-20🌱 Seedling⭐5

Autonomous VAPT platform. Give it a target (FQDN, IP, CIDR) β€” it hunts, it reports. Inspired by the Obsidian Order.

animaworksπŸ“v0.6.2🌱 Seedling⭐225

Organization-as-Code for autonomous AI agents. Brain-inspired memory that grows, consolidates, and forgets. Multi-model (Claude/Codex/Gemini/Cursor/Ollama).

Windows-MCPπŸ“v0.7.1🌱 Seedling⭐5,075

MCP Server for Computer Use in Windows

neurostackπŸ“v0.11.1🌱 Seedling⭐40

Your second brain, starting today. CLI + MCP server that helps you build, maintain, and search a knowledge vault that gets better every day. Works with any AI provider. Local-first, zero-prereq instal

droid-llm-hunterπŸ“v1.0.0🌱 Seedling⭐95

Droid LLM Hunter is a tool to scan for vulnerabilities in Android applications using Large Language Models (LLMs).

RAG-AnythingπŸ“v1.2.10🌱 Seedling⭐15,557

"RAG-Anything: All-in-One RAG Framework"

fast-plaidπŸ“1.4.5🌱 Seedling⭐239

High-Performance Engine for Multi-Vector Search

codexlens-searchπŸ“v0.8.0🌱 Seedling⭐44

Lightweight semantic code search engine β€” 2-stage vector + FTS + RRF fusion + MCP server for Claude Code

devitoπŸ“v4.8.21🌱 Seedling⭐689

DSL and compiler framework for automated finite-differences and stencil computation

SomiπŸ“Mineralization🌱 Seedling⭐21

Local-first AI agent framework with GUI, memory, web search, personality constructs, speech i/o, tools, skills, CLI & Telegram features β€” fully self-hosted via Ollama.

KawaiiGPTπŸ“KawaiiGPT🌱 Seedling⭐831

KawaiiGPT β€” Open-source LLM gateway accessing DeepSeek, Gemini, and Kimi-K2 through reverse-engineered Pollinations API with no API keys required, built-in prompt injection capabilities for security r

OpenRA-RLπŸ“v0.4.1🌱 Seedling⭐118

Open Framework for AI Agents to play Red Alert through Reinforcement Learning

PAI-RAGπŸ“v0.4.3🌱 Seedling⭐450

An easy-to-use framework for modular RAG

ragflowπŸ“v0.24.0🌱 Seedling⭐77,784

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

py-gptπŸ“v2.7.12🌱 Seedling⭐1,724

Desktop AI Assistant powered by GPT-5, GPT-4, o1, o3, Gemini, Claude, Ollama, DeepSeek, Perplexity, Grok, Bielik, chat, vision, voice, RAG, image and video generation, agents, tools, MCP, plugins, spe

DOXπŸ“main@2026-04-15🌱 Seedling⭐1

Broken RAG For The Broken Souls

Comfy-CozyπŸ“v4.0.0🌱 Seedling⭐3

AI co-pilot for ComfyUI β€” 113 tools for workflow authoring, model provisioning, and iterative rendering. Multi-provider (Claude, GPT-4o, Gemini, Ollama). Ships as MCP server or standalone CLI.

uniAIπŸ“0.0.0🌱 Seedling⭐1

Syllabus-aware RAG study assistant for university students. Answers strictly from your own notes & PDFs, unit-scoped retrieval, cross-encoder reranking, and a hallucination gate β€” built to help studen

kagglerunπŸ“master@2026-04-21🌱 Seedling⭐1

πŸš€ Run Python on Kaggle's free GPUs directly from your terminal without the need for a browser, streamlining your data science workflow.

loopyπŸ“v2025.2πŸ’€ Dormant⭐629

A code generator for array-based code on CPUs and GPUs

LettuceDetectπŸ“0.1.8πŸ’€ Dormant⭐545

Lightweight hallucination detection framework for RAG applications

vllm-cliπŸ“v0.2.5πŸ’€ Dormant⭐487

A command-line interface tool for serving LLM using vLLM.

Qwen-AgentπŸ“v0.0.26πŸ’€ Dormant⭐15,963

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

replicate-pythonπŸ“1.0.7πŸ’€ Dormant⭐900

Python client for Replicate