freshcrate

Search results for "gguf"

21 results found
llama.cppπŸ“b8864🌳 Mature⭐103,119

LLM inference in C/C++

compose-for-agentsπŸ“main@2026-04-20🌳 Mature⭐910

Build and run AI agents using Docker Compose. A collection of ready-to-use examples for orchestrating open-source LLMs, tools, and agent runtimes.

cyllamaπŸ“0.2.11🌱 Seedling⭐22

A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp

TigrimosπŸ“v1.3.1🌿 Growing⭐53

A self-hosted AI workspace with chat, code execution, parallel multi-agent orchestration, and a skill marketplace. Runs on macOS and Windows. Everything executes inside a secure Ubuntu sandbox β€” no Do

ai-orchestratorπŸ“v1.0.17🌿 Growing⭐86

Portable multi-agent AI developer setup for Claude Code + Ollama. Role-based local LLM orchestration via Bash β€” plan, code, review, commit. Zero Dependency. Works with any language stack.

AGI-Alpha-Agent-v0πŸ“main@2026-04-18🌿 Growing⭐283

META‑AGENTIC α‑AGI πŸ‘οΈβœ¨ β€” Mission 🎯 End‑to‑end: Identify πŸ” β†’ Out‑Learn πŸ“š β†’ Out‑Think 🧠 β†’ Out‑Design 🎨 β†’ Out‑Strategise β™ŸοΈ β†’ Out‑Execute ⚑

vllmπŸ“v0.19.1🌿 Growing⭐76,155

A high-throughput and memory-efficient inference and serving engine for LLMs

paiml-mcp-agent-toolkitπŸ“v3.14.0🌿 Growing⭐148

Pragmatic AI Labs MCP Agent Toolkit - An MCP Server designed to make code with agents more deterministic

llmwareπŸ“v0.4.6🌿 Growing⭐14,857

Unified framework for building enterprise RAG pipelines with small, specialized models

rag-chatbotπŸ“main@2026-04-14🌿 Growing⭐402

RAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.

deep-research-mcpπŸ“main@2026-04-13🌿 Growing⭐58

MCP server for OpenAI's Deep Research APIs, Gemini Deep Research Agent, and Hugging Face's Open Deep Research

LocalAIπŸ“v4.1.3🌱 Seedling⭐45,254

LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.

spiceaiπŸ“v1.11.5🌱 Seedling⭐2,868

A portable accelerated SQL query, search, and LLM-inference engine, written in Rust, for data-grounded AI apps and agents.

DreamServerπŸ“v2.0.0🌱 Seedling⭐478

Local AI anywhere, for everyone β€” LLM inference, chat UI, voice, agents, workflows, RAG, and image generation. No cloud, no subscriptions.

webbrainπŸ“3.6.8🌱 Seedling⭐3

Open-source AI browser agent for Chrome and Firefox

local-rag-serverπŸ“main@2026-04-21🌱 Seedling⭐2

Deploy a local, multi-user RAG system to query PDF and DOCX documents using a local LLM without cloud or API dependencies.

langgraph-llama-cpp-starterπŸ“main@2026-04-21🌱 Seedling⭐1

πŸ€– Build intelligent, offline LLM agents with LangGraph and llama-cpp-python using this starter template for local, private tool-calling applications.

CoexistAIπŸ“v2.6πŸ’€ Dormant⭐464

CoexistAI is a modular, developer-friendly research assistant framework . It enables you to build, search, summarize, and automate research workflows using LLMs, web search, Reddit, YouTube, and mappi

EliteAgentπŸ“main@2026-04-17🌱 Seedling⭐1

The ultimate native macOS AI Agent. Blends local MLX SLMs with 3D cognitive Metal rendering and autonomous system integrations.

superagentπŸ“node-v0.0.9πŸ’€ Dormant⭐6,515

Superagent protects your AI applications against prompt injections, data leaks, and harmful outputs. Embed safety directly into your app and prove compliance to your customers.

vllm-cliπŸ“v0.2.5πŸ’€ Dormant⭐487

A command-line interface tool for serving LLM using vLLM.