freshcrate

Search results for "llama"

125 results found
llama.cpp📁b8864🌳 Mature103,119

LLM inference in C/C++

npcpy📁v1.4.21🌳 Mature1,287

The python library for research and development in NLP, multimodal LLMs, Agents, ML, Knowledge Graphs, and more.

opik📁2.0.6🌳 Mature18,767

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

restai📁v6.1.45🌿 Growing483

RESTai is an AIaaS (AI as a Service) open-source platform. Supports many public and local LLM suported by Ollama/vLLM/etc. Precise embeddings usage, tuning, analytics etc. Built-in image/audio generat

casibase📁v1.771.2🌳 Mature4,493

⚡️AI Cloud OS: Open-source enterprise-level AI knowledge base and MCP (model-context-protocol)/A2A (agent-to-agent) management platform with admin UI, user management and Single-Sign-On⚡️, supports Ch

compose-for-agents📁main@2026-04-20🌳 Mature910

Build and run AI agents using Docker Compose. A collection of ready-to-use examples for orchestrating open-source LLMs, tools, and agent runtimes.

cyllama📁0.2.11🌱 Seedling22

A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp

aitools_client📁0.0.0🌿 Growing182

Seth's AI Tools: A Unity based front end that uses ComfyUI and LLMs to create stories, images, movies, quizzes and posters

sample-agentic-frameworks-on-aws📁main@2026-04-17🌿 Growing250

Build Agentic AI solutions on AWS, using latest OSS Agentic Frameworks.

ollama📁v0.21.0🌿 Growing168,597

Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.

llxprt-code📁v0.10.0-nightly.260421.636d54708🌳 Mature657

An open-source multi-provider AI assisted CLI development tool. Use whatever LLM you want to code in your terminal.

neurolink📁v9.56.0🌿 Growing121

Universal AI Development Platform with MCP server integration, multi-provider support, and professional CLI. Build, test, and deploy AI applications with multiple ai providers.

OmniRoute📁v3.6.9🌳 Mature2,435

OmniRoute is an AI gateway for multi-provider LLMs: an OpenAI-compatible endpoint with smart routing, load balancing, retries, and fallbacks. Add policies, rate limits, caching, and observability for

litellm📁v1.83.7-stable🌳 Mature42,951

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropi

edgecrab📁v0.7.0🌱 Seedling21

EdgeCrab 🦀 A Super Powerful Personal Assistant inspired by NousHermes and OpenClaw — Rust-native, blazing-fast terminal UI, ReAct tool loop, multi-provider LLM support, ACP protocol, gateway adapters

osaurus📁0.16.16🌳 Mature4,912

Own your AI. The native macOS harness for AI agents -- any model, persistent memory, autonomous execution, cryptographic identity. Built in Swift. Fully offline. Open source.

obsidian-local-llm-hub📁0.12.2🌱 Seedling27

All-in-one local AI hub for Obsidian — LLM chat with vault tools, MCP servers, RAG, workflow automation, encryption, and edit history. Fully private, no cloud required.

agentic-memory📁0.0.0🌿 Growing162

No description

by lhl

Code repo for "Most Language Models can be Poets too: An AI Writing Assistant and Constrained Text Generation Studio" at the (CAI2) workshop, jointly held at (COLING 2022)

cactus📁0.0.0🌿 Growing50

LLM Agent that leverages cheminformatics tools to provide informed responses.

UGTLive📁0.0.0🌿 Growing73

An easy to use GUI-based tool that performs live translations using OCR and LLMs (Either cloud or local only)

ai-orchestrator📁v1.0.17🌿 Growing86

Portable multi-agent AI developer setup for Claude Code + Ollama. Role-based local LLM orchestration via Bash — plan, code, review, commit. Zero Dependency. Works with any language stack.

llamafarm📁v0.0.31🌿 Growing825

Deploy any AI model, agent, database, RAG, and pipeline locally or remotely in minutes

langfuse📁v3.169.0🌿 Growing24,578

🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

promptfoo📁code-scan-action-0.1.5🌿 Growing19,943

Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and

openinference📁python-openinference-instrumentation-google-genai-v0.1.15🌿 Growing913

OpenTelemetry Instrumentation for AI Observability

ClawRouter📁v0.12.158🌿 Growing6,186

The agent-native LLM router for OpenClaw. 41+ models, <1ms routing, USDC payments on Base & Solana via x402.

GhostDesk📁v7.1.0🌱 Seedling39

Give any AI agent a full desktop — it sees the screen, clicks, types, and runs apps like a human. Automate anything with a UI: browsers, legacy software, internal tools. No API needed. One Docker comm

cognithor📁v0.92.2🌿 Growing94

Cognithor - Agent OS: Local-first autonomous agent operating system. 16 LLM providers, 17 channels, 112+ MCP tools, 5-tier memory, A2A protocol, knowledge vault, voice, browser automation, Computer-us

Tigrimos📁v1.3.1🌿 Growing53

A self-hosted AI workspace with chat, code execution, parallel multi-agent orchestration, and a skill marketplace. Runs on macOS and Windows. Everything executes inside a secure Ubuntu sandbox — no Do

aiagentflow📁v1.0.2🌱 Seedling36

A local-first, CLI-driven multi-agent AI software engineering workflow orchestrator with feed specs, PRDs, and guidelines to auto-generate implementation plans and code.

llama_index📁v0.14.21🌿 Growing48,501

LlamaIndex is the leading document agent and OCR platform

PlanExe📁main@2026-04-20🌿 Growing365

Create a plan from a description in minutes

vektori📁main@2026-04-19🌿 Growing72

Memory that remembers the story not just the facts. Three layer sentence graph for AI agents -> Facts, Episodes, raw Sentences. One DB. Zero config.

SmarterRouter📁2.2.5🌿 Growing105

SmarterRouter: An intelligent LLM gateway and VRAM-aware router for Ollama, llama.cpp, and OpenAI. Features semantic caching, model profiling, and automatic failover for local AI labs.

claude-flows📁0.0.0🌿 Growing93

🌊 The leading agent orchestration platform for Claude. Deploy intelligent multi-agent swarms, coordinate autonomous workflows, and build conversational AI systems. Features enterprise-grade architect

claude-engram📁main@2026-04-17🌱 Seedling13

Persistent memory and session intelligence for AI coding assistants. Auto-tracks mistakes, decisions, and context via hooks. Mines your full session history for patterns, predictions, and cross-sessio

agents-flex📁v2.0.9🌿 Growing1,208

Agents-flex is A Lightweight Java AI Application Development Framework.

AutoGPT📁autogpt-platform-beta-v0.6.56🌿 Growing183,319

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

ollamafreeapi📁main@2026-04-15🌿 Growing144

OllamaFreeAPI: Free Distributed API for Ollama LLMs Public gateway to our managed Ollama servers with: - Zero-configuration access to 50+ models - Auto load-balanced across global nodes - Free tier w

llmware📁v0.4.6🌿 Growing14,857

Unified framework for building enterprise RAG pipelines with small, specialized models

parlant📁v3.3.1🌿 Growing17,899

The conversational control layer for customer-facing AI agents - Parlant is a context-engineering framework optimized for controlling customer interactions.

rag-chatbot📁main@2026-04-14🌿 Growing402

RAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.

hermes-workspace📁v2.0.0🌿 Growing1,124

Native web workspace for Hermes Agent — chat, terminal, memory, skills, inspector.

maestro📁v1.5.0🌱 Seedling22

The Maestro App Factory: a highly-opinionated multi-agent orchestration tool for app development that emulates the workflow of high-functioning human development teams using AI agents

vllm-mlx📁v0.2.8🌿 Growing798

OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX bac

MakerAi📁master@2026-04-11🌿 Growing159

The AI Operating System for Delphi. 100% native framework with RAG 2.0 for knowledge retrieval, autonomous agents with semantic memory, visual workflow orchestration, and universal LLM connector. Supp

This GitHub repository contains the complete code for building Business-Ready Generative AI Systems (GenAISys) from scratch. It guides you through architecting and implementing advanced AI controllers

llm7.io📁0.0.0🌿 Growing139

LLM7.io offers a single API gateway that connects you to a wide array of leading AI models from various providers.

oh-my-pi📁v14.1.2🌿 Growing2,872

⌥ AI Coding agent for the terminal — hash-anchored edits, optimized tool harness, LSP, Python, browser, subagents, and more

template-coding-agent📁0.0.0🌱 Seedling42

Advanced Mastra AI coding agent with secure sandbox execution, comprehensive file management, and multi-language support for Python, JavaScript, and TypeScript development workflows

toolbridge📁v2.0.0🌱 Seedling74

Enable tool/function calling for any LLM, in OpenAI and Ollama API formats, adding universal function calling to models without native support. Use local or cloud models with full agent capabilities.

wow-rag📁0.0.0🌱 Seedling231

A simple and trans-platform rag framework and tutorial

NeuronFS📁main@2026-04-21🌿 Growing136

mkdir beats vector DB. B-tree NeuronFS: 0-byte folders govern AI — ₩0 infrastructure, ~200x token efficiency. OS-native constraint engine for LLM agents.

memory_agent_hub📁main@2026-04-20🌱 Seedling38

2026 swarm Agent 年,swarm Agent 、Agent team、 ai coding、skill、memory、evolve、agentic RL 等 AI Agent集合

ai-agents-from-zero📁main@2026-04-20🌿 Growing264

🚀 2026 最系统的 AI Agent 速成指南|智能体实战教程 · 完整学习路径 + 实战项目 + 面试题库 · 对标大模型应用开发工程师岗位 · 覆盖LangChain / LangGraph / Coze / Dify / MCP / skills / LLM / RAG / 提示词 · 企业级部署与微调 · 从0到企业级落地 + 从学习到上线项目 + 面试准备一体化

orbit📁v2.6.6🌿 Growing250

One API for 20+ LLM providers, your databases, and your files — self-hosted, open-source AI gateway with RAG, voice, and guardrails.

mcp-rubber-duck📁v1.19.2🌿 Growing154

An MCP server that acts as a bridge to query multiple OpenAI-compatible LLMs with MCP tool access. Just like rubber duck debugging, explain your problems to various AI "ducks" who can actually researc

AGI-Alpha-Agent-v0📁main@2026-04-18🌿 Growing283

META‑AGENTIC α‑AGI 👁️✨ — Mission 🎯 End‑to‑end: Identify 🔍 → Out‑Learn 📚 → Out‑Think 🧠 → Out‑Design 🎨 → Out‑Strategise ♟️ → Out‑Execute ⚡

vllm📁v0.19.1🌿 Growing76,155

A high-throughput and memory-efficient inference and serving engine for LLMs

sdk-python📁v1.36.0🌿 Growing5,602

A model-driven approach to building AI agents in just a few lines of code.

inference-gateway📁v0.23.6🌱 Seedling109

An open-source, cloud-native, high-performance gateway unifying multiple LLM providers, from local solutions like Ollama to major cloud providers such as OpenAI, Groq, Cohere, Anthropic, Cloudflare an

ai-real-estate-assistant📁dev@2026-04-13🌿 Growing159

Advanced AI Real Estate Assistant using RAG, LLMs, and Python. Features market analysis, property valuation, and intelligent search.

deep-research-mcp📁main@2026-04-13🌿 Growing58

MCP server for OpenAI's Deep Research APIs, Gemini Deep Research Agent, and Hugging Face's Open Deep Research

AgenticGoKit📁v0.5.9🌿 Growing134

Open-source Agentic AI framework in Go for building, orchestrating, and deploying intelligent agents. LLM-agnostic, event-driven, with multi-agent workflows, MCP tool discovery, and production-grade o

ruflo📁v3.5.80🌿 Growing31,236

🌊 The leading agent orchestration platform for Claude. Deploy intelligent multi-agent swarms, coordinate autonomous workflows, and build conversational AI systems. Features enterprise-grade archit

agenticSeek📁main@2026-04-11🌿 Growing25,891

Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity. 🔔 Official updates only via twitter @Martin993

anything-llm📁v1.12.0🌱 Seedling57,919

The all-in-one AI productivity accelerator. On device and privacy first with no annoying setup or configuration.

ds_ex📁main@2026-04-09🌱 Seedling17

DSPEx - Declarative Self-improving Elixir | A BEAM-Native AI Program Optimization Framework

jan📁v0.7.9🌱 Seedling41,710

Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.

Open-Sable📁v1.7.0🌱 Seedling18

Open-Sable is a local-first autonomous agent framework with AGI-inspired cognitive subsystems (goals, memory, metacognition, tool use). It can run continuously on your machine, integrate with chat int

frontman📁v0.15.0🌱 Seedling261

The AI agent that lives in your framework/browser

PromptDrifter📁main@2026-04-19🌱 Seedling8

🧭 PromptDrifter – one‑command CI guardrail that catches prompt drift and fails the build when your LLM answers change.

LocalAI📁v4.1.3🌱 Seedling45,254

LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.

kernel📁v3.97.0🌱 Seedling12

kbot — the AI agent that dreams, learns, and evolves. 764+ tools, 35 agents, 20 providers. Music production, iPhone control, financial analysis, cyber threat intel. Always-on daemon. Runs offline. npm

VecturaKit📁5.3.0🌱 Seedling280

Swift-based vector database for on-device RAG using MLTensor and MLX Embedders

spacebot📁v0.4.1🌱 Seedling2,066

An AI agent for teams, communities, and multi-user environments.

AutoRAG📁v0.3.22🌱 Seedling4,693

AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation

lm-proxy📁v3.2.2🌱 Seedling111

OpenAI-compatible HTTP LLM proxy / gateway for multi-provider inference (Google, Anthropic, OpenAI, PyTorch). Lightweight, extensible Python/FastAPI—use as library or standalone service.

agent2📁v0.1.0🌱 Seedling25

The production runtime for AI agents. Schema in, API out. Built on PydanticAI + FastAPI.

spiceai📁v1.11.5🌱 Seedling2,868

A portable accelerated SQL query, search, and LLM-inference engine, written in Rust, for data-grounded AI apps and agents.

droid-llm-hunter📁v1.0.0🌱 Seedling95

Droid LLM Hunter is a tool to scan for vulnerabilities in Android applications using Large Language Models (LLMs).

mnemos-mcp📁main@2026-04-21🌱 Seedling4

🧠 Transform documentation chaos into a structured memory system with Mnemos, your self-hosted, multi-context knowledge server for developers.

teleton-agent📁v0.8.6🌱 Seedling66

Teleton: Autonomous AI Agent for Telegram & TON Blockchain

RAGLight📁3.4.7🌱 Seedling656

RAGLight is a modular framework for Retrieval-Augmented Generation (RAG). It makes it easy to plug in different LLMs, embeddings, and vector stores, and now includes seamless MCP integration to connec

YouTubeGPT📁v3.3.1🌱 Seedling14

YouTubeGPT is an LLM-based web-app that can be run locally and allows you to summarize and chat (Q&A) with YouTube videos.

edsl📁wasm-wheel🌱 Seedling454

Design, conduct and analyze results of AI-powered surveys and experiments. Simulate social science and market research with large numbers of AI agents and LLMs.

DreamServer📁v2.0.0🌱 Seedling478

Local AI anywhere, for everyone — LLM inference, chat UI, voice, agents, workflows, RAG, and image generation. No cloud, no subscriptions.

py-gpt📁v2.7.12🌱 Seedling1,724

Desktop AI Assistant powered by GPT-5, GPT-4, o1, o3, Gemini, Claude, Ollama, DeepSeek, Perplexity, Grok, Bielik, chat, vision, voice, RAG, image and video generation, agents, tools, MCP, plugins, spe

miniclaw-os📁v0.1.8🌱 Seedling37

We gave AI agents a brain. Memory, planning, continuity, and self-repair — the missing cognitive architecture layer. Runs on your Mac.

guidance📁0.3.2🌱 Seedling21,378

A guidance language for controlling large language models.

GEO-AI📁main@2026-04-21🌱 Seedling3

Optimize websites for AI search engines with a universal TypeScript engine supporting Next.js, NestJS, WordPress, and Shopify integration.

watchtower📁1.0.2🌱 Seedling51

Watchtower is a simple AI-powered penetration testing automation CLI tool that leverages LLMs and LangGraph to orchestrate agentic workflows that you can use to test your websites locally. Generate us

uniAI📁0.0.0🌱 Seedling1

Syllabus-aware RAG study assistant for university students. Answers strictly from your own notes & PDFs, unit-scoped retrieval, cross-encoder reranking, and a hallucination gate — built to help studen

Phantom📁v0.8.0🌱 Seedling107

Autonomous Offensive Security Intelligence AI-powered multi-agent penetration testing

cloneme📁0.0.0💤 Dormant38

CloneMe is an advanced AI platform that builds your digital twin—an AI that chats like you, remembers details, and supports multiple platforms. Customizable, memory-driven, and hot-reloadable, it's th

webbrain📁3.6.8🌱 Seedling3

Open-source AI browser agent for Chrome and Firefox

MOP📁0.0.0🌱 Seedling1

A local LLM-based autonomous agent orchestration platform featuring async background tasks, context-isolated sub-agents, dynamic knowledge injection, and strict security approval gates (Plan Mode).

local-rag-server📁main@2026-04-21🌱 Seedling2

Deploy a local, multi-user RAG system to query PDF and DOCX documents using a local LLM without cloud or API dependencies.

bisheng📁v2.3.0🌱 Seedling11,293

BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SF

Flipkart-Product-Recommender-RAG📁main@2026-04-21🌱 Seedling2

🛒 Build a leading-edge e-commerce recommendation system using RAG architecture, Groq Llama 3, LangChain, and AstraDB, deployed on Kubernetes for scalability.

coordinode📁v0.4.1🌱 Seedling1

The graph-native hybrid retrieval engine for AI and GraphRAG. Graph + Vector + Full-Text in a single transactional engine.

eliza📁v1.7.2🌱 Seedling18,159

Autonomous agents for everyone

hermes-ui📁v2.0🌱 Seedling2

Glassmorphic web interface for Hermes Agent — your self-hosted AI assistant

RustClaw📁v0.5.0🌱 Seedling2

Lean Rust AI agent: 6MB binary, 7.9MB RAM. OpenClaw replacement. Telegram + Discord + GitHub auto-PR. Ollama/Anthropic support.

gptme📁v0.31.0🌱 Seedling4,266

Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web. Make your own persistent autonomous agent on top!

AnyCam2Ros📁master@2026-04-21🌱 Seedling1

📷 Transform any camera into ROS2 image topics for seamless integration with robotic systems and effective VLA model deployment.

AnyToolCall📁main@2026-04-21🌱 Seedling1

🛠️ Simplify tool calls for any LLM with AnyToolCall, an OpenAI-compatible middleware that bypasses native constraints through prompt injection.

slack-mcp-client📁v2.8.3🌱 Seedling167

A Slack bot and MCP client acts as a bridge between Slack and Model Context Protocol (MCP) servers. Using Slack as the interface, it enables large language models (LLMs) to connect and interact with v

dotnet-data-ingestion-local-rag📁main@2026-04-21🌱 Seedling1

Enable local document ingestion and retrieval-augmented generation with a secure, .NET-based pipeline that keeps data on your machine.

asya-chat-ui📁main@2026-04-21🌱 Seedling1

Build multi-organization LLM chat platforms with model routing, tool execution, usage analytics, and OpenAI-compatible APIs.

awesome-local-ai📁main@2026-04-21🌱 Seedling1

🤖 Explore and utilize top open-source tools for running, fine-tuning, and building LLMs entirely locally, without cloud dependencies or API keys.

langgraph-llama-cpp-starter📁main@2026-04-21🌱 Seedling1

🤖 Build intelligent, offline LLM agents with LangGraph and llama-cpp-python using this starter template for local, private tool-calling applications.

Mini-o3📁main@2026-04-21🌱 Seedling1

🧠 Enhance visual search with Mini-o3, providing state-of-the-art multi-turn reasoning and easy-to-use training code for advanced AI applications.

auto-re-agent📁main@2026-04-21🌱 Seedling1

Automate binary analysis by coordinating LLM agents with Ghidra, enabling scalable and precise reverse engineering workflows.

replicate-python📁1.0.7💤 Dormant900

Python client for Replicate

llm-ls📁0.5.3⚰️ Archived865

LSP server leveraging LLMs for code completion (and more?)