freshcrate

Search results for "evaluation"

147 results found
opik๐Ÿ“2.0.6๐ŸŒณ Matureโญ18,767

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

agenta๐Ÿ“v0.96.7๐ŸŒณ Matureโญ4,011

The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.

WeKnora๐Ÿ“v0.4.0๐ŸŒณ Matureโญ13,819

LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.

LLM-Agents-Ecosystem-Handbook๐Ÿ“0.0.0๐ŸŒณ Matureโญ508

One-stop handbook for building, deploying, and understanding LLM agents with 60+ skeletons, tutorials, ecosystem guides, and evaluation tools.

ai-agents-reality-check๐Ÿ“0.0.0๐ŸŒฟ Growingโญ57

Benchmarking the gap between AI agent hype and architecture. Three agent archetypes, 73-point performance spread, stress testing, network resilience, and ensemble coordination analysis with statistica

openlit๐Ÿ“openlit-1.18.1๐ŸŒฟ Growingโญ2,358

Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Management, Vault, Playground. ๐Ÿš€๐Ÿ’ป Integrates with 50+ LLM Providers,

CodeGen๐Ÿ“0.0.0๐ŸŒณ Matureโญ773

Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from dataset creation to model training and evaluation. Comes with pr

openclaw-engram๐Ÿ“v9.3.142๐ŸŒฟ Growingโญ54

Local-first memory plugin for OpenClaw AI agents. LLM-powered extraction, plain markdown storage, hybrid search via QMD. Gives agents persistent long-term memory across conversations.

agent-framework๐Ÿ“python-1.1.0๐ŸŒณ Matureโญ9,325

A framework for building, orchestrating and deploying AI agents and multi-agent workflows with support for Python and .NET.

PraisonAI๐Ÿ“v4.6.25๐ŸŒณ Matureโญ6,900

PraisonAI ๐Ÿฆž โ€” Hire a 24/7 AI Workforce. Stop writing boilerplate and start shipping autonomous agents that research, plan, code, and execute tasks. Deployed in 5 lines of code with built-in memory, R

OpenSandbox๐Ÿ“docker/execd/v1.0.13๐ŸŒณ Matureโญ9,925

Secure, Fast, and Extensible Sandbox runtime for AI agents.

agentmemory๐Ÿ“v0.9.1๐ŸŒณ Matureโญ738

Persistent memory for AI coding agents

neurolink๐Ÿ“v9.56.0๐ŸŒฟ Growingโญ121

Universal AI Development Platform with MCP server integration, multi-provider support, and professional CLI. Build, test, and deploy AI applications with multiple ai providers.

piclaw๐Ÿ“v1.8.3๐ŸŒฟ Growingโญ467

I'm going to build my own OpenClaw, with blackjack... and bun!

mentisdb๐Ÿ“0.9.3.39๐ŸŒฟ Growingโญ56

Memory that lasts and compounds. MentisDB gives agents durable memory so they do not just remember, they improve over time. It stores append-only thought chains plus a Git-like skills registry, lett

langchain๐Ÿ“langchain-core==1.3.0๐ŸŒณ Matureโญ133,178

The agent engineering platform

RAGHub๐Ÿ“main@2026-04-17๐ŸŒณ Matureโญ1,712

A community-driven collection of RAG (Retrieval-Augmented Generation) frameworks, projects, and resources. Contribute and explore the evolving RAG ecosystem.

agentic-memory๐Ÿ“0.0.0๐ŸŒฟ Growingโญ162

No description

by lhl
RAPTOR๐Ÿ“0.0.0๐ŸŒฑ Seedlingโญ13

RAPTOR (Robust AI-Powered Toolkit for Operational Robots) is an AI-native Content Insight Engine that transforms passive media storage into an intelligent knowledge platform through automated analysis

tulip_agent๐Ÿ“0.0.0๐ŸŒฑ Seedlingโญ44

autonomous agent with access to a tool library

LRAT๐Ÿ“0.0.0๐ŸŒฑ Seedlingโญ34

The implementation for SIGIR 2026: Learning to Retrieve from Agent Trajectories.

GEA๐Ÿ“0.0.0๐ŸŒฑ Seedlingโญ23

Group Evolving Agents: Open-Ended Self-Improvement via Experience Sharing

chinese-llm-benchmark๐Ÿ“v5.9๐ŸŒฟ Growingโญ5,841

ReLE่ฏ„ๆต‹๏ผšไธญๆ–‡AIๅคงๆจกๅž‹่ƒฝๅŠ›่ฏ„ๆต‹๏ผˆๆŒ็ปญๆ›ดๆ–ฐ๏ผ‰๏ผš็›ฎๅ‰ๅทฒๅ›Šๆ‹ฌ359ไธชๅคงๆจกๅž‹๏ผŒ่ฆ†็›–chatgptใ€gpt-5.2ใ€o4-miniใ€่ฐทๆญŒgemini-3-proใ€Claude-4.6ใ€ๆ–‡ๅฟƒERNIE-X1.1ใ€ERNIE-5.0ใ€qwen3-maxใ€qwen3.5-plusใ€็™พๅทใ€่ฎฏ้ฃžๆ˜Ÿ็ซใ€ๅ•†ๆฑคsenseChat็ญ‰ๅ•†็”จๆจกๅž‹๏ผŒ ไปฅๅŠstep3.5-flashใ€kimi-k2.5ใ€ernie4.5ใ€Min

arthur-engine๐Ÿ“2.1.529๐ŸŒฟ Growingโญ75

Make AI work for Everyone - Monitoring and governing for your AI/ML

langfuse๐Ÿ“v3.169.0๐ŸŒฟ Growingโญ24,578

๐Ÿชข Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. ๐ŸŠYC W23

promptfoo๐Ÿ“code-scan-action-0.1.5๐ŸŒฟ Growingโญ19,943

Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and

cognithor๐Ÿ“v0.92.2๐ŸŒฟ Growingโญ94

Cognithor - Agent OS: Local-first autonomous agent operating system. 16 LLM providers, 17 channels, 112+ MCP tools, 5-tier memory, A2A protocol, knowledge vault, voice, browser automation, Computer-us

plano๐Ÿ“0.4.20๐ŸŒฟ Growingโญ6,241

Plano is an AI-native proxy and data plane for agentic apps โ€” with built-in orchestration, safety, observability, and smart LLM routing so you stay focused on your agents core logic.

arag๐Ÿ“v0.1.0๐ŸŒฟ Growingโญ247

A-RAG: Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces. State-of-the-art RAG framework with keyword, semantic, and chunk read tools for multi-hop QA.

pydantic-deepagents๐Ÿ“0.3.15๐ŸŒฟ Growingโญ648

Python Deep Agent framework built on top of Pydantic-AI, designed to help you quickly build production-grade autonomous AI agents with planning, filesystem operations, subagent delegation, skills, and

helix๐Ÿ“2.9.30๐ŸŒฟ Growingโญ757

โ™พ๏ธ Private Agent Fleet with Spec Coding. Each agent gets their own GPU-accelerated desktop. Run Claude, Codex, Gemini and open models on a full private AI Stack โ™พ๏ธ

Autonomous-Agents๐Ÿ“main@2026-04-16๐ŸŒฟ Growingโญ1,211

Autonomous Agents (LLMs) research papers. Updated Daily.

ollama๐Ÿ“v0.21.0๐ŸŒฟ Growingโญ168,597

Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.

career-ops๐Ÿ“v1.5.0๐ŸŒฟ Growingโญ30,403

AI-powered job search system built on Claude Code. 14 skill modes, Go dashboard, PDF generation, batch processing.

Awesome-Context-Engineering๐Ÿ“0.0.0๐ŸŒณ Matureโญ3,045

๐Ÿ”ฅ Comprehensive survey on Context Engineering: from prompt engineering to production-grade AI systems. hundreds of papers, frameworks, and implementation guides for LLMs and AI agents.

MODULAR-RAG-MCP-SERVER๐Ÿ“0.0.0๐ŸŒณ Matureโญ783

A modular RAG (Retrieval-Augmented Generation) system with MCP Server architecture. Using Skill to make AI follow each step of the spec and complete the code 100% by AI.

evals๐Ÿ“v0.1.15๐ŸŒฟ Growingโญ103

A comprehensive evaluation framework for AI agents and LLM applications.

langwatch๐Ÿ“skills@v0.3.0๐ŸŒฟ Growingโญ3,193

The platform for LLM evaluations and AI agent testing

AI-Infra-Guard๐Ÿ“v4.1.4๐ŸŒฟ Growingโญ3,428

A full-stack AI Red Teaming platform securing AI ecosystems via OpenClaw Security Scan, Agent Scan, Skills Scan, MCP scan, AI Infra scan and LLM jailbreak evaluation.

OpenClawProBench๐Ÿ“main@2026-04-15๐ŸŒฟ Growingโญ340

OpenClawProBench is a live-first benchmark harness for evaluating LLM agents in the OpenClaw runtime with deterministic grading and repeated-trial reliability.

claw-eval๐Ÿ“main@2026-04-15๐ŸŒฟ Growingโญ394

Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.

unsloth-buddy๐Ÿ“main@2026-04-15๐ŸŒฟ Growingโญ212

Zero-friction LLM fine-tuning skill for Claude Code, Gemini CLI & any ACP agent. Unsloth on NVIDIA ยท TRL+MPS/MLX on Apple Silicon. Automates env setup, LoRA training (SFT, DPO, GRPO, vision), post-hoc

latitude-llm๐Ÿ“claude-code-telemetry-0.0.5๐ŸŒฟ Growingโญ3,955

Latitude is the open-source agent engineering platform

Agentic-RAG-R1๐Ÿ“0.0.0๐ŸŒฟ Growingโญ412

Agentic RAG R1 Framework via Reinforcement Learning

AgenticX๐Ÿ“v0.3.7๐ŸŒฟ Growingโญ105

AgenticX is a unified, production-ready multi-agent platform โ€” Python SDK + CLI (agx) + Studio server + Machi desktop app. Features Meta-Agent orchestration, 15+ LLM providers, MCP Hub, hierarchical m

prism-mcp๐Ÿ“v9.3.0๐ŸŒฟ Growingโญ116

The Mind Palace for AI Agents โ€” Autonomous Cognitive OS with affect-tagged memory (valence engine), token-economic RL (surprisal gate + UBI), Hebbian learning, ACT-R spreading activation, Synapse Engi

agent-client๐Ÿ“v0.13.0๐ŸŒฟ Growingโญ90

Autonomous CLI agent integrations for the Spring AI ecosystem with Claude Code, Gemini CLI, and secure sandbox execution

Learn to build AI agents with Strands framework. Covers LLM integration via Amazon Bedrock/Anthropic, AWS service connections, tool implementation with MCP/A2A protocols, and agent evaluation using La

ISC-Bench๐Ÿ“v0.0.5๐ŸŒฟ Growingโญ786

Internal Safety Collapse: Turning the LLM or an AI Agent into a sensitive data generator.

weaviate๐Ÿ“v1.37.1๐ŸŒฟ Growingโญ15,988

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a c

cordum๐Ÿ“V0.9.9.1๐ŸŒฟ Growingโญ461

The open agent control plane. Govern autonomous AI agents with pre-execution policy enforcement, approval gates, and audit trails. Works with LangChain, CrewAI, MCP, and any framework.

panguard-ai๐Ÿ“v1.4.19๐ŸŒฑ Seedlingโญ37

Open-source security platform for AI agents -- audits skills before install, monitors 24/7, shares threat intelligence across all users. | AI Agent ้–‹ๆบๅฎ‰ๅ…จๅนณๅฐ -- ๅฎ‰่ฃๅ‰ๅฏฉ่จˆ skillใ€24/7 ๅณๆ™‚็›ฃๆŽงใ€็คพ็พคๅ…ฑไบซๅจ่„…ๆƒ…ๅ ฑใ€‚

mission-control๐Ÿ“v2.5.0๐ŸŒฟ Growingโญ1,853

The world's first Autonomous Product Engine (APE): AI agents research your market, generate features, and ship code as PRs. Convoy mode, crash recovery, cost tracking, 80+ API endpoints. Self-hosted v

memind๐Ÿ“main@2026-04-21๐ŸŒฟ Growingโญ360

Self-evolving cognitive memory and context engine for AI agents in Java. Empowering 24/7 proactive agents like OpenClaw with understanding and SOTA performance.

Awesome-World-Models๐Ÿ“main@2026-04-21๐ŸŒฟ Growingโญ1,473

A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related website

karpathy-llm-wiki๐Ÿ“main@2026-04-21๐ŸŒฑ Seedlingโญ34

The Self-Growing Karpathy LLM Wiki โ€” grown by an AI agent yoyo from Karpathy's founding prompt

awesome-prompts๐Ÿ“main@2026-04-21๐ŸŒฟ Growingโญ7,572

Curated list of chatgpt prompts from the top-rated GPTs in the GPTs Store. Prompt Engineering, prompt attack & prompt protect. Advanced Prompt Engineering papers.

honcho๐Ÿ“main@2026-04-21๐ŸŒฟ Growingโญ2,030

Memory library for building stateful agents

aura๐Ÿ“main@2026-04-21๐ŸŒฑ Seedlingโญ47

A sovereign cognitive architecture with IIT 4.0 integrated information, residual-stream affective steering (CAA), Global Workspace Theory, active inference, and 72 consciousness modules โ€” running loca

deer-flow๐Ÿ“main@2026-04-21๐ŸŒฟ Growingโญ60,446

An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of ta

LLM-Agent-Paper-daily๐Ÿ“main@2026-04-21๐ŸŒฑ Seedlingโญ20

Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)

Cogitator-AI๐Ÿ“main@2026-04-21๐ŸŒฑ Seedlingโญ35

๐Ÿค– Kubernetes for AI Agents. Self-hosted, production-grade runtime for orchestrating LLM swarms and autonomous agents. TypeScript-native.

samples๐Ÿ“main@2026-04-20๐ŸŒฟ Growingโญ717

Agent samples built using the Strands Agents SDK.

haystack๐Ÿ“v2.28.0๐ŸŒฟ Growingโญ24,806

Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, m

agentscope๐Ÿ“v1.0.19๐ŸŒฟ Growingโญ23,421

Build and run agents you can see, understand and trust.

awesome-code-agents๐Ÿ“main@2026-04-20๐ŸŒฟ Growingโญ94

A curated list of products, benchmarks, and research papers on autonomous code agents. Beyond coding โ€” they're redefining how software changes the world.

vexa๐Ÿ“v0.10.2๐ŸŒฟ Growingโญ1,862

Open-source meeting transcription API for Google Meet, Microsoft Teams & Zoom. Auto-join bots, real-time WebSocket transcripts, MCP server for AI agents. Self-host or use hosted SaaS.

auto-deep-researcher-24x7๐Ÿ“main@2026-04-19๐ŸŒฟ Growingโญ261

๐Ÿ”ฅ An autonomous AI agent that runs your deep learning experiments 24/7 while you sleep. Zero-cost monitoring, Leader-Worker architecture, constant-size memory.

skills-vote๐Ÿ“main@2026-04-19๐ŸŒฑ Seedlingโญ31

The Next-Gen Agent-Native Skill Recommendation Engine

medusa๐Ÿ“v2026.5.5๐ŸŒฟ Growingโญ252

AI-first security scanner with 76 analyzers, 9,600+ detection rules, and repo poisoning detection for AI/ML, LLM agents, and MCP servers. Scan any GitHub repo with: medusa scan --git user/repo

crewAI๐Ÿ“1.14.2๐ŸŒฟ Growingโญ48,611

Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.

giskard-oss๐Ÿ“giskard-checks/v1.0.2b1๐ŸŒฑ Seedlingโญ5,225

๐Ÿข Open-Source Evaluation & Testing library for LLM Agents

maverick-mcp๐Ÿ“main@2026-04-17๐ŸŒฟ Growingโญ479

MaverickMCP - Personal Stock Analysis MCP Server

trulens๐Ÿ“trulens-2.7.2๐ŸŒฑ Seedlingโญ3,237

Evaluation and Tracking for LLM Experiments and AI Agents

AReaL๐Ÿ“v1.0.3๐ŸŒฟ Growingโญ5,017

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Awesome-Agent-Memory๐Ÿ“main@2026-04-16๐ŸŒฟ Growingโญ333

Curated systems, benchmarks, and papers etc. on memory for LLMs/MLLMs --- long-term context, retrieval, and reasoning.

beads๐Ÿ“v1.0.2๐ŸŒฟ Growingโญ20,577

Beads - A memory upgrade for your coding agent

paiml-mcp-agent-toolkit๐Ÿ“v3.14.0๐ŸŒฟ Growingโญ148

Pragmatic AI Labs MCP Agent Toolkit - An MCP Server designed to make code with agents more deterministic

mlflow๐Ÿ“v3.11.1๐ŸŒฑ Seedlingโญ25,285

The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while controllin

LLM-Wiki๐Ÿ“main@2026-04-18๐ŸŒฑ Seedlingโญ7

Autonomous knowledge base plugin for Claude Code - captures reserch, ideas, and decisions into an interlinked wiki with reserch-on-miss, semantic search, and a Wikipedia-style web UI. Knowledge compou

cognitive-dissonance-dspy๐Ÿ“main@2026-04-14๐ŸŒฟ Growingโญ276

A multi-agent LLM system for detecting and resolving cognitive dissonance.

ai-real-estate-assistant๐Ÿ“dev@2026-04-13๐ŸŒฟ Growingโญ159

Advanced AI Real Estate Assistant using RAG, LLMs, and Python. Features market analysis, property valuation, and intelligent search.

carapace๐Ÿ“v0.7.0๐ŸŒฑ Seedlingโญ42

A secure, stable Rust alternative to openclaw/moltbot/clawdbot

awesome-vector-database๐Ÿ“main@2026-04-13๐ŸŒฟ Growingโญ341

A curated list of awesome works related to high dimensional structure/vector search & database

The Multi-Agent Custom Automation Engine Solution Accelerator is an AI-driven system that manages a group of AI agents to accomplish tasks based on user input. Powered by Microsoft Agent Framework, Az

voltagent๐Ÿ“@voltagent/server-elysia@2.0.7๐ŸŒฟ Growingโญ7,851

AI Agent Engineering Platform built on an Open Source TypeScript AI Agent Framework

AutoRAG๐Ÿ“v0.3.22๐ŸŒฑ Seedlingโญ4,693

AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation

cyber-pilot๐Ÿ“v3.7.0-beta๐ŸŒฟ Growingโญ53

Cyber Pilot is a traceable delivery system for requirements, design, plans, and code.

Awesome-Repo-Level-Code-Generation๐Ÿ“main@2026-04-10๐ŸŒฟ Growingโญ274

Must-read papers on Repository-level Code Generation & Issue Resolution ๐Ÿ”ฅ

tensorzero๐Ÿ“2026.4.0๐ŸŒฑ Seedlingโญ11,204

TensorZero is an open-source LLMOps platform that unifies an LLM gateway, observability, evaluation, optimization, and experimentation.

ds_ex๐Ÿ“main@2026-04-09๐ŸŒฑ Seedlingโญ17

DSPEx - Declarative Self-improving Elixir | A BEAM-Native AI Program Optimization Framework

UltraRAG๐Ÿ“v0.3.0.2๐ŸŒฟ Growingโญ5,480

A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines

sv-excel-agent๐Ÿ“0.0.0๐ŸŒฑ Seedlingโญ179

An Excel AI agent that uses MCP tools to let LLMs read, edit, and automate Excel spreadsheets.

mastra๐Ÿ“@mastra/core@1.24.0๐ŸŒฑ Seedlingโญ22,899

From the team behind Gatsby, Mastra is a framework for building AI-powered applications and agents with a modern TypeScript stack.

b2b-sdr-agent-template๐Ÿ“v3.6.0๐ŸŒฑ Seedlingโญ40

Open-source AI SDR template for B2B export. 10-stage sales pipeline, 10 cron jobs, 4-engine memory, multi-channel (WhatsApp+Telegram+Email). Built on OpenClaw.

trpc-agent-go๐Ÿ“v1.8.0๐ŸŒฑ Seedlingโญ1,085

trpc-agent-go is a powerful Go framework for building intelligent agent systems using large language models (LLMs) and tools.

skill๐Ÿ“v1.2.1๐ŸŒฑ Seedlingโญ978

PinchBench is a benchmarking system for evaluating LLM models as OpenClaw coding agents. Made with ๐Ÿฆ€ by the humans at https://kilo.ai

kernel๐Ÿ“v3.97.0๐ŸŒฑ Seedlingโญ12

kbot โ€” the AI agent that dreams, learns, and evolves. 764+ tools, 35 agents, 20 providers. Music production, iPhone control, financial analysis, cyber threat intel. Always-on daemon. Runs offline. npm

everything-claude-code๐Ÿ“v1.10.0๐ŸŒฑ Seedlingโญ151,139

The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.

ai-memecoin-trading-bot๐Ÿ“0.0.0๐ŸŒฑ Seedlingโญ63

AI-powered meme coin trading bot for Solana and Base that automatically scans new tokens, detects honeypots, calculates win probability, executes trades. Built in Go with a multi-agent architecture, r

llm_intents๐Ÿ“1.7.1๐ŸŒฑ Seedlingโญ111

Exposes internet search tools for use by LLM-backed Assist in Home Assistant

Standard๐Ÿ“0.0.0๐ŸŒฑ Seedlingโญ18

JSON Agents - A universal JSON-native standard for describing AI agents, their capabilities, tools, runtimes, and governance in a portable, framework-agnostic format. Based on RFC 8259, JSON Schema 2

any-agent๐Ÿ“1.18.0๐ŸŒฑ Seedlingโญ1,141

A single interface to use and evaluate different agent frameworks

camel๐Ÿ“v0.2.90๐ŸŒฑ Seedlingโญ16,654

๐Ÿซ CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

codexmaster๐Ÿ“prompt-generator๐ŸŒฑ Seedlingโญ76

Master Codex with this Framework file system + Prompt Generator consisting of 32 markdown files that will set such strict constraints and rules for Codex that its output is nearly flawless. Files for:

membrane๐Ÿ“v0.2.0๐ŸŒฑ Seedlingโญ75

A selective learning and memory substrate for agentic systems โ€” typed, revisable, decayable memory with competence learning and trust-aware retrieval.

mattermost-plugin-agents๐Ÿ“v1.14.0๐ŸŒฑ Seedlingโญ217

Mattermost Agents plugin supporting multiple LLMs

KawaiiGPT๐Ÿ“KawaiiGPT๐ŸŒฑ Seedlingโญ831

KawaiiGPT โ€” Open-source LLM gateway accessing DeepSeek, Gemini, and Kimi-K2 through reverse-engineered Pollinations API with no API keys required, built-in prompt injection capabilities for security r

bisheng๐Ÿ“v2.3.0๐ŸŒฑ Seedlingโญ11,293

BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SF

RAGElo๐Ÿ“0.4.0๐ŸŒฑ Seedlingโญ128

RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker

deltallm๐Ÿ“v0.1.20-rc2๐ŸŒฑ Seedlingโญ3

Route, manage, and analyze your LLM requests across multiple providers with a unified API interface

ragas๐Ÿ“v0.4.3๐ŸŒฑ Seedlingโญ13,329

Supercharge Your LLM Application Evaluations ๐Ÿš€

LightAgent๐Ÿ“v0.5.0๐ŸŒฑ Seedlingโญ831

LightAgent: Lightweight AI agent framework with memory, tools & tree-of-thought. Supports multi-agent collaboration, self-learning, and major LLMs (OpenAI/DeepSeek/Qwen). Open-source with MCP/SSE prot

prd-taskmaster๐Ÿ“v3.0.0๐ŸŒฑ Seedlingโญ184

AI-powered PRD generation for Claude Code with taskmaster integration

PAI-RAG๐Ÿ“v0.4.3๐ŸŒฑ Seedlingโญ450

An easy-to-use framework for modular RAG

py-gpt๐Ÿ“v2.7.12๐ŸŒฑ Seedlingโญ1,724

Desktop AI Assistant powered by GPT-5, GPT-4, o1, o3, Gemini, Claude, Ollama, DeepSeek, Perplexity, Grok, Bielik, chat, vision, voice, RAG, image and video generation, agents, tools, MCP, plugins, spe

sofia๐Ÿ“main@2026-04-11๐ŸŒฑ Seedlingโญ2

Autonomous local AI assistant in Go โ€” 40+ tools, 20+ LLM providers, multi-agent orchestration, self-improving

aictl๐Ÿ“v0.28.0๐ŸŒฑ Seedlingโญ1

๐Ÿค– AI agent in your terminal

evo-agents๐Ÿ“master@2026-04-19๐ŸŒฑ Seedlingโญ3

Complete Workspace Template for OpenClaw - Full agent lifecycle with unified memory system (Markdown + SQLite), self-evolution, RAG. Not for SubAgent/Skill use.

uniAI๐Ÿ“0.0.0๐ŸŒฑ Seedlingโญ1

Syllabus-aware RAG study assistant for university students. Answers strictly from your own notes & PDFs, unit-scoped retrieval, cross-encoder reranking, and a hallucination gate โ€” built to help studen

heartbeat-agent-framework๐Ÿ“0.0.0๐ŸŒฑ Seedlingโญ1

The open-source framework that makes AI agents proactive, self-learning, and autonomous. Multi-project tracking, full logging pipeline, message discipline, and memory review system.

gptme๐Ÿ“v0.31.0๐ŸŒฑ Seedlingโญ4,266

Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web. Make your own persistent autonomous agent on top!

ryvos๐Ÿ“v0.9.0๐ŸŒฑ Seedlingโญ2

Open-source autonomous AI assistant with 5-tier security, 62 tools, 14 LLM providers. Written in Rust. Single binary.

CodeRAG๐Ÿ“main@2026-04-21๐ŸŒฑ Seedlingโญ1

Build semantic vector databases from code and docs to enable AI agents to understand and navigate your entire codebase effectively.

harness๐Ÿ“master@2026-04-21๐ŸŒฑ Seedlingโญ1

Define and control AI agents in markdown with full prompt transparency, persistent memory, and integrated tools via the Claude Agent SDK.

agenttel-sdk๐Ÿ“v0.2.0-alpha๐ŸŒฑ Seedlingโญ6

Agent-ready telemetry SDK โ€” enriches OpenTelemetry across Java, Go, Python, Node.js, and browser with structured context for AI-driven observability.

multi-agent-orchestration-framework๐Ÿ“v0.1.0๐ŸŒฑ Seedlingโญ26

Modular multi-agent orchestration framework powered by LangGraph and FastAPI.

llm-agents.nix๐Ÿ“assets๐ŸŒฑ Seedlingโญ988

Nix packages for AI coding agents and development tools. Automatically updated daily.

Government-Citizen-Services-Voice-Agent๐Ÿ“main@2026-04-15๐ŸŒฑ Seedlingโญ1

Autonomous, multilingual AI voice agent using ElevenLabs, LangGraph, and RAG for government services

LettuceDetect๐Ÿ“0.1.8๐Ÿ’ค Dormantโญ545

Lightweight hallucination detection framework for RAG applications

Neuroverseos-governance๐Ÿ“v0.3.0๐ŸŒฑ Seedlingโญ1

Deterministic governance engine for AI agents. Enforce rules defined in .md governance files across AI systems.

TSUKUYOMI๐Ÿ“2.6.0๐Ÿ’ค Dormantโญ86

TSUKUYOMI is an advanced modular intelligence framework designed for the democratization of Intelligence Analysis via systematic analysis, processing, and reporting across multiple domains. Built on a

HealthFlow๐Ÿ“datasets๐Ÿ’ค Dormantโญ40

HealthFlow: A Self-Evolving AI Agent with Meta Planning for Autonomous Healthcare Research

RagaAI-Catalyst๐Ÿ“v2.2.4๐Ÿ’ค Dormantโญ16,130

Python SDK for Agent AI Observability, Monitoring and Evaluation Framework. Includes features like agent, llm and tools tracing, debugging multi-agentic system, self-hosted dashboard and advanced anal

ClosedSSPM๐Ÿ“v0.4.1๐ŸŒฑ Seedlingโญ1

An open-source SSPM tool written in Go

FlexRAG๐Ÿ“0.3.0๐Ÿ’ค Dormantโญ235

FlexRAG: A RAG Framework for Information Retrieval and Generation.

Qwen-Agent๐Ÿ“v0.0.26๐Ÿ’ค Dormantโญ15,963

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

judge0๐Ÿ“v1.13.1โšฐ๏ธ Archivedโญ4,082

Robust, fast, scalable, and sandboxed open-source online code execution system for humans and AI.

Promptgpt๐Ÿ“v1.2โšฐ๏ธ Archivedโญ119

PromptGPT is an opensource framework that enables users to automatically generate high-quality prompts with zero installations, coding necessary or technical knowledge. Promptgpt follows industry best

medicalAI๐Ÿ“v1.2.9-rcโšฐ๏ธ Archivedโญ21

Medical-AI is a AI framework specifically for Medical Applications https://aibharata.github.io/medicalAI/