Search results for "observability"
AI Observability & Evaluation
Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.
The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.
OmniRoute is an AI gateway for multi-provider LLMs: an OpenAI-compatible endpoint with smart routing, load balancing, retries, and fallbacks. Add policies, rate limits, caching, and observability for
Markdown for the AI era
Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Management, Vault, Playground. ๐๐ป Integrates with 50+ LLM Providers,
Plano is an AI-native proxy and data plane for agentic apps โ with built-in orchestration, safety, observability, and smart LLM routing so you stay focused on your agents core logic.
the easiest way to run natural language-described workflows automatically
A framework for building, orchestrating and deploying AI agents and multi-agent workflows with support for Python and .NET.
โก๏ธ Open-source AI Gateway โ Use any SDK to call 100+ LLMs. Built-in failover, load balancing, cost control & end-to-end tracing.
Persistent memory for AI coding agents
Framework for AI Backend. Build and run AI agents like microservices - scalable, observable, and identity-aware from day one.
Build and run autonomous AI agents with OpenClaw, Hermes, multiple model providers, orchestration, delegation, memory, skills, schedules, and chat connectors.
An API server that implements the official MCP Registry API, providing standardised access to MCP servers from multiple backends, including file-based and other API-compliant registries.
AgentWard โ Built for all, hardened for OpenClaw.
Universal AI Development Platform with MCP server integration, multi-provider support, and professional CLI. Build, test, and deploy AI applications with multiple ai providers.
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropi
Enhanced Proxmox MCP server with advanced virtualization management and full OpenAPI integration.
AI Agent Framework, the Pydantic way
The worldโs fastest AI model gateway (450x less overhead than LiteLLM). Unified access to LLMs across endpoints (openAI, self-hosted, etc.) behind a single authentication layer - with API key generati
The agent engineering platform
A community-driven collection of RAG (Retrieval-Augmented Generation) frameworks, projects, and resources. Contribute and explore the evolving RAG ecosystem.
ToolHive is an enterprise-grade platform for running and managing Model Context Protocol (MCP) servers.
RAPTOR (Robust AI-Powered Toolkit for Operational Robots) is an AI-native Content Insight Engine that transforms passive media storage into an intelligent knowledge platform through automated analysis
Open-source sandboxes where coding agents build and deploy. Spin up isolated environments where Claude Code, Cursor, and other agents code and deploy software.
๐ชข Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. ๐YC W23
AI observability platform for production LLM and agent systems.
Custom plugins for hermes-agent โ goal management, inter-agent bridge, model selection, cost control
OpenTelemetry Instrumentation for AI Observability
The agent-native LLM router for OpenClaw. 41+ models, <1ms routing, USDC payments on Base & Solana via x402.
ANOLISA - Agentic Nexus Operating Layer & Interface System Architecture
Cognithor - Agent OS: Local-first autonomous agent operating system. 16 LLM providers, 17 channels, 112+ MCP tools, 5-tier memory, A2A protocol, knowledge vault, voice, browser automation, Computer-us
Comprehensive guide to AI agent engineering: how 30+ frameworks actually work under the hood. Context rot, compaction, system prompt assembly, SOUL.md, agent loops, memory systems, tool sprawl, MCP,
โพ๏ธ Private Agent Fleet with Spec Coding. Each agent gets their own GPU-accelerated desktop. Run Claude, Codex, Gemini and open models on a full private AI Stack โพ๏ธ
Build Agentic AI solutions on AWS, using latest OSS Agentic Frameworks.
Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
๐ฅ Comprehensive survey on Context Engineering: from prompt engineering to production-grade AI systems. hundreds of papers, frameworks, and implementation guides for LLMs and AI agents.
A modular RAG (Retrieval-Augmented Generation) system with MCP Server architecture. Using Skill to make AI follow each step of the spec and complete the code 100% by AI.
Thoughtbox is an intention ledger for agents. Evaluate AI's decisions against its decision-making.
tRPC-Agent-Python provides an end-to-end foundation for agent building, orchestration, tool integration, session and long-term memory, service deployment, and observability, so you can ship reliable a
See your agent think. Real-time observability dashboard for OpenClaw AI agents.
Pan by Euraika โ a self-hosted AI workspace for Hermes Agent. Chat, skills, extensions, memory, profiles, and runtime controls.
High-scale LLM gateway, written in Rust. OpenTelemetry-based observability included
Monocle is a framework for tracing GenAI app code. This repo contains implementation of Monocle for GenAI apps written in Python.
A comprehensive toolkit for deploying production-ready Generative AI infrastructure on Amazon EKS. Includes pre-configured components for: ๐ AI Gateway (LiteLLM) ๐ค LLM Serving (vLLM, SGLang, Ollama
Latitude is the open-source agent engineering platform
SRE Agent - CNCF Sandbox Project
Open-source Agentic AI framework in Go for building, orchestrating, and deploying intelligent agents. LLM-agnostic, event-driven, with multi-agent workflows, MCP tool discovery, and production-grade o
Convoke extends BMAD Method AI agents with two types of installable modules: Teams bring new agents for a domain, Skills add new capabilities to existing agents. Install them independently or combine
A modular MCP server that provides commonly used developer tools for AI coding agents
AgenticX is a unified, production-ready multi-agent platform โ Python SDK + CLI (agx) + Studio server + Machi desktop app. Features Meta-Agent orchestration, 15+ LLM providers, MCP Hub, hierarchical m
ReLE่ฏๆต๏ผไธญๆAIๅคงๆจกๅ่ฝๅ่ฏๆต๏ผๆ็ปญๆดๆฐ๏ผ๏ผ็ฎๅๅทฒๅๆฌ359ไธชๅคงๆจกๅ๏ผ่ฆ็chatgptใgpt-5.2ใo4-miniใ่ฐทๆญgemini-3-proใClaude-4.6ใๆๅฟERNIE-X1.1ใERNIE-5.0ใqwen3-maxใqwen3.5-plusใ็พๅทใ่ฎฏ้ฃๆ็ซใๅๆฑคsenseChat็ญๅ็จๆจกๅ๏ผ ไปฅๅstep3.5-flashใkimi-k2.5ใernie4.5ใMin
Autonomous CLI agent integrations for the Spring AI ecosystem with Claude Code, Gemini CLI, and secure sandbox execution
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team colla
The implementation of Model Context Protocol (MCP) server for VictoriaLogs.
Enterprise-grade (40m+ lines) codebase intelligence in a zero-setup, private and local Claude Plugin or MCP: managed indexing, hybrid semantic search, polyglot code dependency graphs, and DB/API/infra
The app framework built for AI coding agents. Own every line. Your AI already knows how to build on it.
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
The open agent control plane. Govern autonomous AI agents with pre-execution policy enforcement, approval gates, and audit trails. Works with LangChain, CrewAI, MCP, and any framework.
Test, Debug, and Evaluate MCP servers, ChatGPT apps, and MCP Apps (ext-apps)
Open-source AI gateway written in Rust, with token compression for Claude Code, Codex... and any other LLM client.
โฅ AI Coding agent for the terminal โ hash-anchored edits, optimized tool harness, LSP, Python, browser, subagents, and more
Official MCP Servers for AWS
A High-Availability, Transparent, and Smart Multi-Vendor Proxy for Claude Code. Support Claude Plans, GitHub Copilot, Google Antigravity, ZAI/GLM, MiniMax, Qwen, Xiaomi, Kimi, Doubao...
Agent-native TypeScript framework for building MCP servers. Build tools, not infrastructure.
Curated list of chatgpt prompts from the top-rated GPTs in the GPTs Store. Prompt Engineering, prompt attack & prompt protect. Advanced Prompt Engineering papers.
An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of ta
๐ค Kubernetes for AI Agents. Self-hosted, production-grade runtime for orchestrating LLM swarms and autonomous agents. TypeScript-native.
Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, m
Build and run agents you can see, understand and trust.
Fast, small, and fully autonomous AI personal assistant infrastructure, ANY OS, ANY PLATFORM โ deploy anywhere, swap anything ๐ฆ
METAโAGENTIC ฮฑโAGI ๐๏ธโจ โ Mission ๐ฏ Endโtoโend: Identify ๐ โ OutโLearn ๐ โ OutโThink ๐ง โ OutโDesign ๐จ โ OutโStrategise โ๏ธ โ OutโExecute โก
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
The platform for LLM evaluations and AI agent testing
Endee.io โ A high-performance vector database, designed to handle up to 1B vectors on a single node, delivering significant performance gains through optimized indexing and execution. Also available i
Evaluation and Tracking for LLM Experiments and AI Agents
High-performance zero-dependency L4/L7 load balancer written in Go. Single binary with Web UI, clustering, MCP/AI integration. 8.5K RPS, 39 E2E tests.
Framework to build resilient language agents as graphs.
A modern Ruby framework designed for non-blocking I/O and simpler infrastructure
A Go implementation of the Model Context Protocol (MCP), enabling seamless integration between LLM applications and external data sources and tools.
Multi-agent AI coding platform powered by Vercel Sandbox and AI Gateway
Design-first Go framework that generates API code, documentation, and clients. Define once in an elegant DSL, deploy as HTTP and gRPC services with zero drift between code and docs.
Codingbuddy orchestrates 29 specialized AI agents to deliver code quality comparable to a team of human experts through a PLAN โ ACT โ EVAL workflow.
AI Agent Engineering Platform built on an Open Source TypeScript AI Agent Framework
A unified Model Context Protocol server implementation that aggregates multiple MCP servers into one.
The Mind Palace for AI Agents โ Autonomous Cognitive OS with affect-tagged memory (valence engine), token-economic RL (surprisal gate + UBI), Hebbian learning, ACT-R spreading activation, Synapse Engi
Koog is a JVM (Java and Kotlin) framework for building predictable, fault-tolerant and enterprise-ready AI agents across all platforms โ from backend services to Android and iOS, JVM, and even in-brow
Learn to build AI agents with Strands framework. Covers LLM integration via Amazon Bedrock/Anthropic, AWS service connections, tool implementation with MCP/A2A protocols, and agent evaluation using La
Plugin suite + bundled MCP servers for Claude Code. Full delivery lifecycle: Agile pipeline with multi-model AI review, project bootstrap, documentation generation, codebase audits, performance optimi
Frontier self improving AI intern / coworker
Engine-agnostic LLM gateway in Rust. Full OpenAI & Anthropic API compatibility across SGLang, vLLM, TRT-LLM, OpenAI, Gemini & more. Industry-first gRPC pipeline, KV cache-aware routing, chat history,
TensorZero is an open-source LLMOps platform that unifies an LLM gateway, observability, evaluation, optimization, and experimentation.
A Model Context Protocol (MCP) server for Langfuse, enabling AI agents to query Langfuse trace data for enhanced debugging and observability
One API for 25+ LLMs, OpenAI, Anthropic, Bedrock, Azure. Caching, guardrails & cost controls. Go-native LiteLLM & Kong AI Gateway alternative.
DSPEx - Declarative Self-improving Elixir | A BEAM-Native AI Program Optimization Framework
Enterprise-ready MCP Gateway & Registry that centralizes AI development tools with secure OAuth authentication, dynamic tool discovery, and unified access for both autonomous AI agents and AI coding a
LLM proxy to observe and debug what your AI agents are doing.
Make your OpenClaw agents better, cheaper, and faster.
A web component based AI agentic chat UI element which can be added in any website to turn it into an agentic app
Open-Sable is a local-first autonomous agent framework with AGI-inspired cognitive subsystems (goals, memory, metacognition, tool use). It can run continuously on your machine, integrate with chat int
From the team behind Gatsby, Mastra is a framework for building AI-powered applications and agents with a modern TypeScript stack.
An open-source AI assistant framework with skills and agent architecture
An open-source, cloud-native, high-performance gateway unifying multiple LLM providers, from local solutions like Ollama to major cloud providers such as OpenAI, Groq, Cohere, Anthropic, Cloudflare an
Integrate cutting-edge LLM technology quickly and easily into your apps
The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while controllin
Declarative framework for orchestrating multi-model LLM pipelines with context engineering and quality gates.
trpc-agent-go is a powerful Go framework for building intelligent agent systems using large language models (LLMs) and tools.
structured outputs for llms
The production runtime for AI agents. Schema in, API out. Built on PydanticAI + FastAPI.
Model Context Protocol (MCP) server for Kubernetes and OpenShift
A portable accelerated SQL query, search, and LLM-inference engine, written in Rust, for data-grounded AI apps and agents.
AgentScope Java: Agent-Oriented Programming for Building LLM Applications
High-performance AI pipeline engine with a C++ core and 50+ Python-extensible nodes. Build, debug, and scale LLM workflows with 13+ model providers, 8+ vector databases, and agent orchestration, all f
Ship customer-facing AI with isolation, spend controls, and provenance.
The official Java SDK for Model Context Protocol servers and clients. Maintained in collaboration with Spring AI
JSON Agents - A universal JSON-native standard for describing AI agents, their capabilities, tools, runtimes, and governance in a portable, framework-agnostic format. Based on RFC 8259, JSON Schema 2
RAGLight is a modular framework for Retrieval-Augmented Generation (RAG). It makes it easy to plug in different LLMs, embeddings, and vector stores, and now includes seamless MCP integration to connec
My personal Claude Code and OpenAI Codex setup with battle-tested skills, commands, hooks, agents and MCP servers that I use daily.
TraceRoot - open-source observability and self-healing layer for AI agents. YC S25
๐ง Enable seamless Android app hooking with Vector, a Zygisk module offering a consistent API for module developers and users, built on the LSPlant framework.
A selective learning and memory substrate for agentic systems โ typed, revisable, decayable memory with competence learning and trust-aware retrieval.
The fullstack MCP framework to develop MCP Apps for ChatGPT / Claude & MCP Servers for AI Agents.
AI-indexed portfolio and CV site with machine-readable profile data, evidence-backed case studies, verification signals, and a live MCP endpoint for agent access.
Local-first AI agent framework with GUI, memory, web search, personality constructs, speech i/o, tools, skills, CLI & Telegram features โ fully self-hosted via Ollama.
Your AI-powered SWE teammate, built into your git workflow
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SF
Local AI anywhere, for everyone โ LLM inference, chat UI, voice, agents, workflows, RAG, and image generation. No cloud, no subscriptions.
A fully autonomous (no human-in-loop) agentic based project design and coding machine
Route, manage, and analyze your LLM requests across multiple providers with a unified API interface
Published in CNCF Landscape: A MCP server for Kubernetes.
Self-evolving AI agent framework with 5-layer safety gatekeeper. Agents observe failures, propose fixes, and safely apply them. Built on HKUDS/nanobot.
Multi-agent system for software development
Autonomous local AI assistant in Go โ 40+ tools, 20+ LLM providers, multi-agent orchestration, self-improving
No description
Supercharge Your LLM Application Evaluations ๐
Agent-ready telemetry SDK โ enriches OpenTelemetry across Java, Go, Python, Node.js, and browser with structured context for AI-driven observability.
Lightweight, modular AI agent runtime โ thinks (Hrafn) and remembers (MuninnDB) ๐ฆโโฌ
Open-source autonomous AI assistant with 5-tier security, 62 tools, 14 LLM providers. Written in Rust. Single binary.
Capture and analyze Claude Code sessions locally to track every tool call, decision, and reasoning step without external dependencies.
A Slack bot and MCP client acts as a bridge between Slack and Model Context Protocol (MCP) servers. Using Slack as the interface, it enables large language models (LLMs) to connect and interact with v
Generate a custom newspaper with an AI agent based on your favorite YouTube channels.
Fluid, elastic data abstraction and acceleration for BigData/AI applications in cloud. (Project under CNCF)
Cloud native, ultra-high performance AI&API gateway, LLM API management, distribution system, open platform, supporting all AI APIs.๐ฆไบๅ็ใ่ถ ้ซๆง่ฝ AI&API็ฝๅ ณ๏ผLLM API ็ฎก็ใๅๅ็ณป็ปใๅผๆพๅนณๅฐ๏ผๆฏๆๆๆAI API๏ผไธ้ไบOpenAIใAzureใ
Connect CLI-based AI agents like Claude and Codex to Telegram and Discord with real-time streaming and session management, no SDK needed.
BRUNELLA AGENT SYSTEM (BAS) โ A JรVล DIGITรLIS SZERVEZETE
Autonomous, multilingual AI voice agent using ElevenLabs, LangGraph, and RAG for government services
Run Claude Code, Gemini, Codex โ or any coding agent โ in a clean, isolated sandbox with sensitive data redaction and observability baked in.
A Python-based framework for building multi-agent systems with LLMs. Currently in pre-launch alpha.
Python SDK for Agent AI Observability, Monitoring and Evaluation Framework. Includes features like agent, llm and tools tracing, debugging multi-agentic system, self-hosted dashboard and advanced anal
๐ฆพ A productionโready research outreach AI agent that plans, discovers, reasons, uses tools, autoโbuilds cited briefings, and drafts tailored emails with toolโchaining, memory, tests, and turnkey Dock
MCP (Model Context Protocol) Servers authored and maintained by the PulseMCP team. We build reliable servers thoughtfully designed specifically for MCP Client-powered workflows.
A fast and minimal framework for building agentic systems
