Search results for "harness"
One brain, many harnesses. Portable .agent/ folder (memory + skills + protocols) that plugs into Claude Code, Cursor, Windsurf, OpenCode, OpenClaw, Hermes, or DIY Python β and keeps its knowledge when
Vibe-Skills is an all-in-one AI skills package. It seamlessly integrates expert-level capabilities and context management into a general-purpose skills packageοΌ enabling any AI agent to instantly upgr
Harness LLMs with Multi-Agent Programming
Security and best-practices scanner for AI Plugins, covering Codex, Claude, Opencode, Gemini & more. Scores trust for plugins 0-100.
ARIS βοΈ (Auto-Research-In-Sleep) β Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in β works wi
The leading, most token-efficient MCP server for GitHub source code exploration via tree-sitter AST parsing
AI Agent Framework, the Pydantic way
A general-purpose coding agent that runs inside an NVIDIA OpenShell sandbox, orchestrated by Deep Agents and powered by NVIDIA Nemotron. The agent writes and executes code in an isolated, policy-gover
[GenAI Application Development Framework] π Build GenAI application quick and easy π¬ Easy to interact with GenAI agent in code using structure data and chained-calls syntax π§© Use Event-Driven Flow
A Markdown-first memory system, a standalone library for any AI agent. Inspired by OpenClaw.
Python Deep Agent framework built on top of Pydantic-AI, designed to help you quickly build production-grade autonomous AI agents with planning, filesystem operations, subagent delegation, skills, and
A productive AI coworker that learns, self-improves, and ships work.
An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of ta
An open-source harness for spec-driven code generation.
Nexent is a zero-code platform for auto-generating production-grade AI agents using Harness Engineering principles β unified tools, skills, memory, and orchestration with built-in constraints, feedbac
OpenClawProBench is a live-first benchmark harness for evaluating LLM agents in the OpenClaw runtime with deterministic grading and repeated-trial reliability.
Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.
AgenticX is a unified, production-ready multi-agent platform β Python SDK + CLI (agx) + Studio server + Machi desktop app. Features Meta-Agent orchestration, 15+ LLM providers, MCP Hub, hierarchical m
π¬ Harness Vibe Research with Self-evolving AI Scientists
Official MCP Servers for AWS
A coding agent optimized to smaller LLMs
Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)
[NeurIPS 2024 D&B] GTA: A Benchmark for General Tool Agents & [arXiv 2026] GTA-2
MaverickMCP - Personal Stock Analysis MCP Server
A multi-agent LLM system for detecting and resolving cognitive dissonance.
The conversational control layer for customer-facing AI agents - Parlant is a context-engineering framework optimized for controlling customer interactions.
AI-powered bug bounty hunting from your terminal - recon, 20 vuln classes, autonomous hunting, and report generation. All inside Claude Code.
Agentica: Lightweight async-first Python framework for AI agents. θ½»ιηΊ§εΌζ₯δΌε ηAI Agentζ‘ζΆοΌζ―ζε·₯ε ·θ°η¨γRAGγε€ζΊθ½δ½εMCPγ
An Excel AI agent that uses MCP tools to let LLMs read, edit, and automate Excel spreadsheets.
One memory layer for every AI agent. Local-first, markdown source of truth, and CLI/HTTP/MCP native. Your agent forgot who you are. Again. Dory fixes that.
LLM-powered Agent Runtime with Dynamic DAG Planning & Concurrent Execution
Claude Code skills, architectural principles, and alternative approaches for AI-assisted development
A 27-chapter hands-on tutorial for building an autonomous AI agent from zero in Python. Agent loop, tool system, memory, skills, MCP, multi-platform gateway, and self-evolution β inspired by Herme
Ambient intelligence that sees what you see, hears what you hear, and acts on your behalf
π§ PromptDrifter β oneβcommand CI guardrail that catches prompt drift and fails the build when your LLM answers change.
Autonomous AI Agent Harness β persistent memory, SWARM orchestration, event-driven triggers. The KAIROS pattern, built independently before the Claude Code leak. pip install adam-framework
Local-first AI agent framework with GUI, memory, web search, personality constructs, speech i/o, tools, skills, CLI & Telegram features β fully self-hosted via Ollama.
Claude Code skills collection β CCA study guides, Twitter research, MCP review, auto-iteration tools
A local LLM-based autonomous agent orchestration platform featuring async background tasks, context-isolated sub-agents, dynamic knowledge injection, and strict security approval gates (Plan Mode).
Complete Workspace Template for OpenClaw - Full agent lifecycle with unified memory system (Markdown + SQLite), self-evolution, RAG. Not for SubAgent/Skill use.
GAN-inspired multi-agent system that autonomously builds full-stack web apps from a single prompt using Claude AI agents
π Build an enterprise-ready RAG system to enhance technical documentation querying with LangGraph and multi-step reasoning workflows.
pytest plugin for URL based testing
HealthFlow: A Self-Evolving AI Agent with Meta Planning for Autonomous Healthcare Research
