Search results for "testing"
A backport of Django's built in Tasks framework
Utility to detect blocking calls in the async event loop
Property-based testing framework for Open API and GraphQL based apps
Pytest plugin for aiohttp support
Generic automation framework for acceptance testing and robotic process automation (RPA)
A complete web automation framework for end-to-end testing.
behave is behaviour-driven development, Python style
The dynamic configurator for your Python Project
Hamcrest framework for matcher objects
Python framework for fast Vector Space Modelling
Pytest plugin to randomly order tests and control random.seed.
Implements a fake file system that mocks the Python file system modules.
Dependency injection framework for Python
An extended [CommonMark](https://spec.commonmark.org/) compliant parser,
Contains the API for end users as well as helper functions and classes to build Allure adapters for Python test frameworks
Setuptools Rust extension plugin
An open source FaaS (Function as a service) framework for writing portable Python functions -- brought to you by the Google Cloud Functions team.
MinIO Python SDK for Amazon S3 Compatible Cloud Storage
A light-weight and flexible data validation and testing tool for statistical data objects.
Type hints (PEP 484) support for the Sphinx autodoc extension
asyncio rate limiter, a leaky bucket implementation
Freely available tools for computational molecular biology.
Lightweight, extensible schema and data validation tool for Pythondictionaries.
An asynchronous networking framework written in Python
Developer-friendly load testing framework
PyTorch native Metrics
Toolbox for imbalanced dataset in machine learning
Client library for the Qdrant vector search engine
Parameterized testing with any Python test framework
The AWS X-Ray SDK for Python (the SDK) enables Python developers to record and emit information from within their applications to the AWS X-Ray service.
Travel through time in your tests.
The property-based testing library for Python
tox is a generic virtualenv management and test command line tool
The backend—i.e. core services, APIs, and REST endpoints—to Jupyter web applications.
pytest xdist plugin for distributed testing, most importantly across multiple CPUs
Pytest plugin for measuring coverage.
A utility belt for advanced users of python-requests
Powerful data structures for data analysis, time series, and statistics
RAG pipeline security testing toolkit - 27 techniques across 6 kill chain phases, mapped to MITRE ATLAS
Lightning toolbox for across the our ecosystem.
🐢 Open-Source Evaluation & Testing library for LLM Agents
High-fidelity, anycloud emulators running in your laptop. For DevOps programming, testing, and simulation.
A framework for building, orchestrating and deploying AI agents and multi-agent workflows with support for Python and .NET.
89 skills and 38 specialized agents that enforce proven engineering practices for AI-assisted development. TDD, systematic debugging, parallel code review, and 10-gate development cycles — as a Claude
Autonomous AI agent with persistent memory, self-learning, and earned autonomy. Cognitive partner that remembers, learns, and evolves.
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
⚡ Lightweight offline AI agent for local models. No cloud, no API keys — just your GPU.
Cognithor - Agent OS: Local-first autonomous agent operating system. 16 LLM providers, 17 channels, 112+ MCP tools, 5-tier memory, A2A protocol, knowledge vault, voice, browser automation, Computer-us
MaiSaka, an LLM-based intelligent agent, is a digital lifeform devoted to understanding you and interacting in the style of a real human. She does not pursue perfection, nor does she seek efficiency;
MCP Workspace Server: A secure Model Context Protocol server providing file, git, and GitHub tools for AI assistants within a sandboxed project directory.
Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.
A production-ready runtime framework for agent apps with secure tool sandboxing, Agent-as-a-Service APIs, scalable deployment, full-stack observability, and broad framework compatibility.
Official MCP Servers for AWS
423 plugins, 2,849 skills, 177 agents for Claude Code. Open-source marketplace at tonsofskills.com with the ccpi CLI package manager.
An open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.
Secure, Fast, and Extensible Sandbox runtime for AI agents.
A text-based user interface (TUI) client for interacting with MCP servers using Ollama. Features include agent mode, multi-server, model switching, streaming responses, tool management, human-in-the-l
The Unofficial and Awesome Home Assistant MCP Server
Open-source persistent memory for AI agent pipelines (LangGraph, CrewAI, AutoGen) and Claude. REST API + knowledge graph + autonomous consolidation.
KohakuTerrarium is a general-purpose AI agent framework and batteries-included app for building, running, and composing self-contained agents and multi-agent teams, with built-in tools, sub-agents, pe
Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web. Make your own persistent autonomous agent on top!
It's YOUR data. Take it back. Get your Garmin Connect data into a local SQLite database and AI ready (MCP server)
Invoke py.test as distutils command with dependency resolution
MCP server that gives any LLM its own computer — managed Docker workspaces with live browser, terminal, code execution, document skills, and autonomous sub-agents. Self-hosted, open-source, pluggable
See your agent think. Real-time observability dashboard for OpenClaw AI agents.
Autonomous knowledge base plugin for Claude Code - captures reserch, ideas, and decisions into an interlinked wiki with reserch-on-miss, semantic search, and a Wikipedia-style web UI. Knowledge compou
Enhanced Proxmox MCP server with advanced virtualization management and full OpenAPI integration.
AI-first security scanner with 76 analyzers, 9,600+ detection rules, and repo poisoning detection for AI/ML, LLM agents, and MCP servers. Scan any GitHub repo with: medusa scan --git user/repo
Vibe-Skills is an all-in-one AI skills package. It seamlessly integrates expert-level capabilities and context management into a general-purpose skills package, enabling any AI agent to instantly upgr
Airut is a system for running Claude Code tasks from email and Slack. It handles workspace provisioning, container isolation, network sandboxing, session persistence, and cleanup — a secure foundation
A comprehensive evaluation framework for AI agents and LLM applications.
An event-driven framework designed to build and orchestrate multi-agent AI systems. It enables seamless integration of AI agents with real-world data sources and systems, facilitating complex, multi-s
MCP server to manage Facebook and Instagram Ads (Meta Ads)
Internal Safety Collapse: Turning the LLM or an AI Agent into a sensitive data generator.
Deploy any AI model, agent, database, RAG, and pipeline locally or remotely in minutes
Brain-inspired knowledge graph: spreading activation, Hebbian learning, memory consolidation.
Infrastructure that connects LLMs to ERPNext. Frappe Assistant Core works with the Model Context Protocol (MCP) to expose ERPNext functionality to any compatible Language Model
Code, Build and Evaluate agents - excellent Model and Skills/MCP/ACP Support
A simple and well-tailored LLM application framework that enables you to seamlessly integrate LLM capabilities in the most "Code-Centric" manner. LLM As Function, Prompt As Code. 一个简单的恰到
The memory system your AI agent deserves. 4-stage hybrid retrieval — Vector + BM25 + Knowledge Graph + Neural Reranker — in <150ms. Self-hosted, $0/query, built for agents that need to actually rememb
The Best AI Agent Framework for Agent Collaboration.
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Control Gmail, Google Calendar, Docs, Sheets, Slides, Chat, Forms, Tasks, Search & Drive with AI - Comprehensive Google Workspace / G Suite MCP Server & CLI Tool
An AI Gateway, registry, and proxy that sits in front of any MCP, A2A, or REST/gRPC APIs, exposing a unified endpoint with centralized discovery, guardrails and management. Optimizes Agent & Tool call
🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org
🚀 The fast, Pythonic way to build MCP servers and clients.
The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while controllin
The Multi-Agent Custom Automation Engine Solution Accelerator is an AI-driven system that manages a group of AI agents to accomplish tasks based on user input. Powered by Microsoft Agent Framework, Az
Code repo for "Most Language Models can be Poets too: An AI Writing Assistant and Constrained Text Generation Studio" at the (CAI2) workshop, jointly held at (COLING 2022)
简洁的测试平台日志,波形(FSDB/VCD),根因分析MCP Server. A Simple and Universal MCP Server to Debug Testbench Simulation Failures Via Log Parsing And Waveform Analysis (FSDB/VCD)
Appwrite’s MCP server. Operating your backend has never been easier.
An open-source AI assistant framework with skills and agent architecture
vMLX - Home of JANG_Q - Cont Batch, Prefix, Paged, KV Cache Quant, VL - Powers MLX Studio. Image gen/edit, OpenAI/Anth
Official data.gouv.fr Model Context Protocol (MCP) server that allows AI chatbots to search, explore, and analyze datasets from the French national Open Data platform, directly through conversation.
Enterprise-ready MCP Gateway & Registry that centralizes AI development tools with secure OAuth authentication, dynamic tool discovery, and unified access for both autonomous AI agents and AI coding a
A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines
Nextcloud MCP Server
An MCP Server to utilize Codelogic's rich software dependency data in your AI programming assistant.
Project-agnostic, composable AI workflow automation via pi packages and Claude Code plugins.
🛡⚔️AI-Powered Penetration Testing Framework with automated vulnerability scanning, multi-agent system, and compliance reporting🛡⚔️
PolyCouncil is an open-source multi-model deliberation engine for LM Studio. It runs multiple LLMs in parallel, gathers their answers, scores each response using a shared rubric, and produces a final,
MCP Server for Computer Use in Windows
Droid LLM Hunter is a tool to scan for vulnerabilities in Android applications using Large Language Models (LLMs).
AI conversations that actually remember. Never re-explain your project to your AI again. Join our Discord: https://discord.gg/tyvKNccgqN
auto-wing is a tool that uses LLM to assist automated testing
Benchmarking the gap between AI agent hype and architecture. Three agent archetypes, 73-point performance spread, stress testing, network resilience, and ensemble coordination analysis with statistica
My personal Claude Code and OpenAI Codex setup with battle-tested skills, commands, hooks, agents and MCP servers that I use daily.
AI-powered bug bounty hunting from your terminal - recon, 20 vuln classes, autonomous hunting, and report generation. All inside Claude Code.
Lad MCP Server: Autonomous code & system design review for AI coding agents (Claude Code, Cursor, Codex, etc.). Features multi-model consensus via OpenRouter and context-aware reviews via Serena.
RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry
Core library used by SDKs for IBM Cloud Services
PinchBench is a benchmarking system for evaluating LLM models as OpenClaw coding agents. Made with 🦀 by the humans at https://kilo.ai
Test equality of unordered collections in pytest
The LLM Anti-Framework
A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp
RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker
Enterprise-grade distributed AI agent framework | Develop → Deploy → Observe | K8s-native | Dynamic DI | Auto-failover | Multi-LLM | Python + Java + TypeScript
pytest plugin for URL based testing
Autonomous Offensive Security Intelligence AI-powered multi-agent penetration testing
Published in CNCF Landscape: A MCP server for Kubernetes.
API to interact with the python pyproject.toml based projects
Benchmark for vector databases.
"DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)"
Unified framework for building enterprise RAG pipelines with small, specialized models
754 structured cybersecurity skills for AI agents · Mapped to 5 frameworks: MITRE ATT&CK, NIST CSF 2.0, MITRE ATLAS, D3FEND & NIST AI RMF · agentskills.io standard · Works with Claude Code, GitHub Cop
Video editing MCP server for AI agents. 83 tools, 858 tests collected, 3 interfaces. Works with Claude Code, Cursor, and any MCP client. Local, fast, free.
Open-source, contract-driven data quality validation. Shift-left enforcement at the point of write — before data enters your pipeline.
RAPTOR (Robust AI-Powered Toolkit for Operational Robots) is an AI-native Content Insight Engine that transforms passive media storage into an intelligent knowledge platform through automated analysis
Lightweight, embedded graph-based memory system for AI applications. Fast (<3ms recall), offline-first, with MCP server support for Claude and other AI tools.
Prompt Driven Development Command Line Interface
Memory library for building stateful agents
A sovereign cognitive architecture with IIT 4.0 integrated information, residual-stream affective steering (CAA), Global Workspace Theory, active inference, and 72 consciousness modules — running loca
Agent samples built using the Strands Agents SDK.
Linkedin Automation Tool: Describe your product. Define your target market. The AI finds the leads for you.
A curated list of products, benchmarks, and research papers on autonomous code agents. Beyond coding — they're redefining how software changes the world.
Curated list of the best truly open-source AI projects, models, tools, and infrastructure.
3-tier agentic ChatOps (n8n + GPT-4o + Claude Code) implementing all 21 patterns from "Agentic Design Patterns" — solo operator managing 137 devices
Dragon Brain — persistent long-term memory for AI agents via MCP (Model Context Protocol). Knowledge graph (FalkorDB) + vector search (Qdrant) + CUDA GPU embeddings. Works with Claude, Gemini CLI, Cur
📊 LLM Context Benchmarks - A comprehensive benchmarking tool for testing LLMs with varying context sizes using Ollama. Features dual benchmark modes (API/CLI), automatic hardware detection (optimiz
YAO = Yielding AI Outcomes. A lightweight but rigorous system for creating, evaluating, packaging, and governing reusable agent skills.
Conversational & memory-enabled AI research partner for multi-omics analysis. From biological idea to full research paper.
MaverickMCP - Personal Stock Analysis MCP Server
Advanced AI Real Estate Assistant using RAG, LLMs, and Python. Features market analysis, property valuation, and intelligent search.
MCP server for OpenAI's Deep Research APIs, Gemini Deep Research Agent, and Hugging Face's Open Deep Research
Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity. 🔔 Official updates only via twitter @Martin993
Watchtower is a simple AI-powered penetration testing automation CLI tool that leverages LLMs and LangGraph to orchestrate agentic workflows that you can use to test your websites locally. Generate us
MCP server for guiding Coding Agents via end-to-end requirements to implementation plan pipeline
LLM-powered Agent Runtime with Dynamic DAG Planning & Concurrent Execution
🤖 The most comprehensive directory of AI agent frameworks, platforms, tools, and resources - hundreds of curated entries covering open-source, no-code, enterprise, and autonomous solutions. NEW Boil
MCP Server for Simplenote integration with Claude Desktop
The LLM Evaluation Framework
A Multi-Agentic AI Assistant/Builder
Transform Claude Code into a full development team. 11 specialized agents (Architect, Engineer, QA, Security, UX, DevOps, and more), persistent memory across sessions, and 25,000+ on-demand skills. Wo
Autonomous VAPT platform. Give it a target (FQDN, IP, CIDR) — it hunts, it reports. Inspired by the Obsidian Order.
[Community Supported] Perforce P4 MCP Server is a Model Context Protocol (MCP) server that integrates with the Perforce P4 version control system.
Ham radio & GMRS gateway, repeater and packet radio — bridges two-way radios to Mumble, Broadcastify, and the internet. AIOC USB, RSPduo dual SDR, TH-9800/D75/KV4P CAT control, AI announcements, ADS-B
Local AI server with persistent memory, RAG, and multi-backend inference (MLX / llama.cpp / Ollama). Runs entirely on your machine — zero data sent to external services.
MCP server for searching and retrieving Claude Agent Skills using vector search
Search your files by talking to them - 100% offline
Human-supervised AI code generation using Plan-Do-Check-Act methodology with TDD and refactoring. Works as Claude Code skill or standalone prompts.
The production runtime for AI agents. Schema in, API out. Built on PydanticAI + FastAPI.
Open-Sable is a local-first autonomous agent framework with AGI-inspired cognitive subsystems (goals, memory, metacognition, tool use). It can run continuously on your machine, integrate with chat int
Claude Code plugin for Ruby, Rails, Grape, PostgreSQL, Redis, and Sidekiq development
Design, conduct and analyze results of AI-powered surveys and experiments. Simulate social science and market research with large numbers of AI agents and LLMs.
🛠️ Automate penetration testing with SploitGPT, an AI agent using Kali Linux tools for efficient security assessments and minimal user input.
Your AI-powered SWE teammate, built into your git workflow
A comprehensive suite of protocols, meta-prompts, and orchestration tools designed to streamline software development workflows, project management, and team collaboration. Includes the VibeCode Proto
🦀 The first autonomous hackathon agent stop assisting and start competing (🏆 Hackathon Champion Project).
A tool that compiles messy natural language prompts into a structured intermediate representation (IR) and optionally sends them to LLMs like ChatGPT for cleaner, more reliable responses.
220+ Claude Code skills & agent plugins for Claude Code, Codex, Gemini CLI, Cursor, and 8 more coding agents — engineering, marketing, product, compliance, C-level advisory.
Keyring backend for Google Auth tokens
Route, manage, and analyze your LLM requests across multiple providers with a unified API interface
Autonomous AI agent that researches viral content, generates posts, publishes them, measures engagement — and rewrites its own strategy based on what worked. Self-learning loop powered by LangGraph +
Multi-agent swing trading system — automated screening, research, and execution with backtesting and live trading
⚙️ Enable AI agents to conduct autonomous penetration testing on any Linux distribution with a persistent and robust Model Context Protocol server.
KawaiiGPT — Open-source LLM gateway accessing DeepSeek, Gemini, and Kimi-K2 through reverse-engineered Pollinations API with no API keys required, built-in prompt injection capabilities for security r
Connect any LLM to OpenClaw — production-tested middleware for Qwen3-235B and beyond
A command-line interface tool for serving LLM using vLLM.
Control robots and physical hardware with natural language through Strands Agents.
Self-evolving AI agent framework with 5-layer safety gatekeeper. Agents observe failures, propose fixes, and safely apply them. Built on HKUDS/nanobot.
MCP Server for Apache Spark History Server. The bridge between Agentic AI and Apache Spark.
AI-powered PRD generation for Claude Code with taskmaster integration
Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)
arXiv MCP Server Client 🐙 enables AI assistants to search, retrieve, analyze, and summarize arXiv papers with features like author/category browsing, trends, and citation insights.
JSON Agents - A universal JSON-native standard for describing AI agents, their capabilities, tools, runtimes, and governance in a portable, framework-agnostic format. Based on RFC 8259, JSON Schema 2
🍀 Self-hosted multi-agent AI orchestrator — chat with Claude, Gemini & Copilot CLI from Telegram, WebEx, or browser. 5 runtimes, 17+ models, task scheduling, skill plugins.
The open framework for extensible & grounded AI agent orchestration.
An automated, agentic exploratory testing tool that performs comprehensive QA testing on web applications, simulating human user interactions through various input methods (mouse, keyboard, TAB naviga
A Model Context Protocol server that provides task orchestration capabilities for AI assistants
Qodo-Cover: An AI-Powered Tool for Automated Test Generation and Code Coverage Enhancement! 💻🤖🧪🐞
Workspace Architect is a zero-friction CLI tool that provides curated collections of specialized agents, instructions, and prompts to supercharge your GitHub Copilot experience.
minimize python source code to find bugs more easily
🧭 PromptDrifter – one‑command CI guardrail that catches prompt drift and fails the build when your LLM answers change.
YAML parser and emitter for Python with support for free-threading
Python LLM-RAG deep agent using LangChain, LangGraph and LangSmith built on Quart web microframework and served using Hypercorn ASGI and WSGI web server.
Broken RAG For The Broken Souls
Self-hosted autonomous AI agent — 9-layer cascade, Docker sandbox, encrypted vault, review/build/control plane, 1407+ tests
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
🪈 Intelligent orchestration system that coordinates multiple AI coding assistants (Claude, Codex, Gemini CLI, Copilot CLI) to collaborate on complex software development tasks via REPL or a Vue/Nuxt
🔍 Automate penetration testing with an intelligent agent that organizes security assessments, leveraging local LLMs and Kali Linux for effective exploitation.
Syllabus-aware RAG study assistant for university students. Answers strictly from your own notes & PDFs, unit-scoped retrieval, cross-encoder reranking, and a hallucination gate — built to help studen
🔍 Automate research tasks with the Autonomous Research Agent, utilizing intelligent queries and parallel searches to create concise, comprehensive reports.
Install your own AI DJ Being. She searches, downloads, listens, mixes, and generates music — autonomously. 30hrs for $0.04.
AI-powered group finance assistant using MCP architecture, Gemini LLM and Streamlit.
Automate red teaming by using AI to plan attacks, run security tools, move laterally, and escalate privileges in network environments.
PromptManager is a desktop application for cataloguing, searching, and executing AI prompts, and much more.
Generate PlantUML Diagrams as PNG/SVG with Embedded Web Viewer
🚀 Transform existing codebases into MCP services with ease using Code2MCP's intelligent automation and minimal intrusion design.
Autonomous, multilingual AI voice agent using ElevenLabs, LangGraph, and RAG for government services
GAN-inspired multi-agent system that autonomously builds full-stack web apps from a single prompt using Claude AI agents
🚀 Maximize your C# productivity with advanced techniques in strings, LINQ, and clean code, inspired by the book "Produtivo com C#."
🤖 Build intelligent, offline LLM agents with LangGraph and llama-cpp-python using this starter template for local, private tool-calling applications.
🦾 A production‑ready research outreach AI agent that plans, discovers, reasons, uses tools, auto‑builds cited briefings, and drafts tailored emails with tool‑chaining, memory, tests, and turnkey Dock
Looker REST API
Intelligent Model Context Protocol (MCP) server for AI-assisted API development. Generate mock servers from OpenAPI specs with advanced logging, performance analytics, and server discovery. Optimized
A Python-Script Based Generative AI platform
A stateful AI agent framework powered by the Cognitive Lattice to solve complex tasks with persistent memory and reliable tool orchestration.
