Search results for "tests"
SPDX parser and tools.
Python client for the official Notion API
Django extension to allow working with 'clusters' of models as a single unit, independently of the database
Utility to detect blocking calls in the async event loop
Python lib/cli for JSON/YAML schema validation
A Lucene query parser generating ElasticSearch queries and more !
Python package that interfaces with the Internet Archive's Wayback Machine APIs. Archive pages and retrieve archived pages easily.
Property-based testing framework for Open API and GraphQL based apps
Factory boy classes for wagtail
unittest subTest() support and subtests fixture
An international phone number field for django models.
The PEX packaging toolchain.
Pytest plugin for aiohttp support
Generic automation framework for acceptance testing and robotic process automation (RPA)
Brings async, event-driven capabilities to Django.
Postgresql fixtures and fixture factories for Pytest.
Typed library that provides an ORM wrapper for tmux, a terminal multiplexer.
A Python implement of Agent Client Protocol (ACP, by Zed Industries)
A pytest plugin powered by VCR.py to record and replay HTTP traffic
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML.
behave is behaviour-driven development, Python style
A record linkage toolkit for linking and deduplication
A tool to determine the content type of a file with deep learning
Hamcrest framework for matcher objects
Python framework for fast Vector Space Modelling
Scapy: interactive packet manipulation tool
Python tool and library for decrypting and encrypting MS Office files using a password or other keys
Asyncer, async and await, focused on developer experience.
Pytest plugin to randomly order tests and control random.seed.
Dagster is an orchestration platform for the development, production, and observation of data assets.
Implements a fake file system that mocks the Python file system modules.
Dependency injection framework for Python
The build backend used by PDM that supports latest packaging standards
Contains the API for end users as well as helper functions and classes to build Allure adapters for Python test frameworks
A sphinx extension for designing beautiful, view size responsive web components.
A pytest wrapper with fixtures for Playwright to automate web browsers
Community maintained hooks for PyInstaller
Fiona reads and writes spatial data files
Setuptools Rust extension plugin
A light-weight and flexible data validation and testing tool for statistical data objects.
A modern Python package and dependency manager supporting the latest PEP standards
asyncio rate limiter, a leaky bucket implementation
Lightweight, extensible schema and data validation tool for Pythondictionaries.
A pytest plugin to report test results as JSON files
An asynchronous networking framework written in Python
Pytest Plugin to disable socket calls during tests
Extended JWT integration with Flask
Developer-friendly load testing framework
Framework for large language model evaluations
PyTorch native Metrics
Toolbox for imbalanced dataset in machine learning
Vectorized spatial vector file format I/O using GDAL/OGR
Client library for the Qdrant vector search engine
Parameterized testing with any Python test framework
A lightweight console printing and formatting toolkit
Super lightweight function registries for your library
Modern high-performance serialization utilities for Python
A refreshing functional take on deep learning, compatible with your favorite libraries
Industrial-strength Natural Language Processing (NLP) in Python
Orchestrate your dbt projects in Airflow
Travel through time in your tests.
A versatile test fixtures replacement based on thoughtbot's factory_bot for Ruby.
The property-based testing library for Python
GraphQL Framework for Python
pytest plugin to re-run tests to eliminate flaky failures
A Python module for working with the Tableau Server REST API.
tox is a generic virtualenv management and test command line tool
A set of server components for JupyterLab and JupyterLab like applications.
A high-level Python web framework that encourages rapid development and clean, pragmatic design.
An abstract syntax tree for Python with inference support.
The backend—i.e. core services, APIs, and REST endpoints—to Jupyter web applications.
Jupyter protocol implementation and client libraries
The Slack API Platform SDK for Python
pytest xdist plugin for distributed testing, most importantly across multiple CPUs
An autocompletion tool for Python that can be used for text editors.
Traitlets Python configuration system
A collection of framework independent HTTP protocol utils.
the blessed package to manage your versions by scm tags
Pytest plugin for measuring coverage.
Poetry PEP 517 Build Backend
Python plotting package
FastAPI framework, high performance, easy to learn, fast to code, ready for production
Powerful data structures for data analysis, time series, and statistics
Most extensible Python build backend with support for C/C++ extension modules
Process executor (not only) for tests.
pytest plugin for test session metadata
Ultra-lightweight pure Python package to check if a file is binary or text.
User authentication and session management for Flask.
A Git URL parsing module (supports parsing and rewriting)
LLM-powered knowledge base from your Claude Code, Codex CLI, Copilot, Cursor & Gemini sessions. Karpathy's LLM Wiki pattern — implemented and shipped.
89 skills and 38 specialized agents that enforce proven engineering practices for AI-assisted development. TDD, systematic debugging, parallel code review, and 10-gate development cycles — as a Claude
Autonomous AI agent with persistent memory, self-learning, and earned autonomy. Cognitive partner that remembers, learns, and evolves.
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
MCP-NixOS - Model Context Protocol Server for NixOS resources
Security and best-practices scanner for AI Plugins, covering Codex, Claude, Opencode, Gemini & more. Scores trust for plugins 0-100.
Cognithor - Agent OS: Local-first autonomous agent operating system. 16 LLM providers, 17 channels, 112+ MCP tools, 5-tier memory, A2A protocol, knowledge vault, voice, browser automation, Computer-us
A Python-based low-modeling low-code open-source platform for smart and AI-enhanced software
A MCP (Model Context Protocol) server for interacting with dbt.
Official MCP Servers for AWS
剧本分镜智能体(PenShot):剧本→分镜→片段→prompt | 基于 LangGraph+LLM,自动解析任意格式剧本,生成 Sora/Veo/Runway 等模型可用的连贯text-to-video提示词。保持角色/剧情跨片段一致,支持 MCP/REST API/函数调用 | Python库 + A2A集成。(LLM-powered screenplay-to-video-prompt a
The python library for research and development in NLP, multimodal LLMs, Agents, ML, Knowledge Graphs, and more.
423 plugins, 2,849 skills, 177 agents for Claude Code. Open-source marketplace at tonsofskills.com with the ccpi CLI package manager.
Secure, Fast, and Extensible Sandbox runtime for AI agents.
The Unofficial and Awesome Home Assistant MCP Server
Build AI agents that actually do things. Synapse is an open-source platform for creating, connecting, and orchestrating AI agents powered by any LLM — local or cloud.
RESTai is an AIaaS (AI as a Service) open-source platform. Supports many public and local LLM suported by Ollama/vLLM/etc. Precise embeddings usage, tuning, analytics etc. Built-in image/audio generat
Open-source persistent memory for AI agent pipelines (LangGraph, CrewAI, AutoGen) and Claude. REST API + knowledge graph + autonomous consolidation.
Seedance 2.0 Shot Design Skills
Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web. Make your own persistent autonomous agent on top!
ARIS ⚔️ (Auto-Research-In-Sleep) — Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in — works wi
Declarative Agent Orchestration. Ship while you sleep.
It's YOUR data. Take it back. Get your Garmin Connect data into a local SQLite database and AI ready (MCP server)
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropi
Invoke py.test as distutils command with dependency resolution
MCP server that gives any LLM its own computer — managed Docker workspaces with live browser, terminal, code execution, document skills, and autonomous sub-agents. Self-hosted, open-source, pluggable
Security intelligence API and MCP server for AI agents. 25 tools, 35+ endpoints: CVE/EPSS/KEV, domain recon, SSL, IP reputation, threat intel, email security, code scanning. Free, no signup.
Enhanced Proxmox MCP server with advanced virtualization management and full OpenAPI integration.
SmarterRouter: An intelligent LLM gateway and VRAM-aware router for Ollama, llama.cpp, and OpenAI. Features semantic caching, model profiling, and automatic failover for local AI labs.
AI-first security scanner with 76 analyzers, 9,600+ detection rules, and repo poisoning detection for AI/ML, LLM agents, and MCP servers. Scan any GitHub repo with: medusa scan --git user/repo
Vibe-Skills is an all-in-one AI skills package. It seamlessly integrates expert-level capabilities and context management into a general-purpose skills package, enabling any AI agent to instantly upgr
Airut is a system for running Claude Code tasks from email and Slack. It handles workspace provisioning, container isolation, network sandboxing, session persistence, and cleanup — a secure foundation
Curated directory of terminal-native AI coding agents and the harnesses that orchestrate them. Covers open-source tools (Pi, OpenCode, Aider, Goose), platform agents (Claude Code, Codex, Gemini CLI),
An event-driven framework designed to build and orchestrate multi-agent AI systems. It enables seamless integration of AI agents with real-world data sources and systems, facilitating complex, multi-s
Deploy any AI model, agent, database, RAG, and pipeline locally or remotely in minutes
Brain-inspired knowledge graph: spreading activation, Hebbian learning, memory consolidation.
Infrastructure that connects LLMs to ERPNext. Frappe Assistant Core works with the Model Context Protocol (MCP) to expose ERPNext functionality to any compatible Language Model
Python Deep Agent framework built on top of Pydantic-AI, designed to help you quickly build production-grade autonomous AI agents with planning, filesystem operations, subagent delegation, skills, and
44 plug-and-play skills for OpenClaw — self-modifying AI agent with cron scheduling, security guardrails, persistent memory, knowledge graphs, and MCP health monitoring. Your agent teaches itself new
The agent that grows with you
台灣司法院判決 + 全國法規資料庫 MCP server · Query Taiwan legal data from any MCP AI agent
A simple and well-tailored LLM application framework that enables you to seamlessly integrate LLM capabilities in the most "Code-Centric" manner. LLM As Function, Prompt As Code. 一个简单的恰到
The memory system your AI agent deserves. 4-stage hybrid retrieval — Vector + BM25 + Knowledge Graph + Neural Reranker — in <150ms. Self-hosted, $0/query, built for agents that need to actually rememb
The Best AI Agent Framework for Agent Collaboration.
MCP server for Fabric Real-Time Intelligence (https://aka.ms/fabricrti) supporting tools for Eventhouse (https://aka.ms/eventhouse), Azure Data Explorer (https://aka.ms/adx, and other RTI services (co
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Control Gmail, Google Calendar, Docs, Sheets, Slides, Chat, Forms, Tasks, Search & Drive with AI - Comprehensive Google Workspace / G Suite MCP Server & CLI Tool
An AI Gateway, registry, and proxy that sits in front of any MCP, A2A, or REST/gRPC APIs, exposing a unified endpoint with centralized discovery, guardrails and management. Optimizes Agent & Tool call
Plan-first AI workflow plugin for Claude Code, OpenAI Codex, and Factory Droid. Zero-dep task tracking, worker subagents, Ralph autonomous mode, cross-model reviews.
简洁的测试平台日志,波形(FSDB/VCD),根因分析MCP Server. A Simple and Universal MCP Server to Debug Testbench Simulation Failures Via Log Parsing And Waveform Analysis (FSDB/VCD)
Appwrite’s MCP server. Operating your backend has never been easier.
Unify Claude Code, Codex, Cursor, and Gemini CLI with persistent context, governance, and multi-model debate. 186 MCP tools. 123 tests.
Cyber Pilot is a traceable delivery system for requirements, design, plans, and code.
An open-source AI assistant framework with skills and agent architecture
vMLX - Home of JANG_Q - Cont Batch, Prefix, Paged, KV Cache Quant, VL - Powers MLX Studio. Image gen/edit, OpenAI/Anth
Official data.gouv.fr Model Context Protocol (MCP) server that allows AI chatbots to search, explore, and analyze datasets from the French national Open Data platform, directly through conversation.
AINL helps turn AI from "a smart conversation" into "a structured worker." It is designed for teams building AI workflows that need multiple steps, state and memory, tool use, repeatable execution, v
Enterprise-ready MCP Gateway & Registry that centralizes AI development tools with secure OAuth authentication, dynamic tool discovery, and unified access for both autonomous AI agents and AI coding a
AI-powered development framework with task management, 41 agents, 83 skills, and MCP tools for Cursor, Claude Code, Gemini, Codex & OpenCode. File-based memory that survives across sessions.
Open-source sandboxes where coding agents build and deploy. Spin up isolated environments where Claude Code, Cursor, and other agents code and deploy software.
An MCP Server to utilize Codelogic's rich software dependency data in your AI programming assistant.
PowerMem: Your AI-Powered Long-Term Memory — Accurate, Agile, Affordable. Also friendly support for the OpenClaw Memory Plugin.
See how you really use AI — X-ray your AI coding sessions locally
NEXO Brain — Shared brain for AI agents. Persistent memory, semantic RAG, natural forgetting, metacognitive guard, trust scoring, 150+ MCP tools. Works with Claude Code, Codex, Claude Desktop & any MC
A helper class for handling configuration defaults of packaged apps gracefully.
Wraps any OpenAI API interface as Responses with MCPs support so it supports Codex. Adding any missing stateful features. Ollama and Vllm compliant.
AI conversations that actually remember. Never re-explain your project to your AI again. Join our Discord: https://discord.gg/tyvKNccgqN
Tool that just makes your open source project better using LLM agents
Very basic event publishing system
Harness LLMs with Multi-Agent Programming
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
My personal Claude Code and OpenAI Codex setup with battle-tested skills, commands, hooks, agents and MCP servers that I use daily.
MCP server that saves 97% of AI coding tokens — your AI reads code structurally, not file-by-file. Faster, cheaper, smarter.
Open-source multi-agent AI assistant powered by LangGraph, FastAPI & Next.js — 16+ agents, Human-in-the-Loop, MCP integration, voice TTS, RAG, 500+ metrics, 6 languages.
🤖 MCP server for Apple Mail - Manage emails with AI using Claude Desktop. Search, send, organize mail with natural language.
Lad MCP Server: Autonomous code & system design review for AI coding agents (Claude Code, Cursor, Codex, etc.). Features multi-model consensus via OpenRouter and context-aware reviews via Serena.
AI-powered spec generation and review using multi-repo code graph intelligence for backend teams that ship to production.
Video editing MCP server for AI agents. 83 tools, 858 tests collected, 3 interfaces. Works with Claude Code, Cursor, and any MCP client. Local, fast, free.
Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from dataset creation to model training and evaluation. Comes with pr
Open security scanner for AI supply chain: agents, MCP, containers, cloud, GPU, and runtime with blast-radius analysis.
PinchBench is a benchmarking system for evaluating LLM models as OpenClaw coding agents. Made with 🦀 by the humans at https://kilo.ai
OpenClaw reimagined in pure Python — autonomous AI agent with memory, RAG, skills, web dashboard, voice input, daemon, and multi-channel support.
The official Amplitude backend Python SDK for server-side instrumentation.
Production ready. AI Agent Workflow System for Claude Code
A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp
RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker
AI Agent Backend Platform on FastAPI — MCP server + AI orchestration + async DDD architecture. Zero-boilerplate CRUD, auto domain discovery, 14 Claude Code AI development skills.
Production-ready RAG Framework (Python/FastAPI). 1-line config swaps: 6 Vector DBs (Weaviate, Pinecone, Qdrant, ChromaDB, pgvector, MongoDB), 5 LLMs (Gemini, OpenAI, Claude, Ollama, OpenRouter). OpenA
pytest plugin for URL based testing
Buddhist Digital Text Platform — 9,200+ texts, 500+ sources, 8 UI languages, AI Q&A (RAG), knowledge graph, full-text search
Dragon Brain — persistent long-term memory for AI agents via MCP (Model Context Protocol). Knowledge graph (FalkorDB) + vector search (Qdrant) + CUDA GPU embeddings. Works with Claude, Gemini CLI, Cur
Published in CNCF Landscape: A MCP server for Kubernetes.
🛡⚔️AI-Powered Penetration Testing Framework with automated vulnerability scanning, multi-agent system, and compliance reporting🛡⚔️
Open Framework for AI Agents to play Red Alert through Reinforcement Learning
PolyCouncil is an open-source multi-model deliberation engine for LM Studio. It runs multiple LLMs in parallel, gathers their answers, scores each response using a shared rubric, and produces a final,
Multi-agent memory consistency platform. We're hiring contributors—check HIRING.md
AgenticX is a unified, production-ready multi-agent platform — Python SDK + CLI (agx) + Studio server + Machi desktop app. Features Meta-Agent orchestration, 15+ LLM providers, MCP Hub, hierarchical m
An AI-powered GitHub code review tool that uses LLMs to detect high-confidence, high-impact issues—such as security vulnerabilities, bugs, and maintainability concerns.
Benchmark for vector databases.
This package provides a DateTime data type, as known from Zope. Unless you need to communicate with Zope APIs, you're probably better off using Python's built-in datetime module.
Unified framework for building enterprise RAG pipelines with small, specialized models
AI-powered bug bounty hunting from your terminal - recon, 20 vuln classes, autonomous hunting, and report generation. All inside Claude Code.
A-RAG: Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces. State-of-the-art RAG framework with keyword, semantic, and chunk read tools for multi-hop QA.
autonomous agent with access to a tool library
Security guardrails for Claude Code, MCP tools, and Claude cowork workflows. Local-first modular YARA-style guard packs for secrets, exfiltration, prompt injection, MCP abuse, and risky agent actions.
A Model Context Protocol (MCP) server for Autodesk ShotGrid/Flow Production Tracking (FPT) with comprehensive CRUD operations and data management capabilities.
Shell and coding agent on mcp clients
Creates JUnit XML test result documents that can be read by tools such as Jenkins
Open-source, contract-driven data quality validation. Shift-left enforcement at the point of write — before data enters your pipeline.
Lightweight, embedded graph-based memory system for AI applications. Fast (<3ms recall), offline-first, with MCP server support for Claude and other AI tools.
Observal is an AI agent registry with first in class observabilty and eval framework
Prompt Driven Development Command Line Interface
autonomous AI agent that builds full-stack apps. local models. no cloud. no API keys. runs on your hardware.
A sovereign cognitive architecture with IIT 4.0 integrated information, residual-stream affective steering (CAA), Global Workspace Theory, active inference, and 72 consciousness modules — running loca
An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of ta
A curated list of products, benchmarks, and research papers on autonomous code agents. Beyond coding — they're redefining how software changes the world.
3-tier agentic ChatOps (n8n + GPT-4o + Claude Code) implementing all 21 patterns from "Agentic Design Patterns" — solo operator managing 137 devices
Memory that remembers the story not just the facts. Three layer sentence graph for AI agents -> Facts, Episodes, raw Sentences. One DB. Zero config.
Autonomous Offensive Security Intelligence AI-powered multi-agent penetration testing
YAO = Yielding AI Outcomes. A lightweight but rigorous system for creating, evaluating, packaging, and governing reusable agent skills.
META‑AGENTIC α‑AGI 👁️✨ — Mission 🎯 End‑to‑end: Identify 🔍 → Out‑Learn 📚 → Out‑Think 🧠 → Out‑Design 🎨 → Out‑Strategise ♟️ → Out‑Execute ⚡
Conversational & memory-enabled AI research partner for multi-omics analysis. From biological idea to full research paper.
MaverickMCP - Personal Stock Analysis MCP Server
Open-Source Intelligent Command Layer
OpenClawProBench is a live-first benchmark harness for evaluating LLM agents in the OpenClaw runtime with deterministic grading and repeated-trial reliability.
RAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.
Advanced AI Real Estate Assistant using RAG, LLMs, and Python. Features market analysis, property valuation, and intelligent search.
MCP server for OpenAI's Deep Research APIs, Gemini Deep Research Agent, and Hugging Face's Open Deep Research
A lightweight AI agent framework for vertical domain applications | 面向垂域应用的轻量级 AI Agent 框架
Autonomous Web3 security audit agent for Claude Code
🦀 Agentic RAG for drug intelligence · 57 skills · 15 task categories · DTI · ADR · DDI · PGx · Repurposing · Powered by LangGraph
The Developer's Guide to AI - A Field Guide for the Working Developer
Agentic AI assistant on Telegram, powered by Claude Code. Runs locally with shell access, spec-driven PR reviews, layered security, persistent memory, and scheduled jobs. Your machine, your data, your
Zero-dependency browser automation CLI. 70+ commands, 10 test assertions, smart commands (click/fill by text — no LLM needed). MCP server for AI agents with 500x fewer tokens. Extract, observe, script
Description: Self-hosted graph-based associative memory for personal AI agents. Spreading activation, emotional weighting, zero LLM cost.
Local-first Agentic Memory Layer for MCP Agents • 25 tools • Hybrid search (FTS5 + vector + MMR) • GDPR • 100% local
AI skills that turns coding agents into UiPath experts.
LLM-powered Agent Runtime with Dynamic DAG Planning & Concurrent Execution
Claude Code skills, architectural principles, and alternative approaches for AI-assisted development
A 27-chapter hands-on tutorial for building an autonomous AI agent from zero in Python. Agent loop, tool system, memory, skills, MCP, multi-platform gateway, and self-evolution — inspired by Herme
Generic markdown collection MCP server with FTS5 + semantic search, frontmatter-aware indexing, and incremental reindexing
Synthadoc: An open-source LLM knowledge compilation engine that turns raw documents into structured, local-first wikis. A transparent, human-readable alternative to traditional RAG, which can be self-
MCP Server for Simplenote integration with Claude Desktop
The LLM Evaluation Framework
📊 LLM Context Benchmarks - A comprehensive benchmarking tool for testing LLMs with varying context sizes using Ollama. Features dual benchmark modes (API/CLI), automatic hardware detection (optimiz
This project implements a comprehensive framework for Knowledge Graph Retrieval Augmented Generation (KG-RAG). It focuses on financial data from SEC 10-Q filings and explores how knowledge graphs can
MCP server for Elgato Stream Deck control — set buttons, manage pages, wire actions
OpenBrep: 用自然语言驱动 ArchiCAD GDL 库对象的创建、修改与编译
Library for serializing and deserializing Python Objects to and from JSON and XML.
LLM proxy to observe and debug what your AI agents are doing.
'Turn functions and methods into fully controllable objects'
MoralStack is a governance and safety layer for LLM applications. It analyzes user requests before generation, evaluates risk and intent, and decides whether the AI should answer normally, answer safe
Utilities for spying on function calls in unit tests.
Auto-Use Computer Use — drives your OS, browser, scours the web, writes your code. One agent, end to end.
Local AI server with persistent memory, RAG, and multi-backend inference (MLX / llama.cpp / Ollama). Runs entirely on your machine — zero data sent to external services.
MCP server for searching and retrieving Claude Agent Skills using vector search
One memory layer for every AI agent. Local-first, markdown source of truth, and CLI/HTTP/MCP native. Your agent forgot who you are. Again. Dory fixes that.
Search your files by talking to them - 100% offline
Human-supervised AI code generation using Plan-Do-Check-Act methodology with TDD and refactoring. Works as Claude Code skill or standalone prompts.
The production runtime for AI agents. Schema in, API out. Built on PydanticAI + FastAPI.
Exit pytest test session with custom exit code in different scenarios
Open-Sable is a local-first autonomous agent framework with AGI-inspired cognitive subsystems (goals, memory, metacognition, tool use). It can run continuously on your machine, integrate with chat int
Starter app for building AI SaaS (RAG, Agentic workflow) applications
Personal OS agent that learns who you are, detects life patterns, and grows smarter about you every day. Memory + Cron + Atropos RL
Claude Code plugin for Ruby, Rails, Grape, PostgreSQL, Redis, and Sidekiq development
A comprehensive MCP-based todo management system, that serves as a central nervous system for Madness Interactive, a multi-project task coordination workshop.
Your AI-powered SWE teammate, built into your git workflow
🦀 The first autonomous hackathon agent stop assisting and start competing (🏆 Hackathon Champion Project).
A tool that compiles messy natural language prompts into a structured intermediate representation (IR) and optionally sends them to LLMs like ChatGPT for cleaner, more reliable responses.
An open-source, self-hosted API that turns standard email providers (Mailgun, SES, SendGrid) into "Inbox-as-a-Service" for AI Agents.
Autonomous VAPT platform. Give it a target (FQDN, IP, CIDR) — it hunts, it reports. Inspired by the Obsidian Order.
Pytest plugin to generate json report in CTRF (Common Test Report Format)
Autonomous AI agent that researches viral content, generates posts, publishes them, measures engagement — and rewrites its own strategy based on what worked. Self-learning loop powered by LangGraph +
Continuous prompt optimization for AI applications. Collect feedback, auto-optimize with DSPy, deliver as reviewable PRs.
AI-powered multi-agent system that transforms Telegram into an intelligent automation hub — routing user intent across vision, browser, desktop, and code agents using dynamic model orchestration.
Connect any LLM to OpenClaw — production-tested middleware for Qwen3-235B and beyond
🛠️ Automate penetration testing with SploitGPT, an AI agent using Kali Linux tools for efficient security assessments and minimal user input.
Turn Claude Code from a chat assistant into an autonomous coding system
AITP Research Charter and Protocol: a charter-first protocol, contract, and adapter surface for AI-assisted theoretical physics research.
Self-evolving AI agent framework with 5-layer safety gatekeeper. Agents observe failures, propose fixes, and safely apply them. Built on HKUDS/nanobot.
Autonomous overnight codebase improvement agent for Claude Code. Run it before bed, wake up to production-ready fixes.
AI co-pilot for ComfyUI — 113 tools for workflow authoring, model provisioning, and iterative rendering. Multi-provider (Claude, GPT-4o, Gemini, Ollama). Ships as MCP server or standalone CLI.
AI-powered PRD generation for Claude Code with taskmaster integration
AI patient advocacy tool for cancer treatment. Understand labs, find clinical trials, track treatment — all from your phone. Open source, used in active treatment.
Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)
Assistant IA avancé (RAG, outils, Légifrance, OCR, skills, export de fichiers, historique) conçu principalement pour un usage avec AlbertAPI (DiNum)
Django extension for creating forms that vary according to user permissions
Model-agnostic plug-n-play LangChain/LangGraph agents powered entirely by MCP tools over HTTP/SSE.
A productive AI coworker that learns, self-improves, and ships work.
arXiv MCP Server Client 🐙 enables AI assistants to search, retrieve, analyze, and summarize arXiv papers with features like author/category browsing, trends, and citation insights.
JSON Agents - A universal JSON-native standard for describing AI agents, their capabilities, tools, runtimes, and governance in a portable, framework-agnostic format. Based on RFC 8259, JSON Schema 2
CloneMe is an advanced AI platform that builds your digital twin—an AI that chats like you, remembers details, and supports multiple platforms. Customizable, memory-driven, and hot-reloadable, it's th
🍀 Self-hosted multi-agent AI orchestrator — chat with Claude, Gemini & Copilot CLI from Telegram, WebEx, or browser. 5 runtimes, 17+ models, task scheduling, skill plugins.
Self-hosted autonomous AI agent — 9-layer cascade, Docker sandbox, encrypted vault, review/build/control plane, 1407+ tests
The open framework for extensible & grounded AI agent orchestration.
Autonomous coding agent with web research (Recon), adversarial plan debate, 5-tier cognitive memory, multi-model routing (Gemini + DeepSeek + Ollama), 24/7 loops, and $0 local mode. Apache 2.0.
A Model Context Protocol server that provides task orchestration capabilities for AI assistants
🧭 PromptDrifter – one‑command CI guardrail that catches prompt drift and fails the build when your LLM answers change.
Broken RAG For The Broken Souls
Provide full Python API access to NotebookLM features, including advanced functions beyond the web interface, via CLI and AI agent integration.
AI-agent-friendly PyTorch research pipeline — one YAML config drives preflight, training, Optuna HPO, and real-time TUI monitoring
Local-first AI assistant — 9 specialized agents (code, web, debug, security…), 10M token vector memory, mobile relay via secure tunnel, real-time web search and document processing. Runs 100% on your
🪈 Intelligent orchestration system that coordinates multiple AI coding assistants (Claude, Codex, Gemini CLI, Copilot CLI) to collaborate on complex software development tasks via REPL or a Vue/Nuxt
An automated, agentic exploratory testing tool that performs comprehensive QA testing on web applications, simulating human user interactions through various input methods (mouse, keyboard, TAB naviga
Qodo-Cover: An AI-Powered Tool for Automated Test Generation and Code Coverage Enhancement! 💻🤖🧪🐞
Syllabus-aware RAG study assistant for university students. Answers strictly from your own notes & PDFs, unit-scoped retrieval, cross-encoder reranking, and a hallucination gate — built to help studen
MCP server for 28 security frameworks (ISO 27001, NIST CSF 2.0, NIST 800-53, SOC 2, IEC 62443)
Practical CLI tool for maintaining open source repositories.
Autonomous multi-agent system that turns tasks into code, PRs, and self-healing workflows
Lightweight coordination server for autonomous AI coding agents — task claiming, file locks, message passing, and health monitoring over REST
AI-powered group finance assistant using MCP architecture, Gemini LLM and Streamlit.
🔍 Automate penetration testing with an intelligent agent that organizes security assessments, leveraging local LLMs and Kali Linux for effective exploitation.
Automate red teaming by using AI to plan attacks, run security tools, move laterally, and escalate privileges in network environments.
A collection of Summoner clients and agents featuring example implementations and reusable templates
Hybrid cloud-local AI Employee that runs 24/7 on a cloud VM, monitors Gmail/WhatsApp, drafts responses, and queues approvals via git-synced Obsidian vault. Human-in-the-loop safety gates for email, so
Autonomous, multilingual AI voice agent using ElevenLabs, LangGraph, and RAG for government services
GAN-inspired multi-agent system that autonomously builds full-stack web apps from a single prompt using Claude AI agents
🦾 A production‑ready research outreach AI agent that plans, discovers, reasons, uses tools, auto‑builds cited briefings, and drafts tailored emails with tool‑chaining, memory, tests, and turnkey Dock
A Python-based framework for building multi-agent systems with LLMs. Currently in pre-launch alpha.
🔍 Enable AI-driven network security scanning with a production-ready Nmap MCP server supporting diverse tools, scan types, and timing templates.
ACR Control Plane: runtime control & governance for agentic AI (six-pillar enforcement).
Mistral-common is a library of common utilities for Mistral AI.
Command line tool and async library to perform basic file operations on local paths, Google Cloud Storage paths and Azure Blob Storage paths.
AI-powered self-learning OS with I Ching philosophy | 融合易经哲学的自学型 AI 操作系统
A stateful AI agent framework powered by the Cognitive Lattice to solve complex tasks with persistent memory and reliable tool orchestration.
Intelligent Model Context Protocol (MCP) server for AI-assisted API development. Generate mock servers from OpenAPI specs with advanced logging, performance analytics, and server discovery. Optimized
Medical-AI is a AI framework specifically for Medical Applications https://aibharata.github.io/medicalAI/
AI News Scraper & Semantic Search: A Python application that scrapes news articles, uses GenAI to generate summaries and identify topics, and provides semantic search capabilities through vector embed
SearXNG tool plugin for https://llm.datasette.io/
Demo RAG API (FastAPI, OpenAI, ChromaDB, Docker) automatically generated using the OpenAI Codex CLI tool. Highlights Codex's capability for rapid, complex application development.
