freshcrate

Search results for "validation"

Clear filters
92 results found (Python)
ai-agents-reality-checkπŸ“0.0.0🌿 Growing⭐57

Benchmarking the gap between AI agent hype and architecture. Three agent archetypes, 73-point performance spread, stress testing, network resilience, and ensemble coordination analysis with statistica

PraisonAIπŸ“v4.6.25🌳 Mature⭐6,900

PraisonAI 🦞 β€” Hire a 24/7 AI Workforce. Stop writing boilerplate and start shipping autonomous agents that research, plan, code, and execute tasks. Deployed in 5 lines of code with built-in memory, R

npcpyπŸ“v1.4.21🌳 Mature⭐1,287

The python library for research and development in NLP, multimodal LLMs, Agents, ML, Knowledge Graphs, and more.

mcp-memory-serviceπŸ“v10.39.1🌳 Mature⭐1,643

Open-source persistent memory for AI agent pipelines (LangGraph, CrewAI, AutoGen) and Claude. REST API + knowledge graph + autonomous consolidation.

Auto-claude-code-research-in-sleepπŸ“v0.4.4🌳 Mature⭐6,182

ARIS βš”οΈ (Auto-Research-In-Sleep) β€” Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in β€” works wi

cyllamaπŸ“0.2.11🌱 Seedling⭐22

A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp

ProxmoxMCP-PlusπŸ“v0.2.1🌿 Growing⭐124

Enhanced Proxmox MCP server with advanced virtualization management and full OpenAPI integration.

pydantic-aiπŸ“v1.84.1🌳 Mature⭐16,274

AI Agent Framework, the Pydantic way

openclaw-superpowersπŸ“main@2026-04-17🌿 Growing⭐50

44 plug-and-play skills for OpenClaw β€” self-modifying AI agent with cron scheduling, security guardrails, persistent memory, knowledge graphs, and MCP health monitoring. Your agent teaches itself new

cognithorπŸ“v0.92.2🌿 Growing⭐94

Cognithor - Agent OS: Local-first autonomous agent operating system. 16 LLM providers, 17 channels, 112+ MCP tools, 5-tier memory, A2A protocol, knowledge vault, voice, browser automation, Computer-us

meta-ads-mcpπŸ“1.0.86🌿 Growing⭐762

MCP server to manage Facebook and Instagram Ads (Meta Ads)

pydantic-deepagentsπŸ“0.3.15🌿 Growing⭐648

Python Deep Agent framework built on top of Pydantic-AI, designed to help you quickly build production-grade autonomous AI agents with planning, filesystem operations, subagent delegation, skills, and

CodeGenπŸ“0.0.0🌳 Mature⭐773

Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from dataset creation to model training and evaluation. Comes with pr

tradingπŸ“main@2026-04-21🌱 Seedling⭐27

Paper-first SPY options validation platform with broker-backed scorecards, hard risk gates, paired-trade accounting, and live dashboards.

mcp-client-for-ollamaπŸ“v0.28.0🌿 Growing⭐599

A text-based user interface (TUI) client for interacting with MCP servers using Ollama. Features include agent mode, multi-server, model switching, streaming responses, tool management, human-in-the-l

apple-mail-mcpπŸ“v0.4.1🌱 Seedling⭐40

πŸ€– MCP server for Apple Mail - Manage emails with AI using Claude Desktop. Search, send, organize mail with natural language.

AgenticXπŸ“v0.3.7🌿 Growing⭐105

AgenticX is a unified, production-ready multi-agent platform β€” Python SDK + CLI (agx) + Studio server + Machi desktop app. Features Meta-Agent orchestration, 15+ LLM providers, MCP Hub, hierarchical m

ISC-BenchπŸ“v0.0.5🌿 Growing⭐786

Internal Safety Collapse: Turning the LLM or an AI Agent into a sensitive data generator.

synaptic-memoryπŸ“v0.16.0🌱 Seedling⭐25

Brain-inspired knowledge graph: spreading activation, Hebbian learning, memory consolidation.

cyber-pilotπŸ“v3.7.0-beta🌿 Growing⭐53

Cyber Pilot is a traceable delivery system for requirements, design, plans, and code.

ainativelangπŸ“v1.4.6🌿 Growing⭐66

AINL helps turn AI from "a smart conversation" into "a structured worker." It is designed for teams building AI workflows that need multiple steps, state and memory, tool use, repeatable execution, v

fabric-rti-mcpπŸ“0.5.3🌿 Growing⭐107

MCP server for Fabric Real-Time Intelligence (https://aka.ms/fabricrti) supporting tools for Eventhouse (https://aka.ms/eventhouse), Azure Data Explorer (https://aka.ms/adx, and other RTI services (co

logfireπŸ“v4.32.1🌿 Growing⭐4,161

AI observability platform for production LLM and agent systems.

mcpπŸ“2026.04.20260414152327🌿 Growing⭐8,740

Official MCP Servers for AWS

claude-adsπŸ“v1.5.1🌿 Growing⭐2,207

Comprehensive paid advertising audit & optimization skill for Claude Code. 225+ checks across Google, Meta, YouTube, LinkedIn, TikTok, Microsoft & Apple Search Ads with weighted scoring, parallel agen

fastmcpπŸ“v3.2.4🌿 Growing⭐24,460

πŸš€ The fast, Pythonic way to build MCP servers and clients.

fastapi-agent-blueprintπŸ“v0.4.0🌱 Seedling⭐17

AI Agent Backend Platform on FastAPI β€” MCP server + AI orchestration + async DDD architecture. Zero-boilerplate CRUD, auto domain discovery, 14 Claude Code AI development skills.

agentic-fleet-hubπŸ“master@2026-04-21🌿 Growing⭐57

Self-hosted orchestration layer for autonomous AI agent teams. Shared memory, heartbeat scheduling, vault-first secrets, and cross-model peer review β€” one command to deploy.

honchoπŸ“main@2026-04-21🌿 Growing⭐2,030

Memory library for building stateful agents

auraπŸ“main@2026-04-21🌱 Seedling⭐47

A sovereign cognitive architecture with IIT 4.0 integrated information, residual-stream affective steering (CAA), Global Workspace Theory, active inference, and 72 consciousness modules β€” running loca

LLM-Agent-Paper-dailyπŸ“main@2026-04-21🌱 Seedling⭐20

Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)

Project_InfinityπŸ“main@2026-04-21🌱 Seedling⭐27

Project Infinity leverages MCP and Graph RAG to turn LLMs into a professional D&D 5e Game Master, governed by a dedicated dice server and a persistent player database for a truly consistent adventure.

awesome-code-agentsπŸ“main@2026-04-20🌿 Growing⭐94

A curated list of products, benchmarks, and research papers on autonomous code agents. Beyond coding β€” they're redefining how software changes the world.

skills-voteπŸ“main@2026-04-19🌱 Seedling⭐31

The Next-Gen Agent-Native Skill Recommendation Engine

AGI-Alpha-Agent-v0πŸ“main@2026-04-18🌿 Growing⭐283

META‑AGENTIC α‑AGI πŸ‘οΈβœ¨ β€” Mission 🎯 End‑to‑end: Identify πŸ” β†’ Out‑Learn πŸ“š β†’ Out‑Think 🧠 β†’ Out‑Design 🎨 β†’ Out‑Strategise β™ŸοΈ β†’ Out‑Execute ⚑

security-investigatorπŸ“main@2026-04-18🌿 Growing⭐142

Automated security investigation tool using Microsoft MCP Servers, GitHub Copilot, Python Modules and custom copilot-instructions.

evalsπŸ“v0.1.15🌿 Growing⭐103

A comprehensive evaluation framework for AI agents and LLM applications.

crewAIπŸ“1.14.2🌿 Growing⭐48,611

Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.

SciAgent-SkillsπŸ“main@2026-04-17🌿 Growing⭐93

Life sciences computational skills for scientific AI agents

opentulpaπŸ“main@2026-04-17🌱 Seedling⭐26

Self-hosted personal AI agent that lives in your DMs. Describe any workflow: triage Gmail, pull a Giphy feed, build a Slack bot, monitor markets. It writes the code, runs it, schedules it, and saves i

maverick-mcpπŸ“main@2026-04-17🌿 Growing⭐479

MaverickMCP - Personal Stock Analysis MCP Server

OpenClawProBenchπŸ“main@2026-04-15🌿 Growing⭐340

OpenClawProBench is a live-first benchmark harness for evaluating LLM agents in the OpenClaw runtime with deterministic grading and repeated-trial reliability.

ha-mcpπŸ“v7.3.0🌿 Growing⭐2,201

The Unofficial and Awesome Home Assistant MCP Server

google_workspace_mcpπŸ“v1.19.0🌿 Growing⭐2,087

Control Gmail, Google Calendar, Docs, Sheets, Slides, Chat, Forms, Tasks, Search & Drive with AI - Comprehensive Google Workspace / G Suite MCP Server & CLI Tool

cognitive-dissonance-dspyπŸ“main@2026-04-14🌿 Growing⭐276

A multi-agent LLM system for detecting and resolving cognitive dissonance.

claude-bug-bountyπŸ“v4.0.0🌿 Growing⭐1,690

AI-powered bug bounty hunting from your terminal - recon, 20 vuln classes, autonomous hunting, and report generation. All inside Claude Code.

deep-research-mcpπŸ“main@2026-04-13🌿 Growing⭐58

MCP server for OpenAI's Deep Research APIs, Gemini Deep Research Agent, and Hugging Face's Open Deep Research

The Multi-Agent Custom Automation Engine Solution Accelerator is an AI-driven system that manages a group of AI agents to accomplish tasks based on user input. Powered by Microsoft Agent Framework, Az

EvoScientistπŸ“v0.0.7🌿 Growing⭐2,731

πŸ”¬ Harness Vibe Research with Self-evolving AI Scientists

instructorπŸ“v1.15.1🌱 Seedling⭐12,743

structured outputs for llms

datagouv-mcpπŸ“v0.2.23🌿 Growing⭐1,216

Official data.gouv.fr Model Context Protocol (MCP) server that allows AI chatbots to search, explore, and analyze datasets from the French national Open Data platform, directly through conversation.

chak-aiπŸ“v0.3.1🌿 Growing⭐211

A simple, yet handy, LLM gateway.

mcp-gateway-registryπŸ“v1.0.18🌿 Growing⭐576

Enterprise-ready MCP Gateway & Registry that centralizes AI development tools with secure OAuth authentication, dynamic tool discovery, and unified access for both autonomous AI agents and AI coding a

kuzu-memoryπŸ“v1.12.9🌱 Seedling⭐22

Lightweight, embedded graph-based memory system for AI applications. Fast (<3ms recall), offline-first, with MCP server support for Claude and other AI tools.

AgentQuantπŸ“0.0.0🌱 Seedling⭐87

Autonomous quantitative trading research platform that transforms stock lists into fully backtested strategies using AI agents, real market data, and mathematical formulations, all without requiring a

memoraπŸ“v0.2.27🌱 Seedling⭐386

Give your AI agents persistent memory.

kaiπŸ“v1.4.0🌱 Seedling⭐28

Agentic AI assistant on Telegram, powered by Claude Code. Runs locally with shell access, spec-driven PR reviews, layered security, persistent memory, and scheduled jobs. Your machine, your data, your

agent-actionsπŸ“v0.1.12🌱 Seedling⭐4

Declarative framework for orchestrating multi-model LLM pipelines with context engineering and quality gates.

python-sdkπŸ“v1.27.0🌱 Seedling⭐22,595

The official Python SDK for Model Context Protocol servers and clients

lm-proxyπŸ“v3.2.2🌱 Seedling⭐111

OpenAI-compatible HTTP LLM proxy / gateway for multi-provider inference (Google, Anthropic, OpenAI, PyTorch). Lightweight, extensible Python/FastAPIβ€”use as library or standalone service.

agent2πŸ“v0.1.0🌱 Seedling⭐25

The production runtime for AI agents. Schema in, API out. Built on PydanticAI + FastAPI.

open-responses-serverπŸ“v0.4.3🌱 Seedling⭐161

Wraps any OpenAI API interface as Responses with MCPs support so it supports Codex. Adding any missing stateful features. Ollama and Vllm compliant.

vikramadityaπŸ“main@2026-04-20🌱 Seedling⭐5

Autonomous VAPT platform. Give it a target (FQDN, IP, CIDR) β€” it hunts, it reports. Inspired by the Obsidian Order.

OSAπŸ“v0.2.10🌱 Seedling⭐139

Tool that just makes your open source project better using LLM agents

StandardπŸ“0.0.0🌱 Seedling⭐18

JSON Agents - A universal JSON-native standard for describing AI agents, their capabilities, tools, runtimes, and governance in a portable, framework-agnostic format. Based on RFC 8259, JSON Schema 2

BuildableπŸ“0.0.0🌱 Seedling⭐2

AI-powered web app builder β€” describe it, build it, ship it. 2-agent LangGraph system (Sonnet 4.5 + o4-mini) generates React apps from natural language with live preview and one-click deploy.

PromptDrifterπŸ“main@2026-04-19🌱 Seedling⭐8

🧭 PromptDrifter – one‑command CI guardrail that catches prompt drift and fails the build when your LLM answers change.

KawaiiGPTπŸ“KawaiiGPT🌱 Seedling⭐831

KawaiiGPT β€” Open-source LLM gateway accessing DeepSeek, Gemini, and Kimi-K2 through reverse-engineered Pollinations API with no API keys required, built-in prompt injection capabilities for security r

project-codeguardπŸ“v1.3.1🌱 Seedling⭐123

Project CodeGuard is an open-source, model-agnostic security framework that embeds secure-by-default practices into AI coding agent workflows. It provides comprehensive security rules that guide AI as

COREπŸ“v2.2.2🌱 Seedling⭐30

A thing that uses AI to write perfect applications. For those who want to know how: a governance runtime enforcing immutable constitutional rules on AI coding agents.

kubectl-mcp-serverπŸ“v1.24.0🌱 Seedling⭐865

Published in CNCF Landscape: A MCP server for Kubernetes.

mcp-task-orchestratorπŸ“v1.8.0πŸ’€ Dormant⭐25

A Model Context Protocol server that provides task orchestration capabilities for AI assistants

Zen-Ai-PentestπŸ“v3.0.0🌱 Seedling⭐279

πŸ›‘βš”οΈAI-Powered Penetration Testing Framework with automated vulnerability scanning, multi-agent system, and compliance reportingπŸ›‘βš”οΈ

GeneclawπŸ“v0.1.0🌱 Seedling⭐34

Self-evolving AI agent framework with 5-layer safety gatekeeper. Agents observe failures, propose fixes, and safely apply them. Built on HKUDS/nanobot.

clonemeπŸ“0.0.0πŸ’€ Dormant⭐38

CloneMe is an advanced AI platform that builds your digital twinβ€”an AI that chats like you, remembers details, and supports multiple platforms. Customizable, memory-driven, and hot-reloadable, it's th

LLM-API-Key-ProxyπŸ“main/build-20260123-1-bf7ab7e🌱 Seedling⭐448

Universal LLM Gateway: One API, every LLM. OpenAI/Anthropic-compatible endpoints with multi-provider translation and intelligent load-balancing.

DOXπŸ“main@2026-04-15🌱 Seedling⭐1

Broken RAG For The Broken Souls

Comfy-CozyπŸ“v4.0.0🌱 Seedling⭐3

AI co-pilot for ComfyUI β€” 113 tools for workflow authoring, model provisioning, and iterative rendering. Multi-provider (Claude, GPT-4o, Gemini, Ollama). Ships as MCP server or standalone CLI.

PromptManagerπŸ“master@2026-04-12🌱 Seedling⭐3

PromptManager is a desktop application for cataloguing, searching, and executing AI prompts, and much more.

summoner-agentsπŸ“v1.1.0🌱 Seedling⭐24

A collection of Summoner clients and agents featuring example implementations and reusable templates

uk-due-diligence-mcpπŸ“v1.0.4🌱 Seedling⭐1

UK due diligence MCP server β€” Companies House, corporate research, compliance checks

Grinta-AgentπŸ“main@2026-04-20🌱 Seedling⭐1

Local-first autonomous coding agent that plans, executes, validates, and finishes software tasks end-to-end.

qa-agentπŸ“v0.2.1🌱 Seedling⭐1

An automated, agentic exploratory testing tool that performs comprehensive QA testing on web applications, simulating human user interactions through various input methods (mouse, keyboard, TAB naviga

LegionπŸ“v0.1.3πŸ’€ Dormant⭐116

A Python-based framework for building multi-agent systems with LLMs. Currently in pre-launch alpha.

vllm-cliπŸ“v0.2.5πŸ’€ Dormant⭐487

A command-line interface tool for serving LLM using vLLM.

Agentic-AI-PipelineπŸ“v1.0.0πŸ’€ Dormant⭐57

🦾 A production‑ready research outreach AI agent that plans, discovers, reasons, uses tools, auto‑builds cited briefings, and drafts tailored emails with tool‑chaining, memory, tests, and turnkey Dock

security-controls-mcpπŸ“v1.1.0🌱 Seedling

MCP server for 28 security frameworks (ISO 27001, NIST CSF 2.0, NIST 800-53, SOC 2, IEC 62443)

medicalAIπŸ“v1.2.9-rc⚰️ Archived⭐21

Medical-AI is a AI framework specifically for Medical Applications https://aibharata.github.io/medicalAI/