freshcrate

Search results for "assessment"

Clear filters
26 results found (Python)
opikπŸ“2.0.6🌳 Mature⭐18,767

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

LLM-Agents-Ecosystem-HandbookπŸ“0.0.0🌳 Mature⭐508

One-stop handbook for building, deploying, and understanding LLM agents with 60+ skeletons, tutorials, ecosystem guides, and evaluation tools.

ai-legal-claudeπŸ“0.0.0🌳 Mature⭐708

AI Legal Assistant skill for Claude Code. Contract review, risk analysis, NDA generation, compliance auditing, negotiation strategy, and PDF reports β€” 14 skills, 5 parallel agents. If you want to lear

mcpπŸ“2026.04.20260414152327🌿 Growing⭐8,740

Official MCP Servers for AWS

claude-adsπŸ“v1.5.1🌿 Growing⭐2,207

Comprehensive paid advertising audit & optimization skill for Claude Code. 225+ checks across Google, Meta, YouTube, LinkedIn, TikTok, Microsoft & Apple Search Ads with weighted scoring, parallel agen

LLM-Agent-Paper-dailyπŸ“main@2026-04-21🌱 Seedling⭐20

Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)

garmin-givemydataπŸ“v0.1.10🌿 Growing⭐61

It's YOUR data. Take it back. Get your Garmin Connect data into a local SQLite database and AI ready (MCP server)

OmicsClawπŸ“main@2026-04-18🌿 Growing⭐116

Conversational & memory-enabled AI research partner for multi-omics analysis. From biological idea to full research paper.

security-investigatorπŸ“main@2026-04-18🌿 Growing⭐142

Automated security investigation tool using Microsoft MCP Servers, GitHub Copilot, Python Modules and custom copilot-instructions.

evalsπŸ“v0.1.15🌿 Growing⭐103

A comprehensive evaluation framework for AI agents and LLM applications.

maverick-mcpπŸ“main@2026-04-17🌿 Growing⭐479

MaverickMCP - Personal Stock Analysis MCP Server

OpenClawProBenchπŸ“main@2026-04-15🌿 Growing⭐340

OpenClawProBench is a live-first benchmark harness for evaluating LLM agents in the OpenClaw runtime with deterministic grading and repeated-trial reliability.

deep-research-mcpπŸ“main@2026-04-13🌿 Growing⭐58

MCP server for OpenAI's Deep Research APIs, Gemini Deep Research Agent, and Hugging Face's Open Deep Research

cyber-pilotπŸ“v3.7.0-beta🌿 Growing⭐53

Cyber Pilot is a traceable delivery system for requirements, design, plans, and code.

mcp-gateway-registryπŸ“v1.0.18🌿 Growing⭐576

Enterprise-ready MCP Gateway & Registry that centralizes AI development tools with secure OAuth authentication, dynamic tool discovery, and unified access for both autonomous AI agents and AI coding a

agent-actionsπŸ“v0.1.12🌱 Seedling⭐4

Declarative framework for orchestrating multi-model LLM pipelines with context engineering and quality gates.

Anthropic-Cybersecurity-SkillsπŸ“v1.2.0🌱 Seedling⭐4,262

754 structured cybersecurity skills for AI agents Β· Mapped to 5 frameworks: MITRE ATT&CK, NIST CSF 2.0, MITRE ATLAS, D3FEND & NIST AI RMF Β· agentskills.io standard Β· Works with Claude Code, GitHub Cop

vikramadityaπŸ“main@2026-04-20🌱 Seedling⭐5

Autonomous VAPT platform. Give it a target (FQDN, IP, CIDR) β€” it hunts, it reports. Inspired by the Obsidian Order.

synapse-aiπŸ“v1.0.0🌱 Seedling⭐1

Build AI agents that actually do things. Synapse is an open-source platform for creating, connecting, and orchestrating AI agents powered by any LLM β€” local or cloud.

watchtowerπŸ“1.0.2🌱 Seedling⭐51

Watchtower is a simple AI-powered penetration testing automation CLI tool that leverages LLMs and LangGraph to orchestrate agentic workflows that you can use to test your websites locally. Generate us

LightAgentπŸ“v0.5.0🌱 Seedling⭐831

LightAgent: Lightweight AI agent framework with memory, tools & tree-of-thought. Supports multi-agent collaboration, self-learning, and major LLMs (OpenAI/DeepSeek/Qwen). Open-source with MCP/SSE prot

mcp-task-orchestratorπŸ“v1.8.0πŸ’€ Dormant⭐25

A Model Context Protocol server that provides task orchestration capabilities for AI assistants

Zen-Ai-PentestπŸ“v3.0.0🌱 Seedling⭐279

πŸ›‘βš”οΈAI-Powered Penetration Testing Framework with automated vulnerability scanning, multi-agent system, and compliance reportingπŸ›‘βš”οΈ

DOXπŸ“main@2026-04-15🌱 Seedling⭐1

Broken RAG For The Broken Souls