freshcrate — Search

Search results for "assessment"

26 results found (Python)

opik 📁2.0.6🌳 Mature⭐18,767

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

evaluation hacktoberfest hacktoberfest2025 langchain llama-index llm llm-evaluation llm-observability pythonby comet-mlPython

LLM-Agents-Ecosystem-Handbook 📁0.0.0🌳 Mature⭐508

One-stop handbook for building, deploying, and understanding LLM agents with 60+ skeletons, tutorials, ecosystem guides, and evaluation tools.

ai ai-agent ai-agents fine-tuning finetuning-llms freamework llm llmops pythonby oxbshwPython

ai-legal-claude 📁0.0.0🌳 Mature⭐708

AI Legal Assistant skill for Claude Code. Contract review, risk analysis, NDA generation, compliance auditing, negotiation strategy, and PDF reports — 14 skills, 5 parallel agents. If you want to lear

pythonby zubair-trabzadaPython

mcp 📁2026.04.20260414152327🌿 Growing⭐8,740

Official MCP Servers for AWS

aws mcp mcp-client mcp-clients mcp-host mcp-server mcp-servers mcp-tools pythonby awslabsPython

claude-ads 📁v1.5.1🌿 Growing⭐2,207

Comprehensive paid advertising audit & optimization skill for Claude Code. 225+ checks across Google, Meta, YouTube, LinkedIn, TikTok, Microsoft & Apple Search Ads with weighted scoring, parallel agen

ai ai-marketing claude-code claude-code-skill marketing-automation open-source pythonby AgriciDanielPython

LLM-Agent-Paper-daily 📁main@2026-04-21🌱 Seedling⭐20

Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)

llm llm-agent pythonby Lyz103Python

garmin-givemydata 📁v0.1.10🌿 Growing⭐61

It's YOUR data. Take it back. Get your Garmin Connect data into a local SQLite database and AI ready (MCP server)

pythonby nrvimPython

OmicsClaw 📁main@2026-04-18🌿 Growing⭐116

Conversational & memory-enabled AI research partner for multi-omics analysis. From biological idea to full research paper.

bioinformatics knowledge-graph llm-agent multi-agents multi-omics python single-cell spatial-transcriptomicsby TianGzlabPython

security-investigator 📁main@2026-04-18🌿 Growing⭐142

Automated security investigation tool using Microsoft MCP Servers, GitHub Copilot, Python Modules and custom copilot-instructions.

pythonby SCStelzPython

evals 📁v0.1.15🌿 Growing⭐103

A comprehensive evaluation framework for AI agents and LLM applications.

agentic agentic-ai ai evaluation machine-learning python strands-agentsby strands-agentsPython

maverick-mcp 📁main@2026-04-17🌿 Growing⭐479

MaverickMCP - Personal Stock Analysis MCP Server

anthropic artificial-intelligence claude equities fastmcp finance financial-analysis fintech pythonby wshobsonPython

OpenClawProBench 📁main@2026-04-15🌿 Growing⭐340

OpenClawProBench is a live-first benchmark harness for evaluating LLM agents in the OpenClaw runtime with deterministic grading and repeated-trial reliability.

agent benchmark evaluation harness leaderboard llm openclaw pythonby suyoumoPython

deep-research-mcp 📁main@2026-04-13🌿 Growing⭐58

MCP server for OpenAI's Deep Research APIs, Gemini Deep Research Agent, and Hugging Face's Open Deep Research

pythonby pminerviniPython

cyber-pilot 📁v3.7.0-beta🌿 Growing⭐53

Cyber Pilot is a traceable delivery system for requirements, design, plans, and code.

agents ai architecture code-generation code-review code-validation codegen codegeneration pythonby cyberfabricPython

mcp-gateway-registry 📁v1.0.18🌿 Growing⭐576

Enterprise-ready MCP Gateway & Registry that centralizes AI development tools with secure OAuth authentication, dynamic tool discovery, and unified access for both autonomous AI agents and AI coding a

a2a agentic-ai agents ans documentdb ecs ecs-fargate entra-id pythonby agentic-communityPython

agent-actions 📁v0.1.12🌱 Seedling⭐4

Declarative framework for orchestrating multi-model LLM pipelines with context engineering and quality gates.

ai-agents anthropic context-engineering-framework llm orchestration prompt-engineering prompt-engineering-tool python yamlby MuizzkolapoPython

Anthropic-Cybersecurity-Skills 📁v1.2.0🌱 Seedling⭐4,262

754 structured cybersecurity skills for AI agents · Mapped to 5 frameworks: MITRE ATT&CK, NIST CSF 2.0, MITRE ATLAS, D3FEND & NIST AI RMF · agentskills.io standard · Works with Claude Code, GitHub Cop

ai-agents claude-code cloud-security cybersecurity devsecops ethical-hacking incident-response infosec pythonby mukul975Python

mcp-server-for-oscal 📁v0.4.0🌱 Seedling⭐33

OSCAL tools for AI agents

compliance-as-code compliance-automation continuous-compliance mcp-server oscal python security-as-code security-assurance strands-agentsby awslabsPython

vikramaditya 📁main@2026-04-20🌱 Seedling⭐5

Autonomous VAPT platform. Give it a target (FQDN, IP, CIDR) — it hunts, it reports. Inspired by the Obsidian Order.

ai-security autonomous-agent bash bug-bounty penetration-testing python recon securityby venkatasPython

synapse-ai 📁v1.0.0🌱 Seedling⭐1

Build AI agents that actually do things. Synapse is an open-source platform for creating, connecting, and orchestrating AI agents powered by any LLM — local or cloud.

custom-agents custom-tools directed-acyclic-graph llm-agent mcp-client mcp-server mcp-servers mcp-tools pythonby naveenraj-17Python

watchtower 📁1.0.2🌱 Seedling⭐51

Watchtower is a simple AI-powered penetration testing automation CLI tool that leverages LLMs and LangGraph to orchestrate agentic workflows that you can use to test your websites locally. Generate us

ai-cybersecurity automation-testing claude langgraph openrouter pentest pentesting python red-teamby fzn0xPython

LightAgent 📁v0.5.0🌱 Seedling⭐831

LightAgent: Lightweight AI agent framework with memory, tools & tree-of-thought. Supports multi-agent collaboration, self-learning, and major LLMs (OpenAI/DeepSeek/Qwen). Open-source with MCP/SSE prot

pythonby wanxingaiPython

mcp-task-orchestrator 📁v1.8.0💤 Dormant⭐25

A Model Context Protocol server that provides task orchestration capabilities for AI assistants

ai-agents ai-automation ai-framework ai-orchestration ai-tools anthropic claude claude-desktop pythonby EchoingVesperPython

Zen-Ai-Pentest 📁v3.0.0🌱 Seedling⭐279

🛡⚔️AI-Powered Penetration Testing Framework with automated vulnerability scanning, multi-agent system, and compliance reporting🛡⚔️

ai automation compliance cybersecurity ethical-hacking framework penetration-testing pentesting pythonby SHAdd0WTAkaPython

VectorDBBench 📁v1.0.20🌱 Seedling⭐1,068

Benchmark for vector databases.

benchmark cost-effectiveness performance python vector-database vector-search vectordbby zilliztechPython

DOX 📁main@2026-04-15🌱 Seedling⭐1

Broken RAG For The Broken Souls

hallucination llm python rag retrieval-augmented-generation vibecodingby AmMoPyPython