freshcrate

Search results for "assessment"

59 results found
opikπŸ“2.0.6🌳 Mature⭐18,767

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

pickle-rick-claudeπŸ“v1.44.3🌱 Seedling⭐19

πŸ₯’ Pickle Rick for Claude Code β€” autonomous PRD-driven coding loops + relentless code review. Ralph Loop toolkit.

LLM-Agents-Ecosystem-HandbookπŸ“0.0.0🌳 Mature⭐508

One-stop handbook for building, deploying, and understanding LLM agents with 60+ skeletons, tutorials, ecosystem guides, and evaluation tools.

This Guidance demonstrates how to streamline access to numerous large language models (LLMs) through a unified, industry-standard API gateway based on OpenAI API standards

ai-legal-claudeπŸ“0.0.0🌳 Mature⭐708

AI Legal Assistant skill for Claude Code. Contract review, risk analysis, NDA generation, compliance auditing, negotiation strategy, and PDF reports β€” 14 skills, 5 parallel agents. If you want to lear

sdl-mcpπŸ“v0.10.7🌿 Growing⭐121

SDL-MCP (Symbol Delta Ledger MCP Server) is a cards-first context system for coding agents that saves tokens and improves context.

Autonomous-AgentsπŸ“main@2026-04-16🌿 Growing⭐1,211

Autonomous Agents (LLMs) research papers. Updated Daily.

Awesome-Context-EngineeringπŸ“0.0.0🌳 Mature⭐3,045

πŸ”₯ Comprehensive survey on Context Engineering: from prompt engineering to production-grade AI systems. hundreds of papers, frameworks, and implementation guides for LLMs and AI agents.

AILinkXπŸ“main@2026-04-21🌱 Seedling⭐14

🌐 Connect people and manage tasks with AILinkX, your all-in-one digital life operating system built on advanced AI technology.

cass_memory_systemπŸ“v0.2.8🌿 Growing⭐319

Procedural memory for AI coding agents: transforms scattered session history into persistent, cross-agent memory so every agent learns from every other

mcpπŸ“2026.04.20260414152327🌿 Growing⭐8,740

Official MCP Servers for AWS

claude-adsπŸ“v1.5.1🌿 Growing⭐2,207

Comprehensive paid advertising audit & optimization skill for Claude Code. 225+ checks across Google, Meta, YouTube, LinkedIn, TikTok, Microsoft & Apple Search Ads with weighted scoring, parallel agen

Awesome-World-ModelsπŸ“main@2026-04-21🌿 Growing⭐1,473

A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related website

karpathy-llm-wikiπŸ“main@2026-04-21🌱 Seedling⭐34

The Self-Growing Karpathy LLM Wiki β€” grown by an AI agent yoyo from Karpathy's founding prompt

awesome-promptsπŸ“main@2026-04-21🌿 Growing⭐7,572

Curated list of chatgpt prompts from the top-rated GPTs in the GPTs Store. Prompt Engineering, prompt attack & prompt protect. Advanced Prompt Engineering papers.

LLM-Agent-Paper-dailyπŸ“main@2026-04-21🌱 Seedling⭐20

Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)

garmin-givemydataπŸ“v0.1.10🌿 Growing⭐61

It's YOUR data. Take it back. Get your Garmin Connect data into a local SQLite database and AI ready (MCP server)

OmicsClawπŸ“main@2026-04-18🌿 Growing⭐116

Conversational & memory-enabled AI research partner for multi-omics analysis. From biological idea to full research paper.

security-investigatorπŸ“main@2026-04-18🌿 Growing⭐142

Automated security investigation tool using Microsoft MCP Servers, GitHub Copilot, Python Modules and custom copilot-instructions.

evalsπŸ“v0.1.15🌿 Growing⭐103

A comprehensive evaluation framework for AI agents and LLM applications.

maverick-mcpπŸ“main@2026-04-17🌿 Growing⭐479

MaverickMCP - Personal Stock Analysis MCP Server

OpenClawProBenchπŸ“main@2026-04-15🌿 Growing⭐340

OpenClawProBench is a live-first benchmark harness for evaluating LLM agents in the OpenClaw runtime with deterministic grading and repeated-trial reliability.

paiml-mcp-agent-toolkitπŸ“v3.14.0🌿 Growing⭐148

Pragmatic AI Labs MCP Agent Toolkit - An MCP Server designed to make code with agents more deterministic

deep-research-mcpπŸ“main@2026-04-13🌿 Growing⭐58

MCP server for OpenAI's Deep Research APIs, Gemini Deep Research Agent, and Hugging Face's Open Deep Research

codingbuddyπŸ“v5.6.3🌱 Seedling⭐31

Codingbuddy orchestrates 29 specialized AI agents to deliver code quality comparable to a team of human experts through a PLAN β†’ ACT β†’ EVAL workflow.

cyber-pilotπŸ“v3.7.0-beta🌿 Growing⭐53

Cyber Pilot is a traceable delivery system for requirements, design, plans, and code.

ds_exπŸ“main@2026-04-09🌱 Seedling⭐17

DSPEx - Declarative Self-improving Elixir | A BEAM-Native AI Program Optimization Framework

mcp-gateway-registryπŸ“v1.0.18🌿 Growing⭐576

Enterprise-ready MCP Gateway & Registry that centralizes AI development tools with secure OAuth authentication, dynamic tool discovery, and unified access for both autonomous AI agents and AI coding a

AetherπŸ“v1.0.17🌱 Seedling⭐8

Artifical Ecology For Thought and Emergent Reasoning. The Colony That Builds With You.

cortex-hubπŸ“v0.7.0🌱 Seedling⭐48

Self-hosted AI Agent Memory + Code Intelligence Platform β€” one MCP endpoint for persistent memory, AST-aware code search, shared knowledge, and quality enforcement across all your AI coding agents.

agent-actionsπŸ“v0.1.12🌱 Seedling⭐4

Declarative framework for orchestrating multi-model LLM pipelines with context engineering and quality gates.

Anthropic-Cybersecurity-SkillsπŸ“v1.2.0🌱 Seedling⭐4,262

754 structured cybersecurity skills for AI agents Β· Mapped to 5 frameworks: MITRE ATT&CK, NIST CSF 2.0, MITRE ATLAS, D3FEND & NIST AI RMF Β· agentskills.io standard Β· Works with Claude Code, GitHub Cop

everything-claude-codeπŸ“v1.10.0🌱 Seedling⭐151,139

The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.

kubernetes-mcp-serverπŸ“v0.0.60🌱 Seedling⭐1,422

Model Context Protocol (MCP) server for Kubernetes and OpenShift

vikramadityaπŸ“main@2026-04-20🌱 Seedling⭐5

Autonomous VAPT platform. Give it a target (FQDN, IP, CIDR) β€” it hunts, it reports. Inspired by the Obsidian Order.

mcp-scanπŸ“v2.0.0🌱 Seedling⭐22

Security scanner for MCP server configurations. Detects secrets, CVEs, permission issues, and exfiltration vectors across 10 AI tool clients.

Pentest-SkillπŸ“0.0.0🌱 Seedling⭐2

Transform any LLM into an autonomous security testing agent with structured prompts for seven-phase vulnerability hunting.

agentshieldπŸ“v1.4.0🌱 Seedling⭐361

AI agent security scanner. Detect vulnerabilities in agent configurations, MCP servers, and tool permissions. Available as CLI, GitHub Action, ECC plugin, and GitHub App integration. πŸ›‘οΈ

synapse-aiπŸ“v1.0.0🌱 Seedling⭐1

Build AI agents that actually do things. Synapse is an open-source platform for creating, connecting, and orchestrating AI agents powered by any LLM β€” local or cloud.

ai-runbookπŸ“master@2026-04-20🌱 Seedling⭐2

A dotfiles repo that treats AI agent behavior as infrastructure

watchtowerπŸ“1.0.2🌱 Seedling⭐51

Watchtower is a simple AI-powered penetration testing automation CLI tool that leverages LLMs and LangGraph to orchestrate agentic workflows that you can use to test your websites locally. Generate us

LightAgentπŸ“v0.5.0🌱 Seedling⭐831

LightAgent: Lightweight AI agent framework with memory, tools & tree-of-thought. Supports multi-agent collaboration, self-learning, and major LLMs (OpenAI/DeepSeek/Qwen). Open-source with MCP/SSE prot

mcp-task-orchestratorπŸ“v1.8.0πŸ’€ Dormant⭐25

A Model Context Protocol server that provides task orchestration capabilities for AI assistants

Zen-Ai-PentestπŸ“v3.0.0🌱 Seedling⭐279

πŸ›‘βš”οΈAI-Powered Penetration Testing Framework with automated vulnerability scanning, multi-agent system, and compliance reportingπŸ›‘βš”οΈ

awesome-agent-benchmarksπŸ“master@2026-04-21🌱 Seedling⭐3

🧠 Discover and evaluate advanced benchmark datasets for Large Language Model agents to enhance performance assessment in real-world tasks.

polymarket-trader-mcpπŸ“v1.6.7🌱 Seedling⭐1

The most comprehensive MCP server for Polymarket β€” 48 tools spanning direct trading, market discovery, smart money tracking, copy trading, backtesting, risk management, and portfolio optimization. Wor

skill-evolutionπŸ“main@2026-04-21🌱 Seedling⭐2

Enable AI agents to autonomously create, evaluate, and evolve skills across any marketplace without user intervention.

DOXπŸ“main@2026-04-15🌱 Seedling⭐1

Broken RAG For The Broken Souls

EliteAgentπŸ“main@2026-04-17🌱 Seedling⭐1

The ultimate native macOS AI Agent. Blends local MLX SLMs with 3D cognitive Metal rendering and autonomous system integrations.

SYNARAπŸ“v1.2🌱 Seedling⭐1

The limbic layer. Personality, register, and internal state monitoring. ALBEDO lives here β€” the session instrument that governs tone, emotional signal detection, and the felt layer of every response.

TSUKUYOMIπŸ“2.6.0πŸ’€ Dormant⭐86

TSUKUYOMI is an advanced modular intelligence framework designed for the democratization of Intelligence Analysis via systematic analysis, processing, and reporting across multiple domains. Built on a

ai-dev-guidesπŸ“0.0.0πŸ’€ Dormant⭐32

These guides are designed to help teams and individuals leverage AI tools like GitHub Copilot, OpenAI, and Claude to build software projects efficiently and effectively

judge0πŸ“v1.13.1⚰️ Archived⭐4,082

Robust, fast, scalable, and sandboxed open-source online code execution system for humans and AI.

PromptgptπŸ“v1.2⚰️ Archived⭐119

PromptGPT is an opensource framework that enables users to automatically generate high-quality prompts with zero installations, coding necessary or technical knowledge. Promptgpt follows industry best