freshcrate — Search

Search results for "assessment"

59 results found

opik 📁2.0.6🌳 Mature⭐18,767

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

evaluation hacktoberfest hacktoberfest2025 langchain llama-index llm llm-evaluation llm-observability pythonby comet-mlPython

AgentWard 📁main@2026-04-20🌱 Seedling⭐30

AgentWard – Built for all, hardened for OpenClaw.

agent-security defense-in-depth llm-agent llm-security openclaw openclaw-plugin openclaw-security prompt-injection-defense typescriptby FIND-LabTypeScript

pickle-rick-claude 📁v1.44.3🌱 Seedling⭐19

🥒 Pickle Rick for Claude Code — autonomous PRD-driven coding loops + relentless code review. Ralph Loop toolkit.

agentic ai-coding anthropic autonomous-agent claude claude-code code-review iterative-development javascriptby gregorydicksonJavaScript

LLM-Agents-Ecosystem-Handbook 📁0.0.0🌳 Mature⭐508

One-stop handbook for building, deploying, and understanding LLM agents with 60+ skeletons, tutorials, ecosystem guides, and evaluation tools.

ai ai-agent ai-agents fine-tuning finetuning-llms freamework llm llmops pythonby oxbshwPython

guidance-for-multi-provider-generative-ai-gateway-on-aws 📁0.0.0🌿 Growing⭐213

This Guidance demonstrates how to streamline access to numerous large language models (LLMs) through a unified, industry-standard API gateway based on OpenAI API standards

hclby aws-solutions-library-samplesHCL

ai-legal-claude 📁0.0.0🌳 Mature⭐708

AI Legal Assistant skill for Claude Code. Contract review, risk analysis, NDA generation, compliance auditing, negotiation strategy, and PDF reports — 14 skills, 5 parallel agents. If you want to lear

pythonby zubair-trabzadaPython

sdl-mcp 📁v0.10.7🌿 Growing⭐121

SDL-MCP (Symbol Delta Ledger MCP Server) is a cards-first context system for coding agents that saves tokens and improves context.

agent-context agent-tools agentic-coding agentic-workflow agents ai-agents code-analysis code-context typescriptby GlitterKillTypeScript

GUAN-Framework 📁0.0.0🌱 Seedling⭐25

[ARCHIVED] 已迁移到 MIXIA-Framework repo

ai-framework ai-memory ai-persona claude-code codex-cli cognitive-copilot context-engineering decision-supportby whoisguan

Autonomous-Agents 📁main@2026-04-16🌿 Growing⭐1,211

Autonomous Agents (LLMs) research papers. Updated Daily.

agent agentic agentic-ai agents ai ai-agents aiagent aiagentsby tmgthb

Awesome-Context-Engineering 📁0.0.0🌳 Mature⭐3,045

🔥 Comprehensive survey on Context Engineering: from prompt engineering to production-grade AI systems. hundreds of papers, frameworks, and implementation guides for LLMs and AI agents.

agent agentic-ai agi awesome-list cognitive-science context-engineering llm ragby Meirtz

AILinkX 📁main@2026-04-21🌱 Seedling⭐14

🌐 Connect people and manage tasks with AILinkX, your all-in-one digital life operating system built on advanced AI technology.

ai-agent answers anthropic assessment auto-weixin autogen digital-life-os generative-aiby sh4rck

cass_memory_system 📁v0.2.8🌿 Growing⭐319

Procedural memory for AI coding agents: transforms scattered session history into persistent, cross-agent memory so every agent learns from every other

ai-agents bun developer-tools memory typescriptby DicklesworthstoneTypeScript

mcp 📁2026.04.20260414152327🌿 Growing⭐8,740

Official MCP Servers for AWS

aws mcp mcp-client mcp-clients mcp-host mcp-server mcp-servers mcp-tools pythonby awslabsPython

claude-ads 📁v1.5.1🌿 Growing⭐2,207

Comprehensive paid advertising audit & optimization skill for Claude Code. 225+ checks across Google, Meta, YouTube, LinkedIn, TikTok, Microsoft & Apple Search Ads with weighted scoring, parallel agen

ai ai-marketing claude-code claude-code-skill marketing-automation open-source pythonby AgriciDanielPython

Awesome-World-Models 📁main@2026-04-21🌿 Growing⭐1,473

A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related website

artificial-intelligence autonomous-driving awesome deep-learning embodied-ai future-prediction video-prediction world-modelby leofan90

karpathy-llm-wiki 📁main@2026-04-21🌱 Seedling⭐34

The Self-Growing Karpathy LLM Wiki — grown by an AI agent yoyo from Karpathy's founding prompt

ai-agent karpathy knowledge-base llm typescript wikiby yologdevTypeScript

awesome-prompts 📁main@2026-04-21🌿 Growing⭐7,572

Curated list of chatgpt prompts from the top-rated GPTs in the GPTs Store. Prompt Engineering, prompt attack & prompt protect. Advanced Prompt Engineering papers.

awesome awesome-list chatgpt gpt4 gpts gptstore papers prompt prompt-engineeringby ai-boost

LLM-Agent-Paper-daily 📁main@2026-04-21🌱 Seedling⭐20

Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)

llm llm-agent pythonby Lyz103Python

garmin-givemydata 📁v0.1.10🌿 Growing⭐61

It's YOUR data. Take it back. Get your Garmin Connect data into a local SQLite database and AI ready (MCP server)

pythonby nrvimPython

OmicsClaw 📁main@2026-04-18🌿 Growing⭐116

Conversational & memory-enabled AI research partner for multi-omics analysis. From biological idea to full research paper.

bioinformatics knowledge-graph llm-agent multi-agents multi-omics python single-cell spatial-transcriptomicsby TianGzlabPython

security-investigator 📁main@2026-04-18🌿 Growing⭐142

Automated security investigation tool using Microsoft MCP Servers, GitHub Copilot, Python Modules and custom copilot-instructions.

pythonby SCStelzPython

evals 📁v0.1.15🌿 Growing⭐103

A comprehensive evaluation framework for AI agents and LLM applications.

agentic agentic-ai ai evaluation machine-learning python strands-agentsby strands-agentsPython

maverick-mcp 📁main@2026-04-17🌿 Growing⭐479

MaverickMCP - Personal Stock Analysis MCP Server

anthropic artificial-intelligence claude equities fastmcp finance financial-analysis fintech pythonby wshobsonPython

OpenClawProBench 📁main@2026-04-15🌿 Growing⭐340

OpenClawProBench is a live-first benchmark harness for evaluating LLM agents in the OpenClaw runtime with deterministic grading and repeated-trial reliability.

agent benchmark evaluation harness leaderboard llm openclaw pythonby suyoumoPython

paiml-mcp-agent-toolkit 📁v3.14.0🌿 Growing⭐148

Pragmatic AI Labs MCP Agent Toolkit - An MCP Server designed to make code with agents more deterministic

agentic c deno kotlin mcp mcp-server paiml paiml-active-tool rustby paimlRust

deep-research-mcp 📁main@2026-04-13🌿 Growing⭐58

MCP server for OpenAI's Deep Research APIs, Gemini Deep Research Agent, and Hugging Face's Open Deep Research

pythonby pminerviniPython

codingbuddy 📁v5.6.3🌱 Seedling⭐31

Codingbuddy orchestrates 29 specialized AI agents to deliver code quality comparable to a team of human experts through a PLAN → ACT → EVAL workflow.

ai-agents ai-coding ai-coding-assistant ai-rules claude-code code-quality coding-assistant cursor model-context-protocol typescriptby JeremyDev87TypeScript

cyber-pilot 📁v3.7.0-beta🌿 Growing⭐53

Cyber Pilot is a traceable delivery system for requirements, design, plans, and code.

agents ai architecture code-generation code-review code-validation codegen codegeneration pythonby cyberfabricPython

ds_ex 📁main@2026-04-09🌱 Seedling⭐17

DSPEx - Declarative Self-improving Elixir | A BEAM-Native AI Program Optimization Framework

ai ai-framework automated-optimization beam declarative-programming dspy elixir erlang-vmby nshkrdotcomElixir

mcp-gateway-registry 📁v1.0.18🌿 Growing⭐576

Enterprise-ready MCP Gateway & Registry that centralizes AI development tools with secure OAuth authentication, dynamic tool discovery, and unified access for both autonomous AI agents and AI coding a

a2a agentic-ai agents ans documentdb ecs ecs-fargate entra-id pythonby agentic-communityPython

Aether 📁v1.0.17🌱 Seedling⭐8

Artifical Ecology For Thought and Emergent Reasoning. The Colony That Builds With You.

aether agentic-ai agents ants automation claude claude-code colony go prompt-engineeringby calcosmicGo

cortex-hub 📁v0.7.0🌱 Seedling⭐48

Self-hosted AI Agent Memory + Code Intelligence Platform — one MCP endpoint for persistent memory, AST-aware code search, shared knowledge, and quality enforcement across all your AI coding agents.

ai-agents claude-code code-intelligence cursor developer-tools docker knowledge-base mcp typescriptby lktiepTypeScript

agent-actions 📁v0.1.12🌱 Seedling⭐4

Declarative framework for orchestrating multi-model LLM pipelines with context engineering and quality gates.

ai-agents anthropic context-engineering-framework llm orchestration prompt-engineering prompt-engineering-tool python yamlby MuizzkolapoPython

Anthropic-Cybersecurity-Skills 📁v1.2.0🌱 Seedling⭐4,262

754 structured cybersecurity skills for AI agents · Mapped to 5 frameworks: MITRE ATT&CK, NIST CSF 2.0, MITRE ATLAS, D3FEND & NIST AI RMF · agentskills.io standard · Works with Claude Code, GitHub Cop

ai-agents claude-code cloud-security cybersecurity devsecops ethical-hacking incident-response infosec pythonby mukul975Python

everything-claude-code 📁v1.10.0🌱 Seedling⭐151,139

The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.

ai-agents anthropic claude claude-code developer-tools javascript llm mcp productivityby affaan-mJavaScript

mcp-server-for-oscal 📁v0.4.0🌱 Seedling⭐33

OSCAL tools for AI agents

compliance-as-code compliance-automation continuous-compliance mcp-server oscal python security-as-code security-assurance strands-agentsby awslabsPython

kubernetes-mcp-server 📁v0.0.60🌱 Seedling⭐1,422

Model Context Protocol (MCP) server for Kubernetes and OpenShift

containers context go kubernetes kubernetes-mcp mcp model modelcontextprotocol openshiftby containersGo

vikramaditya 📁main@2026-04-20🌱 Seedling⭐5

Autonomous VAPT platform. Give it a target (FQDN, IP, CIDR) — it hunts, it reports. Inspired by the Obsidian Order.

ai-security autonomous-agent bash bug-bounty penetration-testing python recon securityby venkatasPython

mcp-scan 📁v2.0.0🌱 Seedling⭐22

Security scanner for MCP server configurations. Detects secrets, CVEs, permission issues, and exfiltration vectors across 10 AI tool clients.

ai-security ai-tools claude cli cursor devsecops devtools github-action typescriptby rodolfboctorTypeScript

Pentest-Skill 📁0.0.0🌱 Seedling⭐2

Transform any LLM into an autonomous security testing agent with structured prompts for seven-phase vulnerability hunting.

ai-agent attack-surface autonomous-agent bug-bounty claude-code codex cybersecurity ethical-hackingby NeaByteLab

agentshield 📁v1.4.0🌱 Seedling⭐361

AI agent security scanner. Detect vulnerabilities in agent configurations, MCP servers, and tool permissions. Available as CLI, GitHub Action, ECC plugin, and GitHub App integration. 🛡️

ai-agent anthropic claude-code hackathon mcp opus security typescriptby affaan-mTypeScript

synapse-ai 📁v1.0.0🌱 Seedling⭐1

Build AI agents that actually do things. Synapse is an open-source platform for creating, connecting, and orchestrating AI agents powered by any LLM — local or cloud.

custom-agents custom-tools directed-acyclic-graph llm-agent mcp-client mcp-server mcp-servers mcp-tools pythonby naveenraj-17Python

ai-runbook 📁master@2026-04-20🌱 Seedling⭐2

A dotfiles repo that treats AI agent behavior as infrastructure

ai-prompts ai-skills ai-tools artificial-intelligence dotfiles productivity-tools prompt-engineering shell skills-mdby pbierkortteShell

watchtower 📁1.0.2🌱 Seedling⭐51

Watchtower is a simple AI-powered penetration testing automation CLI tool that leverages LLMs and LangGraph to orchestrate agentic workflows that you can use to test your websites locally. Generate us

ai-cybersecurity automation-testing claude langgraph openrouter pentest pentesting python red-teamby fzn0xPython

LightAgent 📁v0.5.0🌱 Seedling⭐831

LightAgent: Lightweight AI agent framework with memory, tools & tree-of-thought. Supports multi-agent collaboration, self-learning, and major LLMs (OpenAI/DeepSeek/Qwen). Open-source with MCP/SSE prot

pythonby wanxingaiPython

mcp-task-orchestrator 📁v1.8.0💤 Dormant⭐25

A Model Context Protocol server that provides task orchestration capabilities for AI assistants

ai-agents ai-automation ai-framework ai-orchestration ai-tools anthropic claude claude-desktop pythonby EchoingVesperPython

Zen-Ai-Pentest 📁v3.0.0🌱 Seedling⭐279

🛡⚔️AI-Powered Penetration Testing Framework with automated vulnerability scanning, multi-agent system, and compliance reporting🛡⚔️

ai automation compliance cybersecurity ethical-hacking framework penetration-testing pentesting pythonby SHAdd0WTAkaPython

awesome-agent-benchmarks 📁master@2026-04-21🌱 Seedling⭐3

🧠 Discover and evaluate advanced benchmark datasets for Large Language Model agents to enhance performance assessment in real-world tasks.

agent-based-modeling agent-benchmark agentic agentic-ai ai ai-agent ai-models awesomeby axxafo

VectorDBBench 📁v1.0.20🌱 Seedling⭐1,068

Benchmark for vector databases.

benchmark cost-effectiveness performance python vector-database vector-search vectordbby zilliztechPython

ai-agents 📁v0.3.0🌱 Seedling⭐20

Multi-agent system for software development

agentic-ai ai-agents ai-assistant anthropic-claude automation ci-cd claude-code code-generation markdownby rjmurilloMarkdown

polymarket-trader-mcp 📁v1.6.7🌱 Seedling⭐1

The most comprehensive MCP server for Polymarket — 48 tools spanning direct trading, market discovery, smart money tracking, copy trading, backtesting, risk management, and portfolio optimization. Wor

ai-agent ai-trading anthropic blockchain claude copy-trading defi mcp model-context-protocol typescriptby demwickTypeScript

skill-evolution 📁main@2026-04-21🌱 Seedling⭐2

Enable AI agents to autonomously create, evaluate, and evolve skills across any marketplace without user intervention.

agent agent-memory agents ai-assessment ai-security claude claude-skills competence ragby kledidoda

DOX 📁main@2026-04-15🌱 Seedling⭐1

Broken RAG For The Broken Souls

hallucination llm python rag retrieval-augmented-generation vibecodingby AmMoPyPython

EliteAgent 📁main@2026-04-17🌱 Seedling⭐1

The ultimate native macOS AI Agent. Blends local MLX SLMs with 3D cognitive Metal rendering and autonomous system integrations.

apple-silicon autonomous-agents hybrid-intelligence llm-agent local-llm macos metal mlx swiftby trgysvcSwift

SYNARA 📁v1.2🌱 Seedling⭐1

The limbic layer. Personality, register, and internal state monitoring. ALBEDO lives here — the session instrument that governs tone, emotional signal detection, and the felt layer of every response.

agi agi-archiitecture agi-architect ai-architect ai-assessment ai-auditing ai-brain ai-frameworkby AionSystem

TSUKUYOMI 📁2.6.0💤 Dormant⭐86

TSUKUYOMI is an advanced modular intelligence framework designed for the democratization of Intelligence Analysis via systematic analysis, processing, and reporting across multiple domains. Built on a

ai ai-agent ai-framework js json osint osint-toolby savannah-i-g

ai-dev-guides 📁0.0.0💤 Dormant⭐32

These guides are designed to help teams and individuals leverage AI tools like GitHub Copilot, OpenAI, and Claude to build software projects efficiently and effectively

ai-development ai-driven ai-framework development-workflow project-management project-template prompt-engineering prompt-managementby betmoar

judge0 📁v1.13.1⚰️ Archived⭐4,082

Robust, fast, scalable, and sandboxed open-source online code execution system for humans and AI.

ai-agent-tools ai-agents ai-tools code-execution code-executor code-runner competitive-programming html online-compilerby judge0HTML

Promptgpt 📁v1.2⚰️ Archived⭐119

PromptGPT is an opensource framework that enables users to automatically generate high-quality prompts with zero installations, coding necessary or technical knowledge. Promptgpt follows industry best

by howard9192