freshcrate

Search results for "eval"

Clear filters
38 results found (TypeScript)
voratiq๐Ÿ“main@2026-04-21๐ŸŒฟ Growingโญ67

Agent ensembles to design, generate, and select the best code for every task.

agenta๐Ÿ“v0.96.7๐ŸŒณ Matureโญ4,045

The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.

langwatch๐Ÿ“python-sdk@v0.21.0๐ŸŒณ Matureโญ3,206

The platform for LLM evaluations and AI agent testing

langfuse๐Ÿ“v3.169.0๐Ÿ›๏ธ Flagshipโญ25,291

๐Ÿชข Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. ๐ŸŠYC W23

promptfoo๐Ÿ“code-scan-action-0.1.5๐Ÿ›๏ธ Flagshipโญ20,382

Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and

prism-mcp๐Ÿ“v9.3.0๐ŸŒฟ Growingโญ128

The Mind Palace for AI Agents โ€” Autonomous Cognitive OS with affect-tagged memory (valence engine), token-economic RL (surprisal gate + UBI), Hebbian learning, ACT-R spreading activation, Synapse Engi

mastra๐Ÿ“@mastra/core@1.24.0๐Ÿ›๏ธ Flagshipโญ23,202

From the team behind Gatsby, Mastra is a framework for building AI-powered applications and agents with a modern TypeScript stack.

codingbuddy๐Ÿ“v5.6.3๐ŸŒฑ Seedlingโญ31

Codingbuddy orchestrates 29 specialized AI agents to deliver code quality comparable to a team of human experts through a PLAN โ†’ ACT โ†’ EVAL workflow.

latitude-llm๐Ÿ“claude-code-telemetry-0.0.6๐ŸŒณ Matureโญ3,957

Latitude is the open-source agent engineering platform

neurolink๐Ÿ“v9.56.1๐ŸŒฟ Growingโญ83

Universal AI Development Platform with MCP server integration, multi-provider support, and professional CLI. Build, test, and deploy AI applications with multiple ai providers.

strudel-mcp-server๐Ÿ“v2.0.0๐ŸŒฟ Growingโญ193

A Model Context Protocol (MCP) server that gives Claude direct control over Strudel.cc for AI-assisted music generation and live coding.

OmniRoute๐Ÿ“v3.6.9๐ŸŒณ Matureโญ3,250

OmniRoute is an AI gateway for multi-provider LLMs: an OpenAI-compatible endpoint with smart routing, load balancing, retries, and fallbacks. Add policies, rate limits, caching, and observability for

node9-proxy๐Ÿ“v1.11.3๐ŸŒฟ Growingโญ118

The Execution Security Layer for the Agentic Era. Providing deterministic "Sudo" governance and audit logs for autonomous AI agents.

codebase-context๐Ÿ“v2.3.0๐ŸŒฑ Seedlingโญ43

Generate a map of your codebaseto help AI Agents understand your architecture, coding conventions and patterns. Discoverable with Semantic Search

vobase๐Ÿ“create-vobase@0.6.2๐ŸŒฑ Seedlingโญ44

The app framework built for AI coding agents. Own every line. Your AI already knows how to build on it.

panguard-ai๐Ÿ“v1.4.19๐ŸŒฑ Seedlingโญ38

Open-source security platform for AI agents -- audits skills before install, monitors 24/7, shares threat intelligence across all users. | AI Agent ้–‹ๆบๅฎ‰ๅ…จๅนณๅฐ -- ๅฎ‰่ฃๅ‰ๅฏฉ่จˆ skillใ€24/7 ๅณๆ™‚็›ฃๆŽงใ€็คพ็พคๅ…ฑไบซๅจ่„…ๆƒ…ๅ ฑใ€‚

sentry-mcp๐Ÿ“0.32.0๐ŸŒณ Matureโญ658

An MCP server for interacting with Sentry via LLMs.

voltagent๐Ÿ“@voltagent/server-elysia@2.0.7๐Ÿ›๏ธ Flagshipโญ8,380

AI Agent Engineering Platform built on an Open Source TypeScript AI Agent Framework

agent-skills-standard๐Ÿ“php-v1.3.2๐ŸŒฟ Growingโญ428

A collection of Agent Skills Standard and Best Practice for Programming Languages, Frameworks that help our AI Agent follow best practies on frameworks and programming laguages

camofox-browser๐Ÿ“v2.1.1๐ŸŒฟ Growingโญ80

Anti-detection browser server for AI agents โ€” REST API wrapping Camoufox engine with OpenClaw plugin support

MiniSearch๐Ÿ“main@2026-04-20๐ŸŒฟ Growingโญ558

Minimalist web-searching platform with an AI assistant that runs directly from your browser. Uses WebLLM, Wllama and SearXNG. Demo: https://felladrin-minisearch.hf.space

magi-markdown๐Ÿ“main@2026-04-11๐ŸŒฟ Growingโญ552

MAGI: Markdown for Agent Guidance & Instruction - A next-generation markdown extension designed specifically for AI systems. MAGI enhances standard markdown with structured metadata, embedded AI instr

karpathy-llm-wiki๐Ÿ“main@2026-04-21๐ŸŒฑ Seedlingโญ43

The Self-Growing Karpathy LLM Wiki โ€” grown by an AI agent yoyo from Karpathy's founding prompt

deep-code-reasoning-mcp๐Ÿ“main@2026-04-20๐ŸŒฟ Growingโญ105

A Model Context Protocol (MCP) server that provides advanced code analysis and reasoning capabilities powered by Google's Gemini AI

agentshield๐Ÿ“v1.4.0๐ŸŒฟ Growingโญ522

AI agent security scanner. Detect vulnerabilities in agent configurations, MCP servers, and tool permissions. Available as CLI, GitHub Action, ECC plugin, and GitHub App integration. ๐Ÿ›ก๏ธ

Cogitator-AI๐Ÿ“main@2026-04-21๐ŸŒฑ Seedlingโญ36

๐Ÿค– Kubernetes for AI Agents. Self-hosted, production-grade runtime for orchestrating LLM swarms and autonomous agents. TypeScript-native.

polymarket-trader-mcp๐Ÿ“v1.6.7๐ŸŒฑ Seedlingโญ5

The most comprehensive MCP server for Polymarket โ€” 48 tools spanning direct trading, market discovery, smart money tracking, copy trading, backtesting, risk management, and portfolio optimization. Wor

mayros๐Ÿ“v0.3.2๐ŸŒฑ Seedlingโญ10

Production-ready AI agent framework โ€” semantic memory, multi-agent mesh, MCP server, intelligent routing, governance, and 67+ platform integrations.

elsium-ai๐Ÿ“elsium-ai@0.10.0๐ŸŒฑ Seedlingโญ8

Production-grade TypeScript AI runtime focused on reliability, governance, and reproducible LLM systems. Multi-provider gateway, agents, RAG, workflows, policy engine, audit trails, and deterministic

kernel๐Ÿ“v3.97.0๐ŸŒฑ Seedlingโญ12

kbot โ€” the AI agent that dreams, learns, and evolves. 764+ tools, 35 agents, 20 providers. Music production, iPhone control, financial analysis, cyber threat intel. Always-on daemon. Runs offline. npm

cf-browser๐Ÿ“v2.0.0๐ŸŒฑ Seedlingโญ5

Open-source Cloudflare Browser Rendering proxy โ€” 10 MCP tools for Claude Code (content, screenshot, PDF, markdown, scrape, JSON AI extraction, links, a11y, crawl)

@poofnew/vibe-check๐Ÿ“0.1.1๐ŸŒฑ Seedlingโญ5

AI agent evaluation framework for Claude and beyond

agent-regression-testing๐Ÿ“0.1.14๐ŸŒฑ Seedling

A standalone library for AI agent regression testing using LLM-as-judge evaluation

CodeRAG๐Ÿ“main@2026-04-21๐ŸŒฑ Seedlingโญ1

Build semantic vector databases from code and docs to enable AI agents to understand and navigate your entire codebase effectively.

harness๐Ÿ“master@2026-04-21๐ŸŒฑ Seedlingโญ1

Define and control AI agents in markdown with full prompt transparency, persistent memory, and integrated tools via the Claude Agent SDK.

chat-flow๐Ÿ“0.0.0โšฐ๏ธ Archivedโญ687

ChatFlow - AI-based chat flow framework, personalize your ChatGPT workflows and build the road to automationใ€‚ChatFlow โ€”โ€” ๆ‰“้€ ไธชๆ€งๅŒ– ChatGPT ๆต็จ‹๏ผŒๆž„ๅปบ่‡ชๅŠจๅŒ–ไน‹่ทฏ