freshcrate — Search

Search results for "eval"

38 results found (TypeScript)

voratiq 📁main@2026-04-21🌿 Growing⭐67

Agent ensembles to design, generate, and select the best code for every task.

agent-orchestration claude-code cli code-generation codex coding-agents evals gemini-cli typescriptby voratiqTypeScript

agenta 📁v0.96.7🌳 Mature⭐4,045

The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.

agents evaluation llm-as-a-judge llm-evaluation llm-framework llm-monitoring llm-observability llm-platform prompt-engineering typescriptby Agenta-AITypeScript

langwatch 📁python-sdk@v0.21.0🌳 Mature⭐3,206

The platform for LLM evaluations and AI agent testing

ai analytics datasets dspy evaluation gpt llm llm-ops typescriptby langwatchTypeScript

langfuse 📁v3.169.0🏛️ Flagship⭐25,291

🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

analytics autogen evaluation langchain large-language-models llama-index llm llm-evaluation prompt-engineering typescriptby langfuseTypeScript

promptfoo 📁code-scan-action-0.1.5🏛️ Flagship⭐20,382

Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and

ci ci-cd cicd evaluation evaluation-framework llm llm-eval llm-evaluation typescriptby promptfooTypeScript

prism-mcp 📁v9.3.0🌿 Growing⭐128

The Mind Palace for AI Agents — Autonomous Cognitive OS with affect-tagged memory (valence engine), token-economic RL (surprisal gate + UBI), Hebbian learning, ACT-R spreading activation, Synapse Engi

agent-memory ai-agent anti-sycophancy claude-desktop cognitive-architecture google-gemini hebbian-learning llm-tools typescriptby dcostencoTypeScript

mastra 📁@mastra/core@1.24.0🏛️ Flagship⭐23,202

From the team behind Gatsby, Mastra is a framework for building AI-powered applications and agents with a modern TypeScript stack.

agents ai chatbots evals javascript llm mcp nextjs typescriptby mastra-aiTypeScript

codingbuddy 📁v5.6.3🌱 Seedling⭐31

Codingbuddy orchestrates 29 specialized AI agents to deliver code quality comparable to a team of human experts through a PLAN → ACT → EVAL workflow.

ai-agents ai-coding ai-coding-assistant ai-rules claude-code code-quality coding-assistant cursor model-context-protocol typescriptby JeremyDev87TypeScript

latitude-llm 📁claude-code-telemetry-0.0.6🌳 Mature⭐3,957

Latitude is the open-source agent engineering platform

typescriptby latitude-devTypeScript

neurolink 📁v9.56.1🌿 Growing⭐83

Universal AI Development Platform with MCP server integration, multi-provider support, and professional CLI. Build, test, and deploy AI applications with multiple ai providers.

agents ai ai-development ai-platform automation developer-tools llm local-first typescriptby juspayTypeScript

strudel-mcp-server 📁v2.0.0🌿 Growing⭐193

A Model Context Protocol (MCP) server that gives Claude direct control over Strudel.cc for AI-assisted music generation and live coding.

typescriptby williamzujkowskiTypeScript

OmniRoute 📁v3.6.9🌳 Mature⭐3,250

OmniRoute is an AI gateway for multi-provider LLMs: an OpenAI-compatible endpoint with smart routing, load balancing, retries, and fallbacks. Add policies, rate limits, caching, and observability for

typescriptby diegosouzapwTypeScript

node9-proxy 📁v1.11.3🌿 Growing⭐118

The Execution Security Layer for the Agentic Era. Providing deterministic "Sudo" governance and audit logs for autonomous AI agents.

ai-safety ai-security claude-code gemini gemini-cli llm llm-agent mcp-server typescriptby node9-aiTypeScript

codebase-context 📁v2.3.0🌱 Seedling⭐43

Generate a map of your codebaseto help AI Agents understand your architecture, coding conventions and patterns. Discoverable with Semantic Search

ai-agents ai-coding claude code-intelligence context-engineering copilot cursor developer-tools model-context-protocol typescriptby PatrickSysTypeScript

vobase 📁create-vobase@0.6.2🌱 Seedling⭐44

The app framework built for AI coding agents. Own every line. Your AI already knows how to build on it.

better-auth bun claude drizzle-orm flyio honojs mcp rag typescriptby vobaseTypeScript

panguard-ai 📁v1.4.19🌱 Seedling⭐38

Open-source security platform for AI agents -- audits skills before install, monitors 24/7, shares threat intelligence across all users. | AI Agent 開源安全平台 -- 安裝前審計 skill、24/7 即時監控、社群共享威脅情報。

ai-agent ai-security cybersecurity llm-security mcp open-source prompt-injection sigma-rules typescriptby panguard-aiTypeScript

sentry-mcp 📁0.32.0🌳 Mature⭐658

An MCP server for interacting with Sentry via LLMs.

mcp-server tag-production typescriptby getsentryTypeScript

voltagent 📁@voltagent/server-elysia@2.0.7🏛️ Flagship⭐8,380

AI Agent Engineering Platform built on an Open Source TypeScript AI Agent Framework

agents ai ai-agents ai-agents-framework aiagentframework chatbots chatgpt framework typescriptby VoltAgentTypeScript

agent-skills-standard 📁php-v1.3.2🌿 Growing⭐428

A collection of Agent Skills Standard and Best Practice for Programming Languages, Frameworks that help our AI Agent follow best practies on frameworks and programming laguages

agent-agentic-ai android angular best-practices coding-standards cursor-rules flutter typescriptby HoangNguyen0403TypeScript

camofox-browser 📁v2.1.1🌿 Growing⭐80

Anti-detection browser server for AI agents — REST API wrapping Camoufox engine with OpenClaw plugin support

ai-agent anti-detection automation bot-detection browser-automation browser-server camofox camoufox typescriptby redf0x1TypeScript

MiniSearch 📁main@2026-04-20🌿 Growing⭐558

Minimalist web-searching platform with an AI assistant that runs directly from your browser. Uses WebLLM, Wllama and SearXNG. Demo: https://felladrin-minisearch.hf.space

ai ai-search-engine artificial-intelligence generative-ai gpu-accelerated information-retrieval llm llm-inference rag typescriptby felladrinTypeScript

clawtrace 📁main@2026-04-16🌱 Seedling⭐28

Make your OpenClaw agents better, cheaper, and faster.

agent-observability agent-telemetry ai-agent ai-agent-observability ai-evaluation ai-observability automomous-agents claude-harness typescriptby epsilla-cloudTypeScript

magi-markdown 📁main@2026-04-11🌿 Growing⭐552

MAGI: Markdown for Agent Guidance & Instruction - A next-generation markdown extension designed specifically for AI systems. MAGI enhances standard markdown with structured metadata, embedded AI instr

ai ai-agents ai-native document-retrieval embeddings graph-knowledge-distillation kag llm typescriptby sno-aiTypeScript

moss 📁c-sdk-v0.9.0🌿 Growing⭐316

Official Repo of Moss

ai-agents ai-infra hybrid-search rag real-time retrieval semantic-search typescript voice-aiby usemossTypeScript

karpathy-llm-wiki 📁main@2026-04-21🌱 Seedling⭐43

The Self-Growing Karpathy LLM Wiki — grown by an AI agent yoyo from Karpathy's founding prompt

ai-agent karpathy knowledge-base llm typescript wikiby yologdevTypeScript

deep-code-reasoning-mcp 📁main@2026-04-20🌿 Growing⭐105

A Model Context Protocol (MCP) server that provides advanced code analysis and reasoning capabilities powered by Google's Gemini AI

ai ai-tools claude code-analysis code-intelligence code-reasoning debugging developer-tools typescriptby evalopsTypeScript

agentshield 📁v1.4.0🌿 Growing⭐522

AI agent security scanner. Detect vulnerabilities in agent configurations, MCP servers, and tool permissions. Available as CLI, GitHub Action, ECC plugin, and GitHub App integration. 🛡️

ai-agent anthropic claude-code hackathon mcp opus security typescriptby affaan-mTypeScript

Cogitator-AI 📁main@2026-04-21🌱 Seedling⭐36

🤖 Kubernetes for AI Agents. Self-hosted, production-grade runtime for orchestrating LLM swarms and autonomous agents. TypeScript-native.

agent agentic-ai agentic-framework agentic-workflow ai ai-framework automation gemini typescriptby cogitator-aiTypeScript

polymarket-trader-mcp 📁v1.6.7🌱 Seedling⭐5

The most comprehensive MCP server for Polymarket — 48 tools spanning direct trading, market discovery, smart money tracking, copy trading, backtesting, risk management, and portfolio optimization. Wor

ai-agent ai-trading anthropic blockchain claude copy-trading defi mcp model-context-protocol typescriptby demwickTypeScript

mayros 📁v0.3.2🌱 Seedling⭐10

Production-ready AI agent framework — semantic memory, multi-agent mesh, MCP server, intelligent routing, governance, and 67+ platform integrations.

agent-orchestration ai-agents ai-framework chatbot claude cli developer-tools discord-bot typescriptby ApiliumCodeTypeScript

elsium-ai 📁elsium-ai@0.10.0🌱 Seedling⭐8

Production-grade TypeScript AI runtime focused on reliability, governance, and reproducible LLM systems. Multi-provider gateway, agents, RAG, workflows, policy engine, audit trails, and deterministic

agent-framework ai-compliance ai-framework ai-governance ai-infrastructure ai-production ai-reliability ai-runtime typescriptby elsium-aiTypeScript

kernel 📁v3.97.0🌱 Seedling⭐12

kbot — the AI agent that dreams, learns, and evolves. 764+ tools, 35 agents, 20 providers. Music production, iPhone control, financial analysis, cyber threat intel. Always-on daemon. Runs offline. npm

ai-agent anthropic cli coding-agent cybersecurity defi kbot llm typescriptby isaacsightTypeScript

cf-browser 📁v2.0.0🌱 Seedling⭐5

Open-source Cloudflare Browser Rendering proxy — 10 MCP tools for Claude Code (content, screenshot, PDF, markdown, scrape, JSON AI extraction, links, a11y, crawl)

browser-rendering claude-code cloudflare-workers markdown mcp-server pdf python-sdk screenshot typescriptby claude-worldTypeScript

@poofnew/vibe-check 📁0.1.1🌱 Seedling⭐5

AI agent evaluation framework for Claude and beyond

agent ai anthropic claude evaluation llm npm testingby Poof LabsTypeScript

agent-regression-testing 📁0.1.14🌱 Seedling

A standalone library for AI agent regression testing using LLM-as-judge evaluation

agent ai evaluation llm npm regression testingby GitHub ActionsTypeScript

CodeRAG 📁main@2026-04-21🌱 Seedling⭐1

Build semantic vector databases from code and docs to enable AI agents to understand and navigate your entire codebase effectively.

ai ai-tools code-analysis embeddings execution-based-evaluation game-development game-programming game-source rag typescriptby Eyram233TypeScript

harness 📁master@2026-04-21🌱 Seedling⭐1

Define and control AI agents in markdown with full prompt transparency, persistent memory, and integrated tools via the Claude Agent SDK.

ai claude claude-code claude-skills code-repository evaluation-framework gemini git llm-agent typescriptby heba-ramdanTypeScript

chat-flow 📁0.0.0⚰️ Archived⭐687

ChatFlow - AI-based chat flow framework, personalize your ChatGPT workflows and build the road to automation。ChatFlow —— 打造个性化 ChatGPT 流程，构建自动化之路

typescriptby prompt-engineeringTypeScript