freshcrate — Search

Search results for "evaluation"

24 results found (TypeScript)

agenta 📁v0.96.7🌳 Mature⭐4,011

The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.

agents evaluation llm-as-a-judge llm-evaluation llm-framework llm-monitoring llm-observability llm-platform prompt-engineering typescriptby Agenta-AITypeScript

openclaw-engram 📁v9.3.142🌿 Growing⭐54

Local-first memory plugin for OpenClaw AI agents. LLM-powered extraction, plain markdown storage, hybrid search via QMD. Gives agents persistent long-term memory across conversations.

ai-agent ai-memory conversational-ai engram knowledge-graph llm local-first long-term-memory typescriptby joshuaswarrenTypeScript

agentmemory 📁v0.9.1🌳 Mature⭐738

Persistent memory for AI coding agents

typescriptby rohitg00TypeScript

AgentWard 📁main@2026-04-20🌱 Seedling⭐30

AgentWard – Built for all, hardened for OpenClaw.

agent-security defense-in-depth llm-agent llm-security openclaw openclaw-plugin openclaw-security prompt-injection-defense typescriptby FIND-LabTypeScript

neurolink 📁v9.56.0🌿 Growing⭐121

Universal AI Development Platform with MCP server integration, multi-provider support, and professional CLI. Build, test, and deploy AI applications with multiple ai providers.

agents ai ai-development ai-platform automation developer-tools llm local-first typescriptby juspayTypeScript

piclaw 📁v1.8.3🌿 Growing⭐467

I'm going to build my own OpenClaw, with blackjack... and bun!

adaptive-cards ai-agent bun coding-agent docker llm pi-agent self-hosted typescriptby rcarmoTypeScript

langfuse 📁v3.169.0🌿 Growing⭐24,578

🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

analytics autogen evaluation langchain large-language-models llama-index llm llm-evaluation prompt-engineering typescriptby langfuseTypeScript

promptfoo 📁code-scan-action-0.1.5🌿 Growing⭐19,943

Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and

ci ci-cd cicd evaluation evaluation-framework llm llm-eval llm-evaluation typescriptby promptfooTypeScript

langwatch 📁skills@v0.3.0🌿 Growing⭐3,193

The platform for LLM evaluations and AI agent testing

ai analytics datasets dspy evaluation gpt llm llm-ops typescriptby langwatchTypeScript

latitude-llm 📁claude-code-telemetry-0.0.5🌿 Growing⭐3,955

Latitude is the open-source agent engineering platform

typescriptby latitude-devTypeScript

prism-mcp 📁v9.3.0🌿 Growing⭐116

The Mind Palace for AI Agents — Autonomous Cognitive OS with affect-tagged memory (valence engine), token-economic RL (surprisal gate + UBI), Hebbian learning, ACT-R spreading activation, Synapse Engi

agent-memory ai-agent anti-sycophancy claude-desktop cognitive-architecture google-gemini hebbian-learning llm-tools typescriptby dcostencoTypeScript

panguard-ai 📁v1.4.19🌱 Seedling⭐37

Open-source security platform for AI agents -- audits skills before install, monitors 24/7, shares threat intelligence across all users. | AI Agent 開源安全平台 -- 安裝前審計 skill、24/7 即時監控、社群共享威脅情報。

ai-agent ai-security cybersecurity llm-security mcp open-source prompt-injection sigma-rules typescriptby panguard-aiTypeScript

mission-control 📁v2.5.0🌿 Growing⭐1,853

The world's first Autonomous Product Engine (APE): AI agents research your market, generate features, and ship code as PRs. Convoy mode, crash recovery, cost tracking, 80+ API endpoints. Self-hosted v

aiagent automation openclaw typescriptby crshdnTypeScript

karpathy-llm-wiki 📁main@2026-04-21🌱 Seedling⭐34

The Self-Growing Karpathy LLM Wiki — grown by an AI agent yoyo from Karpathy's founding prompt

ai-agent karpathy knowledge-base llm typescript wikiby yologdevTypeScript

Cogitator-AI 📁main@2026-04-21🌱 Seedling⭐35

🤖 Kubernetes for AI Agents. Self-hosted, production-grade runtime for orchestrating LLM swarms and autonomous agents. TypeScript-native.

agent agentic-ai agentic-framework agentic-workflow ai ai-framework automation gemini typescriptby cogitator-aiTypeScript

vexa 📁v0.10.2🌿 Growing⭐1,862

Open-source meeting transcription API for Google Meet, Microsoft Teams & Zoom. Auto-join bots, real-time WebSocket transcripts, MCP server for AI agents. Self-host or use hosted SaaS.

google-meet meeting-assistant meeting-minutes meeting-notes ms-teams ms-teams-app notetaker python zoomby Vexa-aiTypeScript

voltagent 📁@voltagent/server-elysia@2.0.7🌿 Growing⭐7,851

AI Agent Engineering Platform built on an Open Source TypeScript AI Agent Framework

agents ai ai-agents ai-agents-framework aiagentframework chatbots chatgpt framework typescriptby VoltAgentTypeScript

clawtrace 📁main@2026-04-16🌱 Seedling⭐10

Make your OpenClaw agents better, cheaper, and faster.

agent-observability agent-telemetry ai-agent ai-agent-observability ai-evaluation ai-observability automomous-agents claude-harness typescriptby epsilla-cloudTypeScript

mastra 📁@mastra/core@1.24.0🌱 Seedling⭐22,899

From the team behind Gatsby, Mastra is a framework for building AI-powered applications and agents with a modern TypeScript stack.

agents ai chatbots evals javascript llm mcp nextjs typescriptby mastra-aiTypeScript

kernel 📁v3.97.0🌱 Seedling⭐12

kbot — the AI agent that dreams, learns, and evolves. 764+ tools, 35 agents, 20 providers. Music production, iPhone control, financial analysis, cyber threat intel. Always-on daemon. Runs offline. npm

ai-agent anthropic cli coding-agent cybersecurity defi kbot llm typescriptby isaacsightTypeScript

bisheng 📁v2.3.0🌱 Seedling⭐11,293

BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SF

agent ai chatbot enterprise finetune genai gpt langchian typescriptby dataelementTypeScript

CodeRAG 📁main@2026-04-21🌱 Seedling⭐1

Build semantic vector databases from code and docs to enable AI agents to understand and navigate your entire codebase effectively.

ai ai-tools code-analysis embeddings execution-based-evaluation game-development game-programming game-source rag typescriptby Eyram233TypeScript

harness 📁master@2026-04-21🌱 Seedling⭐1

Define and control AI agents in markdown with full prompt transparency, persistent memory, and integrated tools via the Claude Agent SDK.

ai claude claude-code claude-skills code-repository evaluation-framework gemini git llm-agent typescriptby heba-ramdanTypeScript

Neuroverseos-governance 📁v0.3.0🌱 Seedling⭐1

Deterministic governance engine for AI agents. Enforce rules defined in .md governance files across AI systems.

agent-framework agent-guardrails agent-harness ai ai-agents ai-governance ai-guardrails ai-safety mcp-server typescriptby NeuroverseOSTypeScript