freshcrate

Search results for "evaluation"

Clear filters
24 results found (TypeScript)
agenta๐Ÿ“v0.96.7๐ŸŒณ Matureโญ4,011

The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.

openclaw-engram๐Ÿ“v9.3.142๐ŸŒฟ Growingโญ54

Local-first memory plugin for OpenClaw AI agents. LLM-powered extraction, plain markdown storage, hybrid search via QMD. Gives agents persistent long-term memory across conversations.

agentmemory๐Ÿ“v0.9.1๐ŸŒณ Matureโญ738

Persistent memory for AI coding agents

neurolink๐Ÿ“v9.56.0๐ŸŒฟ Growingโญ121

Universal AI Development Platform with MCP server integration, multi-provider support, and professional CLI. Build, test, and deploy AI applications with multiple ai providers.

piclaw๐Ÿ“v1.8.3๐ŸŒฟ Growingโญ467

I'm going to build my own OpenClaw, with blackjack... and bun!

langfuse๐Ÿ“v3.169.0๐ŸŒฟ Growingโญ24,578

๐Ÿชข Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. ๐ŸŠYC W23

promptfoo๐Ÿ“code-scan-action-0.1.5๐ŸŒฟ Growingโญ19,943

Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and

langwatch๐Ÿ“skills@v0.3.0๐ŸŒฟ Growingโญ3,193

The platform for LLM evaluations and AI agent testing

latitude-llm๐Ÿ“claude-code-telemetry-0.0.5๐ŸŒฟ Growingโญ3,955

Latitude is the open-source agent engineering platform

prism-mcp๐Ÿ“v9.3.0๐ŸŒฟ Growingโญ116

The Mind Palace for AI Agents โ€” Autonomous Cognitive OS with affect-tagged memory (valence engine), token-economic RL (surprisal gate + UBI), Hebbian learning, ACT-R spreading activation, Synapse Engi

panguard-ai๐Ÿ“v1.4.19๐ŸŒฑ Seedlingโญ37

Open-source security platform for AI agents -- audits skills before install, monitors 24/7, shares threat intelligence across all users. | AI Agent ้–‹ๆบๅฎ‰ๅ…จๅนณๅฐ -- ๅฎ‰่ฃๅ‰ๅฏฉ่จˆ skillใ€24/7 ๅณๆ™‚็›ฃๆŽงใ€็คพ็พคๅ…ฑไบซๅจ่„…ๆƒ…ๅ ฑใ€‚

mission-control๐Ÿ“v2.5.0๐ŸŒฟ Growingโญ1,853

The world's first Autonomous Product Engine (APE): AI agents research your market, generate features, and ship code as PRs. Convoy mode, crash recovery, cost tracking, 80+ API endpoints. Self-hosted v

karpathy-llm-wiki๐Ÿ“main@2026-04-21๐ŸŒฑ Seedlingโญ34

The Self-Growing Karpathy LLM Wiki โ€” grown by an AI agent yoyo from Karpathy's founding prompt

Cogitator-AI๐Ÿ“main@2026-04-21๐ŸŒฑ Seedlingโญ35

๐Ÿค– Kubernetes for AI Agents. Self-hosted, production-grade runtime for orchestrating LLM swarms and autonomous agents. TypeScript-native.

vexa๐Ÿ“v0.10.2๐ŸŒฟ Growingโญ1,862

Open-source meeting transcription API for Google Meet, Microsoft Teams & Zoom. Auto-join bots, real-time WebSocket transcripts, MCP server for AI agents. Self-host or use hosted SaaS.

voltagent๐Ÿ“@voltagent/server-elysia@2.0.7๐ŸŒฟ Growingโญ7,851

AI Agent Engineering Platform built on an Open Source TypeScript AI Agent Framework

mastra๐Ÿ“@mastra/core@1.24.0๐ŸŒฑ Seedlingโญ22,899

From the team behind Gatsby, Mastra is a framework for building AI-powered applications and agents with a modern TypeScript stack.

kernel๐Ÿ“v3.97.0๐ŸŒฑ Seedlingโญ12

kbot โ€” the AI agent that dreams, learns, and evolves. 764+ tools, 35 agents, 20 providers. Music production, iPhone control, financial analysis, cyber threat intel. Always-on daemon. Runs offline. npm

bisheng๐Ÿ“v2.3.0๐ŸŒฑ Seedlingโญ11,293

BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SF

CodeRAG๐Ÿ“main@2026-04-21๐ŸŒฑ Seedlingโญ1

Build semantic vector databases from code and docs to enable AI agents to understand and navigate your entire codebase effectively.

harness๐Ÿ“master@2026-04-21๐ŸŒฑ Seedlingโญ1

Define and control AI agents in markdown with full prompt transparency, persistent memory, and integrated tools via the Claude Agent SDK.

Neuroverseos-governance๐Ÿ“v0.3.0๐ŸŒฑ Seedlingโญ1

Deterministic governance engine for AI agents. Enforce rules defined in .md governance files across AI systems.