freshcrate

Search results for "evals"

Clear filters
22 results found (Python)
trulensπŸ“trulens-2.7.2🌳 Mature⭐3,261

Evaluation and Tracking for LLM Experiments and AI Agents

jarvisπŸ“v1.28.0🌿 Growing⭐174

Your AI assistant that never forgets and runs 100% privately on your computer. Leave it on 24/7 - it learns your preferences, helps with code, manages your health goals, searches the web, and connects

gptmeπŸ“v0.31.1.dev20260420🌳 Mature⭐4,274

Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web. Make your own persistent autonomous agent on top!

pydantic-aiπŸ“v1.84.1🌳 Mature⭐16,274

AI Agent Framework, the Pydantic way

langchainπŸ“langchain-core==1.3.0🌳 Mature⭐133,178

The agent engineering platform

instructorπŸ“v1.15.1πŸ›οΈ Flagship⭐12,802

structured outputs for llms

logfireπŸ“v4.32.1🌿 Growing⭐4,161

AI observability platform for production LLM and agent systems.

fast-agentπŸ“v0.6.17🌿 Growing⭐3,740

Code, Build and Evaluate agents - excellent Model and Skills/MCP/ACP Support

evalsπŸ“v0.1.15🌿 Growing⭐103

A comprehensive evaluation framework for AI agents and LLM applications.

sec-edgar-mcpπŸ“v1.0.8🌿 Growing⭐253

A SEC EDGAR MCP (Model Context Protocol) Server

ragasπŸ“v0.4.3🌳 Mature⭐13,569

Supercharge Your LLM Application Evaluations πŸš€

honchoπŸ“main@2026-04-21🌿 Growing⭐2,030

Memory library for building stateful agents

LLM-Agent-Paper-dailyπŸ“main@2026-04-21🌱 Seedling⭐20

Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)

atomic-knowledgeπŸ“v0.2.0🌱 Seedling⭐36

Markdown-first work-memory protocol for existing agents, with maintained knowledge, candidate notes, evals, and an example KB.

sv-excel-agentπŸ“0.0.0🌱 Seedling⭐179

An Excel AI agent that uses MCP tools to let LLMs read, edit, and automate Excel spreadsheets.

uipath-ai-skillsπŸ“0.0.0🌱 Seedling⭐81

AI skills that turns coding agents into UiPath experts.

doryπŸ“v0.1.0🌱 Seedling⭐14

One memory layer for every AI agent. Local-first, markdown source of truth, and CLI/HTTP/MCP native. Your agent forgot who you are. Again. Dory fixes that.

agent2πŸ“v0.1.0🌱 Seedling⭐25

The production runtime for AI agents. Schema in, API out. Built on PydanticAI + FastAPI.

Agentic-AI-PipelineπŸ“v1.0.0πŸ’€ Dormant⭐63

🦾 A production‑ready research outreach AI agent that plans, discovers, reasons, uses tools, auto‑builds cited briefings, and drafts tailored emails with tool‑chaining, memory, tests, and turnkey Dock

inspect-aiπŸ“0.3.209🌱 Seedling

Framework for large language model evaluations

google-cloud-aiplatformπŸ“1.148.1🌱 Seedling

Vertex AI API client library