freshcrate

Search results for "experiments"

Clear filters
25 results found (Python)
ai-experimentsπŸ“0.0.0🌿 Growing⭐168

AI Experiments A public repository of AI/ML projects exploring generative models, NLP, computer vision, and autonomous agents. Includes code, documentation, and demos for educational purposes.

opikπŸ“2.0.6🌳 Mature⭐18,767

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

Auto-claude-code-research-in-sleepπŸ“v0.4.4🌳 Mature⭐6,182

ARIS βš”οΈ (Auto-Research-In-Sleep) β€” Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in β€” works wi

langchainπŸ“langchain-core==1.3.0🌳 Mature⭐133,178

The agent engineering platform

llm-rl-environments-lil-courseπŸ“main@2026-04-17🌿 Growing⭐57

🌱 A little course on Reinforcement Learning Environments for evaluating and training Language Models

LRATπŸ“0.0.0🌱 Seedling⭐34

The implementation for SIGIR 2026: Learning to Retrieve from Agent Trajectories.

auto-deep-researcher-24x7πŸ“main@2026-04-19🌿 Growing⭐261

πŸ”₯ An autonomous AI agent that runs your deep learning experiments 24/7 while you sleep. Zero-cost monitoring, Leader-Worker architecture, constant-size memory.

Agentic-RAG-R1πŸ“0.0.0🌿 Growing⭐412

Agentic RAG R1 Framework via Reinforcement Learning

ISC-BenchπŸ“v0.0.5🌿 Growing⭐786

Internal Safety Collapse: Turning the LLM or an AI Agent into a sensitive data generator.

Dragon-BrainπŸ“v1.1.0🌱 Seedling⭐43

Dragon Brain β€” persistent long-term memory for AI agents via MCP (Model Context Protocol). Knowledge graph (FalkorDB) + vector search (Qdrant) + CUDA GPU embeddings. Works with Claude, Gemini CLI, Cur

AGI-Alpha-Agent-v0πŸ“main@2026-04-18🌿 Growing⭐283

META‑AGENTIC α‑AGI πŸ‘οΈβœ¨ β€” Mission 🎯 End‑to‑end: Identify πŸ” β†’ Out‑Learn πŸ“š β†’ Out‑Think 🧠 β†’ Out‑Design 🎨 β†’ Out‑Strategise β™ŸοΈ β†’ Out‑Execute ⚑

evalsπŸ“v0.1.15🌿 Growing⭐103

A comprehensive evaluation framework for AI agents and LLM applications.

maverick-mcpπŸ“main@2026-04-17🌿 Growing⭐479

MaverickMCP - Personal Stock Analysis MCP Server

trulensπŸ“trulens-2.7.2🌱 Seedling⭐3,237

Evaluation and Tracking for LLM Experiments and AI Agents

llmwareπŸ“v0.4.6🌿 Growing⭐14,857

Unified framework for building enterprise RAG pipelines with small, specialized models

UltraRAGπŸ“v0.3.0.2🌿 Growing⭐5,480

A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines

AgentQuantπŸ“0.0.0🌱 Seedling⭐87

Autonomous quantitative trading research platform that transforms stock lists into fully backtested strategies using AI agents, real market data, and mathematical formulations, all without requiring a

Open-SableπŸ“v1.7.0🌱 Seedling⭐18

Open-Sable is a local-first autonomous agent framework with AGI-inspired cognitive subsystems (goals, memory, metacognition, tool use). It can run continuously on your machine, integrate with chat int

mlflowπŸ“v3.11.1🌱 Seedling⭐25,285

The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while controllin

edslπŸ“wasm-wheel🌱 Seedling⭐454

Design, conduct and analyze results of AI-powered surveys and experiments. Simulate social science and market research with large numbers of AI agents and LLMs.

PolyCouncilπŸ“v1.1.1🌱 Seedling⭐28

PolyCouncil is an open-source multi-model deliberation engine for LM Studio. It runs multiple LLMs in parallel, gathers their answers, scores each response using a shared rubric, and produces a final,

camelπŸ“v0.2.90🌱 Seedling⭐16,654

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

RAGEloπŸ“0.4.0🌱 Seedling⭐128

RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker

p4mcp-serverπŸ“2025.2.2901372🌱 Seedling⭐76

[Community Supported] Perforce P4 MCP Server is a Model Context Protocol (MCP) server that integrates with the Perforce P4 version control system.

PromptManagerπŸ“master@2026-04-12🌱 Seedling⭐3

PromptManager is a desktop application for cataloguing, searching, and executing AI prompts, and much more.