freshcrate

Search results for "experiments"

45 results found
ai-experimentsπŸ“0.0.0🌿 Growing⭐168

AI Experiments A public repository of AI/ML projects exploring generative models, NLP, computer vision, and autonomous agents. Includes code, documentation, and demos for educational purposes.

opikπŸ“2.0.6🌳 Mature⭐18,767

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

neurolinkπŸ“v9.56.0🌿 Growing⭐121

Universal AI Development Platform with MCP server integration, multi-provider support, and professional CLI. Build, test, and deploy AI applications with multiple ai providers.

Auto-claude-code-research-in-sleepπŸ“v0.4.4🌳 Mature⭐6,182

ARIS βš”οΈ (Auto-Research-In-Sleep) β€” Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in β€” works wi

langchainπŸ“langchain-core==1.3.0🌳 Mature⭐133,178

The agent engineering platform

llm-rl-environments-lil-courseπŸ“main@2026-04-17🌿 Growing⭐57

🌱 A little course on Reinforcement Learning Environments for evaluating and training Language Models

LRATπŸ“0.0.0🌱 Seedling⭐34

The implementation for SIGIR 2026: Learning to Retrieve from Agent Trajectories.

gemini-autoresearchπŸ“0.0.0🌱 Seedling⭐27

Autonomous goal-directed iteration for Gemini CLI. Inspired by Karpathy's autoresearch. Modify β†’ Verify β†’ Keep/Discard β†’ Repeat forever.

aitools_clientπŸ“0.0.0🌿 Growing⭐182

Seth's AI Tools: A Unity based front end that uses ComfyUI and LLMs to create stories, images, movies, quizzes and posters

Autonomous-AgentsπŸ“main@2026-04-16🌿 Growing⭐1,211

Autonomous Agents (LLMs) research papers. Updated Daily.

quint-llm-kitπŸ“0.0.0🌿 Growing⭐53

Agents and tools for using Quint with LLMs

auto-deep-researcher-24x7πŸ“main@2026-04-19🌿 Growing⭐261

πŸ”₯ An autonomous AI agent that runs your deep learning experiments 24/7 while you sleep. Zero-cost monitoring, Leader-Worker architecture, constant-size memory.

latitude-llmπŸ“claude-code-telemetry-0.0.5🌿 Growing⭐3,955

Latitude is the open-source agent engineering platform

Agentic-RAG-R1πŸ“0.0.0🌿 Growing⭐412

Agentic RAG R1 Framework via Reinforcement Learning

convoke-agentsπŸ“v3.3.0🌱 Seedling⭐42

Convoke extends BMAD Method AI agents with two types of installable modules: Teams bring new agents for a domain, Skills add new capabilities to existing agents. Install them independently or combine

ISC-BenchπŸ“v0.0.5🌿 Growing⭐786

Internal Safety Collapse: Turning the LLM or an AI Agent into a sensitive data generator.

awesome-promptsπŸ“main@2026-04-21🌿 Growing⭐7,572

Curated list of chatgpt prompts from the top-rated GPTs in the GPTs Store. Prompt Engineering, prompt attack & prompt protect. Advanced Prompt Engineering papers.

laravel-travel-agentπŸ“0.0.0🌱 Seedling⭐63

Multi-Agent workflow running into a Laravel application with Neuron PHP AI framework

Dragon-BrainπŸ“v1.1.0🌱 Seedling⭐43

Dragon Brain β€” persistent long-term memory for AI agents via MCP (Model Context Protocol). Knowledge graph (FalkorDB) + vector search (Qdrant) + CUDA GPU embeddings. Works with Claude, Gemini CLI, Cur

AGI-Alpha-Agent-v0πŸ“main@2026-04-18🌿 Growing⭐283

META‑AGENTIC α‑AGI πŸ‘οΈβœ¨ β€” Mission 🎯 End‑to‑end: Identify πŸ” β†’ Out‑Learn πŸ“š β†’ Out‑Think 🧠 β†’ Out‑Design 🎨 β†’ Out‑Strategise β™ŸοΈ β†’ Out‑Execute ⚑

evalsπŸ“v0.1.15🌿 Growing⭐103

A comprehensive evaluation framework for AI agents and LLM applications.

maverick-mcpπŸ“main@2026-04-17🌿 Growing⭐479

MaverickMCP - Personal Stock Analysis MCP Server

trulensπŸ“trulens-2.7.2🌱 Seedling⭐3,237

Evaluation and Tracking for LLM Experiments and AI Agents

autoresearchπŸ“v1.9.12🌿 Growing⭐3,546

Claude Autoresearch Skill β€” Autonomous goal-directed iteration for Claude Code. Inspired by Karpathy's autoresearch. Modify β†’ Verify β†’ Keep/Discard β†’ Repeat forever.

llmwareπŸ“v0.4.6🌿 Growing⭐14,857

Unified framework for building enterprise RAG pipelines with small, specialized models

awesome-vector-databaseπŸ“main@2026-04-13🌿 Growing⭐341

A curated list of awesome works related to high dimensional structure/vector search & database

next-plaidπŸ“v1.2.0🌿 Growing⭐331

NextPlaid, ColGREP: Multi-vector search, from database to coding agents.

UltraRAGπŸ“v0.3.0.2🌿 Growing⭐5,480

A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines

AgentQuantπŸ“0.0.0🌱 Seedling⭐87

Autonomous quantitative trading research platform that transforms stock lists into fully backtested strategies using AI agents, real market data, and mathematical formulations, all without requiring a

Open-SableπŸ“v1.7.0🌱 Seedling⭐18

Open-Sable is a local-first autonomous agent framework with AGI-inspired cognitive subsystems (goals, memory, metacognition, tool use). It can run continuously on your machine, integrate with chat int

mlflowπŸ“v3.11.1🌱 Seedling⭐25,285

The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while controllin

tensorzeroπŸ“2026.4.0🌱 Seedling⭐11,204

TensorZero is an open-source LLMOps platform that unifies an LLM gateway, observability, evaluation, optimization, and experimentation.

edslπŸ“wasm-wheel🌱 Seedling⭐454

Design, conduct and analyze results of AI-powered surveys and experiments. Simulate social science and market research with large numbers of AI agents and LLMs.

deep-research-agentπŸ“0.0.0πŸ’€ Dormant⭐18

Deep research agent built with Neuron PHP AI framewokrk

PolyCouncilπŸ“v1.1.1🌱 Seedling⭐28

PolyCouncil is an open-source multi-model deliberation engine for LM Studio. It runs multiple LLMs in parallel, gathers their answers, scores each response using a shared rubric, and produces a final,

camelπŸ“v0.2.90🌱 Seedling⭐16,654

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

devkitπŸ“v2.1.29🌱 Seedling⭐2

A deterministic development harness for Claude Code β€” MCP workflow engine, enforcement hooks, YAML workflows, and multi-agent consensus (Claude + Codex + Gemini)

rex-cliπŸ“v0.17.0🌱 Seedling⭐27

Local-first AI agent bootstrap: Playwright Browser MCP + ContextDB for Codex CLI, Claude Code, Gemini CLI, and OpenCode.

RAGEloπŸ“0.4.0🌱 Seedling⭐128

RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker

p4mcp-serverπŸ“2025.2.2901372🌱 Seedling⭐76

[Community Supported] Perforce P4 MCP Server is a Model Context Protocol (MCP) server that integrates with the Perforce P4 version control system.

autonomous-agentic-research-swarmπŸ“main@2026-04-11🌱 Seedling⭐4

File-based autonomous agentic research swarm template (Planner/Worker/Judge) with contracts, workstreams, and deterministic quality gates.

PromptManagerπŸ“master@2026-04-12🌱 Seedling⭐3

PromptManager is a desktop application for cataloguing, searching, and executing AI prompts, and much more.

llm-agents.nixπŸ“assets🌱 Seedling⭐988

Nix packages for AI coding agents and development tools. Automatically updated daily.

redesigned-pancakeπŸ“0.0.0⚰️ Archived⭐222

Skip to content github / docs Code Issues 80 Pull requests 35 Discussions Actions Projects 2 Security Insights Merge branch 'main' into 1862-Add-Travis-CI-migration-table 1862-Add-Travis-CI-migration

aiflowsπŸ“v1.1.1⚰️ Archived⭐275

πŸ€–πŸŒŠ aiFlows: The building blocks of your collaborative AI