freshcrate — Search

Search results for "experiments"

45 results found

AI Experiments A public repository of AI/ML projects exploring generative models, NLP, computer vision, and autonomous agents. Includes code, documentation, and demos for educational purposes.

pythonby vivekpathaniaPython

opik 📁2.0.6🌳 Mature⭐18,767

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

evaluation hacktoberfest hacktoberfest2025 langchain llama-index llm llm-evaluation llm-observability pythonby comet-mlPython

neurolink 📁v9.56.0🌿 Growing⭐121

Universal AI Development Platform with MCP server integration, multi-provider support, and professional CLI. Build, test, and deploy AI applications with multiple ai providers.

agents ai ai-development ai-platform automation developer-tools llm local-first typescriptby juspayTypeScript

Auto-claude-code-research-in-sleep 📁v0.4.4🌳 Mature⭐6,182

ARIS ⚔️ (Auto-Research-In-Sleep) — Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in — works wi

ai-research ai-tools aris autonomous-agent claude claude-code claude-code-skills codex pythonby wanshuiyinPython

langchain 📁langchain-core==1.3.0🌳 Mature⭐133,178

The agent engineering platform

agents ai ai-agents anthropic chatgpt deepagents enterprise framework pythonby langchain-aiPython

llm-rl-environments-lil-course 📁main@2026-04-17🌿 Growing⭐57

🌱 A little course on Reinforcement Learning Environments for evaluating and training Language Models

course grpo language-models llm llm-agent python reinforcement-learning reinforcement-learning-environments rlvrby anakin87Python

LRAT 📁0.0.0🌱 Seedling⭐34

The implementation for SIGIR 2026: Learning to Retrieve from Agent Trajectories.

agent agentic llm python searchby Yuqi-ZhouPython

gemini-autoresearch 📁0.0.0🌱 Seedling⭐27

Autonomous goal-directed iteration for Gemini CLI. Inspired by Karpathy's autoresearch. Modify → Verify → Keep/Discard → Repeat forever.

ai-agent autonomous-agent autoresearch gemini gemini-cli javascript karpathy skillby supratikpmJavaScript

aitools_client 📁0.0.0🌿 Growing⭐182

Seth's AI Tools: A Unity based front end that uses ComfyUI and LLMs to create stories, images, movies, quizzes and posters

anthropic automatic1111 c#comfyui llm ollama openai tabbyapiby SethRobinsonC#

Autonomous-Agents 📁main@2026-04-16🌿 Growing⭐1,211

Autonomous Agents (LLMs) research papers. Updated Daily.

agent agentic agentic-ai agents ai ai-agents aiagent aiagentsby tmgthb

quint-llm-kit 📁0.0.0🌿 Growing⭐53

Agents and tools for using Quint with LLMs

bluespecby informalsystemsBluespec

auto-deep-researcher-24x7 📁main@2026-04-19🌿 Growing⭐261

🔥 An autonomous AI agent that runs your deep learning experiments 24/7 while you sleep. Zero-cost monitoring, Leader-Worker architecture, constant-size memory.

ai-agent autonomous-agent claude-code deep-learning experiment-automation gpu hyperparameter-tuning llm-agent pythonby Xiangyue-ZhangPython

latitude-llm 📁claude-code-telemetry-0.0.5🌿 Growing⭐3,955

Latitude is the open-source agent engineering platform

typescriptby latitude-devTypeScript

Agentic-RAG-R1 📁0.0.0🌿 Growing⭐412

Agentic RAG R1 Framework via Reinforcement Learning

agentic grpo python rag rlby jiangxinkePython

convoke-agents 📁v3.3.0🌱 Seedling⭐42

Convoke extends BMAD Method AI agents with two types of installable modules: Teams bring new agents for a domain, Skills add new capabilities to existing agents. Install them independently or combine

agentic agentic-ai bmad-method claude-code intent-driven-development javascript meta-agent pdlc product-discoveryby amalikJavaScript

ISC-Bench 📁v0.0.5🌿 Growing⭐786

Internal Safety Collapse: Turning the LLM or an AI Agent into a sensitive data generator.

adversarial-attacks agent-safety ai-safety benchmark frontier-models jailbreak large-language-models llm-safety pythonby wuyoscarPython

awesome-prompts 📁main@2026-04-21🌿 Growing⭐7,572

Curated list of chatgpt prompts from the top-rated GPTs in the GPTs Store. Prompt Engineering, prompt attack & prompt protect. Advanced Prompt Engineering papers.

awesome awesome-list chatgpt gpt4 gpts gptstore papers prompt prompt-engineeringby ai-boost

laravel-travel-agent 📁0.0.0🌱 Seedling⭐63

Multi-Agent workflow running into a Laravel application with Neuron PHP AI framework

agent agentic-framework agentic-workflow ai ai-agents ai-framework ai-workflow blade laravelby neuron-coreBlade

Dragon-Brain 📁v1.1.0🌱 Seedling⭐43

Dragon Brain — persistent long-term memory for AI agents via MCP (Model Context Protocol). Knowledge graph (FalkorDB) + vector search (Qdrant) + CUDA GPU embeddings. Works with Claude, Gemini CLI, Cur

ai-memory claude codex-cli cursor falkordb gemini-cli knowledge-graph llm-tools pythonby iikarusPython

AGI-Alpha-Agent-v0 📁main@2026-04-18🌿 Growing⭐283

META‑AGENTIC α‑AGI 👁️✨ — Mission 🎯 End‑to‑end: Identify 🔍 → Out‑Learn 📚 → Out‑Think 🧠 → Out‑Design 🎨 → Out‑Strategise ♟️ → Out‑Execute ⚡

agentic agentic-ai agentic-framework ai aiagent aiagents llm meta-agentic pythonby MontrealAIPython

evals 📁v0.1.15🌿 Growing⭐103

A comprehensive evaluation framework for AI agents and LLM applications.

agentic agentic-ai ai evaluation machine-learning python strands-agentsby strands-agentsPython

maverick-mcp 📁main@2026-04-17🌿 Growing⭐479

MaverickMCP - Personal Stock Analysis MCP Server

anthropic artificial-intelligence claude equities fastmcp finance financial-analysis fintech pythonby wshobsonPython

trulens 📁trulens-2.7.2🌱 Seedling⭐3,237

Evaluation and Tracking for LLM Experiments and AI Agents

agent-evaluation agentops ai-agents ai-monitoring ai-observability evals explainable-ml llm-eval pythonby trueraPython

autoresearch 📁v1.9.12🌿 Growing⭐3,546

Claude Autoresearch Skill — Autonomous goal-directed iteration for Claude Code. Inspired by Karpathy's autoresearch. Modify → Verify → Keep/Discard → Repeat forever.

ai autonomous-agent autoresearch claude claude-code iteration karpathy productivity shellby uditgoenkaShell

llmware 📁v0.4.6🌿 Growing⭐14,857

Unified framework for building enterprise RAG pipelines with small, specialized models

agents generative-ai-tools llamacpp llm onnx openvino parsing python retrieval-augmented-generationby llmware-aiPython

awesome-vector-database 📁main@2026-04-13🌿 Growing⭐341

A curated list of awesome works related to high dimensional structure/vector search & database

approximate-nearest-neighbor-search embedding-similarity embeddings-similarity nearest-neighbor-search search-engine similarity-search vector-database vector-searchby dangkhoasdc

next-plaid 📁v1.2.0🌿 Growing⭐331

NextPlaid, ColGREP: Multi-vector search, from database to coding agents.

agentic-rag cli grep multi-vector rust vector-databaseby lightonaiRust

UltraRAG 📁v0.3.0.2🌿 Growing⭐5,480

A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines

deepseek demo easy embedding flask gpt huggingface-transformers llm pythonby OpenBMBPython

AgentQuant 📁0.0.0🌱 Seedling⭐87

Autonomous quantitative trading research platform that transforms stock lists into fully backtested strategies using AI agents, real market data, and mathematical formulations, all without requiring a

agentic-ai ai-for-trading ai-trading ai-trading-agent algorithmic-trading autonomous-agent finance-ai fintech pythonby OnePunchMonkPython

Open-Sable 📁v1.7.0🌱 Seedling⭐18

Open-Sable is a local-first autonomous agent framework with AGI-inspired cognitive subsystems (goals, memory, metacognition, tool use). It can run continuously on your machine, integrate with chat int

agentic agentic-ai ai ai-assistant open-source pythonby IdeoaLabsPython

mlflow 📁v3.11.1🌱 Seedling⭐25,285

The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while controllin

agentops agents ai ai-governance apache-spark evaluation langchain llm-evaluation pythonby mlflowPython

tensorzero 📁2026.4.0🌱 Seedling⭐11,204

TensorZero is an open-source LLMOps platform that unifies an LLM gateway, observability, evaluation, optimization, and experimentation.

ai ai-engineering anthropic artificial-intelligence deep-learning genai generative-ai gpt rustby tensorzeroRust

edsl 📁wasm-wheel🌱 Seedling⭐454

Design, conduct and analyze results of AI-powered surveys and experiments. Simulate social science and market research with large numbers of AI agents and LLMs.

anthropic data-labeling deepinfra domain-specific-language experiments llama2 llm llm-agent pythonby expectedparrotPython

deep-research-agent 📁0.0.0💤 Dormant⭐18

Deep research agent built with Neuron PHP AI framewokrk

agent agentic-ai agentic-framework ai ai-framework ai-workflow deep-research deepresearch phpby neuron-corePHP

PolyCouncil 📁v1.1.1🌱 Seedling⭐28

PolyCouncil is an open-source multi-model deliberation engine for LM Studio. It runs multiple LLMs in parallel, gathers their answers, scores each response using a shared rubric, and produces a final,

ai ai-council ai-experiments ai-framework ai-research artificial-intelligence asyncio concensus pythonby TrentPiercePython

camel 📁v0.2.90🌱 Seedling⭐16,654

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

agent ai-societies artificial-intelligence communicative-ai cooperative-ai deep-learning large-language-models multi-agent-systems pythonby camel-aiPython

devkit 📁v2.1.29🌱 Seedling⭐2

A deterministic development harness for Claude Code — MCP workflow engine, enforcement hooks, YAML workflows, and multi-agent consensus (Claude + Codex + Gemini)

ai-agents claude-code code-quality developer-tools devops go mcp mcp-server multi-agentby 5uck1essGo

rex-cli 📁v0.17.0🌱 Seedling⭐27

Local-first AI agent bootstrap: Playwright Browser MCP + ContextDB for Codex CLI, Claude Code, Gemini CLI, and OpenCode.

ai-agent automation browser-automation claude-code cli codex-cli contextdb gemini-cli javascriptby rexleimoJavaScript

RAGElo 📁0.4.0🌱 Seedling⭐128

RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker

pythonby zetaalphavectorPython

p4mcp-server 📁2025.2.2901372🌱 Seedling⭐76

[Community Supported] Perforce P4 MCP Server is a Model Context Protocol (MCP) server that integrates with the Perforce P4 version control system.

mcp mcp-server p4 p4-code-review p4-mcp p4-mcp-server p4python perforce pythonby perforcePython

autonomous-agentic-research-swarm 📁main@2026-04-11🌱 Seedling⭐4

File-based autonomous agentic research swarm template (Planner/Worker/Judge) with contracts, workstreams, and deterministic quality gates.

agentic automation claude codex git-worktrees html reproducible-research research swarmby AysajanEHTML

PromptManager 📁master@2026-04-12🌱 Seedling⭐3

PromptManager is a desktop application for cataloguing, searching, and executing AI prompts, and much more.

prompt-engineering pythonby voytas75Python

llm-agents.nix 📁assets🌱 Seedling⭐988

Nix packages for AI coding agents and development tools. Automatically updated daily.

buildbot-numtide nixby numtideNix

redesigned-pancake 📁0.0.0⚰️ Archived⭐222

Skip to content github / docs Code Issues 80 Pull requests 35 Discussions Actions Projects 2 Security Insights Merge branch 'main' into 1862-Add-Travis-CI-migration-table 1862-Add-Travis-CI-migration

by Sfedfcv

aiflows 📁v1.1.1⚰️ Archived⭐275

🤖🌊 aiFlows: The building blocks of your collaborative AI

agent agents ai ai-framework ai-frameworks chatgpt copilot gpt jupyter notebookby epfl-dlabJupyter Notebook