Search results for "reinforcement"
A standard API for reinforcement learning and a diverse set of reference environments (formerly Gym).
π± A little course on Reinforcement Learning Environments for evaluating and training Language Models
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.
A framework for building, orchestrating and deploying AI agents and multi-agent workflows with support for Python and .NET.
DSPy: The framework for programmingβnot promptingβlanguage models
Build and run agents you can see, understand and trust.
Universal memory layer for AI Agents. It provides scalable, extensible, and interoperable memory storage and retrieval to streamline AI agent state management for next-generation autonomous systems.
π« CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org
NEXO Brain β Shared brain for AI agents. Persistent memory, semantic RAG, natural forgetting, metacognitive guard, trust scoring, 150+ MCP tools. Works with Claude Code, Codex, Claude Desktop & any MC
Open Framework for AI Agents to play Red Alert through Reinforcement Learning
Agentic RAG R1 Framework via Reinforcement Learning
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
A 27-chapter hands-on tutorial for building an autonomous AI agent from zero in Python. Agent loop, tool system, memory, skills, MCP, multi-platform gateway, and self-evolution β inspired by Herme
A curated list of products, benchmarks, and research papers on autonomous code agents. Beyond coding β they're redefining how software changes the world.
Curated list of the best truly open-source AI projects, models, tools, and infrastructure.
π€ The most comprehensive directory of AI agent frameworks, platforms, tools, and resources - hundreds of curated entries covering open-source, no-code, enterprise, and autonomous solutions. NEW Boil
Open-Sable is a local-first autonomous agent framework with AGI-inspired cognitive subsystems (goals, memory, metacognition, tool use). It can run continuously on your machine, integrate with chat int
Computer Environments Elicit General Agentic Intelligence in LLMs
Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)
Autonomous coding agent with web research (Recon), adversarial plan debate, 5-tier cognitive memory, multi-model routing (Gemini + DeepSeek + Ollama), 24/7 loops, and $0 local mode. Apache 2.0.
π¦Ύ A productionβready research outreach AI agent that plans, discovers, reasons, uses tools, autoβbuilds cited briefings, and drafts tailored emails with toolβchaining, memory, tests, and turnkey Dock
