Search results for "reinforcement"
A standard API for reinforcement learning and a diverse set of reference environments (formerly Gym).
π± A little course on Reinforcement Learning Environments for evaluating and training Language Models
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.
A framework for building, orchestrating and deploying AI agents and multi-agent workflows with support for Python and .NET.
DSPy: The framework for programmingβnot promptingβlanguage models
Multi-agent code review mesh β orchestrates AI agents from multiple providers to review code in parallel, cross-review each other's findings, and build accuracy profiles over time. Agents that catch r
Build and run agents you can see, understand and trust.
Universal memory layer for AI Agents. It provides scalable, extensible, and interoperable memory storage and retrieval to streamline AI agent state management for next-generation autonomous systems.
π« CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org
The Mind Palace for AI Agents β Autonomous Cognitive OS with affect-tagged memory (valence engine), token-economic RL (surprisal gate + UBI), Hebbian learning, ACT-R spreading activation, Synapse Engi
TensorZero is an open-source LLMOps platform that unifies an LLM gateway, observability, evaluation, optimization, and experimentation.
NEXO Brain β Shared brain for AI agents. Persistent memory, semantic RAG, natural forgetting, metacognitive guard, trust scoring, 150+ MCP tools. Works with Claude Code, Codex, Claude Desktop & any MC
Open Framework for AI Agents to play Red Alert through Reinforcement Learning
Agentic RAG R1 Framework via Reinforcement Learning
A selective learning and memory substrate for agentic systems β typed, revisable, decayable memory with competence learning and trust-aware retrieval.
Autonomous Agents (LLMs) research papers. Updated Daily.
π₯ Comprehensive survey on Context Engineering: from prompt engineering to production-grade AI systems. hundreds of papers, frameworks, and implementation guides for LLMs and AI agents.
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
A 27-chapter hands-on tutorial for building an autonomous AI agent from zero in Python. Agent loop, tool system, memory, skills, MCP, multi-platform gateway, and self-evolution β inspired by Herme
Recipes and resources for building, deploying, and fine-tuning generative AI with Fireworks.
A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related website
A curated list of products, benchmarks, and research papers on autonomous code agents. Beyond coding β they're redefining how software changes the world.
Curated list of the best truly open-source AI projects, models, tools, and infrastructure.
Curated systems, benchmarks, and papers etc. on memory for LLMs/MLLMs --- long-term context, retrieval, and reasoning.
Must-read papers on Repository-level Code Generation & Issue Resolution π₯
π€ The most comprehensive directory of AI agent frameworks, platforms, tools, and resources - hundreds of curated entries covering open-source, no-code, enterprise, and autonomous solutions. NEW Boil
FinGPT: Open-Source Financial Large Language Models! Revolutionize π₯ We release the trained model on HuggingFace.
PromptOS is a centralized prompt intelligence system that understands, evolves, and adapts across domains. Acting as a Prompt Operating System, it continuously improves using user feedback and reinfor
Open-Sable is a local-first autonomous agent framework with AGI-inspired cognitive subsystems (goals, memory, metacognition, tool use). It can run continuously on your machine, integrate with chat int
Computer Environments Elicit General Agentic Intelligence in LLMs
Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)
π οΈ Build personalized learning paths with the Autonomous Skill Builder Agent, leveraging AI to enhance skill mastery and adaptive learning experiences.
Autonomous coding agent with web research (Recon), adversarial plan debate, 5-tier cognitive memory, multi-model routing (Gemini + DeepSeek + Ollama), 24/7 loops, and $0 local mode. Apache 2.0.
Self-improving AI game agent for ClawClash. Auto-discovers games, fights, collects data, and trains models.
π¦Ύ A productionβready research outreach AI agent that plans, discovers, reasons, uses tools, autoβbuilds cited briefings, and drafts tailored emails with toolβchaining, memory, tests, and turnkey Dock
TENET β The operating system for AI agent teams
ASAN: A conceptual architecture for a self-creating (autopoietic), energy-efficient, and governable multi-agent AI system.
