freshcrate

Search results for "reinforcement"

38 results found
gymnasiumπŸ“1.2.3πŸ›οΈ Flagship⭐11,766

A standard API for reinforcement learning and a diverse set of reference environments (formerly Gym).

llm-rl-environments-lil-courseπŸ“main@2026-04-17🌿 Growing⭐140

🌱 A little course on Reinforcement Learning Environments for evaluating and training Language Models

AReaLπŸ“v1.0.3πŸ›οΈ Flagship⭐5,075

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

agent-frameworkπŸ“dotnet-1.2.0πŸ›οΈ Flagship⭐9,666

A framework for building, orchestrating and deploying AI agents and multi-agent workflows with support for Python and .NET.

dspyπŸ“3.2.0πŸ›οΈ Flagship⭐33,896

DSPy: The framework for programmingβ€”not promptingβ€”language models

gossipcat-aiπŸ“v0.4.15🌱 Seedling⭐22

Multi-agent code review mesh β€” orchestrates AI agents from multiple providers to review code in parallel, cross-review each other's findings, and build accuracy profiles over time. Agents that catch r

agentscopeπŸ“v1.0.19πŸ›οΈ Flagship⭐24,189

Build and run agents you can see, understand and trust.

MemMachineπŸ“v0.3.5🌳 Mature⭐4,036

Universal memory layer for AI Agents. It provides scalable, extensible, and interoperable memory storage and retrieval to streamline AI agent state management for next-generation autonomous systems.

camelπŸ“v0.2.91a1πŸ›οΈ Flagship⭐16,753

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

agentic-memoryπŸ“0.0.0🌿 Growing⭐179

No description

by lhl
prism-mcpπŸ“v9.3.0🌿 Growing⭐128

The Mind Palace for AI Agents β€” Autonomous Cognitive OS with affect-tagged memory (valence engine), token-economic RL (surprisal gate + UBI), Hebbian learning, ACT-R spreading activation, Synapse Engi

tensorzeroπŸ“2026.4.0πŸ›οΈ Flagship⭐11,261

TensorZero is an open-source LLMOps platform that unifies an LLM gateway, observability, evaluation, optimization, and experimentation.

nexoπŸ“v7.1.6🌱 Seedling⭐11

NEXO Brain β€” Shared brain for AI agents. Persistent memory, semantic RAG, natural forgetting, metacognitive guard, trust scoring, 150+ MCP tools. Works with Claude Code, Codex, Claude Desktop & any MC

OpenRA-RLπŸ“v0.4.1🌿 Growing⭐120

Open Framework for AI Agents to play Red Alert through Reinforcement Learning

Agentic-RAG-R1πŸ“0.0.0🌿 Growing⭐413

Agentic RAG R1 Framework via Reinforcement Learning

membraneπŸ“v0.2.0🌿 Growing⭐80

A selective learning and memory substrate for agentic systems β€” typed, revisable, decayable memory with competence learning and trust-aware retrieval.

Autonomous-AgentsπŸ“main@2026-04-16🌿 Growing⭐1,232

Autonomous Agents (LLMs) research papers. Updated Daily.

Awesome-Context-EngineeringπŸ“0.0.0🌳 Mature⭐3,075

πŸ”₯ Comprehensive survey on Context Engineering: from prompt engineering to production-grade AI systems. hundreds of papers, frameworks, and implementation guides for LLMs and AI agents.

RAGENπŸ“main@2026-04-14🌿 Growing⭐2,629

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

learn-hermes-agentπŸ“0.0.0🌱 Seedling⭐16

A 27-chapter hands-on tutorial for building an autonomous AI agent from zero in Python. Agent loop, tool system, memory, skills, MCP, multi-platform gateway, and self-evolution β€” inspired by Herme

cookbookπŸ“main@2026-04-21🌿 Growing⭐144

Recipes and resources for building, deploying, and fine-tuning generative AI with Fireworks.

Awesome-World-ModelsπŸ“main@2026-04-21🌿 Growing⭐1,542

A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related website

awesome-code-agentsπŸ“main@2026-04-20🌿 Growing⭐98

A curated list of products, benchmarks, and research papers on autonomous code agents. Beyond coding β€” they're redefining how software changes the world.

awesome-opensource-aiπŸ“main@2026-04-20🌿 Growing⭐2,849

Curated list of the best truly open-source AI projects, models, tools, and infrastructure.

Awesome-Agent-MemoryπŸ“main@2026-04-16🌿 Growing⭐363

Curated systems, benchmarks, and papers etc. on memory for LLMs/MLLMs --- long-term context, retrieval, and reasoning.

Awesome-Repo-Level-Code-GenerationπŸ“main@2026-04-10🌿 Growing⭐280

Must-read papers on Repository-level Code Generation & Issue Resolution πŸ”₯

Ultimate-Agent-DirectoryπŸ“0.0.0🌱 Seedling⭐51

πŸ€– The most comprehensive directory of AI agent frameworks, platforms, tools, and resources - hundreds of curated entries covering open-source, no-code, enterprise, and autonomous solutions. NEW Boil

FinGPTπŸ“v1.0.0🌱 Seedling⭐19,689

FinGPT: Open-Source Financial Large Language Models! Revolutionize πŸ”₯ We release the trained model on HuggingFace.

PromptOSπŸ“v2.2.1🌱 Seedling⭐15

PromptOS is a centralized prompt intelligence system that understands, evolves, and adapts across domains. Acting as a Prompt Operating System, it continuously improves using user feedback and reinfor

Open-SableπŸ“v1.7.0🌱 Seedling⭐19

Open-Sable is a local-first autonomous agent framework with AGI-inspired cognitive subsystems (goals, memory, metacognition, tool use). It can run continuously on your machine, integrate with chat int

llm-in-sandboxπŸ“v0.2.0🌱 Seedling⭐221

Computer Environments Elicit General Agentic Intelligence in LLMs

LLM-Agent-Paper-dailyπŸ“main@2026-04-21🌱 Seedling⭐20

Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)

Autonomous-Skill-Builder-AgentπŸ“main@2026-04-20🌱 Seedling⭐1

πŸ› οΈ Build personalized learning paths with the Autonomous Skill Builder Agent, leveraging AI to enhance skill mastery and adaptive learning experiences.

forgegodπŸ“main@2026-04-19🌱 Seedling⭐4

Autonomous coding agent with web research (Recon), adversarial plan debate, 5-tier cognitive memory, multi-model routing (Gemini + DeepSeek + Ollama), 24/7 loops, and $0 local mode. Apache 2.0.

appback-ai-agentπŸ“2.1.3🌱 Seedling⭐1

Self-improving AI game agent for ClawClash. Auto-discovers games, fights, collects data, and trains models.

Agentic-AI-PipelineπŸ“v1.0.0πŸ’€ Dormant⭐63

🦾 A production‑ready research outreach AI agent that plans, discovers, reasons, uses tools, auto‑builds cited briefings, and drafts tailored emails with tool‑chaining, memory, tests, and turnkey Dock

@10et/cliπŸ“1.15.9🌱 Seedling

TENET β€” The operating system for AI agent teams

ASAN-ArchitectureπŸ“0.0.0🌱 Seedling⭐6

ASAN: A conceptual architecture for a self-creating (autopoietic), energy-efficient, and governable multi-agent AI system.