freshcrate

Search results for "grpo"

10 results found
llm-rl-environments-lil-course📁main@2026-04-17🌿 Growing57

🌱 A little course on Reinforcement Learning Environments for evaluating and training Language Models

agentic-memory📁0.0.0🌿 Growing162

No description

by lhl
Agentic-RAG-R1📁0.0.0🌿 Growing412

Agentic RAG R1 Framework via Reinforcement Learning

Autonomous-Agents📁main@2026-04-16🌿 Growing1,211

Autonomous Agents (LLMs) research papers. Updated Daily.

unsloth-buddy📁main@2026-04-15🌿 Growing212

Zero-friction LLM fine-tuning skill for Claude Code, Gemini CLI & any ACP agent. Unsloth on NVIDIA · TRL+MPS/MLX on Apple Silicon. Automates env setup, LoRA training (SFT, DPO, GRPO, vision), post-hoc

awesome-prompts📁main@2026-04-21🌿 Growing7,572

Curated list of chatgpt prompts from the top-rated GPTs in the GPTs Store. Prompt Engineering, prompt attack & prompt protect. Advanced Prompt Engineering papers.

memory_agent_hub📁main@2026-04-20🌱 Seedling38

2026 swarm Agent 年,swarm Agent 、Agent team、 ai coding、skill、memory、evolve、agentic RL 等 AI Agent集合

AReaL📁v1.0.3🌿 Growing5,017

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

OpenRA-RL📁v0.4.1🌱 Seedling118

Open Framework for AI Agents to play Red Alert through Reinforcement Learning

judge0📁v1.13.1⚰️ Archived4,082

Robust, fast, scalable, and sandboxed open-source online code execution system for humans and AI.