freshcrate — Search

Search results for "grpo"

10 results found

llm-rl-environments-lil-course 📁main@2026-04-17🌿 Growing⭐57

🌱 A little course on Reinforcement Learning Environments for evaluating and training Language Models

course grpo language-models llm llm-agent python reinforcement-learning reinforcement-learning-environments rlvrby anakin87Python

agentic-memory 📁0.0.0🌿 Growing⭐162

No description

by lhl

Agentic-RAG-R1 📁0.0.0🌿 Growing⭐412

Agentic RAG R1 Framework via Reinforcement Learning

agentic grpo python rag rlby jiangxinkePython

Autonomous-Agents 📁main@2026-04-16🌿 Growing⭐1,211

Autonomous Agents (LLMs) research papers. Updated Daily.

agent agentic agentic-ai agents ai ai-agents aiagent aiagentsby tmgthb

unsloth-buddy 📁main@2026-04-15🌿 Growing⭐212

Zero-friction LLM fine-tuning skill for Claude Code, Gemini CLI & any ACP agent. Unsloth on NVIDIA · TRL+MPS/MLX on Apple Silicon. Automates env setup, LoRA training (SFT, DPO, GRPO, vision), post-hoc

apple-silicon claude-code dpo fine-tuning gaslamp grpo huggingface lora pythonby TYH-labsPython

awesome-prompts 📁main@2026-04-21🌿 Growing⭐7,572

Curated list of chatgpt prompts from the top-rated GPTs in the GPTs Store. Prompt Engineering, prompt attack & prompt protect. Advanced Prompt Engineering papers.

awesome awesome-list chatgpt gpt4 gpts gptstore papers prompt prompt-engineeringby ai-boost

memory_agent_hub 📁main@2026-04-20🌱 Seedling⭐38

2026 swarm Agent 年，swarm Agent 、Agent team、 ai coding、skill、memory、evolve、agentic RL 等 AI Agent集合

ai-memory elasticsearch graphrag jupyter notebook knowledge-graph llm-agent milvus neo4j rag-technologyby 1850298154Jupyter Notebook

AReaL 📁v1.0.3🌿 Growing⭐5,017

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

agent llm llm-agent llm-reasoning machine-learning-systems mlsys python reinforcement-learning rlby inclusionAIPython

OpenRA-RL 📁v0.4.1🌱 Seedling⭐118

Open Framework for AI Agents to play Red Alert through Reinforcement Learning

pythonby yxc20089Python

judge0 📁v1.13.1⚰️ Archived⭐4,082

Robust, fast, scalable, and sandboxed open-source online code execution system for humans and AI.

ai-agent-tools ai-agents ai-tools code-execution code-executor code-runner competitive-programming html online-compilerby judge0HTML