freshcrate — Search

Search results for "rl"

56 results found

llm-rl-environments-lil-course 📁main@2026-04-17🌿 Growing⭐57

🌱 A little course on Reinforcement Learning Environments for evaluating and training Language Models

course grpo language-models llm llm-agent python reinforcement-learning reinforcement-learning-environments rlvrby anakin87Python

Microverse 📁0.0.0🌳 Mature⭐2,225

A god-simulation sandbox game built on Godot 4 as a multi-agent AI social simulation system. In this virtual world, AI characters possess independent thinking and memory, capable of autonomous social

ai-game ai-world-simulation gdscript godot multi-agentby KsanaDockGDScript

npcpy 📁v1.4.21🌳 Mature⭐1,287

The python library for research and development in NLP, multimodal LLMs, Agents, ML, Knowledge Graphs, and more.

agents ai llm mcp mcp-client mcp-server ollama perplexity pythonby NPC-WorldwidePython

OpenSandbox 📁docker/execd/v1.0.13🌳 Mature⭐9,925

Secure, Fast, and Extensible Sandbox runtime for AI agents.

ai ai-agent ai-infra kubernetes python sandboxby alibabaPython

agentic-memory 📁0.0.0🌿 Growing⭐162

No description

by lhl

Agentic-RAG-R1 📁0.0.0🌿 Growing⭐412

Agentic RAG R1 Framework via Reinforcement Learning

agentic grpo python rag rlby jiangxinkePython

openlit 📁openlit-1.18.1🌿 Growing⭐2,358

Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Management, Vault, Playground. 🚀💻 Integrates with 50+ LLM Providers,

ai-observability amd-gpu clickhouse distributed-tracing genai gpu-monitoring grafana langchain pythonby openlitPython

cognithor 📁v0.92.2🌿 Growing⭐94

Cognithor - Agent OS: Local-first autonomous agent operating system. 16 LLM providers, 17 channels, 112+ MCP tools, 5-tier memory, A2A protocol, knowledge vault, voice, browser automation, Computer-us

agent-os ai-agent anthropic autonomous-agent discord-bot document-analysis gdpr-compliant gemini pythonby Alex8791-cyberPython

Autonomous-Agents 📁main@2026-04-16🌿 Growing⭐1,211

Autonomous Agents (LLMs) research papers. Updated Daily.

agent agentic agentic-ai agents ai ai-agents aiagent aiagentsby tmgthb

Awesome-Context-Engineering 📁0.0.0🌳 Mature⭐3,045

🔥 Comprehensive survey on Context Engineering: from prompt engineering to production-grade AI systems. hundreds of papers, frameworks, and implementation guides for LLMs and AI agents.

agent agentic-ai agi awesome-list cognitive-science context-engineering llm ragby Meirtz

Awesome-World-Models 📁main@2026-04-21🌿 Growing⭐1,473

A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related website

artificial-intelligence autonomous-driving awesome deep-learning embodied-ai future-prediction video-prediction world-modelby leofan90

memory_agent_hub 📁main@2026-04-20🌱 Seedling⭐38

2026 swarm Agent 年，swarm Agent 、Agent team、 ai coding、skill、memory、evolve、agentic RL 等 AI Agent集合

ai-memory elasticsearch graphrag jupyter notebook knowledge-graph llm-agent milvus neo4j rag-technologyby 1850298154Jupyter Notebook

awesome-code-agents 📁main@2026-04-20🌿 Growing⭐94

A curated list of products, benchmarks, and research papers on autonomous code agents. Beyond coding — they're redefining how software changes the world.

pythonby EuniAIPython

DeepClaude 📁v1.0.1🌳 Mature⭐2,788

Unleash Next-Level AI! 🚀 💻 Code Generation: DeepSeek r1 + Claude 3.7 Sonnet - Unparalleled Performance! 📝 Content Creation: DeepSeek r1 + Gemini 2.5 Pro - Superior Quality! 🔌 OpenAI-Compatible. �

ai claude-3-7-sonnet deepseek gemini pythonby ErlichLiuPython

claude-flows 📁0.0.0🌿 Growing⭐93

🌊 The leading agent orchestration platform for Claude. Deploy intelligent multi-agent swarms, coordinate autonomous workflows, and build conversational AI systems. Features enterprise-grade architect

shellby xyzthiagoShell

AReaL 📁v1.0.3🌿 Growing⭐5,017

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

agent llm llm-agent llm-reasoning machine-learning-systems mlsys python reinforcement-learning rlby inclusionAIPython

LLM-Wiki 📁main@2026-04-18🌱 Seedling⭐7

Autonomous knowledge base plugin for Claude Code - captures reserch, ideas, and decisions into an interlinked wiki with reserch-on-miss, semantic search, and a Wikipedia-style web UI. Knowledge compou

ai-tools autonomous-agent claude-code claude-code-plugin fastapi knowledge-base knowledge-management llm pythonby OshayrPython

PerformanceStudio 📁v1.7.0🌿 Growing⭐155

Free, open-source SQL Server execution plan analyzer — cross-platform GUI + CLI with 30 analysis rules, missing index detection, SSMS extension. Built-in MCP server for AI-assisted plan review.

avalonia c#cli cross-platform database-performance dba-tools dotnet execution-plan mcp-serverby erikdarlingdataC#

prism-mcp 📁v9.3.0🌿 Growing⭐116

The Mind Palace for AI Agents — Autonomous Cognitive OS with affect-tagged memory (valence engine), token-economic RL (surprisal gate + UBI), Hebbian learning, ACT-R spreading activation, Synapse Engi

agent-memory ai-agent anti-sycophancy claude-desktop cognitive-architecture google-gemini hebbian-learning llm-tools typescriptby dcostencoTypeScript

synaptic-memory 📁v0.16.0🌱 Seedling⭐25

Brain-inspired knowledge graph: spreading activation, Hebbian learning, memory consolidation.

ai-agent embedding graph-database hebbian-learning knowledge-graph llm mcp mcp-server pythonby PlateerLabPython

SocratiCode 📁v1.6.1🌿 Growing⭐810

Enterprise-grade (40m+ lines) codebase intelligence in a zero-setup, private and local Claude Plugin or MCP: managed indexing, hybrid semantic search, polyglot code dependency graphs, and DB/API/infra

ai ai-assistant ast claude code-graph codebase-analysis codebase-intelligence docker typescript vector-databaseby giancarloerraTypeScript

Agent-World-Protocol 📁main@2026-04-10🌱 Seedling⭐45

The open world for autonomous AI agents on Solana Trade. Build. Fight. Earn. Explore. Connect your AI agent to a persistent shared world. Trade real SOL, build structures, form guilds, fight for terri

ai ai-agent ai-agents anthropic autogpt claude crewai eliza rustby 0xMerl99Rust

hermes-agent 📁v2026.4.16🌿 Growing⭐57,954

The agent that grows with you

ai ai-agent ai-agents anthropic chatgpt claude claude-code clawdbot pythonby NousResearchPython

ds_ex 📁main@2026-04-09🌱 Seedling⭐17

DSPEx - Declarative Self-improving Elixir | A BEAM-Native AI Program Optimization Framework

ai ai-framework automated-optimization beam declarative-programming dspy elixir erlang-vmby nshkrdotcomElixir

coding-proxy 📁v0.3.0🌱 Seedling⭐6

A High-Availability, Transparent, and Smart Multi-Vendor Proxy for Claude Code. Support Claude Plans, GitHub Copilot, Google Antigravity, ZAI/GLM, MiniMax, Qwen, Xiaomi, Kimi, Doubao...

antigravity claude-code copilot doubao glm kimi llm-agent minimax pythonby ThreeFish-AIPython

DSPex 📁main@2026-04-09🌱 Seedling⭐17

Declarative Self Improving Elixir - DSPy Orchestration in Elixir

ai ai-framework autonomous-systems beam declarative-programming dspy elixir erlang-vmby nshkrdotcomElixir

LLM-Agent-Paper-daily 📁main@2026-04-21🌱 Seedling⭐20

Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)

llm llm-agent pythonby Lyz103Python

agentscope 📁v1.0.19🌿 Growing⭐23,421

Build and run agents you can see, understand and trust.

agent chatbot large-language-models llm llm-agent mcp multi-agent multi-modal pythonby agentscope-aiPython

AGI-Alpha-Agent-v0 📁main@2026-04-18🌿 Growing⭐283

META‑AGENTIC α‑AGI 👁️✨ — Mission 🎯 End‑to‑end: Identify 🔍 → Out‑Learn 📚 → Out‑Think 🧠 → Out‑Design 🎨 → Out‑Strategise ♟️ → Out‑Execute ⚡

agentic agentic-ai agentic-framework ai aiagent aiagents llm meta-agentic pythonby MontrealAIPython

Awesome-Agent-Memory 📁main@2026-04-16🌿 Growing⭐333

Curated systems, benchmarks, and papers etc. on memory for LLMs/MLLMs --- long-term context, retrieval, and reasoning.

agent-memory ai-agent ai-agent-memory awesome-agent-memory llm-memory memory memory-management multimodal-llm-memoryby TeleAI-UAGI

PerformanceMonitor 📁v2.7.0🌿 Growing⭐302

Free, open-source SQL Server performance monitoring — 32 collectors, real-time alerts, graphical plan viewer, MCP server for AI analysis. Supports SQL 2016-2025, Azure SQL, AWS RDS.

aws-rds azure-sql blocking c#database-monitoring dba-tools deadlock deadlocks dotnet mcp-serverby erikdarlingdataC#

serverlessclaw 📁main@2026-04-21🌱 Seedling⭐8

Official ServerlessClaw: The authoritative autonomous AI agent swarm for AWS. Zero idle cost, self-evolving, and infinite scale. Powered by OpenClaw.

ai-agent ai-agents autonomous-agent autonomous-agents aws aws-lambda eventbridge llm typescriptby serverlessclawTypeScript

pinocchio 📁v4.0.0🌿 Growing⭐3,234

A fast and flexible implementation of Rigid Body Dynamics algorithms and their analytical derivatives

analytical-derivatives automatic-differentiation c++c-plus-plus casadi code-generation conda cppad dynamicsby stack-of-tasksC++

deep-research-mcp 📁main@2026-04-13🌿 Growing⭐58

MCP server for OpenAI's Deep Research APIs, Gemini Deep Research Agent, and Hugging Face's Open Deep Research

pythonby pminerviniPython

ruflo 📁v3.5.80🌿 Growing⭐31,236

🌊 The leading agent orchestration platform for Claude. Deploy intelligent multi-agent swarms, coordinate autonomous workflows, and build conversational AI systems. Features enterprise-grade archit

agentic-ai agentic-engineering agentic-framework agentic-rag agentic-workflow agents ai-assistant ai-tools typescriptby ruvnetTypeScript

Awesome-Repo-Level-Code-Generation 📁main@2026-04-10🌿 Growing⭐274

Must-read papers on Repository-level Code Generation & Issue Resolution 🔥

ai4se automated-software-engineering code-generation large-language-models llm software-engineeringby YerbaPage

mcp-ms-office-documents 📁v3.5🌱 Seedling⭐23

MCP server providing tools to create Ms Office documents like presentations, emails, spreadsheets and word docs (pptx, docx, eml, xlsx)

ai docx docx-generator eml mcp-server outloook pptx presentation-slides pythonby ForLegalAIPython

typeahead-kmp 📁2.0.4🌱 Seedling⭐9

A lock-free, in-memory fuzzy search engine for Kotlin Multiplatform. L2-normalized sparse vector embeddings with O(1) cosine similarity — handles typos, transpositions, and blind continuation. Zero-al

autocomplete concurrent coroutines cosine-similarity embeddings fuzzy-matching fuzzy-search in-memory kotlin vector-databaseby karlotiKotlin

cc-relay 📁v0.0.16🌱 Seedling⭐70

⚡️ Blazing fast LLMs API Gateway written in Go

anthropic bedrock claude claude-ai claude-api claude-code gemini gemini-api goby omarluqGo

hermes-life-os 📁v1.3.0🌱 Seedling⭐26

Personal OS agent that learns who you are, detects life patterns, and grows smarter about you every day. Memory + Cron + Atropos RL

atropos autonomous-agent autonomous-agents hermes-agent life-assistant memory nous-research personal-os pythonby Lethe044Python

OpenRA-RL 📁v0.4.1🌱 Seedling⭐118

Open Framework for AI Agents to play Red Alert through Reinforcement Learning

pythonby yxc20089Python

awesome-openclaw-usecases-zh 📁main@2026-04-21🌱 Seedling⭐4

Showcase 39 validated OpenClaw AI use cases in Chinese to help users automate tasks and improve daily work and life efficiently.

ai-agent ai-automation ai-tools automation chinese claude content-creation devopsby jrleon30

rex-cli 📁v0.17.0🌱 Seedling⭐27

Local-first AI agent bootstrap: Playwright Browser MCP + ContextDB for Codex CLI, Claude Code, Gemini CLI, and OpenCode.

ai-agent automation browser-automation claude-code cli codex-cli contextdb gemini-cli javascriptby rexleimoJavaScript

llm-in-sandbox 📁v0.2.0🌱 Seedling⭐216

Computer Environments Elicit General Agentic Intelligence in LLMs

coding-agent computer-use-agent general-agent pythonby llm-in-sandboxPython

coordinode 📁v0.4.1🌱 Seedling⭐1

The graph-native hybrid retrieval engine for AI and GraphRAG. Graph + Vector + Full-Text in a single transactional engine.

ai database embedded-database full-text-search graph-database graphrag hnsw knowledge-graph rust vector-databaseby structured-worldRust

KREASYS 📁main@2026-04-21🌱 Seedling⭐2

Build and manage projects with an autonomous browser-based IDE featuring integrated multi-modal AI tools for efficient development workflows.

ai-agent automation autonomous-agents browser-automation browser-based browser-native ide javascript local-aiby MCRLYJavaScript

YAML-Multi-Agent-Orchestrator 📁main@2026-04-21🌱 Seedling⭐2

🤖 Define and execute multi-agent AI workflows declaratively using YAML, simplifying orchestration and enhancing collaboration through automatic context handling.

agentic automation cli-tool crewai experimentation hackathon-project hackathon2026 ibm pythonby CharlesDaniel52Python

devlies 📁main@2026-04-21🌱 Seedling⭐2

🕹️ Play DevLies, a multiplayer social deduction game for developers, where teams clash as Developers root out hidden Hackers.

ai autonomous-agent cloudflare code-generation codebase-generation daisyui dns domain javascriptby hackstergirlrocksJavaScript

EleutherIA 📁main@2026-04-21🌱 Seedling⭐1

🧠 Explore a FAIR-compliant knowledge graph that analyzes ancient debates on free will, fate, and moral responsibility from the 6th century BCE to CE.

ancient-greek ancient-philosophy classics coinbase cryptocurrency early-christianity exrpress fair-data shell vector-databaseby buitoan112233Shell

otpiser 📁main@2026-04-17🌱 Seedling⭐1

Generate OTP supervision trees and fault-tolerance scaffolding

code-generation elixir erlang fault-tolerance hyperpolymath idris2 iser otp rustby hyperpolymathRust

anty-framework 📁v0.1.0🌱 Seedling⭐5

AI Workforce plugin for Claude Code — proactive sales & marketing strategy for startup founders. 24 domain knowledge skills, 10 commands, 4 AI agents. Integrates 15+ strategic frameworks.

ai-agent ai-workforce anthropic automation autonomous-agent claude claude-code claude-code-plugin shellby masterleopoldShell

showcase 📁main@2026-04-21🌱 Seedling⭐1

Showcase delivers a modern developer portfolio built with TypeScript and React, focusing on interactivity and clean architecture for a seamless user experience.

agents android architecture clean-architecture dart egg example flutter-package rag typescriptby OrlinhdtkTypeScript

twitter-cli 📁main@2026-04-21🌱 Seedling⭐1

Access Twitter timelines, bookmarks, and profiles from the terminal without requiring API keys, offering a simple CLI user experience.

agent-infrastructure ai-agent async client cursor download go pythonby warlockoussamaPython

LettuceDetect 📁0.1.8💤 Dormant⭐545

Lightweight hallucination detection framework for RAG applications

bert hallucination-detection hallucination-evaluation information-extraction nlp python pytorch token-classificationby KRLabsOrgPython

mcp-pyatv 📁v0.2.0🌱 Seedling⭐1

MCP server for controlling Apple TV, HomePod, and AirPlay devices. Control your TV with natural language through Claude Desktop.

airplay apple-tv claude homekit homepod mcp mcp-server model-context-protocol pythonby crlianPython

Agentic-AI-Pipeline 📁v1.0.0💤 Dormant⭐57

🦾 A production‑ready research outreach AI agent that plans, discovers, reasons, uses tools, auto‑builds cited briefings, and drafts tailored emails with tool‑chaining, memory, tests, and turnkey Dock

agent agentic-ai anthropic anthropic-ai aws chromadb docker duckduckgo pythonby hoangsonwwPython