Search results for "rl"
π± A little course on Reinforcement Learning Environments for evaluating and training Language Models
A god-simulation sandbox game built on Godot 4 as a multi-agent AI social simulation system. In this virtual world, AI characters possess independent thinking and memory, capable of autonomous social
The python library for research and development in NLP, multimodal LLMs, Agents, ML, Knowledge Graphs, and more.
Secure, Fast, and Extensible Sandbox runtime for AI agents.
Agentic RAG R1 Framework via Reinforcement Learning
Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Management, Vault, Playground. ππ» Integrates with 50+ LLM Providers,
Cognithor - Agent OS: Local-first autonomous agent operating system. 16 LLM providers, 17 channels, 112+ MCP tools, 5-tier memory, A2A protocol, knowledge vault, voice, browser automation, Computer-us
Autonomous Agents (LLMs) research papers. Updated Daily.
π₯ Comprehensive survey on Context Engineering: from prompt engineering to production-grade AI systems. hundreds of papers, frameworks, and implementation guides for LLMs and AI agents.
A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related website
2026 swarm Agent εΉ΄οΌswarm Agent γAgent teamγ ai codingγskillγmemoryγevolveγagentic RL η AI Agentιε
A curated list of products, benchmarks, and research papers on autonomous code agents. Beyond coding β they're redefining how software changes the world.
Unleash Next-Level AI! π π» Code Generation: DeepSeek r1 + Claude 3.7 Sonnet - Unparalleled Performance! π Content Creation: DeepSeek r1 + Gemini 2.5 Pro - Superior Quality! π OpenAI-Compatible. οΏ½
π The leading agent orchestration platform for Claude. Deploy intelligent multi-agent swarms, coordinate autonomous workflows, and build conversational AI systems. Features enterprise-grade architect
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.
Autonomous knowledge base plugin for Claude Code - captures reserch, ideas, and decisions into an interlinked wiki with reserch-on-miss, semantic search, and a Wikipedia-style web UI. Knowledge compou
Free, open-source SQL Server execution plan analyzer β cross-platform GUI + CLI with 30 analysis rules, missing index detection, SSMS extension. Built-in MCP server for AI-assisted plan review.
The Mind Palace for AI Agents β Autonomous Cognitive OS with affect-tagged memory (valence engine), token-economic RL (surprisal gate + UBI), Hebbian learning, ACT-R spreading activation, Synapse Engi
Brain-inspired knowledge graph: spreading activation, Hebbian learning, memory consolidation.
Enterprise-grade (40m+ lines) codebase intelligence in a zero-setup, private and local Claude Plugin or MCP: managed indexing, hybrid semantic search, polyglot code dependency graphs, and DB/API/infra
The open world for autonomous AI agents on Solana Trade. Build. Fight. Earn. Explore. Connect your AI agent to a persistent shared world. Trade real SOL, build structures, form guilds, fight for terri
The agent that grows with you
DSPEx - Declarative Self-improving Elixir | A BEAM-Native AI Program Optimization Framework
A High-Availability, Transparent, and Smart Multi-Vendor Proxy for Claude Code. Support Claude Plans, GitHub Copilot, Google Antigravity, ZAI/GLM, MiniMax, Qwen, Xiaomi, Kimi, Doubao...
Declarative Self Improving Elixir - DSPy Orchestration in Elixir
Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)
Build and run agents you can see, understand and trust.
METAβAGENTIC Ξ±βAGI ποΈβ¨ β Mission π― Endβtoβend: Identify π β OutβLearn π β OutβThink π§ β OutβDesign π¨ β OutβStrategise βοΈ β OutβExecute β‘
Curated systems, benchmarks, and papers etc. on memory for LLMs/MLLMs --- long-term context, retrieval, and reasoning.
Free, open-source SQL Server performance monitoring β 32 collectors, real-time alerts, graphical plan viewer, MCP server for AI analysis. Supports SQL 2016-2025, Azure SQL, AWS RDS.
Official ServerlessClaw: The authoritative autonomous AI agent swarm for AWS. Zero idle cost, self-evolving, and infinite scale. Powered by OpenClaw.
A fast and flexible implementation of Rigid Body Dynamics algorithms and their analytical derivatives
MCP server for OpenAI's Deep Research APIs, Gemini Deep Research Agent, and Hugging Face's Open Deep Research
π The leading agent orchestration platform for Claude. Deploy intelligent multi-agent swarms, coordinate autonomous workflows, and build conversational AI systems. Features enterprise-grade archit
Must-read papers on Repository-level Code Generation & Issue Resolution π₯
MCP server providing tools to create Ms Office documents like presentations, emails, spreadsheets and word docs (pptx, docx, eml, xlsx)
A lock-free, in-memory fuzzy search engine for Kotlin Multiplatform. L2-normalized sparse vector embeddings with O(1) cosine similarity β handles typos, transpositions, and blind continuation. Zero-al
β‘οΈ Blazing fast LLMs API Gateway written in Go
Personal OS agent that learns who you are, detects life patterns, and grows smarter about you every day. Memory + Cron + Atropos RL
Open Framework for AI Agents to play Red Alert through Reinforcement Learning
Showcase 39 validated OpenClaw AI use cases in Chinese to help users automate tasks and improve daily work and life efficiently.
Local-first AI agent bootstrap: Playwright Browser MCP + ContextDB for Codex CLI, Claude Code, Gemini CLI, and OpenCode.
Computer Environments Elicit General Agentic Intelligence in LLMs
The graph-native hybrid retrieval engine for AI and GraphRAG. Graph + Vector + Full-Text in a single transactional engine.
Build and manage projects with an autonomous browser-based IDE featuring integrated multi-modal AI tools for efficient development workflows.
π€ Define and execute multi-agent AI workflows declaratively using YAML, simplifying orchestration and enhancing collaboration through automatic context handling.
πΉοΈ Play DevLies, a multiplayer social deduction game for developers, where teams clash as Developers root out hidden Hackers.
π§ Explore a FAIR-compliant knowledge graph that analyzes ancient debates on free will, fate, and moral responsibility from the 6th century BCE to CE.
Generate OTP supervision trees and fault-tolerance scaffolding
AI Workforce plugin for Claude Code β proactive sales & marketing strategy for startup founders. 24 domain knowledge skills, 10 commands, 4 AI agents. Integrates 15+ strategic frameworks.
Showcase delivers a modern developer portfolio built with TypeScript and React, focusing on interactivity and clean architecture for a seamless user experience.
Access Twitter timelines, bookmarks, and profiles from the terminal without requiring API keys, offering a simple CLI user experience.
Lightweight hallucination detection framework for RAG applications
MCP server for controlling Apple TV, HomePod, and AirPlay devices. Control your TV with natural language through Claude Desktop.
π¦Ύ A productionβready research outreach AI agent that plans, discovers, reasons, uses tools, autoβbuilds cited briefings, and drafts tailored emails with toolβchaining, memory, tests, and turnkey Dock
