Search results for "rl"
π± A little course on Reinforcement Learning Environments for evaluating and training Language Models
A Python-based low-modeling low-code open-source platform for smart and AI-enhanced software
The python library for research and development in NLP, multimodal LLMs, Agents, ML, Knowledge Graphs, and more.
Secure, Fast, and Extensible Sandbox runtime for AI agents.
Open Framework for AI Agents to play Red Alert through Reinforcement Learning
Agentic RAG R1 Framework via Reinforcement Learning
Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Management, Vault, Playground. ππ» Integrates with 50+ LLM Providers,
Cognithor - Agent OS: Local-first autonomous agent operating system. 16 LLM providers, 17 channels, 112+ MCP tools, 5-tier memory, A2A protocol, knowledge vault, voice, browser automation, Computer-us
A curated list of products, benchmarks, and research papers on autonomous code agents. Beyond coding β they're redefining how software changes the world.
Unleash Next-Level AI! π π» Code Generation: DeepSeek r1 + Claude 3.7 Sonnet - Unparalleled Performance! π Content Creation: DeepSeek r1 + Gemini 2.5 Pro - Superior Quality! π OpenAI-Compatible. οΏ½
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.
Autonomous knowledge base plugin for Claude Code - captures reserch, ideas, and decisions into an interlinked wiki with reserch-on-miss, semantic search, and a Wikipedia-style web UI. Knowledge compou
Brain-inspired knowledge graph: spreading activation, Hebbian learning, memory consolidation.
The agent that grows with you
A High-Availability, Transparent, and Smart Multi-Vendor Proxy for Claude Code. Support Claude Plans, GitHub Copilot, Google Antigravity, ZAI/GLM, MiniMax, Qwen, Xiaomi, Kimi, Doubao...
Charles Proxy MCP server for AI agents with live capture, structured traffic analysis, and agent-friendly tool contracts
Ambient intelligence that sees what you see, hears what you hear, and acts on your behalf
Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)
Build and run agents you can see, understand and trust.
METAβAGENTIC Ξ±βAGI ποΈβ¨ β Mission π― Endβtoβend: Identify π β OutβLearn π β OutβThink π§ β OutβDesign π¨ β OutβStrategise βοΈ β OutβExecute β‘
MCP server for OpenAI's Deep Research APIs, Gemini Deep Research Agent, and Hugging Face's Open Deep Research
A 27-chapter hands-on tutorial for building an autonomous AI agent from zero in Python. Agent loop, tool system, memory, skills, MCP, multi-platform gateway, and self-evolution β inspired by Herme
MCP server providing tools to create Ms Office documents like presentations, emails, spreadsheets and word docs (pptx, docx, eml, xlsx)
Personal OS agent that learns who you are, detects life patterns, and grows smarter about you every day. Memory + Cron + Atropos RL
Lightweight hallucination detection framework for RAG applications
Computer Environments Elicit General Agentic Intelligence in LLMs
π€ Define and execute multi-agent AI workflows declaratively using YAML, simplifying orchestration and enhancing collaboration through automatic context handling.
Access Twitter timelines, bookmarks, and profiles from the terminal without requiring API keys, offering a simple CLI user experience.
MCP server for controlling Apple TV, HomePod, and AirPlay devices. Control your TV with natural language through Claude Desktop.
π¦Ύ A productionβready research outreach AI agent that plans, discovers, reasons, uses tools, autoβbuilds cited briefings, and drafts tailored emails with toolβchaining, memory, tests, and turnkey Dock
A framework for optimizing textual system components (AI prompts, code snippets, etc.) using LLM-based reflection and Pareto-efficient evolutionary search.
A standard API for reinforcement learning and a diverse set of reference environments (formerly Gym).
pytest plugin for URL based testing
A Git URL parsing module (supports parsing and rewriting)
CloudEvents Python SDK
Python client library for Modal
Open World Holidays Framework
Jupyter interactive widgets for JupyterLab
A set of server components for JupyterLab and JupyterLab like applications.
Jupyter Notebook - A web-based notebook environment for interactive computing
Dynamic version generation
