freshcrate

Search results for "rl"

Clear filters
42 results found (Python)
llm-rl-environments-lil-courseπŸ“main@2026-04-17🌿 Growing⭐57

🌱 A little course on Reinforcement Learning Environments for evaluating and training Language Models

BESSERπŸ“v7.1.7🌿 Growing⭐160

A Python-based low-modeling low-code open-source platform for smart and AI-enhanced software

npcpyπŸ“v1.4.21🌳 Mature⭐1,287

The python library for research and development in NLP, multimodal LLMs, Agents, ML, Knowledge Graphs, and more.

OpenSandboxπŸ“docker/execd/v1.0.13🌳 Mature⭐9,925

Secure, Fast, and Extensible Sandbox runtime for AI agents.

OpenRA-RLπŸ“v0.4.1🌿 Growing⭐120

Open Framework for AI Agents to play Red Alert through Reinforcement Learning

Agentic-RAG-R1πŸ“0.0.0🌿 Growing⭐412

Agentic RAG R1 Framework via Reinforcement Learning

openlitπŸ“openlit-1.18.1🌿 Growing⭐2,358

Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Management, Vault, Playground. πŸš€πŸ’» Integrates with 50+ LLM Providers,

cognithorπŸ“v0.92.2🌿 Growing⭐94

Cognithor - Agent OS: Local-first autonomous agent operating system. 16 LLM providers, 17 channels, 112+ MCP tools, 5-tier memory, A2A protocol, knowledge vault, voice, browser automation, Computer-us

awesome-code-agentsπŸ“main@2026-04-20🌿 Growing⭐94

A curated list of products, benchmarks, and research papers on autonomous code agents. Beyond coding β€” they're redefining how software changes the world.

DeepClaudeπŸ“v1.0.1🌳 Mature⭐2,788

Unleash Next-Level AI! πŸš€ πŸ’» Code Generation: DeepSeek r1 + Claude 3.7 Sonnet - Unparalleled Performance! πŸ“ Content Creation: DeepSeek r1 + Gemini 2.5 Pro - Superior Quality! πŸ”Œ OpenAI-Compatible. οΏ½

AReaLπŸ“v1.0.3🌿 Growing⭐5,017

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

LLM-WikiπŸ“main@2026-04-18🌱 Seedling⭐7

Autonomous knowledge base plugin for Claude Code - captures reserch, ideas, and decisions into an interlinked wiki with reserch-on-miss, semantic search, and a Wikipedia-style web UI. Knowledge compou

synaptic-memoryπŸ“v0.16.0🌱 Seedling⭐25

Brain-inspired knowledge graph: spreading activation, Hebbian learning, memory consolidation.

hermes-agentπŸ“v2026.4.16🌿 Growing⭐57,954

The agent that grows with you

coding-proxyπŸ“v0.3.0🌱 Seedling⭐6

A High-Availability, Transparent, and Smart Multi-Vendor Proxy for Claude Code. Support Claude Plans, GitHub Copilot, Google Antigravity, ZAI/GLM, MiniMax, Qwen, Xiaomi, Kimi, Doubao...

Charles-mcpπŸ“v3.0.3🌱 Seedling⭐179

Charles Proxy MCP server for AI agents with live capture, structured traffic analysis, and agent-friendly tool contracts

sinain-hudπŸ“overlay-v2.8.0🌱 Seedling⭐5

Ambient intelligence that sees what you see, hears what you hear, and acts on your behalf

LLM-Agent-Paper-dailyπŸ“main@2026-04-21🌱 Seedling⭐20

Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)

agentscopeπŸ“v1.0.19🌿 Growing⭐23,421

Build and run agents you can see, understand and trust.

AGI-Alpha-Agent-v0πŸ“main@2026-04-18🌿 Growing⭐283

META‑AGENTIC α‑AGI πŸ‘οΈβœ¨ β€” Mission 🎯 End‑to‑end: Identify πŸ” β†’ Out‑Learn πŸ“š β†’ Out‑Think 🧠 β†’ Out‑Design 🎨 β†’ Out‑Strategise β™ŸοΈ β†’ Out‑Execute ⚑

deep-research-mcpπŸ“main@2026-04-13🌿 Growing⭐58

MCP server for OpenAI's Deep Research APIs, Gemini Deep Research Agent, and Hugging Face's Open Deep Research

learn-hermes-agentπŸ“0.0.0🌱 Seedling⭐16

A 27-chapter hands-on tutorial for building an autonomous AI agent from zero in Python. Agent loop, tool system, memory, skills, MCP, multi-platform gateway, and self-evolution β€” inspired by Herme

mcp-ms-office-documentsπŸ“v3.5🌱 Seedling⭐23

MCP server providing tools to create Ms Office documents like presentations, emails, spreadsheets and word docs (pptx, docx, eml, xlsx)

hermes-life-osπŸ“v1.3.0🌱 Seedling⭐26

Personal OS agent that learns who you are, detects life patterns, and grows smarter about you every day. Memory + Cron + Atropos RL

LettuceDetectπŸ“0.1.8πŸ’€ Dormant⭐565

Lightweight hallucination detection framework for RAG applications

llm-in-sandboxπŸ“v0.2.0🌱 Seedling⭐221

Computer Environments Elicit General Agentic Intelligence in LLMs

YAML-Multi-Agent-OrchestratorπŸ“main@2026-04-21🌱 Seedling⭐2

πŸ€– Define and execute multi-agent AI workflows declaratively using YAML, simplifying orchestration and enhancing collaboration through automatic context handling.

twitter-cliπŸ“main@2026-04-21🌱 Seedling⭐1

Access Twitter timelines, bookmarks, and profiles from the terminal without requiring API keys, offering a simple CLI user experience.

mcp-pyatvπŸ“v0.2.0🌱 Seedling⭐1

MCP server for controlling Apple TV, HomePod, and AirPlay devices. Control your TV with natural language through Claude Desktop.

Agentic-AI-PipelineπŸ“v1.0.0πŸ’€ Dormant⭐63

🦾 A production‑ready research outreach AI agent that plans, discovers, reasons, uses tools, auto‑builds cited briefings, and drafts tailored emails with tool‑chaining, memory, tests, and turnkey Dock

gepaπŸ“0.1.1🌱 Seedling

A framework for optimizing textual system components (AI prompts, code snippets, etc.) using LLM-based reflection and Pareto-efficient evolutionary search.

gymnasiumπŸ“1.2.3🌱 Seedling

A standard API for reinforcement learning and a diverse set of reference environments (formerly Gym).

pytest-base-urlπŸ“2.1.0🌱 Seedling

pytest plugin for URL based testing

giturlparseπŸ“0.14.0🌱 Seedling

A Git URL parsing module (supports parsing and rewriting)

cloudeventsπŸ“2.0.0🌱 Seedling

CloudEvents Python SDK

modalπŸ“1.4.2🌱 Seedling

Python client library for Modal

holidaysπŸ“0.95🌱 Seedling

Open World Holidays Framework

jupyterlab-serverπŸ“2.28.0🌱 Seedling

A set of server components for JupyterLab and JupyterLab like applications.

notebookπŸ“7.5.5🌱 Seedling

Jupyter Notebook - A web-based notebook environment for interactive computing

django-model-utilsπŸ“5.0.0🌱 Seedling

Django model mixins and utilities

dunamaiπŸ“1.26.1🌱 Seedling

Dynamic version generation