Search results for "dataset"
OpenSource MCP Marketplace | MCP Servers Tools Meta Dataset | Web API | Web Client Integration
Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from dataset creation to model training and evaluation. Comes with pr
423 plugins, 2,849 skills, 177 agents for Claude Code. Open-source marketplace at tonsofskills.com with the ccpi CLI package manager.
π« CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org
One-stop handbook for building, deploying, and understanding LLM agents with 60+ skeletons, tutorials, ecosystem guides, and evaluation tools.
The implementation for SIGIR 2026: Learning to Retrieve from Agent Trajectories.
Group Evolving Agents: Open-Ended Self-Improvement via Experience Sharing
AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
Official Code Release of SAGE: Scalable Agentic 3D Scene Generation for Embodied AI
"RAG-Anything: All-in-One RAG Framework"
My personal Claude Code and OpenAI Codex setup with battle-tested skills, commands, hooks, agents and MCP servers that I use daily.
High-Performance Engine for Multi-Vector Search
Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Management, Vault, Playground. ππ» Integrates with 50+ LLM Providers,
Knowledge Engine for AI Agent Memory in 6 lines of code
Make AI work for Everyone - Monitoring and governing for your AI/ML
Internal Safety Collapse: Turning the LLM or an AI Agent into a sensitive data generator.
Brain-inspired knowledge graph: spreading activation, Hebbian learning, memory consolidation.
Official data.gouv.fr Model Context Protocol (MCP) server that allows AI chatbots to search, explore, and analyze datasets from the French national Open Data platform, directly through conversation.
Benchmark for vector databases.
A-RAG: Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces. State-of-the-art RAG framework with keyword, semantic, and chunk read tools for multi-hop QA.
Supercharge Your LLM Application Evaluations π
A curated list of products, benchmarks, and research papers on autonomous code agents. Beyond coding β they're redefining how software changes the world.
[NeurIPS 2024 D&B] GTA: A Benchmark for General Tool Agents & [arXiv 2026] GTA-2
π₯ An autonomous AI agent that runs your deep learning experiments 24/7 while you sleep. Zero-cost monitoring, Leader-Worker architecture, constant-size memory.
METAβAGENTIC Ξ±βAGI ποΈβ¨ β Mission π― Endβtoβend: Identify π β OutβLearn π β OutβThink π§ β OutβDesign π¨ β OutβStrategise βοΈ β OutβExecute β‘
AI-first security scanner with 76 analyzers, 9,600+ detection rules, and repo poisoning detection for AI/ML, LLM agents, and MCP servers. Scan any GitHub repo with: medusa scan --git user/repo
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.
Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.
Unified framework for building enterprise RAG pipelines with small, specialized models
π¬ Harness Vibe Research with Self-evolving AI Scientists
A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines
Multi-agent swing trading system β automated screening, research, and execution with backtesting and live trading
Continuous prompt optimization for AI applications. Collect feedback, auto-optimize with DSPy, deliver as reviewable PRs.
No description
The LLM Evaluation Framework
Autonomous VAPT platform. Give it a target (FQDN, IP, CIDR) β it hunts, it reports. Inspired by the Obsidian Order.
Your second brain, starting today. CLI + MCP server that helps you build, maintain, and search a knowledge vault that gets better every day. Works with any AI provider. Local-first, zero-prereq instal
LLM proxy to observe and debug what your AI agents are doing.
Lightweight hallucination detection framework for RAG applications
Control robots and physical hardware with natural language through Strands Agents.
A MCP server to use StatCAN data
Python SDK for Agent AI Observability, Monitoring and Evaluation Framework. Includes features like agent, llm and tools tracing, debugging multi-agentic system, self-hosted dashboard and advanced anal
π€ Generate tailored AI training datasets quickly and easily, transforming your domain knowledge into essential training data for model fine-tuning.
HealthFlow: A Self-Evolving AI Agent with Meta Planning for Autonomous Healthcare Research
A framework for optimizing textual system components (AI prompts, code snippets, etc.) using LLM-based reflection and Pareto-efficient evolutionary search.
Official Python package for working with the Roboflow API
A record linkage toolkit for linking and deduplication
A tool to determine the content type of a file with deep learning
Fiona reads and writes spatial data files
A light-weight and flexible data validation and testing tool for statistical data objects.
dlt is an open-source python-first scalable data loading library that does not require any backend to run.
Toolbox for imbalanced dataset in machine learning
No description
Medical-AI is a AI framework specifically for Medical Applications https://aibharata.github.io/medicalAI/
