freshcrate

Search results for "dataset"

Clear filters
60 results found (Python)
mcp-marketplaceπŸ“0.0.0🌱 Seedling⭐31

OpenSource MCP Marketplace | MCP Servers Tools Meta Dataset | Web API | Web Client Integration

CodeGenπŸ“0.0.0🌳 Mature⭐773

Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from dataset creation to model training and evaluation. Comes with pr

claude-code-plugins-plus-skillsπŸ“v4.26.0🌳 Mature⭐1,995

423 plugins, 2,849 skills, 177 agents for Claude Code. Open-source marketplace at tonsofskills.com with the ccpi CLI package manager.

camelπŸ“v0.2.91a1πŸ›οΈ Flagship⭐16,753

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

LLM-Agents-Ecosystem-HandbookπŸ“0.0.0🌳 Mature⭐508

One-stop handbook for building, deploying, and understanding LLM agents with 60+ skeletons, tutorials, ecosystem guides, and evaluation tools.

LRATπŸ“0.0.0🌱 Seedling⭐34

The implementation for SIGIR 2026: Learning to Retrieve from Agent Trajectories.

GEAπŸ“0.0.0🌱 Seedling⭐23

Group Evolving Agents: Open-Ended Self-Improvement via Experience Sharing

AutoRAGπŸ“v0.3.22🌳 Mature⭐4,712

AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation

sageπŸ“0.0.0🌿 Growing⭐244

Official Code Release of SAGE: Scalable Agentic 3D Scene Generation for Embodied AI

RAG-AnythingπŸ“v1.2.10πŸ›οΈ Flagship⭐16,761

"RAG-Anything: All-in-One RAG Framework"

claude-codex-settingsπŸ“v2.3.0🌳 Mature⭐623

My personal Claude Code and OpenAI Codex setup with battle-tested skills, commands, hooks, agents and MCP servers that I use daily.

fast-plaidπŸ“1.4.5🌿 Growing⭐245

High-Performance Engine for Multi-Vector Search

openlitπŸ“openlit-1.18.1🌿 Growing⭐2,358

Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Management, Vault, Playground. πŸš€πŸ’» Integrates with 50+ LLM Providers,

arthur-engineπŸ“2.1.529🌿 Growing⭐75

Make AI work for Everyone - Monitoring and governing for your AI/ML

ISC-BenchπŸ“v0.0.5🌿 Growing⭐786

Internal Safety Collapse: Turning the LLM or an AI Agent into a sensitive data generator.

synaptic-memoryπŸ“v0.16.0🌱 Seedling⭐25

Brain-inspired knowledge graph: spreading activation, Hebbian learning, memory consolidation.

datagouv-mcpπŸ“v0.2.23🌿 Growing⭐1,216

Official data.gouv.fr Model Context Protocol (MCP) server that allows AI chatbots to search, explore, and analyze datasets from the French national Open Data platform, directly through conversation.

aragπŸ“v0.1.0🌿 Growing⭐247

A-RAG: Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces. State-of-the-art RAG framework with keyword, semantic, and chunk read tools for multi-hop QA.

ragasπŸ“v0.4.3🌳 Mature⭐13,569

Supercharge Your LLM Application Evaluations πŸš€

awesome-code-agentsπŸ“main@2026-04-20🌿 Growing⭐94

A curated list of products, benchmarks, and research papers on autonomous code agents. Beyond coding β€” they're redefining how software changes the world.

GTAπŸ“v0.2.0🌿 Growing⭐143

[NeurIPS 2024 D&B] GTA: A Benchmark for General Tool Agents & [arXiv 2026] GTA-2

auto-deep-researcher-24x7πŸ“main@2026-04-19🌿 Growing⭐261

πŸ”₯ An autonomous AI agent that runs your deep learning experiments 24/7 while you sleep. Zero-cost monitoring, Leader-Worker architecture, constant-size memory.

AGI-Alpha-Agent-v0πŸ“main@2026-04-18🌿 Growing⭐283

META‑AGENTIC α‑AGI πŸ‘οΈβœ¨ β€” Mission 🎯 End‑to‑end: Identify πŸ” β†’ Out‑Learn πŸ“š β†’ Out‑Think 🧠 β†’ Out‑Design 🎨 β†’ Out‑Strategise β™ŸοΈ β†’ Out‑Execute ⚑

medusaπŸ“v2026.5.5🌿 Growing⭐252

AI-first security scanner with 76 analyzers, 9,600+ detection rules, and repo poisoning detection for AI/ML, LLM agents, and MCP servers. Scan any GitHub repo with: medusa scan --git user/repo

AReaLπŸ“v1.0.3🌿 Growing⭐5,017

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

claw-evalπŸ“main@2026-04-15🌿 Growing⭐394

Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.

llmwareπŸ“v0.4.6🌿 Growing⭐14,857

Unified framework for building enterprise RAG pipelines with small, specialized models

EvoScientistπŸ“v0.0.7🌿 Growing⭐2,731

πŸ”¬ Harness Vibe Research with Self-evolving AI Scientists

UltraRAGπŸ“v0.3.0.2🌿 Growing⭐5,480

A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines

swing-trading-agentπŸ“0.0.0🌱 Seedling⭐7

Multi-agent swing trading system β€” automated screening, research, and execution with backtesting and live trading

kaizenπŸ“0.0.0🌱 Seedling⭐6

Continuous prompt optimization for AI applications. Collect feedback, auto-optimize with DSPy, deliver as reviewable PRs.

vikramadityaπŸ“main@2026-04-20🌱 Seedling⭐5

Autonomous VAPT platform. Give it a target (FQDN, IP, CIDR) β€” it hunts, it reports. Inspired by the Obsidian Order.

neurostackπŸ“v0.11.1🌱 Seedling⭐40

Your second brain, starting today. CLI + MCP server that helps you build, maintain, and search a knowledge vault that gets better every day. Works with any AI provider. Local-first, zero-prereq instal

invariant-gatewayπŸ“0.0.0🌱 Seedling⭐69

LLM proxy to observe and debug what your AI agents are doing.

LettuceDetectπŸ“0.1.8πŸ’€ Dormant⭐565

Lightweight hallucination detection framework for RAG applications

robotsπŸ“v0.3.8🌱 Seedling⭐44

Control robots and physical hardware with natural language through Strands Agents.

mcp-statcanπŸ“main@2026-04-12🌱 Seedling⭐3

A MCP server to use StatCAN data

RagaAI-CatalystπŸ“v2.2.4πŸ’€ Dormant⭐16,141

Python SDK for Agent AI Observability, Monitoring and Evaluation Framework. Includes features like agent, llm and tools tracing, debugging multi-agentic system, self-hosted dashboard and advanced anal

ai-dataset-generatorπŸ“main@2026-04-21🌱 Seedling⭐1

πŸ€– Generate tailored AI training datasets quickly and easily, transforming your domain knowledge into essential training data for model fine-tuning.

HealthFlowπŸ“datasetsπŸ’€ Dormant⭐40

HealthFlow: A Self-Evolving AI Agent with Meta Planning for Autonomous Healthcare Research

pycocotoolsπŸ“2.0.11🌱 Seedling

Official APIs for the MS-COCO dataset

gepaπŸ“0.1.1🌱 Seedling

A framework for optimizing textual system components (AI prompts, code snippets, etc.) using LLM-based reflection and Pareto-efficient evolutionary search.

roboflowπŸ“1.3.3🌱 Seedling

Official Python package for working with the Roboflow API

recordlinkageπŸ“0.16🌱 Seedling

A record linkage toolkit for linking and deduplication

magikaπŸ“1.0.2🌱 Seedling

A tool to determine the content type of a file with deep learning

fionaπŸ“1.10.1🌱 Seedling

Fiona reads and writes spatial data files

panderaπŸ“0.31.1🌱 Seedling

A light-weight and flexible data validation and testing tool for statistical data objects.

dltπŸ“1.25.0🌱 Seedling

dlt is an open-source python-first scalable data loading library that does not require any backend to run.

imbalanced-learnπŸ“0.14.1🌱 Seedling

Toolbox for imbalanced dataset in machine learning

kerasπŸ“3.14.0🌱 Seedling

Multi-backend Keras

spacy-loggersπŸ“1.0.5🌱 Seedling

Logging utilities for SpaCy

ua-parser-builtinsπŸ“202603🌱 Seedling

Precompiled rules for User Agent Parser

awswranglerπŸ“3.16.0🌱 Seedling

Pandas on AWS.

google-cloud-aiplatformπŸ“1.148.1🌱 Seedling

Vertex AI API client library

medicalAIπŸ“v1.2.9-rc⚰️ Archived⭐21

Medical-AI is a AI framework specifically for Medical Applications https://aibharata.github.io/medicalAI/