freshcrate — Search

Search results for "dataset"

80 results found

phoenix 📁arize-phoenix-v14.9.1🌳 Mature⭐9,209

AI Observability & Evaluation

agents ai-monitoring ai-observability aiengineering anthropic datasets evals jupyter notebook langchain prompt-engineeringby Arize-aiJupyter Notebook

mcp-marketplace 📁0.0.0🌱 Seedling⭐31

OpenSource MCP Marketplace | MCP Servers Tools Meta Dataset | Web API | Web Client Integration

mcp mcp-client mcp-marketplace mcp-server pythonby aiagenta2zPython

CodeGen 📁0.0.0🌳 Mature⭐773

Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from dataset creation to model training and evaluation. Comes with pr

pythonby facebookresearchPython

openclaw-engram 📁v9.3.142🌿 Growing⭐54

Local-first memory plugin for OpenClaw AI agents. LLM-powered extraction, plain markdown storage, hybrid search via QMD. Gives agents persistent long-term memory across conversations.

ai-agent ai-memory conversational-ai engram knowledge-graph llm local-first long-term-memory typescriptby joshuaswarrenTypeScript

agentmemory 📁v0.9.1🌳 Mature⭐738

Persistent memory for AI coding agents

typescriptby rohitg00TypeScript

RAGHub 📁main@2026-04-17🌳 Mature⭐1,712

A community-driven collection of RAG (Retrieval-Augmented Generation) frameworks, projects, and resources. Contribute and explore the evolving RAG ecosystem.

ai artificial-intelligence large-language-models llm machine-learning natural-language-processing nlp open-sourceby Andrew-Jang

agentic-memory 📁0.0.0🌿 Growing⭐162

No description

by lhl

LLM-Agents-Ecosystem-Handbook 📁0.0.0🌳 Mature⭐508

One-stop handbook for building, deploying, and understanding LLM agents with 60+ skeletons, tutorials, ecosystem guides, and evaluation tools.

ai ai-agent ai-agents fine-tuning finetuning-llms freamework llm llmops pythonby oxbshwPython

LRAT 📁0.0.0🌱 Seedling⭐34

The implementation for SIGIR 2026: Learning to Retrieve from Agent Trajectories.

agent agentic llm python searchby Yuqi-ZhouPython

cactus 📁0.0.0🌿 Growing⭐50

LLM Agent that leverages cheminformatics tools to provide informed responses.

cheminformatics chemistry foundation-models jupyter notebook llm llm-agent nlp scienceby pnnlJupyter Notebook

RAGMeUp 📁scala-ui🌳 Mature⭐675

Generic rag framework to apply the power of LLMs on any given dataset

javascriptby SensAI-PTJavaScript

GEA 📁0.0.0🌱 Seedling⭐23

Group Evolving Agents: Open-Ended Self-Improvement via Experience Sharing

code-generation group-evolving-agents open-ended-evolution open-endedness python research-agents self-evolving-agentsby eric-ai-labPython

sage 📁0.0.0🌿 Growing⭐244

Official Code Release of SAGE: Scalable Agentic 3D Scene Generation for Embodied AI

pythonby NVlabsPython

openlit 📁openlit-1.18.1🌿 Growing⭐2,358

Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Management, Vault, Playground. 🚀💻 Integrates with 50+ LLM Providers,

ai-observability amd-gpu clickhouse distributed-tracing genai gpu-monitoring grafana langchain pythonby openlitPython

cognee 📁v1.0.1🌿 Growing⭐15,104

Knowledge Engine for AI Agent Memory in 6 lines of code

ai ai-agents ai-memory cognitive-architecture cognitive-memory context-engineering contributions-welcome good-first-issue pythonby topoteretesPython

capsule 📁v0.8.8🌿 Growing⭐276

A secure, durable runtime to sandbox AI agent tasks. Run untrusted code in isolated WebAssembly environments.

agentic-workflow ai-agents code-execution code-interpreter javascript llm python rustby mavdolRust

Autonomous-Agents 📁main@2026-04-16🌿 Growing⭐1,211

Autonomous Agents (LLMs) research papers. Updated Daily.

agent agentic agentic-ai agents ai ai-agents aiagent aiagentsby tmgthb

semiont 📁v0.4.20🌱 Seedling⭐44

Semiont supports human+ai collaborative knowledge work. Use it as: a Wiki, Semantic Layer, Context Graph, Knowledge Base, Annotator, Research Tool, or Agentic Memory...

ai annotation knowledge typescript wikiby The-AI-AllianceTypeScript

Awesome-Context-Engineering 📁0.0.0🌳 Mature⭐3,045

🔥 Comprehensive survey on Context Engineering: from prompt engineering to production-grade AI systems. hundreds of papers, frameworks, and implementation guides for LLMs and AI agents.

agent agentic-ai agi awesome-list cognitive-science context-engineering llm ragby Meirtz

AgentLint 📁v0.8.5🌱 Seedling⭐12

Lint your repo for AI agent compatibility.

agentic agents-md ai-agent ai-friendly ai-tools anthropic claude-code claude-code-plugin javascriptby 0xmariowuJavaScript

langwatch 📁skills@v0.3.0🌿 Growing⭐3,193

The platform for LLM evaluations and AI agent testing

ai analytics datasets dspy evaluation gpt llm llm-ops typescriptby langwatchTypeScript

arthur-engine 📁2.1.529🌿 Growing⭐75

Make AI work for Everyone - Monitoring and governing for your AI/ML

agentic benchmarking evaluation genai guardrails llm ml monitoring pythonby arthur-aiPython

ISC-Bench 📁v0.0.5🌿 Growing⭐786

Internal Safety Collapse: Turning the LLM or an AI Agent into a sensitive data generator.

adversarial-attacks agent-safety ai-safety benchmark frontier-models jailbreak large-language-models llm-safety pythonby wuyoscarPython

synaptic-memory 📁v0.16.0🌱 Seedling⭐25

Brain-inspired knowledge graph: spreading activation, Hebbian learning, memory consolidation.

ai-agent embedding graph-database hebbian-learning knowledge-graph llm mcp mcp-server pythonby PlateerLabPython

datagouv-mcp 📁v0.2.23🌿 Growing⭐1,216

Official data.gouv.fr Model Context Protocol (MCP) server that allows AI chatbots to search, explore, and analyze datasets from the French national Open Data platform, directly through conversation.

mcp mcp-server open-data opendata pythonby datagouvPython

arag 📁v0.1.0🌿 Growing⭐247

A-RAG: Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces. State-of-the-art RAG framework with keyword, semantic, and chunk read tools for multi-hop QA.

agent agentic-ai agenticrag deepresearch evaluation graphrag llm llmagents pythonby Ayanami0730Python

memind 📁main@2026-04-21🌿 Growing⭐360

Self-evolving cognitive memory and context engine for AI agents in Java. Empowering 24/7 proactive agents like OpenClaw with understanding and SOTA performance.

ai ai-agent ai-agents ai-memory context-engineering java memory openclawby openmemindJava

Awesome-World-Models 📁main@2026-04-21🌿 Growing⭐1,473

A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related website

artificial-intelligence autonomous-driving awesome deep-learning embodied-ai future-prediction video-prediction world-modelby leofan90

memory_agent_hub 📁main@2026-04-20🌱 Seedling⭐38

2026 swarm Agent 年，swarm Agent 、Agent team、 ai coding、skill、memory、evolve、agentic RL 等 AI Agent集合

ai-memory elasticsearch graphrag jupyter notebook knowledge-graph llm-agent milvus neo4j rag-technologyby 1850298154Jupyter Notebook

awesome-code-agents 📁main@2026-04-20🌿 Growing⭐94

A curated list of products, benchmarks, and research papers on autonomous code agents. Beyond coding — they're redefining how software changes the world.

pythonby EuniAIPython

auto-deep-researcher-24x7 📁main@2026-04-19🌿 Growing⭐261

🔥 An autonomous AI agent that runs your deep learning experiments 24/7 while you sleep. Zero-cost monitoring, Leader-Worker architecture, constant-size memory.

ai-agent autonomous-agent claude-code deep-learning experiment-automation gpu hyperparameter-tuning llm-agent pythonby Xiangyue-ZhangPython

AGI-Alpha-Agent-v0 📁main@2026-04-18🌿 Growing⭐283

META‑AGENTIC α‑AGI 👁️✨ — Mission 🎯 End‑to‑end: Identify 🔍 → Out‑Learn 📚 → Out‑Think 🧠 → Out‑Design 🎨 → Out‑Strategise ♟️ → Out‑Execute ⚡

agentic agentic-ai agentic-framework ai aiagent aiagents llm meta-agentic pythonby MontrealAIPython

medusa 📁v2026.5.5🌿 Growing⭐252

AI-first security scanner with 76 analyzers, 9,600+ detection rules, and repo poisoning detection for AI/ML, LLM agents, and MCP servers. Scan any GitHub repo with: medusa scan --git user/repo

agent-security ai-security code-analysis cve-detection devsecops llm-security mcp nextjs pythonby Pantheon-SecurityPython

biomcp 📁v0.8.21🌿 Growing⭐488

BioMCP: Biomedical Model Context Protocol

ai bioinformatics clinical-trials genomics llm mcp mcp-server medical rustby genomoncologyRust

AReaL 📁v1.0.3🌿 Growing⭐5,017

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

agent llm llm-agent llm-reasoning machine-learning-systems mlsys python reinforcement-learning rlby inclusionAIPython

Awesome-Agent-Memory 📁main@2026-04-16🌿 Growing⭐333

Curated systems, benchmarks, and papers etc. on memory for LLMs/MLLMs --- long-term context, retrieval, and reasoning.

agent-memory ai-agent ai-agent-memory awesome-agent-memory llm-memory memory memory-management multimodal-llm-memoryby TeleAI-UAGI

paiml-mcp-agent-toolkit 📁v3.14.0🌿 Growing⭐148

Pragmatic AI Labs MCP Agent Toolkit - An MCP Server designed to make code with agents more deterministic

agentic c deno kotlin mcp mcp-server paiml paiml-active-tool rustby paimlRust

claw-eval 📁main@2026-04-15🌿 Growing⭐394

Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.

agent harness llm openclaw pythonby claw-evalPython

llmware 📁v0.4.6🌿 Growing⭐14,857

Unified framework for building enterprise RAG pipelines with small, specialized models

agents generative-ai-tools llamacpp llm onnx openvino parsing python retrieval-augmented-generationby llmware-aiPython

octocode 📁0.14.0🌿 Growing⭐319

Semantic code searcher and codebase utility

ai ai-tools cli cli-app code-search developer-tool developer-tools doc-search mcp-server rustby MuvonRust

EvoScientist 📁v0.0.7🌿 Growing⭐2,731

🔬 Harness Vibe Research with Self-evolving AI Scientists

ai-agent ai4science multi-agent-system python vibe-researchby EvoScientistPython

next-plaid 📁v1.2.0🌿 Growing⭐331

NextPlaid, ColGREP: Multi-vector search, from database to coding agents.

agentic-rag cli grep multi-vector rust vector-databaseby lightonaiRust

Awesome-Repo-Level-Code-Generation 📁main@2026-04-10🌿 Growing⭐274

Must-read papers on Repository-level Code Generation & Issue Resolution 🔥

ai4se automated-software-engineering code-generation large-language-models llm software-engineeringby YerbaPage

ds_ex 📁main@2026-04-09🌱 Seedling⭐17

DSPEx - Declarative Self-improving Elixir | A BEAM-Native AI Program Optimization Framework

ai ai-framework automated-optimization beam declarative-programming dspy elixir erlang-vmby nshkrdotcomElixir

UltraRAG 📁v0.3.0.2🌿 Growing⭐5,480

A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines

deepseek demo easy embedding flask gpt huggingface-transformers llm pythonby OpenBMBPython

DSPex 📁main@2026-04-09🌱 Seedling⭐17

Declarative Self Improving Elixir - DSPy Orchestration in Elixir

ai ai-framework autonomous-systems beam declarative-programming dspy elixir erlang-vmby nshkrdotcomElixir

agent-arch 📁main@2026-04-21🌱 Seedling⭐10

No description

agentic-ai agentic-workflow agents ai ai-architect ai-architecture ai-architecture-compliance architecture llm-agent pythonby agent-axiomPython

deepeval 📁v3.9.5🌳 Mature⭐14,701

The LLM Evaluation Framework

evaluation-framework evaluation-metrics llm-evaluation llm-evaluation-framework llm-evaluation-metrics pythonby confident-aiPython

db-mcp-server 📁v1.9.0🌱 Seedling⭐362

A powerful multi-database server implementing the Model Context Protocol (MCP) to provide AI assistants with structured access to databases.

database-mcp-server go mcp-serverby FreePeakGo

typeahead-kmp 📁2.0.4🌱 Seedling⭐9

A lock-free, in-memory fuzzy search engine for Kotlin Multiplatform. L2-normalized sparse vector embeddings with O(1) cosine similarity — handles typos, transpositions, and blind continuation. Zero-al

autocomplete concurrent coroutines cosine-similarity embeddings fuzzy-matching fuzzy-search in-memory kotlin vector-databaseby karlotiKotlin

kernel 📁v3.97.0🌱 Seedling⭐12

kbot — the AI agent that dreams, learns, and evolves. 764+ tools, 35 agents, 20 providers. Music production, iPhone control, financial analysis, cyber threat intel. Always-on daemon. Runs offline. npm

ai-agent anthropic cli coding-agent cybersecurity defi kbot llm typescriptby isaacsightTypeScript

AutoRAG 📁v0.3.22🌱 Seedling⭐4,693

AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation

analysis automl benchmarking document-parser embeddings evaluation llm llm-evaluation pythonby Marker-Inc-KoreaPython

tensorzero 📁2026.4.0🌱 Seedling⭐11,204

TensorZero is an open-source LLMOps platform that unifies an LLM gateway, observability, evaluation, optimization, and experimentation.

ai ai-engineering anthropic artificial-intelligence deep-learning genai generative-ai gpt rustby tensorzeroRust

spiceai 📁v1.11.5🌱 Seedling⭐2,868

A portable accelerated SQL query, search, and LLM-inference engine, written in Rust, for data-grounded AI apps and agents.

artificial-intelligence data data-federation developers full-text-search infrastructure llm-inference machine-learning rustby spiceaiRust

vikramaditya 📁main@2026-04-20🌱 Seedling⭐5

Autonomous VAPT platform. Give it a target (FQDN, IP, CIDR) — it hunts, it reports. Inspired by the Obsidian Order.

ai-security autonomous-agent bash bug-bounty penetration-testing python recon securityby venkatasPython

neurostack 📁v0.11.1🌱 Seedling⭐40

Your second brain, starting today. CLI + MCP server that helps you build, maintain, and search a knowledge vault that gets better every day. Works with any AI provider. Local-first, zero-prereq instal

ai-memory knowledge-graph local-ai local-first markdown mcp mcp-server neuroscience pythonby raphasouthallPython

RAG-Anything 📁v1.2.10🌱 Seedling⭐15,557

"RAG-Anything: All-in-One RAG Framework"

multi-modal-rag python retrieval-augmented-generationby HKUDSPython

claude-codex-settings 📁v2.3.0🌱 Seedling⭐587

My personal Claude Code and OpenAI Codex setup with battle-tested skills, commands, hooks, agents and MCP servers that I use daily.

ai-agents ai-tools claude-ai claude-code claude-code-plugin claude-skills claudecode claudecode-config pythonby fcakyonPython

fast-plaid 📁1.4.5🌱 Seedling⭐239

High-Performance Engine for Multi-Vector Search

colbert colpali information-retrieval python rust vector-databaseby lightonaiPython

invariant-gateway 📁0.0.0🌱 Seedling⭐69

LLM proxy to observe and debug what your AI agents are doing.

ai-agents debugging guardrails llm observability proxy pythonby invariantlabs-aiPython

camel 📁v0.2.90🌱 Seedling⭐16,654

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

agent ai-societies artificial-intelligence communicative-ai cooperative-ai deep-learning large-language-models multi-agent-systems pythonby camel-aiPython

membrane 📁v0.2.0🌱 Seedling⭐75

A selective learning and memory substrate for agentic systems — typed, revisable, decayable memory with competence learning and trust-aware retrieval.

agent agent-framework agent-memory agent-skills agentic ai-agents autonomous-agents collaborate goby GustyCubeGo

mcp-use 📁python-v1.7.0🌱 Seedling⭐9,760

The fullstack MCP framework to develop MCP Apps for ChatGPT / Claude & MCP Servers for AI Agents.

agentic-framework ai apps-sdk chatgpt claude-code llms mcp mcp-apps typescriptby mcp-useTypeScript

bisheng 📁v2.3.0🌱 Seedling⭐11,293

BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SF

agent ai chatbot enterprise finetune genai gpt langchian typescriptby dataelementTypeScript

php-sdk 📁v0.4.0🌱 Seedling⭐1,440

The official PHP SDK for Model Context Protocol servers and clients. Maintained in collaboration with The PHP Foundation.

phpby modelcontextprotocolPHP

mcp-statcan 📁main@2026-04-12🌱 Seedling⭐3

A MCP server to use StatCAN data

data-analysis mcp mcp-server pythonby Aryan-JhaveriPython

VectorDBBench 📁v1.0.20🌱 Seedling⭐1,068

Benchmark for vector databases.

benchmark cost-effectiveness performance python vector-database vector-search vectordbby zilliztechPython

OriginDL 📁v1.0.0🌱 Seedling⭐245

Implement a Pytorch-like DL library in C++ from scratch, step by step

ai-framework ai-infra c++cuda deeplearning pytorch yoloby jinbooooomC++

eywa 📁main@2026-04-21🌱 Seedling⭐1

🧠 Capture and manage your team's knowledge effortlessly with Eywa, ensuring no valuable memory is ever lost.

chatbot chatui datasets elasticsearch embeddings gemini-pro graphql iam rust vector-databaseby nans28Rust

ragas 📁v0.4.3🌱 Seedling⭐13,329

Supercharge Your LLM Application Evaluations 🚀

evaluation llm llmops pythonby explodinggradientsPython

ai-dataset-generator 📁main@2026-04-21🌱 Seedling⭐1

🤖 Generate tailored AI training datasets quickly and easily, transforming your domain knowledge into essential training data for model fine-tuning.

ai code-generation codebert data-poisoning-attacks dataset dataset-generation finetune-gpt gpt4o pythonby bosszii2709Python

fluid 📁v1.0.8🌱 Seedling⭐1,908

Fluid, elastic data abstraction and acceleration for BigData/AI applications in cloud. (Project under CNCF)

ai-framework alluxio big-data data-abstraction distributed-cache go kubernetesby fluid-cloudnativeGo

inAI-wiki 📁v0.1.0💤 Dormant⭐50

🌍 The open-source Wikipedia of AI — 2M+ apps, agents, LLMs & datasets. Updated daily with tools, tutorials & news.

agents ai aitools artificial-intelligence chrome-extensions database dataset llm mcpby inai-sandy

mcp-brunella-core 📁main@2026-04-20🌱 Seedling⭐1

BRUNELLA AGENT SYSTEM (BAS) – A JÖVŐ DIGITÁLIS SZERVEZETE

ai c++claude cloudflare copilot enterprise gemini i-love-agents lama vector-databaseby pohi99999C++

LettuceDetect 📁0.1.8💤 Dormant⭐545

Lightweight hallucination detection framework for RAG applications

bert hallucination-detection hallucination-evaluation information-extraction nlp python pytorch token-classificationby KRLabsOrgPython

HealthFlow 📁datasets💤 Dormant⭐40

HealthFlow: A Self-Evolving AI Agent with Meta Planning for Autonomous Healthcare Research

ai-for-healthcare ai-for-science ehr llm llm-agent multi-agent pythonby yhzhu99Python

RagaAI-Catalyst 📁v2.2.4💤 Dormant⭐16,130

Python SDK for Agent AI Observability, Monitoring and Evaluation Framework. Includes features like agent, llm and tools tracing, debugging multi-agentic system, self-hosted dashboard and advanced anal

agentic-ai agentic-ai-development agentneo agents ai-agent-monitoring ai-application-debugging ai-evaluation-tools ai-performance-optimization pythonby raga-ai-hubPython

mcp-bigquery-server 📁v1.0.3💤 Dormant⭐136

A Model Context Protocol (MCP) server that provides secure, read-only access to BigQuery datasets. Enables Large Language Models (LLMs) to safely query and analyze data through a standardized interfac

bigquery google-cloud mcp mcp-servers model-context-protocol sql typescriptby ergutTypeScript

judge0 📁v1.13.1⚰️ Archived⭐4,082

Robust, fast, scalable, and sandboxed open-source online code execution system for humans and AI.

ai-agent-tools ai-agents ai-tools code-execution code-executor code-runner competitive-programming html online-compilerby judge0HTML

medicalAI 📁v1.2.9-rc⚰️ Archived⭐21

Medical-AI is a AI framework specifically for Medical Applications https://aibharata.github.io/medicalAI/

ai-framework keras medical-applications medical-imaging pdf-report prediction python tensorflow tensorflow2by aibharataPython