freshcrate — Search

Search results for "datasets"

83 results found

phoenix 📁arize-phoenix-v14.9.1🌳 Mature⭐9,209

AI Observability & Evaluation

agents ai-monitoring ai-observability aiengineering anthropic datasets evals jupyter notebook langchain prompt-engineeringby Arize-aiJupyter Notebook

opik 📁2.0.6🌳 Mature⭐18,767

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

evaluation hacktoberfest hacktoberfest2025 langchain llama-index llm llm-evaluation llm-observability pythonby comet-mlPython

llm-rl-environments-lil-course 📁main@2026-04-17🌿 Growing⭐57

🌱 A little course on Reinforcement Learning Environments for evaluating and training Language Models

course grpo language-models llm llm-agent python reinforcement-learning reinforcement-learning-environments rlvrby anakin87Python

RAGHub 📁main@2026-04-17🌳 Mature⭐1,712

A community-driven collection of RAG (Retrieval-Augmented Generation) frameworks, projects, and resources. Contribute and explore the evolving RAG ecosystem.

ai artificial-intelligence large-language-models llm machine-learning natural-language-processing nlp open-sourceby Andrew-Jang

Constrained-Text-Generation-Studio 📁0.0.0🌿 Growing⭐216

Code repo for "Most Language Models can be Poets too: An AI Writing Assistant and Constrained Text Generation Studio" at the (CAI2) workshop, jointly held at (COLING 2022)

pythonby HellisotherpeoplePython

LLM-Agents-Ecosystem-Handbook 📁0.0.0🌳 Mature⭐508

One-stop handbook for building, deploying, and understanding LLM agents with 60+ skeletons, tutorials, ecosystem guides, and evaluation tools.

ai ai-agent ai-agents fine-tuning finetuning-llms freamework llm llmops pythonby oxbshwPython

LRAT 📁0.0.0🌱 Seedling⭐34

The implementation for SIGIR 2026: Learning to Retrieve from Agent Trajectories.

agent agentic llm python searchby Yuqi-ZhouPython

langfuse 📁v3.169.0🌿 Growing⭐24,578

🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

analytics autogen evaluation langchain large-language-models llama-index llm llm-evaluation prompt-engineering typescriptby langfuseTypeScript

Autonomous-Agents 📁main@2026-04-16🌿 Growing⭐1,211

Autonomous Agents (LLMs) research papers. Updated Daily.

agent agentic agentic-ai agents ai ai-agents aiagent aiagentsby tmgthb

CodeGen 📁0.0.0🌳 Mature⭐773

Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from dataset creation to model training and evaluation. Comes with pr

pythonby facebookresearchPython

langwatch 📁skills@v0.3.0🌿 Growing⭐3,193

The platform for LLM evaluations and AI agent testing

ai analytics datasets dspy evaluation gpt llm llm-ops typescriptby langwatchTypeScript

latitude-llm 📁claude-code-telemetry-0.0.5🌿 Growing⭐3,955

Latitude is the open-source agent engineering platform

typescriptby latitude-devTypeScript

Agentic-RAG-R1 📁0.0.0🌿 Growing⭐412

Agentic RAG R1 Framework via Reinforcement Learning

agentic grpo python rag rlby jiangxinkePython

holmesgpt 📁0.24.4🌿 Growing⭐2,193

SRE Agent - CNCF Sandbox Project

aiops chatbot chatops devops devops-tools incident incident-management incident-response llm-agent pythonby HolmesGPTPython

ICMP-Ghost-A-Fileless-x64-Assembly-C2-Agent 📁v3.6.2🌿 Growing⭐163

Fileless C2 agent written in pure x64 Assembly for Linux. Features stealth ICMP tunneling, memory-only execution via memfd_create, and terminal-independent daemonization.

assembly bypassing c2-framework cybersecurity evasion fileless icmp-tunnel linux-kernelby JM00NJAssembly

chinese-llm-benchmark 📁v5.9🌿 Growing⭐5,841

ReLE评测：中文AI大模型能力评测（持续更新）：目前已囊括359个大模型，覆盖chatgpt、gpt-5.2、o4-mini、谷歌gemini-3-pro、Claude-4.6、文心ERNIE-X1.1、ERNIE-5.0、qwen3-max、qwen3.5-plus、百川、讯飞星火、商汤senseChat等商用模型，以及step3.5-flash、kimi-k2.5、ernie4.5、Min

agentic-ai artificial-intelligence llm-agent llm-evaluationby jeinlee1991

arthur-engine 📁2.1.529🌿 Growing⭐75

Make AI work for Everyone - Monitoring and governing for your AI/ML

agentic benchmarking evaluation genai guardrails llm ml monitoring pythonby arthur-aiPython

synaptic-memory 📁v0.16.0🌱 Seedling⭐25

Brain-inspired knowledge graph: spreading activation, Hebbian learning, memory consolidation.

ai-agent embedding graph-database hebbian-learning knowledge-graph llm mcp mcp-server pythonby PlateerLabPython

datagouv-mcp 📁v0.2.23🌿 Growing⭐1,216

Official data.gouv.fr Model Context Protocol (MCP) server that allows AI chatbots to search, explore, and analyze datasets from the French national Open Data platform, directly through conversation.

mcp mcp-server open-data opendata pythonby datagouvPython

arag 📁v0.1.0🌿 Growing⭐247

A-RAG: Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces. State-of-the-art RAG framework with keyword, semantic, and chunk read tools for multi-hop QA.

agent agentic-ai agenticrag deepresearch evaluation graphrag llm llmagents pythonby Ayanami0730Python

vector-io 📁0.0.0🌱 Seedling⭐266

Comprehensive Vector Data Tooling. The universal interface for all vector database, datasets and RAG platforms. Easily export, import, backup, re-embed (using any model) or access your vector data fro

chromadb data-backup data-exploration-and-preprocessing data-export data-import datastax huggingface huggingface-datasets jupyter notebookby AI-Northstar-TechJupyter Notebook

memind 📁main@2026-04-21🌿 Growing⭐360

Self-evolving cognitive memory and context engine for AI agents in Java. Empowering 24/7 proactive agents like OpenClaw with understanding and SOTA performance.

ai ai-agent ai-agents ai-memory context-engineering java memory openclawby openmemindJava

Awesome-World-Models 📁main@2026-04-21🌿 Growing⭐1,473

A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related website

artificial-intelligence autonomous-driving awesome deep-learning embodied-ai future-prediction video-prediction world-modelby leofan90

awesome-code-agents 📁main@2026-04-20🌿 Growing⭐94

A curated list of products, benchmarks, and research papers on autonomous code agents. Beyond coding — they're redefining how software changes the world.

pythonby EuniAIPython

excalibase-graphql 📁main@2026-04-19🌱 Seedling⭐31

Excalibase GraphQL instantly turns your database into a GraphQL API. Built with Spring Boot, it supports schema discovery, subscriptions, and type handling — no manual resolvers needed.

api api-automation api-generator code-generation database-to-graphql developer-tools graphql graphql-server javaby excalibaseJava

skills-vote 📁main@2026-04-19🌱 Seedling⭐31

The Next-Gen Agent-Native Skill Recommendation Engine

agent-skill agent-skills llm llm-agent pythonby MemTensorPython

OmicsClaw 📁main@2026-04-18🌿 Growing⭐116

Conversational & memory-enabled AI research partner for multi-omics analysis. From biological idea to full research paper.

bioinformatics knowledge-graph llm-agent multi-agents multi-omics python single-cell spatial-transcriptomicsby TianGzlabPython

biomcp 📁v0.8.21🌿 Growing⭐488

BioMCP: Biomedical Model Context Protocol

ai bioinformatics clinical-trials genomics llm mcp mcp-server medical rustby genomoncologyRust

OpenClawProBench 📁main@2026-04-15🌿 Growing⭐340

OpenClawProBench is a live-first benchmark harness for evaluating LLM agents in the OpenClaw runtime with deterministic grading and repeated-trial reliability.

agent benchmark evaluation harness leaderboard llm openclaw pythonby suyoumoPython

llmware 📁v0.4.6🌿 Growing⭐14,857

Unified framework for building enterprise RAG pipelines with small, specialized models

agents generative-ai-tools llamacpp llm onnx openvino parsing python retrieval-augmented-generationby llmware-aiPython

cognitive-dissonance-dspy 📁main@2026-04-14🌿 Growing⭐276

A multi-agent LLM system for detecting and resolving cognitive dissonance.

pythonby evalopsPython

rag-chatbot 📁main@2026-04-14🌿 Growing⭐402

RAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.

chatbot chromadb gpu lamacpp llama3 llm python qwen3-5 ragby umbertogriffoPython

awesome-vector-database 📁main@2026-04-13🌿 Growing⭐341

A curated list of awesome works related to high dimensional structure/vector search & database

approximate-nearest-neighbor-search embedding-similarity embeddings-similarity nearest-neighbor-search search-engine similarity-search vector-database vector-searchby dangkhoasdc

next-plaid 📁v1.2.0🌿 Growing⭐331

NextPlaid, ColGREP: Multi-vector search, from database to coding agents.

agentic-rag cli grep multi-vector rust vector-databaseby lightonaiRust

Awesome-Repo-Level-Code-Generation 📁main@2026-04-10🌿 Growing⭐274

Must-read papers on Repository-level Code Generation & Issue Resolution 🔥

ai4se automated-software-engineering code-generation large-language-models llm software-engineeringby YerbaPage

ds_ex 📁main@2026-04-09🌱 Seedling⭐17

DSPEx - Declarative Self-improving Elixir | A BEAM-Native AI Program Optimization Framework

ai ai-framework automated-optimization beam declarative-programming dspy elixir erlang-vmby nshkrdotcomElixir

UltraRAG 📁v0.3.0.2🌿 Growing⭐5,480

A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines

deepseek demo easy embedding flask gpt huggingface-transformers llm pythonby OpenBMBPython

Suganthans-BigQuery-MCP-Server 📁0.0.0🌱 Seedling⭐25

BigQuery MCP server for Claude — query any BigQuery dataset in natural language, with built-in SEO analysis tools for GSC bulk export data

typescriptby Suganthan-MohanadasanTypeScript

swing-trading-agent 📁0.0.0🌱 Seedling⭐7

Multi-agent swing trading system — automated screening, research, and execution with backtesting and live trading

ai-agent algorithmic-trading backtesting llm-agent multi-agent paper-trading python quantitative-finance stock-marketby kevmyungPython

deepeval 📁v3.9.5🌳 Mature⭐14,701

The LLM Evaluation Framework

evaluation-framework evaluation-metrics llm-evaluation llm-evaluation-framework llm-evaluation-metrics pythonby confident-aiPython

agent-skills-standard 📁php-v1.3.2🌱 Seedling⭐391

A collection of Agent Skills Standard and Best Practice for Programming Languages, Frameworks that help our AI Agent follow best practies on frameworks and programming laguages

agent-agentic-ai android angular best-practices coding-standards cursor-rules flutter typescriptby HoangNguyen0403TypeScript

cortex-hub 📁v0.7.0🌱 Seedling⭐48

Self-hosted AI Agent Memory + Code Intelligence Platform — one MCP endpoint for persistent memory, AST-aware code search, shared knowledge, and quality enforcement across all your AI coding agents.

ai-agents claude-code code-intelligence cursor developer-tools docker knowledge-base mcp typescriptby lktiepTypeScript

pinecone-ts-client 📁v7.2.0🌱 Seedling⭐269

The official TypeScript/Node client for the Pinecone vector database

llm pinecone semantic-search similarity-search typescript vector-databaseby pinecone-ioTypeScript

sqlite-vector 📁0.9.95🌱 Seedling⭐832

SQLite-Vector is a cross-platform, ultra-efficient SQLite extension that brings vector search capabilities to your embedded database.

cby sqliteaiC

VecturaKit 📁5.3.0🌱 Seedling⭐280

Swift-based vector database for on-device RAG using MLTensor and MLX Embedders

mlx-swift rag swiftby rryamSwift

jupyter-mcp-server 📁v1.0.0🌱 Seedling⭐1,025

🪐 🔧 Model Context Protocol (MCP) Server for Jupyter.

ai jupyter mcp mcp-server python toolsby datalayerPython

instructor 📁v1.15.1🌱 Seedling⭐12,743

structured outputs for llms

openai openai-function-calli openai-functions pydantic-v2 python validationby jxnlPython

tensorzero 📁2026.4.0🌱 Seedling⭐11,204

TensorZero is an open-source LLMOps platform that unifies an LLM gateway, observability, evaluation, optimization, and experimentation.

ai ai-engineering anthropic artificial-intelligence deep-learning genai generative-ai gpt rustby tensorzeroRust

mcp-devkit-server 📁v0.6.0🌱 Seedling⭐46

Developer-focused Mapbox MCP Server

typescriptby mapboxTypeScript

spiceai 📁v1.11.5🌱 Seedling⭐2,868

A portable accelerated SQL query, search, and LLM-inference engine, written in Rust, for data-grounded AI apps and agents.

artificial-intelligence data data-federation developers full-text-search infrastructure llm-inference machine-learning rustby spiceaiRust

infinity 📁v0.7.0-dev5🌱 Seedling⭐4,476

The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text.

ai-native approximate-nearest-neighbor-search bm25 c++cpp20 cpp20-modules embedding full-text-search hnswby infiniflowC++

RAG-Anything 📁v1.2.10🌱 Seedling⭐15,557

"RAG-Anything: All-in-One RAG Framework"

multi-modal-rag python retrieval-augmented-generationby HKUDSPython

fast-plaid 📁1.4.5🌱 Seedling⭐239

High-Performance Engine for Multi-Vector Search

colbert colpali information-retrieval python rust vector-databaseby lightonaiPython

camel 📁v0.2.90🌱 Seedling⭐16,654

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

agent ai-societies artificial-intelligence communicative-ai cooperative-ai deep-learning large-language-models multi-agent-systems pythonby camel-aiPython

arcadedb 📁26.3.2🌱 Seedling⭐793

ArcadeDB Multi-Model Database, one DBMS that supports SQL, Cypher, Gremlin, HTTP/JSON, MongoDB and Redis. ArcadeDB is a conceptual fork of OrientDB, the first Multi-Model DBMS. ArcadeDB supports Vecto

arcadedb database dbms distributed docker document embedded graph javaby ArcadeDataJava

edsl 📁wasm-wheel🌱 Seedling⭐454

Design, conduct and analyze results of AI-powered surveys and experiments. Simulate social science and market research with large numbers of AI agents and LLMs.

anthropic data-labeling deepinfra domain-specific-language experiments llama2 llm llm-agent pythonby expectedparrotPython

synapse-ai 📁v1.0.0🌱 Seedling⭐1

Build AI agents that actually do things. Synapse is an open-source platform for creating, connecting, and orchestrating AI agents powered by any LLM — local or cloud.

custom-agents custom-tools directed-acyclic-graph llm-agent mcp-client mcp-server mcp-servers mcp-tools pythonby naveenraj-17Python

awesome-agent-benchmarks 📁master@2026-04-21🌱 Seedling⭐3

🧠 Discover and evaluate advanced benchmark datasets for Large Language Model agents to enhance performance assessment in real-world tasks.

agent-based-modeling agent-benchmark agentic agentic-ai ai ai-agent ai-models awesomeby axxafo

VectorDBBench 📁v1.0.20🌱 Seedling⭐1,068

Benchmark for vector databases.

benchmark cost-effectiveness performance python vector-database vector-search vectordbby zilliztechPython

ragflow 📁v0.24.0🌱 Seedling⭐77,784

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

agent agentic agentic-ai agentic-workflow ai context-engineering context-retrieval deep-research pythonby infiniflowPython

eywa 📁main@2026-04-21🌱 Seedling⭐1

🧠 Capture and manage your team's knowledge effortlessly with Eywa, ensuring no valuable memory is ever lost.

chatbot chatui datasets elasticsearch embeddings gemini-pro graphql iam rust vector-databaseby nans28Rust

axon 📁main@2026-04-21🌱 Seedling⭐2

Enable autonomous AI workflows with a local-first, zero-trust Rust framework for high-performance multi-agent orchestration and deterministic execution.

agents ai anthropic-api axon-framework code-analysis cost-management deepseek domain-driven-design mcp rustby RandallRORust

autonomous-agentic-research-swarm 📁main@2026-04-11🌱 Seedling⭐4

File-based autonomous agentic research swarm template (Planner/Worker/Judge) with contracts, workstreams, and deterministic quality gates.

agentic automation claude codex git-worktrees html reproducible-research research swarmby AysajanEHTML

ragas 📁v0.4.3🌱 Seedling⭐13,329

Supercharge Your LLM Application Evaluations 🚀

evaluation llm llmops pythonby explodinggradientsPython

ai-dataset-generator 📁main@2026-04-21🌱 Seedling⭐1

🤖 Generate tailored AI training datasets quickly and easily, transforming your domain knowledge into essential training data for model fine-tuning.

ai code-generation codebert data-poisoning-attacks dataset dataset-generation finetune-gpt gpt4o pythonby bosszii2709Python

modular-image-classification-framework 📁main@2026-04-20🌱 Seedling⭐1

A modular deep learning framework for training and evaluating image classification models on datasets like CIFAR-10 and MNIST. Supports configurable CNN architectures, automated training, and performa

ai-framework cnn computer-vision deep-learning keras machine-learning ml-classification ml-tools pythonby RafiShaik-AIPython

fluid 📁v1.0.8🌱 Seedling⭐1,908

Fluid, elastic data abstraction and acceleration for BigData/AI applications in cloud. (Project under CNCF)

ai-framework alluxio big-data data-abstraction distributed-cache go kubernetesby fluid-cloudnativeGo

inAI-wiki 📁v0.1.0💤 Dormant⭐50

🌍 The open-source Wikipedia of AI — 2M+ apps, agents, LLMs & datasets. Updated daily with tools, tutorials & news.

agents ai aitools artificial-intelligence chrome-extensions database dataset llm mcpby inai-sandy

vector-cache-optimizer 📁base-setup@2026-04-21🌱 Seedling⭐1

⚡ Optimize vector searches with a hyper-efficient cache that uses machine learning for faster, smarter data access and reduced costs.

ai ai-assisted backend compiler database distributed-systems mathematics matrix-multiplication python vector-databaseby Ronakagrwal000Python

langgraph-rag-assistant 📁main@2026-04-21🌱 Seedling⭐1

🚀 Build an enterprise-ready RAG system to enhance technical documentation querying with LangGraph and multi-step reasoning workflows.

ai bigdata chromadb deepseek-r1 echarts fastapi lamaindex langgraph python vector-databaseby deepashreesivaPython

tokenizer 📁1.0.0💤 Dormant⭐6

High-Performance Tokenizer implementation in PHP.

agentic-framework agentic-workflow ai ai-agents ai-framework bpe-tokenizer llm phpby neuron-corePHP

roboflow 📁1.3.3🌱 Seedling

Official Python package for working with the Roboflow API

pypiby RoboflowPython

recordlinkage 📁0.16🌱 Seedling

A record linkage toolkit for linking and deduplication

pypiby pypiPython

azure-ai-projects2.1.0🌱 Seedling

Microsoft Corporation Azure AI Projects Client Library for Python

azure pypi sdkby pypiPython

dlt 📁1.25.0🌱 Seedling

dlt is an open-source python-first scalable data loading library that does not require any backend to run.

etl pypiby pypiPython

imbalanced-learn 📁0.14.1🌱 Seedling

Toolbox for imbalanced dataset in machine learning

pypiby pypiPython

keras 📁3.14.0🌱 Seedling

Multi-backend Keras

pypiby pypiPython

transformers 📁5.5.4🌱 Seedling

Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

deep-learning llm machine-learning nlp pypi python pytorch transformer vlmby The Hugging Face team (past and future) with the help of all our contributors (https://github.com/huPython

KAG 📁v0.8.0💤 Dormant⭐8,668

KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge base

knowledge-graph large-language-model logical-reasoning multi-hop-question-answering python trustfulnessby OpenSPGPython

RagaAI-Catalyst 📁v2.2.4💤 Dormant⭐16,130

Python SDK for Agent AI Observability, Monitoring and Evaluation Framework. Includes features like agent, llm and tools tracing, debugging multi-agentic system, self-hosted dashboard and advanced anal

agentic-ai agentic-ai-development agentneo agents ai-agent-monitoring ai-application-debugging ai-evaluation-tools ai-performance-optimization pythonby raga-ai-hubPython

SSUI 📁0.1.1💤 Dormant⭐35

A Python-Script Based Generative AI platform

3d-models-generation ai ai-framework diffusion generative-ai generative-ai-tools image-generation-ai pythonby sunxfancyPython

mcp-bigquery-server 📁v1.0.3💤 Dormant⭐136

A Model Context Protocol (MCP) server that provides secure, read-only access to BigQuery datasets. Enables Large Language Models (LLMs) to safely query and analyze data through a standardized interfac

bigquery google-cloud mcp mcp-servers model-context-protocol sql typescriptby ergutTypeScript

dingo 📁v0.9.0⚰️ Archived⭐1,699

A multi-modal vector database that supports upserts and vector queries using unified SQL (MySQL-Compatible) on structured and unstructured data, while meeting the requirements of high concurrency and

embedding-search embedding-store hybrid-search java key-value-distributed-store mysql-compatibility real-time-semantic-search serving structured-databy dingodbJava