freshcrate
Skin:/

ragas

Supercharge Your LLM Application Evaluations ๐Ÿš€

Why this rank:Strong adoptionHealthy release cadenceRelease freshness

Description

Supercharge Your LLM Application Evaluations ๐Ÿš€

README

Supercharge Your LLM Application Evaluations ๐Ÿš€

Latest release Made with Python License Apache-2.0 Ragas Downloads per month Join Ragas community on Discord Ask DeepWiki.com

Objective metrics, intelligent test generation, and data-driven insights for LLM apps

Ragas is your ultimate toolkit for evaluating and optimizing Large Language Model (LLM) applications. Say goodbye to time-consuming, subjective assessments and hello to data-driven, efficient evaluation workflows. Don't have a test dataset ready? We also do production-aligned test set generation.

Key Features

  • ๐ŸŽฏ Objective Metrics: Evaluate your LLM applications with precision using both LLM-based and traditional metrics.
  • ๐Ÿงช Test Data Generation: Automatically create comprehensive test datasets covering a wide range of scenarios.
  • ๐Ÿ”— Seamless Integrations: Works flawlessly with popular LLM frameworks like LangChain and major observability tools.
  • ๐Ÿ“Š Build feedback loops: Leverage production data to continually improve your LLM applications.

๐Ÿ›ก๏ธ Installation

Pypi:

pip install ragas

Alternatively, from source:

pip install git+https://github.com/vibrantlabsai/ragas

๐Ÿ”ฅ Quickstart

Clone a Complete Example Project

The fastest way to get started is to use the ragas quickstart command:

# List available templates
ragas quickstart

# Create a RAG evaluation project
ragas quickstart rag_eval

# Specify where you want to create it.
ragas quickstart rag_eval -o ./my-project

Available templates:

  • rag_eval - Evaluate RAG systems

Coming Soon:

  • agent_evals - Evaluate AI agents
  • benchmark_llm - Benchmark and compare LLMs
  • prompt_evals - Evaluate prompt variations
  • workflow_eval - Evaluate complex workflows

Evaluate your LLM App

ragas comes with pre-built metrics for common evaluation tasks. For example, Aspect Critique evaluates any aspect of your output using DiscreteMetric:

import asyncio
from openai import AsyncOpenAI
from ragas.metrics import DiscreteMetric
from ragas.llms import llm_factory

# Setup your LLM
client = AsyncOpenAI()
llm = llm_factory("gpt-4o", client=client)

# Create a custom aspect evaluator
metric = DiscreteMetric(
    name="summary_accuracy",
    allowed_values=["accurate", "inaccurate"],
    prompt="""Evaluate if the summary is accurate and captures key information.

Response: {response}

Answer with only 'accurate' or 'inaccurate'."""
)

# Score your application's output
async def main():
    score = await metric.ascore(
        llm=llm,
        response="The summary of the text is..."
    )
    print(f"Score: {score.value}")  # 'accurate' or 'inaccurate'
    print(f"Reason: {score.reason}")


if __name__ == "__main__":
    asyncio.run(main())

Note: Make sure your OPENAI_API_KEY environment variable is set.

Find the complete Quickstart Guide

Want help in improving your AI application using evals?

In the past 2 years, we have seen and helped improve many AI applications using evals. If you want help with improving and scaling up your AI application using evals.

๐Ÿ”— Book a slot or drop us a line: founders@vibrantlabs.com.

๐Ÿซ‚ Community

If you want to get more involved with Ragas, check out our discord server. It's a fun community where we geek out about LLM, Retrieval, Production issues, and more.

Contributors

+----------------------------------------------------------------------------+
|     +----------------------------------------------------------------+     |
|     | Developers: Those who built with `ragas`.                      |     |
|     | (You have `import ragas` somewhere in your project)            |     |
|     |     +----------------------------------------------------+     |     |
|     |     | Contributors: Those who make `ragas` better.       |     |     |
|     |     | (You make PR to this repo)                         |     |     |
|     |     +----------------------------------------------------+     |     |
|     +----------------------------------------------------------------+     |
+----------------------------------------------------------------------------+

We welcome contributions from the community! Whether it's bug fixes, feature additions, or documentation improvements, your input is valuable.

  1. Fork the repository
  2. Create your feature branch (git checkout -b feature/AmazingFeature)
  3. Commit your changes (git commit -m 'Add some AmazingFeature')
  4. Push to the branch (git push origin feature/AmazingFeature)
  5. Open a Pull Request

๐Ÿ” Open Analytics

At Ragas, we believe in transparency. We collect minimal, anonymized usage data to improve our product and guide our development efforts.

โœ… No personal or company-identifying information

โœ… Open-source data collection code

โœ… Publicly available aggregated data

To opt-out, set the RAGAS_DO_NOT_TRACK environment variable to true.

Cite Us

@misc{ragas2024,
  author       = {VibrantLabs},
  title        = {Ragas: Supercharge Your LLM Application Evaluations},
  year         = {2024},
  howpublished = {\url{https://github.com/vibrantlabsai/ragas}},
}

Release History

VersionChangesUrgencyDate
v0.4.3## What's Changed * feat: add `DSPyOptimizer` with `MIPROv2` for advanced prompt optimization by @anistark in https://github.com/vibrantlabsai/ragas/pull/2537 * feat(docs): add llms.txt generation for LLM-friendly documentation by @sanjeed5 in https://github.com/vibrantlabsai/ragas/pull/2539 * feat: dspy caching by @anistark in https://github.com/vibrantlabsai/ragas/pull/2542 * feat: add system prompt support for `InstructorLLM` and `LiteLLMStructuredLLM` by @anistark in https://github.com/vibraLow1/13/2026
v0.4.2## What's Changed * feat: migrate SQLSemanticEquivalence to collections API by @anistark in https://github.com/vibrantlabsai/ragas/pull/2496 * feat: migrate DataCompyScore to collections API by @anistark in https://github.com/vibrantlabsai/ragas/pull/2499 * fix: migrate CHRF Score to new metrics collections by @anistark in https://github.com/vibrantlabsai/ragas/pull/2500 * Feat/improve rag quickstart by @anistark in https://github.com/vibrantlabsai/ragas/pull/2501 * fix: handle classification arLow12/23/2025
v0.4.1## What's Changed * feat: add save/load methods to BasePrompt by @anistark in https://github.com/vibrantlabsai/ragas/pull/2465 * docs: update run_config guide to use collections API by @sanjeed5 in https://github.com/vibrantlabsai/ragas/pull/2468 * fix: add anthropic and gemini clients for custom clients by @anistark in https://github.com/vibrantlabsai/ragas/pull/2472 * feat: migrate `ToolCallAccuracy` to collections API by @anistark in https://github.com/vibrantlabsai/ragas/pull/2476 * chore: aLow12/10/2025
v0.4.0## What's Changed * docs: complete collections API documentation for remaining metrics by @sanjeed5 in https://github.com/vibrantlabsai/ragas/pull/2420 * feat: support GPT-5 and o-series models with automatic `temperature` and `top_p` constraint handling by @anistark in https://github.com/vibrantlabsai/ragas/pull/2418 * update: add llm options as tabs to quickstart by @anistark in https://github.com/vibrantlabsai/ragas/pull/2421 * feat: migrate to instructor.from_provider for universal provider Low12/3/2025
v0.3.9## What's Changed * fix(docs): add missing line break so the step title and description aโ€ฆ by @nkch1k in https://github.com/explodinggradients/ragas/pull/2391 * Migrate SummaryScore by @rhlbhatnagar in https://github.com/explodinggradients/ragas/pull/2376 * feat: add metadata fields for synthetic data traceability by @dev-jonathan in https://github.com/explodinggradients/ragas/pull/2389 * Migrate noise sensitivity by @rhlbhatnagar in https://github.com/explodinggradients/ragas/pull/2379 * docs: Low11/11/2025
v0.3.8## What's Changed * feat: semantic similarity migrated to collections by @anistark in https://github.com/explodinggradients/ragas/pull/2361 * feat: Add reusable testing infrastructure for metrics migration by @jjmachan in https://github.com/explodinggradients/ragas/pull/2370 * add: console scripts for ragas_examples by @anistark in https://github.com/explodinggradients/ragas/pull/2367 * feat: add quickstart cmd with templates to run by @anistark in https://github.com/explodinggradients/ragas/pulLow10/28/2025
v0.3.7## What's Changed * refactor: improve metrics code quality by @anistark in https://github.com/explodinggradients/ragas/pull/2337 * chore: remove old analtyics by @jjmachan in https://github.com/explodinggradients/ragas/pull/2338 * Fix/query distribution robustness by @yatoyun in https://github.com/explodinggradients/ragas/pull/2340 * Simplify earlier how to guides in docs by @sanjeed5 in https://github.com/explodinggradients/ragas/pull/2319 * docs: reorganize prompt evaluation guides in navigatiLow10/14/2025
v0.3.6## What's Changed * Feature/chrf score by @kauabh in https://github.com/explodinggradients/ragas/pull/2221 * Fix/asyncio by @anistark in https://github.com/explodinggradients/ragas/pull/2294 * Fix: update simple RAG init to use embed_text(s) (docs) by @s3pi in https://github.com/explodinggradients/ragas/pull/2292 * Update _bleu_score.py by @kauabh in https://github.com/explodinggradients/ragas/pull/2297 * Refactor/update gemini to genai sdk by @sahusiddharth in https://github.com/explodinggradieLow10/3/2025
v0.3.5## What's Changed * Docs/howto-texttosqlagent by @sanjeed5 in https://github.com/explodinggradients/ragas/pull/2264 * fix: preview logo was too small. by @anistark in https://github.com/explodinggradients/ragas/pull/2277 * modified the documentation to be in sync with current output format by @kotalaraghava in https://github.com/explodinggradients/ragas/pull/2281 * removed some meta properties to test by @jjmachan in https://github.com/explodinggradients/ragas/pull/2278 * feature: improve asyncLow9/17/2025
v0.3.5rc2Release v0.3.5rc2Low9/17/2025
v0.3.5rc1## What's Changed * Docs/howto-texttosqlagent by @sanjeed5 in https://github.com/explodinggradients/ragas/pull/2264 * fix: preview logo was too small. by @anistark in https://github.com/explodinggradients/ragas/pull/2277 * modified the documentation to be in sync with current output format by @kotalaraghava in https://github.com/explodinggradients/ragas/pull/2281 * removed some meta properties to test by @jjmachan in https://github.com/explodinggradients/ragas/pull/2278 * feature: improve asyncLow9/17/2025
v0.3.4## What's Changed * Update context_precision.md by @anupamck in https://github.com/explodinggradients/ragas/pull/2262 * Reduce find_indirect_clusters() runtime through neighborhood detection and sampling by @ahgraber in https://github.com/explodinggradients/ragas/pull/2144 * fix: handle langchain multiple batching by @anistark in https://github.com/explodinggradients/ragas/pull/2257 * chore: fix the dead space in the header for docsite by @jjmachan in https://github.com/explodinggradients/ragas/Low9/10/2025
v0.3.3## What's Changed * feat: refactor Ragas `embeddings` to maintain backward compatibility by @NirantK in https://github.com/explodinggradients/ragas/pull/2184 * feat: move `tracing` from experimental to ragas/integrations by @NirantK in https://github.com/explodinggradients/ragas/pull/2188 * feat: Refactor LLM structure and add InstructorLLM by @NirantK in https://github.com/explodinggradients/ragas/pull/2191 * fix: setup and local ci by @anistark in https://github.com/explodinggradients/ragas/puLow9/4/2025
v0.3.3rc1## What's Changed * feat: refactor Ragas `embeddings` to maintain backward compatibility by @NirantK in https://github.com/explodinggradients/ragas/pull/2184 * feat: move `tracing` from experimental to ragas/integrations by @NirantK in https://github.com/explodinggradients/ragas/pull/2188 * feat: Refactor LLM structure and add InstructorLLM by @NirantK in https://github.com/explodinggradients/ragas/pull/2191 * fix: setup and local ci by @anistark in https://github.com/explodinggradients/ragas/puLow9/4/2025
v0.3.2## What's Changed * feat: save and load prompt object by @jjmachan in https://github.com/explodinggradients/ragas/pull/2152 * refactor: use basemetric as base for metric by @anistark in https://github.com/explodinggradients/ragas/pull/2163 * add: pre-commit hook to run ci on every commit by @anistark in https://github.com/explodinggradients/ragas/pull/2164 * Make `AnswerSimilarity` a dataclass by @hutec in https://github.com/explodinggradients/ragas/pull/2159 * docs: Add discord link to CommunitLow8/19/2025
v0.3.2rc3Release v0.3.2rc3Low8/19/2025
v0.3.2-rc2## What's Changed * fix: pypi requirements and image absolute path by @anistark in https://github.com/explodinggradients/ragas/pull/2196 **Full Changelog**: https://github.com/explodinggradients/ragas/compare/v0.3.2-rc1...v0.3.2-rc2Low8/19/2025
v0.3.2-rc1## What's Changed * feat: save and load prompt object by @jjmachan in https://github.com/explodinggradients/ragas/pull/2152 * refactor: use basemetric as base for metric by @anistark in https://github.com/explodinggradients/ragas/pull/2163 * add: pre-commit hook to run ci on every commit by @anistark in https://github.com/explodinggradients/ragas/pull/2164 * Make `AnswerSimilarity` a dataclass by @hutec in https://github.com/explodinggradients/ragas/pull/2159 * docs: Add discord link to CommunitLow8/19/2025
v0.3.1## What's Changed * docs: improve hello world by @shahules786 in https://github.com/explodinggradients/ragas/pull/2127 * examples: fix api changes by @shahules786 in https://github.com/explodinggradients/ragas/pull/2129 * docs: add explanation by @shahules786 in https://github.com/explodinggradients/ragas/pull/2139 * adds Google Drive backend for storing datasets as sheets by @dmikey in https://github.com/explodinggradients/ragas/pull/2138 * Fix experimental docs navigation, fix broken tutorialsLow8/11/2025
v0.3.0## What's Changed * llamaindex agentic integration by @sahusiddharth in https://github.com/explodinggradients/ragas/pull/2026 * CVE-2025-45691 Fix: Arbitrary File Read Vulnerability + Feat: Security settings by @adithyan-ak in https://github.com/explodinggradients/ragas/pull/1991 * feat: add devin's deepwiki by @jjmachan in https://github.com/explodinggradients/ragas/pull/2039 * feat: added `ragas_experimental` back into main by @jjmachan in https://github.com/explodinggradients/ragas/pull/2031 Low7/17/2025
v0.3.0-rc2Release v0.3.0-rc2Low7/17/2025
v0.3.0-rc1Release v0.3.0-rc1Low7/17/2025
v0.2.15## What's Changed * md of vertexai notebooks by @sahusiddharth in https://github.com/explodinggradients/ragas/pull/1957 * Single-hop query testset generation tutorial by @sahusiddharth in https://github.com/explodinggradients/ragas/pull/1959 * AWS Bedrock Integration by @sahusiddharth in https://github.com/explodinggradients/ragas/pull/1974 * Benchmarking Tutorial by @sahusiddharth in https://github.com/explodinggradients/ragas/pull/1984 * Create SECURITY.md by @adithyan-ak in https://github.comLow4/24/2025
v0.2.14## What's Changed * feat: add http request-response logging with env flag control by @ganeshrvel in https://github.com/explodinggradients/ragas/pull/1903 * Metric with different parameter gives different score by @sahusiddharth in https://github.com/explodinggradients/ragas/pull/1908 * Updated the instruction of `StatementGeneratorPrompt` in faithfulness by @sahusiddharth in https://github.com/explodinggradients/ragas/pull/1906 * Fixed simple criteria metric by @sahusiddharth in https://github.cLow3/4/2025
v0.2.13## What's Changed * Improved the documentation in the migration docs by @sahusiddharth in https://github.com/explodinggradients/ragas/pull/1873 * docs: match default `timeout` in `RunConfig` by @neo in https://github.com/explodinggradients/ragas/pull/1872 * fix(prompt/mixin): Add name property and add it to saving/loading path. by @tim-hilde in https://github.com/explodinggradients/ragas/pull/1853 * Adding missing evaluate import to evals guide by @carlosgsouza in https://github.com/explodinggraLow2/4/2025
v0.2.12## What's Changed * Add token parser for Bedrock & fix anthropic typo by @michaelromagne in https://github.com/explodinggradients/ragas/pull/1851 * fix tp&fp calc error by @oslijunw in https://github.com/explodinggradients/ragas/pull/1848 * docs: added canonical_url by @jjmachan in https://github.com/explodinggradients/ragas/pull/1855 * docs: be consistent with the variable naming of `evaluator_embeddings` by @neo in https://github.com/explodinggradients/ragas/pull/1858 * Add optional use_effectLow1/21/2025
v0.2.11## What's Changed * fix: add min requirement for optimisation by @shahules786 in https://github.com/explodinggradients/ragas/pull/1820 * fix: rubrics based metrics by @sahusiddharth in https://github.com/explodinggradients/ragas/pull/1821 * Enhance ToolCall message to support all argument types by @sahusiddharth in https://github.com/explodinggradients/ragas/pull/1823 * Added from_url method to fetch evaluation annotation from the api. by @ganeshrvel in https://github.com/explodinggradients/ragaLow1/14/2025
v0.2.10## What's Changed * docs: new get started guide by @shahules786 in https://github.com/explodinggradients/ragas/pull/1786 * Update README.md by @shahules786 in https://github.com/explodinggradients/ragas/pull/1794 * docs: fix typo by @shahules786 in https://github.com/explodinggradients/ragas/pull/1801 * fix: llamaindex testset generator by @jjmachan in https://github.com/explodinggradients/ragas/pull/1804 * Docs improvements by @EricGustin in https://github.com/explodinggradients/ragas/pull/1805Low1/8/2025
v0.2.9## What's Changed * feat: replaced NLTK's implementation of BLEU with sacrebleu's implementation by @trent-sp in https://github.com/explodinggradients/ragas/pull/1744 * doc: add note by @shahules786 in https://github.com/explodinggradients/ragas/pull/1753 * docs: add blog to readme by @shahules786 in https://github.com/explodinggradients/ragas/pull/1755 * fixes: some minor fixes by @shahules786 in https://github.com/explodinggradients/ragas/pull/1756 * docs: added ragas-app to getting started byLow12/24/2024
v0.2.8## What's Changed * fix: Update README.md by @avanco in https://github.com/explodinggradients/ragas/pull/1738 * feat: genetic algo based optimizer by @shahules786 in https://github.com/explodinggradients/ragas/pull/1724 * Added native support for Gemini models generation completion signals in LangchainLLMWrapper class / is_finished method by @cnemri in https://github.com/explodinggradients/ragas/pull/1727 * Added custom is_finished_parser logic to Google Vertex AI customizatiโ€ฆ by @cnemri in httpLow12/10/2024
v0.2.7## What's Changed * feat: removed critics and rubrics from examples by @jjmachan in https://github.com/explodinggradients/ragas/pull/1691 * feat: test-generation improvements by @shahules786 in https://github.com/explodinggradients/ragas/pull/1698 * chore: updated tracking for single scores by @jjmachan in https://github.com/explodinggradients/ragas/pull/1716 * feat: remove examples by @jjmachan in https://github.com/explodinggradients/ragas/pull/1708 * Bug in documentation rag_testset_generatioLow12/6/2024
v0.2.6## What's Changed * Avoid unnecessary decompose_claims call when mode='precision' in _factual_correctness.py by @dfhssilva in https://github.com/explodinggradients/ragas/pull/1669 * fixes: instruction format by @shahules786 in https://github.com/explodinggradients/ragas/pull/1672 * docs: updated readme by @jjmachan in https://github.com/explodinggradients/ragas/pull/1674 * docs: update readme by @jjmachan in https://github.com/explodinggradients/ragas/pull/1675 * fixes: add extraction limit to Low11/19/2024
v0.2.5## What's Changed * feat: Unify the use of sentence_segmenter by @jasinliu in https://github.com/explodinggradients/ragas/pull/1629 * feat: aspect critic with reference by @shahules786 in https://github.com/explodinggradients/ragas/pull/1638 * docs: change setting llm by @shahules786 in https://github.com/explodinggradients/ragas/pull/1639 * fixed wrong variable names in starter guide using azure openai by @MichaelPerger in https://github.com/explodinggradients/ragas/pull/1641 * feat: improvemenLow11/12/2024
v0.2.4## What's Changed * Fix : Filename in the path by @satodayo in https://github.com/explodinggradients/ragas/pull/1605 * fix: remove abstract class in Metric by @jjmachan in https://github.com/explodinggradients/ragas/pull/1603 * Added pandas as a method for reading datasets by @joaorura in https://github.com/explodinggradients/ragas/pull/1595 * fix: treat anthropic-direct API the same as bedrock in llamaindex wrapper by @ahgraber in https://github.com/explodinggradients/ragas/pull/1584 * feat: adLow11/7/2024
v0.2.3## What's Changed * feat: measure cost of testset generator by @jjmachan in https://github.com/explodinggradients/ragas/pull/1560 * docs: added luka's blogs by @jjmachan in https://github.com/explodinggradients/ragas/pull/1554 * Fix: add LLMContextPrecisionWithReference to __init__.py by @licux in https://github.com/explodinggradients/ragas/pull/1561 * chore: add verbose option (V=1) to make targets by @trevorbowen in https://github.com/explodinggradients/ragas/pull/1556 * add embeddings to TestLow10/29/2024
v0.2.2## What's Changed * docs: update summarization calculation by @jCobbSK in https://github.com/explodinggradients/ragas/pull/1510 * fix: use get_temperature in base LLM generate by @sapphire-arches in https://github.com/explodinggradients/ragas/pull/1520 * doc: Update modifying prompts metrics doc by @FrankyCTY in https://github.com/explodinggradients/ragas/pull/1525 * fixbug: Expand sentence-ending punctuation check in _create_statements method by @Jeff-67 in https://github.com/explodinggradientsLow10/22/2024
v0.2.1## What's Changed * Fix Discord link by @NirantK in https://github.com/explodinggradients/ragas/pull/1498 * Fix Quickstart link by @NirantK in https://github.com/explodinggradients/ragas/pull/1499 * Rewrite Open Analytics by @NirantK in https://github.com/explodinggradients/ragas/pull/1500 * Suggestions for Introduction and Key Features by @NirantK in https://github.com/explodinggradients/ragas/pull/1501 * docs: fix broken links in testset generation custimisation by @jjmachan in https://github.Low10/16/2024
v0.2.0## What's Changed * feat: topic adherence metric by @shahules786 in https://github.com/explodinggradients/ragas/pull/1327 * feat: tool call accuracy by @shahules786 in https://github.com/explodinggradients/ragas/pull/1311 * feat: sql sementic equivalence by @shahules786 in https://github.com/explodinggradients/ragas/pull/1323 * feat: Factual correctnes metric by @shahules786 in https://github.com/explodinggradients/ragas/pull/1334 * docs: migrating over to mkdocs by @jjmachan in https://github.cLow10/14/2024
v0.1.21**Full Changelog**: https://github.com/explodinggradients/ragas/compare/v0.1.16...v0.1.21 - Pins to langchain<0.3 - based on v0.1.16 which was the last working version for 0.1 - hopefully last release before v0.2Low10/3/2024
v0.2.0-beta## What's Changed * feat: topic adherence metric by @shahules786 in https://github.com/explodinggradients/ragas/pull/1327 * feat: tool call accuracy by @shahules786 in https://github.com/explodinggradients/ragas/pull/1311 * feat: sql sementic equivalence by @shahules786 in https://github.com/explodinggradients/ragas/pull/1323 * feat: Factual correctnes metric by @shahules786 in https://github.com/explodinggradients/ragas/pull/1334 * docs: migrating over to mkdocs by @jjmachan in https://github.cLow10/3/2024
v0.1.20## What's Changed * fix: pin langchain_core to <3 by @jjmachan in https://github.com/explodinggradients/ragas/pull/1329 **Full Changelog**: https://github.com/explodinggradients/ragas/compare/v0.1.19...v0.1.20Low9/18/2024
v0.1.19## What's Changed * feat: add non llm based context recall by @shahules786 in https://github.com/explodinggradients/ragas/pull/1266 * feat: Non LLM based context precision by @shahules786 in https://github.com/explodinggradients/ragas/pull/1264 * feat: added n and temperature passthrough to prompt by @jjmachan in https://github.com/explodinggradients/ragas/pull/1281 * Add option to disable progress bar in `evaluate` by @freddiev4 in https://github.com/explodinggradients/ragas/pull/1279 * feat: nLow9/18/2024
v0.1.18## What's Changed * fix: inverse distance by @shahules786 in https://github.com/explodinggradients/ragas/pull/1267 * fix: non llm based metrics by @shahules786 in https://github.com/explodinggradients/ragas/pull/1268 * fix: v1 to v2 dataset by @shahules786 in https://github.com/explodinggradients/ragas/pull/1275 * fix: make score nested if loop_is_running by @jjmachan in https://github.com/explodinggradients/ragas/pull/1276 **Full Changelog**: https://github.com/explodinggradients/ragas/compaLow9/11/2024
v0.1.17## What's Changed * update documentation for faithfulness with HHEM by @Miaoranmmm in https://github.com/explodinggradients/ragas/pull/1242 * New data representation by @shahules786 in https://github.com/explodinggradients/ragas/pull/1222 * Feat: Opik Integration by @jverre in https://github.com/explodinggradients/ragas/pull/1256 * Add support to MultiTurn Metrics by @shahules786 in https://github.com/explodinggradients/ragas/pull/1249 * fix: add single_turn_ascore by @shahules786 in https://gitLow9/10/2024
v0.1.16## What's Changed * load HHEM on specified device by @Miaoranmmm in https://github.com/explodinggradients/ragas/pull/1235 * feat: new Prompt object and faithfulness metric by @jjmachan in https://github.com/explodinggradients/ragas/pull/1232 * feat (experimental) added new prompt and metric into `ragas.experimental` by @jjmachan in https://github.com/explodinggradients/ragas/pull/1240 ## New Contributors * @Miaoranmmm made their first contribution in https://github.com/explodinggradients/ragas/Low9/3/2024
v0.1.15## What's Changed * fix: typo in model name by @shahules786 in https://github.com/explodinggradients/ragas/pull/1198 * metrics: add domain specific rubrics based scoring by @vaishakhRaveendran in https://github.com/explodinggradients/ragas/pull/1189 * docs: rubrics based metrics by @shahules786 in https://github.com/explodinggradients/ragas/pull/1200 * docs: add context utilisation to the index by @shahules786 in https://github.com/explodinggradients/ragas/pull/1202 * Docs: fix typo in metrics bLow8/27/2024
v0.1.14## What's Changed * Add question to context recall doc by @shahules786 in https://github.com/explodinggradients/ragas/pull/1173 * Add context utilization by @shahules786 in https://github.com/explodinggradients/ragas/pull/1174 * doc: fix output for testset generation and missing files by @jjmachan in https://github.com/explodinggradients/ragas/pull/1176 * Helicone Integration by @chitalian in https://github.com/explodinggradients/ragas/pull/1178 * docs: minor fixes by @shahules786 in https://gitLow8/14/2024
v0.1.13## What's Changed * Fix "global flags not at the start" raised in extract_text function by @vaishakhRaveendran in https://github.com/explodinggradients/ragas/pull/1113 * Random seed fixed. by @Gwenn-LR in https://github.com/explodinggradients/ragas/pull/1142 * feat: Usage tracking for evaluations by @jjmachan in https://github.com/explodinggradients/ragas/pull/1129 * Fixing Demo Code - Compare Embeddings for retriever Docs by @matheusft in https://github.com/explodinggradients/ragas/pull/1153 * Low8/5/2024
v0.1.12## What's Changed * [Fix bug] The adaptation for `rewrite_invalid_question_prompt` was not saved by @psydok in https://github.com/explodinggradients/ragas/pull/1071 * [Fix bug] utils.py by @jrredondo in https://github.com/explodinggradients/ragas/pull/1083 * Llama Index integration: optional RunConfig for evaluate() by @antono in https://github.com/explodinggradients/ragas/pull/1085 * Create a custom directory loader for RAGAS by @vaishakhRaveendran in https://github.com/explodinggradients/ragasLow7/30/2024
v0.1.11## What's Changed * Testgeneration v3 by @shahules786 in https://github.com/explodinggradients/ragas/pull/1024 * fix: import errors and make `experimental` runnable by @jjmachan in https://github.com/explodinggradients/ragas/pull/1078 * chore: added analytics to count the num of testsets generated with experimental by @jjmachan in https://github.com/explodinggradients/ragas/pull/1081 * Fix mutable dataclass defaults by @yasyf in https://github.com/explodinggradients/ragas/pull/1082 * fix(executoLow7/22/2024

Dependencies & License Audit

Loading dependencies...

Similar Packages

arthur-engineMake AI work for Everyone - Monitoring and governing for your AI/ML 2.1.601
Awesome-RAG-Production๐Ÿš€ Build and scale reliable Retrieval-Augmented Generation (RAG) systems with this curated collection of tools, frameworks, and best practices.main@2026-06-07
opikDebug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.2.0.56
ouroborosStop prompting. Start specifying.v0.40.1
OpenClawProBenchOpenClawProBench is a live-first benchmark harness for evaluating LLM agents in the OpenClaw runtime with deterministic grading and repeated-trial reliability.main@2026-05-19

More in Uncategorized

modal-clientSDK libraries for Modal
llama.cppLLM inference in C/C++
symphonySymphony turns project work into isolated, autonomous implementation runs, allowing teams to manage work instead of supervising coding agents.