trulens

Evaluation and Tracking for LLM Experiments and AI Agents

agent-evaluation agentops ai-agents ai-monitoring ai-observability evals explainable-ml llm-eval python

Why this rank:Strong adoptionRecent releaseHealthy release cadence

Description

Evaluation and Tracking for LLM Experiments and AI Agents

README

🦑 Welcome to TruLens!

Don't just vibe-check your LLM app! Systematically evaluate and track your LLM experiments with TruLens. As you develop your app including prompts, models, retrievers, knowledge sources and more, TruLens is the tool you need to understand its performance.

Fine-grained, stack-agnostic instrumentation and comprehensive evaluations help you to identify failure modes & systematically iterate to improve your application.

Read more about the core concepts behind TruLens including Feedback Functions, The RAG Triad, and Honest, Harmless and Helpful Evals.

TruLens in the development workflow

Build your first prototype then connect instrumentation and logging with TruLens. Decide what feedbacks you need, and specify them with TruLens to run alongside your app. Then iterate and compare versions of your app in an easy-to-use user interface 👇

Installation and Setup

Install the trulens pip package from PyPI.

    pip install trulens

Quick Usage

Walk through how to instrument and evaluate a RAG built from scratch with TruLens.

💡 Contributing & Community

Interested in contributing? See our contributing guide for more details.

The best way to support TruLens is to give us a ⭐ on GitHub and join our discourse community!

Release History

Version	Changes	Urgency	Date
trulens-2.9.0	# TruLens 2.9.0 The TruLens community tent got a lot bigger in this release, and we all get to benefit. Fourteen new contributors and many other repeat contributors came together over the last couple months to give TruLens new capabilities for maturing your evaluation system, more robust and comprehensive tracing, and lots of new documentation and examples for CI/CD, agentic evals, and MCP instrumentation. Most of the new eval tooling even came from first-time contributors. ## New Features	High	7/23/2026
trulens-2.8.1	## What's Changed * feat: GEPA (Genetic Prompt Optimization) Integration by @bashward in https://github.com/truera/trulens/pull/2464 * update default models for bedrock, cortex, and litellm providers by @ykd007 in https://github.com/truera/trulens/pull/2511 * add py.typed markers to otel/semconv and hotspots; add SpanType.GUARDRAIL by @ykd007 in https://github.com/truera/trulens/pull/2512 * refactor: decouple Snowflake-specific logic from trulens-core by @joshreini1 in https://github.com/tru	High	5/14/2026
trulens-2.8.0	## What's Changed * Add SchemaValidation: non-LLM programmatic feedback for JSON schema / Pydantic output validation by @bashward in https://github.com/truera/trulens/pull/2427 * [fix]Generate testset examples and usage details addition by @lucifertrj in https://github.com/truera/trulens/pull/2384 * Add OpenAI Agent SDK examples by @sfc-gh-jreini in https://github.com/truera/trulens/pull/2388 * Dashboard Performance: SQL-level aggregation for dashboard leaderboard by @sfc-gh-jreini in https:	High	4/30/2026
trulens-2.7.2	## What's Changed * Python 3.14 support: Traversable import path changed from `importlib.abc` to `importlib.resources.abc` by @rudolfolah in https://github.com/truera/trulens/pull/2386 * Add password-free auth params to SnowflakeConnector, rewrite Snowflake auth docs and block starting trulens dashboard when connected to snowflake by @sfc-gh-jreini in https://github.com/truera/trulens/pull/2380 * Deprecate run_dashboard_sis by @sfc-gh-jreini in https://github.com/truera/trulens/pull/2370 #	High	4/9/2026
trulens-2.7.1	## Bug Fixes * Fix papercuts with Metric API by @sfc-gh-jreini in https://github.com/truera/trulens/pull/2362 ## Tech Debt Reduction - refactor "v2" feedback organization from monorepo by @sfc-gh-jreini in https://github.com/truera/trulens/pull/2354 ## Test Fixes * Fix race condition for otel test db creation by @sfc-gh-jreini in https://github.com/truera/trulens/pull/2368 ## Docs and Examples * update experimental examples and test with metricconfig by @sfc-gh-jreini in https://git	Low	3/10/2026
trulens-2.7.0	## New Unified Metric API We're introducing the `Metric` class as the new primary API for evaluation metrics in TruLens. This unified interface replaces both `Feedback` and `MetricConfig` with a cleaner, more explicit design. ### Why This Matters - Clearer, more readable code: Replace chained `.on_()` methods with an explicit `selectors={}` dictionary—no more guessing which argument maps to which selector - Single unified class*: No need to choose between `Feedback` and `Metric	Low	2/19/2026
trulens-2.6.0	## What's Changed ### Agent Skills and Agents.MD * agents.md and skills by @sfc-gh-jreini in https://github.com/truera/trulens/pull/2337 * Skill and helpers for more beautiful tracing by @sfc-gh-jreini in https://github.com/truera/trulens/pull/2340 ### Support for waiting for evaluation completion * Fix bug and allow waiting in otel by @sfc-gh-jreini in https://github.com/truera/trulens/pull/2339 ### Postgres Support * Postgres support by @sfc-gh-jreini in https://github.com/truera/	Low	2/4/2026
trulens-2.5.3	## What Changed * Direct registration of LangGraph graph compatibility with AI Observability Run by @sfc-gh-dhuang in https://github.com/truera/trulens/pull/2311 * Additional Instructions for all feedback functions by @sfc-gh-ajia in https://github.com/truera/trulens/pull/2313 * Loosen trace compression by @sfc-gh-jreini in https://github.com/truera/trulens/pull/2312 ## Bug Fixes * [Minor fix] Remove staticmethod descriptor for trace_with_run by @sfc-gh-dhuang in https://github.com/true	Low	1/15/2026
trulens-2.5.2	## What's Changed * fix: add additionalProperties: false to structured output schemas for Databricks compatibility by @debu-sinha in https://github.com/truera/trulens/pull/2308 * Make command for updating meta.yaml files by @sfc-gh-jreini in #2309 ## New Contributors * @debu-sinha made their first contribution in https://github.com/truera/trulens/pull/2308 Full Changelog: https://github.com/truera/trulens/compare/trulens-2.5.1...trulens-2.5.2	Low	12/11/2025
trulens-2.5.1	# TruLens 2.5.1 Patch Release ## Bug Fixes * [Snowflake] Enable get_records to fetch serverside metrics. by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/2293 * [Snowflake] Enable serverside metric computation kickoff from .py scripts by @sfc-gh-dhuang in https://github.com/truera/trulens/pull/2297 * Fix internal method to comply with pydantic change introduced in pydantic 2.10 by @sfc-gh-jreini in https://github.com/truera/trulens/pull/2296	Low	11/21/2025
trulens-2.5.0	## What's Changed * Langchain 1.0 support by @sfc-gh-jreini in https://github.com/truera/trulens/pull/2279 * Update pip install in notebook command with version constraints for langchain packages by @sfc-gh-jreini in https://github.com/truera/trulens/pull/2275 * Remove noisy log msg by @sfc-gh-dhuang in https://github.com/truera/trulens/pull/2277 * track eval costs, surface total costs on leaderboard by @sfc-gh-jreini in https://github.com/truera/trulens/pull/2278 ## New examples * Exa	Low	11/14/2025
trulens-2.4.2	## What's Changed * Tracing for LlamaIndex AgentWorkflow + notebook example with GPA evals by @sfc-gh-jreini in https://github.com/truera/trulens/pull/2249 * Tracing and Eval for MCP in LangGraph by @sfc-gh-jreini in https://github.com/truera/trulens/pull/2257 * Updating GPA prompts by @sfc-gh-ajia in https://github.com/truera/trulens/pull/2261 * quiet logger on context manager warning by @sfc-gh-jreini in https://github.com/truera/trulens/pull/2262 * [Snowflake] SDK changes to use latest f	Low	10/21/2025
trulens-2.4.1	## What's Changed * [Snowflake] web search agent example with snowflake ai obs by @sfc-gh-jreini in https://github.com/truera/trulens/pull/2232 * Handle timestamps in `get_events` for Snowflake AI observability event table. by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/2226 * data agent with cortex example by @sfc-gh-jreini in https://github.com/truera/trulens/pull/2236 * Update meta.yaml files for release 2.4.0 by @sfc-gh-dhuang in https://github.com/truera/trulens/pull/223	Low	10/8/2025
trulens-2.4.0	## New Features * Tracing for LlamaIndex Workflows + Improved Async OpenAI support by @sfc-gh-jreini in https://github.com/truera/trulens/pull/2212 * Add `TruSession::compute_feedbacks_on_events` function to compute feedbacks on arbitrary records and feedbacks. by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/2206 * trace compression by @sfc-gh-jreini in https://github.com/truera/trulens/pull/2225 * Reranker span type type and inclusion of reranker span type in LlamaIndex auto	Low	9/23/2025
trulens-2.3.1	## What's Changed * Langchain provider & Deepseek reasoning model support by @sfc-gh-jreini in https://github.com/truera/trulens/pull/2191 * Allow for adding feedback (such as in human feedback). by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/2139 * Add Litellm structured outputs by @sfc-gh-chu in https://github.com/truera/trulens/pull/2115 ## Bug Fixes * Cleanup bedrock provider imports by @Labreo in https://github.com/truera/trulens/pull/2202 * Explicitly turn off OTel	Low	9/4/2025
trulens-2.3.0	## What's Changed * add gpt-5 support, openai capabilities retry and caching, Context-Free-Grammar Support & Answer Relevance prompt fixes by @sfc-gh-jreini in https://github.com/truera/trulens/pull/2189 * Allow main input specification for trace_with_run decorator (live tracing) by @sfc-gh-dhuang in https://github.com/truera/trulens/pull/2182 * Enable OTEL tracing by default by @sfc-gh-dhuang in https://github.com/truera/trulens/pull/2112 ## Examples * Add Tutorial for Multimodal Evalua	Low	8/28/2025
trulens-2.2.4	## What's Changed * Allow the use of your own lock for `compute_now`. by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/2184 * Bugfix: Clean up syntax, hanging vars, bugs in LangGraph quickstart nb by @sfc-gh-nvytla in https://github.com/truera/trulens/pull/2186 * Batch insert events. by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/2183 Full Changelog: https://github.com/truera/trulens/compare/trulens-2.2.3...trulens-2.2.4	Low	8/20/2025
trulens-2.2.3	## What's Changed * TruLens version 2.2.2. by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/2177 * feat: Added Google provider to enable evaluations with Gemini models by @sahusiddharth in https://github.com/truera/trulens/pull/2153 * Use app versions instead of app ids when we can. by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/2179 ## New Contributors * @sahusiddharth made their first contribution in https://github.com/truera/trulens/pull/2153 **Full Cha	Low	8/20/2025
trulens-2.2.2	## What's Changed * Used `streamlit.Pages` instead of the `pages` folder. by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/2176 Full Changelog: https://github.com/truera/trulens/compare/trulens-2.2.1...trulens-2.2.2	Low	8/20/2025
trulens-2.2.1	## New Features * [Langgraph auto instrumentation] Improve instrumentation for Graph API by @sfc-gh-dhuang in https://github.com/truera/trulens/pull/2172 ## Bug Fixes * Record page UI updates by @sfc-gh-jreini in https://github.com/truera/trulens/pull/2171 Full Changelog: https://github.com/truera/trulens/compare/trulens-2.2.0...trulens-2.2.1	Low	8/13/2025
trulens-2.2.0	## New Features * Dropped "trajectory" prefix from agent evals by @sfc-gh-ajia in https://github.com/truera/trulens/pull/2163 * Add context for inline evals by @sfc-gh-jreini in https://github.com/truera/trulens/pull/2169 ## Bug Fixes * Supply host to dashboard. by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/2165 ## Documentation and Examples * update notebooks to new gpa-naming for agent evals by @sfc-gh-jreini in https://github.com/truera/trulens/pull/2168 **Full C	Low	8/11/2025
trulens-2.1.4	## What's Changed * add trulens-apps-langgraph to optional requirements by @sfc-gh-jreini in https://github.com/truera/trulens/pull/2159 * Update Langgraph version requirements to ensure we instrument TaskFunction / _TaskFunction by @sfc-gh-dhuang in https://github.com/truera/trulens/pull/2160 * [Snowflake] Live OTEL tracing with Snowflake Run by @sfc-gh-dhuang in https://github.com/truera/trulens/pull/2158 * custom attribute instrumentation + langgraph instrumentation docs by @sfc-gh-jre	Low	8/6/2025
trulens-2.1.2	## What's Changed * Address feedback on the data agents notebook by @sfc-gh-jreini in https://github.com/truera/trulens/pull/2146 * Make empty events warning quieter by @sfc-gh-jreini in https://github.com/truera/trulens/pull/2145 * improve groundedness reliability by @sfc-gh-jreini in https://github.com/truera/trulens/pull/2147 * Sort siblings in the record viewer by their start time. by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/2149 * Lower Langgraph version required in	Low	7/31/2025
trulens-2.1.1	## What's Changed * Allow TruLens to use reasoning models in OpenAI provider by @sfc-gh-dhuang in https://github.com/truera/trulens/pull/2138 * Fixed Score Parsing error & Added Plan Adherence and Plan Quality by @sfc-gh-ajia in https://github.com/truera/trulens/pull/2137 * Improve main_input guessing logic by @sfc-gh-dhuang in https://github.com/truera/trulens/pull/2142 * In-line evals notebook by @sfc-gh-jreini in https://github.com/truera/trulens/pull/2143 Full Changelog: https:/	Low	7/25/2025
trulens-2.1.0	# TruLens v2.1 TruLens 2.1 includes a number of new features and bug fixes to support tracing and evaluation of agents including Inline evals, trajectory evals, native LangGraph instrumentation (via `TruGraph`). Additionally, we made a variety of stability improvements to evaluators benefiting both OSS and Snowflake users including structured output support and shifting to new more stable serverside metric computation in Snowflake. ## New Features * Create `@inline_evaluations` decorator.	Low	7/22/2025
trulens-1.5.3	## What's Changed * Get more quickstarts OTEL-ready by @sfc-gh-jreini in https://github.com/truera/trulens/pull/2076 * [Docs] Nit: Update grammar for guardrails doc by @sfc-gh-nvytla in https://github.com/truera/trulens/pull/2077 * [Docs] Improve and standardize language for instrumentation-specific docs, fix grammar and spelling errors by @sfc-gh-nvytla in https://github.com/truera/trulens/pull/2078 * [Docs] Fix code examples, update grammar, spelling for logging-specific docs by @sfc-gh-nv	Low	7/1/2025
trulens-1.5.2	## What's Changed * Call `model_fields` and `model_computed_fields` from the class and not the instance. by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/2018 * [Nit] Fix grammar for otel blog banner shoutout by @sfc-gh-nvytla in https://github.com/truera/trulens/pull/2053 * [Nit] Sans serif font fix by @sfc-gh-nvytla in https://github.com/truera/trulens/pull/2054 * Improve error messaging for no records found error (check cross-format records) by @sfc-gh-nvytla in https://gith	Low	6/18/2025
trulens-1.5.1	## What's Changed * Clean up `TruSession` before tests as previous tests can interfere with each other. by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/2044 * Have eval spans hold all direct function calls. by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/2043 * Summit agentic eval demo - streamlit feedback pill and result display by @sfc-gh-dhuang in https://github.com/truera/trulens/pull/2035 * OTel Blog + Examples for pre-release by @sfc-gh-jreini in https://g	Low	6/5/2025
trulens-1.5.0	# Telemetry for the Agentic World: TruLens + OpenTelemetry Agents are rapidly gaining traction across AI applications. With this growth comes a new set of challenges: how do we trace, observe, and evaluate these dynamic, distributed systems? Today, we’re excited to share that TruLens now supports OpenTelemetry (OTel), unlocking powerful, interoperable observability for the agentic world. --- ## Challenge for Tracing Agents Tracing agentic applications is fundamentally differen	Low	6/2/2025
trulens-1.4.9	## What's Changed * Update poetry to trulens 1.4.8 by @sfc-gh-nvytla in https://github.com/truera/trulens/pull/1898 * Doc string spelling by @cronoik-inceptionai in https://github.com/truera/trulens/pull/1895 * [SNOW-2030130] Standardize proper names across TruLens docs by @sfc-gh-nvytla in https://github.com/truera/trulens/pull/1899 * Add Dashboard GIF to Viewing Results Page + update pypi downloads icon to trulens-core by @sfc-gh-jreini in https://github.com/truera/trulens/pull/1900 * [SN	Low	4/11/2025
trulens-1.4.8	## What's Changed * Fix `tests/e2e/test_providers.py::TestProviders::test_llmcompletion_calibration`. by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/1881 * Fix azure openai e2e tests by @sfc-gh-dhuang in https://github.com/truera/trulens/pull/1886 * Fix `test_endpoints.py`. by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/1883 * [Nit] Update maintainer list, fix broken link to contributing guide by @sfc-gh-nvytla in https://github.com/truera/trulens/pull/1889	Low	4/3/2025
trulens-1.4.7	## What's Changed * Fix some e2e test issues. by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/1875 * Add quickstart notebook for PuPr by @sfc-gh-dhuang in https://github.com/truera/trulens/pull/1876 * Rename PuPr quickstart path to be lowercase by @sfc-gh-dhuang in https://github.com/truera/trulens/pull/1877 * [SNOW-1967293] Fix cortex cost tracking - only tallying requests at the end of the streamed SSE by @sfc-gh-dhuang in https://github.com/truera/trulens/pull/1878 * [SN	Low	3/20/2025
trulens-1.4.6	## What's Changed * bump otel version by @sfc-gh-chu in https://github.com/truera/trulens/pull/1871 * Sync conda deps with pyproject deps by @sfc-gh-chu in https://github.com/truera/trulens/pull/1872 * Enable logging in MySQL by @calabozo in https://github.com/truera/trulens/pull/1870 * Put costs in current span. by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/1873 * SDK: Fix run metadata field masks logic and expose description and label as optional and update-able fields b	Low	3/13/2025
trulens-1.4.5	## What's Changed * Fix double quotes issue in run operation system function / method by @sfc-gh-dhuang in https://github.com/truera/trulens/pull/1853 * Get E2E tests running (yes, not even working just running). by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/1850 * Create load test. by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/1854 * Require newer versions of OTEL packages. by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/1855 * Update Snowfl	Low	3/6/2025
trulens-1.4.4	## What's Changed * Add hotpotqa data to pupr notebook by @sfc-gh-chu in https://github.com/truera/trulens/pull/1845 * Have all of the `TestOtelTruBasic`/`TestOtelTruChain`/`TestOtelTruCustom`/`TestOtelTruLlama` be subclasses of a superclass that has shared tests for all of them. by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/1844 * Handle Japanese (and other non-ascii languages) better. by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/1848 * Check before trying	Low	2/27/2025
trulens-1.4.3	## What's Changed ## Bug Fixes * Filipg/fix hotspots for release by @sfc-gh-fgralinski in https://github.com/truera/trulens/pull/1821 * LiteLLM fixes by @sfc-gh-jreini in https://github.com/truera/trulens/pull/1829 * Don't track feedback functions. by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/1833 * Use context variables instead of thread_local since the latter doesn't actually inherit values from the parent threads. by @sfc-gh-dkurokawa in https://github.com/truera/trul	Low	2/26/2025
trulens-1.4.2	## What's Changed * [SNOW-1901834] SDK: Run APIs: CREATE, GET, LIST, and DELETE and run.start() by @sfc-gh-dhuang in https://github.com/truera/trulens/pull/1784 Full Changelog: https://github.com/truera/trulens/compare/trulens-1.4.1...trulens-1.4.2	Low	2/25/2025
trulens-1.4.1	## What's Changed * Bugfix: Avoid ingestion of incorrectly generated spans. by @sfc-gh-apgupta in https://github.com/truera/trulens/pull/1809 * Clean up semantic conventions. by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/1811 * Don't output a lot of span attributes if they're `None`. by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/1812 * Set default host for snowflake connector if not provided by @sfc-gh-dhuang in https://github.com/truera/trulens/pull/1814 *	Low	2/20/2025
trulens-1.4.0	## What's Changed * Create very general function to compute feedbacks. by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/1794 * even quieter logging by @sfc-gh-jreini in https://github.com/truera/trulens/pull/1795 * Create distributed OTEL test. by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/1789 * Trulens hotspots by @sfc-gh-fgralinski in https://github.com/truera/trulens/pull/1757 * Clean up tests a bit and remove `ai.observability.domain` as a span attribute.	Low	2/13/2025
trulens-1.3.5	## What's Changed * Fix `snowflake-sqlalchemy` autocommit issue. by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/1792 Full Changelog: https://github.com/truera/trulens/compare/trulens-1.3.4...trulens-1.3.5	Low	2/6/2025
trulens-1.3.4	## What's Changed * Validation for sis dashboard names by @sfc-gh-chu in https://github.com/truera/trulens/pull/1750 * Handle sync and async generators with the OTEL `@instrument` decorator. by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/1748 * Require version of snowflake ml python >= 1.7.2 by @sfc-gh-dhuang in https://github.com/truera/trulens/pull/1752 * Write returns for "UNKNOWN" spans. by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/1753 * Set up basic E	Low	2/6/2025
trulens-1.3.3	## What's Changed * Enable exporting spans to snowflake stage if a `TruLensSnowflakeSpanExporter` is provided by @sfc-gh-gtokernliang in https://github.com/truera/trulens/pull/1708 * Allow `TruChain`/`TruLlama`/`TruRails` to use OTEL spans. by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/1727 * move `poetry-core` restrictions to az pipeline by @sfc-gh-chu in https://github.com/truera/trulens/pull/1732 * Updated context relevance chain-of-thought prompting by @sfc-gh-dhuang in	Low	1/27/2025
trulens-1.3.2	## Bug Fixes * Handle pydantic upgrade that now handles `model_fields` as a `property` that can resolve to a `dict` when there's nothing. by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/1726 * Don't create event tables unless `TRULENS_OTEL_TRACING` env variable is set. by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/1724 Full Changelog: https://github.com/truera/trulens/compare/trulens-1.3.1...trulens-1.3.2	Low	1/16/2025
trulens-1.3.1	## What's Changed * Update credit consumption table for Cortex LLM by @sfc-gh-dhuang : https://github.com/truera/trulens/pull/1721 ## Bug Fixes * Fix Snowflake SQL alchemy breaking behavior and ensure `AUTOCOMMIT` is enabled to fix trulens ingestion by @sfc-gh-dhuang https://github.com/truera/trulens/pull/1719 * Fix and update Snowflake quickstart notebook by @sfc-gh-dhuang https://github.com/truera/trulens/pull/1722 Full Changelog: https://github.com/truera/trulens/compare/trulens-1.	Low	1/16/2025
trulens-1.3.0	## Optimizing Feedback Functions In this release, we add important changes for improving the alignment of their LLM-Judge evals to human evaluations. ### Global Improvement of Groundedness Feedback The first is the global improvement of the groundedness feedback function (benchmarks and methods forthcoming). We invite any users to submit feedback (positive or negative) on the effectiveness of the new groundedness function using GitHub Issues or Discussions. You can view the addit	Low	1/10/2025
trulens-1.2.11	## What's Changed * Add snowflake PrPr notebook by @sfc-gh-dhuang in https://github.com/truera/trulens/pull/1683 * Support types for Python `< 3.9`. by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/1675 * Change issue assignee to Prudhvi to triage by @sfc-gh-jreini in https://github.com/truera/trulens/pull/1686 * TREC DL and LLM AggreFact experiments for relevance benchmark + prompts comparisons and groundedness vs Bespoke Minicheck 7B by @sfc-gh-dhuang in https://github.com/tr	Low	12/16/2024
trulens-1.2.10	## What's Changed * decode app and record json in get_df_and_cols by @sfc-gh-chu in https://github.com/truera/trulens/pull/1672 * reset_database only drops trulens tables by @sfc-gh-chu in https://github.com/truera/trulens/pull/1676 * trulens-dashboard: handle selected_rows is None and fix fallback to st.dataframe in SiS by @sfc-gh-chu in https://github.com/truera/trulens/pull/1677 * Fix Cortex complete SDK response parsing. by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/167	Low	12/6/2024
trulens-1.2.9	## What's Changed * adding jenkins file for e2e tests by @sfc-gh-srudenko in https://github.com/truera/trulens/pull/1661 * Memoize base endpoint creation for cost tracking by @sfc-gh-chu in https://github.com/truera/trulens/pull/1659 * add importlib resources to conda build by @sfc-gh-chu in https://github.com/truera/trulens/pull/1662 * Switch from SQL function to REST API backend for Cortex Complete - cost tracking of both feedback computations and app generation by @sfc-gh-dhuang in http	Low	12/4/2024
trulens-1.2.6	## What's Changed * Allow Cortex provider to only take a connection object. by @sfc-gh-pdharmana in https://github.com/truera/trulens/pull/1617 * Fix code example formatting in docs by @sfc-gh-jreini in https://github.com/truera/trulens/pull/1610 * Fix old references to mae in GroundTruthAgreement feedback function by @sfc-gh-dhuang in https://github.com/truera/trulens/pull/1622 * Add try on tag creation since its an enterprise feature by @sfc-gh-pdharmana in https://github.com/truera/trulen	Low	11/6/2024
trulens-1.2.4	## What's Changed * try catch on tag creation (#1623) by @sfc-gh-pdharmana in https://github.com/truera/trulens/pull/1624 Full Changelog: https://github.com/truera/trulens/compare/trulens-1.2.2...trulens-1.2.4	Low	11/5/2024
trulens-1.2.2	## What's Changed * Use snowflake connector over snowpark session in trulens Snowflake DB connector as snowpark session isn't thread-safe. by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/1604 * Don't open extra Snowflake connections and don't recycle connections as quickly. by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/1609 * Remove unnecessary deps from `trulens-connectors-snowflake`. by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/1611 **F	Low	10/30/2024
trulens-1.2.1	## Bug Fixes * Don't check for user and account in snowpark sessions because Streamlit apps might hide them. by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/1600 * catch source code not available in `code_line` by @sfc-gh-pmardziel in https://github.com/truera/trulens/pull/1592 * use float nan in place of numpy for skipped evals by @sfc-gh-chu in https://github.com/truera/trulens/pull/1595 * Fix the misspelled `trulens-providers-openai` package in examples by @SSK-14 in https:	Low	10/29/2024
trulens-1.2.0	## What's Changed * Blocking guardrails by @sfc-gh-jreini in https://github.com/truera/trulens/pull/1584 * and add dataset preprocessing utils used in benchmarking by @sfc-gh-dhuang in https://github.com/truera/trulens/pull/1559 * Use ggshield for local secret scanning by @sfc-gh-jreini in https://github.com/truera/trulens/pull/1585 * Clean before uploading docs. by @sfc-gh-dkurokawa in https://github.com/truera/trulens/pull/1594 * Update dev guide with git lfs instructions by @sfc-gh-chu i	Low	10/28/2024

Dependencies & License Audit

Loading dependencies...

Similar Packages

phoenixAI Observability & Evaluationarize-phoenix-v19.0.0

giskard-oss🐢 Open-Source Evaluation & Testing library for LLM Agentsgiskard-scan/v1.0.0b3

logfireAI observability platform for production LLM and agent systems.v4.38.0

agent-actionsDeclarative framework for orchestrating multi-model LLM pipelines with context engineering and quality gates.v0.2.7

ring89 skills and 38 specialized agents that enforce proven engineering practices for AI-assisted development. TDD, systematic debugging, parallel code review, and 10-gate development cycles — as a Claudemain@2026-07-17

More in Testing

multi-agent-ralph-loopAutonomous orchestration framework for Claude Code with MemPalace-inspired memory (4-layer stack, 818-token wake-up), parallel-first Agent Teams (6 teammates), Aristotle First Principles methodology,

pilot#1 Terminal Benchmark 2.0 — AI that ships your tickets.

ObservalObserval is an AI agent registry with first in class observabilty and eval framework

phoenixAI Observability & Evaluation