# onnxruntime

> ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

- **URL**: https://www.freshcrate.ai/projects/onnxruntime
- **Author**: microsoft
- **Category**: Frameworks
- **Latest version**: `v1.26.0` (2026-05-08)
- **License**: MIT
- **Source**: https://github.com/microsoft/onnxruntime
- **Homepage**: https://onnxruntime.ai
- **Language**: C++
- **GitHub**: 19,924 stars, 3,842 forks
- **Registry**: github
- **Tags**: `ai-framework`, `c++`, `deep-learning`, `hardware-acceleration`, `machine-learning`, `neural-networks`, `onnx`, `pytorch`, `scikit-learn`

## Description

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

## Recent releases

| Version | Date | Urgency | Changes |
| --- | --- | --- | --- |
| `v1.26.0` | 2026-05-08 | High | n.b. The following was generated via LLM from Git history. Only the contributor list has been verified.  # ONNX Runtime Release 1.26.0  ## Announcement - Breaking Changes  - **Support for CUDA 12 will be removed in 1.27.0.**   - CUDA 13 will continue to be published as `onnxruntime-<os>-<arch>-gpu_cuda13-<version>.<ext>` - CUDA runtime will be moving soon to a dedicated Execution Provider (EP) instead of a published package from ORT core.  ## Highlights  - Added optional memory mappi |
| `v1.25.1` | 2026-04-27 | High | # ONNX Runtime Release 1.25.1  ## 📢 Announcements & Breaking Changes  ### ONNX Op Updates  *   **Enhanced ONNX operator support** with new opset versions: Reshape (opset 25), Transpose (opset 24) ([#27752](https://github.com/microsoft/onnxruntime/pull/27752))  ---  ## ✨ New Features  ### 📊 New ONNX Ops & Model Support  *   **LinearAttention and CausalConvState operators** for Qwen3.5 model support ([#27907](https://github.com/microsoft/onnxruntime/pull/27907)) *   **RotaryEmbedd |
| `v1.25.0` | 2026-04-20 | High | ## 📢 Announcements & Breaking Changes  ### Build & Platform  *   **C++20 is now required** to build ONNX Runtime from source. Minimum toolchains: MSVC 19.29+, GCC 10+, Clang 10+. Users of prebuilt packages are unaffected. ([#27178](https://github.com/microsoft/onnxruntime/pull/27178)) *   **CUDA minimum version raised to 12.0** — CUDA 11.x is no longer supported. Users pinned to CUDA 11.x should stay on ORT 1.24.x or upgrade their CUDA toolkit/driver. ([#27570](https://github.com/microsoft |
| `v1.24.4` | 2026-03-17 | Medium | This is a patch release for ONNX Runtime 1.24, containing bug fixes and execution provider updates.  ## Bug Fixes - **Core**: Added PCI bus fallback for Linux GPU device discovery in containerized environments (e.g., AKS/Kubernetes) where `nvidia-drm` is not loaded but GPU PCI devices are still exposed via sysfs. ([#27591](https://github.com/microsoft/onnxruntime/pull/27591)) - **Plugin EP**: Fixed null pointer dereference when iterating output spans in `GetOutputIndex`. ([#27644](https://gi |
| `v1.24.3` | 2026-03-05 | Low | This is a patch release for ONNX Runtime 1.24, containing bug fixes, security improvements, performance enhancements, and execution provider updates.  ## Security Fixes - **Core**: Fixed GatherCopyData integer truncation leading to heap out-of-bounds read/write. ([#27444](https://github.com/microsoft/onnxruntime/pull/27444)) - **Core**: Fixed RoiAlign heap out-of-bounds read via unchecked batch_indices. ([#27543](https://github.com/microsoft/onnxruntime/pull/27543)) - **Core**: Prevent heap |
| `v1.24.2` | 2026-02-19 | Low | This is a patch release for ONNX Runtime 1.24, containing several bug fixes, security improvements, and execution provider updates.  ## Bug Fixes - **NuGet**: Fixed native library loading issues in the ONNX Runtime NuGet package on Linux and macOS. ([#27266](https://github.com/microsoft/onnxruntime/pull/27266)) - **macOS**: Fixed Java support and Jar testing on macOS ARM64. ([#27271](https://github.com/microsoft/onnxruntime/pull/27271)) - **Core**: Enable Robust Symlink Support for External |
| `v1.24.1` | 2026-02-06 | Low | ## 📢 Announcements & Breaking Changes  ### Platform Support Changes - **Python 3.10 wheels are no longer published** — Please upgrade to Python 3.11+ - **Python 3.14 support added** - **Free-threaded Python (PEP 703)** — Added support for Python 3.13t and 3.14t in Linux ([#26786](https://github.com/microsoft/onnxruntime/pull/26786)) - **x86_64 binaries for macOS/iOS are no longer provided and minimum macOS is raised to 14.0**  ### API Version - **ORT_API_VERSION** updated to **24** ([# |
| `v1.23.2` | 2025-10-25 | Low | Release v1.23.2 |
| `v1.23.1` | 2025-10-08 | Low | ## What's Changed - Fix Attention GQA implementation on CPU (#25966)   - Address edge GetMemInfo edge cases (#26021)   - Implement new Python APIs (#25999)   - MemcpyFromHost and MemcpyToHost support for plugin EPs (#26088)   - [TRT RTX EP] Fix bug for generating the correct subgraph in GetCapability (#26132) - add session_id_ to LogEvaluationStart/Stop, LogSessionCreationStart (#25590)   - [build] fix WebAssembly build on macOS/arm64 (#25653)   - [CPU] MoE Kernel (#25958)   - [CPU] Blo |
| `v1.23.0` | 2025-09-26 | Low | # Announcements  - This release introduces Execution Provider (EP) Plugin API, which is a new infrastructure for building plugin-based EPs. (#24887 , #25137, #25124, #25147, #25127, #25159, #25191, #2524)  - This release introduces the ability to dynamically download and install execution providers. This feature is exclusively available in the WinML build and requires Windows 11 version 25H2 or later. To leverage this new capability, C/C++/C# users should use the builds distributed through t |

## Citation

- HTML: https://www.freshcrate.ai/projects/onnxruntime
- Markdown: https://www.freshcrate.ai/projects/onnxruntime.md
- Dependencies JSON: https://www.freshcrate.ai/api/projects/onnxruntime/deps

_Generated by freshcrate.ai. Indexes github releases for AI-agent ecosystem packages._