onnxruntime

Home > Frameworks > onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

ai-framework c++deep-learning hardware-acceleration machine-learning neural-networks onnx pytorch scikit-learn

Why this rank:Strong adoptionRecent releaseHealthy release cadence

Description

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

README

ONNX Runtime is a cross-platform inference and training machine-learning accelerator.

ONNX Runtime inference can enable faster customer experiences and lower costs, supporting models from deep learning frameworks such as PyTorch and TensorFlow/Keras as well as classical machine learning libraries such as scikit-learn, LightGBM, XGBoost, etc. ONNX Runtime is compatible with different hardware, drivers, and operating systems, and provides optimal performance by leveraging hardware accelerators where applicable alongside graph optimizations and transforms. Learn more →

ONNX Runtime training can accelerate the model training time on multi-node NVIDIA GPUs for transformer models with a one-line addition for existing PyTorch training scripts. Learn more →

Get Started & Resources

General Information: onnxruntime.ai
Usage documentation and tutorials: onnxruntime.ai/docs
YouTube video tutorials: youtube.com/@ONNXRuntime
Upcoming Release Roadmap
Companion sample repositories:
- ONNX Runtime Inferencing: microsoft/onnxruntime-inference-examples
- ONNX Runtime Training: microsoft/onnxruntime-training-examples

Releases

The current release and past releases can be found here: https://github.com/microsoft/onnxruntime/releases.

For details on the upcoming release, including release dates, announcements, features, and guidance on submitting feature requests, please visit the release roadmap: https://onnxruntime.ai/roadmap.

Data/Telemetry

Windows distributions of this project may collect usage data and send it to Microsoft to help improve our products and services. See the privacy statement for more details.

Contributions and Feedback

We welcome contributions! Please see the contribution guidelines.

For feature requests or bug reports, please file a GitHub Issue.

For general discussion or questions, please use GitHub Discussions.

Code of Conduct

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact opencode@microsoft.com with any additional questions or comments.

License

This project is licensed under the MIT License.

Release History

Version	Changes	Urgency	Date
v1.26.0	n.b. The following was generated via LLM from Git history. Only the contributor list has been verified. # ONNX Runtime Release 1.26.0 ## Announcement - Breaking Changes - Support for CUDA 12 will be removed in 1.27.0. - CUDA 13 will continue to be published as `onnxruntime-<os>-<arch>-gpu_cuda13-<version>.<ext>` - CUDA runtime will be moving soon to a dedicated Execution Provider (EP) instead of a published package from ORT core. ## Highlights - Added optional memory mappi	High	5/8/2026
v1.25.1	# ONNX Runtime Release 1.25.1 ## 📢 Announcements & Breaking Changes ### ONNX Op Updates * Enhanced ONNX operator support with new opset versions: Reshape (opset 25), Transpose (opset 24) ([#27752](https://github.com/microsoft/onnxruntime/pull/27752)) --- ## ✨ New Features ### 📊 New ONNX Ops & Model Support * LinearAttention and CausalConvState operators for Qwen3.5 model support ([#27907](https://github.com/microsoft/onnxruntime/pull/27907)) * **RotaryEmbedd	High	4/27/2026
v1.25.0	## 📢 Announcements & Breaking Changes ### Build & Platform * C++20 is now required to build ONNX Runtime from source. Minimum toolchains: MSVC 19.29+, GCC 10+, Clang 10+. Users of prebuilt packages are unaffected. ([#27178](https://github.com/microsoft/onnxruntime/pull/27178)) * CUDA minimum version raised to 12.0 — CUDA 11.x is no longer supported. Users pinned to CUDA 11.x should stay on ORT 1.24.x or upgrade their CUDA toolkit/driver. ([#27570](https://github.com/microsoft	High	4/20/2026
v1.24.4	This is a patch release for ONNX Runtime 1.24, containing bug fixes and execution provider updates. ## Bug Fixes - Core: Added PCI bus fallback for Linux GPU device discovery in containerized environments (e.g., AKS/Kubernetes) where `nvidia-drm` is not loaded but GPU PCI devices are still exposed via sysfs. ([#27591](https://github.com/microsoft/onnxruntime/pull/27591)) - Plugin EP: Fixed null pointer dereference when iterating output spans in `GetOutputIndex`. ([#27644](https://gi	Medium	3/17/2026
v1.24.3	This is a patch release for ONNX Runtime 1.24, containing bug fixes, security improvements, performance enhancements, and execution provider updates. ## Security Fixes - Core: Fixed GatherCopyData integer truncation leading to heap out-of-bounds read/write. ([#27444](https://github.com/microsoft/onnxruntime/pull/27444)) - Core: Fixed RoiAlign heap out-of-bounds read via unchecked batch_indices. ([#27543](https://github.com/microsoft/onnxruntime/pull/27543)) - Core: Prevent heap	Low	3/5/2026
v1.24.2	This is a patch release for ONNX Runtime 1.24, containing several bug fixes, security improvements, and execution provider updates. ## Bug Fixes - NuGet: Fixed native library loading issues in the ONNX Runtime NuGet package on Linux and macOS. ([#27266](https://github.com/microsoft/onnxruntime/pull/27266)) - macOS: Fixed Java support and Jar testing on macOS ARM64. ([#27271](https://github.com/microsoft/onnxruntime/pull/27271)) - Core: Enable Robust Symlink Support for External	Low	2/19/2026
v1.24.1	## 📢 Announcements & Breaking Changes ### Platform Support Changes - Python 3.10 wheels are no longer published — Please upgrade to Python 3.11+ - Python 3.14 support added - Free-threaded Python (PEP 703) — Added support for Python 3.13t and 3.14t in Linux ([#26786](https://github.com/microsoft/onnxruntime/pull/26786)) - x86_64 binaries for macOS/iOS are no longer provided and minimum macOS is raised to 14.0 ### API Version - ORT_API_VERSION updated to 24 ([#	Low	2/6/2026
v1.23.2	Release v1.23.2	Low	10/25/2025
v1.23.1	## What's Changed - Fix Attention GQA implementation on CPU (#25966) - Address edge GetMemInfo edge cases (#26021) - Implement new Python APIs (#25999) - MemcpyFromHost and MemcpyToHost support for plugin EPs (#26088) - [TRT RTX EP] Fix bug for generating the correct subgraph in GetCapability (#26132) - add session_id_ to LogEvaluationStart/Stop, LogSessionCreationStart (#25590) - [build] fix WebAssembly build on macOS/arm64 (#25653) - [CPU] MoE Kernel (#25958) - [CPU] Blo	Low	10/8/2025
v1.23.0	# Announcements - This release introduces Execution Provider (EP) Plugin API, which is a new infrastructure for building plugin-based EPs. (#24887 , #25137, #25124, #25147, #25127, #25159, #25191, #2524) - This release introduces the ability to dynamically download and install execution providers. This feature is exclusively available in the WinML build and requires Windows 11 version 25H2 or later. To leverage this new capability, C/C++/C# users should use the builds distributed through t	Low	9/26/2025
v1.22.2	# What's new? This release adds an optimized CPU/MLAS implementation of DequantizeLinear (8 bit) and introduces the build option client_package_build, which enables default options that are more appropriate for client/on-device workloads (e.g., disable thread spinning by default). ## Build System & Packages - Add –client_package_build option ([#25351](https://github.com/microsoft/onnxruntime/pull/25351)) - @jywu-msft - Remove the python installation steps from win-qnn-arm64-ci-pipe	Low	8/13/2025
v1.22.1	# What's new? This release replaces static linking of dxcore.lib with optional runtime loading, lowering the minimum supported version from Windows 10 22H2 (10.0.22621) to 20H1 (10.0.19041). This enables compatibility with Windows Server 2019 (10.0.17763), where dxcore.dll may be absent. - change dependency from gitlab eigen to github eigen-mirror #24884 - @prathikr - Weaken dxcore dependency #24845 - @skottmckay - [DML] Restore compatibility with Windows Sdk 10.0.17134.0 #24950 - @J	Low	7/8/2025
v1.22.0	## Announcements * This release introduces new API's for Model Editor, Auto EP infrastructure, and AOT Compile * OnnxRuntime GPU packages require CUDA 12.x , packages built for CUDA 11.x are no longer published. * The min supported Windows version is now 10.0.19041. ## GenAI & Advanced Model Features * Constrained Decoding: Introduced new capabilities for constrained decoding, offering more control over generative AI model outputs. ## Execution & Core Optimizations ### Co	Low	5/10/2025
v1.21.1	# What's new? - Extend CMAKE_CUDA_FLAGS with all Blackwell compute capacity #23928 - @yf711 - [ARM CPU] Fix fp16 const initialization on no-fp16 platform #23978 - @fajin-corp - [TensorRT EP] Call cudaSetDevice at compute function for handling multithreading scenario #24010 - @chilo-ms - Fix attention bias broadcast #24017 - @tianleiwu - Deleted the constant SKIP_CUDA_TEST_WITH_DML #24113 - @CodingSeaotter - [QNN EP] ARM64EC python package remove --vcpkg in build #24174 - @jywu-msft	Low	4/21/2025
v1.21.0	## Announcements - No large announcements of note this release! We've made a lot of small refinements to streamline your ONNX Runtime experience. ## GenAI & Advanced Model Features ### Enhanced Decoding & Pipeline Support - Added "chat mode" support for CPU, GPU, and WebGPU. - Provided support for decoder model pipelines. - Added support for Java API for MultiLoRA. ### API & Compatibility Updates - Chat mode introduced breaking changes in the API (see [migration guide](https://onnx	Low	3/8/2025
v1.20.2	# What's new? ## Build System & Packages - Merge Windows machine pools for Web CI pipeline to reduce maintenance costs (#23243) - @snnn - Update boost URL for React Native CI pipeline (#23281) - @jchen351 - Move ORT Training pipeline to GitHub actions and enable CodeQL scan for the source code (#22543) - @snnn - Move Linux GitHub actions to a dedicated machine pool (#22566) - @snnn - Update Apple deployment target to iOS 15.1 and macOS 13.3 (#23308) - @snnn - Deprecate macOS 12 in pa	Low	2/12/2025
v1.20.1	# What's new? ## Python Quantization Tool - Prevent int32 quantized bias from clipping by adjusting the weight's scale (#22020) - @adrianlizarraga - Update QDQ Pad, Slice, Softmax (#22676) - @adrianlizarraga - Introduce get_qdq_config() helper to get QDQ configurations (#22677) - @adrianlizarraga - Add reduce_range option to get_qdq_config() (#22782) - @adrianlizarraga - Flaky test due to Pad reflect bug (#22798) - @adrianlizarraga ## CPU EP - Refactor SkipLayerNorm implementati	Low	11/21/2024
v1.20.0	Release Manager: @apsonawane # Announcements - All ONNX Runtime Training packages have been deprecated. ORT 1.19.2 was the last release for which onnxruntime-training (PyPI), onnxruntime-training-cpu (PyPI), Microsoft.ML.OnnxRuntime.Training (Nuget), onnxruntime-training-c (CocoaPods), onnxruntime-training-objc (CocoaPods), and onnxruntime-training-android (Maven Central) were published. - ONNX Runtime packages will stop supporting Python 3.8 and Python 3.9. This decision align	Low	11/1/2024
v1.19.2	## Announcements * ORT 1.19.2 is a small patch release, fixing some broken workflows and introducing bug fixes. ## Build System & Packages * Fixed the signing of native DLLs. * Disabled absl symbolize in Windows Release build to avoid dependency on dbghelp.dll. ## Training * Restored support for CUDA compute capability 7.0 and 7.5 with CUDA 12, and 6.0 and 6.1 with CUDA 11. * Several fixes for training CI pipelines. ## Mobile * Fixed ArgMaxOpBuilder::AddToModelBuilderImpl() nullpt	Low	9/4/2024
v1.19.0	## Announcements * Note that the wrong commit was initially tagged with v1.19.0. The final commit has since been correctly tagged: https://github.com/microsoft/onnxruntime/commit/26250ae74d2c9a3c6860625ba4a147ddfb936907. This shouldn't effect much, but sorry for the inconvenience! ## Build System & Packages * Numpy support for 2.x has been added * Qualcomm SDK has been upgraded to 2.25 * ONNX has been upgraded from 1.16 → 1.16.1 * Default GPU packages use CUDA 12.x and Cudnn 9.x (pre	Low	8/19/2024
v1.18.1	## What's new? Announcements: - ONNX Runtime Python packages now have numpy dependency >=1.21.6, <2.0. Support for numpy 2.0 will be added in a future release. - CUDA 12.x ONNX Runtime GPU packages are now built against cuDNN 9.x (1.18.0 packages previously depended on cuDNN 8.x). CUDA 11.x ONNX Runtime GPU packages continue to depend on CuDNN 8.x. - Windows packages require installation of Microsoft Visual C++ Redistributable Runtime 14.38 or newer. TensorRT EP: - TensorRT We	Low	6/28/2024
v1.18.0	## Announcements * Windows ARM32 support has been dropped at the source code level. * Python version >=3.8 is now required for build.bat/build.sh (previously >=3.7). Note: If you have Python version <3.8, you can bypass the tools and use CMake directly. * The [onnxruntime-mobile](https://mvnrepository.com/artifact/com.microsoft.onnxruntime/onnxruntime-mobile) Android package and onnxruntime-mobile-c/onnxruntime-mobile-objc iOS cocoapods are being deprecated. Please use the [onnx	Low	5/21/2024
v1.17.3	# What's new? General: - Update copying API header files to make Linux logic consistent with Windows ([#19736](https://github.com/microsoft/onnxruntime/pull/19736)) - @mszhanyi - Pin ONNX version to fix DML and Python packaging pipeline exceptions ([#20073](https://github.com/microsoft/onnxruntime/pull/20073)) - @mszhanyi Build System & Packages: - Fix minimal build with training APIs enabled bug affecting Apple framework ([#19858](https://github.com/microsoft/onnxruntime/pull/	Low	4/18/2024
v1.17.1	This patch release includes the following updates: # General - Update thread affinity on server so it is only set with auto affinity ([#19318](https://github.com/microsoft/onnxruntime/pull/19318)) - @ivberg # Build System and Packages - Fix bug that was breaking arm64 build by disabling __cpuid check on arm64 builds since intrinsic is not available ([#19574](https://github.com/microsoft/onnxruntime/pull/19574)) - @smk2007 # Core - Add capturestate / rundown ETW support loggin	Low	2/27/2024
v1.17.0	# Announcements In the next release, we will totally drop support for Windows ARM32. # General - Added support for new ONNX 1.15 opsets: [IsInf-20](https://github.com/onnx/onnx/blob/main/docs/Changelog.md#IsInf-20), [IsNaN-20](https://github.com/onnx/onnx/blob/main/docs/Changelog.md#IsNaN-20), [DFT-20](https://github.com/onnx/onnx/blob/main/docs/Changelog.md#DFT-20), [ReduceMax-20](https://github.com/onnx/onnx/blob/main/docs/Changelog.md#ReduceMax-20), [ReduceMin-20](https://github.com/on	Low	2/3/2024
v1.16.3	## What's Changed 1. Stable Diffusion XL demo update by @tianleiwu in https://github.com/microsoft/onnxruntime/pull/18496 2. Fixed a memory leak issue(#18466) in TensorRT EP by @chilo-ms in https://github.com/microsoft/onnxruntime/pull/18467 3. Fix a use-after-free bug in SaveInputOutputNamesToNodeMapping function by @snnn in https://github.com/microsoft/onnxruntime/pull/18456 . The issue was found by AddressSanitizer.	Low	11/20/2023
v1.16.2	The patch release includes updates on: * Performance optimizations for Llama2 on CUDA EP and DirectML EP * Performance optimizations for Stable Diffusion XL model for CUDA EP * [Demos](https://github.com/microsoft/onnxruntime/blob/main/onnxruntime/python/tools/transformers/models/stable_diffusion/README.md) for text to image generation * Mobile bug fixes for crash on some older 64-bit ARM devices and AOT inlining issue on iOS with C# bindings * TensorRT EP bug fixes for user provided co	Low	11/9/2023
v1.16.1	Patch release for 1.16 - Fix type of weights and activations in the ONNX quantizer - Fix quantization bug in historic quantizer #17619 - Enable session option access in nodejs API - Update nodejs to v18 - Align ONNX Runtime extensions inclusion in source and build - Limit per thread context to 1 in the TensorRT EP to avoid error caused by input shape changes	Low	10/11/2023
v1.16.0	# General * Support for serialization of models >=2GB # APIs * New session option to disable default CPU EP fallback `session.disable_cpu_ep_fallback` * Java * Support for fp16 and bf16 tensors as inputs and outputs, along with utilities to convert between these and fp32 data. On JDK 20 and newer the fp16 conversion methods use the JDK's Float.float16ToFloat and Float.floatToFloat16 methods which can be hardware accelerated and vectorized on some platforms. * Support for external in	Low	9/20/2023
v1.15.1	This release fixed the following issues: 1. A coding problem in test/shared_lib/test_inference.cc that it should use ASSERT_NEAR to test float values instead of ASSERT_EQ. Without this change, some DNNL/OpenVino tests would fail on some AMD CPUs. 2. A misaligned error in cublasGemmBatchedHelper function. The error only occurs when CUDA version = 11.8 and the GPU's CUDA Compute capability >=80. (In other words: with TensorFloat-32 support) (#15981) 3. A build issue that build with onnxruntim	Low	6/16/2023
v1.15.0	# Announcements Starting from the next release(ONNX Runtime 1.16.0), at operating system level we will drop the support for - iOS 11 and below. iOS 12 will be the minimum supported version. - CentOS 7, Ubuntu 18.04, and any Linux distro without glibc version >=2.28. At compiler level we will drop the support for - GCC version <= 9 - Visual Studio 2019 Also, we will remove the onnxruntime_DISABLE_ABSEIL build option since we will upgrade protobuf and the new protobuf version wil	Low	5/25/2023
v1.14.1	This patch addresses packaging issues and bug fixes on top of v1.14.0: * Mac OS Python build for x86 arch (issue: #14663) * DirectML EP fixes: sequence ops (#14442), package naming to remove -dev suffix * CUDA12 build compatibility (#14659) * Performance regression fixes: IOBinding input (#14719), Transformer models (#14732, #14517, #14699) * ORT Training kernel fix (#14727) Only select packages were published for this patch release; others can be found in the attachments below: * Pypi:	Low	3/2/2023
v1.14.0	# Announcements * Building ORT from source will require cmake version >=3.24 instead of >=3.18. # General * [ONNX 1.13](https://github.com/onnx/onnx/releases/tag/v1.13.0) support (opset 18) * Threading * ORT Threadpool is now NUMA aware [(details)](https://onnxruntime.ai/docs/performance/tune-performance.html#numa-support-and-performance-tuning) * New API to set thread affinity ([details](https://onnxruntime.ai/docs/performance/tune-performance.html#set-intra-op-thread-affinity))	Low	2/11/2023
v1.13.1	# Announcements * Security issues addressed by this release 1. A protobuf security issue CVE-2022-1941 that impact users who load ONNX models from untrusted sources, for example, a deep learning inference service which allows users to upload their models then runs the inferences in a shared environment. 2. An ONNX security vulnerability that allows reading of tensor_data outside the model directory, which allows attackers to read or write arbitrary files on an affected system that load	Low	10/24/2022
v1.12.1	This patch addresses packaging issues and bug fixes on top of v1.12.0. - Java package: MacOS M1 support folder structure fix - Android package: enable optimizations - GPU (TensorRT provider): bug fixes - DirectML: package fix - WinML: bug fixes See #12418 for full list of specific fixes included	Low	8/4/2022
v1.12.0	# Announcements * For Execution Provider maintainers/owners: the [lightweight compile API](https://github.com/microsoft/onnxruntime/blob/master/include/onnxruntime/core/framework/execution_provider.h#L249) is now the default compiler API for all Execution Providers (this was previously only available for the mobile build). If you have an EP using the [legacy compiler API](https://github.com/microsoft/onnxruntime/blob/master/include/onnxruntime/core/framework/execution_provider.h#L237), please m	Low	7/22/2022
v1.11.1	This is a patch release on 1.11.0 with the following fixes: - Symbolic shape infer error (https://github.com/microsoft/onnxruntime/pull/10674) - Quantization tool bug (https://github.com/microsoft/onnxruntime/pull/10940) - Adds missing numpy type when looking for the ort correspondance (https://github.com/microsoft/onnxruntime/pull/10943) - Profiling tool JSON format bug (https://github.com/microsoft/onnxruntime/pull/11046) - Function bug fix (https://github.com/microsoft/onnxruntime/pull	Low	4/27/2022
v1.11.0	# Key Updates ## General * Support for ONNX 1.11 with opset 16 * Updated protobuf version to 3.18.x * Enable usage of Mimalloc ([details](https://onnxruntime.ai/docs/performance/tune-performance.html#mimalloc-allocator-usage)) * Transformer model helper scripts * [T5 conversion script](https://github.com/microsoft/onnxruntime/blob/master/onnxruntime/python/tools/transformers/models/t5/convert_to_onnx.py) * [GPT2 conversion script](https://github.com/microsoft/onnxruntime/tree/maste	Low	3/26/2022
v1.10.0	# Announcements * As noted in the [deprecation notice](https://github.com/microsoft/onnxruntime/blob/4daa14bc74b5378d5fcb0d6de063a9fa8bd42eac/onnxruntime/python/onnxruntime_inference_collection.py#L350) in ORT 1.9, InferenceSession now requires the providers parameters to be set when enabling Execution Providers other than default CPUExecutionProvider. e.g. InferenceSession('model.onnx', providers=['CUDAExecutionProvider']) * Python 3.6 support removed for Mac builds. Since 3.6 is end-of-life	Low	12/8/2021
v1.9.1	This is a patch release on 1.9.0 with the following fixes: - Microsoft.AI.MachineLearning NuGet Package Fixes - Bug fix for the issue that fails GPU execution if the executable is on the path that contained the unicode characters - [9229](https://github.com/microsoft/onnxruntime/pull/9229). - Bug fix for the NuGet package to be installed on UWP apps with 1.9 - [9182](https://github.com/microsoft/onnxruntime/pull/9182). - Bug fix for OpenVino EP Python API- [9166](https://github.com	Low	10/5/2021
v1.9.0	# Announcements * GCC version < 7 is no longer supported * CMAKE_SYSTEM_PROCESSOR needs be set when cross-compiling on Linux because pytorch cpuinfo was introduced as a dependency for ARM big.LITTLE support. Set it to the value of `uname -m` output of your target device. # General * ONNX 1.10 support * opset 15 * ONNX IR 8 (SparseTensor type, model local functionprotos, Optional type not yet fully supported this release) * Improved documentation of [C/C++ APIs](https://onnxruntime	Low	9/23/2021
v1.8.2	This is a minor patch release on [1.8.1](https://github.com/microsoft/onnxruntime/releases/tag/v1.8.1) with the following changes: ## Inference * Fix a crash issue when optimizing `Conv->Add->Relu` for CUDA EP * ORT Mobile updates * Change [Pre-built iOS package](https://onnxruntime.ai/docs/how-to/mobile/overview.html#pre-built-package) to static framework to fix App Store submission issue * Support for metadata in ORT format models * Additional operators * Bug fixes ## Know	Low	8/6/2021
v1.8.1	This release contains fixes and key updates for 1.8.0. For all package installation details, please refer to https://www.onnxruntime.ai. ## Inference * Fixes for GPU package loading issues * Fix for memory issue for models with convolution nodes while using the EXHAUSTIVE algo search mode * ORT Mobile updates * CoreML EP enabled in iOS mobile package * Additional operators * Bug fixes * [React Native package](https://www.npmjs.com/package/onnxruntime-react-native) now availabl	Low	7/8/2021
v1.8.0	# Announcements * This release * Building onnxruntime from source now requires a C++ compiler with full C++14 support. * Builds with OpenMP are no longer published. They can still be [built from source](http://www.onnxruntime.ai/docs/how-to/build/inferencing.html#openmp) if needed. The default threadpool option should provide optimal performance for the majority of models. * New dependency for Python package: flatbuffers * Next release (v1.9) * Builds will require C++ 17 compiler	Low	6/3/2021
v1.7.2	This is a minor patch release on [1.7.1](https://github.com/microsoft/onnxruntime/releases/tag/v1.7.1) with the following changes: - Fix [Microsoft.AI.MachineLearning](https://www.nuget.org/packages/Microsoft.AI.MachineLearning/) NuGet package to correctly install on C# UWP projects in Visual Studio.	Low	4/8/2021
v1.7.1	The [Microsoft.ML.OnnxRuntime.Gpu](https://www.nuget.org/packages/Microsoft.ML.OnnxRuntime.Gpu/) and [Microsoft.ML.OnnxRuntime.Managed](https://www.nuget.org/packages/Microsoft.ML.OnnxRuntime.Managed/) packages are uploaded to Nuget.org. Please note the version numbers for the Microsoft.ML.OnnxRuntime.Managed package.	Low	3/4/2021
v1.7.0	# Announcements Starting from this release, all ONNX Runtime CPU packages are now built without OpenMP. A version with OpenMP is available on Nuget ([Microsoft.ML.OnnxRuntime.OpenMP](https://www.nuget.org/packages/Microsoft.ML.OnnxRuntime.OpenMP)) and PyPi ([onnxruntime-openmp](https://pypi.org/project/onnxruntime-openmp/)). Please report any issues in [GH Issues](https://github.com/microsoft/onnxruntime/issues). Note: The 1.7.0 GPU package is uploaded on [this Azure DevOps Feed](htt	Low	3/3/2021
v1.6.0	# Announcements * OpenMP will be disabled in future official builds (build option will still be available). A NoOpenMP version of ONNX Runtime is now available with this release on [Nuget](http://nuget.org/packages/Microsoft.ML.OnnxRuntime.NoOpenMP) and [PyPi](https://pypi.org/project/onnxruntime/) for C/C++/C#/Python users. * In the next release, MKL-ML, openblas, and jemallac build options will be removed, and the [Microsoft.ML.OnnxRuntime.MKLML](https://www.nuget.org/packages/Micr	Low	12/11/2020
v1.5.3	This is a minor patch release on [1.5.2](https://github.com/Microsoft/onnxruntime/releases/tag/v1.5.2) with the following changes: * Fix shared provider unload crash #5553 * Minor minimal build header fix	Low	10/29/2020
v1.5.2	This is a minor patch release on [1.5.1](https://github.com/Microsoft/onnxruntime/releases/tag/v1.5.1) with the following changes: * Remove dependency on cudnn64_7.dll for GPU C# nuget: https://github.com/microsoft/onnxruntime/pull/5386 * Add config keys header file in the packages for Linux and Mac: https://github.com/microsoft/onnxruntime/pull/5388 * Add flatbuffers verifier for ORT format buffer: https://github.com/microsoft/onnxruntime/pull/5378 * Use official flatbuffers v1.12: https://	Low	10/15/2020
orttraining_rc3.1	Fixes issue discovered during validation. Changes: - https://github.com/microsoft/onnxruntime/pull/5350	Low	10/8/2020
orttraining_rc3	See: https://github.com/microsoft/onnxruntime/releases/tag/v1.5.1	Low	9/30/2020

Dependencies & License Audit

Loading dependencies...

Similar Packages

onnxruntime-javaA type-safe, lightweight, modern, and performant binding Java binding of Microsoft's ONNX Runtimev1.26.0-1

transformersTransformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.v5.10.1

modular-image-classification-frameworkA modular deep learning framework for training and evaluating image classification models on datasets like CIFAR-10 and MNIST. Supports configurable CNN architectures, automated training, and performamain@2026-05-28

OriginDLImplement a Pytorch-like DL library in C++ from scratch, step by stepv1.0.0

cONNXrPure C ONNX runtime with zero dependancies for embedded devices0.0.0

More from microsoft

generative-ai-for-beginners21 Lessons, Get Started Building with Generative AI

autogenA programming framework for agentic AI

playwright-mcpPlaywright MCP server

semantic-kernelIntegrate cutting-edge LLM technology quickly and easily into your apps

More in Frameworks

langchainThe agent engineering platform

deer-flowAn open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of ta

tqdmFast, Extensible Progress Meter

simBuild, deploy, and orchestrate AI agents. Sim is the central intelligence layer for your AI workforce.