Home > Infrastructure > new-api

new-api

A unified AI model hub for aggregation & distribution. It supports cross-converting various LLMs into OpenAI-compatible, Claude-compatible, or Gemini-compatible formats. A centralized gateway for pers

ai-gateway claude deepseek gemini go newapi openai rerank

Why this rank:Strong adoptionRecent releaseHealthy release cadence

Description

README

New API

🍥 Next-Generation LLM Gateway and AI Asset Management System

简体中文 | 繁體中文 | English | Français | 日本語

Quick Start • Key Features • Deployment • Documentation • Help

📝 Project Description

Important

This project is for personal learning purposes only, with no guarantee of stability or technical support
Users must comply with OpenAI's Terms of Use and applicable laws and regulations, and must not use it for illegal purposes
According to the 《Interim Measures for the Management of Generative Artificial Intelligence Services》, please do not provide any unregistered generative AI services to the public in China.

🤝 Trusted Partners

No particular order

🙏 Special Thanks

Thanks to JetBrains for providing free open-source development license for this project

🚀 Quick Start

Using Docker Compose (Recommended)

# Clone the project
git clone https://github.com/QuantumNous/new-api.git
cd new-api

# Edit docker-compose.yml configuration
nano docker-compose.yml

# Start the service
docker-compose up -d

Using Docker Commands

# Pull the latest image
docker pull calciumion/new-api:latest

# Using SQLite (default)
docker run --name new-api -d --restart always \
  -p 3000:3000 \
  -e TZ=Asia/Shanghai \
  -v ./data:/data \
  calciumion/new-api:latest

# Using MySQL
docker run --name new-api -d --restart always \
  -p 3000:3000 \
  -e SQL_DSN="root:123456@tcp(localhost:3306)/oneapi" \
  -e TZ=Asia/Shanghai \
  -v ./data:/data \
  calciumion/new-api:latest

💡 Tip: -v ./data:/data will save data in the data folder of the current directory, you can also change it to an absolute path like -v /your/custom/path:/data

🎉 After deployment is complete, visit http://localhost:3000 to start using!

📖 For more deployment methods, please refer to Deployment Guide

📚 Documentation

📖 Official Documentation |

Quick Navigation:

Category	Link
🚀 Deployment Guide	Installation Documentation
⚙️ Environment Configuration	Environment Variables
📡 API Documentation	API Documentation
❓ FAQ	FAQ
💬 Community Interaction	Communication Channels

✨ Key Features

For detailed features, please refer to Features Introduction

🎨 Core Functions

Feature	Description
🎨 New UI	Modern user interface design
🌍 Multi-language	Supports Simplified Chinese, Traditional Chinese, English, French, Japanese
🔄 Data Compatibility	Fully compatible with the original One API database
📈 Data Dashboard	Visual console and statistical analysis
🔒 Permission Management	Token grouping, model restrictions, user management

💰 Payment and Billing

✅ Online recharge (EPay, Stripe)
✅ Pay-per-use model pricing
✅ Cache billing support (OpenAI, Azure, DeepSeek, Claude, Qwen and all supported models)
✅ Flexible billing policy configuration

🔐 Authorization and Security

😈 Discord authorization login
🤖 LinuxDO authorization login
📱 Telegram authorization login
🔑 OIDC unified authentication
🔍 Key quota query usage (with neko-api-key-tool)

🚀 Advanced Features

API Format Support:

⚡ OpenAI Responses
⚡ OpenAI Realtime API (including Azure)
⚡ Claude Messages
⚡ Google Gemini
🔄 Rerank Models (Cohere, Jina)

Intelligent Routing:

⚖️ Channel weighted random
🔄 Automatic retry on failure
🚦 User-level model rate limiting

Format Conversion:

🔄 OpenAI Compatible ⇄ Claude Messages
🔄 OpenAI Compatible → Google Gemini
🔄 Google Gemini → OpenAI Compatible - Text only, function calling not supported yet
🚧 OpenAI Compatible ⇄ OpenAI Responses - In development
🔄 Thinking-to-content functionality

Reasoning Effort Support:

View detailed configuration

OpenAI series models:

o3-mini-high - High reasoning effort
o3-mini-medium - Medium reasoning effort
o3-mini-low - Low reasoning effort
gpt-5-high - High reasoning effort
gpt-5-medium - Medium reasoning effort
gpt-5-low - Low reasoning effort

Claude thinking models:

claude-3-7-sonnet-20250219-thinking - Enable thinking mode

Google Gemini series models:

gemini-2.5-flash-thinking - Enable thinking mode
gemini-2.5-flash-nothinking - Disable thinking mode
gemini-2.5-pro-thinking - Enable thinking mode
gemini-2.5-pro-thinking-128 - Enable thinking mode with thinking budget of 128 tokens
You can also append -low, -medium, or -high to any Gemini model name to request the corresponding reasoning effort (no extra thinking-budget suffix needed).

🤖 Model Support

For details, please refer to API Documentation - Relay Interface

Model Type	Description	Documentation
🤖 OpenAI-Compatible	OpenAI compatible models	Documentation
🤖 OpenAI Responses	OpenAI Responses format	Documentation
🎨 Midjourney-Proxy	Midjourney-Proxy(Plus)	Documentation
🎵 Suno-API	Suno API	Documentation
🔄 Rerank	Cohere, Jina	Documentation
💬 Claude	Messages format	Documentation
🌐 Gemini	Google Gemini format	Documentation
🔧 Dify	ChatFlow mode	-
🎯 Custom	Supports complete call address	-

📡 Supported Interfaces

View complete interface list

🚢 Deployment

Tip

Latest Docker image: calciumion/new-api:latest

📋 Deployment Requirements

Component	Requirement
Local database	SQLite (Docker must mount `/data` directory)
Remote database	MySQL ≥ 5.7.8 or PostgreSQL ≥ 9.6
Container engine	Docker / Docker Compose

⚙️ Environment Variable Configuration

Common environment variable configuration

Variable Name	Description	Default Value
`SESSION_SECRET`	Session secret (required for multi-machine deployment)	-
`CRYPTO_SECRET`	Encryption secret (required for Redis)	-
`SQL_DSN`	Database connection string	-
`REDIS_CONN_STRING`	Redis connection string	-
`STREAMING_TIMEOUT`	Streaming timeout (seconds)	`300`
`STREAM_SCANNER_MAX_BUFFER_MB`	Max per-line buffer (MB) for the stream scanner; increase when upstream sends huge image/base64 payloads	`64`
`MAX_REQUEST_BODY_MB`	Max request body size (MB, counted after decompression; prevents huge requests/zip bombs from exhausting memory). Exceeding it returns `413`	`32`
`AZURE_DEFAULT_API_VERSION`	Azure API version	`2025-04-01-preview`
`ERROR_LOG_ENABLED`	Error log switch	`false`
`PYROSCOPE_URL`	Pyroscope server address	-
`PYROSCOPE_APP_NAME`	Pyroscope application name	`new-api`
`PYROSCOPE_BASIC_AUTH_USER`	Pyroscope basic auth user	-
`PYROSCOPE_BASIC_AUTH_PASSWORD`	Pyroscope basic auth password	-
`PYROSCOPE_MUTEX_RATE`	Pyroscope mutex sampling rate	`5`
`PYROSCOPE_BLOCK_RATE`	Pyroscope block sampling rate	`5`
`HOSTNAME`	Hostname tag for Pyroscope	`new-api`

📖 Complete configuration: Environment Variables Documentation

🔧 Deployment Methods

Method 1: Docker Compose (Recommended)

# Clone the project
git clone https://github.com/QuantumNous/new-api.git
cd new-api

# Edit configuration
nano docker-compose.yml

# Start service
docker-compose up -d

Method 2: Docker Commands

Using SQLite:

docker run --name new-api -d --restart always \
  -p 3000:3000 \
  -e TZ=Asia/Shanghai \
  -v ./data:/data \
  calciumion/new-api:latest

Using MySQL:

docker run --name new-api -d --restart always \
  -p 3000:3000 \
  -e SQL_DSN="root:123456@tcp(localhost:3306)/oneapi" \
  -e TZ=Asia/Shanghai \
  -v ./data:/data \
  calciumion/new-api:latest

💡 Path explanation:

./data:/data - Relative path, data saved in the data folder of the current directory

You can also use absolute path, e.g.: /your/custom/path:/data

Method 3: BaoTa Panel

Install BaoTa Panel (≥ 9.2.0 version)
Search for New-API in the application store
One-click installation

📖 Tutorial with images

⚠️ Multi-machine Deployment Considerations

Warning

Must set SESSION_SECRET - Otherwise login status inconsistent
Shared Redis must set CRYPTO_SECRET - Otherwise data cannot be decrypted

🔄 Channel Retry and Cache

Retry configuration: Settings → Operation Settings → General Settings → Failure Retry Count

Cache configuration:

REDIS_CONN_STRING: Redis cache (recommended)
MEMORY_CACHE_ENABLED: Memory cache

🔗 Related Projects

Upstream Projects

Project	Description
One API	Original project base
Midjourney-Proxy	Midjourney interface support

Supporting Tools

Project	Description
neko-api-key-tool	Key quota query tool
new-api-horizon	New API high-performance optimized version

💬 Help Support

📖 Documentation Resources

Resource	Link
📘 FAQ	FAQ
💬 Community Interaction	Communication Channels
🐛 Issue Feedback	Issue Feedback
📚 Complete Documentation	Official Documentation

🤝 Contribution Guide

Welcome all forms of contribution!

🐛 Report Bugs
💡 Propose New Features
📝 Improve Documentation
🔧 Submit Code

📜 License

This project is licensed under the GNU Affero General Public License v3.0 (AGPLv3).

This is an open-source project developed based on One API (MIT License).

If your organization's policies do not permit the use of AGPLv3-licensed software, or if you wish to avoid the open-source obligations of AGPLv3, please contact us at: support@quantumnous.com

🌟 Star History

💖 Thank you for using New API

If this project is helpful to you, welcome to give us a ⭐️ Star！

Official Documentation • Issue Feedback • Latest Release

_{Built with ❤️ by QuantumNous}

Release History

Version	Changes	Urgency	Date
v1.0.0-rc.10	## Highlights v1.0.0-rc.10 focuses on UI polish, relay reliability, and admin workflow fixes. It adds new theme options and subscription balance purchases while tightening logs, channel behavior, webhooks, payments, and provider tool-call handling. ## New Features - New theme customization options add Anthropic and Simple Large presets, configurable serif typography, and an extra-large scaling option. - Subscription billing now supports balance purchases. ## Bug Fixes - Automatically disabled	High	5/26/2026
v1.0.0-rc.9	### Highlights v1.0.0-rc.9 adds subscription balance purchases and introduces a broader UI refresh across settings, channels, logs, themes, and the home page. It also improves relay performance and fixes several channel, webhook, payment, image, and tool-call reliability issues. ### New Features - Added support for purchasing account balance from the subscription flow. - Added an Anthropic-inspired theme preset with configurable serif typography. ### Bug Fixes - Fixed webhook processing failur	High	5/26/2026
v1.0.0-rc.7	### New Features - Added support for `request_header` key source (#4903). ### Bug Fixes - Batch fixed multiple UI issues in the default web interface (#4880, #4893, #4817, #4877, #4898, #4876). - Addressed an issue where channel remarks could not be cleared (#4886). - Corrected the filtering behavior for usage logs (#4883). - Fixed validation for unlimited API key quotas in the web interface (#4881). - Enforced access control on header navigation for public modules (#4889). - Applied g	High	5/19/2026
v1.0.0-rc.6	## Highlights This release improves observability by recording upstream request IDs in logs and adds a compliance confirmation step for paid features. If your `logs` table is large, pre-apply the `upstream_request_id` column and index with online DDL before upgrading to avoid a potentially long migration. ## New Features - Added upstream request ID tracking in request logs for easier upstream call tracing, while avoiding unintended response header overrides. - Added a compliance confirmation st	High	5/13/2026
v1.0.0-rc.5	## Highlights The default dashboard has been refreshed with built-in model performance metrics and a clearer health view. This release also adds DeepChat deeplink support and fixes several default UI, wallet, playground, and ratio-display regressions. ## New Features - Dashboard overview has been redesigned and now includes model performance metrics with a clearer performance health panel. - DeepChat deeplinks are now supported, making it easier to open supported chat links directly in DeepChat	High	5/12/2026
v1.0.0-rc.4	## Highlights OpenAI image editing is more reliable: image edit requests now preserve reference image data and related metadata, preventing edits from losing their intended context (#4646). This release also adds model performance badges and fixes several billing and dropdown UI issues. ## New Features - Added model performance badges to model lists so users can compare available models at a glance. ## Bug Fixes - Fixed OpenAI image edit requests so reference images and related edit met	High	5/6/2026
v1.0.0-rc.2	## Highlights v1.0.0-rc.2 officially opens up switching between the new and classic frontend, making it easier to test the redesigned UI, compare workflows, and fall back instantly when needed. This RC also refreshes key console pages and fixes several web issues affecting email binding, tiered pricing, API key grouping, and custom Vertex base URLs. ## New Features - Added a built-in switch between the new and classic frontend, so teams can adopt the redesigned UI gradually without losing a	High	4/30/2026
v0.13.2	## New Features - Model sync now detects upstream models more accurately, refreshes pricing automatically from provider pricing endpoints, shows removed upstream models in the fetch-models dialog, and keeps pricing available during sync delays (#4452). - User records now include account creation and last login timestamps for easier auditing and administration (#4450). - Added configurable model matching for Ali native Messages APIs, making it easier to route requests to compatible upstream model	High	4/27/2026
v0.12.15	### New Features - Added Waffo Pay support for account top-ups and payment workflows (#4089). ### Bug Fixes - Fixed a recharge card issue in the top-up flow that could break parts of the card interface (#4388). ### Improvements - Added support for Gemini's `IncludeServerSideToolInvocations` setting in `ToolConfig`, improving compatibility with newer Gemini tool-calling behavior (#4311). - Improved Codex channel reliability and usability by refreshing credentials for auto-disabled channels, usi	High	4/22/2026
v0.12.14	## Improvements - Improved user-visible management and top-up logs with a cleaner display, preserved row expansion, and clearer handling of legacy records. --- Full Changelog: https://github.com/QuantumNous/new-api/compare/v0.12.13...v0.12.14	High	4/17/2026
v0.12.13	This release improves admin workflows for top-ups and makes account status changes apply more consistently. ### Bug Fixes - Disabling a user now takes effect immediately instead of being delayed by stale cached account data. ### Improvements - Top-up logs now include additional admin-only audit details to make review and troubleshooting easier. ### Breaking Changes - Top-up searches are now limited to a maximum 30-day date range to keep queries responsive and reliable. --- *Full Changelog	High	4/17/2026
v0.12.12	This release adds support for Claude Opus 4.7, expanding the set of available models in the API. ## New Features - Added support for the Claude Opus 4.7 model, making it available through the API alongside existing model options (#4293). --- ## What's Changed * feat: support claude-opus-4-7 by @prnake in https://github.com/QuantumNous/new-api/pull/4293 * chore(deps): bump github.com/jackc/pgx/v5 from 5.7.1 to 5.9.0 by @dependabot[bot] in https://github.com/QuantumNous/new-api/pull/4294 **Fu	High	4/17/2026
v0.12.11	### New Features * Added payment method tracking and validation for top-ups. ### Bug Fixes * Improved handling of empty string content during OpenAI to Claude message conversion. --- Full Changelog: https://github.com/QuantumNous/new-api/compare/v0.12.10...v0.12.11	High	4/16/2026
v0.12.10	## New Features - Added passthrough support for Claude `cache_control` and `speed` options, giving you more control over request behavior when using Claude models (#4247). ## Bug Fixes - Fixed OpenAI Responses API compatibility issues when `instructions` contain structured content instead of plain text (#4260). ## Improvements - Improved Stripe payment processing to better handle asynchronous webhook events, making delayed payment confirmations more reliable. - Quota adjustment logs now record	High	4/15/2026
v0.12.9	## New Features - Subscription cards now show the next quota reset time, making it easier to see when usage limits will refresh (#4181). ## Bug Fixes - Claude requests are now more reliable by avoiding incompatible Top P settings when needed. - Azure channels now correctly support the `/v1/responses/compact` route (#4149). - Editing values in GroupTable no longer forces the cursor to jump to the end on each keystroke (#4208). ## Improvements - Dashboard charts have been refined to present rank	High	4/13/2026
v0.12.8	### Bug Fixes * Correctly report the stream status in error logs instead of defaulting to false (#4195). --- ## What's Changed * fix: isStream status in error logs instead of hardcoded false by @feitianbubu in https://github.com/QuantumNous/new-api/pull/4195 * chore(deps): bump axios from 1.13.5 to 1.15.0 in /web by @dependabot[bot] in https://github.com/QuantumNous/new-api/pull/4201 Full Changelog: https://github.com/QuantumNous/new-api/compare/v0.12.7...v0.12.8	Medium	4/12/2026
v0.12.7	## Improvements - Quota management now uses an amount-first input flow and applies changes atomically, making balance adjustments simpler and more reliable. - Model pricing errors now use cleaner, role-aware messages so it is easier to understand and resolve failed pricing actions. --- Full Changelog: https://github.com/QuantumNous/new-api/compare/v0.12.6...v0.12.7	Medium	4/12/2026
nightly-20260409	### Highlights / 重点更新 This release introduces comprehensive support for tiered billing (阶梯计费). It allows for flexible pricing expressions, detailed tool call quota calculations, and channel test support for tiered models. 本次发布重点增加了对阶梯计费的全面支持，支持灵活的计费表达式解析、自定义工具调用额度计算，并完善了UI配置项和渠道测试支持。 Docker image: `calciumion/new-api:nightly` Docker 镜像: `calciumion/new-api:nightly` <img width="500" alt="" src="https://github.com/user-attachments/assets/893d5789-64b3-414d-8e71-cc178b72	High	4/9/2026
v0.12.6	### New Features * Fill in custom fields for vllm-omini (#4154) ### Bug Fixes * Accept string usage values in Alibaba task polling (#4155) * Use correct endpoint for coding plan image generation in Zhipu 4V (#4146) * Prefer explicit pricing for compact models (#4156) * Fix document rendering (#4153) ### Improvements * Refine PR template and add PR submission checks (#4076) --- ## What's Changed * fix(zhipu_4v): use correct endpoint for coding plan image generation by @NyaMisty in https://git	Medium	4/9/2026
v0.12.5	### New Features * Add support for Minimax image generation relay (#4103). ### Improvements * Redesign group ratio rules with a collapsible layout for better usability. * Enhance dashboard chart axes and sorting logic. * Add a convenient copy button next to the API link in the dashboard info panel. ### Bug Fixes * Resolve email delivery issues for Outlook and similar providers (#4112). <img width="2848" height="966" alt="" src="https://github.com/user-attachments/assets/8cfdd943-3b	High	4/8/2026
v0.12.4	## New Features * Added an `IncludeModelName` option to channel affinity rules for per-model affinity tracking (#3488) * Added an ErrorBoundary component to the web UI to prevent full-page crashes (#3350) * Added support for `Duration` in `TaskSubmitReq` (#4068) * Added admin user analytics to the dashboard ## Bug Fixes * Fixed pricing filtering based on usable groups (#4123) * Fixed an issue where Claude streaming interruptions incorrectly overwrote usage data instead of preserving cache billi	Medium	4/8/2026
v0.12.3	This is a small polish release focused on making configuration easier to navigate by simplifying the settings layout and consolidating model pricing controls. ## Improvements - Refined the settings interface with a unified model pricing view and a cleaner tab structure to make configuration easier to navigate. --- ## What's Changed * feat(token): add batch API for fetching token keys by @RedwindA in https://github.com/QuantumNous/new-api/pull/4114 Full Changelog: https://github.com/Quan	Medium	4/7/2026
v0.12.2	### New Features * Add support for PDF conversion between OpenAI and Claude APIs. * Support differential billing for Seedance 2.0 based on video vs. no-video input. * Enhance max_tokens handling and input sanitization in the playground (#4106). * Enhance stream status display with error tooltips in usage logs. ### Bug Fixes * Fix usage calculation issues when converting between Claude and OpenAI APIs by emitting `message_delta` for the final stream chunk (#4090). * Fix Gemini stream detection b	Medium	4/6/2026
v0.12.1	This release adds Seedance 2.0 video support and broadens media compatibility across providers. It also makes failures easier to understand with clearer Seedance and performance-related error messages. ## New Features - Added Seedance 2.0 video generation API support with duration control in seconds, clearer failure reporting, and better prompt handling for single-text requests (#4042). - Added HEIC/HEIF image support, including Gemini channel compatibility and more reliable handling of HEIF im	Medium	4/2/2026
v0.12.0	v0.12.0 adds Wan 2.7 image generation support and streamlines channel setup in the admin UI. It also improves billing accuracy and fixes several model and dashboard issues. ## New Features - Added support for Wan 2.7 image generation, including control over how many images are generated per request (#3526). ## Bug Fixes - Fixed image-generation billing and usage statistics so request counts and costs are reported more accurately without double-counting (#3512). - Fixed a UI issue that caused t	Medium	4/1/2026
v0.12.0-alpha.2	## Release Highlights This release includes minor improvements to the `EditChannelModal` component to enhance clipboard handling reliability. ## Bug Fixes - EditChannelModal: Enhance clipboard handling with error checks (`670abee`) --- Full Changelog: https://github.com/QuantumNous/new-api/compare/v0.12.0-alpha.1...v0.12.0-alpha.2	Medium	3/31/2026
v0.12.0-alpha.1	## Highlights This release introduces a new clipboard magic string feature for quick channel creation and includes important security and UI improvements. ## New Features * Add clipboard magic string for quick channel creation from token copy ## Bug Fixes * 修复消耗分布图表悬浮时滚动条闪烁 (Fix dashboard scrollbar flickering) (#3474) * Fix xAI MaxTokens mapping for grok-3-mini model ## Improvements * Security improvements * Enhance footer layout and styling * Update Traditional Chinese README with missing co	Medium	3/31/2026
v0.11.9	## Release Notes for v0.11.9 ### New Features * Add slide-in animations and update translations for new UI elements * Prevent metadata from overriding model fields (#3461) * Record stream interruption reasons via StreamStatus * Expose i18n instance to the global window object for easier access ### Bug Fixes * Restore pre-3400 OpenRouter billing semantics (#3438) * Restore doubao coding plan deprecation and regex ignored models lost during conflict resolution * Preserve cache usage in openai-to	Medium	3/31/2026
v0.11.9-alpha.1	## What's Changed * fix: disable doubao coding plan selection by @seefs001 in https://github.com/QuantumNous/new-api/pull/3379 * fix: oauth bind callback handling by @seefs001 in https://github.com/QuantumNous/new-api/pull/3393 * feat: support regex-prefixed ignored upstream models by @seefs001 in https://github.com/QuantumNous/new-api/pull/3381 * adjuct default settings by @seefs001 in https://github.com/QuantumNous/new-api/pull/3335 * fix: honor channel affinity skip-retry when channel is	Medium	3/23/2026
v0.11.8	## What's Changed * fix: normalize generic oauth bearer token type by @seefs001 in https://github.com/QuantumNous/new-api/pull/3359 * feat: add server log file management to performance settings by @RedwindA in https://github.com/QuantumNous/new-api/pull/3369 * feat: Add support for counting cache-hit tokens in llama.cpp by @wenyifancc in https://github.com/QuantumNous/new-api/pull/3357 * docs: 完善宝塔面板部署教程并修复链接错误 by @lcq225 in https://github.com/QuantumNous/new-api/pull/3360 ## New Contrib	Low	3/22/2026
v0.11.7	## What's Changed * chore: refine PR template by @seefs001 in https://github.com/QuantumNous/new-api/pull/3287 * feat: 支持通过环境变量配置搜索接口限流参数 by @ywandy in https://github.com/QuantumNous/new-api/pull/3313 * fix: 修正 Codex free 账号用量显示到每周窗口 by @Honghurumeng in https://github.com/QuantumNous/new-api/pull/3316 * feat(waffo): Waffo payment gateway integration by @zhongyuanzhao-alt in https://github.com/QuantumNous/new-api/pull/3293 * fix: redirect OAuth login in current page by @seefs001 in https://g	Low	3/19/2026
v0.11.6-patch.1	Release v0.11.6-patch.1	Low	3/17/2026
nightly-20260317-44fc10b	feat: implement tiered billing expression evaluation Docker image: calciumion/new-api:nightly <img width="500" alt="" src="https://github.com/user-attachments/assets/893d5789-64b3-414d-8e71-cc178b724ac6" />	Low	3/17/2026
v0.11.6	参数覆盖记录到日志错误日志恢复tooltip显示 ## What's Changed * feat: params override log by @seefs001 in https://github.com/QuantumNous/new-api/pull/3285 Full Changelog: https://github.com/QuantumNous/new-api/compare/v0.11.5...v0.11.6	Low	3/17/2026
v0.11.5	## What's Changed * Round remaining balance by @KiGamji in https://github.com/QuantumNous/new-api/pull/3233 * enhance channel key viewing by @seefs001 in https://github.com/QuantumNous/new-api/pull/3257 Full Changelog: https://github.com/QuantumNous/new-api/compare/v0.11.4...v0.11.5	Low	3/14/2026
v0.11.4-patch.1	Full Changelog: https://github.com/QuantumNous/new-api/compare/v0.11.4...v0.11.4-patch.1	Low	3/14/2026
v0.11.4	## What's Changed * Feature/param override wildcard path by @seefs001 in https://github.com/QuantumNous/new-api/pull/3129 * fix: fetch model add header passthrough rule key check by @feitianbubu in https://github.com/QuantumNous/new-api/pull/3130 * feats: repair the thinking of claude to openrouter convert by @nekohy in https://github.com/QuantumNous/new-api/pull/3120 * feat: kling cost quota support use FinalUnitDeduction as totalToken by @feitianbubu in https://github.com/QuantumNous/new-a	Low	3/12/2026
v0.11.4-alpha.5	请求头覆盖新增追加功能 ## What's Changed * fix: kling risk fail return openAIVideo error by @feitianbubu in https://github.com/QuantumNous/new-api/pull/3148 * fix: add explicit docker-compose networks by @pigletfly in https://github.com/QuantumNous/new-api/pull/3147 * feat：support $keep_only_declared and deduped $append for header override by @seefs001 in https://github.com/QuantumNous/new-api/pull/3182 ## New Contributors * @pigletfly made their first contribution in https://github.com/QuantumNo	Low	3/9/2026
v0.11.4-alpha.4	为渠道参数覆盖可视化规则提供拖拽排序支持后端Token相关API变更 ## What's Changed * 为渠道参数覆盖可视化规则提供拖拽排序支持 by @somnifex in https://github.com/QuantumNous/new-api/pull/3166 Full Changelog: https://github.com/QuantumNous/new-api/compare/v0.11.4-alpha.3...v0.11.4-alpha.4	Low	3/8/2026
v0.11.4-alpha.3	## Main Changes 重构日志价格显示，默认隐藏倍率逻辑，可在列设置中切换 Refactor the log price display: hide the multiplier logic by default, with an option to toggle it in the column settings. <img width="3552" height="774" alt="" src="https://github.com/user-attachments/assets/1f8ca60c-3589-483a-bcf5-faa27c10e01e" /> Full Changelog: https://github.com/QuantumNous/new-api/compare/v0.11.4-alpha.2...v0.11.4-alpha.3	Low	3/6/2026
v0.11.4-alpha.2	## Main Changes 模型广场增加缓存等价格显示 Unify pricing labels and expand marketplace pricing display <img width="3166" height="1202" src="https://github.com/user-attachments/assets/243c9bb8-5d27-4378-914d-2194a952f690" /> Full Changelog: https://github.com/QuantumNous/new-api/compare/v0.11.4-alpha.1...v0.11.4-alpha.2	Low	3/6/2026
v0.11.4-alpha.1	## Main Changes 完善可视化价格设置，为后续阶梯定价的开发做准备 Improve the visual pricing settings to prepare for subsequent development of tiered pricing. <img width="4022" height="1742" src="https://github.com/user-attachments/assets/f36f0637-79ff-43bf-983f-778b6a2d6f73" /> ## What's Changed * fix(relay): skip retries for bad response body errors by @seefs001 in https://github.com/QuantumNous/new-api/pull/3151 Full Changelog: https://github.com/QuantumNous/new-api/compare/v0.11.2-patch.2...v0.11.4-	Low	3/6/2026
v0.11.2-patch.2	优化kling模型计费 ## What's Changed * feat: kling cost quota support use FinalUnitDeduction as totalToken by @feitianbubu in https://github.com/QuantumNous/new-api/pull/2769 * fix: If top_p is not provided, ignore it by @seefs001 in https://github.com/QuantumNous/new-api/pull/3141 Full Changelog: https://github.com/QuantumNous/new-api/compare/v0.11.2-patch.1...v0.11.2-patch.2	Low	3/6/2026
v0.11.2-patch.1	gpt-5.4补全倍率 ## What's Changed * Feature/param override wildcard path by @seefs001 in https://github.com/QuantumNous/new-api/pull/3129 * fix: fetch model add header passthrough rule key check by @feitianbubu in https://github.com/QuantumNous/new-api/pull/3130 * feats: repair the thinking of claude to openrouter convert by @nekohy in https://github.com/QuantumNous/new-api/pull/3120 Full Changelog: https://github.com/QuantumNous/new-api/compare/v0.11.2...v0.11.2-patch.1	Low	3/6/2026
v0.11.2	## Main changes 增加定时获取上游模型更新功能异步任务在没有设置价格时抛出错误渠道参数覆盖功能增强，提供可视化编辑，整合请求头覆盖和透传功能 Add a scheduled feature to fetch upstream model updates. Throw an error in the asynchronous task when the price is not set. Enhance the channel parameter override feature by integrating request header overrides and passthrough functionality. ## Enhance the channel parameter override feature 整合请求头覆盖功能，可视化编辑，并提供基础模板 <img width="2216" height="1560" alt="1772641010569" src="https://github.com/user-attachment	Low	3/4/2026
v0.11.2-alpha.2	优化检测上游模型功能美化模型广场 Optimize the upstream model fetch feature Beautify the Model Marketplace ## What's Changed * fix: use default model price for radio price model by @feitianbubu in https://github.com/QuantumNous/new-api/pull/3090 * Fix/auto fetch upstream model tips by @seefs001 in https://github.com/QuantumNous/new-api/pull/3096 Full Changelog: https://github.com/QuantumNous/new-api/compare/v0.11.2-alpha.1...v0.11.2-alpha.2	Low	3/3/2026
v0.11.2-alpha.1	feat: auto fetch upstream models 新增自动检测上游模型列表变动功能 <img width="1386" height="976" alt="image" src="https://github.com/user-attachments/assets/3effc3d4-8d5c-4987-b7d9-861a4943a839" /> <img width="1224" height="512" alt="image" src="https://github.com/user-attachments/assets/771b1670-b36e-445e-89cd-fde5d389d1ad" /> ## What's Changed * fix: change token model_limits column from varchar(1024) to text by @RedwindA in https://github.com/QuantumNous/new-api/pull/3037 * Return error when mode	Low	3/2/2026
v0.11.1-alpha.7	Support `aionui://` deep link protocol 支持AionUi项目一键填入 <img width="2374" height="960" alt="1772457674576" src="https://github.com/user-attachments/assets/9f53bf2c-8670-4c02-a3a7-b2b6c169e269" /> Full Changelog: https://github.com/QuantumNous/new-api/compare/v0.11.1-alpha.6...v0.11.1-alpha.7	Low	3/2/2026
v0.11.1-alpha.6	## What's Changed * Revert v0.11.1-alpha.5 Full Changelog: https://github.com/QuantumNous/new-api/compare/v0.11.1-alpha.4...v0.11.1-alpha.6	Low	3/2/2026
v0.11.1-alpha.4	## What's Changed * fix: aws text content blocks must be non-empty by @seefs001 in https://github.com/QuantumNous/new-api/pull/3077 Full Changelog: https://github.com/QuantumNous/new-api/compare/v0.11.1-alpha.3...v0.11.1-alpha.4	Low	3/2/2026
v0.11.1-alpha.3	支持cc-switch一键填入 <img alt="1772378663623" src="https://github.com/user-attachments/assets/e0b8ece0-b137-4853-a01c-fea43c40d151" /> ## What's Changed * chore(deps): bump axios from 1.12.0 to 1.13.5 in /web by @dependabot[bot] in https://github.com/QuantumNous/new-api/pull/3061 * chore(deps): bump tar and electron-builder in /electron by @dependabot[bot] in https://github.com/QuantumNous/new-api/pull/2964 * build(deps-dev): bump lodash from 4.17.21 to 4.17.23 in /electron by @dependabot[bo	Low	3/1/2026
v0.11.1-alpha.2	参数覆盖增加UI可视化操作，支持更多操作。渠道亲和支持绑定参数覆盖，适用于codex/claude code等的请求头透传。 ## What's Changed * feat: improve channel override ui/ux by @seefs001 in https://github.com/QuantumNous/new-api/pull/3009 Full Changelog: https://github.com/QuantumNous/new-api/compare/v0.11.1-alpha.1...v0.11.1-alpha.2	Low	2/28/2026
v0.11.1-alpha.1	完善veo适配生成（Vertex,AI Studio) 修复订阅支付完成后的跳转路径错误 Improve Veo adaptation generation (Vertex, AI Studio) Fix incorrect redirection path after subscription payment completion ## What's Changed * fix: redirect subscription payment return to user-accessible page by @seefs001 in https://github.com/QuantumNous/new-api/pull/3052 * feat: add image model to supported image presets by @seefs001 in https://github.com/QuantumNous/new-api/pull/3029 * feat: minimax reasoning_split by @seefs001 in https:	Low	2/28/2026
v0.11.0	## What's Changed * refactor: async task by @Calcium-Ion in https://github.com/QuantumNous/new-api/pull/2985 * Feature/channel retry warning by @seefs001 in https://github.com/QuantumNous/new-api/pull/2987 * fix: claude affinity cache counter by @seefs001 in https://github.com/QuantumNous/new-api/pull/2980 * feat: add missing OpenAI/Claude/Gemini request fields by @seefs001 in https://github.com/QuantumNous/new-api/pull/2971 * feat: codex oauth proxy by @seefs001 in https://github.com/Quant	Low	2/28/2026
v0.11.0-alpha.9	修复Sora接口传图报错`unsupported mimetype (\'application/octet-stream\')` Fix Sora API image upload error `unsupported mimetype ('application/octet-stream')` Full Changelog: https://github.com/QuantumNous/new-api/compare/v0.11.0-alpha.8...v0.11.0-alpha.9	Low	2/25/2026
v0.11.0-alpha.8	修复GetUserSetting永远无法读取旧版本用户设置的问题 Full Changelog: https://github.com/QuantumNous/new-api/compare/v0.11.0-alpha.7...v0.11.0-alpha.8	Low	2/24/2026
v0.11.0-alpha.7	第三方登录支持优先使用第三方用户名请求头透传功能优化加强渠道模型选择防呆设计 Third-party login support, prioritize using third-party username Request header pass-through function optimization Strengthen channel model selection foolproof design ## What's Changed * fix: skip Accept-Encoding during header passthrough (#2214) by @Caisin in https://github.com/QuantumNous/new-api/pull/2997 * feat: move user bindings to dedicated management modal by @seefs001 in https://github.com/QuantumNous/new-api/pull/2993 * Fix/pr 2900	Low	2/24/2026
v0.11.0-alpha.6	grok 违规扣费完善 ## What's Changed * fix: violation fee check by @seefs001 in https://github.com/QuantumNous/new-api/pull/2994 Full Changelog: https://github.com/QuantumNous/new-api/compare/v0.11.0-alpha.5...v0.11.0-alpha.6	Low	2/23/2026
v0.11.0-alpha.5	修复PostForm下重试出现`multipart: NextPart: EOF`错误 Full Changelog: https://github.com/QuantumNous/new-api/compare/v0.11.0-alpha.4...v0.11.0-alpha.5	Low	2/23/2026

Dependencies & License Audit

Loading dependencies...

Similar Packages

llmioLLM API load-balancing gateway. LLM API 负载均衡网关.v0.8.12

cc-relay⚡️ Blazing fast LLMs API Gateway written in Gov0.0.17

ai-guideProvide free, open access to comprehensive AI tools, guides, reviews, and resources to reduce knowledge gaps and empower users.main@2026-06-04

lobehubThe ultimate space for work and life — to find, build, and collaborate with agent teammates that grow with you. We are taking agent harness to the next level — enabling multi-agent collaboration, effov2.2.2

axonhub⚡️ Open-source AI Gateway — Use any SDK to call 100+ LLMs. Built-in failover, load balancing, cost control & end-to-end tracing.v1.0.0-beta2

More in Infrastructure

tensorzeroTensorZero is an open-source LLMOps platform that unifies an LLM gateway, observability, evaluation, optimization, and experimentation.

planoPlano is an AI-native proxy and data plane for agentic apps — with built-in orchestration, safety, observability, and smart LLM routing so you stay focused on your agents core logic.

modelsThis repository contains comprehensive pricing and configuration data for LLMs. It powers cost attribution for 200+ enterprises running 400B+ tokens through Portkey AI Gateway every day.

edgeeOpen-source AI gateway written in Rust, with token compression for Claude Code, Codex... and any other LLM client.