freshcrate
Skin:/
Home > Infrastructure > new-api

new-api

A unified AI model hub for aggregation & distribution. It supports cross-converting various LLMs into OpenAI-compatible, Claude-compatible, or Gemini-compatible formats. A centralized gateway for pers

Why this rank:Strong adoptionRecent releaseHealthy release cadence

Description

A unified AI model hub for aggregation & distribution. It supports cross-converting various LLMs into OpenAI-compatible, Claude-compatible, or Gemini-compatible formats. A centralized gateway for personal and enterprise model management. 🍥

README

new-api

New API

🍥 Next-Generation LLM Gateway and AI Asset Management System

简体中文 | 繁體中文 | English | Français | 日本語

license release docker GoReportCard

QuantumNous%2Fnew-api | Trendshift
Featured|HelloGitHub New API - All-in-one AI asset management gateway. | Product Hunt

Quick StartKey FeaturesDeploymentDocumentationHelp

📝 Project Description

Important


🤝 Trusted Partners

No particular order

Cherry Studio Aion UI Peking University UCloud Alibaba Cloud IO.NET


🙏 Special Thanks

JetBrains Logo

Thanks to JetBrains for providing free open-source development license for this project


🚀 Quick Start

Using Docker Compose (Recommended)

# Clone the project
git clone https://github.com/QuantumNous/new-api.git
cd new-api

# Edit docker-compose.yml configuration
nano docker-compose.yml

# Start the service
docker-compose up -d
Using Docker Commands
# Pull the latest image
docker pull calciumion/new-api:latest

# Using SQLite (default)
docker run --name new-api -d --restart always \
  -p 3000:3000 \
  -e TZ=Asia/Shanghai \
  -v ./data:/data \
  calciumion/new-api:latest

# Using MySQL
docker run --name new-api -d --restart always \
  -p 3000:3000 \
  -e SQL_DSN="root:123456@tcp(localhost:3306)/oneapi" \
  -e TZ=Asia/Shanghai \
  -v ./data:/data \
  calciumion/new-api:latest

💡 Tip: -v ./data:/data will save data in the data folder of the current directory, you can also change it to an absolute path like -v /your/custom/path:/data


🎉 After deployment is complete, visit http://localhost:3000 to start using!

📖 For more deployment methods, please refer to Deployment Guide


📚 Documentation

Quick Navigation:

Category Link
🚀 Deployment Guide Installation Documentation
⚙️ Environment Configuration Environment Variables
📡 API Documentation API Documentation
❓ FAQ FAQ
💬 Community Interaction Communication Channels

✨ Key Features

For detailed features, please refer to Features Introduction

🎨 Core Functions

Feature Description
🎨 New UI Modern user interface design
🌍 Multi-language Supports Simplified Chinese, Traditional Chinese, English, French, Japanese
🔄 Data Compatibility Fully compatible with the original One API database
📈 Data Dashboard Visual console and statistical analysis
🔒 Permission Management Token grouping, model restrictions, user management

💰 Payment and Billing

  • ✅ Online recharge (EPay, Stripe)
  • ✅ Pay-per-use model pricing
  • ✅ Cache billing support (OpenAI, Azure, DeepSeek, Claude, Qwen and all supported models)
  • ✅ Flexible billing policy configuration

🔐 Authorization and Security

  • 😈 Discord authorization login
  • 🤖 LinuxDO authorization login
  • 📱 Telegram authorization login
  • 🔑 OIDC unified authentication
  • 🔍 Key quota query usage (with neko-api-key-tool)

🚀 Advanced Features

API Format Support:

Intelligent Routing:

  • ⚖️ Channel weighted random
  • 🔄 Automatic retry on failure
  • 🚦 User-level model rate limiting

Format Conversion:

  • 🔄 OpenAI Compatible ⇄ Claude Messages
  • 🔄 OpenAI Compatible → Google Gemini
  • 🔄 Google Gemini → OpenAI Compatible - Text only, function calling not supported yet
  • 🚧 OpenAI Compatible ⇄ OpenAI Responses - In development
  • 🔄 Thinking-to-content functionality

Reasoning Effort Support:

View detailed configuration

OpenAI series models:

  • o3-mini-high - High reasoning effort
  • o3-mini-medium - Medium reasoning effort
  • o3-mini-low - Low reasoning effort
  • gpt-5-high - High reasoning effort
  • gpt-5-medium - Medium reasoning effort
  • gpt-5-low - Low reasoning effort

Claude thinking models:

  • claude-3-7-sonnet-20250219-thinking - Enable thinking mode

Google Gemini series models:

  • gemini-2.5-flash-thinking - Enable thinking mode
  • gemini-2.5-flash-nothinking - Disable thinking mode
  • gemini-2.5-pro-thinking - Enable thinking mode
  • gemini-2.5-pro-thinking-128 - Enable thinking mode with thinking budget of 128 tokens
  • You can also append -low, -medium, or -high to any Gemini model name to request the corresponding reasoning effort (no extra thinking-budget suffix needed).

🤖 Model Support

For details, please refer to API Documentation - Relay Interface

Model Type Description Documentation
🤖 OpenAI-Compatible OpenAI compatible models Documentation
🤖 OpenAI Responses OpenAI Responses format Documentation
🎨 Midjourney-Proxy Midjourney-Proxy(Plus) Documentation
🎵 Suno-API Suno API Documentation
🔄 Rerank Cohere, Jina Documentation
💬 Claude Messages format Documentation
🌐 Gemini Google Gemini format Documentation
🔧 Dify ChatFlow mode -
🎯 Custom Supports complete call address -

📡 Supported Interfaces

View complete interface list

🚢 Deployment

Tip

Latest Docker image: calciumion/new-api:latest

📋 Deployment Requirements

Component Requirement
Local database SQLite (Docker must mount /data directory)
Remote database MySQL ≥ 5.7.8 or PostgreSQL ≥ 9.6
Container engine Docker / Docker Compose

⚙️ Environment Variable Configuration

Common environment variable configuration
Variable Name Description Default Value
SESSION_SECRET Session secret (required for multi-machine deployment) -
CRYPTO_SECRET Encryption secret (required for Redis) -
SQL_DSN Database connection string -
REDIS_CONN_STRING Redis connection string -
STREAMING_TIMEOUT Streaming timeout (seconds) 300
STREAM_SCANNER_MAX_BUFFER_MB Max per-line buffer (MB) for the stream scanner; increase when upstream sends huge image/base64 payloads 64
MAX_REQUEST_BODY_MB Max request body size (MB, counted after decompression; prevents huge requests/zip bombs from exhausting memory). Exceeding it returns 413 32
AZURE_DEFAULT_API_VERSION Azure API version 2025-04-01-preview
ERROR_LOG_ENABLED Error log switch false
PYROSCOPE_URL Pyroscope server address -
PYROSCOPE_APP_NAME Pyroscope application name new-api
PYROSCOPE_BASIC_AUTH_USER Pyroscope basic auth user -
PYROSCOPE_BASIC_AUTH_PASSWORD Pyroscope basic auth password -
PYROSCOPE_MUTEX_RATE Pyroscope mutex sampling rate 5
PYROSCOPE_BLOCK_RATE Pyroscope block sampling rate 5
HOSTNAME Hostname tag for Pyroscope new-api

📖 Complete configuration: Environment Variables Documentation

🔧 Deployment Methods

Method 1: Docker Compose (Recommended)
# Clone the project
git clone https://github.com/QuantumNous/new-api.git
cd new-api

# Edit configuration
nano docker-compose.yml

# Start service
docker-compose up -d
Method 2: Docker Commands

Using SQLite:

docker run --name new-api -d --restart always \
  -p 3000:3000 \
  -e TZ=Asia/Shanghai \
  -v ./data:/data \
  calciumion/new-api:latest

Using MySQL:

docker run --name new-api -d --restart always \
  -p 3000:3000 \
  -e SQL_DSN="root:123456@tcp(localhost:3306)/oneapi" \
  -e TZ=Asia/Shanghai \
  -v ./data:/data \
  calciumion/new-api:latest

💡 Path explanation:

  • ./data:/data - Relative path, data saved in the data folder of the current directory
  • You can also use absolute path, e.g.: /your/custom/path:/data
Method 3: BaoTa Panel
  1. Install BaoTa Panel (≥ 9.2.0 version)
  2. Search for New-API in the application store
  3. One-click installation

📖 Tutorial with images

⚠️ Multi-machine Deployment Considerations

Warning

  • Must set SESSION_SECRET - Otherwise login status inconsistent
  • Shared Redis must set CRYPTO_SECRET - Otherwise data cannot be decrypted

🔄 Channel Retry and Cache

Retry configuration: Settings → Operation Settings → General Settings → Failure Retry Count

Cache configuration:

  • REDIS_CONN_STRING: Redis cache (recommended)
  • MEMORY_CACHE_ENABLED: Memory cache

🔗 Related Projects

Upstream Projects

Project Description
One API Original project base
Midjourney-Proxy Midjourney interface support

Supporting Tools

Project Description
neko-api-key-tool Key quota query tool
new-api-horizon New API high-performance optimized version

💬 Help Support

📖 Documentation Resources

Resource Link
📘 FAQ FAQ
💬 Community Interaction Communication Channels
🐛 Issue Feedback Issue Feedback
📚 Complete Documentation Official Documentation

🤝 Contribution Guide

Welcome all forms of contribution!

  • 🐛 Report Bugs
  • 💡 Propose New Features
  • 📝 Improve Documentation
  • 🔧 Submit Code

📜 License

This project is licensed under the GNU Affero General Public License v3.0 (AGPLv3).

This is an open-source project developed based on One API (MIT License).

If your organization's policies do not permit the use of AGPLv3-licensed software, or if you wish to avoid the open-source obligations of AGPLv3, please contact us at: support@quantumnous.com


🌟 Star History

Star History Chart


💖 Thank you for using New API

If this project is helpful to you, welcome to give us a ⭐️ Star!

Official DocumentationIssue FeedbackLatest Release

Built with ❤️ by QuantumNous

Release History

VersionChangesUrgencyDate
v1.0.0-rc.10## Highlights v1.0.0-rc.10 focuses on UI polish, relay reliability, and admin workflow fixes. It adds new theme options and subscription balance purchases while tightening logs, channel behavior, webhooks, payments, and provider tool-call handling. ## New Features - New theme customization options add Anthropic and Simple Large presets, configurable serif typography, and an extra-large scaling option. - Subscription billing now supports balance purchases. ## Bug Fixes - Automatically disabled High5/26/2026
v1.0.0-rc.9### Highlights v1.0.0-rc.9 adds subscription balance purchases and introduces a broader UI refresh across settings, channels, logs, themes, and the home page. It also improves relay performance and fixes several channel, webhook, payment, image, and tool-call reliability issues. ### New Features - Added support for purchasing account balance from the subscription flow. - Added an Anthropic-inspired theme preset with configurable serif typography. ### Bug Fixes - Fixed webhook processing failurHigh5/26/2026
v1.0.0-rc.7### New Features - Added support for `request_header` key source (#4903). ### Bug Fixes - Batch fixed multiple UI issues in the default web interface (#4880, #4893, #4817, #4877, #4898, #4876). - Addressed an issue where channel remarks could not be cleared (#4886). - Corrected the filtering behavior for usage logs (#4883). - Fixed validation for unlimited API key quotas in the web interface (#4881). - Enforced access control on header navigation for public modules (#4889). - Applied gHigh5/19/2026
v1.0.0-rc.6## Highlights This release improves observability by recording upstream request IDs in logs and adds a compliance confirmation step for paid features. If your `logs` table is large, pre-apply the `upstream_request_id` column and index with online DDL before upgrading to avoid a potentially long migration. ## New Features - Added upstream request ID tracking in request logs for easier upstream call tracing, while avoiding unintended response header overrides. - Added a compliance confirmation stHigh5/13/2026
v1.0.0-rc.5## Highlights The default dashboard has been refreshed with built-in model performance metrics and a clearer health view. This release also adds DeepChat deeplink support and fixes several default UI, wallet, playground, and ratio-display regressions. ## New Features - Dashboard overview has been redesigned and now includes model performance metrics with a clearer performance health panel. - DeepChat deeplinks are now supported, making it easier to open supported chat links directly in DeepChatHigh5/12/2026
v1.0.0-rc.4## Highlights OpenAI image editing is more reliable: image edit requests now preserve reference image data and related metadata, preventing edits from losing their intended context (#4646). This release also adds model performance badges and fixes several billing and dropdown UI issues. ## New Features - Added model performance badges to model lists so users can compare available models at a glance. ## Bug Fixes - Fixed OpenAI image edit requests so reference images and related edit metHigh5/6/2026
v1.0.0-rc.2## Highlights v1.0.0-rc.2 officially opens up switching between the new and classic frontend, making it easier to test the redesigned UI, compare workflows, and fall back instantly when needed. This RC also refreshes key console pages and fixes several web issues affecting email binding, tiered pricing, API key grouping, and custom Vertex base URLs. ## New Features - Added a built-in switch between the new and classic frontend, so teams can adopt the redesigned UI gradually without losing aHigh4/30/2026
v0.13.2## New Features - Model sync now detects upstream models more accurately, refreshes pricing automatically from provider pricing endpoints, shows removed upstream models in the fetch-models dialog, and keeps pricing available during sync delays (#4452). - User records now include account creation and last login timestamps for easier auditing and administration (#4450). - Added configurable model matching for Ali native Messages APIs, making it easier to route requests to compatible upstream modelHigh4/27/2026
v0.12.15### New Features - Added Waffo Pay support for account top-ups and payment workflows (#4089). ### Bug Fixes - Fixed a recharge card issue in the top-up flow that could break parts of the card interface (#4388). ### Improvements - Added support for Gemini's `IncludeServerSideToolInvocations` setting in `ToolConfig`, improving compatibility with newer Gemini tool-calling behavior (#4311). - Improved Codex channel reliability and usability by refreshing credentials for auto-disabled channels, usiHigh4/22/2026
v0.12.14## Improvements - Improved user-visible management and top-up logs with a cleaner display, preserved row expansion, and clearer handling of legacy records. --- **Full Changelog**: https://github.com/QuantumNous/new-api/compare/v0.12.13...v0.12.14High4/17/2026
v0.12.13This release improves admin workflows for top-ups and makes account status changes apply more consistently. ### Bug Fixes - Disabling a user now takes effect immediately instead of being delayed by stale cached account data. ### Improvements - Top-up logs now include additional admin-only audit details to make review and troubleshooting easier. ### Breaking Changes - Top-up searches are now limited to a maximum 30-day date range to keep queries responsive and reliable. --- **Full Changelog*High4/17/2026
v0.12.12This release adds support for Claude Opus 4.7, expanding the set of available models in the API. ## New Features - Added support for the Claude Opus 4.7 model, making it available through the API alongside existing model options (#4293). --- ## What's Changed * feat: support claude-opus-4-7 by @prnake in https://github.com/QuantumNous/new-api/pull/4293 * chore(deps): bump github.com/jackc/pgx/v5 from 5.7.1 to 5.9.0 by @dependabot[bot] in https://github.com/QuantumNous/new-api/pull/4294 **FuHigh4/17/2026
v0.12.11### New Features * Added payment method tracking and validation for top-ups. ### Bug Fixes * Improved handling of empty string content during OpenAI to Claude message conversion. --- **Full Changelog**: https://github.com/QuantumNous/new-api/compare/v0.12.10...v0.12.11High4/16/2026
v0.12.10## New Features - Added passthrough support for Claude `cache_control` and `speed` options, giving you more control over request behavior when using Claude models (#4247). ## Bug Fixes - Fixed OpenAI Responses API compatibility issues when `instructions` contain structured content instead of plain text (#4260). ## Improvements - Improved Stripe payment processing to better handle asynchronous webhook events, making delayed payment confirmations more reliable. - Quota adjustment logs now recordHigh4/15/2026
v0.12.9## New Features - Subscription cards now show the next quota reset time, making it easier to see when usage limits will refresh (#4181). ## Bug Fixes - Claude requests are now more reliable by avoiding incompatible Top P settings when needed. - Azure channels now correctly support the `/v1/responses/compact` route (#4149). - Editing values in GroupTable no longer forces the cursor to jump to the end on each keystroke (#4208). ## Improvements - Dashboard charts have been refined to present rankHigh4/13/2026
v0.12.8### Bug Fixes * Correctly report the stream status in error logs instead of defaulting to false (#4195). --- ## What's Changed * fix: isStream status in error logs instead of hardcoded false by @feitianbubu in https://github.com/QuantumNous/new-api/pull/4195 * chore(deps): bump axios from 1.13.5 to 1.15.0 in /web by @dependabot[bot] in https://github.com/QuantumNous/new-api/pull/4201 **Full Changelog**: https://github.com/QuantumNous/new-api/compare/v0.12.7...v0.12.8Medium4/12/2026
v0.12.7## Improvements - Quota management now uses an amount-first input flow and applies changes atomically, making balance adjustments simpler and more reliable. - Model pricing errors now use cleaner, role-aware messages so it is easier to understand and resolve failed pricing actions. --- **Full Changelog**: https://github.com/QuantumNous/new-api/compare/v0.12.6...v0.12.7Medium4/12/2026
nightly-20260409### Highlights / 重点更新 This release introduces comprehensive support for **tiered billing (阶梯计费)**. It allows for flexible pricing expressions, detailed tool call quota calculations, and channel test support for tiered models. 本次发布重点增加了对**阶梯计费**的全面支持,支持灵活的计费表达式解析、自定义工具调用额度计算,并完善了UI配置项和渠道测试支持。 **Docker image:** `calciumion/new-api:nightly` **Docker 镜像:** `calciumion/new-api:nightly` <img width="500" alt="" src="https://github.com/user-attachments/assets/893d5789-64b3-414d-8e71-cc178b72High4/9/2026
v0.12.6### New Features * Fill in custom fields for vllm-omini (#4154) ### Bug Fixes * Accept string usage values in Alibaba task polling (#4155) * Use correct endpoint for coding plan image generation in Zhipu 4V (#4146) * Prefer explicit pricing for compact models (#4156) * Fix document rendering (#4153) ### Improvements * Refine PR template and add PR submission checks (#4076) --- ## What's Changed * fix(zhipu_4v): use correct endpoint for coding plan image generation by @NyaMisty in https://gitMedium4/9/2026
v0.12.5### New Features * Add support for Minimax image generation relay (#4103). ### Improvements * Redesign group ratio rules with a collapsible layout for better usability. * Enhance dashboard chart axes and sorting logic. * Add a convenient copy button next to the API link in the dashboard info panel. ### Bug Fixes * Resolve email delivery issues for Outlook and similar providers (#4112). <img width="2848" height="966" alt="" src="https://github.com/user-attachments/assets/8cfdd943-3bHigh4/8/2026
v0.12.4## New Features * Added an `IncludeModelName` option to channel affinity rules for per-model affinity tracking (#3488) * Added an ErrorBoundary component to the web UI to prevent full-page crashes (#3350) * Added support for `Duration` in `TaskSubmitReq` (#4068) * Added admin user analytics to the dashboard ## Bug Fixes * Fixed pricing filtering based on usable groups (#4123) * Fixed an issue where Claude streaming interruptions incorrectly overwrote usage data instead of preserving cache billiMedium4/8/2026
v0.12.3This is a small polish release focused on making configuration easier to navigate by simplifying the settings layout and consolidating model pricing controls. ## Improvements - Refined the settings interface with a unified model pricing view and a cleaner tab structure to make configuration easier to navigate. --- ## What's Changed * feat(token): add batch API for fetching token keys by @RedwindA in https://github.com/QuantumNous/new-api/pull/4114 **Full Changelog**: https://github.com/QuanMedium4/7/2026
v0.12.2### New Features * Add support for PDF conversion between OpenAI and Claude APIs. * Support differential billing for Seedance 2.0 based on video vs. no-video input. * Enhance max_tokens handling and input sanitization in the playground (#4106). * Enhance stream status display with error tooltips in usage logs. ### Bug Fixes * Fix usage calculation issues when converting between Claude and OpenAI APIs by emitting `message_delta` for the final stream chunk (#4090). * Fix Gemini stream detection bMedium4/6/2026
v0.12.1This release adds Seedance 2.0 video support and broadens media compatibility across providers. It also makes failures easier to understand with clearer Seedance and performance-related error messages. ## New Features - Added Seedance 2.0 video generation API support with duration control in seconds, clearer failure reporting, and better prompt handling for single-text requests (#4042). - Added HEIC/HEIF image support, including Gemini channel compatibility and more reliable handling of HEIF imMedium4/2/2026
v0.12.0v0.12.0 adds Wan 2.7 image generation support and streamlines channel setup in the admin UI. It also improves billing accuracy and fixes several model and dashboard issues. ## New Features - Added support for Wan 2.7 image generation, including control over how many images are generated per request (#3526). ## Bug Fixes - Fixed image-generation billing and usage statistics so request counts and costs are reported more accurately without double-counting (#3512). - Fixed a UI issue that caused tMedium4/1/2026
v0.12.0-alpha.2## Release Highlights This release includes minor improvements to the `EditChannelModal` component to enhance clipboard handling reliability. ## Bug Fixes - **EditChannelModal:** Enhance clipboard handling with error checks (`670abee`) --- **Full Changelog**: https://github.com/QuantumNous/new-api/compare/v0.12.0-alpha.1...v0.12.0-alpha.2Medium3/31/2026
v0.12.0-alpha.1## Highlights This release introduces a new clipboard magic string feature for quick channel creation and includes important security and UI improvements. ## New Features * Add clipboard magic string for quick channel creation from token copy ## Bug Fixes * 修复消耗分布图表悬浮时滚动条闪烁 (Fix dashboard scrollbar flickering) (#3474) * Fix xAI MaxTokens mapping for grok-3-mini model ## Improvements * Security improvements * Enhance footer layout and styling * Update Traditional Chinese README with missing coMedium3/31/2026
v0.11.9## Release Notes for v0.11.9 ### New Features * Add slide-in animations and update translations for new UI elements * Prevent metadata from overriding model fields (#3461) * Record stream interruption reasons via StreamStatus * Expose i18n instance to the global window object for easier access ### Bug Fixes * Restore pre-3400 OpenRouter billing semantics (#3438) * Restore doubao coding plan deprecation and regex ignored models lost during conflict resolution * Preserve cache usage in openai-toMedium3/31/2026
v0.11.9-alpha.1## What's Changed * fix: disable doubao coding plan selection by @seefs001 in https://github.com/QuantumNous/new-api/pull/3379 * fix: oauth bind callback handling by @seefs001 in https://github.com/QuantumNous/new-api/pull/3393 * feat: support regex-prefixed ignored upstream models by @seefs001 in https://github.com/QuantumNous/new-api/pull/3381 * adjuct default settings by @seefs001 in https://github.com/QuantumNous/new-api/pull/3335 * fix: honor channel affinity skip-retry when channel isMedium3/23/2026
v0.11.8## What's Changed * fix: normalize generic oauth bearer token type by @seefs001 in https://github.com/QuantumNous/new-api/pull/3359 * feat: add server log file management to performance settings by @RedwindA in https://github.com/QuantumNous/new-api/pull/3369 * feat: Add support for counting cache-hit tokens in llama.cpp by @wenyifancc in https://github.com/QuantumNous/new-api/pull/3357 * docs: 完善宝塔面板部署教程并修复链接错误 by @lcq225 in https://github.com/QuantumNous/new-api/pull/3360 ## New ContribLow3/22/2026
v0.11.7## What's Changed * chore: refine PR template by @seefs001 in https://github.com/QuantumNous/new-api/pull/3287 * feat: 支持通过环境变量配置搜索接口限流参数 by @ywandy in https://github.com/QuantumNous/new-api/pull/3313 * fix: 修正 Codex free 账号用量显示到每周窗口 by @Honghurumeng in https://github.com/QuantumNous/new-api/pull/3316 * feat(waffo): Waffo payment gateway integration by @zhongyuanzhao-alt in https://github.com/QuantumNous/new-api/pull/3293 * fix: redirect OAuth login in current page by @seefs001 in https://gLow3/19/2026
v0.11.6-patch.1Release v0.11.6-patch.1Low3/17/2026
nightly-20260317-44fc10bfeat: implement tiered billing expression evaluation Docker image: calciumion/new-api:nightly <img width="500" alt="" src="https://github.com/user-attachments/assets/893d5789-64b3-414d-8e71-cc178b724ac6" />Low3/17/2026
v0.11.6参数覆盖记录到日志 错误日志恢复tooltip显示 ## What's Changed * feat: params override log by @seefs001 in https://github.com/QuantumNous/new-api/pull/3285 **Full Changelog**: https://github.com/QuantumNous/new-api/compare/v0.11.5...v0.11.6Low3/17/2026
v0.11.5## What's Changed * Round remaining balance by @KiGamji in https://github.com/QuantumNous/new-api/pull/3233 * enhance channel key viewing by @seefs001 in https://github.com/QuantumNous/new-api/pull/3257 **Full Changelog**: https://github.com/QuantumNous/new-api/compare/v0.11.4...v0.11.5Low3/14/2026
v0.11.4-patch.1**Full Changelog**: https://github.com/QuantumNous/new-api/compare/v0.11.4...v0.11.4-patch.1Low3/14/2026
v0.11.4## What's Changed * Feature/param override wildcard path by @seefs001 in https://github.com/QuantumNous/new-api/pull/3129 * fix: fetch model add header passthrough rule key check by @feitianbubu in https://github.com/QuantumNous/new-api/pull/3130 * feats: repair the thinking of claude to openrouter convert by @nekohy in https://github.com/QuantumNous/new-api/pull/3120 * feat: kling cost quota support use FinalUnitDeduction as totalToken by @feitianbubu in https://github.com/QuantumNous/new-aLow3/12/2026
v0.11.4-alpha.5请求头覆盖新增追加功能 ## What's Changed * fix: kling risk fail return openAIVideo error by @feitianbubu in https://github.com/QuantumNous/new-api/pull/3148 * fix: add explicit docker-compose networks by @pigletfly in https://github.com/QuantumNous/new-api/pull/3147 * feat:support $keep_only_declared and deduped $append for header override by @seefs001 in https://github.com/QuantumNous/new-api/pull/3182 ## New Contributors * @pigletfly made their first contribution in https://github.com/QuantumNoLow3/9/2026
v0.11.4-alpha.4为渠道参数覆盖可视化规则提供拖拽排序支持 后端Token相关API变更 ## What's Changed * 为渠道参数覆盖可视化规则提供拖拽排序支持 by @somnifex in https://github.com/QuantumNous/new-api/pull/3166 **Full Changelog**: https://github.com/QuantumNous/new-api/compare/v0.11.4-alpha.3...v0.11.4-alpha.4Low3/8/2026
v0.11.4-alpha.3## Main Changes 重构日志价格显示,默认隐藏倍率逻辑,可在列设置中切换 Refactor the log price display: hide the multiplier logic by default, with an option to toggle it in the column settings. <img width="3552" height="774" alt="" src="https://github.com/user-attachments/assets/1f8ca60c-3589-483a-bcf5-faa27c10e01e" /> **Full Changelog**: https://github.com/QuantumNous/new-api/compare/v0.11.4-alpha.2...v0.11.4-alpha.3Low3/6/2026
v0.11.4-alpha.2## Main Changes 模型广场增加缓存等价格显示 Unify pricing labels and expand marketplace pricing display <img width="3166" height="1202" src="https://github.com/user-attachments/assets/243c9bb8-5d27-4378-914d-2194a952f690" /> **Full Changelog**: https://github.com/QuantumNous/new-api/compare/v0.11.4-alpha.1...v0.11.4-alpha.2Low3/6/2026
v0.11.4-alpha.1## Main Changes 完善可视化价格设置,为后续阶梯定价的开发做准备 Improve the visual pricing settings to prepare for subsequent development of tiered pricing. <img width="4022" height="1742" src="https://github.com/user-attachments/assets/f36f0637-79ff-43bf-983f-778b6a2d6f73" /> ## What's Changed * fix(relay): skip retries for bad response body errors by @seefs001 in https://github.com/QuantumNous/new-api/pull/3151 **Full Changelog**: https://github.com/QuantumNous/new-api/compare/v0.11.2-patch.2...v0.11.4-Low3/6/2026
v0.11.2-patch.2优化kling模型计费 ## What's Changed * feat: kling cost quota support use FinalUnitDeduction as totalToken by @feitianbubu in https://github.com/QuantumNous/new-api/pull/2769 * fix: If top_p is not provided, ignore it by @seefs001 in https://github.com/QuantumNous/new-api/pull/3141 **Full Changelog**: https://github.com/QuantumNous/new-api/compare/v0.11.2-patch.1...v0.11.2-patch.2Low3/6/2026
v0.11.2-patch.1gpt-5.4补全倍率 ## What's Changed * Feature/param override wildcard path by @seefs001 in https://github.com/QuantumNous/new-api/pull/3129 * fix: fetch model add header passthrough rule key check by @feitianbubu in https://github.com/QuantumNous/new-api/pull/3130 * feats: repair the thinking of claude to openrouter convert by @nekohy in https://github.com/QuantumNous/new-api/pull/3120 **Full Changelog**: https://github.com/QuantumNous/new-api/compare/v0.11.2...v0.11.2-patch.1Low3/6/2026
v0.11.2## Main changes 增加定时获取上游模型更新功能 异步任务在没有设置价格时抛出错误 渠道参数覆盖功能增强,提供可视化编辑,整合请求头覆盖和透传功能 Add a scheduled feature to fetch upstream model updates. Throw an error in the asynchronous task when the price is not set. Enhance the channel parameter override feature by integrating request header overrides and passthrough functionality. ## Enhance the channel parameter override feature 整合请求头覆盖功能,可视化编辑,并提供基础模板 <img width="2216" height="1560" alt="1772641010569" src="https://github.com/user-attachmentLow3/4/2026
v0.11.2-alpha.2优化检测上游模型功能 美化模型广场 Optimize the upstream model fetch feature Beautify the Model Marketplace ## What's Changed * fix: use default model price for radio price model by @feitianbubu in https://github.com/QuantumNous/new-api/pull/3090 * Fix/auto fetch upstream model tips by @seefs001 in https://github.com/QuantumNous/new-api/pull/3096 **Full Changelog**: https://github.com/QuantumNous/new-api/compare/v0.11.2-alpha.1...v0.11.2-alpha.2Low3/3/2026
v0.11.2-alpha.1feat: auto fetch upstream models 新增自动检测上游模型列表变动功能 <img width="1386" height="976" alt="image" src="https://github.com/user-attachments/assets/3effc3d4-8d5c-4987-b7d9-861a4943a839" /> <img width="1224" height="512" alt="image" src="https://github.com/user-attachments/assets/771b1670-b36e-445e-89cd-fde5d389d1ad" /> ## What's Changed * fix: change token model_limits column from varchar(1024) to text by @RedwindA in https://github.com/QuantumNous/new-api/pull/3037 * Return error when modeLow3/2/2026
v0.11.1-alpha.7Support `aionui://` deep link protocol 支持AionUi项目一键填入 <img width="2374" height="960" alt="1772457674576" src="https://github.com/user-attachments/assets/9f53bf2c-8670-4c02-a3a7-b2b6c169e269" /> **Full Changelog**: https://github.com/QuantumNous/new-api/compare/v0.11.1-alpha.6...v0.11.1-alpha.7Low3/2/2026
v0.11.1-alpha.6## What's Changed * Revert v0.11.1-alpha.5 **Full Changelog**: https://github.com/QuantumNous/new-api/compare/v0.11.1-alpha.4...v0.11.1-alpha.6Low3/2/2026
v0.11.1-alpha.4## What's Changed * fix: aws text content blocks must be non-empty by @seefs001 in https://github.com/QuantumNous/new-api/pull/3077 **Full Changelog**: https://github.com/QuantumNous/new-api/compare/v0.11.1-alpha.3...v0.11.1-alpha.4Low3/2/2026
v0.11.1-alpha.3支持cc-switch一键填入 <img alt="1772378663623" src="https://github.com/user-attachments/assets/e0b8ece0-b137-4853-a01c-fea43c40d151" /> ## What's Changed * chore(deps): bump axios from 1.12.0 to 1.13.5 in /web by @dependabot[bot] in https://github.com/QuantumNous/new-api/pull/3061 * chore(deps): bump tar and electron-builder in /electron by @dependabot[bot] in https://github.com/QuantumNous/new-api/pull/2964 * build(deps-dev): bump lodash from 4.17.21 to 4.17.23 in /electron by @dependabot[boLow3/1/2026
v0.11.1-alpha.2参数覆盖增加UI可视化操作,支持更多操作。 渠道亲和支持绑定参数覆盖,适用于codex/claude code等的请求头透传。 ## What's Changed * feat: improve channel override ui/ux by @seefs001 in https://github.com/QuantumNous/new-api/pull/3009 **Full Changelog**: https://github.com/QuantumNous/new-api/compare/v0.11.1-alpha.1...v0.11.1-alpha.2Low2/28/2026
v0.11.1-alpha.1完善veo适配生成(Vertex,AI Studio) 修复订阅支付完成后的跳转路径错误 Improve Veo adaptation generation (Vertex, AI Studio) Fix incorrect redirection path after subscription payment completion ## What's Changed * fix: redirect subscription payment return to user-accessible page by @seefs001 in https://github.com/QuantumNous/new-api/pull/3052 * feat: add image model to supported image presets by @seefs001 in https://github.com/QuantumNous/new-api/pull/3029 * feat: minimax reasoning_split by @seefs001 in https:Low2/28/2026
v0.11.0## What's Changed * refactor: async task by @Calcium-Ion in https://github.com/QuantumNous/new-api/pull/2985 * Feature/channel retry warning by @seefs001 in https://github.com/QuantumNous/new-api/pull/2987 * fix: claude affinity cache counter by @seefs001 in https://github.com/QuantumNous/new-api/pull/2980 * feat: add missing OpenAI/Claude/Gemini request fields by @seefs001 in https://github.com/QuantumNous/new-api/pull/2971 * feat: codex oauth proxy by @seefs001 in https://github.com/QuantLow2/28/2026
v0.11.0-alpha.9修复Sora接口传图报错`unsupported mimetype (\'application/octet-stream\')` Fix Sora API image upload error `unsupported mimetype ('application/octet-stream')` **Full Changelog**: https://github.com/QuantumNous/new-api/compare/v0.11.0-alpha.8...v0.11.0-alpha.9Low2/25/2026
v0.11.0-alpha.8修复GetUserSetting永远无法读取旧版本用户设置的问题 **Full Changelog**: https://github.com/QuantumNous/new-api/compare/v0.11.0-alpha.7...v0.11.0-alpha.8Low2/24/2026
v0.11.0-alpha.7第三方登录支持优先使用第三方用户名 请求头透传功能优化 加强渠道模型选择防呆设计 Third-party login support, prioritize using third-party username Request header pass-through function optimization Strengthen channel model selection foolproof design ## What's Changed * fix: skip Accept-Encoding during header passthrough (#2214) by @Caisin in https://github.com/QuantumNous/new-api/pull/2997 * feat: move user bindings to dedicated management modal by @seefs001 in https://github.com/QuantumNous/new-api/pull/2993 * Fix/pr 2900 Low2/24/2026
v0.11.0-alpha.6grok 违规扣费完善 ## What's Changed * fix: violation fee check by @seefs001 in https://github.com/QuantumNous/new-api/pull/2994 **Full Changelog**: https://github.com/QuantumNous/new-api/compare/v0.11.0-alpha.5...v0.11.0-alpha.6Low2/23/2026
v0.11.0-alpha.5修复PostForm下重试出现`multipart: NextPart: EOF`错误 **Full Changelog**: https://github.com/QuantumNous/new-api/compare/v0.11.0-alpha.4...v0.11.0-alpha.5Low2/23/2026

Dependencies & License Audit

Loading dependencies...

Similar Packages

llmioLLM API load-balancing gateway. LLM API 负载均衡网关.v0.8.12
cc-relay⚡️ Blazing fast LLMs API Gateway written in Gov0.0.17
ai-guideProvide free, open access to comprehensive AI tools, guides, reviews, and resources to reduce knowledge gaps and empower users.main@2026-06-04
lobehubThe ultimate space for work and life — to find, build, and collaborate with agent teammates that grow with you. We are taking agent harness to the next level — enabling multi-agent collaboration, effov2.2.2
axonhub⚡️ Open-source AI Gateway — Use any SDK to call 100+ LLMs. Built-in failover, load balancing, cost control & end-to-end tracing.v1.0.0-beta2

More in Infrastructure

tensorzeroTensorZero is an open-source LLMOps platform that unifies an LLM gateway, observability, evaluation, optimization, and experimentation.
planoPlano is an AI-native proxy and data plane for agentic apps — with built-in orchestration, safety, observability, and smart LLM routing so you stay focused on your agents core logic.
modelsThis repository contains comprehensive pricing and configuration data for LLMs. It powers cost attribution for 200+ enterprises running 400B+ tokens through Portkey AI Gateway every day.
edgeeOpen-source AI gateway written in Rust, with token compression for Claude Code, Codex... and any other LLM client.