freshcrate
Skin:/
Home > Databases > OpenViking

OpenViking

OpenViking is an open-source context database designed specifically for AI Agents(such as openclaw). OpenViking unifies the management of context (memory, resources, and skills) that Agents need throu

Why this rank:Strong adoptionRecent releaseHealthy release cadence

Description

OpenViking is an open-source context database designed specifically for AI Agents(such as openclaw). OpenViking unifies the management of context (memory, resources, and skills) that Agents need through a file system paradigm, enabling hierarchical context delivery and self-evolving.

README

OpenViking

OpenViking: The Context Database for AI Agents

English / 中文 / 日本語

Website · GitHub · Issues · Docs

👋 Join our Community

📱 Lark Group · WeChat · Discord · X

volcengine%2FOpenViking | Trendshift


Overview

Challenges in Agent Development

In the AI era, data is abundant, but high-quality context is hard to come by. When building AI Agents, developers often face these challenges:

  • Fragmented Context: Memories are in code, resources are in vector databases, and skills are scattered, making them difficult to manage uniformly.
  • Surging Context Demand: An Agent's long-running tasks produce context at every execution. Simple truncation or compression leads to information loss.
  • Poor Retrieval Effectiveness: Traditional RAG uses flat storage, lacking a global view and making it difficult to understand the full context of information.
  • Unobservable Context: The implicit retrieval chain of traditional RAG is like a black box, making it hard to debug when errors occur.
  • Limited Memory Iteration: Current memory is just a record of user interactions, lacking Agent-related task memory.

The OpenViking Solution

OpenViking is an open-source Context Database designed specifically for AI Agents.

We aim to define a minimalist context interaction paradigm for Agents, allowing developers to completely say goodbye to the hassle of context management. OpenViking abandons the fragmented vector storage model of traditional RAG and innovatively adopts a "file system paradigm" to unify the structured organization of memories, resources, and skills needed by Agents.

With OpenViking, developers can build an Agent's brain just like managing local files:

  • Filesystem Management ParadigmSolves Fragmentation: Unified context management of memories, resources, and skills based on a filesystem paradigm.
  • Tiered Context LoadingReduces Token Consumption: L0/L1/L2 three-tier structure, loaded on demand, significantly saving costs.
  • Directory Recursive RetrievalImproves Retrieval Effect: Supports native filesystem retrieval methods, combining directory positioning with semantic search to achieve recursive and precise context acquisition.
  • Visualized Retrieval TrajectoryObservable Context: Supports visualization of directory retrieval trajectories, allowing users to clearly observe the root cause of issues and guide retrieval logic optimization.
  • Automatic Session ManagementContext Self-Iteration: Automatically compresses content, resource references, tool calls, etc., in conversations, extracting long-term memory, making the Agent smarter with use.

Quick Start

Prerequisites

Before starting with OpenViking, please ensure your environment meets the following requirements:

  • Python Version: 3.10 or higher
  • Go Version: 1.22 or higher (Required for building AGFS components)
  • C++ Compiler: GCC 9+ or Clang 11+ (Required for building core extensions)
  • Operating System: Linux, macOS, Windows
  • Network Connection: A stable network connection is required (for downloading dependencies and accessing model services)

1. Installation

Python Package

pip install openviking --upgrade --force-reinstall

Rust CLI (Optional)

curl -fsSL https://raw.githubusercontent.com/volcengine/OpenViking/main/crates/ov_cli/install.sh | bash

Or build from source:

cargo install --git https://github.com/volcengine/OpenViking ov_cli

2. Model Preparation

OpenViking requires the following model capabilities:

  • VLM Model: For image and content understanding
  • Embedding Model: For vectorization and semantic retrieval

Supported VLM Providers

OpenViking supports three VLM providers:

Provider Description Get API Key
volcengine Volcengine Doubao Models Volcengine Console
openai OpenAI Official API OpenAI Platform
litellm Unified access to various third-party models (Anthropic, DeepSeek, Gemini, vLLM, Ollama, etc.) See LiteLLM Providers

💡 Tip:

  • litellm supports unified access to various models. The model field must follow the LiteLLM format specification
  • The system auto-detects common models (e.g., claude-*, deepseek-*, gemini-*, hosted_vllm/*, ollama/*, etc.). For other models, use the full prefix according to LiteLLM format

Provider-Specific Notes

Volcengine (Doubao)

Volcengine supports both model names and endpoint IDs. Using model names is recommended for simplicity:

{
  "vlm": {
    "provider": "volcengine",
    "model": "doubao-seed-2-0-pro-260215",
    "api_key": "your-api-key",
    "api_base": "https://ark.cn-beijing.volces.com/api/v3"
  }
}

You can also use endpoint IDs (found in Volcengine ARK Console:

{
  "vlm": {
    "provider": "volcengine",
    "model": "ep-20241220174930-xxxxx",
    "api_key": "your-api-key",
    "api_base": "https://ark.cn-beijing.volces.com/api/v3"
  }
}
OpenAI

Use OpenAI's official API:

{
  "vlm": {
    "provider": "openai",
    "model": "gpt-4o",
    "api_key": "your-api-key",
    "api_base": "https://api.openai.com/v1"
  }
}

You can also use a custom OpenAI-compatible endpoint:

{
  "vlm": {
    "provider": "openai",
    "model": "gpt-4o",
    "api_key": "your-api-key",
    "api_base": "https://your-custom-endpoint.com/v1"
  }
}
LiteLLM (Anthropic, DeepSeek, Gemini, Qwen, vLLM, Ollama, etc.)

LiteLLM provides unified access to various models. The model field should follow LiteLLM's naming convention. Here we use Claude and Qwen as examples:

Anthropic:

{
  "vlm": {
    "provider": "litellm",
    "model": "claude-3-5-sonnet-20240620",
    "api_key": "your-anthropic-api-key"
  }
}

Qwen (DashScope):

{
  "vlm": {
    "provider": "litellm",
    "model": "dashscope/qwen-turbo", // see https://docs.litellm.ai/docs/providers/dashscope for more details
    "api_key": "your-dashscope-api-key",
    "api_base": "https://dashscope.aliyuncs.com/compatible-mode/v1"
  }
}

💡 Tip for Qwen:

  • For China/Beijing region, use api_base: https://dashscope.aliyuncs.com/compatible-mode/v1
  • For International region, use api_base: https://dashscope-intl.aliyuncs.com/compatible-mode/v1

Common model formats:

Provider Model Example Notes
Anthropic claude-3-5-sonnet-20240620 Auto-detected, uses ANTHROPIC_API_KEY
DeepSeek deepseek-chat Auto-detected, uses DEEPSEEK_API_KEY
Gemini gemini-pro Auto-detected, uses GEMINI_API_KEY
Qwen dashscope/qwen-turbo Set api_base based on region (see above)
OpenRouter openrouter/openai/gpt-4o Full prefix required
vLLM hosted_vllm/llama-3.1-8b Set api_base to vLLM server
Ollama ollama/llama3.1 Set api_base to Ollama server

Local Models (vLLM / Ollama):

# Start Ollama
ollama serve
// Ollama
{
  "vlm": {
    "provider": "litellm",
    "model": "ollama/llama3.1",
    "api_base": "http://localhost:11434"
  }
}

For complete model support, see LiteLLM Providers Documentation.

3. Environment Configuration

Server Configuration Template

Create a configuration file ~/.openviking/ov.conf, remove the comments before copy:

{
  "storage": {
    "workspace": "/home/your-name/openviking_workspace"
  },
  "log": {
    "level": "INFO",
    "output": "stdout"                 // Log output: "stdout" or "file"
  },
  "embedding": {
    "dense": {
      "api_base" : "<api-endpoint>",   // API endpoint address
      "api_key"  : "<your-api-key>",   // Model service API Key
      "provider" : "<provider-type>",  // Provider type: "volcengine" or "openai" (currently supported)
      "dimension": 1024,               // Vector dimension
      "model"    : "<model-name>"      // Embedding model name (e.g., doubao-embedding-vision-250615 or text-embedding-3-large)
    },
    "max_concurrent": 10               // Max concurrent embedding requests (default: 10)
  },
  "vlm": {
    "api_base" : "<api-endpoint>",     // API endpoint address
    "api_key"  : "<your-api-key>",     // Model service API Key
    "provider" : "<provider-type>",    // Provider type (volcengine, openai, deepseek, anthropic, etc.)
    "model"    : "<model-name>",       // VLM model name (e.g., doubao-seed-2-0-pro-260215 or gpt-4-vision-preview)
    "max_concurrent": 100              // Max concurrent LLM calls for semantic processing (default: 100)
  }
}

Note: For embedding models, supported providers are volcengine (Doubao), openai, jina, voyage, minimax, vikingdb, and gemini (requires pip install "google-genai>=1.0.0"). For VLM models, we support three providers: volcengine, openai, and litellm. The litellm provider supports various models including Anthropic (Claude), DeepSeek, Gemini, Moonshot, Zhipu, DashScope, MiniMax, vLLM, Ollama, and more.

Server Configuration Examples

👇 Expand to see the configuration example for your model service:

Example 1: Using Volcengine (Doubao Models)
{
  "storage": {
    "workspace": "/home/your-name/openviking_workspace"
  },
  "log": {
    "level": "INFO",
    "output": "stdout"                 // Log output: "stdout" or "file"
  },
  "embedding": {
    "dense": {
      "api_base" : "https://ark.cn-beijing.volces.com/api/v3",
      "api_key"  : "your-volcengine-api-key",
      "provider" : "volcengine",
      "dimension": 1024,
      "model"    : "doubao-embedding-vision-250615"
    },
    "max_concurrent": 10
  },
  "vlm": {
    "api_base" : "https://ark.cn-beijing.volces.com/api/v3",
    "api_key"  : "your-volcengine-api-key",
    "provider" : "volcengine",
    "model"    : "doubao-seed-2-0-pro-260215",
    "max_concurrent": 100
  }
}
Example 2: Using OpenAI Models
{
  "storage": {
    "workspace": "/home/your-name/openviking_workspace"
  },
  "log": {
    "level": "INFO",
    "output": "stdout"                 // Log output: "stdout" or "file"
  },
  "embedding": {
    "dense": {
      "api_base" : "https://api.openai.com/v1",
      "api_key"  : "your-openai-api-key",
      "provider" : "openai",
      "dimension": 3072,
      "model"    : "text-embedding-3-large"
    },
    "max_concurrent": 10
  },
  "vlm": {
    "api_base" : "https://api.openai.com/v1",
    "api_key"  : "your-openai-api-key",
    "provider" : "openai",
    "model"    : "gpt-4-vision-preview",
    "max_concurrent": 100
  }
}
Example 3: Using Google Gemini Embedding

Install the required package first:

pip install "google-genai>=1.0.0"
{
  "storage": {
    "workspace": "/home/your-name/openviking_workspace"
  },
  "embedding": {
    "dense": {
      "provider": "gemini",
      "api_key": "your-google-api-key",
      "model": "gemini-embedding-2-preview",
      "dimension": 3072
    },
    "max_concurrent": 10
  },
  "vlm": {
    "api_base" : "https://api.openai.com/v1",
    "api_key"  : "your-openai-api-key",
    "provider" : "openai",
    "model"    : "gpt-4o",
    "max_concurrent": 100
  }
}

Get your Google API key at https://aistudio.google.com/apikey

Set Server Configuration Environment Variable

After creating the configuration file, set the environment variable to point to it (Linux/macOS):

export OPENVIKING_CONFIG_FILE=~/.openviking/ov.conf # by default

On Windows, use one of the following:

PowerShell:

$env:OPENVIKING_CONFIG_FILE = "$HOME/.openviking/ov.conf"

Command Prompt (cmd.exe):

set "OPENVIKING_CONFIG_FILE=%USERPROFILE%\.openviking\ov.conf"

💡 Tip: You can also place the configuration file in other locations, just specify the correct path in the environment variable.

CLI/Client Configuration Examples

👇 Expand to see the configuration example for your CLI/Client:

Example: ovcli.conf for visiting localhost server

{
  "url": "http://localhost:1933",
  "timeout": 60.0,
  "output": "table"
}

After creating the configuration file, set the environment variable to point to it (Linux/macOS):

export OPENVIKING_CLI_CONFIG_FILE=~/.openviking/ovcli.conf # by default

On Windows, use one of the following:

PowerShell:

$env:OPENVIKING_CLI_CONFIG_FILE = "$HOME/.openviking/ovcli.conf"

Command Prompt (cmd.exe):

set "OPENVIKING_CLI_CONFIG_FILE=%USERPROFILE%\.openviking\ovcli.conf"

4. Run Your First Example

📝 Prerequisite: Ensure you have completed the configuration (ov.conf and ovcli.conf) in the previous step.

Now let's run a complete example to experience the core features of OpenViking.

Launch Server

openviking-server

or you can run in background

nohup openviking-server > /data/log/openviking.log 2>&1 &

Run the CLI

ov status
ov add-resource https://github.com/volcengine/OpenViking # --wait
ov ls viking://resources/
ov tree viking://resources/volcengine -L 2
# wait some time for semantic processing if not --wait
ov find "what is openviking"
ov grep "openviking" --uri viking://resources/volcengine/OpenViking/docs/zh

Congratulations! You have successfully run OpenViking 🎉

VikingBot Quick Start

VikingBot is an AI agent framework built on top of OpenViking. Here's how to get started:

# Option 1: Install VikingBot from PyPI (recommended for most users)
pip install "openviking[bot]"

# Option 2: Install VikingBot from source (for development)
uv pip install -e ".[bot]"

# Start OpenViking server with Bot enabled
openviking-server --with-bot

# In another terminal, start interactive chat
ov chat

If you use the official Docker image, vikingbot is already bundled in the image and starts by default together with the OpenViking server and console UI. You can disable it at runtime with either --without-bot or -e OPENVIKING_WITH_BOT=0.


Server Deployment Details

For production environments, we recommend running OpenViking as a standalone HTTP service to provide persistent, high-performance context support for your AI Agents.

🚀 Deploy OpenViking on Cloud: To ensure optimal storage performance and data security, we recommend deploying on Volcengine Elastic Compute Service (ECS) using the veLinux operating system. We have prepared a detailed step-by-step guide to get you started quickly.

👉 View: Server Deployment & ECS Setup Guide

OpenClaw Context Plugin Details

  • Test Dataset: Effect testing based on LoCoMo10 (https://github.com/snap-research/locomo) long-range dialogues (1,540 cases in total after removing category5 without ground truth)
  • Experimental Groups: Since users may not disable OpenClaw's native memory when using OpenViking, we added experimental groups with native memory enabled or disabled
  • OpenViking Version: 0.1.18
  • Model: seed-2.0-code
  • Evaluation Script: https://github.com/ZaynJarvis/openclaw-eval/tree/main
Experimental Group Task Completion Rate Cost: Input Tokens (Total)
OpenClaw(memory-core) 35.65% 24,611,530
OpenClaw + LanceDB (-memory-core) 44.55% 51,574,530
OpenClaw + OpenViking Plugin (-memory-core) 52.08% 4,264,396
OpenClaw + OpenViking Plugin (+memory-core) 51.23% 2,099,622
  • Experimental Conclusions: After integrating OpenViking:
  • With native memory enabled: 43% improvement over original OpenClaw with 91% reduction in input token cost; 15% improvement over LanceDB with 96% reduction in input token cost.
  • With native memory disabled: 49% improvement over original OpenClaw with 83% reduction in input token cost; 17% improvement over LanceDB with 92% reduction in input token cost.

👉 View: OpenClaw Context Plugin

👉 View: OpenCode Memory Plugin Example

👉 View: Claude Code Memory Plugin Example

--

Core Concepts

After running the first example, let's dive into the design philosophy of OpenViking. These five core concepts correspond one-to-one with the solutions mentioned earlier, together building a complete context management system:

1. Filesystem Management Paradigm → Solves Fragmentation

We no longer view context as flat text slices but unify them into an abstract virtual filesystem. Whether it's memories, resources, or capabilities, they are mapped to virtual directories under the viking:// protocol, each with a unique URI.

This paradigm gives Agents unprecedented context manipulation capabilities, enabling them to locate, browse, and manipulate information precisely and deterministically through standard commands like ls and find, just like a developer. This transforms context management from vague semantic matching into intuitive, traceable "file operations". Learn more: Viking URI | Context Types

viking://
├── resources/              # Resources: project docs, repos, web pages, etc.
│   ├── my_project/
│   │   ├── docs/
│   │   │   ├── api/
│   │   │   └── tutorials/
│   │   └── src/
│   └── ...
├── user/                   # User: personal preferences, habits, etc.
│   └── memories/
│       ├── preferences/
│       │   ├── writing_style
│       │   └── coding_habits
│       └── ...
└── agent/                  # Agent: skills, instructions, task memories, etc.
    ├── skills/
    │   ├── search_code
    │   ├── analyze_data
    │   └── ...
    ├── memories/
    └── instructions/

2. Tiered Context Loading → Reduces Token Consumption

Stuffing massive amounts of context into a prompt all at once is not only expensive but also prone to exceeding model windows and introducing noise. OpenViking automatically processes context into three levels upon writing:

  • L0 (Abstract): A one-sentence summary for quick retrieval and identification.
  • L1 (Overview): Contains core information and usage scenarios for Agent decision-making during the planning phase.
  • L2 (Details): The full original data, for deep reading by the Agent when absolutely necessary.

Learn more: Context Layers

viking://resources/my_project/
├── .abstract               # L0 Layer: Abstract (~100 tokens) - Quick relevance check
├── .overview               # L1 Layer: Overview (~2k tokens) - Understand structure and key points
├── docs/
│   ├── .abstract          # Each directory has corresponding L0/L1 layers
│   ├── .overview
│   ├── api/
│   │   ├── .abstract
│   │   ├── .overview
│   │   ├── auth.md        # L2 Layer: Full content - Load on demand
│   │   └── endpoints.md
│   └── ...
└── src/
    └── ...

3. Directory Recursive Retrieval → Improves Retrieval Effect

Single vector retrieval struggles with complex query intents. OpenViking has designed an innovative Directory Recursive Retrieval Strategy that deeply integrates multiple retrieval methods:

  1. Intent Analysis: Generate multiple retrieval conditions through intent analysis.
  2. Initial Positioning: Use vector retrieval to quickly locate the high-score directory where the initial slice is located.
  3. Refined Exploration: Perform a secondary retrieval within that directory and update high-score results to the candidate set.
  4. Recursive Drill-down: If subdirectories exist, recursively repeat the secondary retrieval steps layer by layer.
  5. Result Aggregation: Finally, obtain the most relevant context to return.

This "lock high-score directory first, then refine content exploration" strategy not only finds the semantically best-matching fragments but also understands the full context where the information resides, thereby improving the globality and accuracy of retrieval. Learn more: Retrieval Mechanism

4. Visualized Retrieval Trajectory → Observable Context

OpenViking's organization uses a hierarchical virtual filesystem structure. All context is integrated in a unified format, and each entry corresponds to a unique URI (like a viking:// path), breaking the traditional flat black-box management mode with a clear hierarchy that is easy to understand.

The retrieval process adopts a directory recursive strategy. The trajectory of directory browsing and file positioning for each retrieval is fully preserved, allowing users to clearly observe the root cause of problems and guide the optimization of retrieval logic. Learn more: Retrieval Mechanism

5. Automatic Session Management → Context Self-Iteration

OpenViking has a built-in memory self-iteration loop. At the end of each session, developers can actively trigger the memory extraction mechanism. The system will asynchronously analyze task execution results and user feedback, and automatically update them to the User and Agent memory directories.

  • User Memory Update: Update memories related to user preferences, making Agent responses better fit user needs.
  • Agent Experience Accumulation: Extract core content such as operational tips and tool usage experience from task execution experience, aiding efficient decision-making in subsequent tasks.

This allows the Agent to get "smarter with use" through interactions with the world, achieving self-evolution. Learn more: Session Management


Advanced Reading

Documentation

For more details, please visit our Full Documentation.

Community & Team

For more details, please see: About Us

Join the Community

OpenViking is still in its early stages, and there are many areas for improvement and exploration. We sincerely invite every developer passionate about AI Agent technology:

  • Light up a precious Star for us to give us the motivation to move forward.
  • Visit our Website to understand the philosophy we convey, and use it in your projects via the Documentation. Feel the change it brings and give us feedback on your truest experience.
  • Join our community to share your insights, help answer others' questions, and jointly create an open and mutually helpful technical atmosphere:
  • Become a Contributor, whether submitting a bug fix or contributing a new feature, every line of your code will be an important cornerstone of OpenViking's growth.

Let's work together to define and build the future of AI Agent context management. The journey has begun, looking forward to your participation!

Star Trend

Star History Chart

License

The OpenViking project uses different licenses for different components:

  • Main Project: AGPLv3 - see the LICENSE file for details
  • crates/ov_cli: Apache 2.0 - see the LICENSE for details
  • examples: Apache 2.0 - see the LICENSE for details
  • third_party: Respective original licenses of third-party projects

Release History

VersionChangesUrgencyDate
v0.3.23# OpenViking v0.3.23 Release Notes / 发布说明 Release date / 发布日期: 2026-06-03 Full Changelog / 完整变更记录: https://github.com/volcengine/OpenViking/compare/v0.3.22...v0.3.23 Compare range / 对比范围: `v0.3.22...v0.3.23` (42 commits) --- ## 中文 ### 版本概览 v0.3.23 是一次偏体验与可靠性的版本:原生 `ov` CLI 改为更完整的交互式配置与诊断入口,Web Studio 增加 Playground 与连接身份管理,VikingBot 的经验召回改为配置驱动,同时补齐一批资源导入、会话、插件和 VLM 链路的可靠性修复。 ### 主要更新 - **原生 `ov` CLI 体验重构**:`ov config` 现在是配置管理入口,可交互添加、编辑、删除、切换配置;`ov config show`、`ov conHigh6/3/2026
v0.3.21Release date / 发布日期: 2026-05-27 Full Changelog / 完整变更记录: https://github.com/volcengine/OpenViking/compare/v0.3.20...v0.3.21 Compare range / 对比范围: `v0.3.20..v0.3.21` Commits / 提交数: 23 --- ## 中文 ### 版本概览 v0.3.21 主要强化 Memory V2 的 agent trajectory/experience 链路、批量会话写入、OpenClaw 工具命名、资源解析与文件系统错误处理,并修复 pip/pipx 安装下 `/studio` 静态资源缺失的问题。这个版本也补充了 tau2/VikingBot 自改进评测链路和一批文档、测试与安全清理。 ### 主要更新 - **Trajectory 记忆更适合检索与复盘**:trajectory schema 新增 `retrieval_anchor` 和 `embedding_templatHigh5/27/2026
v0.3.19OpenViking `v0.3.19` 已发布。 这个版本是基于 `v0.3.18` 后当前 `main` 的一次小版本发布,重点收敛 Console / Web Studio 的多时区数据口径。核心变化是把 Usage/Audit BFF 的时间分桶语义前移到服务端:统计数据统一按 UTC 写入,查询时再按请求里的 viewer timezone 做读端重分桶,前端和外部调用方不再需要、也不应该再做客户端二次时区转换。 相比 `v0.3.18` 的广泛产品化推进,`v0.3.19` 更聚焦于一个会影响 BFF 消费方式的 breaking change:`/api/v1/console/*` 的 `date` / `hour` bucket 现在应被视为服务端已经按用户时区解释后的结果。这个改动修复了 Docker / Railway 等 UTC 容器环境下“今天”统计边界不符合用户本地时间的问题,也修复了 Web Studio heatmap 在 UTC+ 用户侧被二次平移到“明天”的问题。 ## 重点变更 ### 1. Console BFF 改为服务端High5/22/2026
v0.3.17# OpenViking v0.3.17 Release Notes / 发布说明 Release date / 发布日期: 2026-05-15 Full Changelog / 完整变更记录: https://github.com/volcengine/OpenViking/compare/v0.3.16...v0.3.17 Range / 范围: `v0.3.16..v0.3.17`, 80 commits, 494 files changed. --- ## 中文 ### 版本概览 OpenViking v0.3.17 是一次面向 Agent 集成、数据迁移、可观测性和存储可靠性的综合发布。这个版本新增 LangChain / LangGraph 适配器、OVPack v2 迁移与备份能力、npm 原生 CLI 分发、Codex / OpenCode 插件改造,以及 Console 使用统计与请求审计 BFF;同时补强检索层级过滤、WebDAV 资源访问、VLM 失败切换、任务状态持久化和并发写入锁。 ### 主要更新 - High5/15/2026
cli@0.3.14# OpenViking CLI v0.3.14 ## Installation ### Quick Install (macOS/Linux) ```bash curl -fsSL https://raw.githubusercontent.com/volcengine/OpenViking/refs/tags/cli@0.3.14/crates/ov_cli/install.sh | bash ``` ### Manual Installation Download the appropriate binary for your platform below, extract it, and add it to your PATH. The CLI command is simply `ov`: ```bash # After extraction chmod +x ov # Unix only mv ov /usr/local/bin/ # or any directory in your PATH # Verify installation ov --versioHigh5/8/2026
v0.3.13# OpenViking v0.3.13 Release Notes / 发布说明 Release date / 发布日期: 2026-04-29 Full Changelog / 完整变更记录: https://github.com/volcengine/OpenViking/compare/v0.3.12...v0.3.13 Commit range / 提交范围: `v0.3.12...v0.3.13` (57 commits) --- ## 中文 ### 版本概览 OpenViking v0.3.13 是一个面向服务化、Agent 集成和生产可观测性的增强版本。它在 `openviking-server` 内置了标准 MCP HTTP 端点,新增用户级 skill 隐私配置与版本管理,补齐 Prometheus / OpenTelemetry 配置入口,并进一步收紧 API 错误、URI 校验和 bot 工具沙箱边界。 升级时最需要关注两点:`encryption.api_key_hashing.enabled` 现在需要显式配置High4/29/2026
v0.3.12## What's Changed * feat(semantic): add output_language_override to pin summary/overview language by @0xble in https://github.com/volcengine/OpenViking/pull/1607 * docs(bot): sync gateway config example with #1640 security defaults by @r266-tech in https://github.com/volcengine/OpenViking/pull/1649 * fix(parser): validate feishu config limits by @duyua9 in https://github.com/volcengine/OpenViking/pull/1645 * fix(code-hosting): recognize SSH repository hosts with userinfo by @officialasishkumHigh4/24/2026
v0.3.10# OpenViking v0.3.10 Release Notes / 发布说明 Release date / 发布日期: 2026-04-22 Full Changelog / 完整变更记录: https://github.com/volcengine/OpenViking/compare/v0.3.9...v0.3.10 --- ## 中文 ### 版本概览 v0.3.10 重点增强了 VLM provider、OpenClaw 插件生态、VikingDB 数据面接入,以及文件写入、QueueFS、Bot/CLI 的稳定性。本次发布包含 46 个提交,覆盖新功能、兼容性修复、安全修复和测试补强。 ### 主要更新 - 新增 Codex、Kimi、GLM VLM provider,并支持 `vlm.timeout` 配置。 - 新增 VikingDB `volcengine.api_key` 数据面模式,可通过 API Key 访问已创建好的云上 VikingDB collection/index。 - `write()` 新增 High4/23/2026
v0.3.9## What's Changed * reorg: remove golang depends by @MaojiaSheng in https://github.com/volcengine/OpenViking/pull/1339 * Feat/mem opt by @chenjw in https://github.com/volcengine/OpenViking/pull/1349 * fix: openai like embedding models fix, no more matryoshka error by @MaojiaSheng in https://github.com/volcengine/OpenViking/pull/1350 * feat(bot): Add disable OpenViking config for channels. by @yeshion23333 in https://github.com/volcengine/OpenViking/pull/1352 * fix(config): point missing-conHigh4/18/2026
v0.3.8- Date: 2026-04-15 - Tag: `v0.3.8` - Compare: https://github.com/volcengine/OpenViking/compare/v0.3.5...v0.3.8 ## 中文说明 OpenViking v0.3.8 主要聚焦于 Memory V2、Agent/插件生态增强,以及一批配置、检索和稳定性修复。本次版本共整理了 35 个变更项,覆盖 Memory V2、CLI、OpenClaw、Codex、VikingBot、检索性能和部署体验,并包含 8 位新贡献者。 ### Memory V2 专题 Memory V2 是 v0.3.8 的核心主题之一。本节重点介绍其记忆格式设计,以及模板化、结构化更新和可扩展性带来的架构优化。 - 记忆格式: - Memory V2 不再把长期记忆限制在 v1 的固定类别里,而是改成基于 YAML 模板定义记忆类型。 - 每种记忆模板都可以定义 `directory`、`filename_template`、`fields`、`merge_op`,必High4/15/2026
v0.3.6- Date: 2026-04-14 - Tag: `v0.3.6` - Compare: https://github.com/volcengine/OpenViking/compare/v0.3.5...v0.3.6 ## 中文说明 OpenViking v0.3.6 主要聚焦于 Memory V2、Agent/插件生态增强,以及一批配置、检索和稳定性修复。本次版本共整理了 35 个变更项,覆盖 Memory V2、CLI、OpenClaw、Codex、VikingBot、检索性能和部署体验,并包含 8 位新贡献者。 ### Memory V2 专题 Memory V2 是 v0.3.6 的核心主题之一。本节重点介绍其记忆格式设计,以及模板化、结构化更新和可扩展性带来的架构优化。 - 记忆格式: - Memory V2 不再把长期记忆限制在 v1 的固定类别里,而是改成基于 YAML 模板定义记忆类型。 - 每种记忆模板都可以定义 `directory`、`filename_template`、`fields`、`merge_op`,必High4/14/2026
v0.3.5## What's Changed * fix(memory): define config before v2 memory lock retry settings access by @heaoxiang-ai in https://github.com/volcengine/OpenViking/pull/1317 * fix: 优化测试关键词匹配和移除 Release Approval Gate by @kaisongli in https://github.com/volcengine/OpenViking/pull/1313 * fix: sanitize internal error details in bot proxy responses by @sjhddh in https://github.com/volcengine/OpenViking/pull/1310 * feat: add scenario-based API tests by @kaisongli in https://github.com/volcengine/OpenViking/pHigh4/10/2026
v0.3.5## What's Changed * fix(memory): define config before v2 memory lock retry settings access by @heaoxiang-ai in https://github.com/volcengine/OpenViking/pull/1317 * fix: 优化测试关键词匹配和移除 Release Approval Gate by @kaisongli in https://github.com/volcengine/OpenViking/pull/1313 * fix: sanitize internal error details in bot proxy responses by @sjhddh in https://github.com/volcengine/OpenViking/pull/1310 * feat: add scenario-based API tests by @kaisongli in https://github.com/volcengine/OpenViking/pHigh4/10/2026
v0.3.4# OpenViking v0.3.4 本次 `v0.3.4` 版本主要围绕 OpenClaw 插件与评测链路、Memory / 存储与写入稳定性、安全边界与网络控制,以及发布流程、Docker 与 CI 体系做了持续增强。相较 `v0.3.3`,这一版本一方面补齐了 OpenClaw 默认行为、eval 脚本、provider 扩展和多项兼容性问题,另一方面也显著加强了会话写入等待、锁与压缩器重试、HTTP 资源导入 SSRF 防护、trusted mode 限制和整体发布交付链路。 ## 版本亮点 - **OpenClaw 插件与评测体验继续完善**:调整 `recallPreferAbstract` 与 `ingestReplyAssist` 的默认值以降低意外行为,[PR #1204](https://github.com/volcengine/OpenViking/pull/1204) [PR #1206](https://github.com/volcengine/OpenViking/pull/1206);新增 OpenClaw eval shell 脚Medium4/9/2026
v0.3.3# OpenViking v0.3.3 本次 `v0.3.3` 版本主要围绕评测与写入能力、OpenClaw 插件与集成体验、会话与资源导入链路、稳定性与安全性,以及测试与 CI 体系做了集中增强。相较 `v0.3.2`,这一版本一方面补齐了 benchmark / eval / write 等面向落地使用的能力,另一方面也明显加强了 OpenClaw 插件的可观测性、健康检查、容错和测试覆盖,同时修复了若干会直接影响生产使用的锁处理、任务权限、ZIP 编码、资源导入与 embedder 参数问题。 ## Highlights - **评测与写入能力继续扩展**:新增 RAG benchmark 评测框架 [PR #825](https://github.com/volcengine/OpenViking/pull/825),补充 OpenClaw 的 LoCoMo eval 脚本与说明 [PR #1152](https://github.com/volcengine/OpenViking/pull/1152),并新增内容写入接口 [PR #1151](https:/Medium4/3/2026
v0.3.2## What's Changed * Chore/pr agent ark token billing by @qin-ptr in https://github.com/volcengine/OpenViking/pull/1117 * Fix HTTPX recognition issue with SOCKS5 proxy causing OpenViking crash by @wlff123 in https://github.com/volcengine/OpenViking/pull/1118 * refactor(model): unify config-driven retry across VLM and embedding by @qin-ctx in https://github.com/volcengine/OpenViking/pull/926 * fix(bot): import eval session time by @yeshion23333 in https://github.com/volcengine/OpenViking/pull/Medium4/1/2026
v0.3.1## What's Changed * feat(ast): add PHP tree-sitter support by @yangxinxin-7 in https://github.com/volcengine/OpenViking/pull/1087 * feat(ci): add multi-platform API test support for 5 platforms by @kaisongli in https://github.com/volcengine/OpenViking/pull/1093 * fix(ci): refresh uv.lock for docker release build by @zhoujh01 in https://github.com/volcengine/OpenViking/pull/1094 * fix(openclaw-plugin): simplify install flow and harden helpers by @qin-ctx in https://github.com/volcengine/OpenVMedium3/31/2026
v0.2.14# OpenViking v0.2.14 本次 `v0.2.14` 版本主要围绕多租户能力、解析导入链路、OpenClaw 插件体验、Bot/Feishu 集成,以及服务端稳定性与安全性做了集中增强。 ## Highlights - 多租户与身份管理进一步完善。CLI 已支持租户身份默认值与覆盖,文档新增多租户使用指南,memory 也支持仅按 agent 维度隔离的 `agent-only` scope。 - 解析与导入链路更完整。图片解析新增 OCR 文本提取,目录导入识别 `.cc` 文件,重复标题导致的文件名冲突得到修复,HTTP 上传链路改为更稳妥的 upload id 流程。 - OpenClaw 插件显著增强。安装器与升级流程统一,默认按最新 Git tag 安装,session API 与 context pipeline 做了统一重构,并补齐了 Windows、compaction、compact result mapping、子进程重拉起等多处兼容性与稳定性问题。 - Bot 与 Feishu 集成可用性继续提升。修复了 bot proxy Medium3/30/2026
v0.2.13## What's Changed * test: add comprehensive unit tests for core utilities by @xingzihai in https://github.com/volcengine/OpenViking/pull/990 * fix(vlm): scope LiteLLM thinking param to DashScope providers only by @deepakdevp in https://github.com/volcengine/OpenViking/pull/958 * Api test:improve API test infrastructure with dual-mode CI by @kaisongli in https://github.com/volcengine/OpenViking/pull/950 * docs: Add basic usage example and Chinese documentation for examples by @xingzihai in htMedium3/26/2026
v0.2.12## What's Changed * Use uv sync --locked in Dockerfile by @mtthidoteu in https://github.com/volcengine/OpenViking/pull/963 * fix(server): handle CancelledError during shutdown paths by @ZaynJarvis in https://github.com/volcengine/OpenViking/pull/848 * fix(bot):rollback config by @yeshion23333 in https://github.com/volcengine/OpenViking/pull/973 ## New Contributors * @mtthidoteu made their first contribution in https://github.com/volcengine/OpenViking/pull/963 **Full Changelog**: https:Medium3/25/2026
v0.2.11# OpenViking v0.2.11 OpenViking v0.2.11 聚焦在四个方向:模型与检索生态扩展、解析与导入能力增强、服务端可观测性与运维能力补齐,以及多租户安全性和稳定性加固。相较 `v0.2.9`,这一版本不仅补上了 Helm 部署、Prometheus 指标、健康统计 API、`ov doctor` 与 `reindex` 等工程能力,也持续扩展了 embedding、rerank、VLM 与 bot 侧的模型接入面。 这次更新的代表性改动包括:新增 MiniMax embedding、Azure OpenAI、GeminiDenseEmbedder、LiteLLM embedding/rerank、OpenAI-compatible rerank 与 Tavily 搜索后端;新增 Whisper 音频转写与飞书/Lark 云文档解析;新增多租户文件加密与文档加密;增加 Prometheus 指标导出、内存健康统计接口、可信租户头鉴权模式,以及面向 Kubernetes 的 Helm Chart。与此同时,版本还集中修复了 Windows 锁文件、会Medium3/25/2026
v0.2.10# LiteLLM 安全热修复 Release Note 更新时间:2026-03-24 ## 背景 由于上游依赖 `LiteLLM` 出现公开供应链安全事件,OpenViking 在本次热修复中临时禁用所有 LiteLLM 相关入口,以避免继续安装或运行到受影响依赖。 ## 变更内容 - 移除根依赖中的 `litellm` - 移除根 `uv.lock` 中的 `litellm` - 禁用 LiteLLM 相关的 VLM provider 入口 - 禁用 bot 侧 LiteLLM provider 和图片工具入口 - 增加 LiteLLM 已禁用的回归测试 ## 建议操作 建议用户立即执行以下动作: 1. 检查运行环境中是否安装 `litellm` 2. 卸载可疑版本并重建虚拟环境、容器镜像或发布产物 3. 对近期安装过可疑版本的机器轮换 API Key 和相关凭证 4. 升级到本热修复版本 可用命令: ```bash python -m pip show litellm python -m pip unMedium3/24/2026
v0.2.9## What's Changed * fix(resource): enforce agent-level watch task isolation by @lyfmt in https://github.com/volcengine/OpenViking/pull/762 * feat(embedder): use summary for file embedding in semantic pipeline by @yangxinxin-7 in https://github.com/volcengine/OpenViking/pull/765 * Fix/bot readme by @chenjw in https://github.com/volcengine/OpenViking/pull/774 * Fix/increment update dir vector store by @myysy in https://github.com/volcengine/OpenViking/pull/773 * fix(plugin): restore bug fixesLow3/19/2026
v0.2.8# OpenViking v0.2.8 发布公告 OpenViking v0.2.8 现已发布。 这是一次围绕 **上下文工程能力、插件生态、检索与记忆链路、可观测性以及工程兼容性** 持续增强的版本更新。 整体来看,v0.2.8 以功能补强和稳定性修复为主,适合现有用户升级;如果你在使用 OpenClaw / OpenCode 插件、长会话记忆、资源同步或自定义模型接入,本次更新尤其值得关注。 ## 本次更新亮点 ### 1. 插件生态继续升级,OpenClaw / OpenCode 集成更完整 - `openclaw-plugin` 升级到 **2.0**,从 memory plugin 进一步演进为 **context engine**。 - 相关 PR:[#662](https://github.com/volcengine/OpenViking/pull/662) - 新增并完善 **OpenCode memory plugin example**,补充 attribution 与后续插件更新。 - 相关 PR:[#569](htLow3/19/2026
v0.2.6# OpenViking v0.2.6 发布公告 OpenViking v0.2.6 已发布。 这是一次聚焦体验优化和稳定性增强的小版本更新。相比 v0.2.5,这一版不仅带来了更顺手的命令行交互和全新的 Console,也补齐了会话异步提交、后台任务跟踪、资源导入目录结构保留等关键能力,同时在 openclaw memory plugin、安装流程、跨平台兼容性和 CI 稳定性上做了大量修复与打磨。 ## 重点更新 ### 1. CLI 与对话体验进一步升级 - `ov chat` 现在基于 `rustyline` 提供更完整的行编辑体验,终端交互更自然,不再出现常见的方向键控制字符问题。 - 新增 Markdown 渲染能力,终端中的回答展示更清晰,代码块、列表等内容可读性更好。 - 支持聊天历史记录,同时提供关闭格式化和关闭历史记录的选项,便于在不同终端环境中按需使用。 ### 2. 服务端异步能力增强,长任务不再轻易阻塞 - Session commit 新增异步提交能力,并支持通过 `wait` 参数控制是否同步等待结果。 - Low3/11/2026
v0.2.5## What's Changed * docs: use openviking-server to launch server by @MaojiaSheng in https://github.com/volcengine/OpenViking/pull/398 * fix: Session.add_message() support parts parameter by @qin-ctx in https://github.com/volcengine/OpenViking/pull/399 * feat: support GitHub tree/<ref> URL for code repository import by @yangxinxin-7 in https://github.com/volcengine/OpenViking/pull/400 * fix: improve ISO datetime parsing by @zztdandan in https://github.com/volcengine/OpenViking/pull/404 * feaLow3/6/2026
v0.2.3## Breaking Change After upgrading, datasets/indexes generated by historical versions are not compatible with the new version and cannot be reused directly. Please rebuild the datasets after upgrading (a full rebuild is recommended) to avoid retrieval anomalies, inconsistent filtering results, or runtime errors. Stop the service -> rm -rf ./your-openviking-workspace -> restart the service with the openviking-server command. ## What's Changed * Feat: CLI optimization by @MaojiaSheng in httpLow3/3/2026
v0.2.2## Breaking Change Warning: This Release includes Breaking Chage! Before upgrading, you should stop VikingDB Server and clear workspace dir first. ## What's Changed * fix ci by @zhoujh01 in https://github.com/volcengine/OpenViking/pull/363 * 在readme补充千问使用方法 by @BytedanceFu in https://github.com/volcengine/OpenViking/pull/364 * feat(parse): Add C# AST extractor support by @suraciii in https://github.com/volcengine/OpenViking/pull/366 * chore: bump CLI version to 0.2.1 by @ZaynJarvis inLow3/3/2026
v0.2.1This is a **core feature preview release**. Please note that performance and consistency have not been fully optimized, so use with caution. ``` Before you upgrade, please remove old ov.conf and old data directory, and then follow the new README.md to deploy! ``` ## 1. Core Capability Upgrades: Multi-tenancy, Cloud-Native & OpenClaw/OpenCode Adaptation - **Multi-tenancy**: Implemented foundational multi-tenancy support at the API layer (#260, #283), laying the groundwork for isolated usLow2/28/2026
cli@0.2.0# OpenViking CLI v0.2.0 ## Installation ### Quick Install (macOS/Linux) ```bash curl -fsSL https://raw.githubusercontent.com/volcengine/OpenViking/refs/tags/cli@0.2.0/crates/ov_cli/install.sh | bash ``` ### Manual Installation Download the appropriate binary for your platform below, extract it, and add it to your PATH. The CLI command is simply `ov`: ```bash # After extraction chmod +x ov # Unix only mv ov /usr/local/bin/ # or any directory in your PATH # Verify installation ov --version Low2/27/2026
v0.1.18## What's Changed * feat: add Rust CLI implementation [very fast] by @ZaynJarvis in https://github.com/volcengine/OpenViking/pull/162 * feat: make -o and --json global param by @ZaynJarvis in https://github.com/volcengine/OpenViking/pull/172 * feat: provide a test_ov.sh scripts as reference by @ZaynJarvis in https://github.com/volcengine/OpenViking/pull/173 * fix: short markdown parse filename by @MaojiaSheng in https://github.com/volcengine/OpenViking/pull/176 * Update 03-quickstart-serverLow2/23/2026
cli@0.1.0# OpenViking CLI v0.1.0 ## Installation ### Quick Install (macOS/Linux) ```bash curl -fsSL https://raw.githubusercontent.com/volcengine/OpenViking/refs/tags/cli@0.1.0/crates/ov_cli/install.sh | bash ``` ### Manual Installation Download the appropriate binary for your platform below, extract it, and add it to your PATH. The CLI command is simply `ov`: ```bash # After extraction chmod +x ov # Unix only mv ov /usr/local/bin/ # or any directory in your PATH # Verify installation ov --version Low2/14/2026
v0.1.17## What's Changed * Revert "feat: support dynamic project_name config in VectorDB / volcengine" by @MaojiaSheng in https://github.com/volcengine/OpenViking/pull/167 * Fix/ci clean workspace by @kkkwjx07 in https://github.com/volcengine/OpenViking/pull/170 * fix: tree uri output error, and validate ov.conf before start by @MaojiaSheng in https://github.com/volcengine/OpenViking/pull/169 **Full Changelog**: https://github.com/volcengine/OpenViking/compare/v0.1.16...v0.1.17Low2/14/2026
v0.1.16## What's Changed * fix: fix vectordb by @kkkwjx07 in https://github.com/volcengine/OpenViking/pull/164 * feat: make temp uri readable, and enlarge timeout of add-resource by @MaojiaSheng in https://github.com/volcengine/OpenViking/pull/165 * feat: support dynamic project_name config in VectorDB / volcengine by @baojun-zhang in https://github.com/volcengine/OpenViking/pull/161 * fix: server uvloop conflicts with nest_asyncio by @MaojiaSheng in https://github.com/volcengine/OpenViking/pull/1Low2/13/2026
v0.1.15## What's Changed Now you can try Server/CLI mode! * refactor(client): 拆分 HTTP 客户端,分离嵌入模式与 HTTP 模式 by @qin-ctx in https://github.com/volcengine/OpenViking/pull/141 * Transaction store by @MaojiaSheng in https://github.com/volcengine/OpenViking/pull/136 * fix CI: correct patch targets in test_quick_start_lite.py by @ZaynJarvis in https://github.com/volcengine/OpenViking/pull/140 * fix/lifecycle by @qin-ctx in https://github.com/volcengine/OpenViking/pull/144 * Fix/lifecycle by @qin-ctx Low2/13/2026
v0.1.14## What's Changed * build(deps): bump protobuf from 6.33.2 to 6.33.5 by @dependabot[bot] in https://github.com/volcengine/OpenViking/pull/104 * refactor: cpp bytes rows by @kkkwjx07 in https://github.com/volcengine/OpenViking/pull/105 * fix: agfs port by @qin-ctx in https://github.com/volcengine/OpenViking/pull/110 * refactor: refactor agfs s3 backend config by @chuanbao666 in https://github.com/volcengine/OpenViking/pull/113 * fix(depends): 修复pip依赖的安全漏洞 by @chuanbao666 in https://github.coLow2/12/2026
v0.1.12## What's Changed * feat: add search_with_sparse_logit_alpha by @kkkwjx07 in https://github.com/volcengine/OpenViking/pull/71 * refactor: Refactor S3 configuration structure and fix Python 3.9 compatibility issues by @baojun-zhang in https://github.com/volcengine/OpenViking/pull/73 * fix: fix ci by @kkkwjx07 in https://github.com/volcengine/OpenViking/pull/74 * refactor: unify async execution utilities into run_async by @qin-ctx in https://github.com/volcengine/OpenViking/pull/75 * docs: upLow2/9/2026
v0.1.11## What's Changed * support small github code repos by @MaojiaSheng in https://github.com/volcengine/OpenViking/pull/70 ## New Contributors * @MaojiaSheng made their first contribution in https://github.com/volcengine/OpenViking/pull/70 **Full Changelog**: https://github.com/volcengine/OpenViking/compare/v0.1.10...v0.1.11Low2/5/2026
v0.1.9## What's Changed * Bump github/codeql-action from 3 to 4 by @dependabot[bot] in https://github.com/volcengine/OpenViking/pull/5 * Bump actions/setup-go from 5 to 6 by @dependabot[bot] in https://github.com/volcengine/OpenViking/pull/4 * Bump actions/setup-python from 5 to 6 by @dependabot[bot] in https://github.com/volcengine/OpenViking/pull/3 * Bump astral-sh/setup-uv from 4 to 7 by @dependabot[bot] in https://github.com/volcengine/OpenViking/pull/2 * Bump actions/download-artifact from 4Low2/5/2026

Dependencies & License Audit

Loading dependencies...

Similar Packages

PageIndex📑 PageIndex: Document Index for Vectorless, Reasoning-based RAGmain@2026-06-02
MineContext🛠️ Simplify your tasks with MineContext, an open-source AI tool that provides context-aware support for clarity and efficiency in work and study.main@2026-06-05
Advanced_Graph_RAGNo descriptionmain@2026-06-05
txtai💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflowsv9.10.0
Ollama-Terminal-AgentAutomate shell tasks using a local Ollama model that plans, executes, and fixes commands without cloud or API dependencies.main@2026-06-04

More in Databases

WeKnoraLLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.
orbitOne API for 20+ LLM providers, your databases, and your files — self-hosted, open-source AI gateway with RAG, voice, and guardrails.
ai-real-estate-assistantAdvanced AI Real Estate Assistant using RAG, LLMs, and Python. Features market analysis, property valuation, and intelligent search.
alibabacloud-adb20211201Alibaba Cloud adb (20211201) SDK Library for Python