freshcrate Agent Edition — Linux for agent operators

Open-source, high-performance LLM gateway written in Rust. Connect to any LLM provider with a single API. Observability Included.

Get started »

Slack | Docs

Traceloop Hub is a next-gen high-performance LLM gateway written in Rust that centralizes control and tracing of all LLM calls. It provides a unified OpenAI-compatible API for connecting to multiple LLM providers with observability built-in.

Features

Multi-Provider Support: OpenAI, Anthropic, Azure OpenAI, Google VertexAI, AWS Bedrock
OpenAI Compatible API: Drop-in replacement for OpenAI API calls
Two Deployment Modes:
- YAML Mode: Simple static configuration with config files
- Database Mode: Dynamic configuration with PostgreSQL and Management API
Built-in Observability: OpenTelemetry tracing and Prometheus metrics
High Performance: Written in Rust with async/await support
Hot Reload: Dynamic configuration updates (Database mode)
Pipeline System: Extensible request/response processing
Unified Architecture: Single crate structure with integrated Management API

Quick Start

Using Docker

# YAML Mode (simple deployment)
docker run -p 3000:3000 -v $(pwd)/config.yaml:/app/config.yaml traceloop/hub

# Database Mode (with management API)
docker run -p 3000:3000 -p 8080:8080 \
  -e HUB_MODE=database \
  -e DATABASE_URL=postgresql://user:pass@host:5432/db \
  traceloop/hub

Using Cargo

# Clone and build
git clone https://github.com/traceloop/hub.git
cd hub
cargo build --release

# YAML Mode
./target/release/hub

# Database Mode  
HUB_MODE=database DATABASE_URL=postgresql://user:pass@host:5432/db ./target/release/hub

Architecture

The project uses a unified single-crate architecture:

hub/
├── src/                        # Main application code
│   ├── main.rs                 # Application entry point
│   ├── lib.rs                  # Library exports
│   ├── config/                 # Configuration management
│   ├── providers/              # LLM provider implementations
│   ├── models/                 # Data models
│   ├── pipelines/              # Request processing pipelines
│   ├── routes.rs               # HTTP routing
│   ├── state.rs                # Application state management
│   ├── management/             # Management API (Database mode)
│   │   ├── api/                # REST API endpoints
│   │   ├── db/                 # Database models and repositories
│   │   ├── services/           # Business logic
│   │   └── dto.rs              # Data transfer objects
│   └── types/                  # Shared type definitions
├── migrations/                 # Database migrations
├── helm/                       # Kubernetes deployment
├── tests/                      # Integration tests
└── docs/                       # Documentation

Configuration Modes

YAML Mode

Perfect for simple deployments and development environments.

Features:

Static configuration via config.yaml
No external dependencies
Simple provider and model setup
No management API
Single port (3000)

Example config.yaml:

providers:
  - key: openai
    type: openai
    api_key: sk-...

models:
  - key: gpt-4
    type: gpt-4
    provider: openai

pipelines:
  - name: chat
    type: Chat
    plugins:
      - ModelRouter:
          models: [gpt-4]

Database Mode

Ideal for production environments requiring dynamic configuration.

Features:

PostgreSQL-backed configuration
REST Management API (/api/v1/management/*)
Hot reload without restarts
Configuration polling and synchronization
SecretObject system for credential management
Dual ports (3000 for Gateway, 8080 for Management)

Setup:

Set up PostgreSQL database
Run migrations: sqlx migrate run

Set environment variables:

HUB_MODE=database
DATABASE_URL=postgresql://user:pass@host:5432/db

API Endpoints

Core LLM Gateway (Both Modes)

Port 3000:

POST /api/v1/chat/completions - Chat completions
POST /api/v1/completions - Text completions
POST /api/v1/embeddings - Text embeddings
GET /health - Health check
GET /metrics - Prometheus metrics
GET /swagger-ui - OpenAPI documentation

Management API (Database Mode Only)

Port 8080:

GET /health - Management API health check
GET|POST|PUT|DELETE /api/v1/management/providers - Provider management
GET|POST|PUT|DELETE /api/v1/management/model-definitions - Model management
GET|POST|PUT|DELETE /api/v1/management/pipelines - Pipeline management

Provider Configuration

OpenAI

providers:
  - key: openai
    type: openai
    api_key: sk-...
    # Optional
    organization_id: org-...
    base_url: https://api.openai.com/v1

Anthropic

providers:
  - key: anthropic
    type: anthropic
    api_key: sk-ant-...

Azure OpenAI

providers:
  - key: azure
    type: azure
    api_key: your-key
    resource_name: your-resource
    api_version: "2023-05-15"

AWS Bedrock

providers:
  - key: bedrock
    type: bedrock
    region: us-east-1
    # Uses IAM roles or AWS credentials

Google VertexAI

Supports two authentication modes that route to different Google APIs:

# Option 1: API Key (uses Gemini Developer API)
providers:
  - key: vertexai
    type: vertexai
    api_key: your-gemini-api-key

# Option 2: Service Account (uses Vertex AI)
providers:
  - key: vertexai
    type: vertexai
    project_id: your-project
    location: us-central1
    credentials_path: /path/to/service-account.json

Auth Method	API Endpoint	Use Case
API Key	`generativelanguage.googleapis.com`	Simple setup, development
Service Account	`{location}-aiplatform.googleapis.com`	Enterprise, GCP-integrated

Deployment

Helm Chart

# YAML Mode
helm install hub ./helm

# Database Mode
helm install hub ./helm \
  --set management.enabled=true \
  --set management.database.host=postgres \
  --set management.database.existingSecret=postgres-secret

Docker Compose

docker compose example

version: '3.8'
services:
  # YAML Mode
  hub-yaml:
    image: traceloop/hub
    ports:
      - "3000:3000"
    volumes:
      - ./config.yaml:/app/config.yaml

  # Database Mode
  hub-database:
    image: traceloop/hub
    ports:
      - "3000:3000"
      - "8080:8080"
    environment:
      - HUB_MODE=database
      - DATABASE_URL=postgresql://hub:password@postgres:5432/hub
    depends_on:
      - postgres

  postgres:
    image: postgres:15
    environment:
      - POSTGRES_DB=hub
      - POSTGRES_USER=hub
      - POSTGRES_PASSWORD=password

Environment Variables

Variable	Description	Default	Required
`HUB_MODE`	Deployment mode: `yaml` or `database`	`yaml`	No
`CONFIG_FILE_PATH`	Path to YAML config file	`config.yaml`	YAML mode
`DATABASE_URL`	PostgreSQL connection string	-	Database mode
`DB_POLL_INTERVAL_SECONDS`	Config polling interval	`30`	No
`PORT`	Gateway server port	`3000`	No
`MANAGEMENT_PORT`	Management API port	`8080`	Database mode
`TRACE_CONTENT_ENABLED`	Enable request/response tracing	`true`	No

Development

Prerequisites

Rust 1.87+
PostgreSQL 12+ (for database mode)
sqlx-cli (for migrations)

Commands

# Build OSS version
cargo build

# Test
cargo test

# Format
cargo fmt

# Lint
cargo clippy

# Run YAML mode
cargo run

# Run database mode
HUB_MODE=database DATABASE_URL=postgresql://... cargo run

Database Setup (for Database Mode)

# Install sqlx-cli
cargo install sqlx-cli --no-default-features --features postgres

# Run migrations
sqlx migrate run

# Use setup script for complete setup
./scripts/setup-db.sh

Project Structure

The project follows a unified single-crate architecture:

src/main.rs: Application entry point with mode detection
src/lib.rs: Library exports for all modules
src/config/: Configuration management and validation
src/providers/: LLM provider implementations
src/models/: Request/response data models
src/pipelines/: Request processing pipelines
src/management/: Management API (Database mode)
src/types/: Shared type definitions
src/state.rs: Thread-safe application state
src/routes.rs: Dynamic HTTP routing

Key Features

Hot Reload: Configuration changes without restarts (Database mode)
Atomic Updates: Thread-safe configuration updates
Dynamic Routing: Pipeline-based request steering
Comprehensive Testing: Integration tests with testcontainers
OpenAPI Documentation: Auto-generated API specs

Observability

OpenTelemetry Tracing

Configure in your pipeline:

pipelines:
  - name: traced-chat
    type: Chat
    plugins:
      - Tracing:
          endpoint: http://jaeger:14268/api/traces
          api_key: your-key
      - ModelRouter:
          models: [gpt-4]

Prometheus Metrics

Available at /metrics:

Request counts and latencies
Provider-specific metrics
Error rates
Active connections

Architecture

┌─────────────────┐    ┌──────────────────┐    ┌─────────────────┐
│   Client App    │───▶│  Traceloop Hub   │───▶│   LLM Provider  │
└─────────────────┘    │                  │    │  (OpenAI, etc.) │
                       │  ┌─────────────┐ │    └─────────────────┘
                       │  │ Config Mode │ │
                       │  │ YAML | DB   │ │    ┌─────────────────┐
                       │  └─────────────┘ │───▶│   Observability │
                       │                  │    │ (OTel, Metrics) │
                       │  ┌─────────────┐ │    └─────────────────┘
                       │  │ Management  │ │
                       │  │ API (DB)    │ │
                       │  └─────────────┘ │
                       └──────────────────┘

Version	Changes	Urgency	Date
0.9.0	## v0.9.0 (2026-04-15) ### Feat - Expose model provider header (#100) ### Fix - anthropic: drop top_p when both temperature and top_p are set + flatten response (#99) [main 156dfa7] bump: version 0.8.0 → 0.9.0 4 files changed, 13 insertions(+), 3 deletions(-)	High	4/15/2026
0.8.0	## v0.8.0 (2026-03-04) ### Feat - add base_url support for OpenAI provider in managed mode (#98) [main 95b5880] bump: version 0.7.7 → 0.8.0 4 files changed, 9 insertions(+), 3 deletions(-)	Low	3/4/2026

hub

Description

README

Get started » Slack | Docs

Features

Quick Start

Using Docker

Using Cargo

Architecture

Configuration Modes

YAML Mode

Database Mode

API Endpoints

Core LLM Gateway (Both Modes)

Management API (Database Mode Only)

Provider Configuration

OpenAI

Anthropic

Azure OpenAI

AWS Bedrock

Google VertexAI

Deployment

Helm Chart

Docker Compose

Environment Variables

Development

Prerequisites

Commands

Database Setup (for Database Mode)

Project Structure

Key Features

Observability

OpenTelemetry Tracing

Prometheus Metrics

Architecture

License

Contributing

Support

Release History

Dependencies & License Audit

Similar Packages

Get started »

Slack | Docs