Monitoring - OpenViking

OpenViking Server provides comprehensive endpoints for monitoring system health and component status.

Health Check

The /health endpoint provides a simple liveness check with no authentication required.

curl http://localhost:1933/health

Response:

{"status": "ok"}

This endpoint is always accessible without authentication, making it ideal for load balancers and monitoring tools.

Readiness Check

The /ready endpoint checks all system components before accepting traffic.

curl http://localhost:1933/ready

Response:

{
  "status": "ready",
  "checks": {
    "agfs": "ok",
    "vectordb": "ok",
    "api_key_manager": "ok"
  }
}

Use /health for Kubernetes liveness probes and /ready for readiness probes.

System Status

Get comprehensive status of all components.

Overall System Health

Python SDK
HTTP Client
HTTP API

import openviking as ov

client = ov.SyncOpenViking(path="./data")
client.initialize()

# Get full status
status = client.get_status()
print(f"Healthy: {status['is_healthy']}")
print(f"Components: {status['components']}")

# Quick health check
if client.is_healthy():
    print("System OK")
else:
    print(f"Errors: {status['errors']}")

import openviking as ov

client = ov.SyncHTTPClient(
    url="http://localhost:1933",
    api_key="your-key"
)

status = client.get_status()
print(f"Healthy: {status['is_healthy']}")

for name, info in status['components'].items():
    print(f"{name}: {info['is_healthy']}")

curl http://localhost:1933/api/v1/observer/system \
  -H "X-API-Key: your-key"

Response:

{
  "status": "ok",
  "result": {
    "is_healthy": true,
    "errors": [],
    "components": {
      "queue": {
        "name": "queue",
        "is_healthy": true,
        "has_errors": false
      },
      "vikingdb": {
        "name": "vikingdb",
        "is_healthy": true,
        "has_errors": false
      },
      "vlm": {
        "name": "vlm",
        "is_healthy": true,
        "has_errors": false
      }
    }
  }
}

Component Status

Check individual component health.

Queue Status

Monitor the processing queue:

curl http://localhost:1933/api/v1/observer/queue \
  -H "X-API-Key: your-key"

VikingDB Status

Check vector database health:

curl http://localhost:1933/api/v1/observer/vikingdb \
  -H "X-API-Key: your-key"

VLM Status

Monitor Vision Language Model health:

curl http://localhost:1933/api/v1/observer/vlm \
  -H "X-API-Key: your-key"

Response Time Monitoring

Every API response includes processing time in the X-Process-Time header.

curl -v http://localhost:1933/api/v1/fs/ls?uri=viking:// \
  -H "X-API-Key: your-key" 2>&1 | grep X-Process-Time
# < X-Process-Time: 0.0023

Time is measured in seconds and represents server-side processing time only.

Kubernetes Integration

Configure health probes for Kubernetes deployments:

apiVersion: apps/v1
kind: Deployment
metadata:
  name: openviking
spec:
  template:
    spec:
      containers:
      - name: openviking
        image: ghcr.io/volcengine/openviking:main
        ports:
        - containerPort: 1933
        
        livenessProbe:
          httpGet:
            path: /health
            port: 1933
          initialDelaySeconds: 10
          periodSeconds: 10
          timeoutSeconds: 5
          failureThreshold: 3
        
        readinessProbe:
          httpGet:
            path: /ready
            port: 1933
          initialDelaySeconds: 5
          periodSeconds: 5
          timeoutSeconds: 3
          failureThreshold: 3

Liveness Probe

Uses /health to check if the container is alive. Kubernetes restarts the pod if this fails.

Readiness Probe

Uses /ready to check if the service is ready to accept traffic. Kubernetes removes the pod from service endpoints if this fails.

Monitoring Best Practices

Set Up Alerts

Configure alerts for component health:

import openviking as ov
import time

client = ov.SyncHTTPClient(url="http://localhost:1933", api_key="key")

while True:
    status = client.get_status()
    
    if not status['is_healthy']:
        # Send alert
        errors = status.get('errors', [])
        print(f"ALERT: System unhealthy - {errors}")
        
    for name, info in status['components'].items():
        if not info['is_healthy']:
            print(f"ALERT: {name} unhealthy")
    
    time.sleep(60)  # Check every minute

Monitor Processing Queue

queue_status = client.observer.queue

if queue_status.get('queue_size', 0) > 100:
    print("WARNING: Queue backlog detected")

Track Response Times

import httpx
import time

url = "http://localhost:1933/api/v1/fs/ls?uri=viking://"
headers = {"X-API-Key": "your-key"}

response = httpx.get(url, headers=headers)
process_time = float(response.headers.get("X-Process-Time", 0))

if process_time > 1.0:
    print(f"WARNING: Slow response: {process_time}s")

Log System Metrics

import logging

logger = logging.getLogger(__name__)

status = client.get_status()
logger.info(f"System health: {status['is_healthy']}")
logger.info(f"Queue size: {client.observer.queue.get('queue_size', 0)}")
logger.info(f"VLM calls: {client.observer.vlm.get('total_calls', 0)}")

Prometheus Integration

While OpenViking doesn’t expose Prometheus metrics directly, you can create a simple exporter:

from prometheus_client import start_http_server, Gauge
import openviking as ov
import time

# Define metrics
system_health = Gauge('openviking_system_healthy', 'System health status')
queue_size = Gauge('openviking_queue_size', 'Processing queue size')
response_time = Gauge('openviking_response_time', 'API response time')

client = ov.SyncHTTPClient(url="http://localhost:1933", api_key="key")

def collect_metrics():
    while True:
        try:
            # System health
            status = client.get_status()
            system_health.set(1 if status['is_healthy'] else 0)
            
            # Queue size
            queue_status = client.observer.queue
            queue_size.set(queue_status.get('queue_size', 0))
            
        except Exception as e:
            print(f"Metrics collection error: {e}")
        
        time.sleep(15)

if __name__ == '__main__':
    start_http_server(8000)  # Prometheus metrics on :8000
    collect_metrics()

Add to prometheus.yml:

scrape_configs:
  - job_name: 'openviking'
    static_configs:
      - targets: ['localhost:8000']

Troubleshooting

Component Unhealthy

Symptom: is_healthy: false for a componentSolutions:

Check component logs
Verify configuration (API keys, endpoints)
Test network connectivity
Restart the service

# Check detailed status
curl http://localhost:1933/api/v1/observer/system \
  -H "X-API-Key: key" | jq '.result'

High Queue Size

Symptom: Processing queue grows without clearingSolutions:

Increase embedding concurrency:
```
{"embedding": {"max_concurrent": 20}}
```
Check VLM rate limits
Scale horizontally with more workers

Slow Response Times

Symptom: X-Process-Time > 1 secondSolutions:

Check vector database performance
Optimize search parameters (limit, threshold)
Use local storage for better latency
Enable caching

Deployment

Server deployment and configuration

Configuration

System configuration reference

Python SDK

Client health check methods

CLI Usage

CLI monitoring commands

​Health Check

​Readiness Check

​System Status

​Overall System Health

​Component Status

​Queue Status

​VikingDB Status

​VLM Status

​Response Time Monitoring

​Kubernetes Integration

​Monitoring Best Practices

​Prometheus Integration

​Troubleshooting

​Related Resources

Deployment

Configuration

Python SDK

CLI Usage

Health Check

Readiness Check

System Status

Overall System Health

Component Status

Queue Status

VikingDB Status

VLM Status

Response Time Monitoring

Kubernetes Integration

Monitoring Best Practices

Prometheus Integration

Troubleshooting

Related Resources