System
System Metrics
GET
System Metrics
Get detailed metrics and status for all system components including queue system, VikingDB, and VLM token usage.
Response:
Response:
The observer API provides component-level monitoring for production deployments.
Authentication
Requires API key authentication viaX-API-Key header.
Available Metrics
Queue System
Get status of embedding and semantic processing queues.VikingDB Status
Get VikingDB collection and vector count information.VLM Token Usage
Get Vision Language Model token usage statistics.Overall System Status
Get combined status of all components.Response Schemas
Component Status Response
Response status (
ok or error)Component status information
Request processing time in seconds
System Status Response
Response status (
ok or error)Overall system status
Request processing time in seconds
Response Examples
Wait for Processing
Wait for all asynchronous processing (embedding, semantic generation) to complete.Health Check
Quick boolean health check:Monitoring Best Practices
Use Readiness Probes in Kubernetes
Use Readiness Probes in Kubernetes
Configure the
/ready endpoint (no auth required) as your readiness probe:Monitor Queue Depths
Monitor Queue Depths
If
pending or in_progress counts remain high:- Check embedding service health
- Verify VikingDB connectivity
- Consider scaling workers
Track VLM Token Usage
Track VLM Token Usage
Monitor token consumption to:
- Predict costs
- Identify usage spikes
- Detect potential abuse
Related Endpoints
- System Status - Basic status check
- Wait for Processing - Ensure async operations complete
System Metrics
