Skip to main content

Production Monitoring

This guide sets up ongoing observability for a Livepeer Gateway running in production. It covers Prometheus metrics collection, Grafana dashboards, log monitoring, the Livepeer Explorer, and automated health alerts. This guide includes:
  • Monitoring setup for , Pipelines and service routing
  • Installation guides for (recommended), , and
This page is about ongoing production monitoring - watching the Gateway over time with metrics, dashboards, and alerts. For one-time verification that the Gateway started correctly, see the verify step in the setup journey.

Livepeer Explorer

The Livepeer Explorer provides a network-level view of all registered Gateways on Arbitrum One. It is accessible from any OS and requires no local installation.

Explorer columns

The Gateway Explorer table displays the following for each registered Gateway:
After registering an on-chain Gateway, it may take a few minutes to appear in the Explorer. Search by the ETH address shown in livepeer_cli Option 1 (under BROADCASTER STATS).
For off-chain Gateways, the Explorer does not show the node directly. Use for network-wide AI Orchestrator visibility instead.

Gateway Explorer

View registered Gateways, deposit balances, and network activity on Arbitrum.

Livepeer Tools

Community dashboard for Gateway performance metrics and AI Orchestrator status.

Key Metrics Reference

All Gateway types (Video, AI, Dual) expose metrics on the same -httpAddr port (default 8935). A single Prometheus scrape job covers all workload types. Full metric table:

Grafana Dashboards

The livepeer/livepeer-monitoring Docker stack includes three starter Grafana dashboards:
Active sessions, success rate, transcoding latency, and Orchestrator connectivity. The primary operational dashboard for day-to-day monitoring.
ETH deposit and reserve over time, ticket value sent, and payment creation errors. Critical for on-chain Gateways to track deposit depletion rate.
Realtime ratios, latency distribution (p50, p95, p99), and segment success/failure breakdown. Useful for identifying quality degradation patterns.
Custom panels to consider adding:
  • Deposit depletion forecast (linear regression on livepeer_gateway_deposit over 24h)
  • Session count vs -maxSessions limit as a percentage gauge
  • AI discovery error rate overlaid with session count (correlation check)

Alert Rules

Configure these rules in Prometheus to receive automated notifications when the Gateway needs attention.

Critical

Critical Alerts
groups:
  - name: livepeer-gateway-critical
    rules:
      - alert: GatewayDown
        expr: up{job=~"livepeer.*"} == 0
        for: 1m
        labels:
          severity: critical
        annotations:
          summary: "Livepeer Gateway is unreachable"

      - alert: DepositExhausted
        expr: livepeer_gateway_deposit < 10000000000000000
        for: 5m
        labels:
          severity: critical
        annotations:
          summary: "Gateway ETH deposit critically low ({{ $value | humanize }} wei)"

      - alert: SuccessRateCritical
        expr: |
          rate(livepeer_segment_transcoded_total[5m]) /
          rate(livepeer_segment_source_appeared_total[5m]) < 0.90
        for: 5m
        labels:
          severity: critical
        annotations:
          summary: "Transcoding success rate below 90%"

Warning

Warning Alerts
      - alert: DepositLow
        expr: livepeer_gateway_deposit < 50000000000000000
        for: 10m
        labels:
          severity: warning
        annotations:
          summary: "Gateway ETH deposit below 0.05 ETH - top up soon"

      - alert: HighTranscodeLatency
        expr: |
          rate(livepeer_transcode_overall_latency_seconds_sum[5m]) /
          rate(livepeer_transcode_overall_latency_seconds_count[5m]) > 5
        for: 5m
        labels:
          severity: warning
        annotations:
          summary: "Average transcoding latency exceeds 5 seconds"

      - alert: PaymentCreateErrors
        expr: rate(livepeer_payment_create_errors[5m]) > 0
        for: 2m
        labels:
          severity: warning
        annotations:
          summary: "Ticket creation errors - check Arbitrum RPC health"

      - alert: AIDiscoveryErrors
        expr: rate(livepeer_discovery_errors_total[5m]) > 0
        for: 5m
        labels:
          severity: warning
        annotations:
          summary: "Orchestrator discovery failing - check -orchAddr list"

Prometheus Metrics

Full table of all livepeer_* metrics.

On-Chain Metrics

TicketBroker events and Arbitrum contract monitoring.

Tools & Dashboards

Explorer, Livepeer Tools, livepeer_cli.
Last modified on April 7, 2026