Monday, 26 October 2020

Prometheus

  • The Prometheus monitoring system and time series database
  • To monitor workloads
  • open source monitoring solution
  • Scape data/metrics using http end points
  • All data is stored as TimeSeries
    • Every time series is identified by metric name and a set of key-value pairs, called labels

  • Types of metrics
    • Counter
      • Value that only goes up
    • Gauge
      • Numeric value that can go up and down(like cpu load)
    • Histogram
      • Sample observations(request duration, response sizes)
    • Summary
  • PromQL
    • Provides builtin operators and functions
    • expressions over time-series vectors
  • Querying metrics
    • Instant Vector
        • single sample for each time series - all sharing the same time stamp
        • eg: node_cpu_seconds_total
    • Range Vector
      • range of data points over time for each time series
      • eg: node_cpu_seconds_total[5min]
    • Scalar
      • Simple numeric floating point value
  • Service discovery
    • Automatic detection of devices and services offered by these devices on a computer network
  • Exporters
    • Export third party metrics  to Prometheus metrics

Alerting

  • Alerting rules
    • Rules live in Prometheus server config
    •  
  • Alert manager

No comments:

Post a Comment