We started monitoring our infrastructure with Icinga2 and display performance metrics using Grafana and InfluxDB. It kind of works, but sometimes there are gaps in the graph:
But if I hover over the graph, I see that there are no data points between 29.9. 2:50 up until 29.9. 12:18.
The service checks are executed every 3 minutes. Sometimes Grafana/InfluxDB is randomly missing performance data
But I can’t figure out why we don’t get the performance data to display…
Has anyone faced the same issue or knows how I can solve it?
Hi @lgubler,
Might it just be the case of the Monitoring Plugin not reporting performance data in that time frame? Was there a different state in that time frame? An Alarm for example?
Thanks @lorenz for the reply earlier. I was able to figure out why it wasn’t working. Icinga2 itself and the monitoring were fine. The issue was with my Influxdb2Writer configuration.
When I had enable_send_metadata = true, InfluxDB wasn’t able to properly process the results. In the logs I saw lots of these warnings:
Please keep in mind that this is the first time I’m using Icinga2, InfluxDB and Grafana in production. If anyone knows about some improvements or explanations why InfluxDB failed when processing the metadata, please let me know!
Ok, the Influx error message gives us a hint. There are some edge cases where icinga2 tries to write “invalid” performance data (“invalid” in the sense that InfluxDB rejects them for some reason).
The leads to some weird effects, since icinga2 bundles a lot of data points (1024 (flush_threshold) by default or less depending on the flush_interval) and if the write request is rejected all of those are dropped.
You are currently circumventing this by writing every data point in it’s own request (which will be slower/more expensive).
In the log of icinga2 you should be able to find error message like
warning/Influxdb2Writer: Ignoring invalid perfdata for checkable
which should point you to the failing data point. Ideally you could report that so icinga2 can be modified to filter them out ahead of time.