Hello, we have custom script(plugin) to monitor device interfaces (96 of them) and few attributes of each interfaces. Script prints 5 x 96 times PerfData output which is all under single service. One of the attributes triggers the exit code.
Issue is that if there is an active “Critical” alert which is been ACK’ed. 2nd alert on different interface is not reported.
Issues could be solved by following:
(1) Alert on Performance Data. We noticed the threshold creates colored DOT but does does not create actual alert reporting the issue. Is there a way to make PerfData threshold a true alert?
(2) Somehow create service for each output of the script. Such that each interface is its own service with alerts.
Script takes advantage of bulk information pull so and we like to avoid running it 100(device) x 96 (interface) x 5(attributes) times.