I’ve been working with Icinga2 for a while now and was curious as to what everyone’s workflows are like.
For me, I use the Dashing dashboard for a high-level view of everything going on, which includes the number of current host/service problems and list of individual problems as well as rotating NagVis maps of high-level views of my environment integrated in an iFrame on the dashboard. If there is some problem that pops up, the NagVis map will visibly and audibly alert me (I’ll also get an email or SMS) and I can then click on the node to bring me to a more detailed NagVis map of the network device with its port mappings, health, and resource statuses (if applicable). Then, if I want to get an even more granular view, I can click on one of the health or resource nodes from NagVis to bring me to a Grafana graph of the performance history of that host/service. Through this process, I’m able to find where, when, and sometimes how the problem occurred.
I’m now wondering how you guys implement Icinga into your environments and if there is any room for improvement for my set up, plus, perhaps I’ve sparked some ideas for your Icinga set ups as well!