I have the following problem: Sometimes one of my VMs “freezes” (for 1-2 minutes). Then SSH is no longer possible and commands on the command line are executed afterwards.
I would like to monitor these dropouts, i.e. set a flag when this problem occurs again.
How can I solve this? I had thought of a cronjob which saves a timestamp to a file every minute and in case of a failure this entry is missing. But an evaluation of the file would run on the same machine and if the machine is frozen then no evaluation or transfer to CHECK_MK happens…
Anybody got an idea?