Nagios xi Not Sending Recoveries Intermittently

This support forum board is for support questions relating to Nagios xi, our flagship commercial network monitoring solution.
pcappell
Posts: 24
Joined: Tue Jul 28, 2020 2:43 pm

Nagios xi Not Sending Recoveries Intermittently

Post by pcappell »

Good Afternoon,

We have just experienced what we would consider an anomaly in our environment; several critical service alerts (which sent email notifications) did not send their respective recovery emails after the service had recovered.

We are using RHEL, and I have checked /usr/local/nagios/var/nagios.log and do not see any indication of an attempt in that log to send out an email, which I believe rules out any SMTP or sendmail (we do not use sendmail) issues.

These service checks have not experienced this issue in the past; this appears to be an isolated event that we are looking to investigate.

Thank you!
benjaminsmith
Posts: 5324
Joined: Wed Aug 22, 2018 4:39 pm
Location: saint paul

Re: Nagios xi Not Sending Recoveries Intermittently

Post by benjaminsmith »

Hi,

This could be the result of a number the things. The first step would be to verify the recovery notifications are enabled both for the service or host object and also for the contact. You can verify this in the CCM or search through the objects.cache file at:

Code: Select all

/usr/local/nagios/var/objects.cache
Another possibility would be the time periods, check the configurations to make sure the time periods are set corretly.

More details on checking the configurations in the KB article below.

Nagios xi - Notification Problems

If that's not the issue, then check to make sure that the host or service did actually recover. You can verify this by running a State History report on the object. Go to Reports > Available Reports > State History.

If you're not able to find the cause, please PM the system profile and the exact name of the service and I'll take a closer look for you.

To send us your system profile
.
Login to the Nagios xi GUI using a web browser.
Click the "Admin" > "System Profile" Menu
Click the "Download Profile" butt
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.

Be sure to check out our Knowledgebase for helpful articles and solutions!
pcappell
Posts: 24
Joined: Tue Jul 28, 2020 2:43 pm

Re: Nagios xi Not Sending Recoveries Intermittently

Post by pcappell »

Hello Benjamin,

I have verified the troubleshooting steps listed above and everything checks out.

I have sent over a System Profile via private message.

Best,
Paul
benjaminsmith
Posts: 5324
Joined: Wed Aug 22, 2018 4:39 pm
Location: saint paul

Re: Nagios xi Not Sending Recoveries Intermittently

Post by benjaminsmith »

Hi Paul,

I just checked my messages and I do not have a recent one from pcappell. Can you send this once more? Thanks, Benjamin
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.

Be sure to check out our Knowledgebase for helpful articles and solutions!
pcappell
Posts: 24
Joined: Tue Jul 28, 2020 2:43 pm

Re: Nagios xi Not Sending Recoveries Intermittently

Post by pcappell »

Hi Ben,

Did you receive the system profile? I believe I sent it over on 4/24.

Best,
Paul
benjaminsmith
Posts: 5324
Joined: Wed Aug 22, 2018 4:39 pm
Location: saint paul

Re: Nagios xi Not Sending Recoveries Intermittently

Post by benjaminsmith »

Hi Paul,

Sorry for the delay, I do see the one from April 13 and looked up services you provided and the settings look good, as you mentioned. If the service is flapping ( go back and forth between OK and Non-OK), that would suppress any notifications. Another possibility is if the host was down.

At this point, we would need to search through the log files directly to track the status changes of these services and host. The log files are located in the following directory.

Code: Select all

/usr/local/nagios/var/archives
They are rotated every 24 hours, so the data in the file labeled 5-5-2021, would be from 5-04-2021. If this happened on 4/6, please zip the files from 4/3 to 4/9 for good measure.

Thanks!
Benjamin
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.

Be sure to check out our Knowledgebase for helpful articles and solutions!