Active checks stuck every day at same time

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
pdusza
Posts: 8
Joined: Mon Aug 01, 2011 12:47 pm

Active checks stuck every day at same time

Post by pdusza »

Hi,

From about three days I have a strange issue with nagiosXI. At about 1 am almost all active checks and performance graph stuck. What is really interesting 'Next check' date is set to another day at midnight. The only thing which I can do is to schedule immediate check on all host and services.
Some hosts/services are ok (updated and checked) but most of them just stuck. I checked the npcd logs, nagios logs and I don't see there any errors, NPCD deamon is running. There is no differences between hosts/services definition of those which stuck and running (same templates).

I have no idea what could be wrong.

Please look at screens:
Capture1.PNG
Capture3.PNG
Capture4.PNG
You do not have the required permissions to view the files attached to this post.
mguthrie
Posts: 4380
Joined: Mon Jun 14, 2010 10:21 am

Re: Active checks stuck every day at same time

Post by mguthrie »

That is pretty odd. Can you show us the output from the following command?

Code: Select all

chage -l nagios

First, lets make sure we don't have multiple instances of Nagios that have spawned. Run the following from the command-line:

Code: Select all

service nagios stop
killall -9 nagios
service nagios start

Then lets check to make sure there isn't any DB corruption:
http://assets.nagios.com/downloads/nagi ... tabase.pdf
pdusza
Posts: 8
Joined: Mon Aug 01, 2011 12:47 pm

Re: Active checks stuck every day at same time

Post by pdusza »

Hi,

here is a output of that commands:
Capture5.PNG
and here is an output of repairing process (no errors):
result.txt
You do not have the required permissions to view the files attached to this post.