Monitoring Engine Status Process State red

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
mejokj
Posts: 353
Joined: Mon Jul 22, 2013 10:31 pm

Monitoring Engine Status Process State red

Post by mejokj »

Hi Team,

We are continuously getting Monitoring Engine Status Process State red.

On checking, we could see that there is a TIMEOUT error on perfdata.log. Attached the screenshot.

So we are going to increase the timeout value for perfdata processing on process_perfdata.cfg from 40 to 60 or 80.

Our question is, Will this increase in timeout cause any performance issues?

===========================================================================

Current timeout value


[root@nagios pnp]# cat process_perfdata.cfg
#
# Config File for process_perfdata.pl
#
# $Id: process_perfdata.cfg-sample.in 520 2008-09-16 12:50:10Z pitchfork $
#
# process_perfdata.pl Timout
#
TIMEOUT = 40

================================================================

Thanks
You do not have the required permissions to view the files attached to this post.
benjaminsmith
Posts: 5324
Joined: Wed Aug 22, 2018 4:39 pm
Location: saint paul

Re: Monitoring Engine Status Process State red

Post by benjaminsmith »

HI,
Our question is, Will this increase in timeout cause any performance issues?
No, that won't affect the performance. However, there is max load threshold for NPCD, and increasing this too much will cause problems since the performance process will take as many resources as you allow it to.

That said, issues with the performance graph normally do not cause the monitoring engine to stop.

Do you have enough memory and disk space? Check the nagios.log for error messages.

Code: Select all

tail -n 100 /usr/local/nagios/var/nagios.log
--Benjamin

If the issue persists, please send us a system profile.

To send us your system profile.
Login to the Nagios XI GUI using a web browser.
Click the "Admin" > "System Profile" Menu
Click the "Download Profile" button
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.

Be sure to check out our Knowledgebase for helpful articles and solutions!
mejokj
Posts: 353
Joined: Mon Jul 22, 2013 10:31 pm

Re: Monitoring Engine Status Process State red

Post by mejokj »

below is the ram and cpu of the server.


[root@nagios libexec]# free -m
total used free shared buff/cache available
Mem: 31897 2834 882 1621 28180 26858
Swap: 30719 341 30378


[root@nagios libexec]# lscpu
Architecture: x86_64
CPU op-mode(s): 32-bit, 64-bit
Byte Order: Little Endian
CPU(s): 56
On-line CPU(s) list: 0-55
Thread(s) per core: 2
Core(s) per socket: 14
Socket(s): 2
NUMA node(s): 2
Vendor ID: GenuineIntel
CPU family: 6
Model: 79
Model name: Intel(R) Xeon(R) CPU E5-2660 v4 @ 2.00GHz
Stepping: 1
CPU MHz: 1999.921
BogoMIPS: 3996.49
Virtualization: VT-x
L1d cache: 32K
L1i cache: 32K
L2 cache: 256K
L3 cache: 35840K
NUMA node0 CPU(s): 0-13,28-41
NUMA node1 CPU(s): 14-27,42-55
benjaminsmith
Posts: 5324
Joined: Wed Aug 22, 2018 4:39 pm
Location: saint paul

Re: Monitoring Engine Status Process State red

Post by benjaminsmith »

Hi,

The specs look good. Did you find anything errors standing out in the nagios log? Please PM the system profile so we can help troubleshoot this for you.

Benjamin
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.

Be sure to check out our Knowledgebase for helpful articles and solutions!
mejokj
Posts: 353
Joined: Mon Jul 22, 2013 10:31 pm

Re: Monitoring Engine Status Process State red

Post by mejokj »

Hi Benjamin,

We could not see anything unusual in the Nagios log files.

Send you the system profile as PM to you.

Please have a look.

Thanks
benjaminsmith
Posts: 5324
Joined: Wed Aug 22, 2018 4:39 pm
Location: saint paul

Re: Monitoring Engine Status Process State red

Post by benjaminsmith »

Hi,

I'm not seeing anything in the logs related to the nagios process crashing or stopping. Memory and disk space are good. Are you still getting notifications from the system? Are you setting check results being updated in the GUI? How often is the process stopping?

I noticed you have livestatus integrated, are there any other customizations on this server?

Since the database is offloaded the log was not included in the profile, can you retrieve this from the remote host and upload it to the ticket?

Thanks,
Benjamin
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.

Be sure to check out our Knowledgebase for helpful articles and solutions!