Monitoring Engine Status Process State red

mejokj · Post by **mejokj** » Thu Mar 04, 2021 3:33 am

Hi Team,

We are continuously getting Monitoring Engine Status Process State red.

On checking, we could see that there is a TIMEOUT error on perfdata.log. Attached the screenshot.

So we are going to increase the timeout value for perfdata processing on process_perfdata.cfg from 40 to 60 or 80.

Our question is, Will this increase in timeout cause any performance issues?

===========================================================================

Current timeout value

[root@nagios pnp]# cat process_perfdata.cfg
#
# Config File for process_perfdata.pl
#
# $Id: process_perfdata.cfg-sample.in 520 2008-09-16 12:50:10Z pitchfork $
#
# process_perfdata.pl Timout
#
TIMEOUT = 40

================================================================

Thanks

benjaminsmith · Post by **benjaminsmith** » Thu Mar 04, 2021 6:09 pm

HI,

Our question is, Will this increase in timeout cause any performance issues?

No, that won't affect the performance. However, there is max load threshold for NPCD, and increasing this too much will cause problems since the performance process will take as many resources as you allow it to.

That said, issues with the performance graph normally do not cause the monitoring engine to stop.

Do you have enough memory and disk space? Check the nagios.log for error messages.

Code: Select all

tail -n 100 /usr/local/nagios/var/nagios.log

--Benjamin

If the issue persists, please send us a system profile.

To send us your system profile.
Login to the Nagios XI GUI using a web browser.
Click the "Admin" > "System Profile" Menu
Click the "Download Profile" button

mejokj · Post by **mejokj** » Fri Mar 05, 2021 9:53 am

below is the ram and cpu of the server.

[root@nagios libexec]# free -m
total used free shared buff/cache available
Mem: 31897 2834 882 1621 28180 26858
Swap: 30719 341 30378

[root@nagios libexec]# lscpu
Architecture: x86_64
CPU op-mode(s): 32-bit, 64-bit
Byte Order: Little Endian
CPU(s): 56
On-line CPU(s) list: 0-55
Thread(s) per core: 2
Core(s) per socket: 14
Socket(s): 2
NUMA node(s): 2
Vendor ID: GenuineIntel
CPU family: 6
Model: 79
Model name: Intel(R) Xeon(R) CPU E5-2660 v4 @ 2.00GHz
Stepping: 1
CPU MHz: 1999.921
BogoMIPS: 3996.49
Virtualization: VT-x
L1d cache: 32K
L1i cache: 32K
L2 cache: 256K
L3 cache: 35840K
NUMA node0 CPU(s): 0-13,28-41
NUMA node1 CPU(s): 14-27,42-55

benjaminsmith · Post by **benjaminsmith** » Fri Mar 05, 2021 6:12 pm

Hi,

The specs look good. Did you find anything errors standing out in the nagios log? Please PM the system profile so we can help troubleshoot this for you.

Benjamin

mejokj · Post by **mejokj** » Tue Mar 09, 2021 2:56 am

Hi Benjamin,

We could not see anything unusual in the Nagios log files.

Send you the system profile as PM to you.

Please have a look.

Thanks

benjaminsmith · Post by **benjaminsmith** » Tue Mar 09, 2021 6:20 pm

Hi,

I'm not seeing anything in the logs related to the nagios process crashing or stopping. Memory and disk space are good. Are you still getting notifications from the system? Are you setting check results being updated in the GUI? How often is the process stopping?

I noticed you have livestatus integrated, are there any other customizations on this server?

Since the database is offloaded the log was not included in the profile, can you retrieve this from the remote host and upload it to the ticket?

Thanks,
Benjamin

Nagios Support Forum

Monitoring Engine Status Process State red

Monitoring Engine Status Process State red

Re: Monitoring Engine Status Process State red

Re: Monitoring Engine Status Process State red

Re: Monitoring Engine Status Process State red

Re: Monitoring Engine Status Process State red

Re: Monitoring Engine Status Process State red