Performance graphs for Services always empty?

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
KDA
Posts: 16
Joined: Wed Aug 11, 2021 1:28 am

Performance graphs for Services always empty?

Post by KDA »

Hi

Whereas I get Performance graphs for Hosts, I have never seen anything for services other than: (when i click on the Performance graphs tab in a Service Details page)

"No performance graphs were found for this service. If you have just started monitoring this object then it may take up to 15 minutes for the performance graphs to appear."

...despite having run Nagios Xi literally for weeks.

My Administration/System Component Status/Performance grapher status is green/ok.

I checked these:
ls /usr/local/nagios/var/spool/perfdata/ | wc -l
ls /usr/local/nagios/var/spool/xidpe/ | wc -l

they are both 0 (zero)... so I don't know is this is a meaningful indicator of anything?

My perfdata.log says: (I don't know what to make of this?)

$ tail -f /usr/local/nagios/var/perfdata.log
2021-10-07 02:10:56 [11932] [0] *** TIMEOUT: Please check your npcd.cfg
2021-10-07 02:10:56 [11932] [0] *** TIMEOUT: Could not delete /usr/local/nagios/var/spool/perfdata//1633537059.perfdata.host-PID-11932:No such file or directory
2021-10-07 02:10:56 [11932] [0] *** Timeout while processing Host: ".pnp-internal" Service: "runtime"
2021-10-07 02:10:56 [11932] [0] *** process_perfdata.pl terminated on signal ALRM
2021-10-10 21:02:36 [3486] [0] *** TIMEOUT: Timeout after 5 secs. ***
2021-10-10 21:02:36 [3486] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2021-10-10 21:02:36 [3486] [0] *** TIMEOUT: Please check your npcd.cfg
2021-10-10 21:02:36 [3486] [0] *** TIMEOUT: Could not delete /usr/local/nagios/var/spool/perfdata//1633878333.perfdata.host-PID-3486:No such file or directory
2021-10-10 21:02:36 [3486] [0] *** Timeout while processing Host: ".pnp-internal" Service: "runtime"
2021-10-10 21:02:36 [3486] [0] *** process_perfdata.pl terminated on signal ALRM
User avatar
pbroste
Posts: 1288
Joined: Tue Jun 01, 2021 1:27 pm

Re: Performance graphs for Services always empty?

Post by pbroste »

Hello @KDA

Thanks for reaching out, looking over the details, want to verify that the 'npcd.service' is running:

Code: Select all

systemctl status npcd.service
Let's also make sure that the nagios User Account has become inactive:

Code: Select all

chage -I -1 -m 0 -M 99999 -E -1 nagios
Please send us the System Profile if things are not chugging along so we can see what is going on.

To send us your system profile.
  • Login to the Nagios XI GUI using a web browser.
  • Click the "Admin" > "System Profile" Menu
  • Click the "Download Profile" button
  • Save the profile.zip file and share
Or via command:

Code: Select all

rm -rf /usr/local/nagiosxi/var/components/profile.zip
/usr/local/nagiosxi/scripts/components/getprofile.sh SUPPORT

Then send the resulting /usr/local/nagiosxi/var/components/profile.zip​ file via Private Message.

Thanks,
Perry
KDA
Posts: 16
Joined: Wed Aug 11, 2021 1:28 am

Re: Performance graphs for Services always empty?

Post by KDA »

Hi again,

The npcd command gives me this, not sure what to make of it but it does not look good I presume?

[hb@localhost nagvis]$ systemctl status npcd.service
● npcd.service - LSB: Nagios NPCD Initscript
Loaded: loaded (/etc/rc.d/init.d/npcd; bad; vendor preset: disabled)
Active: active (running) since Mon 2021-10-25 06:19:27 UTC; 39min ago
Docs: man:systemd-sysv-generator(8)
Process: 1100 ExecStart=/etc/rc.d/init.d/npcd start (code=exited, status=0/SUCCESS)
Main PID: 1131 (npcd)
Tasks: 1
CGroup: /system.slice/npcd.service
└─1131 /usr/local/nagios/bin/npcd -d -f /usr/local/nagios/etc/pnp/npcd.cfg

Oct 25 06:19:27 localhost.localdomain systemd[1]: Starting LSB: Nagios NPCD Initscript...
Oct 25 06:19:27 localhost.localdomain npcd[1100]: NPCD started.
Oct 25 06:19:27 localhost.localdomain systemd[1]: Failed to parse PID from file /usr/local/nagiosxi/var/subsys/npcd.pid: Invalid argument
Oct 25 06:19:27 localhost.localdomain systemd[1]: Started LSB: Nagios NPCD Initscript.


With regards to this one: Let's also make sure that the nagios User Account has become inactive:
chage -I -1 -m 0 -M 99999 -E -1 nagios

...not sure I understand ... but i did it:

[hb@localhost nagvis]$ sudo chage -I -1 -m 0 -M 99999 -E -1 nagios
[sudo] password for hb:
[hb@localhost nagvis]$

I will send you the profile in a private msg.

-håkon
User avatar
pbroste
Posts: 1288
Joined: Tue Jun 01, 2021 1:27 pm

Re: Performance graphs for Services always empty?

Post by pbroste »

Hello @KDA

Thanks for following up with the details. In review, we see that there are tables in the database that are throwing errors and want to go ahead and run through the database repair.
  • Code: Select all

    /usr/local/nagiosxi/scripts/repair_databases.sh
Then restart the nagios.services and let us know how things look, and follow up with an updated System Profile if problems persist.

Also some housekeeping suggestions; we see that there is a continuous number of RSA certificate errors listed which are not match with the server name.
[ssl:warn] [pid 1137] AH01909: RSA certificate configured for localhost.localdomain:443 does NOT include an ID which matches the server name
Thanks,
Perry
KDA
Posts: 16
Joined: Wed Aug 11, 2021 1:28 am

Re: Performance graphs for Services always empty?

Post by KDA »

Hi again,

Thank you, i did the requested database repair (it appeared to fix this & that...) but the end results as far as the Performance graphs remains the same:

"No performance graphs were found for this service. If you have just started monitoring this object then it may take up to 15 minutes for the performance graphs to appear."

It has been running for about an hour now, no cigar... the host graph is there, though. Is it just me misunderstanding what I should be expecting to get in terms of graphs?

Maybe I could rephrase my question: my real issue is that I have a requirement from my end client (for whom I am setting up Nagios) that " (the system).....shall be able to display alarm history"

This requirement is not very specific... but maybe you could point me to what options i have with regards to displaying "alarm history" (I interpret this as stored results of OID/services polled which returned a non-OK result)

-håkon
User avatar
pbroste
Posts: 1288
Joined: Tue Jun 01, 2021 1:27 pm

Re: Performance graphs for Services always empty?

Post by pbroste »

Hello @KDA

Thanks for following up, there are some options to look at.

One; the option to use example command to process results:

Code: Select all

cat /usr/local/nagios/var/archives/* | grep ALERT
Secondly; to use the 'Core GUI' to list results as well:

Code: Select all

HTTP**(s)://xxx.xxx.xxx.xxx/nagios/
  • > Notifications > Pull-down-list (from upper-right)
**s: depending on the environment

Thanks,
Perry