low perfomance of nagiosxi

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
andaks
Posts: 32
Joined: Thu May 19, 2011 9:47 am

low perfomance of nagiosxi

Post by andaks »

Hello everybody:)
Guys, could you help me with nagiosxi perfomance optimization?
I have read several documents about nagiosxi optimization but I need detailed information.
Maybe some advices about apache-server optimization, about database opt, and of course nagios opt.
You do not have the required permissions to view the files attached to this post.
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises

Re: low perfomance of nagiosxi

Post by scottwilkerson »

This picture isn't particularly useful in diagnosing the problem as it doesn't show what is using all the CPU...

One of our developer did an excellent writeup a couple months ago regarding performance tuning that have some additional items you may want to look into to reduce load
http://labs.nagios.com/2012/01/30/nagio ... g-disk-io/
Former Nagios employee
Creator:
Human Design Website
Get Your Human Design Chart
andaks
Posts: 32
Joined: Thu May 19, 2011 9:47 am

Re: low perfomance of nagiosxi

Post by andaks »

oops...
Have some problems...
Try to optimize http://assets.nagios.com/downloads/nagi ... ios_XI.pdf
And now have no graphs. They stopped draw when i started installation of the latest rrdtool version from sources.
In the article I noticed the line where we should delete outdate rrdtool-perl. When I did it also have deleted dependence nagiosxideps.
After that I have no graphs. Althoug I was able to get rrd.journal file
Could you point me the right direction please.
andaks
Posts: 32
Joined: Thu May 19, 2011 9:47 am

Re: low perfomance of nagiosxi

Post by andaks »

rrdtool-perl...
Does it mean that I need a new version of it?
It was 1.38. But now I have rrdtool 1.47 from sources.
andaks
Posts: 32
Joined: Thu May 19, 2011 9:47 am

Re: low perfomance of nagiosxi

Post by andaks »

another question.
graphs appeared successfully from time to time. Everything can be fine within an hour, or may go wrong. In the second case graphs can not be updated some time and after that appear new created rrd.journal file. There are two files at this moment. Also rrd Displays graphics with a delay of two-three hours (in case when graphs are ok). I guess there is a problem with flushing data from the rrd.journal. Because when I do restart the service rrdcached, file rrd.journal become empty, without any data. How to fix this problem?
Thank you in advance guys. Will be waiting for feedback.
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises

Re: low perfomance of nagiosxi

Post by scottwilkerson »

My guess this is caused by the load threshold in /usr/local/nagios/etc/pnp/npcd.cfg

With multi-CPU server we can increase this a lot
Edit /usr/local/nagios/etc/pnp/npcd.cfg and change

Code: Select all

load_threshold = 10.0
to

Code: Select all

load_threshold = 40.0
Also while in there, to decrease load you should make sure

Code: Select all

log_level = 0
Former Nagios employee
Creator:
Human Design Website
Get Your Human Design Chart
mguthrie
Posts: 4380
Joined: Mon Jun 14, 2010 10:21 am

Re: low perfomance of nagiosxi

Post by mguthrie »

Can you post your /etc/sysconfig/rrdcached config?

Also, what is the size of your environment. How many hosts+services are you monitoring and how often do you have the checks running?
andaks
Posts: 32
Joined: Thu May 19, 2011 9:47 am

Re: low perfomance of nagiosxi

Post by andaks »

Hey guys! You are the best!!!
I have no problem with time now :)
Of course I almost always have threshold over 10, because of it I had such prroblem with graphs. Now its ok :)

But, my system is still overloaded.

rrdcached config:

Code: Select all

# Settings for rrdcached
OPTIONS="-l unix:/var/rrdtool/rrdcached/rrdcached.sock -F -s nagios -m 0660 -w 900 -z 90 -j /tmp/ -b /var/rrdtool/rrdcached -P FLUSH,PENDING"
RRDC_USER=nagios
There are 123 hosts, 1098 services. 4 CPU 4 Gb of RAM. Checking starts every 5 or 3 minutes. Mainly 5 minutes.
mguthrie
Posts: 4380
Joined: Mon Jun 14, 2010 10:21 am

Re: low perfomance of nagiosxi

Post by mguthrie »

That CPU load seems awfully high for that kind of check load, unless it all happens to be checks against VMware. Are the performance graphs updating more consistently now?

Make sure you don't have corruption in your database, that'll eat CPU in a major way:
http://assets.nagios.com/downloads/nagi ... tabase.pdf

Also run:

Code: Select all

psql nagiosxi nagiosxi
vacuum;
vacuum analyze;
vacuum full;
\q
If your performance data processing gets behind, it can definitely create a snowball for your CPU usage. What kind of average CPU load are you running now the the graphs are updating more consistently?

I would take a look at this doc as well if you haven't already:
http://assets.nagios.com/downloads/nagi ... rmance.pdf
andaks
Posts: 32
Joined: Thu May 19, 2011 9:47 am

Re: low perfomance of nagiosxi

Post by andaks »

Yes, graphics are update every 15 minutes now :)
Already repaired mysql database nagios. But it seems that just indexes were fixed. Nothing more.
Run the following:

Code: Select all

#> psql nagiosxi nagiosxi
	nagiosxi=> vacuum;
	WARNING:  skipping "pg_database" --- only superuser can vacuum it
	WARNING:  skipping "pg_authid" --- only superuser can vacuum it
	WARNING:  skipping "pg_tablespace" --- only superuser can vacuum it
	WARNING:  skipping "pg_pltemplate" --- only superuser can vacuum it
	WARNING:  skipping "pg_shdepend" --- only superuser can vacuum it
	WARNING:  skipping "pg_shdescription" --- only superuser can vacuum it
	WARNING:  skipping "pg_auth_members" --- only superuser can vacuum it
	VACUUM
Now I have reduced load on my CPU but not enough:
You do not have the required permissions to view the files attached to this post.