Hi all.
I have some graphcs for a cisco router and several of the graphs are showing blanks in the middle of a traffic spike. How can I fix it?
Blank space in graphs
-
- Posts: 316
- Joined: Thu Mar 31, 2011 9:54 am
- Location: Patten, ME
Blank space in graphs
You do not have the required permissions to view the files attached to this post.
-
- Former Nagios Staff
- Posts: 13589
- Joined: Mon May 23, 2011 12:15 pm
Re: Blank space in graphs
Anything unusual in "/usr/local/nagios/var/npcd.log" or "/usr/local/nagios/var/perfdata.log"?
Be sure to check out our Knowledgebase for helpful articles and solutions!
-
- Posts: 316
- Joined: Thu Mar 31, 2011 9:54 am
- Location: Patten, ME
Re: Blank space in graphs
Everything looked pretty normal in npcd.log. perfdata.log showed a lot of timing out though so I increased the parallel threads to 10 and wait time to 30 in npcd.cfg. Anything else I should look at?
-
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
Re: Blank space in graphs
If you are getting timeouts make sure that the server isn't too overloaded...cwscribner wrote:Everything looked pretty normal in npcd.log. perfdata.log showed a lot of timing out though so I increased the parallel threads to 10 and wait time to 30 in npcd.cfg. Anything else I should look at?
Also, make sure your restart npcd for the changes to take affect.
-
- Posts: 316
- Joined: Thu Mar 31, 2011 9:54 am
- Location: Patten, ME
Re: Blank space in graphs
The server is definitely overloaded and a new 16 core CPU is en route. So basically, the overload could be what's causing the blanks?
-
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
Re: Blank space in graphs
Well, the overloading is likely causing the timeouts that you described, which would lead to the perfdata not getting correctly put in the rrd file.
by default the npcd has a little bit of a safeguard against this by checking the load on the server with the following line npcd.cfg
by default the npcd has a little bit of a safeguard against this by checking the load on the server with the following line npcd.cfg
Code: Select all
load_threshold = 10.0
-
- Posts: 316
- Joined: Thu Mar 31, 2011 9:54 am
- Location: Patten, ME
Re: Blank space in graphs
A few questions...
Is fixing the load problem the only way to fix this problem?
How much of a safeguard is that line? (The load is regularly in the 8-12 range and is often in the 15-20 range)
Will the npcd.cfg changes make a difference in lightening the load so the data can properly be processed?
Is fixing the load problem the only way to fix this problem?
How much of a safeguard is that line? (The load is regularly in the 8-12 range and is often in the 15-20 range)
Will the npcd.cfg changes make a difference in lightening the load so the data can properly be processed?
-
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
Re: Blank space in graphs
Honestly, I doubt that will lighten the load, I actually think if actually may help prevent the hard spikes if you setcwscribner wrote:A few questions...
Is fixing the load problem the only way to fix this problem?
How much of a safeguard is that line? (The load is regularly in the 8-12 range and is often in the 15-20 range)
Will the npcd.cfg changes make a difference in lightening the load so the data can properly be processed?
sleep_time = 10
npcd_max_threads = 5
Also I would also check the /usr/local/nagios/etc/pnp/npcd.cfg and the /usr/local/nagios/etc/pnp/process_perfdata.cfg files and make sure logging is set to 0.
-
- Posts: 316
- Joined: Thu Mar 31, 2011 9:54 am
- Location: Patten, ME
Re: Blank space in graphs
Already decreased the logging verbosity. I figured that would help things along.
What do the sleep_time and npcd_max_threads parameters do in terms of system metrics and graphing? Will increasing these increase the server load? Will increasing them provide better or more accurate graphing? Are there implications if you do the inverse?
What do the sleep_time and npcd_max_threads parameters do in terms of system metrics and graphing? Will increasing these increase the server load? Will increasing them provide better or more accurate graphing? Are there implications if you do the inverse?
-
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
Re: Blank space in graphs
the amont of work to be done is the same, but if you increase them, it tries to do more work at the same time, and could cause timeouts if the load gets too high.
If you decrease them, it will help prevent a large spike in load, but causes a more even load without the spikes
If you decrease them, it will help prevent a large spike in load, but causes a more even load without the spikes