Hi,
I’m having a problem to show alerts from remote xi (5.5.8) to Fusion (4.1.6).
I can see that the credentials and the IP are in ok according to the fusion.
There are alerts in local xi and they are not appear in the fusion.
I ran /usr/local/nagiosfusion/cron/poll_subsys.php --debug --server 39 --user nagiosadmin and I see that the fusion poll all the servers and services.
Any ideas?
Fusion don't show alerts from local xi
-
- Madmin
- Posts: 9190
- Joined: Thu Oct 30, 2014 9:02 am
Re: Fusion don't show alerts from local xi
Can you login to the Fusion server as root and check the log file in this folder /usr/local/nagiosfusion/var/log for any errors when the system is polling the xi server?
Check these files for sure but the error may be in the other log file in that folder.
Check these files for sure but the error may be in the other log file in that folder.
Code: Select all
auth_subsys.log
poll_subsys.log
Be sure to check out our Knowledgebase for helpful articles and solutions!
-
- Posts: 144
- Joined: Wed Mar 28, 2018 6:23 am
Re: Fusion don't show alerts from local xi
HI,
auth_subsys.log
poll_subsys.log - is empty
I ran /usr/local/nagiosfusion/cron/poll_subsys.php --debug --server 39 --user nagiosadmin and i get the info.
I similate host and service alert but I can see only hosts alerts in the fusion.
From some reason I can' t see services alerts.
I cahnged the nagiosadmin password but it didn't help.
The fusion reach the remote xi (I ran xi system status using the API from Fusion and I get a result).
auth_subsys.log
Code: Select all
2019-02-13 21:05:34[s: 20, u: 0] Failed authentication check
2019-02-13 21:06:09[s: 20, u: 0] Failed authentication check
2019-02-13 21:06:32[s: 20, u: 0] Failed authentication check
2019-02-13 21:07:05[s: 20, u: 0] Failed authentication check
2019-02-13 21:07:35[s: 20, u: 0] Failed authentication check
2019-02-13 21:57:15[s: 11, u: 0] Failed authentication check
2019-02-13 21:57:35[s: 11, u: 0] Failed authentication check
2019-02-13 21:58:10[s: 11, u: 0] Failed authentication check
2019-02-13 21:58:34[s: 11, u: 0] Failed authentication check
2019-02-13 21:59:14[s: 11, u: 0] Failed authentication check
I ran /usr/local/nagiosfusion/cron/poll_subsys.php --debug --server 39 --user nagiosadmin and i get the info.
Code: Select all
OPERATING IN DEBUG MODE
2019-02-14 12:02:13[s: 0, u: 0] poll_server() unable to poll data for s:TEST, u:nagiosadmin, poll:servicestatus
2019-02-14 12:02:13[s: 0, u: 0] poll_server() CHECK YOUR LIVE_DATA_TIMEOUT SETTINGS. IT MAY NEED INCREASED
SERVER: TEST
USERNAME: nagiosadmin
POLLED_DATA:
Array
(
[server_id] => 39
[server_type] => 1
[authentication_type] => 0
[username] => nagiosadmin
[polled_time] => 1550138392
[data] => Array
(
[bpi] => Array
(
[linux-servers] => Array
(
[title] => HG: linux-servers
[state] => 0
[text] => Group health is 100% with 0 problem(s).
)
[all_dell_openmanage_oob_servers] => Array
(
[title] => SG: all_dell_openmanage_oob_servers
[state] => 0
[text] => Group health is 75% with 3 problem(s).
)
)
[hosts_pending] => 0
[hosts_up] => 34
[hosts_down] => 0
[hosts_unreachable] => 0
[hosts_count] => 34
[host_status] => Array
(
[localhost] => Array
(
[alias] => localhost
[current_state] => 0
[output] => OK - 127.0.0.1: rta 0.012ms, lost 0%
[disabled] => 0
[downtime] => 0
[acknowledged] => 0
)
[10.10.10.10] => Array
(
[alias] => 10.10.10.10
[current_state] => 0
[output] => OK - 10.10.10.10: rta 0.110ms, lost 0%
[disabled] => 0
[downtime] => 0
[acknowledged] => 0
)
[10.10.10.10] => Array
(
[alias] => 10.10.10.10
[current_state] => 0
[output] => OK - 10.10.10.10: rta 0.084ms, lost 0%
[disabled] => 0
[downtime] => 0
[acknowledged] => 0
)
[10.10.10.10] => Array
(
[alias] => 10.10.10.10
[current_state] => 0
[output] => OK - 10.10.10.10: rta 0.753ms, lost 0%
[disabled] => 0
[downtime] => 0
[acknowledged] => 0
)
[10.10.10.10] => Array
(
[alias] => 10.10.10.10
[current_state] => 0
[output] => OK - 10.10.10.10: rta 1.014ms, lost 0%
[disabled] => 0
[downtime] => 0
[acknowledged] => 0
)
[10.10.10.10] => Array
(
[alias] => 10.10.10.10
[current_state] => 0
[output] => OK - 10.10.10.10: rta 1.082ms, lost 0%
[disabled] => 0
[downtime] => 0
[acknowledged] => 0
)
[10.10.10.10] => Array
(
[alias] => 10.10.10.10
[current_state] => 0
[output] => OK - 10.10.10.10: rta 0.692ms, lost 0%
[disabled] => 0
[downtime] => 0
[acknowledged] => 0
)
[10.10.10.10] => Array
(
[alias] => 10.10.10.10
[current_state] => 0
[output] => OK - 10.10.10.10: rta 0.083ms, lost 0%
[disabled] => 0
[downtime] => 0
[acknowledged] => 0
)
[10.10.10.10] => Array
(
[alias] => 10.10.10.10
[current_state] => 0
[output] => OK - 10.10.10.10: rta 0.179ms, lost 0%
[disabled] => 0
[downtime] => 0
[acknowledged] => 0
)
[10.10.10.10] => Array
(
[alias] => 10.10.10.10
[current_state] => 0
[output] => OK - 10.10.10.10: rta 0.169ms, lost 0%
[disabled] => 0
[downtime] => 0
[acknowledged] => 0
)
[10.10.10.10] => Array
(
[alias] => 10.10.10.10
[current_state] => 0
[output] => OK - 10.10.10.10: rta 0.166ms, lost 0%
[disabled] => 0
[downtime] => 0
[acknowledged] => 0
)
[10.10.10.10] => Array
(
[alias] => 10.10.10.10
[current_state] => 0
[output] => OK - 10.10.10.10: rta 0.179ms, lost 0%
[disabled] => 0
[downtime] => 0
[acknowledged] => 0
)
[10.10.10.10] => Array
(
[alias] => 10.10.10.10
[current_state] => 0
[output] => OK - 10.10.10.10: rta 0.513ms, lost 0%
[disabled] => 0
[downtime] => 0
[acknowledged] => 0
)
[10.10.10.10] => Array
(
[alias] => 10.10.10.10
[current_state] => 0
[output] => OK - 10.10.10.10: rta 0.114ms, lost 0%
[disabled] => 0
[downtime] => 0
[acknowledged] => 0
)
[10.10.10.10] => Array
(
[alias] => 10.10.10.10
[current_state] => 0
[output] => OK - 10.10.10.10: rta 0.116ms, lost 0%
[disabled] => 0
[downtime] => 0
[acknowledged] => 0
)
[10.10.10.10] => Array
(
[alias] => 10.10.10.10
[current_state] => 0
[output] => OK - 10.10.10.10: rta 0.104ms, lost 0%
[disabled] => 0
[downtime] => 0
[acknowledged] => 0
)
[10.10.10.10] => Array
(
[alias] => 10.10.10.10
[current_state] => 0
[output] => OK - 10.10.10.10: rta 0.100ms, lost 0%
[disabled] => 0
[downtime] => 0
[acknowledged] => 0
)
[10.10.10.10] => Array
(
[alias] => 10.10.10.10
[current_state] => 0
[output] => OK - 10.10.10.10: rta 0.701ms, lost 0%
[disabled] => 0
[downtime] => 0
[acknowledged] => 0
)
[10.10.10.10] => Array
(
[alias] => 10.10.10.10
[current_state] => 0
[output] => OK - 10.10.10.10: rta 0.964ms, lost 0%
[disabled] => 0
[downtime] => 0
[acknowledged] => 0
)
[10.10.10.10] => Array
(
[alias] => 10.10.10.10
[current_state] => 0
[output] => OK - 10.10.10.10: rta 0.265ms, lost 0%
[disabled] => 0
[downtime] => 0
[acknowledged] => 0
)
[10.10.10.10] => Array
(
[alias] => 10.10.10.10
[current_state] => 0
[output] => OK - 10.10.10.10: rta 0.256ms, lost 0%
[disabled] => 0
[downtime] => 0
[acknowledged] => 0
)
[10.10.10.10] => Array
(
[alias] => 10.10.10.10
[current_state] => 0
[output] => OK - 10.10.10.10: rta 0.283ms, lost 0%
[disabled] => 0
[downtime] => 0
[acknowledged] => 0
)
[10.10.10.10] => Array
(
[alias] => 10.10.10.10
[current_state] => 0
[output] => OK - 10.10.10.10: rta 0.259ms, lost 0%
[disabled] => 0
[downtime] => 0
[acknowledged] => 0
)
[10.10.10.10] => Array
(
[alias] => 10.10.10.10
[current_state] => 0
[output] => OK - 10.10.10.10: rta 0.678ms, lost 0%
[disabled] => 0
[downtime] => 0
[acknowledged] => 0
)
[10.10.10.10] => Array
(
[alias] => 10.10.10.10
[current_state] => 0
[output] => OK - 10.10.10.10: rta 1.211ms, lost 0%
[disabled] => 0
[downtime] => 0
[acknowledged] => 0
)
[10.10.10.10] => Array
(
[alias] => 10.10.10.10
[current_state] => 0
[output] => OK - 10.10.10.10: rta 0.277ms, lost 0%
[disabled] => 0
[downtime] => 0
[acknowledged] => 0
)
[10.10.10.10] => Array
(
[alias] => 10.10.10.10
[current_state] => 0
[output] => OK - 10.10.10.10: rta 1.042ms, lost 0%
[disabled] => 0
[downtime] => 0
[acknowledged] => 0
)
[10.10.10.10] => Array
(
[alias] => 10.10.10.10
[current_state] => 0
[output] => OK - 10.10.10.10: rta 0.260ms, lost 0%
[disabled] => 0
[downtime] => 0
[acknowledged] => 0
)
[10.10.10.10] => Array
(
[alias] => 10.10.10.10
[current_state] => 0
[output] => OK - 10.10.10.10: rta 1.445ms, lost 0%
[disabled] => 0
[downtime] => 0
[acknowledged] => 0
)
[10.10.10.10] => Array
(
[alias] => 10.10.10.10
[current_state] => 0
[output] => OK - 10.10.10.10: rta 0.421ms, lost 0%
[disabled] => 0
[downtime] => 0
[acknowledged] => 0
)
[10.10.10.10] => Array
(
[alias] => 10.10.10.10
[current_state] => 0
[output] => OK - 10.10.10.10: rta 0.279ms, lost 0%
[disabled] => 0
[downtime] => 0
[acknowledged] => 0
)
[10.10.10.10] => Array
(
[alias] => 10.10.10.10
[current_state] => 0
[output] => OK - 10.10.10.10: rta 1.382ms, lost 0%
[disabled] => 0
[downtime] => 0
[acknowledged] => 0
)
[10.10.10.10] => Array
(
[alias] => 10.10.10.10
[current_state] => 0
[output] => OK - 10.10.10.10: rta 1.519ms, lost 0%
[disabled] => 0
[downtime] => 0
[acknowledged] => 0
)
[10.10.10.10] => Array
(
[alias] => 10.10.10.10
[current_state] => 0
[output] => OK - 10.10.10.10: rta 0.244ms, lost 0%
[disabled] => 0
[downtime] => 0
[acknowledged] => 0
)
)
[hosts_problems] => 0
[hosts_problems_unhandled] => 0
[hosts_disabled] => 0
[hosts_acknowledged] => 0
[hosts_flapping] => 0
[hosts_downtime] => 0
[hosts_pending_disabled] => 0
[hosts_up_disabled] => 0
[hosts_up_downtime] => 0
[hosts_down_disabled] => 0
[hosts_down_acknowledged] => 0
[hosts_down_downtime] => 0
[hosts_unreachable_disabled] => 0
[hosts_unreachable_acknowledged] => 0
[hosts_unreachable_downtime] => 0
[monitoring_engine] => 1
[notifications] => 1
[active_checks] => 1
[passive_checks] => 1
[event_handlers] => 1
[users] => Array
(
[0] => Array
(
[user_id] => 3
[username] => operator
[name] => operator
[email] => operator@root.com
[enabled] => 1
)
[1] => Array
(
[user_id] => 1
[username] => nagiosadmin
[name] => Nagios Administrator
[email] => root@localhost
[enabled] => 1
)
)
[info] => Array
(
[product] => nagiosxi
[version] => 5.5.8
[version_major] => 5
[version_minor] => 5.8
[build_id] => 1544466765
[release] => 5508
)
[alert_list] => Array
(
[0] => Array
(
[time] => 2019-02-14 13:59:10
[type] => 65536
[host] => 10.10.10.10
[service] => check ntp time
[state_type] => SOFT
[state] => CRITICAL
[output] => NTP CRITICAL: Offset 652.473913 secs
)
[1] => Array
(
[time] => 2019-02-14 13:58:12
[type] => 65536
[host] => 10.10.10.10
[service] => check ntp time
[state_type] => HARD
[state] => CRITICAL
[output] => NTP CRITICAL: Offset 437.6006638 secs
)
[2] => Array
(
[time] => 2019-02-14 13:56:14
[type] => 65536
[host] => 10.10.10.10
[service] => check ntp time
[state_type] => SOFT
[state] => CRITICAL
[output] => NTP CRITICAL: Offset 437.599648 secs
)
[3] => Array
(
[time] => 2019-02-14 13:54:15
[type] => 65536
[host] => 10.10.10.10
[service] => check ntp time
[state_type] => SOFT
[state] => CRITICAL
[output] => NTP CRITICAL: Offset 437.5982324 secs
)
)
[hostgroup_members] => Array
(
[linux-servers] => Array
(
[alias] => linux-servers
[members] => Array
(
[0] => localhost
)
)
)
[servicegroup_members] => Array
(
[all_dell_openmanage_oob_servers] => Array
(
[alias] => all_dell_openmanage_oob_servers
[members] => Array
(
[0] => Array
(
[host_name] => 10.10.10.10
[service_description] => Dell OpenManage OOB - PowerEdge Server
)
[1] => Array
(
[host_name] => 10.10.10.10
[service_description] => Dell OpenManage OOB - PowerEdge Server
)
[2] => Array
(
[host_name] => 10.10.10.10
[service_description] => Dell OpenManage OOB - Operating System and Firmware Information
)
[3] => Array
(
[host_name] => 10.10.10.10
[service_description] => Dell OpenManage OOB - Operating System and Firmware Information
)
[4] => Array
(
[host_name] => 10.10.10.10
[service_description] => Dell OpenManage OOB - Model Service Tag Information
)
[5] => Array
(
[host_name] => 10.10.10.10
[service_description] => Dell OpenManage OOB - PowerEdge Server
)
[6] => Array
(
[host_name] => 10.10.10.10
[service_description] => Dell OpenManage OOB - Operating System and Firmware Information
)
[7] => Array
(
[host_name] => 10.10.10.10
[service_description] => Dell OpenManage OOB - Model Service Tag Information
)
[8] => Array
(
[host_name] => 10.10.10.10
[service_description] => Dell OpenManage OOB - Model Service Tag Information
)
[9] => Array
(
[host_name] => 10.10.10.10
[service_description] => Dell OpenManage OOB - PowerEdge Server
)
[10] => Array
(
[host_name] => 10.10.10.10
[service_description] => Dell OpenManage OOB - Operating System and Firmware Information
)
[11] => Array
(
[host_name] => 10.10.10.10
[service_description] => Dell OpenManage OOB - Model Service Tag Information
)
)
)
)
[dashlets_params_simple_hosts] => Array
(
[localhost] => localhost
[10.10.10.10] => 10.10.10.10
[10.10.10.10] => 10.10.10.10
[10.10.10.10] => 10.10.10.10
[10.10.10.10] => 10.10.10.10
[10.10.10.10] => 10.10.10.10
[10.10.10.10] => 10.10.10.10
[10.10.10.10] => 10.10.10.10
[10.10.10.10] => 10.10.10.10
[10.10.10.10] => 10.10.10.10
[10.10.10.10] => 10.10.10.10
[10.10.10.10] => 10.10.10.10
[10.10.10.10] => 10.10.10.10
[10.10.10.10] => 10.10.10.10
[10.10.10.10] => 10.10.10.10
[10.10.10.10] => 10.10.10.10
[10.10.10.10] => 10.10.10.10
[10.10.10.10] => 10.10.10.10
[10.10.10.10] => 10.10.10.10
[10.10.10.10] => 10.10.10.10
[10.10.10.10] => 10.10.10.10
[10.10.10.10] => 10.10.10.10
[10.10.10.10] => 10.10.10.10
[10.10.10.10] => 10.10.10.10
[10.10.10.10] => 10.10.10.10
[10.10.10.10] => 10.10.10.10
[10.10.10.10] => 10.10.10.10
[10.10.10.10] => 10.10.10.10
[10.10.10.10] => 10.10.10.10
[10.10.10.10] => 10.10.10.10
[10.10.10.10] => 10.10.10.10
[10.10.10.10] => 10.10.10.10
[10.10.10.10] => 10.10.10.10
[10.10.10.10] => 10.10.10.10
)
[dashlets_params_simple_hostgroups] => Array
(
[linux-servers] => linux-servers
)
[dashlets_params_simple_servicegroups] => Array
(
[all_dell_openmanage_oob_servers] => all_dell_openmanage_oob_servers
)
)
[debug_started_time] => 2019-02-14 11:59:52
[debug_completed_time] => 2019-02-14 12:04:04
)
MEMORY USED: 4,113,160 Bytes
MEMORY PEAK: 5,011,928 Bytes
From some reason I can' t see services alerts.
I cahnged the nagiosadmin password but it didn't help.
The fusion reach the remote xi (I ran xi system status using the API from Fusion and I get a result).
-
- Madmin
- Posts: 9190
- Joined: Thu Oct 30, 2014 9:02 am
Re: Fusion don't show alerts from local xi
If the poll_subsys.log file is empty then I would guess that the cron job that gathers the data may not be running.
First, lets stop all of the nagios processes by running the following as root.
Then restart cron by running
Let the system run for 10 to 15 minutes so it can gather the data and check the GUI to see if it is updated.
If not, try increasing the Polling Interval for the xi server by going to the Servers menu and editing the server settings.
Increase the Polling Interval. If you have any value in the field, double it. If it is empty, try 300 seconds.
Also, retype in the username and password and verify the Fusekey is correct. Update the settings.
Then check again to see if it updates the data in the GUI.
First, lets stop all of the nagios processes by running the following as root.
Code: Select all
pkill -u nagios
Code: Select all
service crond restart
If not, try increasing the Polling Interval for the xi server by going to the Servers menu and editing the server settings.
Increase the Polling Interval. If you have any value in the field, double it. If it is empty, try 300 seconds.
Also, retype in the username and password and verify the Fusekey is correct. Update the settings.
Then check again to see if it updates the data in the GUI.
Be sure to check out our Knowledgebase for helpful articles and solutions!
-
- Posts: 144
- Joined: Wed Mar 28, 2018 6:23 am
Re: Fusion don't show alerts from local xi
Hi,
I ran the commnds and still the same.
username, password and the Fusekey are correct (I did the test check got green light).
Server Setting:
Authentication Interval: 300
Polling Interval: 300
URL: http://X.X.X.X/nagiosxi/
Server type: Nagios xi
This server worked before on this Fusion.
Nothing was changed.
Still the poll don't show all the services.
In did debug mode and got the same as before.
My fuision has other xi servers (some in the same version 5.5.8) which are connect to it and all workign OK.
Even when I ran manually /usr/local/nagiosfusion/cron/poll_subsys.php --server 39 --user nagiosadmin I don't see any errors in logs file
I ran the commnds and still the same.
username, password and the Fusekey are correct (I did the test check got green light).
Server Setting:
Authentication Interval: 300
Polling Interval: 300
URL: http://X.X.X.X/nagiosxi/
Server type: Nagios xi
This server worked before on this Fusion.
Nothing was changed.
Still the poll don't show all the services.
In did debug mode and got the same as before.
My fuision has other xi servers (some in the same version 5.5.8) which are connect to it and all workign OK.
Even when I ran manually /usr/local/nagiosfusion/cron/poll_subsys.php --server 39 --user nagiosadmin I don't see any errors in logs file
-
- Madmin
- Posts: 9190
- Joined: Thu Oct 30, 2014 9:02 am
Re: Fusion don't show alerts from local xi
Maybe there is some bad data in the MYSQL table for the service details and it is causing it rom updating.
Lets truncate the polled data and to do that, run the following as root.
Let the system run for 10 minutes so it has time to retrieve the data and check that server to see if it has the Service data.
Lets truncate the polled data and to do that, run the following as root.
Code: Select all
cd /usr/local/nagiosfusion/scripts/
./truncate_polled.php
Be sure to check out our Knowledgebase for helpful articles and solutions!
-
- Posts: 144
- Joined: Wed Mar 28, 2018 6:23 am
Re: Fusion don't show alerts from local xi
Hi,
I did the command you wrote but still i don't see any alerts in the Fusion.
I waited more than an hour and ran the command twice.
Also when I ran curl from Fusion CLI witht the xi ip and API for getting service status I get a respond.
I did the command you wrote but still i don't see any alerts in the Fusion.
I waited more than an hour and ran the command twice.
Also when I ran curl from Fusion CLI witht the xi ip and API for getting service status I get a respond.
-
- Madmin
- Posts: 9190
- Joined: Thu Oct 30, 2014 9:02 am
Re: Fusion don't show alerts from local xi
How many Host and Services are defined on the xi server that is not showing the Service Data?
Be sure to check out our Knowledgebase for helpful articles and solutions!
-
- Posts: 144
- Joined: Wed Mar 28, 2018 6:23 am
Re: Fusion don't show alerts from local xi
34 Hosts
589 Services
589 Services
-
- Madmin
- Posts: 9190
- Joined: Thu Oct 30, 2014 9:02 am
Re: Fusion don't show alerts from local xi
Login to the Fusion GUI and go to the Admin > System Settings menu.
Change the Log Level to Trace and enable writing to the Fusion log and debug file. Update the settings.
Let the system run long enough for it to poll the xi server and check the following files for any errors / data when the xi server is being polled.
Couple of questions.
How many servers do you have fused on the server?
In the System Settings > Data & Polling tab, how many Simultaneous Pollers: do you have set?
Try setting that to the same number of fused systems and see if that helps.
Change the Log Level to Trace and enable writing to the Fusion log and debug file. Update the settings.
Let the system run long enough for it to poll the xi server and check the following files for any errors / data when the xi server is being polled.
Code: Select all
/usr/local/nagiosfusion/var/log/fusion.log
/usr/local/nagiosfusion/var/log/fusion.debug
How many servers do you have fused on the server?
In the System Settings > Data & Polling tab, how many Simultaneous Pollers: do you have set?
Try setting that to the same number of fused systems and see if that helps.
Be sure to check out our Knowledgebase for helpful articles and solutions!