Thruk Timeout when trying to display a Servicer

Hi,
when I try to open a service with a lot of performance data (interface usage of a core switch).
The “status information” is shown normally in the overview
Also the perfdate for this service seems not to make it into influx.
Similar swiches work fine

the error:
Internal Server Error

An internal error occurred. Details can be found in the Apache error log (/omd/sites/monitoring/var/log/apache/error_log)Here are the last couple of lines from that log file:

at /omd/sites/monitoring/share/thruk/lib/Thruk.pm line 725.         
Thruk::__ANON__('TERM') called at /omd/sites/monitoring/share/thruk/lib/Thruk.pm line 1241         
eval {...} called at /omd/sites/monitoring/share/thruk/lib/Thruk.pm line 1241         
Thruk::graceful_stop('Thruk=HASH(0xcd3d40)') called at /omd/sites/monitoring/share/thruk/lib/Thruk/Utils.pm line 2757         
Thruk::Utils::check_memory_usage('Thruk::Context=HASH(0x2887370)') called at /omd/sites/monitoring/share/thruk/lib/Thruk.pm line 1097         
Thruk::_after_dispatch('Thruk::Context=HASH(0x2887370)', 'ARRAY(0x326b040)') called at /omd/sites/monitoring/share/thruk/lib/Thruk.pm line 381         
Thruk::_dispatcher('[Mon Dec 09 23:04:53.347333 2019 core:error [pid 8100] [client 123.123.123.123:60754] End of script output before headers: fcgid.thruk, referer: https://123.123.123.111:60443/monitoring/thruk/cgi-bin/status.cgi?host=all
HASH(0x297ac30)') called at /omd/sites/monitoring/lib/perl5/lib/perl5/Plack/Util.pm line 145         
eval {...} called at /omd/sites/monitoring/lib/perl5/lib/perl5/Plack/Util.pm line 145         
Plack::Util::run_app('CODE(0x19c0570)', 'HASH(0x297ac30)') called at /omd/sites/monitoring/lib/perl5/lib/perl5/Plack/Handler/FCGI.pm line 147         
Plack::Handler::FCGI::run('Plack::Handler::FCGI=HASH(0xcb8c90)', 'CODE(0x19c0570)') called at /omd/sites/monitoring/share/thruk/script/thruk_fastcgi.pl line 27         
         
Mon Dec 09 23:04:58.208380 2019 fcgid:warn [pid 20925] mod_fcgid: cleanup zombie process 6193
ERROR: got signal TERM while handling request, possible timeout in https://123.123.123.111:60443/monitoring/thruk/cgi-bin/extinfo.cgi?type=2&host=SWITCH&service=Port-Auslastung&backend=c55c8         
ERROR: User:       omdadmin         
ERROR: Address:    123.123.123.123         
ERROR: Parameters: {'backend' => 'c55c8','host' => 'SWITCH','service' => 'Port-Auslastung','type' => '2'};         
ERROR: Stacktrace:         
at /omd/sites/monitoring/share/thruk/lib/Thruk.pm line 725.         
Thruk::__ANON__('TERM') called at /omd/sites/monitoring/share/thruk/lib/Thruk.pm line 1241         
eval {...} called at /omd/sites/monitoring/share/thruk/lib/Thruk.pm line 1241         
Thruk::graceful_stop('Thruk=HASH(0x2120d40)') called at /omd/sites/monitoring/share/thruk/lib/Thruk/Utils.pm line 2757         
Thruk::Utils::check_memory_usage('Thruk::Context=HASH(0x42b7920)') called at /omd/sites/monitoring/share/thruk/lib/Thruk.pm line 1097         
Thruk::_after_dispatch('Thruk::Context=HASH(0x42b7920)', 'ARRAY(0x5855660)') called at /omd/sites/monitoring/share/thruk/lib/Thruk.pm line 381         
Thruk::_dispatche[Mon Dec 09 23:11:29.192571 2019 core:error [pid 8433] [client 123.123.123.123:33510] End of script output before headers: fcgid.thruk, referer: https://123.123.123.111:60443/monitoring/thruk/cgi-bin/status.cgi?s0_op=~&s0_type=search&add_default_service_filter=1&s0_value=ho%3ARPSW01_
r('HASH(0x45b9948)') called at /omd/sites/monitoring/lib/perl5/lib/perl5/Plack/Util.pm line 145         
eval {...} called at /omd/sites/monitoring/lib/perl5/lib/perl5/Plack/Util.pm line 145         
Plack::Util::run_app('CODE(0x2e0d5d0)', 'HASH(0x45b9948)') called at /omd/sites/monitoring/lib/perl5/lib/perl5/Plack/Handler/FCGI.pm line 147         
Plack::Handler::FCGI::run('Plack::Handler::FCGI=HASH(0x2105c90)', 'CODE(0x2e0d5d0)') called at /omd/sites/monitoring/share/thruk/script/thruk_fastcgi.pl line 27         

the command
/omd/sites/monitoring/lib/monitoring-plugins/check_nwc_health --hostname 123.123.123.123 --mode interface-usage --community STRING -t 120

OMD Version: 3.20
OS Version: CentOS Linux release 7.7.1908 (Core)

sample output
OK - interface gigabitethernet1/1/1 (alias gi1/1/1) usage is in:0.00% (0.00bit/s) out:0.00% (0.00bit/s) (down), interface gigabitethernet1/1/2 (alias gi1/1/2) usage is in:0.00% (0.00bit/s) out:0.00% (0.00bit/s) (down), interface gigabitethernet1/1/3 (alias gi1/1/3) usage is in:0.00% (0.00bit/s) out:0.00% (0.00bit/s) (down), interface gigabitethernet1/1/4 (alias gi1/1/4) usage is in:0.00% (0.00bit/s) out:0.00% (0.00bit/s) (down), ... 1 has no traffic | 'gigabitethernet1/1/1_usage_in'=0%;80;90;0;100 'gigabitethernet1/1/1_usage_out'=0%;80;90;0;100 'gigabitethernet1/1/1_traffic_in'=0;0;0;0;0 'gigabitethernet1/1/1_traffic_out'=0;0;0;0;0 ... 'Po32_traffic_out'=0;0;0;0;0

Any hints?

BR
Oliver

I don’t think this error is related to the check somehow. From the stacktrace we can see, that the error occurs during the post-request processing.
I improved that a bit, so it will hopefully run better with the next release.

thank you - looking forward to testing the new release :slight_smile: