FOG Server CPU usage 100%
-
@wayne-workman said:
Actually this line is using a ton of CPU:
131.09 87 0 0.0 1.69 1.69 158.227.4.135 10.0.15.4:80 POST /fog/status/getservertime.php HTTP/1.1
131 is a LOT.
Yeah that looks like there is something strange going on with this script. Though I have no idea what could be causing this. On the other hand the PHP code in that file looks kind of like it might take more time than usual. I guess only @Tom-Elliott can tell us what’s those calls to
ignore_user_abort()
andset_time_limit()
are used for. -
@Tom-Elliott about the tasks table:
MariaDB [fog]> show table status from fog where Name = 'tasks'; +-------+--------+---------+------------+------+----------------+-------------+-----------------+--------------+-----------+----------------+---------------------+---------------------+---------------------+-----------------+----------+--------------------+---------+ | Name | Engine | Version | Row_format | Rows | Avg_row_length | Data_length | Max_data_length | Index_length | Data_free | Auto_increment | Create_time | Update_time | Check_time | Collation | Checksum | Create_options | Comment | +-------+--------+---------+------------+------+----------------+-------------+-----------------+--------------+-----------+----------------+---------------------+---------------------+---------------------+-----------------+----------+--------------------+---------+ | tasks | MyISAM | 10 | Dynamic | 5217 | 125 | 653940 | 281474976710655 | 602112 | 0 | 5225 | 2017-06-14 18:23:02 | 2017-11-15 13:10:31 | 2017-09-07 15:31:22 | utf8_general_ci | NULL | row_format=DYNAMIC | | +-------+--------+---------+------------+------+----------------+-------------+-----------------+--------------+-----------+----------------+---------------------+---------------------+---------------------+-----------------+----------+--------------------+---------+
-
@fernando-gietz Truncate it.
-
@Wayne-Workman I have truncated the table and nothing.
Srv PID Acc M CPU SS Req Conn Child Slot Client VHost Request 0-0 4154 0/133/2998 _ 15.91 0 175 0.0 0.18 3.69 158.227.129.66 10.0.15.4:80 POST /fog/management/index.php?sub=requestClientInfo&authorize& 1-0 22327 0/633/2833 _ 101.18 0 88 0.0 0.70 3.34 158.227.129.66 10.0.15.4:80 GET /fog/service/getversion.php?newService&json HTTP/1.1 2-0 10758 0/3575/3575 _ 583.44 0 0 0.0 4.46 4.46 158.227.129.66 10.0.15.4:80 GET /fog/management/other/ssl/srvpublic.crt HTTP/1.1 3-0 11712 0/1096/3380 _ 174.13 0 210 0.0 1.27 4.11 158.227.115.42 10.0.15.4:80 GET /fog/management/index.php?node=home&sub=bandwidth&url%5B%5D 4-0 17308 0/3259/3538 W 531.14 58 0 0.0 3.79 4.26 158.227.4.135 10.0.15.4:80 POST /fog/status/getservertime.php HTTP/1.1 5-0 22817 0/605/3241 _ 95.71 1 73 0.0 0.68 5.64 158.227.138.17 10.0.15.4:80 GET /fog/service/getversion.php?newService&json HTTP/1.1 6-0 13038 0/1037/3469 _ 166.05 0 74 0.0 1.20 4.01 158.227.129.66 10.0.15.4:80 GET /fog/service/getversion.php?clientver&newService&json HTTP/ 7-0 - 0/0/2866 . 2.96 167 0 0.0 0.00 3.64 ::1 10.0.15.4:80 OPTIONS * HTTP/1.0 8-0 11826 0/997/3504 W 165.00 115 0 0.0 1.13 4.20 158.227.4.135 10.0.15.4:80 GET /fog/management/index.php?node=group&sub=membership&id=334 9-0 3151 0/174/3003 _ 25.77 0 77 0.0 0.21 3.42 10.0.15.4 10.0.15.4:80 GET /fog/status/bandwidth.php?dev=ens192 HTTP/1.1 10-0 15893 0/3295/3295 W 551.38 0 0 0.0 3.74 3.74 158.227.4.135 10.0.15.4:80 GET /server-status HTTP/1.1```
-
@sebastian-roth The idea of those is specific to ajax calls, though I guess the ignore_user_abort could be done with. The set_time_limit basically tells the script that it can run indefinitely. Ignore_user_abort tells the script to keep processing the script even if the client (browser) has requested to abort/stop/cancel the script.
-
To don’t lose the thread.
Hi again!
I follow with this issue Toisolate the problem I have do a fresh install:
Enviroment:
FOG version: 1.5.0
Computer: HP 800 G2 with 8 GB RAM and SSD
OS: Centos 7 64 bitsI have imported 7000 new hosts to the database from webUI, later I have created a new group with one computer. If I try to list the membership of this new group, the browser needs 40 second to show me the computer.
If anybody wants to reply the issu, I can send the hosts import file.
-
@Tom-Elliott sounds like the group membership is doing a lot of unnecessary calculations.
-
@joe-schmitt somewhat yes, somewhat no. For membership it should just gather the names, but when the object loads it first has to get the ids so it knows which hostnames it needs to grab. This should happen fairly fast, but processing to display 7k items does take a bit.
-
@tom-elliott I can understand that the proccess takes a bit 10 seconds or 15 seconds, but in my producction environment takes 2 minutes and in the physical server 40 seconds.
Repeat, I can send you the import hosts file to do test and see in-situ the problem.
-
- thread note:
@Fernando-Gietz I have replicated the issue and am looking at it now.
-
@fernando-gietz If I can break some time free I’d like to look at it from a SQL database point of view. We may be missing an index that is causing such a bad performance.
-
Mind installing the
working-1.5.1
branch please?Significantly improved load for membership and taskings.
-
For benchmark/testing.
Using Joe’s server as the testbed, it was taking about 42 seconds to load the group membership page and around 30 seconds for the tasking page.
It now takes 3 seconds to load, and 5 (total) to render on membership. Tasking is relatively instant.
-
@tom-elliott Oh la la!! works very verry fine