Deployment stops/waits very long with: Reattempting to update database ....
-
@kratkale Interesting… On the fog server in /opt/fog/logs is there anything helpful in the multicast log? Or in /var/log/php-fpm logs? There has to be something unique going on there.
-
ls -trl
…
-rw-r–r-- 1 root root 81348 Aug 31 20:52 multicast.log.udpcast.68
-rw-r–r-- 1 root root 1950 Aug 31 23:42 fogimagesize.log
-rw-r–r-- 1 root root 2306 Sep 1 00:12 fogscheduler.log
-rw-r–r-- 1 root root 3175 Sep 1 00:12 pinghost.log
-rw-r–r-- 1 root root 5501 Sep 1 00:12 fogsnapinhash.log
-rw-r–r-- 1 root root 8243 Sep 1 00:12 fogreplicator.log
-rw-r–r-- 1 root root 150997 Sep 1 00:12 groupmanager.log
-rw-r–r-- 1 root root 12194 Sep 1 00:12 fogsnapinrep.log
-rw-r–r-- 1 root root 8733 Sep 1 00:13 multicast.log
-rw-r–r-- 1 root root 49638 Sep 1 00:13 servicemaster.logthe multicast log ends with:
[08-31-18 10:12:45 pm] * No tasks found!
[08-31-18 10:12:55 pm] * No tasks found!
[08-31-18 10:13:05 pm] * No tasks found!
[08-31-18 10:13:15 pm] * No tasks found!
[08-31-18 10:13:25 pm] * No tasks found!
[08-31-18 10:13:35 pm] * No tasks found!php7.0-fpm.log:
[31-Aug-2018 19:03:49] WARNING: [pool www] server reached pm.max_children setting (5), consider raising it
[31-Aug-2018 20:35:25] NOTICE: Terminating …
[31-Aug-2018 20:35:26] NOTICE: exiting, bye-bye!
[31-Aug-2018 20:36:10] NOTICE: fpm is running, pid 449
[31-Aug-2018 20:36:10] NOTICE: ready to handle connections
[31-Aug-2018 20:36:10] NOTICE: systemd monitor interval set to 10000ms
[31-Aug-2018 20:36:15] WARNING: [pool www] server reached pm.max_children setting (5), consider raising it
[31-Aug-2018 20:36:30] WARNING: [pool www] server reached pm.max_children setting (5), consider raising it
[31-Aug-2018 20:37:21] WARNING: [pool www] server reached pm.max_children setting (5), consider raising it
[31-Aug-2018 20:52:24] WARNING: [pool www] server reached pm.max_children setting (5), consider raising it
[31-Aug-2018 21:10:46] WARNING: [pool www] server reached pm.max_children setting (5), consider raising it
[31-Aug-2018 23:34:18] NOTICE: Terminating …
[31-Aug-2018 23:34:18] NOTICE: exiting, bye-bye!
[31-Aug-2018 23:35:10] ALERT: [pool www] pm.max_spare_servers(3) must not be less than pm.min_spare_servers(5)
[31-Aug-2018 23:35:10] ERROR: failed to post process the configuration
[31-Aug-2018 23:35:10] ERROR: FPM initialization failed
[31-Aug-2018 23:42:30] ALERT: [pool www] pm.max_spare_servers(3) must not be less than pm.min_spare_servers(5)
[31-Aug-2018 23:42:30] ERROR: failed to post process the configuration
[31-Aug-2018 23:42:30] ERROR: FPM initialization failed
[01-Sep-2018 00:15:48] NOTICE: fpm is running, pid 7989
[01-Sep-2018 00:15:48] NOTICE: ready to handle connections
[01-Sep-2018 00:15:48] NOTICE: systemd monitor interval set to 10000ms
[01-Sep-2018 00:16:45] WARNING: [pool www] seems busy (you may need to increase pm.start_servers, or pm.min/max_spare_servers), spawning 8 children, there are 1 idle, and 14 total children -
@kratkale said in Deployment stops/waits very long with: Reattempting to update database ....:
-rw-r–r-- 1 root root 81348 Aug 31 20:52 multicast.log.udpcast.68
This is probably the log you want to review.
[31-Aug-2018 20:36:15] WARNING: [pool www] server reached pm.max_children setting (5), consider raising it
[31-Aug-2018 23:42:30] ALERT: [pool www] pm.max_spare_servers(3) must not be less than pm.min_spare_servers(5)
[01-Sep-2018 00:16:45] WARNING: [pool www] seems busy (you may need to increase pm.start_servers, or pm.min/max_spare_servers), spawning 8 children, there are 1 idle, and 14 total childrenYou need to double check your settings in the php-fpm file. If these settings are not right and php-fpm does not start correctly, you won’t have a web gui.
-
@george1421
Good Morning Georg,
I’ve a working web GUI.
FOG was deploying 30 PC over night but without multicast. The image size is about 50GB.Sorry I don’t know what to change.
/home/kratkale# cat /etc/php/7.0/fpm/pool.d/www.conf | grep -v ‘;’| grep -v ^$[www]
user = www-data
group = www-data
listen = 127.0.0.1:9000
listen.owner = www-data
listen.group = www-data
pm = dynamic
pm.max_children = 50
pm.start_servers = 5
pm.min_spare_servers = 5
pm.max_spare_servers = 10
pm.max_requests = 2000
php_admin_value[memory_limit] = 256MSo what to do? Multicast stopped at the beginning of partclone;
Thank you Alexander
-
@george1421
another interesting point is:
I cannot deploy an image direct on the client Computer:
If I start the PC I get the normal FOG Welcome screen.
If I select Deploy Image, I’m asked for the password. But it does not Deploy the image; immediately I get the FOG Welcome screen again. Another attempt: The User and the password is still on the FOG Welcome screen???Another attempt with a different user: same behavior.
I can deploy the image from the Web Interface …
-
@kratkale This is really strange because the errors don’t align with what you have in your config file.
The logs clearly state:
[31-Aug-2018 20:37:21] WARNING: [pool www] server reached pm.max_children setting (5), consider raising itBut your config file has
pm.max_children = 50Are you SURE you don’t have 2 www.conf files? Or another .conf file in /etc/php/7.0/fpm/pool.d
If you stop php-fpm, nuke php7.0-fpm.log and then restart php-fpm are those error still present?
I don’t think the php-fpm errors are related to multicast stopping at the beginning of partclone. Is there anything on the screen of the target computers that indicate why its stopped? I clear screen shot of the error taken with a mobile phone might give us a clue to what is going on.
Also have you reviewed the multicast logs multicast.log and multicast.log.udpcast to see of there is any useful info in there?
And for some background information.
- The target computers you are deploying to, deploy OK with unicast, but fail with multicast?
- How many systems are you trying to multicast at one time?
- Are alll of the computers on the same subnet?
-
@kratkale said in Deployment stops/waits very long with: Reattempting to update database ....:
If I select Deploy Image, I’m asked for the password. But it does not Deploy the image; immediately I get the FOG Welcome screen again. Another attempt: The User and the password is still on the FOG Welcome screen???
For clarity what hardware are you trying to deploy to? (manufacturer and model). I can’t remember the normal sequence off the top of my head, in that when you create a direct deployment if it deploys immediately or automatically schedules a deployment, reboots, then deploys. I can see logic to both.
-
@george1421
Hi,
Are you SURE you don’t have 2 www.conf files?
Yes I think so:
-rw-r–r-- 1 root root 19K Sep 1 00:15 www.confI think it is ok! I changed the www.conf on the 1 Sep - the log entry is on 31-Aug-2018 at 20:37:21
So the log file informs about the old config?!
Also have you reviewed the multicast logs
Can I upload the log files? Yes there are a lot of informations - meaningless to me.
And for some background information.
1) The target computers deploy OK with unicast. The stop deploying with multicast at the first screen of partclone. It looks to me they are waiting that all computers get ready. But they never do … Finally all the PC’s show the same picture. I can do a picture when my deployment is finished.
I was able to deploy multicast with FOG 1.4.4 and 1.5.4. The Version 1.5.4 had the trouble with “Update Database”. The Deployment istself worked fine. On all PC’s the image was completely. Some of them didn’t join my domain. But all of them where running the windows 10 image2) I have one class with 25 PCs, the second with 36 PCs. There are three (a, b, c) D-Link Switches DGS1210
a) the FOG Server, the Green Interface from IP Fire, 25 PCs and the connection to the other two switches
b) 18 Clients of the 36 PCs
c) 18 Clients of the 36 PCs3) Yes all Computer are in one subnet
192.168.0.0\24
The Client PCs get their IP via DHCP form an IPFire. The IP adresses are fixed.All of them are nearly identical hardware
Lenovo I5 with SSD and 8GB RAM delivered 201808
The only difference is the size of the second hard disc - I think irrelevant to FOG. -
@george1421
I have another Lab with 24 PC’s. Also Lenove M910. I don t need these PC on Monday. These PC’s are in a different Sub net 192.168.1.0\24 and have their own FOG Server (running Debian 9.5). There is the Version 1.5.4 installed.
I can test there what ever you want … -
@george1421
Hi,
In the lab I have updated like in my main classroom. Multicast is working with 24 clients in the lab.
I’ll do another attempt in my main class room with 56 PCs. -
Ok - not very important but maybe it helps for the development. I’ve 2 old HP8000. I have done a Win10 1803 image on one PC and was unable to clone it with the FOG server.
Symptom: Windows stops with- Automatische Reperatur wird vorbereitet (automatic repair)
- Diagnose des PCs wird ausgeführt (diagnostic)
- Automatische Reperatur konnte Ihren PC nicht reparieren. (unable to repair)
So the cloned image is not working.
Another Test with clonzilla 2.5.6-22-amd64 works without any problem -
@george1421
so I’ve done now a new test with 62 identical computers .
This time the computers are cloned with multicast - but now again the problem with the update of the database. 50GB was done in 15 minutes - thats really great …
waiting for update of the database …But now all computers joined my domain.
-
@kratkale said in Deployment stops/waits very long with: Reattempting to update database ....:
waiting for update of the database …
Sorry I’ve been away for a bit. So the imaging process stops at the above message and never completes? Or is it working as you need? If the system is hanging at the update of the database, is there any error message thrown in either the apache error log or php-fpm error logs? The system should not just stop.
-
@george1421
no the imaging process is finished. It just takes a long time to update the database. Really long compared to the deployment of the image.If I reduce the amount of PC’s it is working fine. 25 is a good amount. I think it is a php Problem?
What is not working:
-) Printers: I want to run two TCP IP Printer on Port 9100. One is a HP LaserJet - the second one is a A0 HP 510 Plotter. In both cases the Network connection is named as 9100. It is the port name. If you do the installation by hand Windows names it with the IP adress. Which is of course different. As the connections have the same name I think the second one replaces the first one …-) Changing the printer setup on FOG changes sometimes nothing. (e.g. the name of the inf file - the printer …) My solution was to deploy the computer again …
-) Group management with Printers: If I update a group with the option “only Assigned Printers” the clients are not updated. The are still in the mode “No Printer Management”
-) If you are sitting direct on the host computer you can not start an individual Deployment. This is a pitty - because everyone cannot “repair” his computer with deployment … I loved this option. Meaning less work to me.
Sorry - I have the feeling my post is very senseless. …
There is actual a question: can I implement a pdf printer when I use “only assigned printers”. To use this option is very important to me because my students print several the same job if it is not printed on the desired printer. So I want to assign only the printer at the room where the pc is. But in this case I’m missing a pdf printer.
-
@kratkale Read through this topic, possibly you have many client’s talking to the FOG server (and the DB for that reason) which might cap the connections.
About the printers, I might ask you to open new topics so we don’t loose track of different issues in one thread.