@Sebastian-Roth Sorry for the delay. I didn’t see your response until yesterday and didn’t get a chance to try until today. So, I did a git pull but the replication branch seems to have vanished! When I checked GitHub it is not present there either. Did you already merge it with the working branch? I am still willing to test it if you have a branch you would like me to use.
Posts made by jflippen
-
RE: Replication problems 1.5.4 - always copying
-
Printer Management Level for a group not working
I am on the working branch for our school districts FOG servers, and I have noticed that when trying to assign printers to groups, the group is able to assign the printer and default printer settings, however when I try to set the management level it would either only assign “only assigned printers” no matter which one I clicked and saved, or just now when I went to test it won’t change the setting for the group of computers at all and I have to set the management level manually on each station in the group, which is a huge pain for large labs.
-
RE: Creating group, members add themselves.
@falko said in Creating group, members add themselves.:
When I have experienced the issue in the past I have always found that the phantom hosts added to new groups were only the most recent hosts that were registered. Whether that be 2 hosts or 5 phantoms
Running the SQL maintenance commands from the wiki (as mentioned I think updates also helped) seemed to resolve
I was also having this issue, finding that when my co-workers created a group from the hosts page with even just 1 selected computer, it would add over 200 computers to the new group! I was able to avoid this issue by going to groups, creating a new group there, then adding members from the groups screen.
A couple of weeks ago a came across the SQL management commands and that fixed the issue for me! I can now create groups again from the hosts screen (at least until it breaks again). For those wondering about the commands they can be found here: https://wiki.fogproject.org/wiki/index.php?title=Troubleshoot_MySQL#Database_Maintenance_Commands
-
RE: Replication problems 1.5.4 - always copying
This sounds strange. Cannot think of why it shouldn’t properly upgrade from working-1.6 to replication branch. On the other hand I have not tested myself yet. So it’s not impossible to cause an issue. Do you think it’s worth looking into? How would I replicate the issue?
Maybe, but I know it is generally considered best practice to back up the database and start from scratch when it comes to upgrading the FOG server (at least I think I remember reading that somewhere on the wiki or forums). Every time I tried “uninstalling” FOG it is never a clean uninstall. I generally haven’t had issues with doing an in-place update or switching branches, but like I said when i tried to do it for the replication branch going from working 1.6 it kept the new web UI and the log was spitting out the old way, making me believe that either the install skipped files that needed to be replaced or something else failed.
I do have a few other bugs I noticed, but I’ll have to make a new thread for those. I see that there is a bug section in the forum, but wasn’t sure if there was a page with a list of current known bugs or if I just need to search the forum for the bugs I have been experiencing. I don’t want to start a new thread if I don’t need to!
-
RE: Replication problems 1.5.4 - always copying
@Sebastian-Roth Okay, so I managed to test out the new branch at home on my VM’s (like I mentioned before, not going to touch our primary FOG server right now sense we have a work around). Here is what I found:
-
The first time around I tried just doing an in-place upgrade of the working 1.6 branch. This, however seemed to not work properly as it did not change anything after I swapped branches and installed on all three (stopping the service and rebooting on each VM, going from nodes to master).
-
The 2nd time I did a fresh install of Centos 7 on each VM, followed the wiki guide, and went directly to the replication branch. I then created a fake image by creating a folder and then putting some random files in there (I don’t have PXE boot or a server environment at home). When I checked the log and did some testing, it looks like it’s working! Previously, FOG would not delete mismatched files, meaning the files would never get updated on the node, but would keep trying to run an lftp sync… It would even say files didn’t match even if they had the same md5sum at times. This seems to have been fixed in your new branch! I have included my log below.
-
During testing I noticed that the replication service won’t remove files that are on the node but not the master (from changing the files I was testing with on the master). Would it be possible to implement an if exist variable that checks an array of the file names in an image folder against the master? This would be handy for keeping the driver injection folder we use in the images folder synced properly too when removing deprecated models.
The Log, as promised
[11-07-18 6:34:23 am] ================================== === ==== ===== ==== === ========= == === == === === ======== ==== == ==== === === ======== ==== == ========= === ==== ==== == ========= === ======== ==== == === === === ======== ==== == ==== === === ========= == === == === === ========== ===== ==== ================================== ===== Free Opensource Ghost ====== ================================== ============ Credits ============= = https://fogproject.org/Credits = ================================== == Released under GPL Version 3 == ================================== [11-07-18 6:34:23 am] Interface Ready with IP Address: 127.0.0.1 [11-07-18 6:34:23 am] Interface Ready with IP Address: 127.0.1.1 [11-07-18 6:34:23 am] Interface Ready with IP Address: 192.168.1.16 [11-07-18 6:34:23 am] Interface Ready with IP Address: FOG-MASTER [11-07-18 6:34:23 am] FOGService: ImageReplicator - Waiting for mysql to be available [11-07-18 6:34:35 am] * Starting ImageReplicator Service [11-07-18 6:34:35 am] * Checking for new items every 600 seconds [11-07-18 6:34:35 am] * Starting service loop [11-07-18 6:34:36 am] * Starting Image Replication. [11-07-18 6:34:36 am] * We are group ID: 1. We are group name: default [11-07-18 6:34:36 am] * We are node ID: 1. We are node name: DefaultMember [11-07-18 6:34:36 am] * Attempting to perform Group -> Group image replication. [11-07-18 6:34:36 am] | Replicating postdownloadscripts [11-07-18 6:34:38 am] * Found Image to transfer to 2 nodes [11-07-18 6:34:38 am] | File Name: postdownloadscripts [11-07-18 6:34:38 am] | NODE-1 server does not appear to be online. [11-07-18 6:34:38 am] | NODE-2 server does not appear to be online. [11-07-18 6:34:38 am] | Replicating postinitscripts [11-07-18 6:34:40 am] * Found Image to transfer to 2 nodes [11-07-18 6:34:40 am] | File Name: dev/postinitscripts [11-07-18 6:34:40 am] | NODE-1 server does not appear to be online. [11-07-18 6:34:40 am] | NODE-2 server does not appear to be online. [11-07-18 6:34:40 am] * Not syncing Image between groups [11-07-18 6:34:40 am] | Image Name: test [11-07-18 6:34:40 am] | There are no other members to sync to. [11-07-18 6:34:40 am] * Attempting to perform Group -> Nodes image replication. [11-07-18 6:34:42 am] * Found Image to transfer to 2 nodes [11-07-18 6:34:42 am] | Image Name: test [11-07-18 6:34:42 am] | NODE-1 server does not appear to be online. [11-07-18 6:34:42 am] | NODE-2 server does not appear to be online. [11-07-18 6:44:42 am] * Starting Image Replication. [11-07-18 6:44:42 am] * We are group ID: 1. We are group name: default [11-07-18 6:44:42 am] * We are node ID: 1. We are node name: DefaultMember [11-07-18 6:44:42 am] * Attempting to perform Group -> Group image replication. [11-07-18 6:44:42 am] | Replicating postdownloadscripts [11-07-18 6:44:42 am] * Found Image to transfer to 2 nodes [11-07-18 6:44:42 am] | File Name: postdownloadscripts [11-07-18 6:44:43 am] # postdownloadscripts: No need to sync fog.postdownload (NODE-1) [11-07-18 6:44:43 am] * All files synced for this item. [11-07-18 6:44:43 am] # postdownloadscripts: No need to sync fog.postdownload (NODE-2) [11-07-18 6:44:43 am] * All files synced for this item. [11-07-18 6:44:43 am] | Replicating postinitscripts [11-07-18 6:44:44 am] * Found Image to transfer to 2 nodes [11-07-18 6:44:44 am] | File Name: dev/postinitscripts [11-07-18 6:44:44 am] # dev/postinitscripts: No need to sync fog.postinit (NODE-1) [11-07-18 6:44:44 am] * All files synced for this item. [11-07-18 6:44:45 am] # dev/postinitscripts: No need to sync fog.postinit (NODE-2) [11-07-18 6:44:45 am] * All files synced for this item. [11-07-18 6:44:45 am] * Not syncing Image between groups [11-07-18 6:44:45 am] | Image Name: test [11-07-18 6:44:45 am] | There are no other members to sync to. [11-07-18 6:44:45 am] * Attempting to perform Group -> Nodes image replication. [11-07-18 6:44:46 am] * Found Image to transfer to 2 nodes [11-07-18 6:44:46 am] | Image Name: test [11-07-18 6:44:46 am] # test: File does not exist 2014-10-31 11.52.32.jpg (NODE-1) [11-07-18 6:44:46 am] # test: File size mismatch - CentOS-7-x86_64-DVD-1804.iso: 4470079488 != 2903506944 [11-07-18 6:44:46 am] # test: Deleting remote file CentOS-7-x86_64-DVD-1804.iso [11-07-18 6:44:46 am] # test: File does not exist Fedora-Server-dvd-x86_64-28-1.1.iso (NODE-1) [11-07-18 6:44:46 am] # test: File does not exist FileZilla_3.38.1_win64-setup.exe (NODE-1) [11-07-18 6:44:46 am] | CMD: lftp -e 'set xfer:log 1; set xfer:log-file "/opt/fog/log/fogreplicator.test.transfer.NODE-1.log";set ftp:list-options -a;set net:max-retries 10;set net:timeout 30; mirror -c --parallel=20 -R --ignore-time -vvv --exclude ".srvprivate" "/images/test" "/images/test"; exit' -u fog,[Protected] 192.168.1.17 [11-07-18 6:44:46 am] | Started sync for Image test - Resource id #714 [11-07-18 6:44:46 am] # test: File does not exist 2014-10-31 11.52.32.jpg (NODE-2) [11-07-18 6:44:46 am] # test: File size mismatch - CentOS-7-x86_64-DVD-1804.iso: 4470079488 != 2903506944 [11-07-18 6:44:46 am] # test: Deleting remote file CentOS-7-x86_64-DVD-1804.iso [11-07-18 6:44:46 am] # test: File does not exist Fedora-Server-dvd-x86_64-28-1.1.iso (NODE-2) [11-07-18 6:44:46 am] # test: File does not exist FileZilla_3.38.1_win64-setup.exe (NODE-2) [11-07-18 6:44:46 am] | CMD: lftp -e 'set xfer:log 1; set xfer:log-file "/opt/fog/log/fogreplicator.test.transfer.NODE-2.log";set ftp:list-options -a;set net:max-retries 10;set net:timeout 30; mirror -c --parallel=20 -R --ignore-time -vvv --exclude ".srvprivate" "/images/test" "/images/test"; exit' -u fog,[Protected] 192.168.1.18 [11-07-18 6:44:46 am] | Started sync for Image test - Resource id #729 [11-07-18 6:48:10 am] | Sync finished - Resource id #714 [11-07-18 6:48:11 am] | Sync finished - Resource id #729 [11-07-18 6:54:42 am] * Starting Image Replication. [11-07-18 6:54:42 am] * We are group ID: 1. We are group name: default [11-07-18 6:54:42 am] * We are node ID: 1. We are node name: DefaultMember [11-07-18 6:54:42 am] * Attempting to perform Group -> Group image replication. [11-07-18 6:54:42 am] | Replicating postdownloadscripts [11-07-18 6:54:43 am] * Found Image to transfer to 2 nodes [11-07-18 6:54:43 am] | File Name: postdownloadscripts [11-07-18 6:54:43 am] # postdownloadscripts: File does not exist fog.custominstall (NODE-1) [11-07-18 6:54:43 am] # postdownloadscripts: No need to sync fog.postdownload (NODE-1) [11-07-18 6:54:43 am] | CMD: lftp -e 'set xfer:log 1; set xfer:log-file "/opt/fog/log/fogreplicator..transfer.NODE-1.log";set ftp:list-options -a;set net:max-retries 10;set net:timeout 30; mirror -c --parallel=20 -R --ignore-time -vvv --exclude ".srvprivate" "/images/postdownloadscripts" "/images/postdownloadscripts"; exit' -u fog,[Protected] 192.168.1.17 [11-07-18 6:54:43 am] | Started sync for Image postdownloadscripts - Resource id #781 [11-07-18 6:54:44 am] # postdownloadscripts: File does not exist fog.custominstall (NODE-2) [11-07-18 6:54:44 am] # postdownloadscripts: No need to sync fog.postdownload (NODE-2) [11-07-18 6:54:44 am] | CMD: lftp -e 'set xfer:log 1; set xfer:log-file "/opt/fog/log/fogreplicator..transfer.NODE-2.log";set ftp:list-options -a;set net:max-retries 10;set net:timeout 30; mirror -c --parallel=20 -R --ignore-time -vvv --exclude ".srvprivate" "/images/postdownloadscripts" "/images/postdownloadscripts"; exit' -u fog,[Protected] 192.168.1.18 [11-07-18 6:54:44 am] | Started sync for Image postdownloadscripts - Resource id #794 [11-07-18 6:54:44 am] | Replicating postinitscripts [11-07-18 6:54:45 am] * Found Image to transfer to 2 nodes [11-07-18 6:54:45 am] | File Name: dev/postinitscripts [11-07-18 6:54:45 am] # dev/postinitscripts: No need to sync fog.postinit (NODE-1) [11-07-18 6:54:45 am] * All files synced for this item. [11-07-18 6:54:45 am] # dev/postinitscripts: No need to sync fog.postinit (NODE-2) [11-07-18 6:54:45 am] * All files synced for this item. [11-07-18 6:54:46 am] * Not syncing Image between groups [11-07-18 6:54:46 am] | Image Name: test [11-07-18 6:54:46 am] | There are no other members to sync to. [11-07-18 6:54:46 am] * Attempting to perform Group -> Nodes image replication. [11-07-18 6:54:46 am] * Found Image to transfer to 2 nodes [11-07-18 6:54:46 am] | Image Name: test [11-07-18 6:54:47 am] # test: No need to sync 2014-10-31 11.52.32.jpg (NODE-1) [11-07-18 6:54:47 am] # test: No need to sync CentOS-7-x86_64-DVD-1804.iso (NODE-1) [11-07-18 6:54:48 am] # test: No need to sync Fedora-Server-dvd-x86_64-28-1.1.iso (NODE-1) [11-07-18 6:54:48 am] # test: No need to sync FileZilla_3.38.1_win64-setup.exe (NODE-1) [11-07-18 6:54:48 am] * All files synced for this item. [11-07-18 6:54:49 am] # test: No need to sync 2014-10-31 11.52.32.jpg (NODE-2) [11-07-18 6:54:49 am] # test: No need to sync CentOS-7-x86_64-DVD-1804.iso (NODE-2) [11-07-18 6:54:49 am] # test: No need to sync Fedora-Server-dvd-x86_64-28-1.1.iso (NODE-2) [11-07-18 6:54:50 am] # test: No need to sync FileZilla_3.38.1_win64-setup.exe (NODE-2) [11-07-18 6:54:50 am] * All files synced for this item. [11-07-18 6:54:50 am] | Sync finished - Resource id #781 [11-07-18 6:54:50 am] | Sync finished - Resource id #794 [11-07-18 7:04:42 am] * Starting Image Replication. [11-07-18 7:04:42 am] * We are group ID: 1. We are group name: default [11-07-18 7:04:42 am] * We are node ID: 1. We are node name: DefaultMember [11-07-18 7:04:42 am] * Attempting to perform Group -> Group image replication. [11-07-18 7:04:42 am] | Replicating postdownloadscripts [11-07-18 7:04:42 am] * Found Image to transfer to 2 nodes [11-07-18 7:04:42 am] | File Name: postdownloadscripts [11-07-18 7:04:42 am] # postdownloadscripts: File does not exist fog.copydrivers (NODE-1) [11-07-18 7:04:42 am] # postdownloadscripts: File size mismatch - fog.custominstall: 1673 != 235 [11-07-18 7:04:42 am] # postdownloadscripts: Deleting remote file fog.custominstall [11-07-18 7:04:43 am] # postdownloadscripts: File size mismatch - fog.postdownload: 271 != 235 [11-07-18 7:04:43 am] # postdownloadscripts: Deleting remote file fog.postdownload [11-07-18 7:04:43 am] | CMD: lftp -e 'set xfer:log 1; set xfer:log-file "/opt/fog/log/fogreplicator..transfer.NODE-1.log";set ftp:list-options -a;set net:max-retries 10;set net:timeout 30; mirror -c --parallel=20 -R --ignore-time -vvv --exclude ".srvprivate" "/images/postdownloadscripts" "/images/postdownloadscripts"; exit' -u fog,[Protected] 192.168.1.17 [11-07-18 7:04:43 am] | Started sync for Image postdownloadscripts - Resource id #975 [11-07-18 7:04:43 am] # postdownloadscripts: File does not exist fog.copydrivers (NODE-2) [11-07-18 7:04:43 am] # postdownloadscripts: File size mismatch - fog.custominstall: 1673 != 235 [11-07-18 7:04:43 am] # postdownloadscripts: Deleting remote file fog.custominstall [11-07-18 7:04:43 am] # postdownloadscripts: File size mismatch - fog.postdownload: 271 != 235 [11-07-18 7:04:43 am] # postdownloadscripts: Deleting remote file fog.postdownload [11-07-18 7:04:43 am] | CMD: lftp -e 'set xfer:log 1; set xfer:log-file "/opt/fog/log/fogreplicator..transfer.NODE-2.log";set ftp:list-options -a;set net:max-retries 10;set net:timeout 30; mirror -c --parallel=20 -R --ignore-time -vvv --exclude ".srvprivate" "/images/postdownloadscripts" "/images/postdownloadscripts"; exit' -u fog,[Protected] 192.168.1.18 [11-07-18 7:04:43 am] | Started sync for Image postdownloadscripts - Resource id #993 [11-07-18 7:04:43 am] | Replicating postinitscripts [11-07-18 7:04:44 am] * Found Image to transfer to 2 nodes [11-07-18 7:04:44 am] | File Name: dev/postinitscripts [11-07-18 7:04:44 am] # dev/postinitscripts: No need to sync fog.postinit (NODE-1) [11-07-18 7:04:44 am] * All files synced for this item. [11-07-18 7:04:44 am] # dev/postinitscripts: No need to sync fog.postinit (NODE-2) [11-07-18 7:04:44 am] * All files synced for this item. [11-07-18 7:04:45 am] * Not syncing Image between groups [11-07-18 7:04:45 am] | Image Name: test [11-07-18 7:04:45 am] | There are no other members to sync to. [11-07-18 7:04:45 am] * Attempting to perform Group -> Nodes image replication. [11-07-18 7:04:45 am] * Found Image to transfer to 2 nodes [11-07-18 7:04:45 am] | Image Name: test [11-07-18 7:04:46 am] # test: No need to sync 2014-10-31 11.52.32.jpg (NODE-1) [11-07-18 7:04:46 am] # test: No need to sync CentOS-7-x86_64-DVD-1804.iso (NODE-1) [11-07-18 7:04:46 am] # test: No need to sync Fedora-Server-dvd-x86_64-28-1.1.iso (NODE-1) [11-07-18 7:04:47 am] # test: No need to sync FileZilla_3.38.1_win64-setup.exe (NODE-1) [11-07-18 7:04:47 am] * All files synced for this item. [11-07-18 7:04:47 am] # test: No need to sync 2014-10-31 11.52.32.jpg (NODE-2) [11-07-18 7:04:47 am] # test: No need to sync CentOS-7-x86_64-DVD-1804.iso (NODE-2) [11-07-18 7:04:48 am] # test: No need to sync Fedora-Server-dvd-x86_64-28-1.1.iso (NODE-2) [11-07-18 7:04:48 am] # test: No need to sync FileZilla_3.38.1_win64-setup.exe (NODE-2) [11-07-18 7:04:48 am] * All files synced for this item. [11-07-18 7:04:48 am] | Sync finished - Resource id #975 [11-07-18 7:04:48 am] | Sync finished - Resource id #993
-
-
RE: Replication problems 1.5.4 - always copying
@Sebastian-Roth Definitely! Though I will probably test on some VM’s first since our actual FOG server is running scripts to back everything up to our nodes instead since we were having issues with both replication services. Will the test code be on the -working channel or a different one? I’ll try to provide feedback when I can. There are other bugs I need to report on as well, but haven’t had time to document.
-
RE: FOG 1.5.4 Replication Issue
@Raymond-Bell @Tom-Elliott You had me update to the working-1.6 branch and it fixed the issue in that it would go the next node even if it says there is a non-match file on the first node. The issue of it constantly thinking image files (usually the largest .img) don’t match still has not been fixed. I have been too busy with other issues in our school district to continue testing at the moment though and have been manually updating images using rsync if we ever upload a new version, seeing as how it doesn’t seem to delete non-matching files and /or thinks matching files don’t match. I am able to replicate the issue on VM’s at home and with multiple test machines and our servers at work. However until the work orders slow down I am unable to keep digging into the issue and the rsync has been working as a temporary solution.
-
RE: Replication problems 1.5.4 - always copying
@wayne-workman Thanks Wayne. Is it possible that the block is timing out then if the file is taking too long on the remote server to create the md5sum and that is why it keeps trying to replace the same file each time there is a replication cycle? Is there a place I can change the time it takes to timeout to try as a troubleshooting technique (or I might go with George’s technique and try and echo out the two variables to a log during that pass to check and see if it is truly grabbing the $hashRem
-
RE: Replication problems 1.5.4 - always copying
@jgallo @Tom-Elliott
In another thread I was looking into the code a bit and Tom verified that the following code compares the files with one being hashed on one server and the other being hashed on the other.private static function _filesAreEqual($size_a, $size_b, $file_a, $file_b, $avail) { if ($size_a != $size_b) { return false; } if (false === $avail) { if ($size_a < 1047685760) { $remhash = md5_file($file_b); $lochash = md5_file($file_a); return ($remhash == $lochash); } return file_exists($file_b) && file_exists($file_a); } $hashLoc = self::getHash($file_a); $hashRem = $file_b; $hashCom = ($hashLoc == $hashRem); return $hashCom;
Tom, is it possible that the function is checking for the $hashRem variable before the other server finishes it’s md5hash and therefore comes up with a mismatch? Still doesn’t explain why it won’t delete the file and replace though…
-
RE: Replication problems 1.5.4 - always copying
@hanz @Tom-Elliott I updated this morning and can confirm that my server will now check other nodes after the first one in line has files to replace. However, I believe it is still having the issue of the files saying they are different even though the md5sum is the same (and not deleting said file when it says it will and transfer to log)
[07-09-18 7:04:19 am] ================================== === ==== ===== ==== === ========= == === == === === ======== ==== == ==== === === ======== ==== == ========= === ==== ==== == ========= === ======== ==== == === === === ======== ==== == ==== === === ========= == === == === === ========== ===== ==== ================================== ===== Free Opensource Ghost ====== ================================== ============ Credits ============= = https://fogproject.org/Credits = ================================== == Released under GPL Version 3 == ================================== [07-09-18 7:04:19 am] Interface Ready with IP Address: 10.59.10.12 [07-09-18 7:04:19 am] Interface Ready with IP Address: 127.0.0.1 [07-09-18 7:04:19 am] Interface Ready with IP Address: 127.0.1.1 [07-09-18 7:04:19 am] Interface Ready with IP Address: 192.168.122.1 [07-09-18 7:04:19 am] Interface Ready with IP Address: do-isdir-01.tsd.tsdnet.turlock.k12.ca.us [07-09-18 7:04:19 am] Interface Ready with IP Address: fog-master-tsd-tsdnet-turlock-k12-ca-us [07-09-18 7:04:19 am] * Starting ImageReplicator Service [07-09-18 7:04:19 am] * Checking for new items every 900 seconds [07-09-18 7:04:19 am] * Starting service loop [07-09-18 7:04:19 am] * Starting Image Replication. [07-09-18 7:04:19 am] * We are group ID: 1. We are group name: default [07-09-18 7:04:19 am] * We are node ID: 1. We are node name: DO-FOG-MASTER [07-09-18 7:04:19 am] * Attempting to perform Group -> Group image replication. [07-09-18 7:04:19 am] | Replicating postdownloadscripts [07-09-18 7:04:20 am] * Found Image to transfer to 2 s [07-09-18 7:04:20 am] | File Name: postdownloadscripts [07-09-18 7:04:29 am] | postdownloadscripts: No need to sync fog.deletelog file to OC-FOG-NODE [07-09-18 7:04:29 am] | postdownloadscripts: No need to sync fog.drivers file to OC-FOG-NODE [07-09-18 7:04:29 am] | postdownloadscripts: No need to sync fog.postdownload file to OC-FOG-NODE [07-09-18 7:04:29 am] * All files synced for this item. [07-09-18 7:04:48 am] | postdownloadscripts: No need to sync fog.deletelog file to PH-FOG-NODE [07-09-18 7:04:48 am] | Files do not match on server: PH-FOG-NODE [07-09-18 7:04:48 am] | Deleting remote file: /images/postdownloadscripts/fog.drivers [07-09-18 7:04:48 am] | Files do not match on server: PH-FOG-NODE [07-09-18 7:04:48 am] | Deleting remote file: /images/postdownloadscripts/fog.postdownload [07-09-18 7:04:48 am] | CMD: lftp -e 'set xfer:log 1; set xfer:log-file "/opt/fog/log/fogreplicator..transfer.PH-FOG-NODE.log";set ftp:list-options -a;set net:max-retries 10;set net:timeout 30; mirror -c --parallel=20 -R --ignore-time -vvv --exclude ".srvprivate" "/images/postdownloadscripts" "/images/postdownloadscripts"; exit' -u fog,[Protected] 10.60.219.12 [07-09-18 7:04:48 am] * Started sync for Image postdownloadscripts [07-09-18 7:04:48 am] | Replicating postinitscripts [07-09-18 7:04:48 am] * Found Image to transfer to 2 s [07-09-18 7:04:48 am] | File Name: dev/postinitscripts [07-09-18 7:04:57 am] | dev/postinitscripts: No need to sync fog.postinit file to OC-FOG-NODE [07-09-18 7:04:57 am] * All files synced for this item. [07-09-18 7:05:06 am] | dev/postinitscripts: No need to sync fog.postinit file to PH-FOG-NODE [07-09-18 7:05:06 am] * All files synced for this item. [07-09-18 7:05:06 am] * Not syncing Image between s [07-09-18 7:05:06 am] | Image Name: Win10BaseVM [07-09-18 7:05:06 am] | There are no other members to sync to. [07-09-18 7:05:06 am] * Not syncing Image between s [07-09-18 7:05:06 am] | Image Name: Win7BaseVM [07-09-18 7:05:06 am] | There are no other members to sync to. [07-09-18 7:05:06 am] * Attempting to perform Group -> Nodes image replication. [07-09-18 7:05:07 am] * Found Image to transfer to 2 s [07-09-18 7:05:07 am] | Image Name: Win10BaseVM [07-09-18 7:05:16 am] | Win10BaseVM: No need to sync d1.fixed_size_partitions file to OC-FOG-NODE [07-09-18 7:05:17 am] | Win10BaseVM: No need to sync d1.mbr file to OC-FOG-NODE [07-09-18 7:05:17 am] | Win10BaseVM: No need to sync d1.minimum.partitions file to OC-FOG-NODE [07-09-18 7:05:17 am] | Win10BaseVM: No need to sync d1.original.fstypes file to OC-FOG-NODE [07-09-18 7:05:17 am] | Win10BaseVM: No need to sync d1.original.swapuuids file to OC-FOG-NODE [07-09-18 7:05:18 am] | Win10BaseVM: No need to sync d1.original.uuids file to OC-FOG-NODE [07-09-18 7:05:18 am] | Win10BaseVM: No need to sync d1.partitions file to OC-FOG-NODE [07-09-18 7:05:18 am] | Win10BaseVM: No need to sync d1p1.img file to OC-FOG-NODE [07-09-18 7:05:19 am] | Win10BaseVM: No need to sync d1p2.img file to OC-FOG-NODE [07-09-18 7:05:19 am] | Win10BaseVM: No need to sync d1p3.img file to OC-FOG-NODE [07-09-18 7:07:17 am] | Files do not match on server: OC-FOG-NODE [07-09-18 7:07:17 am] | Deleting remote file: /images/Win10BaseVM/d1p4.img [07-09-18 7:07:17 am] | CMD: lftp -e 'set xfer:log 1; set xfer:log-file "/opt/fog/log/fogreplicator.Win10BaseVM.transfer.OC-FOG-NODE.log";set ftp:list-options -a;set net:max-retries 10;set net:timeout 30; mirror -c --parallel=20 -R --ignore-time -vvv --exclude ".srvprivate" "/images/Win10BaseVM" "/images/Win10BaseVM"; exit' -u fog,[Protected] 10.59.181.12 [07-09-18 7:07:17 am] * Started sync for Image Win10BaseVM [07-09-18 7:07:26 am] | Win10BaseVM: No need to sync d1.fixed_size_partitions file to PH-FOG-NODE [07-09-18 7:07:26 am] | Win10BaseVM: No need to sync d1.mbr file to PH-FOG-NODE [07-09-18 7:07:27 am] | Win10BaseVM: No need to sync d1.minimum.partitions file to PH-FOG-NODE [07-09-18 7:07:27 am] | Win10BaseVM: No need to sync d1.original.fstypes file to PH-FOG-NODE [07-09-18 7:07:27 am] | Win10BaseVM: No need to sync d1.original.swapuuids file to PH-FOG-NODE [07-09-18 7:07:27 am] | Win10BaseVM: No need to sync d1.original.uuids file to PH-FOG-NODE [07-09-18 7:07:28 am] | Win10BaseVM: No need to sync d1.partitions file to PH-FOG-NODE [07-09-18 7:07:28 am] | Win10BaseVM: No need to sync d1p1.img file to PH-FOG-NODE [07-09-18 7:07:28 am] | Win10BaseVM: No need to sync d1p2.img file to PH-FOG-NODE [07-09-18 7:07:29 am] | Win10BaseVM: No need to sync d1p3.img file to PH-FOG-NODE [07-09-18 7:09:28 am] | Win10BaseVM: No need to sync d1p4.img file to PH-FOG-NODE [07-09-18 7:09:28 am] * All files synced for this item. [07-09-18 7:09:28 am] * Found Image to transfer to 2 s [07-09-18 7:09:28 am] | Image Name: Win7BaseVM [07-09-18 7:09:37 am] | Win7BaseVM: No need to sync d1.fixed_size_partitions file to OC-FOG-NODE [07-09-18 7:09:37 am] | Win7BaseVM: No need to sync d1.mbr file to OC-FOG-NODE [07-09-18 7:09:37 am] | Win7BaseVM: No need to sync d1.minimum.partitions file to OC-FOG-NODE [07-09-18 7:09:38 am] | Win7BaseVM: No need to sync d1.original.fstypes file to OC-FOG-NODE [07-09-18 7:09:38 am] | Win7BaseVM: No need to sync d1.original.swapuuids file to OC-FOG-NODE [07-09-18 7:09:38 am] | Win7BaseVM: No need to sync d1.partitions file to OC-FOG-NODE [07-09-18 7:11:56 am] | Win7BaseVM: No need to sync d1p1.img file to OC-FOG-NODE [07-09-18 7:11:56 am] * All files synced for this item. [07-09-18 7:12:05 am] | Win7BaseVM: No need to sync d1.fixed_size_partitions file to PH-FOG-NODE [07-09-18 7:12:05 am] | Win7BaseVM: No need to sync d1.mbr file to PH-FOG-NODE [07-09-18 7:12:05 am] | Win7BaseVM: No need to sync d1.minimum.partitions file to PH-FOG-NODE [07-09-18 7:12:06 am] | Win7BaseVM: No need to sync d1.original.fstypes file to PH-FOG-NODE [07-09-18 7:12:06 am] | Win7BaseVM: No need to sync d1.original.swapuuids file to PH-FOG-NODE [07-09-18 7:12:06 am] | Win7BaseVM: No need to sync d1.partitions file to PH-FOG-NODE [07-09-18 7:14:21 am] | Win7BaseVM: No need to sync d1p1.img file to PH-FOG-NODE [07-09-18 7:14:21 am] * All files synced for this item. [07-09-18 7:29:21 am] * Starting Image Replication. [07-09-18 7:29:21 am] * We are group ID: 1. We are group name: default [07-09-18 7:29:21 am] * We are node ID: 1. We are node name: DO-FOG-MASTER [07-09-18 7:29:21 am] * Attempting to perform Group -> Group image replication. [07-09-18 7:29:21 am] | Replicating postdownloadscripts [07-09-18 7:29:21 am] * Found Image to transfer to 2 s [07-09-18 7:29:21 am] | File Name: postdownloadscripts [07-09-18 7:29:30 am] | postdownloadscripts: No need to sync fog.deletelog file to OC-FOG-NODE [07-09-18 7:29:31 am] | postdownloadscripts: No need to sync fog.drivers file to OC-FOG-NODE [07-09-18 7:29:31 am] | postdownloadscripts: No need to sync fog.postdownload file to OC-FOG-NODE [07-09-18 7:29:31 am] * All files synced for this item. [07-09-18 7:29:40 am] | postdownloadscripts: No need to sync fog.deletelog file to PH-FOG-NODE [07-09-18 7:29:41 am] | postdownloadscripts: No need to sync fog.drivers file to PH-FOG-NODE [07-09-18 7:29:41 am] | postdownloadscripts: No need to sync fog.postdownload file to PH-FOG-NODE [07-09-18 7:29:41 am] * All files synced for this item. [07-09-18 7:29:41 am] | Replicating postinitscripts [07-09-18 7:29:41 am] * Found Image to transfer to 2 s [07-09-18 7:29:41 am] | File Name: dev/postinitscripts [07-09-18 7:29:51 am] | dev/postinitscripts: No need to sync fog.postinit file to OC-FOG-NODE [07-09-18 7:29:51 am] * All files synced for this item. [07-09-18 7:30:00 am] | dev/postinitscripts: No need to sync fog.postinit file to PH-FOG-NODE [07-09-18 7:30:00 am] * All files synced for this item. [07-09-18 7:30:00 am] * Not syncing Image between s [07-09-18 7:30:00 am] | Image Name: Win10BaseVM [07-09-18 7:30:00 am] | There are no other members to sync to. [07-09-18 7:30:00 am] * Not syncing Image between s [07-09-18 7:30:00 am] | Image Name: Win7BaseVM [07-09-18 7:30:00 am] | There are no other members to sync to. [07-09-18 7:30:00 am] * Attempting to perform Group -> Nodes image replication. [07-09-18 7:30:00 am] * Found Image to transfer to 2 s [07-09-18 7:30:00 am] | Image Name: Win10BaseVM [07-09-18 7:30:10 am] | Win10BaseVM: No need to sync d1.fixed_size_partitions file to OC-FOG-NODE [07-09-18 7:30:10 am] | Win10BaseVM: No need to sync d1.mbr file to OC-FOG-NODE [07-09-18 7:30:10 am] | Win10BaseVM: No need to sync d1.minimum.partitions file to OC-FOG-NODE [07-09-18 7:30:11 am] | Win10BaseVM: No need to sync d1.original.fstypes file to OC-FOG-NODE [07-09-18 7:30:11 am] | Win10BaseVM: No need to sync d1.original.swapuuids file to OC-FOG-NODE [07-09-18 7:30:11 am] | Win10BaseVM: No need to sync d1.original.uuids file to OC-FOG-NODE [07-09-18 7:30:11 am] | Win10BaseVM: No need to sync d1.partitions file to OC-FOG-NODE [07-09-18 7:30:12 am] | Win10BaseVM: No need to sync d1p1.img file to OC-FOG-NODE [07-09-18 7:30:12 am] | Win10BaseVM: No need to sync d1p2.img file to OC-FOG-NODE [07-09-18 7:30:13 am] | Win10BaseVM: No need to sync d1p3.img file to OC-FOG-NODE [07-09-18 7:32:18 am] | Files do not match on server: OC-FOG-NODE [07-09-18 7:32:18 am] | Deleting remote file: /images/Win10BaseVM/d1p4.img [07-09-18 7:32:18 am] | CMD: lftp -e 'set xfer:log 1; set xfer:log-file "/opt/fog/log/fogreplicator.Win10BaseVM.transfer.OC-FOG-NODE.log";set ftp:list-options -a;set net:max-retries 10;set net:timeout 30; mirror -c --parallel=20 -R --ignore-time -vvv --exclude ".srvprivate" "/images/Win10BaseVM" "/images/Win10BaseVM"; exit' -u fog,[Protected] 10.59.181.12 [07-09-18 7:32:18 am] * Started sync for Image Win10BaseVM [07-09-18 7:32:28 am] | Win10BaseVM: No need to sync d1.fixed_size_partitions file to PH-FOG-NODE [07-09-18 7:32:28 am] | Win10BaseVM: No need to sync d1.mbr file to PH-FOG-NODE [07-09-18 7:32:28 am] | Win10BaseVM: No need to sync d1.minimum.partitions file to PH-FOG-NODE [07-09-18 7:32:29 am] | Win10BaseVM: No need to sync d1.original.fstypes file to PH-FOG-NODE [07-09-18 7:32:29 am] | Win10BaseVM: No need to sync d1.original.swapuuids file to PH-FOG-NODE [07-09-18 7:32:29 am] | Win10BaseVM: No need to sync d1.original.uuids file to PH-FOG-NODE [07-09-18 7:32:29 am] | Win10BaseVM: No need to sync d1.partitions file to PH-FOG-NODE [07-09-18 7:32:30 am] | Win10BaseVM: No need to sync d1p1.img file to PH-FOG-NODE [07-09-18 7:32:30 am] | Win10BaseVM: No need to sync d1p2.img file to PH-FOG-NODE [07-09-18 7:32:31 am] | Win10BaseVM: No need to sync d1p3.img file to PH-FOG-NODE```
-
RE: Replication problems 1.5.4 - always copying
@tom-elliott Based off my last reply to you on this thread:
https://forums.fogproject.org/topic/12037/replication-only-working-on-1-5-nodesI have noticed that it has not been deleting the files when it says it will in the log. I can manually delete them via ftp and the fog username / password. Also, the snapins will update on the nodes just fine when I re-upload a snapin, so it just seems to be an issue with the images. I think what they are referring to is that since they have multiple nodes, as do I, the image never replicates to the other nodes because it keeps finding a fault and never updating the one of the first one in line.
-
RE: Replication problems 1.5.4 - always copying
@george1421 Dear god I hope you’re right. I haven’t had a chance to upgrade to working branch 1.5.5 yet… been busy with other summer work. Oddly enough, I found my snapins update just fine when it comes to replication… it is just the images that never deleted the old files that didn’t match. I will try this soon as I get a chance.
-
RE: Replication: Only working on 1/5 nodes
@george1421 Sorry for the long delay, I’ve been gone on vacation for the last week. I just tried using the fog credentials stored for one of the nodes and was able to FTP in from my Windows workstation using clonezilla. I was able to delete the file manually. So now the question is:
Why isn’t the PHP command to do the FTP deletion for the replicator service working? -
RE: PXE Boot Dell Optiplex 7050 fails in UEFI works in Legacy
@george1421 I have had to disable secure boot to PXE boot at all with FOG, but I think you can re-enabled it after the image deploys but before it starts the OOBE experience (meaning you will need to have it shut down after imaging or catch it on reboot). One of the more experienced guys might prove me wrong though.
You will have to leave it disabled if you want to PXE boot in the future though. -
RE: PXE Boot Dell Optiplex 7050 fails in UEFI works in Legacy
@dholland When we started rolling out the 7050 and the lattitude e5580 we were forced to go Windows 10 thanks to Intel not supporting updates in Win7 with their latest chipsets, so I had to figure out making a golden image for Win10, using UEFI and Legacy on same network… all sorts of fun.
-
RE: PXE Boot Dell Optiplex 7050 fails in UEFI works in Legacy
@dholland Hey, I think for UEFI booting you have to boot with ipxe.efi for option 67. Also, I would give this wiki page a read… I just set it up last month and it’s amazing!
https://wiki.fogproject.org/wiki/index.php?title=BIOS_and_UEFI_Co-Existence -
RE: Replication: Only working on 1/5 nodes
@tom-elliott Okay, thanks. Back to the drawing board I go. Any ideas on why this line,
self::$FOGFTP->delete($remotefilescheck[$index]);
, is not deleting the file on the node before the lftp call? -
RE: Replication: Only working on 1/5 nodes
@george1421 @Tom-Elliott Is this a logic error in this method?
private static function _filesAreEqual($size_a, $size_b, $file_a, $file_b, $avail) { if ($size_a != $size_b) { return false; } if (false === $avail) { if ($size_a < 1047685760) { $remhash = md5_file($file_b); $lochash = md5_file($file_a); return ($remhash == $lochash); } return file_exists($file_b) && file_exists($file_a); } $hashLoc = self::getHash($file_a); $hashRem = $file_b; $hashCom = ($hashLoc == $hashRem); return $hashCom;
It looks like it skips md5sum for files larger than 1GB and does the encode_64 from getHash instead. The method is calling getHash for $file_a but not for $file_b. I do not see a part in the php file that calls for a hash for $file_b at all. Wouldn’t this mean the hashComp would be comparing a hashed file vs a non-hashed file?
-
RE: Replication: Only working on 1/5 nodes
@george1421 Thanks, George. I’ll have to give that a go once I get this replication issue resolved. Right now I’m trying to keep myself from going cross-eyed going over @Tom-Elliott 's PHP. It’s driving me nuts that I can’t figure out:
-
Why it says the file is different when sometimes the md5sum matches for the two files
-
Why it’s not deleting the file when they don’t match
-
When replacing a file with a different md5sum, if I run the lftp command manually and insert the correct credentials, I don’t get any errors, but it doesn’t change the existing file (I’m guessing since the deleting is handled by this line:
self::$FOGFTP->delete($remotefilescheck[$index]);
)
To be fair, I don’t know PHP, just some other languages. I am just trying to get a better understanding of what’s happening under the hood to see if it will help me resolve my issue.
I feel like there has to be something wrong with my environment, otherwise we should see way more people complaining about this issue, no? However the fact I can replicate the issue on the OplixPlex 380 test fog group with the issues I am having with my actual fog servers means it’s likely not the servers themselves…or I’m doing it wrong after following the directions here:
https://wiki.fogproject.org/wiki/index.php?title=CentOS_7Idk… I better start sweeping up all the hair I’ve pulled out
-
-
RE: Replication: Only working on 1/5 nodes
@george1421 Hey George, I thought the faux drivers folder would only work if there were no subdirectories in the drivers folder? Has this been modified to allow your previous setup of drivers/ModelName/x64/.ini files?
Also, I would like to note that I am not using NAS devices but three Dell rack mounted servers that we bought a few months ago. I do have a hardware-level RAID 1, but today I have repeated the test and set up some old OptPlex 380 machines as test fog servers (as their own master and 2 nodes, not as part of my actual server and ther nodes). With these test fog servers, it’s even more screwy. It says that files don’t match even though the md5sum is the same. I didn’t even get to updating the image. The imagereplicationservice is now hanging up after the first transfer (on the 380’s at least).
I was trying to find the code to get a better understanding of what’s happening at the actual service level. After some digging I found the imagereplicator.class.php file, which calls another function replicateItems for each image. I cannot find where this replicateItems function is defined. Basically, I am trying to find the class that is actually doing the lftp command and / or deleting the files, how it’s verifying if they are the same, etc.
You wouldn’t happen to know where these files are, would you?