Storage nodes not deploying images
-
Hi George,
I just watched the video tutorial on the location plugin, then installed it on the main server. I have created both the main server and two storage nodes in the location management tab. As all three are Fog servers, one normal and two storage, I selected the option to use kernels and inits from all locations. Time to test, I will post up once I have restarted my 6 test hosts and seen where they pull their deployment images from now.Thanks for the information, I didn’t know about the plugins as I only started Fog (to replace my WDS server) less than a week ago.
-
Hi George,
It seems that the hosts are still deploying from the main server only, I attached a screenshot showing the bandwidth and it shows that only FOG-Ubuntu (my normal server) is sending data, my 2 slave (storage servers) nodes are doing nothing.I am also watching ‘tcptrack’ on all three servers, there is a lot of activity between the 2 storage nodes and the main server, but all very small and very short connections (speed less than 5 KB/s), nothing like the traffic on the main server - I see all the small activities, but also the 6 deployments running around 20 MB/s.
I am wondering if this is because I am only selecting deploy at the Fog PXE window on the hosts and not registering the clients when they first connect?
-
@EuroEnglish Double check the image replicated properly. If the non-masters don’t have a copy, I think it’ll use the node that does have a copy.
Can you post a screenshot of your Storage Management page please? the one that lists all the storage nodes, groups, master status. Also - the image you’re deploying, can you post a screenshot of the Image’s storage group area please?
-
@EuroEnglish And also did you assign specific hosts to specific locations?
-
Hi Wayne,
I have attached the screenshots you requested and also the most recent Image Replication log file, in case that shows any issues. Right now it isn’t a problem, as I am only testing a few machines at a time, however once summer arrives I need to deploy around 300 laptops. I am hoping that setting up a main server and 3 storage nodes will allow me to push around 20 at a time without too much slow down (5 per node). -
@EuroEnglish Going to look through those closely, but this is what I was requesting for the image’s storage group settings:
-
@EuroEnglish Can you also please post a screenshot of your location management area - the one that lists all the locations please? Also - ensure as @george1421 already said - make sure the host you’re trying to image is assigned the location you want it to use.
-
@EuroEnglish said in Storage nodes not deploying images:
me to push around 20 at a time without too much slow down (
If you are going to do this, and all clients and fog server will be on the same subnet (VLAN) you might consider a multicast deployment instead of 20 unicast deployments. You can scale to 50 or more systems using a mutlicast deployment. It does take some work to set it up but you can deploy a whole classroom quickly with that route.
<edit> Also consider if your network is robust enough to support 20 unicast streams running at 4-6GB/min. That may be too much for your infrastructure. </edit>
-
Hi Wayne,
Sorry about that, I have now taken a screenshot of that section and attached it. I also checked the other 2 images I have and all are the same as the one attached. -
looking at the log, it might appear that your storage groups are not setup correctly.
[04-11-17 4:06:27 pm] | Image Name: Student-E460 [04-11-17 4:06:27 pm] * Found Image to transfer to 2 nodes [04-11-17 4:06:27 pm] * Attempting to perform Group -> Nodes image replication. [04-11-17 4:06:27 pm] | There are no other members to sync to. [04-11-17 4:06:27 pm] | Image Name: Student-Twist [04-11-17 4:06:27 pm] * Not syncing Image between groups [04-11-17 4:06:27 pm] | There are no other members to sync to. [04-11-17 4:06:27 pm] | Image Name: Student-T440-Old [04-11-17 4:06:27 pm] * Not syncing Image between groups [04-11-17 4:06:27 pm] | There are no other members to sync to. [04-11-17 4:06:27 pm] | Image Name: Student-E460 [04-11-17 4:06:27 pm] * Not syncing Image between groups
This part of the log may need some dev clarification (and some possible code changes). There is no need to sync because??? The file already exists on the target server or the FOG replicator is being a bit lazy??
[04-11-17 4:06:30 pm] | Student-E460: No need to sync d1.partitions file to FOG-Ubuntu-Slave1 [04-11-17 4:06:30 pm] | Student-E460: No need to sync d1.original.swapuuids file to FOG-Ubuntu-Slave1 [04-11-17 4:06:30 pm] | Student-E460: No need to sync d1.original.fstypes file to FOG-Ubuntu-Slave1 [04-11-17 4:06:29 pm] | Student-E460: No need to sync d1.minimum.partitions file to FOG-Ubuntu-Slave1 [04-11-17 4:06:29 pm] | Student-E460: No need to sync d1.mbr file to FOG-Ubuntu-Slave1 [04-11-17 4:06:29 pm] | Student-E460: No need to sync d1.fixed_size_partitions file to FOG-Ubuntu-Slave1
-
Hi Wayne,
Here is the location management screenshot. -
@EuroEnglish Sorry to keep asking you for screen shots and more screen shots. The one I see missing is the storage group association. All the other ones look good.
Just be aware that the storage nodes and location plugins were intending for a multi site or multi building(subnet) setup where you would dedicate certain workstation to specific locations. Its not really setup to do what I’m calling overflow imaging. (you deploy to server 1 until max clients is reached, then the next client is assigned to another storage node until its max client level is reached and so on). The storage node concept doesn’t work that way.
-
Thanks for the idea regarding multicasting, I will look into setting that up in the Wiki - Seems almost everything you would ever need about Fog is there somewhere. We were running WDS on a Windows server for the last few years, but something went wrong and it will not capture new images. Still deploys, but not a good idea to have old images that take longer to update than it did to deploy the image
I was researching how to fix the WDS server and came across a forum for Fog, this solution is a life-saver, no more sysprep and capturing images is much easier. I also find that deployment is much faster with Fog. We image the machines over the summer break, working in a high school, and bandwidth isn’t much of a problem, individual 1Gb Ethernet to each server and fiber point to point between the server room switches and my imaging location switches. However, I agree that multicasting might still speed things up.
-
@george1421 Something else to consider is that your image push time will be less than 5 min per system per unicast. Can you stage these systems (next system) that fast? It will take longer for windows to install than it will to push out 5 systems in series.
-
@george1421 He has another group called
default
that doesn’t have any nodes in it. I know from the screenshot he posted for the Images Storage Groups. default is listed there. I’m guessing that’s what the logs are referring to. -
Based on your thoughts about the Image replication log files I used Webmin to check the Images folder on each server. It seems that you might be correct, it shows that there is a small file size difference between the main server and the two storage servers. It also seems that rights didn’t transfer the same, both storage servers have different rights to the files.
I marked each file manager with the server for reference:
-
@EuroEnglish The other thing I see is file ownership is different on each system too. That’s not right.
Fog will use the user account listed for each storage node to copy the files from the master node to the remote nodes. This is done over ftp from the master server in the storage group to all slave nodes in the same storage group. Systems defined in FOG but in a different storage group will be left out of the replication cycle.
Did you seed the files on each storage node or did FOG do this?
-
Well spotted on the owner difference, I didn’t even notice that one - Strange as it pretty obvious now you mentioned it. All the image replication was done by Fog itself, however it could be that I messed something up when I was setting up the different users on the different servers - Again, could be part of the problem.
-
I have deleted the ‘Default’ storage group, it will never be used and so I guess there is no point leaving it there. I will add another group, ‘Teacher’, to separate the different images, but that can wait until I have everything running correctly. That should clear up those log file entries I would imagine.
-
@EuroEnglish The fog installer along with a couple extra commands can fix these things.
SSH into each box, become root with
sudo -i
Make sure that inside/opt/fog/.fogsettings
(as root) the username field is set tofog
. Then re-run the fog installer (again, as root). This will fix the passwords & user names both on the local system & in the DB. Then on each box, reset ownership & permissions of the images withchown -R fog:root /images;chmod -R 777 /images