Multicast problem - per host not group
-
Hopefully the attached images will help explain what’s going on a bit better.
[ATTACH=full]1214[/ATTACH][ATTACH=full]1215[/ATTACH][ATTACH=full]1216[/ATTACH][ATTACH=full]1217[/ATTACH]
When the machines are turned on from this point they will just image (they won’t wait for the others in their group) so if we were to turn one machine on it will just image.
[url=“/_imported_xf_attachments/1/1214_Multicast1.PNG?:”]Multicast1.PNG[/url][url=“/_imported_xf_attachments/1/1215_Multicast2.PNG?:”]Multicast2.PNG[/url][url=“/_imported_xf_attachments/1/1216_Multicast3.PNG?:”]Multicast3.PNG[/url][url=“/_imported_xf_attachments/1/1217_Multicast4.PNG?:”]Multicast4.PNG[/url]
-
Understood.
I think this is related to the “delay” issue you’re seeing on logging into the GUI.
I think what’s happening is the “delay” is causing the tasks to be generated very slowly.
So by the time one host is “tasked” the udp-sender command actually starts up. Then the next task get’s created.
-
Thanks for the suggestions, the fog sever is connected to the internet via a proxy, but the proxy port the server connects to is unauthenticated (locked down only but network acls)
I’ve just tried removing the proxy information and this seems to have fixed the issues (both slow task creation and the multicast grouping issue)
Thanks for your help!
-
One last thing on this issue, when the client boots into it’s multicast runtime we get an error saying:
/bin/fog.download: line 70: [: too many arguments
Is this normal? Just going to start googling it now!
-
you’re not likely to find anything from google on that.
thank you for reporting that bug, and let us know if the imaging task complete successfully -
Thanks Junkhacker, when the clients are set to multicast individually, the job completes just fine, when they are part of a group job they don’t even begin, they just sit at the blue screen as if they are waiting for another computer to join the group, have left them for an hour in this state and nothing I’m afraid!
-
subscribe
-
[quote=“Enrico, post: 34288, member: 25297”]subscribe[/quote]
What does “subscribe” mean?
-
I have the same issue:
when the clients are set to multicast individually, the job completes just fine, when they are part of a group job they don’t even begin, they just sit at the blue screen as if they are waiting for another computer to join the group, have left them for an hour in this state and nothing I’m afraid
and so I subscribe (up?) the thread to get notifications… Wrong? -
So you’re not able to create a “group” multicast tasking? Or the multicast tasking never starts?
Have you verified that the FOGMulticastManager service is actually running?
-
Hi Tom,
For us the multicast task never starts, it’s as if the computers are waiting for another computer to join - or for something to kick the session off.
How can I check if the FOGMulitcastManager is running please?
Thanks,
IA
-
I create the task, then I boot the 2 group members and the task starts (I can see the udpcast process and the log entry:
cat /images/test1/test1|/usr/local/sbin/udp-sender --min-receivers 2 --portbase 57050 --interface eth0 --full-duplex --ttl 32 --nokbd;the hosts start partclone “starting to restore image…” and hang (I don’t see any traffic).
Sending a multicast task on a single host works…
Thanks for your help -
[quote=“ictadmin, post: 34293, member: 1004”]Hi Tom,
For us the multicast task never starts, it’s as if the computers are waiting for another computer to join - or for something to kick the session off.
How can I check if the FOGMulitcastManager is running please?
Thanks,
IA[/quote]
This is the log file: /opt/fog/log/multicast.log
the process is:
ps aux|grep FOGMulticastManager
root 1716 0.0 2.0 333900 20888 ? S 12:31 0:00 /usr/bin/php -q /opt/fog/service/FOGMulticastManager/FOGMulticastManager -
Thanks for the pointers Enrico.
The mulitcast task log (multicast.log.udpcast.tasknumber) looks like this:
[CODE]Udp-sender 20120424
Using mcast address 236.16.1.48
UDP sender for (stdin) at 172.16.1.48 on bond0
Broadcasting control to 224.0.0.1
New connection from 172.16.12.2 (#0) 00000009
New connection from 172.16.12.10 (#1) 00000009
New connection from 172.16.12.17 (#2) 00000009
Starting transfer: 00000009
Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
[/CODE]The multicast.log looks like:
[CODE][07-29-14 12:12:22 pm] CMD: cat /images/WIN7V13/sys.img.*|/usr/local/sbin/udp-sender --min-receivers 3 --portbase 57722 --interface bond0 --full-duplex --ttl 32 --nokbd;
[07-29-14 12:12:22 pm] | Task (45) test has started.
[07-29-14 12:12:32 pm] | Task (45) test is already running PID 22373
[07-29-14 12:12:42 pm] | Task (45) test is already running PID 22373
[07-29-14 12:12:52 pm] | Task (45) test is already running PID 22373
[07-29-14 12:13:02 pm] | Task (45) test is already running PID 22373
[07-29-14 12:13:12 pm] | Task (45) test is already running PID 22373
[/CODE]and this goes on until we switch the machines off where the task is marked as complete.
The only other thing I can see is the /bon/fog.download line 70 error which I highlighted in a earlier post.
The FOGMulticastManagerService is running.
Thanks!
IA
-
[quote=“Enrico, post: 34299, member: 25297”]This is the log file: /opt/fog/log/multicast.log
the process is:
ps aux|grep FOGMulticastManager
root 1716 0.0 2.0 333900 20888 ? S 12:31 0:00 /usr/bin/php -q /opt/fog/service/FOGMulticastManager/FOGMulticastManager[/quote]I am kind of getting the same thing. I click run multicast, PXE boot, and when all computers are ready to go it hangs. It seems like if it is waiting for another computer. On the screen I get " Partclone v0.2.69 [url]http://partclone.org[/url]" "Starting to restore image (-) to device (/dev/sda1). What do I need to do.
Nate
-
[quote=“Tom Elliott, post: 34291, member: 7271”]So you’re not able to create a “group” multicast tasking? Or the multicast tasking never starts?
Have you verified that the FOGMulticastManager service is actually running?[/quote]
Hi Tom, I am having a similar issue. When I add the hosts to a group and then multicast, they just sit on a the screen “Partclone v0.2.69 [url]http://partlone.org[/url]” “Starting to restore image (-) to Device (/dev/sda1)” … I can unicast just fine but can not multicast. I must have a setting wrong but can’t figure it out. Can you help?
Nate
-
[quote=“Nathan Blehi, post: 34904, member: 25306”]Hi Tom, I am having a similar issue. When I add the hosts to a group and then multicast, they just sit on a the screen “Partclone v0.2.69 [url]http://partlone.org[/url]” “Starting to restore image (-) to Device (/dev/sda1)” … I can unicast just fine but can not multicast. I must have a setting wrong but can’t figure it out. Can you help?
Nate[/quote]
What version of FOG are you runnign?
-
[quote=“Tom Elliott, post: 34915, member: 7271”]What version of FOG are you runnign?[/quote]
Sorry for the late response. I didn’t see your message. I am running 1.2.0 on Ubuntu 12.04. I am running into another problem now where even unicasting is crashing the network… Any suggestions?