Multicast problem - per host not group
-
I have the same issue:
when the clients are set to multicast individually, the job completes just fine, when they are part of a group job they don’t even begin, they just sit at the blue screen as if they are waiting for another computer to join the group, have left them for an hour in this state and nothing I’m afraid
and so I subscribe (up?) the thread to get notifications… Wrong? -
So you’re not able to create a “group” multicast tasking? Or the multicast tasking never starts?
Have you verified that the FOGMulticastManager service is actually running?
-
Hi Tom,
For us the multicast task never starts, it’s as if the computers are waiting for another computer to join - or for something to kick the session off.
How can I check if the FOGMulitcastManager is running please?
Thanks,
IA
-
I create the task, then I boot the 2 group members and the task starts (I can see the udpcast process and the log entry:
cat /images/test1/test1|/usr/local/sbin/udp-sender --min-receivers 2 --portbase 57050 --interface eth0 --full-duplex --ttl 32 --nokbd;the hosts start partclone “starting to restore image…” and hang (I don’t see any traffic).
Sending a multicast task on a single host works…
Thanks for your help -
[quote=“ictadmin, post: 34293, member: 1004”]Hi Tom,
For us the multicast task never starts, it’s as if the computers are waiting for another computer to join - or for something to kick the session off.
How can I check if the FOGMulitcastManager is running please?
Thanks,
IA[/quote]
This is the log file: /opt/fog/log/multicast.log
the process is:
ps aux|grep FOGMulticastManager
root 1716 0.0 2.0 333900 20888 ? S 12:31 0:00 /usr/bin/php -q /opt/fog/service/FOGMulticastManager/FOGMulticastManager -
Thanks for the pointers Enrico.
The mulitcast task log (multicast.log.udpcast.tasknumber) looks like this:
[CODE]Udp-sender 20120424
Using mcast address 236.16.1.48
UDP sender for (stdin) at 172.16.1.48 on bond0
Broadcasting control to 224.0.0.1
New connection from 172.16.12.2 (#0) 00000009
New connection from 172.16.12.10 (#1) 00000009
New connection from 172.16.12.17 (#2) 00000009
Starting transfer: 00000009
Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
[/CODE]The multicast.log looks like:
[CODE][07-29-14 12:12:22 pm] CMD: cat /images/WIN7V13/sys.img.*|/usr/local/sbin/udp-sender --min-receivers 3 --portbase 57722 --interface bond0 --full-duplex --ttl 32 --nokbd;
[07-29-14 12:12:22 pm] | Task (45) test has started.
[07-29-14 12:12:32 pm] | Task (45) test is already running PID 22373
[07-29-14 12:12:42 pm] | Task (45) test is already running PID 22373
[07-29-14 12:12:52 pm] | Task (45) test is already running PID 22373
[07-29-14 12:13:02 pm] | Task (45) test is already running PID 22373
[07-29-14 12:13:12 pm] | Task (45) test is already running PID 22373
[/CODE]and this goes on until we switch the machines off where the task is marked as complete.
The only other thing I can see is the /bon/fog.download line 70 error which I highlighted in a earlier post.
The FOGMulticastManagerService is running.
Thanks!
IA
-
[quote=“Enrico, post: 34299, member: 25297”]This is the log file: /opt/fog/log/multicast.log
the process is:
ps aux|grep FOGMulticastManager
root 1716 0.0 2.0 333900 20888 ? S 12:31 0:00 /usr/bin/php -q /opt/fog/service/FOGMulticastManager/FOGMulticastManager[/quote]I am kind of getting the same thing. I click run multicast, PXE boot, and when all computers are ready to go it hangs. It seems like if it is waiting for another computer. On the screen I get " Partclone v0.2.69 [url]http://partclone.org[/url]" "Starting to restore image (-) to device (/dev/sda1). What do I need to do.
Nate
-
[quote=“Tom Elliott, post: 34291, member: 7271”]So you’re not able to create a “group” multicast tasking? Or the multicast tasking never starts?
Have you verified that the FOGMulticastManager service is actually running?[/quote]
Hi Tom, I am having a similar issue. When I add the hosts to a group and then multicast, they just sit on a the screen “Partclone v0.2.69 [url]http://partlone.org[/url]” “Starting to restore image (-) to Device (/dev/sda1)” … I can unicast just fine but can not multicast. I must have a setting wrong but can’t figure it out. Can you help?
Nate
-
[quote=“Nathan Blehi, post: 34904, member: 25306”]Hi Tom, I am having a similar issue. When I add the hosts to a group and then multicast, they just sit on a the screen “Partclone v0.2.69 [url]http://partlone.org[/url]” “Starting to restore image (-) to Device (/dev/sda1)” … I can unicast just fine but can not multicast. I must have a setting wrong but can’t figure it out. Can you help?
Nate[/quote]
What version of FOG are you runnign?
-
[quote=“Tom Elliott, post: 34915, member: 7271”]What version of FOG are you runnign?[/quote]
Sorry for the late response. I didn’t see your message. I am running 1.2.0 on Ubuntu 12.04. I am running into another problem now where even unicasting is crashing the network… Any suggestions?