Multicast problem - per host not group



  • Hi there,
    When creating a Multicast task for a group of (30) hosts it looks as though the multicast task is created individually per host and when the suite of computers is booted, the download starts but works as if we had just set a unicast for all. In other words, unlike our old 0.32 setup, the server is not waiting for all hosts to get to the same ‘checkpoint’ before downloading.
    Hope that makes sense.
    Thanks for any help.



  • [quote=“Tom Elliott, post: 34915, member: 7271”]What version of FOG are you runnign?[/quote]

    Sorry for the late response. I didn’t see your message. I am running 1.2.0 on Ubuntu 12.04. I am running into another problem now where even unicasting is crashing the network… Any suggestions?


  • Senior Developer

    [quote=“Nathan Blehi, post: 34904, member: 25306”]Hi Tom, I am having a similar issue. When I add the hosts to a group and then multicast, they just sit on a the screen “Partclone v0.2.69 [url]http://partlone.org[/url]” “Starting to restore image (-) to Device (/dev/sda1)” … I can unicast just fine but can not multicast. I must have a setting wrong but can’t figure it out. Can you help?

    Nate[/quote]

    What version of FOG are you runnign?



  • [quote=“Tom Elliott, post: 34291, member: 7271”]So you’re not able to create a “group” multicast tasking? Or the multicast tasking never starts?

    Have you verified that the FOGMulticastManager service is actually running?[/quote]

    Hi Tom, I am having a similar issue. When I add the hosts to a group and then multicast, they just sit on a the screen “Partclone v0.2.69 [url]http://partlone.org[/url]” “Starting to restore image (-) to Device (/dev/sda1)” … I can unicast just fine but can not multicast. I must have a setting wrong but can’t figure it out. Can you help?

    Nate



  • [quote=“Enrico, post: 34299, member: 25297”]This is the log file: /opt/fog/log/multicast.log
    the process is:
    ps aux|grep FOGMulticastManager
    root 1716 0.0 2.0 333900 20888 ? S 12:31 0:00 /usr/bin/php -q /opt/fog/service/FOGMulticastManager/FOGMulticastManager[/quote]

    I am kind of getting the same thing. I click run multicast, PXE boot, and when all computers are ready to go it hangs. It seems like if it is waiting for another computer. On the screen I get " Partclone v0.2.69 [url]http://partclone.org[/url]" "Starting to restore image (-) to device (/dev/sda1). What do I need to do.

    Nate



  • Thanks for the pointers Enrico.

    The mulitcast task log (multicast.log.udpcast.tasknumber) looks like this:

    [CODE]Udp-sender 20120424
    Using mcast address 236.16.1.48
    UDP sender for (stdin) at 172.16.1.48 on bond0
    Broadcasting control to 224.0.0.1
    New connection from 172.16.12.2 (#0) 00000009
    New connection from 172.16.12.10 (#1) 00000009
    New connection from 172.16.12.17 (#2) 00000009
    Starting transfer: 00000009
    Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
    Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
    Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
    Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
    Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
    Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
    Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
    Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
    Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
    Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
    Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
    Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
    Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
    Timeout notAnswered=[0,1,2] notReady=[0,1,2] nrAns=0 nrRead=0 nrPart=3 avg=10000
    [/CODE]

    The multicast.log looks like:

    [CODE][07-29-14 12:12:22 pm] CMD: cat /images/WIN7V13/sys.img.*|/usr/local/sbin/udp-sender --min-receivers 3 --portbase 57722 --interface bond0 --full-duplex --ttl 32 --nokbd;
    [07-29-14 12:12:22 pm] | Task (45) test has started.
    [07-29-14 12:12:32 pm] | Task (45) test is already running PID 22373
    [07-29-14 12:12:42 pm] | Task (45) test is already running PID 22373
    [07-29-14 12:12:52 pm] | Task (45) test is already running PID 22373
    [07-29-14 12:13:02 pm] | Task (45) test is already running PID 22373
    [07-29-14 12:13:12 pm] | Task (45) test is already running PID 22373
    [/CODE]

    and this goes on until we switch the machines off where the task is marked as complete.

    The only other thing I can see is the /bon/fog.download line 70 error which I highlighted in a earlier post.

    The FOGMulticastManagerService is running.

    Thanks!

    IA



  • [quote=“ictadmin, post: 34293, member: 1004”]Hi Tom,

    For us the multicast task never starts, it’s as if the computers are waiting for another computer to join - or for something to kick the session off.

    How can I check if the FOGMulitcastManager is running please?

    Thanks,

    IA[/quote]
    This is the log file: /opt/fog/log/multicast.log
    the process is:
    ps aux|grep FOGMulticastManager
    root 1716 0.0 2.0 333900 20888 ? S 12:31 0:00 /usr/bin/php -q /opt/fog/service/FOGMulticastManager/FOGMulticastManager



  • I create the task, then I boot the 2 group members and the task starts (I can see the udpcast process and the log entry:
    cat /images/test1/test1|/usr/local/sbin/udp-sender --min-receivers 2 --portbase 57050 --interface eth0 --full-duplex --ttl 32 --nokbd;

    the hosts start partclone “starting to restore image…” and hang (I don’t see any traffic).
    Sending a multicast task on a single host works…
    Thanks for your help



  • Hi Tom,

    For us the multicast task never starts, it’s as if the computers are waiting for another computer to join - or for something to kick the session off.

    How can I check if the FOGMulitcastManager is running please?

    Thanks,

    IA


  • Senior Developer

    So you’re not able to create a “group” multicast tasking? Or the multicast tasking never starts?

    Have you verified that the FOGMulticastManager service is actually running?



  • I have the same issue:
    when the clients are set to multicast individually, the job completes just fine, when they are part of a group job they don’t even begin, they just sit at the blue screen as if they are waiting for another computer to join the group, have left them for an hour in this state and nothing I’m afraid
    and so I subscribe (up?) the thread to get notifications… Wrong?


  • Senior Developer

    [quote=“Enrico, post: 34288, member: 25297”]subscribe[/quote]

    What does “subscribe” mean?



  • subscribe



  • Thanks Junkhacker, when the clients are set to multicast individually, the job completes just fine, when they are part of a group job they don’t even begin, they just sit at the blue screen as if they are waiting for another computer to join the group, have left them for an hour in this state and nothing I’m afraid!


  • Developer

    you’re not likely to find anything from google on that.
    thank you for reporting that bug, and let us know if the imaging task complete successfully



  • One last thing on this issue, when the client boots into it’s multicast runtime we get an error saying:

    /bin/fog.download: line 70: [: too many arguments

    Is this normal? Just going to start googling it now!



  • Thanks for the suggestions, the fog sever is connected to the internet via a proxy, but the proxy port the server connects to is unauthenticated (locked down only but network acls)

    I’ve just tried removing the proxy information and this seems to have fixed the issues (both slow task creation and the multicast grouping issue)

    Thanks for your help!


  • Senior Developer

    Understood.

    I think this is related to the “delay” issue you’re seeing on logging into the GUI.

    I think what’s happening is the “delay” is causing the tasks to be generated very slowly.

    So by the time one host is “tasked” the udp-sender command actually starts up. Then the next task get’s created.



  • Hopefully the attached images will help explain what’s going on a bit better.

    [ATTACH=full]1214[/ATTACH][ATTACH=full]1215[/ATTACH][ATTACH=full]1216[/ATTACH][ATTACH=full]1217[/ATTACH]

    When the machines are turned on from this point they will just image (they won’t wait for the others in their group) so if we were to turn one machine on it will just image.

    [url="/_imported_xf_attachments/1/1214_Multicast1.PNG?:"]Multicast1.PNG[/url][url="/_imported_xf_attachments/1/1215_Multicast2.PNG?:"]Multicast2.PNG[/url][url="/_imported_xf_attachments/1/1216_Multicast3.PNG?:"]Multicast3.PNG[/url][url="/_imported_xf_attachments/1/1217_Multicast4.PNG?:"]Multicast4.PNG[/url]


  • Senior Developer

    Are you saying you’re seeing the tasks in the active task list as individual or you saying you can start one system we’ve all the rest within the group turned off and that one system will image


Log in to reply
 

376
Online

38976
Users

10712
Topics

101675
Posts

Looks like your connection to FOG Project was lost, please wait while we try to reconnect.