Deploying via Multicast aborts midway



  • Hello,

    I have a Problem regarding Multicast.

    We are trying to change our imaging-system from Timago to FOG, because we want to save the license costs.

    We are an IT-service-provider at various schools. This system is already running at another school.

    We have around 300 clients in different groups on which the image must be distributed.

    After testing the Roll-out of a few clients via Unicast, which worked perfectly fine, we tested multicast. Everything seemed normal, until the clients just stopped downloading.

    The multicast.log just shows, that we have an Task running. We stopped the task after a while, because nothing was happening.

    [03-24-20 10:37:38 am]  * No new tasks found
    [03-24-20 10:37:48 am]  | Task ID: 21 Name: Multi-Cast Task - Test is new
    [03-24-20 10:37:48 am]  | Task ID: 21 Name: Multi-Cast Task - Test image file found, file: /images/Image_Vorlage_24-03-2020
    [03-24-20 10:37:48 am]  | Task ID: 21 Name: Multi-Cast Task - Test 2 clients found
    [03-24-20 10:37:48 am]  | Task ID: 21 Name: Multi-Cast Task - Test sending on base port 53536
    [03-24-20 10:37:48 am]  | Command: /usr/local/sbin/udp-sender --interface ens192 --min-receivers 2 --max-wait 600 --portbase 53536 --full-duplex --ttl 32 --nokbd --nopointopoint --file /images/Image_Vorlage_24-03-2020/d1p1.img;/usr/local/sbin/udp-sender --interface ens192 --min-receivers 2 --max-wait 10 --portbase 53536 --full-duplex --ttl 32 --nokbd --nopointopoint --file /images/Image_Vorlage_24-03-2020/d1p2.img;
    [03-24-20 10:37:48 am]  | Task ID: 21 Name: Multi-Cast Task - Test has started
    [03-24-20 10:37:59 am]  | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687
    [03-24-20 10:38:09 am]  | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687
    [03-24-20 10:38:19 am]  | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687
    [03-24-20 10:38:29 am]  | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687
    [03-24-20 10:38:39 am]  | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687
    [03-24-20 10:38:49 am]  | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687
    [03-24-20 10:38:59 am]  | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687
    [03-24-20 10:39:09 am]  | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687
    [03-24-20 10:39:19 am]  | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687
    [03-24-20 10:39:29 am]  | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687
    [03-24-20 10:39:39 am]  | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687
    [03-24-20 10:39:49 am]  | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687
    [03-24-20 10:39:59 am]  | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687
    [03-24-20 10:40:09 am]  | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687
    [03-24-20 10:40:19 am]  | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687
    [03-24-20 10:40:29 am]  | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687
    [03-24-20 10:40:39 am]  | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687
    [03-24-20 10:40:49 am]  | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687
    [03-24-20 10:40:59 am]  | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687
    [03-24-20 10:41:09 am]  | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687
    [03-24-20 10:41:19 am]  | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687
    [03-24-20 10:41:29 am]  | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687
    [03-24-20 10:41:39 am]  | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687
    [03-24-20 10:41:49 am]  | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687
    [03-24-20 10:41:59 am]  | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687
    [03-24-20 10:42:10 am]  | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687
    [03-24-20 10:42:20 am]  | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687
    [03-24-20 10:42:30 am]  | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687
    [03-24-20 10:42:40 am]  | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687
    [03-24-20 10:42:50 am]  | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687
    [03-24-20 10:43:00 am]  | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687
    [03-24-20 10:43:10 am]  | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687
    [03-24-20 10:43:20 am]  | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687
    [03-24-20 10:43:30 am]  | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687
    [03-24-20 10:43:40 am]  | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687
    [03-24-20 10:43:50 am]  | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687
    [03-24-20 10:44:00 am]  | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687
    [03-24-20 10:44:10 am]  | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687
    [03-24-20 10:44:20am]  * No new tasks found
    

    The other multicast.log shows this:

    [03-24-20 10:37:48 am] Task startedUdp-sender 20120424
    Using mcast address 234.200.0.20
    UDP sender for /images/Image_Vorlage_24-03-2020/d1p1.img at 10.200.0.20 on ens192 
    Broadcasting control to 224.0.0.1
    New connection from 10.200.20.7  (#0) 00000009
    New connection from 10.200.20.6  (#1) 00000009
    Starting transfer: 00000009
    Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=4868
    bytes=  4 892 160  re-xmits=0000114 (  3.3%) slice=0112 -   0
    bytes= 40 768 000  re-xmits=0004259 ( 15.2%) slice=0112 -   1
    bytes= 64 576 512  re-xmits=0008636 ( 19.4%) slice=0112 -   0
    bytes=104 855 296  re-xmits=0012732 ( 17.6%) slice=0112 -   0
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=782
    bytes=126 706 944  re-xmits=0015950 ( 18.3%) slice=0112 -   0
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=425
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=760
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=953
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=765
    bytes=159 484 416  re-xmits=0020762 ( 18.9%) slice=0112 -   0
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=742
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=433
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=522
    bytes=196 012 544  re-xmits=0025253 ( 18.7%) slice=0112 -   1
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=327
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=461
    bytes=220 799 488  re-xmits=0026092 ( 17.2%) slice=0112 -   1
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=178
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=201
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=249
    bytes=236 128 256  re-xmits=0026403 ( 16.2%) slice=0112 -   1
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=230
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=265
    bytes=242 977 280  re-xmits=0026698 ( 15.9%) slice=0112 -   1
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=878
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=783
    bytes=263 035 136  re-xmits=0029636 ( 16.4%) slice=0112 -   0
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=531
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=891
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=851
    bytes=281 625 344  re-xmits=0031800 ( 16.4%) slice=0112 -   0
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=554
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=394
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=342
    bytes=316 033 536  re-xmits=0036837 ( 16.9%) slice=0112 -   0
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=528
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=524
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=561
    bytes=354 518 528  re-xmits=0040732 ( 16.7%) slice=0112 -   0
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=345
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=700
    bytes=384 849 920  re-xmits=0046298 ( 17.5%) slice=0112 -   1
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=600
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=581
    bytes=393 655 808  re-xmits=0047982 ( 17.7%) slice=0112 -   1
    bytes=434 749 952  re-xmits=0051829 ( 17.3%) slice=0112 -   1
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=564
    Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=578
    bytes=446 980 352  re-xmits=0053215 ( 17.3%) slice=0112 -   0
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=495
    bytes=485 302 272  re-xmits=0057494 ( 17.2%) slice=0112 -   1
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=871
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=2180
    bytes=509 436 928  re-xmits=0061383 ( 17.5%) slice=0112 -   1
    bytes=550 857 216  re-xmits=0064480 ( 17.0%) slice=0112 -   1
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=646
    bytes=588 852 992  re-xmits=0068838 ( 17.0%) slice=0112 -   1
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=617
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=589
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=631
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=621
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=378
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=565
    bytes=621 956 608  re-xmits=0073532 ( 17.2%) slice=0112 -   0
    bytes=667 127 552  re-xmits=0075901 ( 16.5%) slice=0112 -   0
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=564
    bytes=705 938 688  re-xmits=0079407 ( 16.3%) slice=0112 -   0
    bytes=756 817 152  re-xmits=0080445 ( 15.4%) slice=0112 -   0
    Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=310
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=517
    bytes=793 671 424  re-xmits=0084059 ( 15.4%) slice=0112 -   0
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=747
    bytes=826 285 824  re-xmits=0087687 ( 15.4%) slice=0112 -   0
    bytes=851 398 912  re-xmits=0092524 ( 15.8%) slice=0112 -   1
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=257
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=467
    bytes=892 819 200  re-xmits=0094285 ( 15.3%) slice=0112 -   0
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=284
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=348
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=465
    bytes=929 184 256  re-xmits=0098962 ( 15.5%) slice=0112 -   0
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=927
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=1013
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=1094
    Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=1038
    bytes=939 620 864  re-xmits=0101138 ( 15.6%) slice=0112 -   1
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=478
    bytes=980 388 864  re-xmits=0103847 ( 15.4%) slice=0112 -   1
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=651
    bytes=984 465 664  re-xmits=0104473 ( 15.4%) slice=0112 -   0
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=476
    Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=781
    bytes=992 619 264  re-xmits=0105114 ( 15.4%) slice=0112 -   0
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=961
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=995
    bytes=     989 579K re-xmits=0107743 ( 15.4%) slice=0112 -   1
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=375
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=519
    bytes=  1 028 755K re-xmits=0112584 ( 15.5%) slice=0112 -   1
    bytes=  1 072 230K re-xmits=0116554 ( 15.4%) slice=0112 -   1
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=790
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=747
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=534
    Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=507
    Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=611
    bytes=  1 111 405K re-xmits=0120740 ( 15.4%) slice=0112 -   1
    Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=789
    Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=693
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=659
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=683
    Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=1098
    bytes=  1 118 253K re-xmits=0123502 ( 15.7%) slice=0112 -   0
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=1315
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=1239
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=1061
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=1011
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=838
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=818
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=843
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=992
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=985
    bytes=  1 131 152K re-xmits=0126727 ( 15.9%) slice=0112 -   0
    Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=787
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=397
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=488
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=727
    bytes=  1 169 532K re-xmits=0131097 ( 15.9%) slice=0112 -   1
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=613
    bytes=  1 217 944K re-xmits=0132611 ( 15.4%) slice=0112 -   1
    bytes=  1 268 585K re-xmits=0133203 ( 14.9%) slice=0112 -   1
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=174
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=476
    bytes=  1 294 224K re-xmits=0134243 ( 14.7%) slice=0112 -   0
    Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=546
    bytes=  1 299 798K re-xmits=0134530 ( 14.7%) slice=0112 -   0
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=406
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=420
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=502
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=611
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=648
    bytes=  1 331 807K re-xmits=0137782 ( 14.7%) slice=0112 -   0
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=650
    bytes=  1 346 936K re-xmits=0138899 ( 14.6%) slice=0112 -   0
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=802
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=806
    bytes=  1 387 704K re-xmits=0141898 ( 14.5%) slice=0112 -   1
    Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=690
    bytes=  1 417 165K re-xmits=0146170 ( 14.6%) slice=0112 -   0
    bytes=  1 462 870K re-xmits=0148104 ( 14.3%) slice=0112 -   0
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=342
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=576
    bytes=  1 502 205K re-xmits=0151668 ( 14.3%) slice=0112 -   0
    bytes=  1 517 174K re-xmits=0153844 ( 14.4%) slice=0112 -   1
    Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=585
    bytes=  1 560 172K re-xmits=0155842 ( 14.2%) slice=0112 -   1
    Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=336
    bytes=  1 608 743K re-xmits=0157092 ( 13.8%) slice=0112 -   1
    bytes=  1 618 935K re-xmits=0157582 ( 13.8%) slice=0112 -   0
    Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=846
    bytes=  1 659 703K re-xmits=0159921 ( 13.7%) slice=0112 -   0
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=911
    bytes=  1 665 436K re-xmits=0160684 ( 13.7%) slice=0112 -   0
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=473
    bytes=  1 705 567K re-xmits=0163941 ( 13.6%) slice=0112 -   1
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=532
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=535
    bytes=  1 718 626K re-xmits=0166400 ( 13.7%) slice=0112 -   0
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=747
    bytes=  1 747 131K re-xmits=0168427 ( 13.7%) slice=0112 -   1
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=918
    bytes=  1 785 033K re-xmits=0172380 ( 13.7%) slice=0112 -   1
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=517
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=351
    bytes=  1 818 953K re-xmits=0176911 ( 13.8%) slice=0112 -   0
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=400
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=544
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=135
    bytes=  1 865 613K re-xmits=0179763 ( 13.7%) slice=0112 -   0
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=174
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=201
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=165
    Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=192
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=343
    bytes=  1 905 585K re-xmits=0184726 ( 13.7%) slice=0112 -   0
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=392
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=513
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=832
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=731
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=757
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=581
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=517
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=496
    bytes=  1 934 409K re-xmits=0190701 ( 14.0%) slice=0112 -   1
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=648
    bytes=  1 965 463K re-xmits=0197018 ( 14.2%) slice=0112 -   0
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=535
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=370
    bytes=  2 003 205K re-xmits=0202271 ( 14.3%) slice=0112 -   1
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=546
    bytes=  2 004 639K re-xmits=0202846 ( 14.3%) slice=0112 -   1
    Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=349
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=254
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=165
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=212
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=423
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=629
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=692
    bytes=  2 046 681K re-xmits=0205559 ( 14.2%) slice=0112 -   0
    bytes=  2 090 315K re-xmits=0209120 ( 14.2%) slice=0112 -   0
    bytes=  2 121 369K re-xmits=0215790 ( 14.4%) slice=0112 -   1
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=738
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=498
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=630
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=626
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=652
    bytes=  2 139 046K re-xmits=0219085 ( 14.5%) slice=0112 -   1
    bytes=  2 180 769K re-xmits=0222653 ( 14.5%) slice=0112 -   0
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=752
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=405
    bytes=  2 215 326K re-xmits=0226243 ( 14.5%) slice=0112 -   1
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=395
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=434
    bytes=  2 244 310K re-xmits=0230872 ( 14.6%) slice=0112 -   1
    Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=726
    bytes=  2 256 094K re-xmits=0233386 ( 14.7%) slice=0112 -   0
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=492
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=480
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=720
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=606
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=649
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=662
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=740
    bytes=  2 287 785K re-xmits=0238647 ( 14.8%) slice=0112 -   0
    Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=467
    bytes=  2 304 188K re-xmits=0239225 ( 14.7%) slice=0112 -   0
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=400
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=394
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=412
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=430
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=620
    bytes=  2 336 516K re-xmits=0243966 ( 14.8%) slice=0112 -   0
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=913
    Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=938
    bytes=  2 364 703K re-xmits=0246284 ( 14.8%) slice=0112 -   0
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=688
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=634
    bytes=  2 387 316K re-xmits=0248041 ( 14.7%) slice=0112 -   1
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=587
    Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=715
    bytes=  2 424 422K re-xmits=0252167 ( 14.7%) slice=0112 -   0
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=322
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=363
    bytes=  2 461 368K re-xmits=0255875 ( 14.7%) slice=0112 -   0
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=738
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=543
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=573
    bytes=  2 475 859K re-xmits=0258024 ( 14.8%) slice=0112 -   1
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=474
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=504
    bytes=  2 516 627K re-xmits=0261528 ( 14.7%) slice=0112 -   0
    Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=725
    bytes=  2 521 405K re-xmits=0262000 ( 14.7%) slice=0112 -   0
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=477
    bytes=  2 562 014K re-xmits=0264860 ( 14.6%) slice=0112 -   1
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=854
    bytes=  2 568 861K re-xmits=0265795 ( 14.7%) slice=0112 -   1
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=641
    bytes=  2 613 133K re-xmits=0268208 ( 14.5%) slice=0112 -   1
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=581
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=747
    bytes=  2 624 917K re-xmits=0269700 ( 14.6%) slice=0112 -   1
    bytes=  2 671 578K re-xmits=0271900 ( 14.4%) slice=0112 -   1
    bytes=  2 721 901K re-xmits=0272779 ( 14.2%) slice=0112 -   1
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=125
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=163
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=208
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=123
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=152
    bytes=  2 771 109K re-xmits=0273976 ( 14.0%) slice=0112 -   0
    bytes=  2 820 476K re-xmits=0275173 ( 13.8%) slice=0112 -   1
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=200
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=174
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=151
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=192
    bytes=  2 829 394K re-xmits=0275559 ( 13.8%) slice=0112 -   0
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=240
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=266
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=281
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=294
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=294
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=305
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=311
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=395
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=389
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=438
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=528
    bytes=  2 864 589K re-xmits=0280165 ( 13.9%) slice=0112 -   0
    bytes=  2 900 420K re-xmits=0285084 ( 13.9%) slice=0112 -   0
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=150
    bytes=  2 949 787K re-xmits=0286144 ( 13.7%) slice=0112 -   0
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=176
    Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=183
    bytes=  2 998 199K re-xmits=0287367 ( 13.6%) slice=0112 -   0
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=231
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=259
    .
    .
    .
    .
    .
    .
    Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=205
    Dropping client #0 because of timeout
    Disconnecting #0 (10.200.20.7)
    bytes=  3 026 227K re-xmits=0289381 ( 13.5%) slice=0112 -   1
    Timeout notAnswered=[1] notReady=[1] nrAns=0 nrRead=0 nrPart=1 avg=205
    .
    .
    .
    .
    .
    Timeout notAnswered=[1] notReady=[1] nrAns=0 nrRead=0 nrPart=1 avg=205
    Dropping client #1 because of timeout
    Disconnecting #1 (10.200.20.6)
    bytes=  3 026 227K re-xmits=0289381 ( 13.5%) slice=0112 -   1
    Transfer complete.
    
    Udp-sender 20120424
    Using mcast address 234.200.0.20
    UDP sender for /images/Image_Vorlage_24-03-2020/d1p2.img at 10.200.0.20 on ens192 
    Broadcasting control to 224.0.0.1
    

    As you can see, the Task is running into a lot of Timeouts.

    The first thing I thought of was that this is a network problem, a wrong multicast configuration on the switches. IGMP snooping is on every switch enabled, we configured one switch as the IGMP Proxy. This configuration is mostly identical to a working multicast-configuration of another school.

    As of now, i am at end of my wisdom.

    Best regards

    Dankau



  • @george1421 We have 3 switches between the computers and the FOG-server.

    The problem is resolved!

    Our Solution:
    The WebGUI from the switch showed the options in the IGMP Snooping as enabled, in the config-file some of the needed options were disabled. After enabling them in the file and uploading that we tried another mutlicast task.

    The timeouts still are in the multicast.log.udpcast.x, but the task completes successfully.



  • @george1421 We have 3 switches between the computers and the FOG-server.

    The problem is resolved!

    Our Solution:
    The WebGUI from the switch showed the options in the IGMP Snooping as enabled, in the config-file some of the needed options were disabled. After enabling them in the file and uploading that we tried another mutlicast task.

    The timeouts still are in the multicast.log.udpcast.x, but the task completes successfully.


  • Moderator

    @Dankau So how many hops (network switches) is the target computer away from the FOG server.



  • @george1421 We are using D-Link switches only.

    After testing the multicast in the same subnet we got the same result, that means we are running into timeouts.
    I can confirm, after checking multiple times, that IGMP Snooping is enabled on every switch and for the needed VLANs.

    Sadly, I can´t test multicast on the same switch beacuse every port on our 10 GbE-switch is used.


  • Moderator

    @Dankau What switch manufacture is your networking infrastructure? Cisco Catalyst series?

    I remember a post from not to long ago that something in cisco land needed to be turned on. I also found someone with almost the same initial post as you have but there was no resolution.


  • Moderator

    @Dankau said in Deploying via Multicast aborts midway:

    Multicast is not working in the same subnet.

    You are getting the same timeout issue? The switches between the FOG server and test computers had igmp snooping enabled on the VLAN where the FOG server is connected? Ideally for the initial test to have the target computers on the same switch as the FOG server would be a baseline.

    The rest of your network is fine from a bottleneck standpoint.



  • Thank you for your fast reply.

    I do think that there is one option in our configuration wrong, but i just don´t know where. IGMP Snooping is configured on all switches and the VLANs are correctly assigned. On our Core-Switch (also our router) we have the IGMP Proxy enabled for the right VLANs. The multicast shouldn´t just stop in the middle of the Roll-Out. All of the computers are new and they have a decent hardware.

    1.) We are running a virtual FOG-server on VMWare, the FOG-server has 4 cores and 16 GB RAM, the network-card runs as VMXnet3
    2.) As of now, we only had maybe 1 computer talking to our FOG-server, even with that PC
    3.) Our server has 4 1GbE network-cards as trunk installed
    4.) Our server is connected to a 10 GbE-Switch. That switch is directly connected over a 20 GbE-trunk to our Core-Switch. The classroom ist connected to an access-swtich, which is connected over a 1 GbE-connection to our Core-switch. The infrastructure is pretty new
    5.) We tried to multicast a complete room with 20 computers. After that didn´t work, we made a group with 2 computers, but we couldn´t get it to work either.
    6.) We didn´t try that yet, we will set 2 computers in the same VLAN as out FOG-Server and see, if that works.
    7.) Electrical distance should not be an issue.

    I hope you can work with these answers.

    EDIT: We just tried to have a Multicast-Session in the same subnet as the FOG-Server. The result was the same. Multicast is not working in the same subnet.


  • Moderator

    I can tell you that multicasting is a beast onto it’s one. 90% of the problems are infrastructure and configuration with the rest something else (most not related to FOG).

    IGMP Snooping needs to be enabled on all network switches and assigned on the VLANs where the multicasts would occur. If your target computers are on different subnets than your FOG server then you will need to enable the igmp proxy service on your VLAN routers. I would suggest that you not try to multicast across a WAN link.

    When multicasting multiple systems the slowest system controls the speed of imaging. The target system image is sent out one block at a time. All clients must acknowledge the block before the next block is sent out by the FOG server (or any multicast imaging server). If not all of the clients respond with an ack to the FOG server the entire block must be restransmitted to all computers. If a client disappears, after a time it will be dropped from the multicast server and the rest will continue to image.

    With the number of retransmits it makes me think you have a networking issue.

    Will you tell us a bit more about this fog server and your environment?

    1. Describe the fog server’s hardware
    2. How many computers that have the fog client installed is talking to this FOG server?
    3. How is the FOG server connected to your network? Is it via a single 1GbE link, 10GbE link, etc??
    4. What is your switch infrastructure?
    5. When you get these timeouts how many target computers are you trying to image at one time?
    6. If you move (as a test) computers to the same subnet as the FOG server, do you have similar results when you send out a multicast?
    7. Does electrical distance from the FOG server impact your ability to multicast (i.e. multicasting works in the building where the fog server is, but not in the building at the far end of your campus?)

Log in to reply
 

251
Online

6.8k
Users

14.1k
Topics

133.3k
Posts