Deploying via Multicast aborts midway
-
Hello,
I have a Problem regarding Multicast.
We are trying to change our imaging-system from Timago to FOG, because we want to save the license costs.
We are an IT-service-provider at various schools. This system is already running at another school.
We have around 300 clients in different groups on which the image must be distributed.
After testing the Roll-out of a few clients via Unicast, which worked perfectly fine, we tested multicast. Everything seemed normal, until the clients just stopped downloading.
The multicast.log just shows, that we have an Task running. We stopped the task after a while, because nothing was happening.
[03-24-20 10:37:38 am] * No new tasks found [03-24-20 10:37:48 am] | Task ID: 21 Name: Multi-Cast Task - Test is new [03-24-20 10:37:48 am] | Task ID: 21 Name: Multi-Cast Task - Test image file found, file: /images/Image_Vorlage_24-03-2020 [03-24-20 10:37:48 am] | Task ID: 21 Name: Multi-Cast Task - Test 2 clients found [03-24-20 10:37:48 am] | Task ID: 21 Name: Multi-Cast Task - Test sending on base port 53536 [03-24-20 10:37:48 am] | Command: /usr/local/sbin/udp-sender --interface ens192 --min-receivers 2 --max-wait 600 --portbase 53536 --full-duplex --ttl 32 --nokbd --nopointopoint --file /images/Image_Vorlage_24-03-2020/d1p1.img;/usr/local/sbin/udp-sender --interface ens192 --min-receivers 2 --max-wait 10 --portbase 53536 --full-duplex --ttl 32 --nokbd --nopointopoint --file /images/Image_Vorlage_24-03-2020/d1p2.img; [03-24-20 10:37:48 am] | Task ID: 21 Name: Multi-Cast Task - Test has started [03-24-20 10:37:59 am] | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687 [03-24-20 10:38:09 am] | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687 [03-24-20 10:38:19 am] | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687 [03-24-20 10:38:29 am] | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687 [03-24-20 10:38:39 am] | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687 [03-24-20 10:38:49 am] | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687 [03-24-20 10:38:59 am] | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687 [03-24-20 10:39:09 am] | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687 [03-24-20 10:39:19 am] | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687 [03-24-20 10:39:29 am] | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687 [03-24-20 10:39:39 am] | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687 [03-24-20 10:39:49 am] | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687 [03-24-20 10:39:59 am] | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687 [03-24-20 10:40:09 am] | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687 [03-24-20 10:40:19 am] | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687 [03-24-20 10:40:29 am] | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687 [03-24-20 10:40:39 am] | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687 [03-24-20 10:40:49 am] | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687 [03-24-20 10:40:59 am] | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687 [03-24-20 10:41:09 am] | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687 [03-24-20 10:41:19 am] | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687 [03-24-20 10:41:29 am] | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687 [03-24-20 10:41:39 am] | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687 [03-24-20 10:41:49 am] | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687 [03-24-20 10:41:59 am] | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687 [03-24-20 10:42:10 am] | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687 [03-24-20 10:42:20 am] | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687 [03-24-20 10:42:30 am] | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687 [03-24-20 10:42:40 am] | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687 [03-24-20 10:42:50 am] | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687 [03-24-20 10:43:00 am] | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687 [03-24-20 10:43:10 am] | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687 [03-24-20 10:43:20 am] | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687 [03-24-20 10:43:30 am] | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687 [03-24-20 10:43:40 am] | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687 [03-24-20 10:43:50 am] | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687 [03-24-20 10:44:00 am] | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687 [03-24-20 10:44:10 am] | Task ID: 21 Name: Multi-Cast Task - Test is already running with pid: 25687 [03-24-20 10:44:20am] * No new tasks found
The other multicast.log shows this:
[03-24-20 10:37:48 am] Task startedUdp-sender 20120424 Using mcast address 234.200.0.20 UDP sender for /images/Image_Vorlage_24-03-2020/d1p1.img at 10.200.0.20 on ens192 Broadcasting control to 224.0.0.1 New connection from 10.200.20.7 (#0) 00000009 New connection from 10.200.20.6 (#1) 00000009 Starting transfer: 00000009 Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=4868 bytes= 4 892 160 re-xmits=0000114 ( 3.3%) slice=0112 - 0 bytes= 40 768 000 re-xmits=0004259 ( 15.2%) slice=0112 - 1 bytes= 64 576 512 re-xmits=0008636 ( 19.4%) slice=0112 - 0 bytes=104 855 296 re-xmits=0012732 ( 17.6%) slice=0112 - 0 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=782 bytes=126 706 944 re-xmits=0015950 ( 18.3%) slice=0112 - 0 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=425 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=760 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=953 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=765 bytes=159 484 416 re-xmits=0020762 ( 18.9%) slice=0112 - 0 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=742 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=433 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=522 bytes=196 012 544 re-xmits=0025253 ( 18.7%) slice=0112 - 1 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=327 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=461 bytes=220 799 488 re-xmits=0026092 ( 17.2%) slice=0112 - 1 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=178 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=201 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=249 bytes=236 128 256 re-xmits=0026403 ( 16.2%) slice=0112 - 1 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=230 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=265 bytes=242 977 280 re-xmits=0026698 ( 15.9%) slice=0112 - 1 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=878 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=783 bytes=263 035 136 re-xmits=0029636 ( 16.4%) slice=0112 - 0 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=531 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=891 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=851 bytes=281 625 344 re-xmits=0031800 ( 16.4%) slice=0112 - 0 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=554 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=394 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=342 bytes=316 033 536 re-xmits=0036837 ( 16.9%) slice=0112 - 0 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=528 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=524 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=561 bytes=354 518 528 re-xmits=0040732 ( 16.7%) slice=0112 - 0 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=345 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=700 bytes=384 849 920 re-xmits=0046298 ( 17.5%) slice=0112 - 1 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=600 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=581 bytes=393 655 808 re-xmits=0047982 ( 17.7%) slice=0112 - 1 bytes=434 749 952 re-xmits=0051829 ( 17.3%) slice=0112 - 1 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=564 Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=578 bytes=446 980 352 re-xmits=0053215 ( 17.3%) slice=0112 - 0 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=495 bytes=485 302 272 re-xmits=0057494 ( 17.2%) slice=0112 - 1 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=871 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=2180 bytes=509 436 928 re-xmits=0061383 ( 17.5%) slice=0112 - 1 bytes=550 857 216 re-xmits=0064480 ( 17.0%) slice=0112 - 1 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=646 bytes=588 852 992 re-xmits=0068838 ( 17.0%) slice=0112 - 1 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=617 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=589 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=631 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=621 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=378 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=565 bytes=621 956 608 re-xmits=0073532 ( 17.2%) slice=0112 - 0 bytes=667 127 552 re-xmits=0075901 ( 16.5%) slice=0112 - 0 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=564 bytes=705 938 688 re-xmits=0079407 ( 16.3%) slice=0112 - 0 bytes=756 817 152 re-xmits=0080445 ( 15.4%) slice=0112 - 0 Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=310 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=517 bytes=793 671 424 re-xmits=0084059 ( 15.4%) slice=0112 - 0 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=747 bytes=826 285 824 re-xmits=0087687 ( 15.4%) slice=0112 - 0 bytes=851 398 912 re-xmits=0092524 ( 15.8%) slice=0112 - 1 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=257 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=467 bytes=892 819 200 re-xmits=0094285 ( 15.3%) slice=0112 - 0 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=284 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=348 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=465 bytes=929 184 256 re-xmits=0098962 ( 15.5%) slice=0112 - 0 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=927 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=1013 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=1094 Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=1038 bytes=939 620 864 re-xmits=0101138 ( 15.6%) slice=0112 - 1 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=478 bytes=980 388 864 re-xmits=0103847 ( 15.4%) slice=0112 - 1 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=651 bytes=984 465 664 re-xmits=0104473 ( 15.4%) slice=0112 - 0 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=476 Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=781 bytes=992 619 264 re-xmits=0105114 ( 15.4%) slice=0112 - 0 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=961 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=995 bytes= 989 579K re-xmits=0107743 ( 15.4%) slice=0112 - 1 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=375 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=519 bytes= 1 028 755K re-xmits=0112584 ( 15.5%) slice=0112 - 1 bytes= 1 072 230K re-xmits=0116554 ( 15.4%) slice=0112 - 1 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=790 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=747 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=534 Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=507 Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=611 bytes= 1 111 405K re-xmits=0120740 ( 15.4%) slice=0112 - 1 Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=789 Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=693 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=659 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=683 Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=1098 bytes= 1 118 253K re-xmits=0123502 ( 15.7%) slice=0112 - 0 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=1315 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=1239 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=1061 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=1011 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=838 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=818 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=843 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=992 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=985 bytes= 1 131 152K re-xmits=0126727 ( 15.9%) slice=0112 - 0 Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=787 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=397 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=488 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=727 bytes= 1 169 532K re-xmits=0131097 ( 15.9%) slice=0112 - 1 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=613 bytes= 1 217 944K re-xmits=0132611 ( 15.4%) slice=0112 - 1 bytes= 1 268 585K re-xmits=0133203 ( 14.9%) slice=0112 - 1 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=174 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=476 bytes= 1 294 224K re-xmits=0134243 ( 14.7%) slice=0112 - 0 Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=546 bytes= 1 299 798K re-xmits=0134530 ( 14.7%) slice=0112 - 0 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=406 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=420 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=502 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=611 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=648 bytes= 1 331 807K re-xmits=0137782 ( 14.7%) slice=0112 - 0 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=650 bytes= 1 346 936K re-xmits=0138899 ( 14.6%) slice=0112 - 0 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=802 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=806 bytes= 1 387 704K re-xmits=0141898 ( 14.5%) slice=0112 - 1 Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=690 bytes= 1 417 165K re-xmits=0146170 ( 14.6%) slice=0112 - 0 bytes= 1 462 870K re-xmits=0148104 ( 14.3%) slice=0112 - 0 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=342 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=576 bytes= 1 502 205K re-xmits=0151668 ( 14.3%) slice=0112 - 0 bytes= 1 517 174K re-xmits=0153844 ( 14.4%) slice=0112 - 1 Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=585 bytes= 1 560 172K re-xmits=0155842 ( 14.2%) slice=0112 - 1 Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=336 bytes= 1 608 743K re-xmits=0157092 ( 13.8%) slice=0112 - 1 bytes= 1 618 935K re-xmits=0157582 ( 13.8%) slice=0112 - 0 Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=846 bytes= 1 659 703K re-xmits=0159921 ( 13.7%) slice=0112 - 0 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=911 bytes= 1 665 436K re-xmits=0160684 ( 13.7%) slice=0112 - 0 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=473 bytes= 1 705 567K re-xmits=0163941 ( 13.6%) slice=0112 - 1 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=532 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=535 bytes= 1 718 626K re-xmits=0166400 ( 13.7%) slice=0112 - 0 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=747 bytes= 1 747 131K re-xmits=0168427 ( 13.7%) slice=0112 - 1 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=918 bytes= 1 785 033K re-xmits=0172380 ( 13.7%) slice=0112 - 1 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=517 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=351 bytes= 1 818 953K re-xmits=0176911 ( 13.8%) slice=0112 - 0 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=400 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=544 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=135 bytes= 1 865 613K re-xmits=0179763 ( 13.7%) slice=0112 - 0 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=174 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=201 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=165 Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=192 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=343 bytes= 1 905 585K re-xmits=0184726 ( 13.7%) slice=0112 - 0 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=392 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=513 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=832 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=731 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=757 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=581 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=517 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=496 bytes= 1 934 409K re-xmits=0190701 ( 14.0%) slice=0112 - 1 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=648 bytes= 1 965 463K re-xmits=0197018 ( 14.2%) slice=0112 - 0 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=535 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=370 bytes= 2 003 205K re-xmits=0202271 ( 14.3%) slice=0112 - 1 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=546 bytes= 2 004 639K re-xmits=0202846 ( 14.3%) slice=0112 - 1 Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=349 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=254 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=165 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=212 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=423 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=629 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=692 bytes= 2 046 681K re-xmits=0205559 ( 14.2%) slice=0112 - 0 bytes= 2 090 315K re-xmits=0209120 ( 14.2%) slice=0112 - 0 bytes= 2 121 369K re-xmits=0215790 ( 14.4%) slice=0112 - 1 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=738 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=498 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=630 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=626 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=652 bytes= 2 139 046K re-xmits=0219085 ( 14.5%) slice=0112 - 1 bytes= 2 180 769K re-xmits=0222653 ( 14.5%) slice=0112 - 0 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=752 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=405 bytes= 2 215 326K re-xmits=0226243 ( 14.5%) slice=0112 - 1 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=395 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=434 bytes= 2 244 310K re-xmits=0230872 ( 14.6%) slice=0112 - 1 Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=726 bytes= 2 256 094K re-xmits=0233386 ( 14.7%) slice=0112 - 0 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=492 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=480 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=720 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=606 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=649 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=662 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=740 bytes= 2 287 785K re-xmits=0238647 ( 14.8%) slice=0112 - 0 Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=467 bytes= 2 304 188K re-xmits=0239225 ( 14.7%) slice=0112 - 0 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=400 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=394 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=412 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=430 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=620 bytes= 2 336 516K re-xmits=0243966 ( 14.8%) slice=0112 - 0 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=913 Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=938 bytes= 2 364 703K re-xmits=0246284 ( 14.8%) slice=0112 - 0 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=688 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=634 bytes= 2 387 316K re-xmits=0248041 ( 14.7%) slice=0112 - 1 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=587 Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=715 bytes= 2 424 422K re-xmits=0252167 ( 14.7%) slice=0112 - 0 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=322 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=363 bytes= 2 461 368K re-xmits=0255875 ( 14.7%) slice=0112 - 0 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=738 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=543 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=573 bytes= 2 475 859K re-xmits=0258024 ( 14.8%) slice=0112 - 1 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=474 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=504 bytes= 2 516 627K re-xmits=0261528 ( 14.7%) slice=0112 - 0 Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=725 bytes= 2 521 405K re-xmits=0262000 ( 14.7%) slice=0112 - 0 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=477 bytes= 2 562 014K re-xmits=0264860 ( 14.6%) slice=0112 - 1 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=854 bytes= 2 568 861K re-xmits=0265795 ( 14.7%) slice=0112 - 1 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=641 bytes= 2 613 133K re-xmits=0268208 ( 14.5%) slice=0112 - 1 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=581 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=747 bytes= 2 624 917K re-xmits=0269700 ( 14.6%) slice=0112 - 1 bytes= 2 671 578K re-xmits=0271900 ( 14.4%) slice=0112 - 1 bytes= 2 721 901K re-xmits=0272779 ( 14.2%) slice=0112 - 1 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=125 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=163 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=208 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=123 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=152 bytes= 2 771 109K re-xmits=0273976 ( 14.0%) slice=0112 - 0 bytes= 2 820 476K re-xmits=0275173 ( 13.8%) slice=0112 - 1 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=200 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=174 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=151 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=192 bytes= 2 829 394K re-xmits=0275559 ( 13.8%) slice=0112 - 0 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=240 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=266 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=281 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=294 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=294 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=305 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=311 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=395 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=389 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=438 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=528 bytes= 2 864 589K re-xmits=0280165 ( 13.9%) slice=0112 - 0 bytes= 2 900 420K re-xmits=0285084 ( 13.9%) slice=0112 - 0 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=150 bytes= 2 949 787K re-xmits=0286144 ( 13.7%) slice=0112 - 0 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=176 Timeout notAnswered=[1] notReady=[1] nrAns=1 nrRead=1 nrPart=2 avg=183 bytes= 2 998 199K re-xmits=0287367 ( 13.6%) slice=0112 - 0 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=231 Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=259 . . . . . . Timeout notAnswered=[0] notReady=[0] nrAns=1 nrRead=1 nrPart=2 avg=205 Dropping client #0 because of timeout Disconnecting #0 (10.200.20.7) bytes= 3 026 227K re-xmits=0289381 ( 13.5%) slice=0112 - 1 Timeout notAnswered=[1] notReady=[1] nrAns=0 nrRead=0 nrPart=1 avg=205 . . . . . Timeout notAnswered=[1] notReady=[1] nrAns=0 nrRead=0 nrPart=1 avg=205 Dropping client #1 because of timeout Disconnecting #1 (10.200.20.6) bytes= 3 026 227K re-xmits=0289381 ( 13.5%) slice=0112 - 1 Transfer complete. Udp-sender 20120424 Using mcast address 234.200.0.20 UDP sender for /images/Image_Vorlage_24-03-2020/d1p2.img at 10.200.0.20 on ens192 Broadcasting control to 224.0.0.1
As you can see, the Task is running into a lot of Timeouts.
The first thing I thought of was that this is a network problem, a wrong multicast configuration on the switches. IGMP snooping is on every switch enabled, we configured one switch as the IGMP Proxy. This configuration is mostly identical to a working multicast-configuration of another school.
As of now, i am at end of my wisdom.
Best regards
Dankau
-
@george1421 We have 3 switches between the computers and the FOG-server.
The problem is resolved!
Our Solution:
The WebGUI from the switch showed the options in the IGMP Snooping as enabled, in the config-file some of the needed options were disabled. After enabling them in the file and uploading that we tried another mutlicast task.The timeouts still are in the multicast.log.udpcast.x, but the task completes successfully.
-
I can tell you that multicasting is a beast onto it’s one. 90% of the problems are infrastructure and configuration with the rest something else (most not related to FOG).
IGMP Snooping needs to be enabled on all network switches and assigned on the VLANs where the multicasts would occur. If your target computers are on different subnets than your FOG server then you will need to enable the igmp proxy service on your VLAN routers. I would suggest that you not try to multicast across a WAN link.
When multicasting multiple systems the slowest system controls the speed of imaging. The target system image is sent out one block at a time. All clients must acknowledge the block before the next block is sent out by the FOG server (or any multicast imaging server). If not all of the clients respond with an ack to the FOG server the entire block must be restransmitted to all computers. If a client disappears, after a time it will be dropped from the multicast server and the rest will continue to image.
With the number of retransmits it makes me think you have a networking issue.
Will you tell us a bit more about this fog server and your environment?
- Describe the fog server’s hardware
- How many computers that have the fog client installed is talking to this FOG server?
- How is the FOG server connected to your network? Is it via a single 1GbE link, 10GbE link, etc??
- What is your switch infrastructure?
- When you get these timeouts how many target computers are you trying to image at one time?
- If you move (as a test) computers to the same subnet as the FOG server, do you have similar results when you send out a multicast?
- Does electrical distance from the FOG server impact your ability to multicast (i.e. multicasting works in the building where the fog server is, but not in the building at the far end of your campus?)
-
Thank you for your fast reply.
I do think that there is one option in our configuration wrong, but i just don´t know where. IGMP Snooping is configured on all switches and the VLANs are correctly assigned. On our Core-Switch (also our router) we have the IGMP Proxy enabled for the right VLANs. The multicast shouldn´t just stop in the middle of the Roll-Out. All of the computers are new and they have a decent hardware.
1.) We are running a virtual FOG-server on VMWare, the FOG-server has 4 cores and 16 GB RAM, the network-card runs as VMXnet3
2.) As of now, we only had maybe 1 computer talking to our FOG-server, even with that PC
3.) Our server has 4 1GbE network-cards as trunk installed
4.) Our server is connected to a 10 GbE-Switch. That switch is directly connected over a 20 GbE-trunk to our Core-Switch. The classroom ist connected to an access-swtich, which is connected over a 1 GbE-connection to our Core-switch. The infrastructure is pretty new
5.) We tried to multicast a complete room with 20 computers. After that didn´t work, we made a group with 2 computers, but we couldn´t get it to work either.
6.) We didn´t try that yet, we will set 2 computers in the same VLAN as out FOG-Server and see, if that works.
7.) Electrical distance should not be an issue.I hope you can work with these answers.
EDIT: We just tried to have a Multicast-Session in the same subnet as the FOG-Server. The result was the same. Multicast is not working in the same subnet.
-
@Dankau said in Deploying via Multicast aborts midway:
Multicast is not working in the same subnet.
You are getting the same timeout issue? The switches between the FOG server and test computers had igmp snooping enabled on the VLAN where the FOG server is connected? Ideally for the initial test to have the target computers on the same switch as the FOG server would be a baseline.
The rest of your network is fine from a bottleneck standpoint.
-
@Dankau What switch manufacture is your networking infrastructure? Cisco Catalyst series?
I remember a post from not to long ago that something in cisco land needed to be turned on. I also found someone with almost the same initial post as you have but there was no resolution.
-
@george1421 We are using D-Link switches only.
After testing the multicast in the same subnet we got the same result, that means we are running into timeouts.
I can confirm, after checking multiple times, that IGMP Snooping is enabled on every switch and for the needed VLANs.Sadly, I can´t test multicast on the same switch beacuse every port on our 10 GbE-switch is used.
-
@Dankau So how many hops (network switches) is the target computer away from the FOG server.
-
@george1421 We have 3 switches between the computers and the FOG-server.
The problem is resolved!
Our Solution:
The WebGUI from the switch showed the options in the IGMP Snooping as enabled, in the config-file some of the needed options were disabled. After enabling them in the file and uploading that we tried another mutlicast task.The timeouts still are in the multicast.log.udpcast.x, but the task completes successfully.