• Recent
  • Unsolved
  • Tags
  • Popular
  • Users
  • Groups
  • Search
  • Register
  • Login
  • Recent
  • Unsolved
  • Tags
  • Popular
  • Users
  • Groups
  • Search
  • Register
  • Login

FOG Multicast issue - stuck on starting to restore image - version 7547

Scheduled Pinned Locked Moved Solved
FOG Problems
4
57
19.7k
Loading More Posts
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • A
    arainero @Sebastian Roth
    last edited by Jul 9, 2016, 5:25 PM

    @Sebastian-Roth I know, I turned into one of those people. But as far as I can tell nothing on the network has changed. It is a very basic setup. Internet comes in, hits the pfsense router which goes to a layer 2 meraki switch (which is trunked to another identical one) and out to the computers.

    The only other device on the network is a wireless AP and a few gaming consoles. Other than that every computer has a direct ethernet cord to it. All of these devices were here before when multicast was working.

    It truly sounds network related somehow, but I have no idea what could be causing it at this point.

    On Monday I plan to hook the router (for the pxe boot routing), 2 pc’s for multicasting, and an admin computer (for the web GUI access) to a dumb switch and see what happens when I cut everything else out. That should rule out any network issues, right?

    W 1 Reply Last reply Jul 9, 2016, 5:43 PM Reply Quote 0
    • W
      Wayne Workman @arainero
      last edited by Wayne Workman Jul 9, 2016, 11:45 AM Jul 9, 2016, 5:43 PM

      @arainero wait a second there, the multicast is going through the router? That could be the problem, it’s probably blocking the needed port and/or UDP traffic.

      Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG!
      Daily Clean Installation Results:
      https://fogtesting.fogproject.us/
      FOG Reporting:
      https://fog-external-reporting-results.fogproject.us/

      A 1 Reply Last reply Jul 9, 2016, 7:17 PM Reply Quote 0
      • A
        arainero @Wayne Workman
        last edited by Jul 9, 2016, 7:17 PM

        @Wayne-Workman The multicast doesn’t go through the router, it is just the edge device and I was just giving a brief rundown of the network and how basic it was.

        As far as I know the router can’t block interlan traffic. Also, no changes have been made to the router either (I know how much you guys love hearing that).

        W 1 Reply Last reply Jul 9, 2016, 10:17 PM Reply Quote 0
        • W
          Wayne Workman @arainero
          last edited by Wayne Workman Jul 9, 2016, 4:17 PM Jul 9, 2016, 10:17 PM

          @arainero

          What side of the router is the fog server on? You didn’t include that part, you just said three computers on a dumb switch. Where is the fog server?

          Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG!
          Daily Clean Installation Results:
          https://fogtesting.fogproject.us/
          FOG Reporting:
          https://fog-external-reporting-results.fogproject.us/

          A 1 Reply Last reply Jul 10, 2016, 4:29 PM Reply Quote 0
          • S
            Sebastian Roth Moderator
            last edited by Sebastian Roth Jul 10, 2016, 3:57 AM Jul 10, 2016, 9:55 AM

            @arainero Thanks for the new logs, tests and explanations. I am pretty sure we are headed the right way with that. Doing the test with a dump switch and just the FOG server and two clients on it is definitely a good idea - although your other setup also sounds quite simple. Still give it a try with that dump switch I’d say. A couple of things that got my attention:

            • The timestamps in the last log output you posted are in reverse?!? That does not make any sense to me.
            • two meraki switches being trunked (was this setup always like that? which trunking protocol is being used?) - I’ve never done multicasting over a trunked link
            • have you ever checked on your FOG server if those udp-sender processes are actually started? After checking the log also run ps ax | grep udp
            • AFAIK there should be no difference between running the command by hand and it being run by FOG when scheduling a task. But please give the following steps a try to see if it makes any difference. Schedule a multicast task for two of your clients plus unicast debug deploy tasks for another two clients. Make sure the multicast clients do not boot up (disconnect from the LAN so they don’t wake up) and start the two unicast clients. When they are up run udp-receiver --nokbd --ttl 32 --mcast-rdv-address 192.168.1.3 | gunzip > /dev/sda1

            Web GUI issue? Please check apache error (debian/ubuntu: /var/log/apache2/error.log, centos/fedora/rhel: /var/log/httpd/error_log) and php-fpm log (/var/log/php*-fpm.log)

            Please support FOG if you like it: https://wiki.fogproject.org/wiki/index.php/Support_FOG

            W A 3 Replies Last reply Jul 10, 2016, 4:18 PM Reply Quote 0
            • W
              Wayne Workman @Sebastian Roth
              last edited by Jul 10, 2016, 4:18 PM

              @Sebastian-Roth said in FOG Multicast issue - stuck on starting to restore image - version 7547:

              The timestamps in the last log output you posted are in reverse?!?

              In Fog’s web interface, there’s a check box to reverse the logs, that way the newest ones appear at the top. I feel it should be default, because with a large log you can’t always get to the bottom of the file by using the web interface to show 1000 lines.

              Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG!
              Daily Clean Installation Results:
              https://fogtesting.fogproject.us/
              FOG Reporting:
              https://fog-external-reporting-results.fogproject.us/

              1 Reply Last reply Reply Quote 0
              • A
                arainero @Sebastian Roth
                last edited by Jul 10, 2016, 4:27 PM

                @Sebastian-Roth

                Could the time stamps be like that since I had the log sorted from top down?

                Trunking might have been the wrong term. They are simply connected to each other through SFP ports with twinax cable. Like plugging a switch into another switch with ethernet.

                The udp sender appears to be in working order. This is what I get after starting a multicast session:

                [root@fogserv ~]# ps ax | grep udp
                28869 ?        S      0:00 sh -c cat /images/6_16_16/d1p1.img | /usr/local/sbin/udp-sender --interface eth0 --min-receivers 2 --max-wait 180 --mcast-data-address 232.168.1.0 --full-duplex --ttl 32 --nokbd --nopointopoint;cat /images/6_16_16/d1p2.img | /usr/local/sbin/udp-sender --interface eth0 --min-receivers 2 --max-wait 180 --mcast-data-address 232.168.1.0 --full-duplex --ttl 32 --nokbd --nopointopoint;cat /images/6_16_16/d2p1.img | /usr/local/sbin/udp-sender --interface eth0 --min-receivers 2 --max-wait 180 --mcast-data-address 232.168.1.0 --full-duplex --ttl 32 --nokbd --nopointopoint;
                28871 ?        S      0:00 /usr/local/sbin/udp-sender --interface eth0 --min-receivers 2 --max-wait 180 --mcast-data-address 232.168.1.0 --full-duplex --ttl 32 --nokbd --nopointopoint
                28881 pts/1    S+     0:00 grep udp
                

                Doing the debug test seems to have worked.
                alt text

                One thing I noticed that might be odd is that when I went to the task management page and to active multicast tasks it said 4 hosts. There are only 2 members in the group. Is this indicative of anything wrong?

                alt text

                T 1 Reply Last reply Jul 11, 2016, 9:55 AM Reply Quote 0
                • A
                  arainero @Wayne Workman
                  last edited by Jul 10, 2016, 4:29 PM

                  @Wayne-Workman The fog server is on the LAN along with the client computers. The router is the edge device and handles the DHCP for the network.

                  1 Reply Last reply Reply Quote 0
                  • S
                    Sebastian Roth Moderator
                    last edited by Jul 11, 2016, 9:45 AM

                    @arainero Do I get this right? You did as I suggested: started a multicast using the web gui and started the multicast receive on two clients by hand. That worked!? So why wouldn’t it work when the other two clients run into the multicast session???

                    Could you please do this same test again but remove the --mcast-data-address 232.168.1.0 setting. This should not be necessary I reckon. After that give it another try with a plain multicast session. But first make sure that you don’t have any old udp-sender processes running on the FOG server (ps ax | grep udp and killall udp-sender in case there are orphaned processes around) and clear the (multicast) task lists in the web gui!

                    Web GUI issue? Please check apache error (debian/ubuntu: /var/log/apache2/error.log, centos/fedora/rhel: /var/log/httpd/error_log) and php-fpm log (/var/log/php*-fpm.log)

                    Please support FOG if you like it: https://wiki.fogproject.org/wiki/index.php/Support_FOG

                    1 Reply Last reply Reply Quote 0
                    • T
                      Tom Elliott @arainero
                      last edited by Jul 11, 2016, 9:55 AM

                      @arainero I would think the reasoning for the maxClients to show up as 4 is because you’ve rebooted/reran the same multicast tasking for the host at least 1 per each host and possibly more times on each (maybe only one host) by hand?

                      Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG! Get in contact with me (chat bubble in the top right corner) if you want to join in.

                      Web GUI issue? Please check apache error (debian/ubuntu: /var/log/apache2/error.log, centos/fedora/rhel: /var/log/httpd/error_log) and php-fpm log (/var/log/php*-fpm.log)

                      Please support FOG if you like it: https://wiki.fogproject.org/wiki/index.php/Support_FOG

                      1 Reply Last reply Reply Quote 0
                      • A
                        arainero @Sebastian Roth
                        last edited by Jul 11, 2016, 5:16 PM

                        @Sebastian-Roth Once I removed the --mcast-data-address 232.168.1.0 from the multicast settings page and attempted the manual debug multicast it didn’t work. I made sure no udp process was running in the background either. For some reason a mcast address is needed now for these tests. (This was never necessary when multicasting used to work)

                        alt text

                        1 Reply Last reply Reply Quote 0
                        • S
                          Sebastian Roth Moderator
                          last edited by Jul 12, 2016, 6:18 AM

                          @arainero Why were those different IPs (.109 and .110) than last time (.127 and .128)? Did you use different clients for the test or do your clients change IP address on every reboot because they get it from a dynamic pool?

                          Have you tried this on just the simple setup with a dump switch yet?

                          Could you please run the same test again but use --mcast-data-address 232.168.1.4 for example. Just want to see what exactly this is about. Not making sense to me that it does receive the data on 232.168.1.0 but not on 232.168.1.3.

                          Web GUI issue? Please check apache error (debian/ubuntu: /var/log/apache2/error.log, centos/fedora/rhel: /var/log/httpd/error_log) and php-fpm log (/var/log/php*-fpm.log)

                          Please support FOG if you like it: https://wiki.fogproject.org/wiki/index.php/Support_FOG

                          A 1 Reply Last reply Jul 18, 2016, 3:32 PM Reply Quote 0
                          • A
                            arainero @Sebastian Roth
                            last edited by Jul 18, 2016, 3:32 PM

                            @Sebastian-Roth

                            The IP’s were different because they were different computers. The computers are all identical and are all connected to the same switch. Nothing is unique about them.

                            We just tried the test on a dumb switch, the multicast failed at the same spot as usual. This is what the multcast log shows.

                            [07-18-16 11:20:15 am] | CMD: cat /images/6_16_16/d1p1.img | /usr/local/sbin/udp-sender --interface eth0 --min-receivers 2 --max-wait 180 --full-duplex --ttl 32 --nokbd --nopointopoint;cat /images/6_16_16/d1p2.img | /usr/local/sbin/udp-sender --interface eth0 --min-receivers 2 --max-wait 180 --full-duplex --ttl 32 --nokbd --nopointopoint;cat /images/6_16_16/d2p1.img | /usr/local/sbin/udp-sender --interface eth0 --min-receivers 2 --max-wait 180 --full-duplex --ttl 32 --nokbd --nopointopoint;
                            [07-18-16 11:20:15 am] | Task (1) Multi-Cast Task sending on base port: 50362
                            [07-18-16 11:20:15 am] | Task (1) 2 client(s) found.
                            [07-18-16 11:20:15 am] | Task (1) /images/6_16_16 image file found.
                            [07-18-16 11:20:15 am] | Task (1) Multi-Cast Task has been cleaned.
                            

                            Using --mcast-data-address 232.168.1.4 worked when testing manually, like the others.

                            T 1 Reply Last reply Jul 18, 2016, 3:42 PM Reply Quote 0
                            • T
                              Tom Elliott @arainero
                              last edited by Jul 18, 2016, 3:42 PM

                              @arainero Then can you add your 232.168.1.4 address to the FOG Settings?

                              FOG Configuration Page->FOG Settings->Multicast Settings->FOG_MULTICAST_ADDRESS.

                              If you have to specify the port, also under the same area I think (FOG_MULTICAST_PORT_OVERRIDE) just update as well.

                              Then cancel the tasking.

                              Run these commands in MySQL (just to ensure all is well):

                              truncate table `fog`.`multicastSessions`;
                              truncate table `fog`.`multicastSessionsAssoc`;
                              DELETE FROM `fog`.`tasks` where `taskTypeID`='8';
                              

                              Then recreate your task and test again.

                              Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG! Get in contact with me (chat bubble in the top right corner) if you want to join in.

                              Web GUI issue? Please check apache error (debian/ubuntu: /var/log/apache2/error.log, centos/fedora/rhel: /var/log/httpd/error_log) and php-fpm log (/var/log/php*-fpm.log)

                              Please support FOG if you like it: https://wiki.fogproject.org/wiki/index.php/Support_FOG

                              A 1 Reply Last reply Jul 24, 2016, 3:35 AM Reply Quote 0
                              • W
                                Wayne Workman
                                last edited by Jul 18, 2016, 4:25 PM

                                After doing the steps tom listed, it wouldn’t hurt to give the FOGMulticastManager a restart as well.

                                Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG!
                                Daily Clean Installation Results:
                                https://fogtesting.fogproject.us/
                                FOG Reporting:
                                https://fog-external-reporting-results.fogproject.us/

                                1 Reply Last reply Reply Quote 0
                                • A
                                  arainero @Tom Elliott
                                  last edited by Jul 24, 2016, 3:35 AM

                                  @Tom-Elliott Tried this with no success unfortunately. I just updated FOG to trunk to see if that would help, but that didn’t yield any positive results either.

                                  Would getting a pcap help at all with this? I wouldn’t know what to look for, but I could supply one I’m sure.

                                  1 Reply Last reply Reply Quote 0
                                  • A
                                    arainero
                                    last edited by arainero Jul 23, 2016, 11:12 PM Jul 24, 2016, 5:08 AM

                                    Well, I got some good news. I was able to get a successful multicast going tonight. I am not 100% sure of what fixed it. I did several things.

                                    Not in particular order:

                                    1. Updated FOG to trunk
                                    2. Disabled IGMP snooping from the switches
                                    3. Made a change to CentOS as seen here that has to do with routing (honestly don’t think this helped)
                                    4. Added 232.168.1.0 for FOG_MULTICAST_ADDRESS (it wouldn’t work without that)

                                    I plan to do more tests to really make sure it is working tomorrow and the following days. I will keep this updated if anything substantial changes.

                                    Thank you to everyone.

                                    1 Reply Last reply Reply Quote 0
                                    • 1
                                    • 2
                                    • 3
                                    • 3 / 3
                                    3 / 3
                                    • First post
                                      56/57
                                      Last post

                                    256

                                    Online

                                    12.0k

                                    Users

                                    17.3k

                                    Topics

                                    155.2k

                                    Posts
                                    Copyright © 2012-2024 FOG Project