• Recent
    • Unsolved
    • Tags
    • Popular
    • Users
    • Groups
    • Search
    • Register
    • Login

    Imaging stops after client boots up

    Scheduled Pinned Locked Moved Solved
    FOG Problems
    3
    29
    7.9k
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • F
      Foglalt
      last edited by

      Hi!

      I have a strange issue. Task for deployment is given, pc restarted for imaging and it stops with a root prompt. Like if something the scripts failed to start their job on clientside. If i test a linuxrc script, it goes on and finishes without error, image deployed properly. But why it stucks right before doing imaging?

      Fog 1.2, stock install. Problem was not seen till a point which cannot be interfering, but let it be mentioned: an extra iso boot image option was added to the boot menu for ipxe.

      Any suggestion?

      (3-4 times it happened and kinda frustrating, no same visuals noticed. Last occurance was accompanied with a DHCP/BOOTP error displayed, as follows

      “DHCP/BOOTP: Reply not for us, op[2] xid[45d6eedc].”

      As I recall this is not a good simptom as not all same issue was followed by this dhcp message.

      1 Reply Last reply Reply Quote 0
      • Wayne WorkmanW
        Wayne Workman
        last edited by

        What’s the model of the target-host?

        Have you looked through the list of working devices and problematic devices?

        [url]http://fogproject.org/wiki/index.php/WorkingDevices[/url]
        [url]http://fogproject.org/wiki/index.php/ProblematicDevices[/url]

        Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG!
        Daily Clean Installation Results:
        https://fogtesting.fogproject.us/
        FOG Reporting:
        https://fog-external-reporting-results.fogproject.us/

        1 Reply Last reply Reply Quote 0
        • F
          Foglalt
          last edited by

          Hi!

          Yes, I checked the device list, not on badboys list. And, that same hardware worked before (2-3 days before). We have plenty of practically clone hw, and the one we happen to have problem, the exact one, worked even before. That is the most frustrating part of the story. We use fog for years and are happy to use it. And now this happens (well, we use 1.2 only for a few weeks, and problem only occures this version, not the previously used 0.32).

          1 Reply Last reply Reply Quote 0
          • Wayne WorkmanW
            Wayne Workman
            last edited by

            Can you look through this WiKi article and give these steps a try?

            [url]http://fogproject.org/wiki/index.php/Troubleshooting_an_image_push_to_a_client[/url]

            Lets see exactly where the problem is.

            Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG!
            Daily Clean Installation Results:
            https://fogtesting.fogproject.us/
            FOG Reporting:
            https://fog-external-reporting-results.fogproject.us/

            1 Reply Last reply Reply Quote 0
            • F
              Foglalt
              last edited by

              It has error massage at the 7th step. error says 2 things:

              • cannot read /proc/partitions
              • cannot access /proc/mounts

              for me it is a bit strange at this point, especially knowing that if it has a valid task to deploy, gets stuck at a prompt, THEN commint ./linuxrc command it goes into imaging process, and finishes it without any further error. additional, maybe useful info: the point where it stucks has a prompt:
              “root@146#”

              where the 146 is the first number from the cliens or server ip. does it have any additional meaning for you?

              [quote=“Wayne Workman, post: 46356, member: 28155”]Can you look through this WiKi article and give these steps a try?

              [url]http://fogproject.org/wiki/index.php/Troubleshooting_an_image_push_to_a_client[/url]

              Lets see exactly where the problem is.[/quote]

              1 Reply Last reply Reply Quote 0
              • F
                Foglalt
                last edited by

                I still have no clue on why imaging stops. Up to this point we take notes to find a pattern in hardware, etc, but still find no solution. And after a LOT of machines cloned it is a really frustrating to see that now it doesnt work more than 1 out of 10 (!) cases. Even worked-before hardwares stop working. Still no error messages at all!

                1 Reply Last reply Reply Quote 0
                • Wayne WorkmanW
                  Wayne Workman
                  last edited by

                  Rogue DHCP?

                  Perhaps FOG’s static IP got assigned to something else by accident?

                  Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG!
                  Daily Clean Installation Results:
                  https://fogtesting.fogproject.us/
                  FOG Reporting:
                  https://fog-external-reporting-results.fogproject.us/

                  1 Reply Last reply Reply Quote 0
                  • Wayne WorkmanW
                    Wayne Workman
                    last edited by

                    You know,

                    You could capture the traffic on the FOG machine, but depending on how far imaging gets, it might be too much data to hold in RAM.

                    There’s an article in the WiKi called TCPDump. You could run that either on the FOG machine, or use a hub (not a switch) connected between the FOG server and your switch, and capture traffic on a separate device.

                    Kill the capture moments after the failure. Open the resulting file, and look down at the bottom… see what’s going on…

                    This is starting to sound a lot like a network issue… (still thinking rogue DHCP, possibly) because you had a working FOG server, and then all of a sudden, everything is not working right… And I assume if you made changes on the FOG server prior to these problems, you would have brought that up here.

                    Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG!
                    Daily Clean Installation Results:
                    https://fogtesting.fogproject.us/
                    FOG Reporting:
                    https://fog-external-reporting-results.fogproject.us/

                    1 Reply Last reply Reply Quote 0
                    • Wayne WorkmanW
                      Wayne Workman
                      last edited by

                      You should also try to run the script found in this post:
                      [url]http://fogproject.org/forum/threads/lets-make-scripts.12551/#post-46119[/url]

                      It’s an info-gathering script. You can post the results here in a .zip file, but I also encourage you to look through it yourself, too.

                      Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG!
                      Daily Clean Installation Results:
                      https://fogtesting.fogproject.us/
                      FOG Reporting:
                      https://fog-external-reporting-results.fogproject.us/

                      1 Reply Last reply Reply Quote 0
                      • F
                        Foglalt
                        last edited by

                        Rouge dhcp. Well, frist thought was this, as not on fog, but we had some same situation, so we go and rested it. Not this kind of problem here (as far as tests went). And, for the fun of it, now we make tons of tests with different hardwares to see any pattern if we may see, and latest test machines produced this: if ever occure, nothing can change it (well, 10-12 reboot, no change on same box). BUT, cos always there is a but… if it stucks after.xz file load, i get the root@146 like prompt i can ANYTIME go on with imaging commencing ./linuxrc. Anytime!. So, if network issue anyhow, it is only in the boot process of the ipxe downloaded OS. It can be a clue

                        Now we collect motherboard info for listing working and not working machines and still there are machines we find with working and not working status.

                        I even replaced the fog server machine with other one (not the best solution, but I relocated the disk to other hw). Same issues: sometimes works, sometimes not. Pretty frustrating.

                        I will go through the suggestions you made above as soon as i am able, but as always, we have to do many things separately. And, as we were after testing phase, this system got into production, so, not the best to fall back to previous versions or go backup; but I think I will not be able to avoid that… 😞

                        1 Reply Last reply Reply Quote 0
                        • Wayne WorkmanW
                          Wayne Workman
                          last edited by

                          What version of FOG are you running? What OS distribution?

                          Also, before you do any more of these labor-intense tests, can you try to re-run your installer?

                          Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG!
                          Daily Clean Installation Results:
                          https://fogtesting.fogproject.us/
                          FOG Reporting:
                          https://fog-external-reporting-results.fogproject.us/

                          1 Reply Last reply Reply Quote 0
                          • F
                            Foglalt
                            last edited by

                            I run the troubleshoot.sh now, made a little change on host ip to be less… well, open 🙂 . Here is the log it generated.

                            [url]http://pastebin.com/1PKpRzmN[/url]

                            1 Reply Last reply Reply Quote 0
                            • Wayne WorkmanW
                              Wayne Workman
                              last edited by

                              [quote=“Foglalt, post: 46977, member: 26236”]I run the troubleshoot.sh now, made a little change on host ip to be less… well, open 🙂 . Here is the log it generated.

                              [url]http://pastebin.com/1PKpRzmN[/url][/quote]

                              The first thing I noticed is that a ton of your services aren’t even running/reporting (unless you deleted this from the output??):

                              [CODE]----------------------RPCBind status below
                              ----------------------NFS status below
                              ----------------------Firewall status below
                              ----------------------FOGMulticastManager status below
                              ----------------------FOGSnapinReplicator status below
                              ----------------------FOGImageReplicator status below
                              ----------------------FOGScheduler status below
                              ----------------------Installation log below
                              [/CODE]

                              Can you check your firewall?
                              Can you make sure NFS is running?
                              Can you make sure RPCBind is running?

                              Also, the FOG services should be running, too.

                              and, your ownership for things in /images is not uniform, however it shouldn’t matter considering everything has 777 permissions.

                              Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG!
                              Daily Clean Installation Results:
                              https://fogtesting.fogproject.us/
                              FOG Reporting:
                              https://fog-external-reporting-results.fogproject.us/

                              1 Reply Last reply Reply Quote 0
                              • F
                                Foglalt
                                last edited by

                                Ask anything! All is welcome and maybe closer to solution. Btw, you said things strange to me (and I was puzzled to see them before posting log to you, but I thought i may be underinformed what is normal and what is not.

                                • Firewall iptables is totally empty (iptables -L, all default output)
                                • NSF can be reached (as I said linuxrc can read from it as manually started)
                                • rpcbind ha processes (well, I am not a pro of all, so, I olny tested processes; what should look for)
                                • Fog services. well, it surprised me to see this, I havent noticed. How could I acomplish that? It was a stock installation, so what went rong without notice…!? Well, ok, how to redo them without making a mess?
                                • ownership of files of the images are not uniform, yes, cos after testing was ok, some images from old storage was copied for further testing with new image vs old image context.

                                As for asking about removed parts of log: nothing was removed, only some personal info is hidden. 🙂

                                1 Reply Last reply Reply Quote 0
                                • Wayne WorkmanW
                                  Wayne Workman
                                  last edited by

                                  Can you take a picture of the error you’re seeing on clients? Or, the screen it sits on? Post the picture here, please.

                                  Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG!
                                  Daily Clean Installation Results:
                                  https://fogtesting.fogproject.us/
                                  FOG Reporting:
                                  https://fog-external-reporting-results.fogproject.us/

                                  1 Reply Last reply Reply Quote 0
                                  • F
                                    Foglalt
                                    last edited by

                                    well, actually nothing special (sorry, today i had zero time to test and make photo). after init files downloaded (.xz for example) it gets a normal looking root prompt. (will make photo, btw, next week)

                                    1 Reply Last reply Reply Quote 0
                                    • F
                                      Foglalt
                                      last edited by

                                      Well, I am sure not a debug friendly solution I did, but I had to move on to a working state with my fog server. So, I replaced the machine, do a full reinstall, redo the personalized changes of the server, then tadaaa, all works as charm (btw, I keep the actual fog server and the buggy one to have a bit-by-bit comparison to see what is the difference. Up to that point we will know nothing unfortunatelly.

                                      Wayne, I really appreciate your attention. If I find any detectable and – best case – reproducable reason for my issue, I will post it, send bug report if needed.

                                      Bests

                                      1 Reply Last reply Reply Quote 0
                                      • F
                                        Foglalt
                                        last edited by

                                        UPDATE: sad news, especially for me. Case reopened, as the fresh installation, on different box, with ultra zero modification (only pxe menu access was set) AND SAME ISSUE… means: I smell some bug in the infrastructure what the old version was resistent (or change was after v1.2 installed).

                                        I would welcome all debug methods to detect where the problem lies (sure not in fog basic setups, maybe not in hardware of the server, as 2 totally different instal does same problem)

                                        Any suggestion?

                                        1 Reply Last reply Reply Quote 0
                                        • Wayne WorkmanW
                                          Wayne Workman
                                          last edited by

                                          Try Trunk? [url]http://jbob.io/wiki/index.php/Upgrade_to_trunk[/url]

                                          Couldn’t hurt at this point.

                                          Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG!
                                          Daily Clean Installation Results:
                                          https://fogtesting.fogproject.us/
                                          FOG Reporting:
                                          https://fog-external-reporting-results.fogproject.us/

                                          1 Reply Last reply Reply Quote 0
                                          • Wayne WorkmanW
                                            Wayne Workman
                                            last edited by

                                            Tom thinks the drive HAD a GPT, but the update made it think it has MBR and that broke it. And Windows doesn’t properly remove GPT fragments from the disk.

                                            You can use fixparts to repair it.

                                            Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG!
                                            Daily Clean Installation Results:
                                            https://fogtesting.fogproject.us/
                                            FOG Reporting:
                                            https://fog-external-reporting-results.fogproject.us/

                                            1 Reply Last reply Reply Quote 0
                                            • 1
                                            • 2
                                            • 1 / 2
                                            • First post
                                              Last post

                                            297

                                            Online

                                            12.0k

                                            Users

                                            17.3k

                                            Topics

                                            155.2k

                                            Posts
                                            Copyright © 2012-2024 FOG Project