• Recent
  • Unsolved
  • Tags
  • Popular
  • Users
  • Groups
  • Search
  • Register
  • Login
  • Recent
  • Unsolved
  • Tags
  • Popular
  • Users
  • Groups
  • Search
  • Register
  • Login

ipxe booting "waiting for the link to come up"

Scheduled Pinned Locked Moved Solved
FOG Problems
3
23
3.8k
Loading More Posts
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • K
    kvothe88
    last edited by Jan 29, 2021, 7:46 AM

    Hi,

    Recently installed fog server on my domain. I use my own dhcp (pfsense) and not fog’s dhcp. I was able to register and image a big number of different computers but today i was trying to register a tiny Toshiba Satellite pro.

    After pxe booting, fog’s menu shows up. Then i choose quick register and after that it shows “starting enp3s0 interface and waiting link to come up” after that its says “No link detectet on enp3s0m, skipping it. Failed to get an ip via DHCP!”
    Thats weird because the laptop was able to get an IP and boot to fog’s server menu.

    Any suggestion? im stuck there.

    thanks in advance.

    1 Reply Last reply Reply Quote 0
    • S
      Sebastian Roth Moderator
      last edited by Jan 29, 2021, 11:59 AM

      @kvothe88 So we know other devices work well from what you say. So this seems like a specific issue with the Toshiba Satellite pro device. Let me ask you a few questions and then I’ll explain some of the background.

      Which version of FOG and FOS kernel do you use? See in FOG web UI -> FOG Configuration -> Kernel Update

      What you need to know about the boot process is that different components are working together to make this work:

      • NIC ROM doing the initial PXE boot to pull an IP and get nextserver and filename information
      • with that info it downloads an iPXE binary from your FOG server, a “PXE bootloader” used to show the FOG boot menu or boot into a scheduled task iPXE needs to query the DHCP server a second time
      • when a task is scheduled iPXE will load and handover to FOS (FOG OS, a tiny Linux with kernel and initrd doing all the work) which will boot up and again needs to get an IP from your DHCP server…

      Unfortunately those components are not able to hand over the IP information to the next part. And they all use their very own set of device drivers.

      So in your case the first two are going fine but the last one cannot get an IP.

      Please pay attention to the link-up LED on your Toshiba and the switch it is connected to. Is the link up at this stage where it says “waiting link to come up”?

      Web GUI issue? Please check apache error (debian/ubuntu: /var/log/apache2/error.log, centos/fedora/rhel: /var/log/httpd/error_log) and php-fpm log (/var/log/php*-fpm.log)

      Please support FOG if you like it: https://wiki.fogproject.org/wiki/index.php/Support_FOG

      K 1 Reply Last reply Feb 3, 2021, 11:46 AM Reply Quote 0
      • K
        kvothe88 @Sebastian Roth
        last edited by Feb 3, 2021, 11:46 AM

        @sebastian-roth said in ipxe booting "waiting for the link to come up":

        ersion of FOG and FOS kernel do you u

        hi,

        thank you for your answer. Sorry for the delay, had no time to answer. Now i have a while, i can tell you that my kernel version is 5.10.12 from 1st of February for bzimage and bzimage32. Also, the switch makes link when trying to get the ip adress.

        Also, i registered the host by myself with the lan mac adress and on the first step menu, before deploying image or selecting quick registration, seems to recognize the host with the name i gave to it on the fog server.

        After this, when i select an option, it trys to get a new IP and cant handle it…thats frustrating…all the kernel versions are the same with all the drivers + new ones in the newst versions?

        any ideas? I tried also to put a normal switch between my computer and the normal switch to see if that was a spaning tree problem but NO…

        thanks in advance!

        1 Reply Last reply Reply Quote 0
        • S
          Sebastian Roth Moderator
          last edited by Sebastian Roth Feb 3, 2021, 5:56 AM Feb 3, 2021, 11:55 AM

          @kvothe88 As you have registered the host, please schedule a debug task for it (same as a normal task but just before you click the button in the web UI there is a checkbox for debug) and boot it up. You will see the same issue with the network link but it should go ahead after a while and bring you to a command console. Maybe you need to hit ENTER a couple of times to get there.

          Now type the following commands, take a picture and upload that to the forum:

          dmesg | grep firmware
          ip a s
          lspci -nn | grep -i net
          

          The first command might come back blank but the other two should return something for sure! Keep the debug session open as we might have more commands needed to get the information needed.

          Web GUI issue? Please check apache error (debian/ubuntu: /var/log/apache2/error.log, centos/fedora/rhel: /var/log/httpd/error_log) and php-fpm log (/var/log/php*-fpm.log)

          Please support FOG if you like it: https://wiki.fogproject.org/wiki/index.php/Support_FOG

          K 1 Reply Last reply Feb 3, 2021, 12:16 PM Reply Quote 0
          • K
            kvothe88 @Sebastian Roth
            last edited by Feb 3, 2021, 12:16 PM

            @sebastian-roth said in ipxe booting "waiting for the link to come up":

            nd might come ba

            hi, thanks for the fast reply. Here you have the image. For the first command i had no results:

            0e07fa46-7dbd-4e59-9e1f-4b622c8b34fd.jpg

            G 1 Reply Last reply Feb 3, 2021, 12:55 PM Reply Quote 0
            • G
              george1421 Moderator @kvothe88
              last edited by Feb 3, 2021, 12:55 PM

              @kvothe88 Realtek nic 10ec:8136 first added to the linux kernel in version 3.3 so the kernel fog uses (even an old one) should support this nic. I’m going to think that FOS Linux is missing a specific firmware patch for that nic.

              Run this command from the debug console dmesg | less Look through that log file for specifically things that say “failed to find fw file”. It should list a file path and file name. Send us a screen shot of the missing firmware file.

              Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG!

              K 1 Reply Last reply Feb 3, 2021, 1:15 PM Reply Quote 0
              • K
                kvothe88 @george1421
                last edited by Feb 3, 2021, 1:15 PM

                @george1421

                Hi Gorge, could not find nothing similar to this in that log. Its very big and i could miss something but I checked it twice and saw nothing similar…

                Could those drivers been missing? thanks

                G 1 Reply Last reply Feb 3, 2021, 1:29 PM Reply Quote 0
                • G
                  george1421 Moderator @kvothe88
                  last edited by george1421 Feb 3, 2021, 7:29 AM Feb 3, 2021, 1:29 PM

                  @kvothe88 The driver should be in the FOS Linux kernel because it is a very common one. Sometimes the nic card requires a specific driver.

                  This is a bit harder way to look if you don’t know vi commands but key in vi /var/log/syslog

                  Then key in /Failed and press Enter (case is important). That will take you to the first occurrence of the word “Failed” in the log. If that is not it press / and enter again to go to the next “Failed” word. If you searched through the whole log and did not find an mention of missing fw (firmware) then that isn’t the problem. To exit vi key in ESC q!

                  This one will give us a bit more detail on the lspci command key in the following: lspci -nn -k | more Scroll down to where you see the line starting with 03:00 There should be two additional lines below that we are interested in. Does it list a kernel module being used?

                  Here is an example of what I’m looking for. This is from my FOG server of course, the kernel values is what is needed.

                  0b:00.0 Ethernet controller [0200]: VMware VMXNET3 Ethernet Controller [15ad:07b
                  0] (rev 01)
                          Subsystem: VMware VMXNET3 Ethernet Controller [15ad:07b0]
                          Kernel driver in use: vmxnet3
                          Kernel modules: vmxnet3
                  

                  Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG!

                  K 2 Replies Last reply Feb 3, 2021, 1:46 PM Reply Quote 0
                  • K
                    kvothe88 @george1421
                    last edited by Feb 3, 2021, 1:46 PM

                    @george1421 said in ipxe booting "waiting for the link to come up":

                    vi /var/log/syslog

                    vi /var/log/syslog yu mean in the Toshiba laptop or in my fog server? On toshiba, on the same consolore where i ran the debug task, there is not a syslog file

                    G 1 Reply Last reply Feb 3, 2021, 2:14 PM Reply Quote 0
                    • K
                      kvothe88 @george1421
                      last edited by kvothe88 Feb 3, 2021, 8:08 AM Feb 3, 2021, 2:05 PM

                      @george1421

                      here you have all i could get from your question:

                      ac1b412c-c780-4442-bdc0-2ac085ba08b3.jpg

                      227459d6-d431-4b27-a7a6-be8d85e86e6a.jpg

                      1 Reply Last reply Reply Quote 0
                      • G
                        george1421 Moderator @kvothe88
                        last edited by Feb 3, 2021, 2:14 PM

                        @kvothe88 said in ipxe booting "waiting for the link to come up":

                        On toshiba, on the same consolore where i ran the debug task, there is not a syslog file

                        Yes on pxe booting computer there should be a log file in /var/log its either called syslog or messages. I can’t remember at the moment which.

                        I see that the right kernel driver is being used. “r8169”. So its either missing the proper firmware (need info from above) or I need to send you an updated FOS Linux kernel that has the current realtek driver. Lets find out about the firmware bit first.

                        Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG!

                        K 1 Reply Last reply Feb 3, 2021, 2:23 PM Reply Quote 0
                        • K
                          kvothe88 @george1421
                          last edited by Feb 3, 2021, 2:23 PM

                          @george1421 hi Gorge,

                          got nothing from that file. No /Failed strings shown, no /failed, no /error or /firmware either with capital letter

                          3593a64e-f558-4a4c-8f11-fe83bc85c003.jpg 548e42a3-6baf-4d92-bcb5-548dfa1c4ec4.jpg

                          G 1 Reply Last reply Feb 3, 2021, 2:46 PM Reply Quote 0
                          • G
                            george1421 Moderator @kvothe88
                            last edited by Feb 3, 2021, 2:46 PM

                            @kvothe88 I think I would like to personally see that log file. You can use the scp command on the pxe booting client computer to copy the file to you fog server from there upload it to this thread with a .txt file extension. The command is similar to this scp /var/log/messages root@192.168.1.10:/images Of course change the user and IP address to match your environment. The command will drop the file into /images on the fog server.

                            ok then lets try one of the one-off kernels that have the updated realtek driver in it.

                            https://drive.google.com/file/d/1vSu5L-DAZYK7VYiJtFfCYrrqJb963cMg/view?usp=sharing

                            Download that file from the link and save it as bzImageRT3 in /var/www/html/fog/service/ipxe directory.

                            Manually register this host with the fog server using the fog web ui then in the host registration page, set the kernel field to bzImageRT3. Save the settings and then pxe boot the target computer into the hardware compatibility check. See if it detects the NIC there.

                            Also, just so we know the scope, approx how old is this laptop?

                            Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG!

                            K 1 Reply Last reply Feb 3, 2021, 3:14 PM Reply Quote 0
                            • K
                              kvothe88 @george1421
                              last edited by Feb 3, 2021, 3:14 PM

                              @george1421 said in ipxe booting "waiting for the link to come up":

                              bzImageRT3

                              Hi George! Thanks again.

                              So, i was not able to move the file to my fog server, that was the error:

                              aee40515-2de8-4c83-9713-23fcc8dbfe59.jpg

                              Again, i was able to register the host on the fogserver UI. I copied the kernel where you told me and i booted from pxe. Tried the compatibility mode and the bzImageRT3 was using was OK. Got some errors about dell_smbios but unfortunatly the link error showed again:

                              f421a513-280b-4ba0-97ad-188cf122981e.jpg

                              G 1 Reply Last reply Feb 3, 2021, 3:25 PM Reply Quote 0
                              • G
                                george1421 Moderator @kvothe88
                                last edited by Sebastian Roth Feb 3, 2021, 2:39 PM Feb 3, 2021, 3:25 PM

                                @kvothe88 Sorry I’m trying to do too many things this AM. Yes scp should have worked, but we have an issue with the NETWORK. So no network functions.

                                OK will the updated realtek drives in the kernel I sent did not fix the issue. Its still pointing back to the firmware.

                                If you have a usb flash drive we can use that to get the messages file out of the pxe booted computer.

                                1. Insert the usb drive into the target computer.
                                2. issue this command from the debug console
                                  lsblk
                                3. I identify the usb flash drive by its size in the list. For the reset of this lets assume it shows up and /dev/sdb
                                4. Now let make a directory where we can mount that usb drive.
                                  mkdir /ext
                                5. Lets connect the usb drive to that directory
                                  mount /dev/sdb1 /ext
                                6. That command will mount the first partition on that usb stick to the /ext directory. Issue this command to see the contents of the usb drive
                                  ls -la /ext
                                7. If you see what is expected then copy the log file to the use flash drive directory.
                                  cp /var/log/messages /ext
                                8. When done unmount that usb drive with
                                  unmount /ext
                                9. Now you will be able to remove the usb drive. Don’t remove it before you unmount it and the command completes.

                                Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG!

                                K 1 Reply Last reply Feb 3, 2021, 3:53 PM Reply Quote 0
                                • K
                                  kvothe88 @george1421
                                  last edited by Feb 3, 2021, 3:53 PM

                                  @george1421 hi George, thanks for your reply. Here you have the file:

                                  messages.txt

                                  G 1 Reply Last reply Feb 3, 2021, 5:16 PM Reply Quote 0
                                  • G
                                    george1421 Moderator @kvothe88
                                    last edited by Feb 3, 2021, 5:16 PM

                                    @kvothe88 Just to confirm this is the real mac address of the network adapter in question “60:02:92:3e:ab:7a” ?? The log file is saying the link is down.

                                    Also how old is this computer?

                                    Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG!

                                    K 1 Reply Last reply Feb 3, 2021, 5:52 PM Reply Quote 0
                                    • K
                                      kvothe88 @george1421
                                      last edited by Feb 3, 2021, 5:52 PM

                                      @george1421 hi George,

                                      Yes that’s the Mac. Interface is not able to get ip so that makes sense…

                                      Maybe 5, 6 years old

                                      G 1 Reply Last reply Feb 3, 2021, 7:02 PM Reply Quote 0
                                      • G
                                        george1421 Moderator @kvothe88
                                        last edited by Feb 3, 2021, 7:02 PM

                                        @kvothe88 After much head scratching I understand what is going on here. I don’t have a solution atm, but at least I have an idea.

                                        Realtek has a generic network driver called the r8169. That supports (mostly) a large range of network adapters.

                                        Looking at the boot log of the file you sent me I see this:

                                        r8169 0000:03:00.0 eth0: RTL8106e, 60:02:92:3e:ab:7a, XID 449, IRQ 91
                                        

                                        So the real nic is an RTL8106 not an r8168/r8169.

                                        I also found a reference that there is a bug in the R8169 kernel after version 5.4 and the recommendation was to use the 4.19.x series kernel.

                                        ref: https://archived.forum.manjaro.org/t/realtek-rtl810xe-ethernet-not-working-after-kernel-5-4-31/135945/4

                                        it appears the issue was fixed in 5.4.33 (so it should be fixed in 5.6.18).

                                        So what can we do quickly? See if downgrading your linux kernel to 4.19.x allows the system to boot.

                                        Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG!

                                        K 1 Reply Last reply Feb 4, 2021, 7:49 AM Reply Quote 0
                                        • S
                                          Sebastian Roth Moderator
                                          last edited by Feb 3, 2021, 8:13 PM

                                          @kvothe88 @george1421 It’s been busy at work and so I only just saw all your posts. Interesting. From the very first picture you posted we see that enp3s0 is there but says NO-CARRIER and so it seems to not be able to detect the link to be up. Firmware blob does not seem to be an issue from the information posted so far.

                                          Let’s try disabling auto-negotiation for testing to see if that makes a difference. Either do that on the switch if you can or boot into a debug session and run ethtool -s enp3s0 autoneg off - then wait for a bit and check with ip a s to see if it the NO-CARRIER goes away…

                                          Web GUI issue? Please check apache error (debian/ubuntu: /var/log/apache2/error.log, centos/fedora/rhel: /var/log/httpd/error_log) and php-fpm log (/var/log/php*-fpm.log)

                                          Please support FOG if you like it: https://wiki.fogproject.org/wiki/index.php/Support_FOG

                                          1 Reply Last reply Reply Quote 0
                                          • 1
                                          • 2
                                          • 1 / 2
                                          1 / 2
                                          • First post
                                            6/23
                                            Last post

                                          187

                                          Online

                                          12.0k

                                          Users

                                          17.3k

                                          Topics

                                          155.2k

                                          Posts
                                          Copyright © 2012-2024 FOG Project