Hyper V and Pxe boot to Fog problems
-
@lukebarone Found on the wiki. Only difference here is open general.h after downloading and change #define to #undefine for these lines:
#define DOWNLOAD_PROTO_HTTPS
#define IMAGE_TRUST_CMD
#define CERT_CMD
These lines aren’t consecutive in the file so you’ll have to look for each. -
@sebastian-roth Finally got it to work. Unsure if something is wrong on my side but I had to delete the sleep line to get it to compile properly. As a result, my images are dim since I had to pause the VMs to get screenshots.
Working:
No-worky:
-
@Paulman9 Great to see you got it working and figured a way to get the pictures. Sorry for the sleep compile issue. I forgot to tell you need to add the unistd header at the top (e.g. line 29) for that to work.
... #include <ipxe/device.h> #include <ipxe/console.h> #include <ipxe/init.h> #include <unistd.h> ...
Now the next step is to figure out which startup functions are called and where it hangs. Unfortunately there does not seem to be an easy way to get the function names from the pointers. So you need to add debug code to each of the startup functions by hand - sorry!
Here is a list of all eleven startup functions in iPXE (found runningfind ipxe/src/ -type f -exec grep "\.startup =" {} /dev/null \;
ipxe/src/hci/linux_args.c: .startup = linux_args_parse, ipxe/src/arch/x86/interface/pcbios/hidemem.c: .startup = hide_etherboot, ipxe/src/arch/x86/interface/pcbios/bios_console.c: .startup = bios_inject_startup, ipxe/src/arch/x86/image/initrd.c: .startup = initrd_startup, ipxe/src/arch/x86/core/cachedhcp.c: .startup = cachedhcp_startup, ipxe/src/arch/x86/core/runtime.c: .startup = runtime_init, ipxe/src/interface/linux/linux_console.c: .startup = linux_console_startup, ipxe/src/interface/efi/efi_timer.c: .startup = efi_tick_startup, ipxe/src/core/device.c: .startup = probe_devices, ipxe/src/crypto/rootcert.c: .startup = rootcert_init, ipxe/src/crypto/rbg.c: .startup = rbg_startup_fn,
Should maybe start with the crypto stuff as we think this might be causing it here. So edit
ipxe/src/crypto/rootcert.c
, jump to line 95 and add aDBGC
as first call after the function header:... static void rootcert_init ( void ) { DBGC(0x1, "rootcert start"); static int initialised; ...
Now compile the binary with
make ... DEBUG=init,rootcert
and try it out. Follow the same schema for all the other startup functions. The first parameter is just a color code, can be any hex number really. So you can use0x1
for the first,0x2
for the second if you like it colorful.Note that you have the
calling startup function 0x...
printout first and then your newly added output when it enters the particular startup routine. So I suspect it to halt after one of your newly added printouts. From there you can add more printouts throughout that function and those being called. Let me know what you find or if you get stuck at some point.PS: I’ve done those debugging steps a couple of times when trying to find out why iPXE would hang on some particular hardware. Usually I’d just compile the binary and give it to users for testing. So this is the first time I hand over the knowledge on how to debug iPXE init code and I am grateful @Paulman9 is keen to follow this. @Wayne-Workman mind adding that to the wiki as well?
-
@sebastian-roth Here is the output from a working vm:
I’m no programmer so you’re way over my head here haha Output seemed the same as before on the non-working one -
@Paulman9 Ok, so it’s definitely not the rootcert code causing the hang. Just keep going like this with all the other files. Try adding debug to
ipxe/src/crypto/rbg.c
next I’d suggest andpxe/src/interface/efi/efi_timer.c
is a good candidate for an issue as well!! Just don’t forget to add those to themake ... DEBUG=
command too when compiling. -
@sebastian-roth I suppose this is what I am looking for then?
Non working machine stalled here
-
@Paulman9 Yeah, exactly. Now from here just put in more
DBGC
startments in therbg_startup
function (line 73ff) to see if it gets past thefetch_uuid_setting
anddrbg_instantiate
calls. -
@Paulman9 Any news on this. Please let me know if you need further assistance.
-
I just came across this by accident and wondered if this was ever solved. Reading through it and the related iPXE forum post (link) it seems like this was caused and fixed by Microsoft. So if you have this issue, update and you should be fine.
-
@Sebastian-Roth Sorry, I completely forgot about this. Just updated to latest kernel on my server and tested on 1803, worked perfect. Thanks for the update.