@sebastian-roth said in Another TFTP timeout issue:
Do you still get the exact same error message “No DHCP or proxyDHCP offer received”? If not, post another picture.
No I never got that error I always got the TFTP timeout error.
@sebastian-roth said in Another TFTP timeout issue:
Do you still get the exact same error message “No DHCP or proxyDHCP offer received”? If not, post another picture.
No I never got that error I always got the TFTP timeout error.
@sebastian-roth I started from scratch with a new VM and used a different subnet to see if that was the issue but it’s all the same.
Thank you for reminding me about those changes. I added them to dhcpd.conf again but still no dice. I’m back to square one.
Yes I work at a recycler there’s lots of different equipment I’ve checked.
@sebastian-roth @sebastian-roth It’s getting even weirder now. I thought I managed to fix it after trying another clean install.
I was able to even capture a few windows images using VirtualBox. But I just figured out that everything works fine ONLY in VirtualBox. Nothing has changed when trying to PXE boot a physical laptop or PC. Still get a TFTP timeout error.
A dumb switch doesn’t make a difference for either the virtual or physical PXE attempt. (Virtual works every time, physical does not). I’m not using any fancy networking. The physical laptop uses the same exact adapter that is bridged to VirtualBox when it succeeds so I have no idea what is going on. I have restarted both the VM’s and the actual server but that hasn’t fixed it.
Server ->VirtualBox - Success
Server->Dumb Switch->VirtualBox-Success
Server->Physical Laptop-Fail
Server->Dumb Switch->Physical Laptop-Fail
I captured two pcap files. One from the successful VirtualBox and one from the unsuccessful TFTP timeout physical laptop.
@sebastian-roth said in Another TFTP timeout issue:
With debug enabled you should get to a command shell after a while. Please run the following commands and take a picture:
Here you go:
By the way, did you see my post on extending your dhcpd.conf to get option 66/67? https://forums.fogproject.org/post/141631
Yes I made those changes but nothing changed with either the attempt to PXE boot VirtualBox or the physical laptop.
@sebastian-roth said in Another TFTP timeout issue:
Sorry my bad! Newer Ubuntu versions seem to log those requests to /var/log/apache2/other_vhosts_access.log…
Roger that. I tried it again and I do see the 3 requests for those files.
@sebastian-roth said in Another TFTP timeout issue:
Can you please run tail -f /var/log/apache2/access.log while PXE booting the VirtualBox VM? You should see requests for boot.php, then bzImage and init.xz and lastly another request as connection check which we see in the picture. Do you see all those?
I PXE booted to the failed registration attempt but unless I’m doing it wrong this file is completely empty for me. See below:
@sebastian-roth Disabled the firewall but the issue persisted. I got something else though.
Here’s the pcap file for the following registration attemp I’ve outlined below: output.pcap
I tried doing a registration like you suggested and I got this:
However, if I wait a couple of seconds while in the FOG menu, the registration will start (I tested waiting a few seconds on memtest and it started running fine) but DHCP will fail. See here:
As far as the setup, I have my physical proxmox server with the installed FOG VM setup to use its interfaces. I have a laptop with VirtualBox running that is plugged directly into the back of the server. To PXE boot I have a bridged adapter for VirtualBox to use to PXE in the VirtualBox settings.
I’ve tried eliminating the complexity by also by just physically PXE booting the actual laptop. But I keep getting the same results anyways so it’s just faster to use VirtualBox for me without having to restart over and over again.
Edit: Thought I should clarify though, that the TFTP timeout issue when PXE booting the actual laptop hasn’t changed. Just thought that VirtualBox getting past that might say something about what issue could be.
@george1421 I think I might have something.
I tried something and I think I got something. I tried PXE booting using Virtualbox instead. IT DOES SUCCESSFULLY PXE BOOT but take a look at network interface. During the PXE boot you can see that it’s not up initially.
After getting to the fog menu this is what happens when you try and pick an item (in this case memtest). You can see that the connection is dropping for whatever unknown reason.
The cables are good. The equipment and ports are good, everything lights up. DHCP is working. Fog management page is accessible.
I’m out of ideas here.
@george1421 I’m using Proxmox 6.2 and Ubuntu Server 20.04 for fog.
Does the networking inside the VM and in Proxmox look right to you? I’m sorry for bothering but I’m a little out of my element here.
@george1421 Don’t really know where to look now. This is the output from ps aux | grep dhcp:
@george1421 Here is the dhcpd.conf
# DHCP Server Configuration file\n#see /usr/share/doc/dhcp*/dhcpd.conf.sample
# This file was created by FOG
#Definition of PXE-specific options
# Code 1: Multicast IP Address of bootfile
# Code 2: UDP Port that client should monitor for MTFTP Responses
# Code 3: UDP Port that MTFTP servers are using to listen for MTFTP requests
# Code 4: Number of seconds a client must listen for activity before trying
# to start a new MTFTP transfer
# Code 5: Number of seconds a client must listen before trying to restart
# a MTFTP transfer
option space PXE;
option PXE.mtftp-ip code 1 = ip-address;
option PXE.mtftp-cport code 2 = unsigned integer 16;
option PXE.mtftp-sport code 3 = unsigned integer 16;
option PXE.mtftp-tmout code 4 = unsigned integer 8;
option PXE.mtftp-delay code 5 = unsigned integer 8;
option arch code 93 = unsigned integer 16;
use-host-decl-names on;
ddns-update-style interim;
ignore client-updates;
# Specify subnet of ether device you do NOT want service.
# For systems with two or more ethernet devices.
# subnet 136.165.0.0 netmask 255.255.0.0 {}
subnet 192.168.70.0 netmask 255.255.255.0{
option subnet-mask 255.255.255.0;
range dynamic-bootp 192.168.70.32 192.168.70.254;
default-lease-time 21600;
max-lease-time 43200;
option routers 192.168.70.1;
#option routers 0.0.0.0
next-server 192.168.70.11;
class "Legacy" {
match if substring(option vendor-class-identifier, 0, 20) = "PXEClient:Arch:00000";
filename "undionly.kkpxe";
}
class "UEFI-32-2" {
match if substring(option vendor-class-identifier, 0, 20) = "PXEClient:Arch:00002";
filename "i386-efi/ipxe.efi";
}
class "UEFI-32-1" {
match if substring(option vendor-class-identifier, 0, 20) = "PXEClient:Arch:00006";
filename "i386-efi/ipxe.efi";
}
class "UEFI-64-1" {
match if substring(option vendor-class-identifier, 0, 20) = "PXEClient:Arch:00007";
filename "ipxe.efi";
}
class "UEFI-64-2" {
match if substring(option vendor-class-identifier, 0, 20) = "PXEClient:Arch:00008";
filename "ipxe.efi";
}
class "UEFI-64-3" {
match if substring(option vendor-class-identifier, 0, 20) = "PXEClient:Arch:00009";
filename "ipxe.efi";
}
class "SURFACE-PRO-4" {
match if substring(option vendor-class-identifier, 0, 32) = "PXEClient:Arch:00007:UNDI:003016";
filename "ipxe.efi";
}
class "Apple-Intel-Netboot" {
match if substring(option vendor-class-identifier, 0, 14) = "AAPLBSDPC/i386";
option dhcp-parameter-request-list 1,3,17,43,60;
if (option dhcp-message-type = 8) {
option vendor-class-identifier "AAPLBSDPC";
if (substring(option vendor-encapsulated-options, 0, 3) = 01:01:01) {
# BSDP List
option vendor-encapsulated-options 01:01:01:04:02:80:00:07:04:81:00:05:2a:09:0D:81:00:05:2a:08:69:50:58:45:2d:46:4f:47;
filename "ipxe.efi";
}
}
}
}
@george1421 The FOG installer did. At least I think so. I never messed with any DHCP settings.
Here’s the .pcap file.
@george1421 When asked if I wanted FOG to handle DHCP I put yes.
Side note I tried reverting to snapshot and trying another FOG install and now I can’t pull undionly.kpxe, I get this:
And when PXE booting I get this:
I’m going to just wipe the VM and try again. Think I messed something up.
Edit: Okay. I’m back to just the regular old TFTP timeout issue. Back to square one at least.
@george1421 Hey George. I set it up the FOG server to provide DHCP during the install. I’m testing out the PXE by plugging in a laptop directly into the back of the physical server hosting the FOG VM. The server I’m using is a Super Micro E200-8D.
I have a dedicated FOG server on a Proxmox box I’ve been setting up all day today. I get the following when trying to PXE boot a client.
I’ve read through all the other TFTP threads and tried all the troubleshooting steps I could find, clean re-installing multiple times along the way but I still have the same issue. I’m able to succesfully pull undionly.kpxe from a separate windows machine so I can see that TFTP is working.
No clue what’s going on.
@Sebastian-Roth said in Image replication failing due to syntax error?:
If I am correct you should be able to fix this by manually applying this minimal code change in /var/www/html/fog/lib/service/fogservice.class.php:
That did the trick!
I’ll look into upgrading the FOG server. Will running a new install script do an in-place upgrade without breaking anything?
I’ve set up a storage node and I discovered that the images are not being replicated from the master node. When looking at the log I see the following:
[09-18-20 9:51:18 am] * Found Image to transfer to 1 node
[09-18-20 9:51:18 am] | File Name: dev/postinitscripts
[09-18-20 9:51:18 am] * Not syncing Image between nodes
[09-18-20 9:51:18 am] | File Name: dev/postinitscripts
[09-18-20 9:51:18 am] | File or path cannot be reached.
[09-18-20 9:51:18 am] * Not syncing Image between groups
[09-18-20 9:51:18 am] | Image Name: Compudopt
[09-18-20 9:51:18 am] | There are no other members to sync to.
[09-18-20 9:51:19 am] * Not syncing Image between groups
[09-18-20 9:51:19 am] | Image Name: macOS 10.13 High Sierra
[09-18-20 9:51:19 am] | There are no other members to sync to.
[09-18-20 9:51:19 am] * Not syncing Image between groups
[09-18-20 9:51:19 am] | Image Name: macOS 10.14 Catalina
[09-18-20 9:51:19 am] | There are no other members to sync to.
[09-18-20 9:51:19 am] * Not syncing Image between groups
[09-18-20 9:51:19 am] | Image Name: Windows 10 Home UEFI
[09-18-20 9:51:19 am] | There are no other members to sync to.
[09-18-20 9:51:19 am] * Not syncing Image between groups
[09-18-20 9:51:19 am] | Image Name: Windows 10 Professional UEFI
[09-18-20 9:51:19 am] | There are no other members to sync to.
[09-18-20 9:51:19 am] * Attempting to perform Group -> Nodes image replication.
[09-18-20 9:51:20 am] * Found Image to transfer to 1 node
[09-18-20 9:51:20 am] | Image Name: Compudopt
[09-18-20 9:51:20 am] # Compudopt: File does not exist d1.fixed_size_partitions (ssh-server)
[09-18-20 9:51:20 am] # Compudopt: File does not exist d1.has_grub (ssh-server)
[09-18-20 9:51:20 am] # Compudopt: File does not exist d1.mbr (ssh-server)
[09-18-20 9:51:20 am] # Compudopt: File does not exist d1.minimum.partitions (ssh-server)
[09-18-20 9:51:20 am] # Compudopt: File does not exist d1.original.fstypes (ssh-server)
[09-18-20 9:51:20 am] # Compudopt: File does not exist d1.original.swapuuids (ssh-server)
[09-18-20 9:51:20 am] # Compudopt: File does not exist d1.partitions (ssh-server)
[09-18-20 9:51:20 am] # Compudopt: File does not exist d1p1.img (ssh-server)
[09-18-20 9:51:20 am] | CMD: lftp -e 'set xfer:log 1; set xfer:log-file /opt/fog/log/fogreplicator.Compudopt.transfer.ssh-server.log;set ftp:list-options -a;set net:max-retries 10;set net:timeout 30; mirror -c --parallel=20 -R --ignore-time -vvv --exclude ".srvprivate" "/images/Compudopt" "/images/Compudopt"; exit' -u fogproject,[Protected] 192.168.1.152
[09-18-20 9:51:20 am] | Started sync for Image Compudopt - Resource id #2094204
sh: 1: Syntax error: "(" unexpected
[09-18-20 9:51:20 am] * Found Image to transfer to 1 node
[09-18-20 9:51:20 am] | Image Name: macOS 10.13 High Sierra
[09-18-20 9:51:21 am] # macOS 10.13 High Sierra: File does not exist d1.mbr (ssh-server)
[09-18-20 9:51:21 am] # macOS 10.13 High Sierra: File does not exist d1.original.uuids (ssh-server)
[09-18-20 9:51:21 am] # macOS 10.13 High Sierra: File does not exist d1.partitions (ssh-server)
[09-18-20 9:51:21 am] # macOS 10.13 High Sierra: File does not exist d1p1.img (ssh-server)
[09-18-20 9:51:21 am] # macOS 10.13 High Sierra: File does not exist d1p2.img (ssh-server)
[09-18-20 9:51:21 am] # macOS 10.13 High Sierra: File does not exist d1p3.img (ssh-server)
[09-18-20 9:51:21 am] | CMD: lftp -e 'set xfer:log 1; set xfer:log-file /opt/fog/log/fogreplicator.macOS 10.13 High Sierra.transfer.ssh-server.log;set ftp:list-options -a;set net:max-retries 10;set net:timeout 30; mirror -c --parallel=20 -R --ignore-time -vvv --exclude ".srvprivate" "/images/macOS10.13HighSierra" "/images/macOS10.13HighSierra"; exit' -u fogproject,[Protected] 192.168.1.152
[09-18-20 9:51:21 am] | Started sync for Image macOS 10.13 High Sierra - Resource id #2094236
sh: 1: Syntax error: "(" unexpected
[09-18-20 9:51:21 am] * Found Image to transfer to 1 node
[09-18-20 9:51:21 am] | Image Name: macOS 10.14 Catalina
[09-18-20 9:51:21 am] # macOS 10.14 Catalina: File does not exist d1.mbr (ssh-server)
[09-18-20 9:51:21 am] # macOS 10.14 Catalina: File does not exist d1.original.uuids (ssh-server)
[09-18-20 9:51:21 am] # macOS 10.14 Catalina: File does not exist d1.partitions (ssh-server)
[09-18-20 9:51:21 am] # macOS 10.14 Catalina: File does not exist d1p1.img (ssh-server)
[09-18-20 9:51:21 am] # macOS 10.14 Catalina: File does not exist d1p2.img (ssh-server)
[09-18-20 9:51:21 am] | CMD: lftp -e 'set xfer:log 1; set xfer:log-file /opt/fog/log/fogreplicator.macOS 10.14 Catalina.transfer.ssh-server.log;set ftp:list-options -a;set net:max-retries 10;set net:timeout 30; mirror -c --parallel=20 -R --ignore-time -vvv --exclude ".srvprivate" "/images/macOS10.14Catalina" "/images/macOS10.14Catalina"; exit' -u fogproject,[Protected] 192.168.1.152
[09-18-20 9:51:21 am] | Started sync for Image macOS 10.14 Catalina - Resource id #2094267
sh: 1: Syntax error: "(" unexpected
[09-18-20 9:51:21 am] * Found Image to transfer to 1 node
[09-18-20 9:51:21 am] | Image Name: Windows 10 Home UEFI
[09-18-20 9:51:22 am] # Windows 10 Home UEFI: File does not exist d1.fixed_size_partitions (ssh-server)
[09-18-20 9:51:22 am] # Windows 10 Home UEFI: File does not exist d1.mbr (ssh-server)
[09-18-20 9:51:22 am] # Windows 10 Home UEFI: File does not exist d1.minimum.partitions (ssh-server)
[09-18-20 9:51:22 am] # Windows 10 Home UEFI: File does not exist d1.original.fstypes (ssh-server)
[09-18-20 9:51:22 am] # Windows 10 Home UEFI: File does not exist d1.original.swapuuids (ssh-server)
[09-18-20 9:51:22 am] # Windows 10 Home UEFI: File does not exist d1.original.uuids (ssh-server)
[09-18-20 9:51:22 am] # Windows 10 Home UEFI: File does not exist d1.partitions (ssh-server)
[09-18-20 9:51:22 am] # Windows 10 Home UEFI: File does not exist d1p1.img (ssh-server)
[09-18-20 9:51:22 am] # Windows 10 Home UEFI: File does not exist d1p2.img (ssh-server)
[09-18-20 9:51:22 am] # Windows 10 Home UEFI: File does not exist d1p3.img (ssh-server)
[09-18-20 9:51:22 am] # Windows 10 Home UEFI: File does not exist d1p4.img (ssh-server)
[09-18-20 9:51:22 am] | CMD: lftp -e 'set xfer:log 1; set xfer:log-file /opt/fog/log/fogreplicator.Windows 10 Home UEFI.transfer.ssh-server.log;set ftp:list-options -a;set net:max-retries 10;set net:timeout 30; mirror -c --parallel=20 -R --ignore-time -vvv --exclude ".srvprivate" "/images/Windows10HomeUEFI" "/images/Windows10HomeUEFI"; exit' -u fogproject,[Protected] 192.168.1.152
[09-18-20 9:51:22 am] | Started sync for Image Windows 10 Home UEFI - Resource id #2094304
sh: 1: Syntax error: "(" unexpected
[09-18-20 9:51:22 am] * Found Image to transfer to 1 node
[09-18-20 9:51:22 am] | Image Name: Windows 10 Professional UEFI
[09-18-20 9:51:22 am] # Windows 10 Professional UEFI: File does not exist d1.fixed_size_partitions (ssh-server)
[09-18-20 9:51:22 am] # Windows 10 Professional UEFI: File does not exist d1.mbr (ssh-server)
[09-18-20 9:51:22 am] # Windows 10 Professional UEFI: File does not exist d1.minimum.partitions (ssh-server)
[09-18-20 9:51:22 am] # Windows 10 Professional UEFI: File does not exist d1.original.fstypes (ssh-server)
[09-18-20 9:51:22 am] # Windows 10 Professional UEFI: File does not exist d1.original.swapuuids (ssh-server)
[09-18-20 9:51:22 am] # Windows 10 Professional UEFI: File does not exist d1.original.uuids (ssh-server)
[09-18-20 9:51:22 am] # Windows 10 Professional UEFI: File does not exist d1.partitions (ssh-server)
[09-18-20 9:51:22 am] # Windows 10 Professional UEFI: File does not exist d1p1.img (ssh-server)
[09-18-20 9:51:22 am] # Windows 10 Professional UEFI: File does not exist d1p2.img (ssh-server)
[09-18-20 9:51:22 am] # Windows 10 Professional UEFI: File does not exist d1p3.img (ssh-server)
[09-18-20 9:51:22 am] # Windows 10 Professional UEFI: File does not exist d1p4.img (ssh-server)
[09-18-20 9:51:22 am] | CMD: lftp -e 'set xfer:log 1; set xfer:log-file /opt/fog/log/fogreplicator.Windows 10 Professional UEFI.transfer.ssh-server.log;set ftp:list-options -a;set net:max-retries 10;set net:timeout 30; mirror -c --parallel=20 -R --ignore-time -vvv --exclude ".srvprivate" "/images/Windows10ProfessionalUEFI" "/images/Windows10ProfessionalUEFI"; exit' -u fogproject,[Protected] 192.168.1.152
[09-18-20 9:51:22 am] | Started sync for Image Windows 10 Professional UEFI - Resource id #2094341
sh: 1: Syntax error: "(" unexpected
[09-18-20 9:51:22 am] | Sync finished - Resource id #2094204
[09-18-20 9:51:22 am] | Sync finished - Resource id #2094236
[09-18-20 9:51:22 am] | Sync finished - Resource id #2094267
[09-18-20 9:51:22 am] | Sync finished - Resource id #2094304
[09-18-20 9:51:22 am] | Sync finished - Resource id #2094341