could not pxe boot (input/output error) when do multicast
-
I’m using ver 7202.
It’s fine with unicast. I use Slack plugin onlyThanks,
-
@dvchuyen Hi, https://forums.fogproject.org/topic/6910/could-not-boot-input-output-error-http-ipxe-org-1d0c6539, don’t know if it’s usefull.
-
I’ve already found and fixed this issue, just putting through a few last minute tests.
-
Should be fixed in latest. Please update and let me know. Thank you,
-
@Tom-Elliott
I could boot the machine now, but it stuck at this screen
-
-
@Wayne-Workman
I am checking the topics. However, Multicast worked fine before until I update FOG.Thanks,
-
@dvchuyen When did you update and from what to what did you update? I ask because that was kind of unmentioned in the post generally speaking.
I tested multicast and am able to verify that it works. The other times I see the issue as in your picture it tends to be the interface UDP-Cast is trying to use is incorrect. You can provide the /opt/fog/log/multicast.log file as well to help us help you more appropriately.
-
I remember I worked in v6566 (I could not remember exactly). I updated through SVN. . I also updated to last version (7228), and same thing happen. Nothing changed with network infrastructure, my FOG is Centos7 installed on ESXi, I created a test VM with same network and it does not work either. I also rebooted the FOG server.
Here is the multicast.log[root@fog ~]# cat /opt/fog/log/multicast.log [04-20-16 8:46:19 pm] | 0 tasks to be cleaned [04-20-16 8:46:19 pm] | 1 task found [04-20-16 8:46:19 pm] | Task (9) a is new! [04-20-16 8:46:19 pm] | Task (9) a has been cleaned. [04-20-16 8:46:19 pm] | Task (9) /images/Win10ESNoDriver image file found. Udp-sender 20120424 Using mcast address 232.168.1.252 UDP sender for (stdin) at 192.168.1.252 on eno16777984 Broadcasting control to 224.0.0.1 [04-20-16 8:46:19 pm] | Task (9) 1 client(s) found. [04-20-16 8:46:19 pm] | Task (9) a sending on base port: 63670 [04-20-16 8:46:19 pm] | CMD: cat /images/Win10ESNoDriver/d1p1.img | /usr/local/sbin/udp-sender --interface eno16777984 --min-receivers 1 --max-wait 60 --portbase 63670 --full-duplex --ttl 32 --nokbd --nopointopoint;cat /images/Win10ESNoDriver/d1p2.img | /usr/local/sbin/udp-sender --interface eno16777984 --min-receivers 1 --max-wait 60 --portbase 63670 --full-duplex --ttl 32 --nokbd --nopointopoint; [04-20-16 8:46:19 pm] | Task (9) a has started. [04-20-16 8:46:29 pm] | 0 tasks to be cleaned [04-20-16 8:46:29 pm] | 1 task found [04-20-16 8:46:29 pm] | Task (9) a is already running PID 6099 [04-20-16 8:46:40 pm] | 0 tasks to be cleaned [04-20-16 8:46:40 pm] | 1 task found [04-20-16 8:46:40 pm] | Task (9) a is already running PID 6099 [04-20-16 8:46:50 pm] | 0 tasks to be cleaned [04-20-16 8:46:50 pm] | 1 task found [04-20-16 8:46:50 pm] | Task (9) a is already running PID 6099 [04-20-16 8:47:01 pm] | 0 tasks to be cleaned [04-20-16 8:47:01 pm] | 1 task found [04-20-16 8:47:01 pm] | Task (9) a is already running PID 6099 [04-20-16 8:47:11 pm] | 0 tasks to be cleaned [04-20-16 8:47:11 pm] | 1 task found [04-20-16 8:47:11 pm] | Task (9) a is already running PID 6099
-
@dvchuyen said in could not pxe boot (input/output error) when do multicast:
/usr/local/sbin/udp-sender --interface eno16777984
How many nodes to you have?
-
This post is deleted! -
@Tom-Elliott
I have only 1 node. Images are stored in Fog server. Server has 2 NICs: 1 for manage console, and 1 for imaging -
It seems that the multicast job is cancel automatically. I set timeout 10 minutes
[04-25-16 5:29:51 pm] * No tasks found! [04-25-16 5:30:02 pm] * No tasks found! [root@fog ~]# tail /opt/fog/log/multicast.log [04-25-16 5:30:23 pm] | Sleeping for 10 seconds to ensure tasks are properly submitted [root@fog ~]# tail /opt/fog/log/multicast.log [04-25-16 5:30:33 pm] | Task (13) a sending on base port: 63100 Udp-sender 20120424 bind socket to 192.168.1.252:63101 (Address already in use) cat: write error: Broken pipe [04-25-16 5:30:33 pm] | CMD: cat /images/Win10ESNoDriver/d1p1.img | /usr/local/sbin/udp-sender --interface eno16777984 --min-receivers 1 --max-wait 600 --portbase 63100 --full-duplex --ttl 32 --nokbd --nopointopoint;cat /images/Win10ESNoDriver/d1p2.img | /usr/local/sbin/udp-sender --interface eno16777984 --min-receivers 1 --max-wait 600 --portbase 63100 --full-duplex --ttl 32 --nokbd --nopointopoint; [04-25-16 5:30:33 pm] | Task (13) a has started. [04-25-16 5:30:43 pm] | 0 tasks to be cleaned [04-25-16 5:30:43 pm] | 1 task found [04-25-16 5:30:43 pm] | Task (13) a is no longer running. [04-25-16 5:30:43 pm] | Task (13) a has been cleaned as cancelled. [root@fog ~]# vim /opt/fog/log/multicast.log```
-
@dvchuyen And interface eno167777984 is connected to the network expecting handle the Multicast traffic? The fact that it appears to cancel directly after starting leads me to believe the eno16777984 nic is not the nic that’s expected to be running. Either the interface itself doesn’t exist or it doesn’t have an address associated to it when you’re trying to run the multicast task.
-
@Tom-Elliott said in could not pxe boot (input/output error) when do multicast:
Th
It does… so weird
[root@fog ~]# ifconfig eno16777984: flags=4163<UP,BROADCAST,RUNNING,MULTICAST> mtu 1500 inet 192.168.1.252 netmask 255.255.255.0 broadcast 192.168.1.255 inet6 fe80::20c:29ff:fe8f:977b prefixlen 64 scopeid 0x20<link> ether 00:0c:29:8f:97:7b txqueuelen 1000 (Ethernet) RX packets 93418 bytes 9024294 (8.6 MiB) RX errors 0 dropped 0 overruns 0 frame 0 TX packets 55970 bytes 124838779 (119.0 MiB) TX errors 0 dropped 0 overruns 0 carrier 0 collisions 0 eno33557248: flags=4163<UP,BROADCAST,RUNNING,MULTICAST> mtu 1500 inet 172.16.0.3 netmask 255.255.255.0 broadcast 172.16.0.255 inet6 fe80::20c:29ff:fe8f:9785 prefixlen 64 scopeid 0x20<link> ether 00:0c:29:8f:97:85 txqueuelen 1000 (Ethernet) RX packets 2044261 bytes 204256783 (194.7 MiB) RX errors 0 dropped 0 overruns 0 frame 0 TX packets 1839259 bytes 129113712 (123.1 MiB) TX errors 0 dropped 0 overruns 0 carrier 0 collisions 0```
-
@dvchuyen and that’s about how most of us feel when Tom posts to a thread.
-
Update to latest verion, restart the server, now the task is not cancelled but It still stuck at Starting to restore image.
[04-26-16 8:18:42 am] | 0 tasks to be cleaned [04-26-16 8:18:42 am] | 1 task found [04-26-16 8:18:42 am] | Task (14) a is new! [04-26-16 8:18:42 am] | Task (14) a has been cleaned. [04-26-16 8:18:42 am] | Task (14) /images/Win10ESNoDriver image file found. [04-26-16 8:18:42 am] | Task (14) 1 client(s) found. [04-26-16 8:18:42 am] | Task (14) a sending on base port: 63100 Udp-sender 20120424 Using mcast address 232.168.1.252 UDP sender for (stdin) at 192.168.1.252 on eno16777984 Broadcasting control to 224.0.0.1 [04-26-16 8:18:42 am] | CMD: cat /images/Win10ESNoDriver/d1p1.img | /usr/local/sbin/udp-sender --interface eno16777984 --min-receivers 1 --max-wait 600 --portbase 63100 --full-duplex --ttl 32 --nokbd --nopointopoint;cat /images/Win10ESNoDriver/d1p2.img | /usr/local/sbin/udp-sender --interface eno16777984 --min-receivers 1 --max-wait 600 --portbase 63100 --full-duplex --ttl 32 --nokbd --nopointopoint; [04-26-16 8:18:42 am] | Task (14) a has started. [04-26-16 8:18:52 am] | 0 tasks to be cleaned [04-26-16 8:18:52 am] | 1 task found [04-26-16 8:18:52 am] | Task (14) a is already running PID 3646 [04-26-16 8:19:03 am] | 0 tasks to be cleaned [04-26-16 8:19:03 am] | 1 task found [04-26-16 8:19:03 am] | Task (14) a is already running PID 3646 5,1```
-
@dvchuyen There are commands in here to clear the tables related to multicast: https://wiki.fogproject.org/wiki/index.php?title=Troubleshoot_Downloading_-_Multicast
-
@Wayne-Workman Yes, I already cleared the tables then re-runed the installer,
[04-26-16 1:28:48 pm] | 0 tasks to be cleaned [04-26-16 1:28:48 pm] | 1 task found [04-26-16 1:28:48 pm] | Task (1) a is new! [04-26-16 1:28:48 pm] | Task (1) a has been cleaned. Udp-sender 20120424 Using mcast address 232.168.1.252 UDP sender for (stdin) at 192.168.1.252 on eno16777984 Broadcasting control to 224.0.0.1 [04-26-16 1:28:48 pm] | Task (1) /images/Win10ESNoDriver image file found. [04-26-16 1:28:48 pm] | Task (1) 1 client(s) found. [04-26-16 1:28:48 pm] | Task (1) a sending on base port: 63100 [04-26-16 1:28:48 pm] | CMD: cat /images/Win10ESNoDriver/d1p1.img | /usr/local/sbin/udp-sender --interface eno16777984 --min-receivers 1 --max-wait 600 --portbase 63100 --full-duplex --ttl 32 --nokbd --nopointopoint;cat /images/Win10ESNoDriver/d1p2.img | /usr/local/sbin/udp-sender --interface eno16777984 --min-receivers 1 --max-wait 600 --portbase 63100 --full-duplex --ttl 32 --nokbd --nopointopoint; [04-26-16 1:28:48 pm] | Task (1) a has started. [04-26-16 1:28:59 pm] | 0 tasks to be cleaned [04-26-16 1:28:59 pm] | 1 task found [04-26-16 1:28:59 pm] | Task (1) a is already running PID 16550 [04-26-16 1:29:10 pm] | 0 tasks to be cleaned [04-26-16 1:29:10 pm] | 1 task found [04-26-16 1:29:10 pm] | Task (1) a is already running PID 16550 [04-26-16 1:29:21 pm] | 0 tasks to be cleaned```
-
@dvchuyen Have you tried multicasting by hand as shown in the wiki articles Wayne posted about a week ago (see in the earlier posts here)?