Struggling to get multi-casting to work.
-
Hi
I have just set up a new FOG server and I am in the process of testing multi-casting. I have created a task with 1 client in, I pxe boot the client and it starts the processs, but then it just hangs at the partclone screen. I have checked the log files on the server and I am getting this repeating over and over.
[04-12-19 10:02:59 am] | Task ID: 2 Name: Multi-Cast Task is new
[04-12-19 10:02:59 am] | Task ID: 2 Name: Multi-Cast Task image file found, file: /images/250G7Windows10
[04-12-19 10:02:59 am] | Task ID: 2 Name: Multi-Cast Task 1 client found
[04-12-19 10:02:59 am] | Task ID: 2 Name: Multi-Cast Task sending on base port 50814
[04-12-19 10:02:59 am] | Command: /usr/local/sbin/udp-sender --interface dev --min-receivers 1 --max-wait 600 --portbase 50814 --full-duplex --ttl 32 --nokbd --nopointopoint --file /images/250G7Windows10/d1p1.img;/usr/local/sbin/udp-sender --interface dev --min-receivers 1 --max-wait 10 --portbase 50814 --full-duplex --ttl 32 --nokbd --nopointopoint --file /images/250G7Windows10/d1p2.img;/usr/local/sbin/udp-sender --interface dev --min-receivers 1 --max-wait 10 --portbase 50814 --full-duplex --ttl 32 --nokbd --nopointopoint --file /images/250G7Windows10/d1p3.img;/usr/local/sbin/udp-sender --interface dev --min-receivers 1 --max-wait 10 --portbase 50814 --full-duplex --ttl 32 --nokbd --nopointopoint --file /images/250G7Windows10/d1p4.img;
[04-12-19 10:02:59 am] | Task ID: 2 Name: Multi-Cast Task has started
[04-12-19 10:03:09 am] | Task ID: 2 Name: Multi-Cast Task is no longer running
[04-12-19 10:03:09 am] | Task ID: 2 Name: Multi-Cast Task has been killedI have run through the troubleshooting steps on https://wiki.fogproject.org/wiki/index.php/Multicasting with 1 and 2 clients and they complete successfully.
Can you guys point me in the right direction as to what could be causing the issue.
Thanks.
-
It seems like this is the week for multicast issues. Lets start out with some basics.
- Is the fog server on the same subnet (vlan) as the target computers?
- Do you have igmp snooping enabled on your network switches?
- From the log file it appears that the fog server is sending out all 4 partitions. Are you saying that the target computer stops at partition 1?
- Do you have any kind of firewall or screen router between the FOG server and the pxe booting computer? (akin question to #1)
- Is your network at least 1GbE or do you have some 100Mb/s networking elements in the data path between the FOG server and target computer?
-
@lister From the logs we see that the multicast task was only scheduled for a single host. This is totally fine, just wanted to ask if you intended to do so for testing or if you actually wanted to multicast with more clients?
The other thing I noticed in the logs “udp-sender --interface dev …”. I guess
dev
is not the right name for a netword device in Linux. Please check the settings in the web UI (FOG Configuration -> FOG Settings -> Multicast Settings -> UDPCAST INTERFACE and while you are at it, also check Storage -> DefaultMember -> Interface) -
@Sebastian-Roth said in Struggling to get multi-casting to work.:
I guess dev is not the right name for a network device in Linux.
Oooooo good eyes and catch.
@lister Will you post the output of this command
ip addr show
? -
@george1421 said in Struggling to get multi-casting to work.:
It seems like this is the week for multicast issues. Lets start out with some basics.
- Is the fog server on the same subnet (vlan) as the target computers?
- Do you have igmp snooping enabled on your network switches?
- From the log file it appears that the fog server is sending out all 4 partitions. Are you saying that the target computer stops at partition 1?
- Do you have any kind of firewall or screen router between the FOG server and the pxe booting computer? (akin question to #1)
- Is your network at least 1GbE or do you have some 100Mb/s networking elements in the data path between the FOG server and target computer?
Thanks for the response.
- Yes they are all on the same vlan.
- No we don’t have it enabled.
- No it doesn’t even begin the first partition.
- No as mention in 1. all devices are on the same vlan.
- All devices are connected by 1GbE connections.
-
@Sebastian-Roth said in Struggling to get multi-casting to work.:
@lister From the logs we see that the multicast task was only scheduled for a single host. This is totally fine, just wanted to ask if you intended to do so for testing or if you actually wanted to multicast with more clients?
The other thing I noticed in the logs “udp-sender --interface dev …”. I guess
dev
is not the right name for a netword device in Linux. Please check the settings in the web UI (FOG Configuration -> FOG Settings -> Multicast Settings -> UDPCAST INTERFACE and while you are at it, also check Storage -> DefaultMember -> Interface)Checked both locations and it has the correct interface specified.
@george1421 said in Struggling to get multi-casting to work.:
@Sebastian-Roth said in Struggling to get multi-casting to work.:
I guess dev is not the right name for a network device in Linux.
Oooooo good eyes and catch.
@lister Will you post the output of this command
ip addr show
?See below, both settings mentioned have ens33 specified.
1: lo: <LOOPBACK,UP,LOWER_UP> mtu 65536 qdisc noqueue state UNKNOWN group default qlen 1000
link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
inet 127.0.0.1/8 scope host lo
valid_lft forever preferred_lft forever
inet6 ::1/128 scope host
valid_lft forever preferred_lft forever
2: ens32: <NO-CARRIER,BROADCAST,MULTICAST,UP> mtu 1500 qdisc fq_codel state DOWN group default qlen 1000
link/ether 00:50:56:91:81:37 brd ff:ff:ff:ff:ff:ff
inet6 fe80::250:56ff:fe91:8137/64 scope link
valid_lft forever preferred_lft forever
3: ens33: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc fq_codel state UP group default qlen 1000
link/ether 00:50:56:91:67:b9 brd ff:ff:ff:ff:ff:ff
inet 172.16.150.10/24 brd 172.16.150.255 scope global ens33
valid_lft forever preferred_lft forever
inet 172.16.150.100/24 brd 172.16.150.255 scope global secondary dynamic ens33
valid_lft 11609sec preferred_lft 11609sec
inet6 fe80::250:56ff:fe91:67b9/64 scope link
valid_lft forever preferred_lft forever -
@lister said in Struggling to get multi-casting to work.:
3: ens33: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc fq_codel state UP group default qlen 1000
link/ether 00:50:56:91:67:b9 brd ff:ff:ff:ff:ff:ff
inet 172.16.150.10/24 brd 172.16.150.255 scope global ens33
valid_lft forever preferred_lft forever
inet 172.16.150.100/24 brd 172.16.150.255 scope global secondary dynamic ens33
valid_lft 11609sec preferred_lft 11609sec
inet6 fe80::250:56ff:fe91:67b9/64 scope link
valid_lft forever preferred_lft foreverSo your defined multicast network adapter is ens33 right? The web ui shows this value?
Can you explain why ens33 has 2 IP addresses. I can see this confusing both FOG and the udpsend program.
-
@george1421 said in Struggling to get multi-casting to work.:
@lister said in Struggling to get multi-casting to work.:
3: ens33: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc fq_codel state UP group default qlen 1000
link/ether 00:50:56:91:67:b9 brd ff:ff:ff:ff:ff:ff
inet 172.16.150.10/24 brd 172.16.150.255 scope global ens33
valid_lft forever preferred_lft forever
inet 172.16.150.100/24 brd 172.16.150.255 scope global secondary dynamic ens33
valid_lft 11609sec preferred_lft 11609sec
inet6 fe80::250:56ff:fe91:67b9/64 scope link
valid_lft forever preferred_lft foreverSo your defined multicast network adapter is ens33 right? The web ui shows this value?
Can you explain why ens33 has 2 IP addresses. I can see this confusing both FOG and the udpsend program.
Correct, I don’t know why it has 2 IP addresses, it is configured in netplan with a static address and dhcp disabled.
-
Thanks both for your help, I managed to get rid of the second IP address and the server is now multi-casting to 2 laptops.