Multicast Issues on Centos 7
-
@BedCruncher said:
Just realized maybe you were referring to have me run the UDP-Sender / UDP-Receiver commands manually?
Nope, I was asking you to run the command
ps ax | grep udp
and show the output to us. From what I get you don’t seem to have any udp-sender processes running after you started multicast task. Please make sure you have a valid image on your server:ls -al /images/<img-name>
(please post the output here as well!)… -
Here’s the result of the
ls -al
ls -al /images/ total 4 drwxrwxrwx. 5 root root 81 Apr 27 10:39 . dr-xr-xr-x. 20 root root 4096 Apr 27 10:08 .. drwxrwxrwx. 2 root root 22 Apr 27 10:39 dev -rwxrwxrwx. 1 root root 0 Apr 27 08:48 .mntcheck drwxrwxrwx. 2 root root 29 Apr 27 08:48 postdownloadscripts drwxrwxrwx. 2 root root 69 Apr 27 10:39 W7Px64PreSysprep
and
ps -aux | grep udp
ps aux | grep udp root 25751 0.0 0.0 112644 960 pts/1 S+ 10:45 0:00 grep --color=auto udp
-
@BedCruncher No UDP-sender processes… please check you multicast.log to see if it actually starts any…
As stated in one of my earlier posts I see this in my multicast.log:
[04-26-16 9:46:58 pm] | CMD: cat /images/foo/d1p1.img | /usr/local/sbin/udp-sender --interface ens4 ...
You might want to upgrade to the latest version as Tom has fixed a bug which truncated the log file every so often. So you might miss the output. Upgrade and then see what you have in the logfile!
As well, show us what you have in the image directory:
ls -al /images/W7Px64PreSysprep
-
@Sebastian-Roth
Results ofls -al /images/W7Px64PreSysprep/ total 22338008 drwxrwxrwx. 2 root root 69 Apr 27 10:39 . drwxrwxrwx. 5 root root 81 Apr 27 10:39 .. -rwxrwxrwx. 1 root root 1048576 Apr 27 10:12 d1.mbr -rwxrwxrwx. 1 root root 8561702 Apr 27 10:12 d1p1.img -rwxrwxrwx. 1 root root 22864501047 Apr 27 10:39 d1p2.img -rwxrwxrwx. 1 root root 190 Apr 27 10:12 d1.partitions
On a side note, I went in earlier and completely wiped out my current install according to Uninstall Fog and then reinstalled earlier this morning and the multicast portion didn’t show up at all until manually ran
touch /opt/fog/log/multicast.log
and the selection was also missing from the Log Viewer menu. I verified the file permissions of that file compared to the other longs under that directory.Just downloaded build 7348 and am now getting
-
Following that screen, I let it run for a bit, then while it was doing the invalid session scrolling, I deleted and reinitiated a multicast session and it’s hanging at the same spot as previous.
-
@BedCruncher I am very sorry but without information (log file for example) I am not able to help. Posting the partclone picture is nice but does not really help I am afraid. Hope this does not sound ignorant, it’s not meant to!
We need to get some log output from the FOGMulticastManager (multicast.log)! Or at least see the udp-sender processes on the FOG server. Otherwise multicast will not work. Please try this:
service FOGMulticastManager restart tail -f /var/log/fog/multicast.log
Notice the log path! Multicast daemon writes the log to /var/log…
-
@Sebastian-Roth said in Multicast Issues on Centos 7:
service FOGMulticastManager restart
tail -f /var/log/fog/multicast.logBelow is what that shows.
tail -f /var/log/fog/multicast.log
Can you verify the permissions that you have on that logfile? such as user, group, and file permissions? No, I completely understand, I was just trying to post that in the hopes that it would provide at least some extra information for your troubleshooting.
On my system the path
/var/log/fog
is actually a symlink to/opt/fog/log
and I didn’t create this manually. Had to have been put in place on the installation.As near as I can tell fog is never actually writing to the multicast.log file. It was only getting what I had posted earlier when it was manually ran during the udp-sender/udp-receiver testing.
-
@BedCruncher Did you tun the service restart??? What do you get from this command?
ps ax | grep FOG
-
Just finished the upgrade to 7356 and ran
ps ax | grep FOG 16001 ? Ss 0:00 /usr/bin/php -q /opt/fog/service/FOGMulticastManager/FOGMulticastManager & 16004 ? S 0:00 /usr/bin/php -q /opt/fog/service/FOGMulticastManager/FOGMulticastManager & 16020 ? Ss 0:00 /usr/bin/php -q /opt/fog/service/FOGImageReplicator/FOGImageReplicator & 16023 ? S 0:00 /usr/bin/php -q /opt/fog/service/FOGImageReplicator/FOGImageReplicator & 16039 ? Ss 0:00 /usr/bin/php -q /opt/fog/service/FOGSnapinReplicator/FOGSnapinReplicator & 16042 ? S 0:00 /usr/bin/php -q /opt/fog/service/FOGSnapinReplicator/FOGSnapinReplicator & 16058 ? Ss 0:00 /usr/bin/php -q /opt/fog/service/FOGTaskScheduler/FOGTaskScheduler & 16061 ? S 0:00 /usr/bin/php -q /opt/fog/service/FOGTaskScheduler/FOGTaskScheduler & 16073 ? Ss 0:00 /usr/bin/php -q /opt/fog/service/FOGPingHosts/FOGPingHosts & 16076 ? S 0:00 /usr/bin/php -q /opt/fog/service/FOGPingHosts/FOGPingHosts & 16388 pts/1 S+ 0:00 grep --color=auto FOG
I can see that they are running as expected.
-
@Tom-Elliott Turns out something is wrong with the services logs in current trunk! I don’t see multicast.log on a fresh latest install… Service is properly running. Probably something to do with the static/self switch I suppose?!?
FOGMulticastManager[16754]: PHP Warning: fopen(): Filename cannot be empty in /var/www/html/fog/lib/service/fogservice.class.php on line 82 FOGMulticastManager[16754]: PHP Warning: fopen(): Filename cannot be empty in /var/www/html/fog/lib/service/fogservice.class.php on line 91
-
@Sebastian-Roth Should be fixed in latest, sorry.
-
I’ve updated to version 7386 to try and get this part of things fixed. Will post back once I test this.
-
I am now getting logging working, but
[04-28-16 7:46:38 am] ___ ___ ___ /\ \ /\ \ /\ \ /::\ \ /::\ \ /::\ \ /:/\:\ \ /:/\:\ \ /:/\:\ \ /::\-\:\ \ /:/ \:\ \ /:/ \:\ \ /:/\:\ \:\__\ /:/__/ \:\__\ /:/__/_\:\__\ \/__\:\ \/__/ \:\ \ /:/ / \:\ /\ \/__/ \:\__\ \:\ /:/ / \:\ \:\__\ \/__/ \:\/:/ / \:\/:/ / \::/ / \::/ / \/__/ \/__/ ########################################### # Free Computer Imaging Solution # # Credits: # # http://fogproject.org/credits # # GNU GPL Version 3 # ########################################### [04-28-16 7:46:38 am] Interface Ready with IP Address: xx.xx.xx.xx [04-28-16 7:46:38 am] Interface Ready with IP Address: xx.xx.xx.xx [04-28-16 7:46:38 am] Interface Ready with IP Address: 192.168.240.10 [04-28-16 7:46:38 am] Interface Ready with IP Address: REMOVED [04-28-16 7:46:38 am] Interface Ready with IP Address: REMOVED [04-28-16 7:46:38 am] * Starting MulticastManager Service [04-28-16 7:46:38 am] * Checking for new items every 10 seconds [04-28-16 7:46:38 am] * Starting service loop [04-28-16 7:46:38 am] * No tasks found! [04-28-16 7:46:48 am] * No tasks found! [04-28-16 7:46:58 am] * No tasks found! [04-28-16 7:47:09 am] * No tasks found! [04-28-16 7:47:19 am] * No tasks found! [04-28-16 7:47:29 am] * No tasks found! [04-28-16 7:47:39 am] * No tasks found! [04-28-16 7:47:49 am] * No tasks found! [04-28-16 7:48:00 am] * No tasks found! [04-28-16 7:48:10 am] * No tasks found! [04-28-16 7:48:20 am] * No tasks found! [04-28-16 7:48:30 am] * No tasks found! [04-28-16 7:48:40 am] * No tasks found! [04-28-16 7:48:50 am] * No tasks found! [04-28-16 7:49:01 am] * No tasks found! [04-28-16 7:49:11 am] * No tasks found! [04-28-16 7:49:21 am] * No tasks found! [04-28-16 7:49:31 am] * No tasks found! [04-28-16 7:49:41 am] * No tasks found! [04-28-16 7:49:51 am] * No tasks found! [04-28-16 7:50:02 am] | Sleeping for 10 seconds to ensure tasks are properly submitted [04-28-16 7:50:12 am] | 0 tasks to be cleaned [04-28-16 7:50:12 am] | 1 task found [04-28-16 7:50:12 am] | Task (4) Multi-Cast Task is new! [04-28-16 7:50:12 am] | Task (4) Multi-Cast Task has been cleaned. Udp-sender 20120424 [04-28-16 7:50:12 am] | Task (4) /images/W7Px64PreSysprep image file found. Using mcast address 232.168.240.10 UDP sender for (stdin) at 192.168.240.10 on em2 Broadcasting control to 224.0.0.1 [04-28-16 7:50:12 am] | Task (4) 2 client(s) found. [04-28-16 7:50:12 am] | Task (4) Multi-Cast Task sending on base port: 50028 [04-28-16 7:50:12 am] | CMD: cat /images/W7Px64PreSysprep/d1p1.img | /usr/local/sbin/udp-sender --interface em2 --min-receivers 2 --max-wait 600 --portbase 50028 --full-duplex --ttl 32 --nokbd --nopointopoint;cat /images/W7Px64PreSysprep/d1p2.img | /usr/local/sbin/udp-sender --interface em2 --min-receivers 2 --max-wait 600 --portbase 50028 --full-duplex --ttl 32 --nokbd --nopointopoint; [04-28-16 7:50:12 am] | Task (4) Multi-Cast Task has started. [04-28-16 7:50:22 am] | 0 tasks to be cleaned [04-28-16 7:50:22 am] | 1 task found [04-28-16 7:50:22 am] | Task (4) Multi-Cast Task is already running PID 13546 [04-28-16 7:50:32 am] | 0 tasks to be cleaned [04-28-16 7:50:32 am] | 1 task found [04-28-16 7:50:32 am] | Task (4) Multi-Cast Task is already running PID 13546 [04-28-16 7:50:42 am] | 0 tasks to be cleaned [04-28-16 7:50:42 am] | 1 task found
the above after letting it for quite a few minutes the above ending to this log gets repeatedly entered. I also ran the ran the command
ps aux | grep udp
ps aux | grep udp root 13546 0.0 0.0 115240 1456 ? S 07:50 0:00 sh -c cat /images/W7Px64PreSysprep/d1p1.img | /usr/local/sbin/udp-sender --interface em2 --min-receivers 2 --max-wait 600 --portbase 50028 --full-duplex --ttl 32 --nokbd --nopointopoint;cat /images/W7Px64PreSysprep/d1p2.img | /usr/local/sbin/udp-sender --interface em2 --min-receivers 2 --max-wait 600 --portbase 50028 --full-duplex --ttl 32 --nokbd --nopointopoint; root 13548 0.0 0.0 8640 664 ? S 07:50 0:00 /usr/local/sbin/udp-sender --interface em2 --min-receivers 2 --max-wait 600 --portbase 50028 --full-duplex --ttl 32 --nokbd --nopointopoint root 13753 0.0 0.0 112644 964 pts/1 S+ 07:57 0:00 grep --color=auto udp
Still hangs at the same partclone screen as before.
ps ax | grep FOG 13062 ? Ss 0:00 /usr/bin/php -q /opt/fog/service/FOGMulticastManager/FOGMulticastManager & 13065 ? S 0:06 /usr/bin/php -q /opt/fog/service/FOGMulticastManager/FOGMulticastManager & 13081 ? Ss 0:00 /usr/bin/php -q /opt/fog/service/FOGImageReplicator/FOGImageReplicator & 13084 ? S 0:00 /usr/bin/php -q /opt/fog/service/FOGImageReplicator/FOGImageReplicator & 13100 ? Ss 0:00 /usr/bin/php -q /opt/fog/service/FOGSnapinReplicator/FOGSnapinReplicator & 13103 ? S 0:00 /usr/bin/php -q /opt/fog/service/FOGSnapinReplicator/FOGSnapinReplicator & 13119 ? Ss 0:00 /usr/bin/php -q /opt/fog/service/FOGTaskScheduler/FOGTaskScheduler & 13122 ? S 0:00 /usr/bin/php -q /opt/fog/service/FOGTaskScheduler/FOGTaskScheduler & 13134 ? Ss 0:00 /usr/bin/php -q /opt/fog/service/FOGPingHosts/FOGPingHosts & 13137 ? S 0:00 /usr/bin/php -q /opt/fog/service/FOGPingHosts/FOGPingHosts & ps aux | grep Multicast root 13062 0.0 0.8 323076 15852 ? Ss 07:46 0:00 /usr/bin/php -q /opt/fog/service/FOGMulticastManager/FOGMulticastManager & root 13065 0.4 0.6 411168 13028 ? S 07:46 0:05 /usr/bin/php -q /opt/fog/service/FOGMulticastManager/FOGMulticastManager & root 14024 0.0 0.0 112648 960 pts/1 S+ 08:06 0:00 grep --color=auto Multicast
-
@BedCruncher What’s output if you go to (in the browser)
http://ip.of.fog.here/fog/service/ipxe/boot.php?mac=macofhostwithcolons
Where macofhostwithcolons is the mac address (with colons) of the host that’s setup to do this tasking?
-
@Tom-Elliott
Here’s the info. I also compared it to a second host that I had added to a group for multitasking, and it appears identical, barring the mac address of course. Let me know what else you can recommend and I’ll be glad to accommodate.#!ipxe set fog-ip 192.168.240.10 set fog-webroot fog set boot-url http://${fog-ip}/${fog-webroot} kernel bzImage32 loglevel=4 initrd=init_32.xz root=/dev/ram0 rw ramdisk_size=127000 keymap= web=192.168.240.10/fog/ consoleblank=0 rootfstype=ext4 rootfstype=ext4 mac=00:26:b9:aa:14:ec ftp=192.168.240.10 storage=192.168.240.10:/images/ storageip=192.168.240.10 web=192.168.240.10/fog/ osid=5 consoleblank=0 irqpoll hostname=0026b9aa14ec chkdsk=0 img=W7Px64PreSysprep imgType=mps imgPartitionType=all imgid=1 imgFormat= PIGZ_COMP=-6 hostearly=1 port=50028 type=down mc=yes imgfetch init_32.xz boot
-
@BedCruncher When the client boots to begin the tasking, is it on a separate subnet from the 192.168.240.10 server?
-
@Tom-Elliott
No it’s not. I have the em2 interface controlling the DHCP server and have a cable running to a 5 port Gig switch. It’s a dumb switch with no special routing or anything. I even checked under the dhcpd.leases file and can find it there.lease 192.168.240.30 { starts 4 2016/04/28 12:50:28; ends 4 2016/04/28 18:50:28; cltt 4 2016/04/28 12:50:28; binding state active; next binding state free; rewind binding state free; hardware ethernet 00:26:b9:aa:14:ec; uid "\001\000&\271\252\024\354"; }
EDIT:
Updated to build 7410 this morning to hopefully see if it might have been fixed.
EDIT 2:
Updated and initiated a new test and still getting the same results. -
Hey guys, I had to step away over the weekend. Has there been any new updates?
-
@BedCruncher Now that we have the logging back can you please try running udp-sender by hand again using the exact same command as seen in the logs?
-
@Sebastian-Roth
So I tried both of the udp-sender commandsudp-sender --file /opt/fog/.fogsettings --log /tmp/multicast.log --ttl 32 --nopointopoint --interface em2
which results in
13:41:14.387632 Using mcast address 232.168.240.10 13:41:14.387698 UDP sender for /opt/fog/.fogsettings at 192.168.240.10 on em2 13:41:14.387708 Broadcasting control to 224.0.0.1
displaying in the logs and then no task found
and also
cat /images/W7Px64PreSysprep/d1p1.img | /usr/local/sbin/udp-sender --interface em2 --min-receivers 2 --max-wait 600 --portbase 50028 --full-duplex --ttl 32 --nokbd --nopointopoint;cat /images/W7Px64PreSysprep/d1p2.img | /usr/local/sbin/udp-sender --interface em2 --min-receivers 2 --max-wait 600 --portbase 50028 --full-duplex --ttl 32 --nokbd --nopointopoint;
which results in the same message as above, but not the repeated no active task found entries. I also updated to build 7470 earlier. It is acting differently. I don’t see that the udp-sender and udp-receiver are talking any more.