Multicast not deploying
-
I searched and I found loads of information but none of the descriptions matched the issue I am facing.
I have a group created, 30 machines registered to the group, Each mac Address is only in the list once.
I click the deploy Multicast button, it says "
[CENTER][SIZE=16px][FONT=Ubuntu][COLOR=#333333]All 30 machines were queued without error."[/COLOR][/FONT][/SIZE][/CENTER]
but then it never actually does anything. I check in the Task management for “Active Multicasts” and I see the task there stating it is at 0%. All machines reboot and load the PXE menu fine, but then 3 seconds later the task is not there and none of my imaging is started, help would be greatly appreciated, where do I need to begin searching for errors?Each individual machine has a file created for it in the pxelinux.cfg folder.
How do I verify that Multicast is still enabled and I didn’t turn it off someplace?
Thanks in advance!!!
-
You should check the FOGMulticastManager log in /opt/fog/log/multicast.log. The problem is the FOGMulticstManager daemon, this daemon kill your multicast tasks and the tasks of the clients.
-
I have ran a couple of tests, I started with the log file and at first it was a problem with my nic card being eth1 instead of eth0.
I adjusted the settings and changed the network card to eth0 and rebooted, now it reports properly. But now my units only sit at the blue “Please Wait” screen. I can see the task is active and it is a multicast, I can even see that one of them supposedly is deploying and the rest are in the que. When I physically check the machine it is still sitting at the blue “Please wait” screen. I left them over night and this morning they are still at the blue screen.
so I began here : [url]http://fogproject.org/forum/threads/wiki-troubleshooting-multicast.22/[/url]
I ran the tests and when I do the commands in the debug mode they complete with out a problem. I will post my log in just a moment when I get back on the server. Its not the WHOLE file but it is a section of the past hour.
I noticed in the log it didn’t show my commands from Debug but they did complete successfully.
But when I look in the other log it appears there is a machine that is not reporting in, I’ve tried rebooting them all multiple times
[code]
[06-06-13 9:31:39 am] | [06-06-13 9:31:39 am] Task (13) Redding Staff is no longer running.
[06-06-13 9:31:39 am] | [06-06-13 9:31:39 am] Task (13) Redding Staff will not be cleaned yet (5 min delay).
[06-06-13 9:31:49 am] * [06-06-13 9:31:49 am] Checking if I am the group manager.
[06-06-13 9:31:50 am] * [06-06-13 9:31:50 am] I am the group manager.
[06-06-13 9:31:50 am] | [06-06-13 9:31:50 am] Task (13) Redding Staff is no longer running.
[06-06-13 9:31:50 am] | [06-06-13 9:31:50 am] Task (13) Redding Staff will not be cleaned yet (5 min delay).
[06-06-13 9:32:00 am] * [06-06-13 9:32:00 am] Checking if I am the group manager.
[06-06-13 9:32:00 am] * [06-06-13 9:32:00 am] I am the group manager.
[06-06-13 9:32:00 am] | [06-06-13 9:32:00 am] Task (13) Redding Staff is no longer running.
[06-06-13 9:32:00 am] | [06-06-13 9:32:00 am] Task (13) Redding Staff will not be cleaned yet (5 min delay).
[06-06-13 9:32:10 am] * [06-06-13 9:32:10 am] Checking if I am the group manager.
[06-06-13 9:32:10 am] * [06-06-13 9:32:10 am] I am the group manager.
[06-06-13 9:32:10 am] | [06-06-13 9:32:10 am] Task (13) Redding Staff is no longer running.
[06-06-13 9:32:10 am] | [06-06-13 9:32:10 am] Task (13) Redding Staff will not be cleaned yet (5 min delay).
[06-06-13 9:32:20 am] * [06-06-13 9:32:20 am] Checking if I am the group manager.
[06-06-13 9:32:20 am] * [06-06-13 9:32:20 am] I am the group manager.
[06-06-13 9:32:20 am] | [06-06-13 9:32:20 am] Task (13) Redding Staff is no longer running.
[06-06-13 9:32:20 am] | [06-06-13 9:32:20 am] Task (13) Redding Staff will not be cleaned yet (5 min delay).
[06-06-13 9:32:30 am] * [06-06-13 9:32:30 am] Checking if I am the group manager.
[06-06-13 9:32:30 am] * [06-06-13 9:32:30 am] I am the group manager.
[06-06-13 9:32:30 am] | [06-06-13 9:32:30 am] Task (13) Redding Staff is no longer running.
[06-06-13 9:32:30 am] | [06-06-13 9:32:30 am] Task (13) Redding Staff will not be cleaned yet (5 min delay).
[06-06-13 9:32:40 am] * [06-06-13 9:32:40 am] Checking if I am the group manager.
[06-06-13 9:32:40 am] * [06-06-13 9:32:40 am] I am the group manager.
[06-06-13 9:32:40 am] | [06-06-13 9:32:40 am] Task (13) Redding Staff is no longer running.
[06-06-13 9:32:40 am] | [06-06-13 9:32:40 am] Task (13) Redding Staff will not be cleaned yet (5 min delay).
[06-06-13 9:32:50 am] * [06-06-13 9:32:50 am] Checking if I am the group manager.
[06-06-13 9:32:50 am] * [06-06-13 9:32:50 am] I am the group manager.
[06-06-13 9:32:50 am] | [06-06-13 9:32:50 am] Cleaning Task (13) Redding Staff
[06-06-13 9:32:50 am] | [06-06-13 9:32:50 am] Task (13) Redding Staff has been cleaned.
[06-06-13 9:33:00 am] * [06-06-13 9:33:00 am] Checking if I am the group manager.
[06-06-13 9:33:00 am] * [06-06-13 9:33:00 am] I am the group manager.
[06-06-13 9:33:10 am] * [06-06-13 9:33:10 am] Checking if I am the group manager.
[06-06-13 9:33:10 am] * [06-06-13 9:33:10 am] I am the group manager.
~
[06-06-13 10:29:26 am] * [06-06-13 10:29:26 am] I am the group manager.
[06-06-13 10:29:36 am] * [06-06-13 10:29:36 am] Checking if I am the group manager.
[06-06-13 10:29:36 am] * [06-06-13 10:29:36 am] I am the group manager.
[06-06-13 10:29:46 am] * [06-06-13 10:29:46 am] Checking if I am the group manager.
[06-06-13 10:29:46 am] * [06-06-13 10:29:46 am] I am the group manager.
[06-06-13 10:29:56 am] * [06-06-13 10:29:56 am] Checking if I am the group manager.
[06-06-13 10:29:56 am] * [06-06-13 10:29:56 am] I am the group manager.
[06-06-13 10:30:06 am] * [06-06-13 10:30:06 am] Checking if I am the group manager.
[06-06-13 10:30:06 am] * [06-06-13 10:30:06 am] I am the group manager.
[06-06-13 10:30:16 am] * [06-06-13 10:30:16 am] Checking if I am the group manager.
[06-06-13 10:30:16 am] * [06-06-13 10:30:16 am] I am the group manager.[/code][code]Terminal Excerpt:
root@REDDINGSSH:~# gunzip -c “/images/Windows7/d1p1.img” | /usr/local/sbin/udp-sender --min-receivers 2 --portbase 9000 --interface eth0 --half-duplex --ttl 32
Udp-sender 2007-12-28
Using mcast address 234.8.22.3
UDP sender for (stdin) at 10.8.22.3 on eth0
Broadcasting control to 224.0.0.1
New connection from 10.8.10.127 (#0) 00000009
Ready. Press any key to start sending data.
New connection from 10.8.10.181 (#1) 00000009
Ready. Press any key to start sending data.
Starting transfer: 00000009
bytes= 25 908 220 re-xmits=0000000 ( 0.0%) slice=1024 73 709 551 615 - 0
Transfer complete.
Disconnecting #0 (10.8.10.127)
Disconnecting #1 (10.8.10.181)
[/code][code]Udp-sender 2007-12-28
Using mcast address 234.8.22.3
UDP sender for (stdin) at 10.8.22.3 on eth0
Broadcasting control to 224.0.0.1
New connection from 10.8.10.127 (#0) 00000009
New connection from 10.8.10.181 (#1) 00000009
New connection from 10.8.10.26 (#2) 00000009
New connection from 10.8.10.62 (#3) 00000009
New connection from 10.8.10.198 (#4) 00000009
New connection from 10.8.11.19 (#5) 00000009
New connection from 10.8.10.60 (#6) 00000009
New connection from 10.8.11.16 (#7) 00000009
New connection from 10.8.10.53 (#8) 00000009
New connection from 10.8.10.54 (#9) 00000009
New connection from 10.8.10.83 (#10) 00000009
New connection from 10.8.10.51 (#11) 00000009
New connection from 10.8.10.244 (#12) 00000009
New connection from 10.8.10.8 (#13) 00000009
New connection from 10.8.10.142 (#14) 00000009
New connection from 10.8.10.138 (#15) 00000009
New connection from 10.8.11.48 (#16) 00000009
New connection from 10.8.10.67 (#17) 00000009
New connection from 10.8.10.145 (#18) 00000009
New connection from 10.8.11.27 (#19) 00000009
New connection from 10.8.10.61 (#20) 00000009
New connection from 10.8.10.115 (#21) 00000009
New connection from 10.8.10.254 (#22) 00000009
New connection from 10.8.10.100 (#23) 00000009
New connection from 10.8.11.24 (#24) 00000009
New connection from 10.8.10.177 (#25) 00000009
New connection from 10.8.10.46 (#26) 00000009
New connection from 10.8.10.239 (#27) 00000009
New connection from 10.8.11.72 (#28) 00000009[/code] -
ALRIGHT I GOT IT!!!
I adjusted the MYSQL information in both of the Config files, and found the sucker that wasn’t reporting and rebooted it now I have transfer!!! thanks for letting me bounce ideas and keep track of my efforts