So i haven been working to get multicast working on our FOG server.
The network is configured correctly as I have a basic VM loaded with FOG and it is able to multicast fine.
Fresh install of FOG on Ubuntu 10.04 on our physical host, ubuntu updated.
Physical host is a dell 2950 server with Ethernet controller: Broadcom Corporation NetXtreme II BCM5708 Gigabit E thernet (rev 12)
All passwords are defaulted for testing purposes.
Logs Below.
[CODE]Multicast LOG:
[05-24-12 11:51:12 am] * Starting FOG Multicast Manager Service
[05-24-12 11:51:17 am] * [05-24-12 11:51:17 am] Checking for new tasks every 10 seconds.
[05-24-12 11:51:17 am] * [05-24-12 11:51:17 am] Starting service loop.
[05-24-12 11:51:17 am] | [05-24-12 11:51:17 am] Failed to use database: fog
[05-24-12 11:51:27 am] | [05-24-12 11:51:27 am] Failed to use database: fog
[05-24-12 11:51:37 am] | [05-24-12 11:51:37 am] Failed to use database: fog
[05-24-12 11:51:47 am] | [05-24-12 11:51:47 am] Failed to use database: fog
[05-24-12 11:51:57 am] | [05-24-12 11:51:57 am] Failed to use database: fog
[05-24-12 11:52:07 am] | [05-24-12 11:52:07 am] Failed to use database: fog
[05-24-12 11:52:17 am] | [05-24-12 11:52:17 am] Failed to use database: fog
[05-24-12 12:18:19 pm] * [05-24-12 12:18:19 pm] I am the group manager.
[05-24-12 12:18:19 pm] | [05-24-12 12:18:19 pm] Task (1) 111 is new!
[05-24-12 12:18:19 pm] | [05-24-12 12:18:19 pm] Task (1) 111 image file found.
[05-24-12 12:18:19 pm] | [05-24-12 12:18:19 pm] Task (1) 111 2 clients found.
[05-24-12 12:18:19 pm] | [05-24-12 12:18:19 pm] Task (1) 111 sending on base port: 63100
[05-24-12 12:18:19 pm] CMD: gunzip -c “/images/745test” | /usr/local/sbin/udp-sender --min-receivers 2 --portbase 63100 --interface eth0 --half-duplex --ttl 32 --nokbd
[05-24-12 12:18:24 pm] | [05-24-12 12:18:24 pm] Task (1) 111 has started.
[05-24-12 12:18:34 pm] * [05-24-12 12:18:34 pm] Checking if I am the group manager.
[05-24-12 12:18:34 pm] * [05-24-12 12:18:34 pm] I am the group manager.
[05-24-12 12:18:44 pm] * [05-24-12 12:18:44 pm] Checking if I am the group
[05-24-12 12:23:45 pm] | [05-24-12 12:23:45 pm] Task (1) 111 is no longer running.
[05-24-12 12:23:45 pm] | [05-24-12 12:23:45 pm] Task (1) 111 will not be cleaned yet (5 min delay).
[05-24-12 12:23:55 pm] * [05-24-12 12:23:55 pm] Checking if I am the group manager.[/CODE]
[CODE]Scheduler
[05-24-12 11:51:12 am] * Starting FOG Task Scheduler Service
[05-24-12 11:51:12 am] * [05-24-12 11:51:12 am] Checking for new tasks every 60 seconds.
[05-24-12 11:51:12 am] * [05-24-12 11:51:12 am] Starting service loop.
[05-24-12 11:51:12 am] * [05-24-12 11:51:12 am] Looking for shared library information…
[05-24-12 11:51:12 am] * [05-24-12 11:51:12 am] shared library found.
[05-24-12 11:51:12 am] | [05-24-12 11:51:12 am] Failed to use database: fog
[05-24-12 11:52:12 am] | [05-24-12 11:52:12 am] Failed to use database: fog
[05-24-12 11:53:12 am] | [05-24-12 11:53:12 am] Failed to use database: fog
[05-24-12 11:54:12 am] * [05-24-12 11:54:12 am] Checking if I am the group manager.
[05-24-12 11:54:12 am] * [05-24-12 11:54:12 am] I am the group manager.
[05-24-12 11:54:12 am] * [05-24-12 11:54:12 am] No tasks found.
[05-24-12 11:55:12 am] * [05-24-12 11:55:12 am] Checking if I am the group manager.[/CODE]
The clients hang at the please wait screen, and the task is eventually cleaned.
I’m not sure what the “failed to use database” means, or if it means anything.
The fact I can send a mutlicast with a VM but not a physical box doesn’t make a lot of sense to me. same network, same switches, same clients.
Unicast works great and always has.
Any help is appreciate.
EDIT:
multicast.log.udpcast says:
tail -f multicast.log.udpcast.2
Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=10000
Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=10000
Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=10000
Timeout notAnswered=[0,1] notReady=[0,1] nrAns=0 nrRead=0 nrPart=2 avg=10000
Dropping client #0 because of timeout
Disconnecting #0 (IP)
Dropping client #1 because of timeout
Disconnecting #1 (IP)