FOG 33b - installation - HP Elite 8300 SFF
-
[quote=“Alistair, post: 25703, member: 1830”]Jaymes has found the problem!
sudo /etc/init.d/FOGMulticastManager stop
Stopping fog imaging solution: FOGMulticastManager
start-stop-daemon: warning: Failed to kill 1157: no such processsudo /etc/init.d/FOGMulticastManager start
Starting FOG solution: FOGMulticastManagerMulticast worked !!!
“no such process” !!!
It wasn’t running.[/quote]You may want to add an upstart job to start the multicast manager at a reboot.
-
[quote=“Alistair, post: 25701, member: 1830”]Jaymes - Thanks for those final tips on stopping and starting multicast.
I’ll try that. I’m not going back to 10.04.[/quote]I have a feeling there’s more here than meets the eye. The FOGMulticastManager service is what handles the starting and killing (once complete or timedout) of the multicast imaging types. What’s the sleep timer set to on your system? It’s usually defaulted to every 10 seconds, which worries me a little bit here. Is the firewall disabled on the FOG Server? [code]sudo ufw disable[/code]
[quote=“Alistair, post: 25701, member: 1830”][FONT=Times New Roman]I can’t spend any more time on this. I’m kicking a dead horse. I’m running with unicast. I know it’s slower and only 10 at a time.[/FONT][/quote][FONT=Times New Roman][/FONT]
[FONT=Times New Roman]What makes you think it’s slower? You can also adjust the number you want to queue. You just need to find the right balance. At home I do around 20 systems at a time just to test things out. As I stated earlier, multicast breaks my network but unicast sails just fine. It’s just as fast, if not faster than multicast even if I test from my work place.[/FONT][FONT=Times New Roman][quote=“Alistair, post: 25701, member: 1830”][/quote][/FONT][quote=“Alistair, post: 25701, member: 1830”]
[FONT=Times New Roman]I have no more time to use on this multicast problem. I’m usually good at fault finding, but here I have failed to find any pattern of success and failure. [/FONT][/quote][FONT=Times New Roman][/FONT]
[FONT=Times New Roman]Based on what I’ve obtained from the information, it seems that something is changed between the reboots that allows multicast to “suddenly” appear to work. Your restarting of the server shows that it’s another issue in whole. You restarted to do fault finding, but your restarting of the system was, seemingly, causing the failure in the first place.[/FONT]
[FONT=Times New Roman][/FONT]
[FONT=Times New Roman][quote=“Alistair, post: 25701, member: 1830”][/quote][/FONT][quote=“Alistair, post: 25701, member: 1830”][FONT=Times New Roman]One time in another school I had 50% of students who could print to a particular printer and 50% who could not. I eventually discovered that all those who had domain user names longer than 15 characters, could not print out on that particular printer. No problem in printing to other printers. Go figure. Over 20 years I’ve seen [U]many[/U] examples of 1 program creating a problem in another on various platforms. Drivers for smartboards screw up LibreOffice.[/FONT][/quote][FONT=Times New Roman][/FONT][FONT=Times New Roman][quote=“Alistair, post: 25701, member: 1830”][/quote][/FONT][quote=“Alistair, post: 25701, member: 1830”][FONT=Times New Roman]I suspect there is a problem in fog, maybe connected with firmware on my new HP Elite 8300 SFF and USDT clients.[/FONT][/quote][FONT=Times New Roman][/FONT]
[FONT=Times New Roman]What problem do you suspect? Didn’t you state that imaging works fine on these clients? Multicast (when working) and Unicast works perfectly fine?[/FONT]
[FONT=Times New Roman][quote=“Alistair, post: 25701, member: 1830”][/quote][/FONT][quote=“Alistair, post: 25701, member: 1830”][FONT=Times New Roman]I’m running fog 33b on one of the new SFF. But I also think it is strange and disappointing that I cannot unicast or multicast to my old HP DC7100 clients that I imaged many times from my old fogserver version 0.32.[/FONT][/quote][FONT=Times New Roman][/FONT]
[FONT=Times New Roman]What exactly happens on these systems? Is there a kernel panic, can you test the systems for compatibility? I can rebuild a kernel fairly quick, but I can’t fix a problem without information. All I’ve gotten from these DC7100’s is, they don’t work, but no messages showing what or why they may NOT be working in the firstplace.[/FONT]
[quote=“Alistair, post: 25701, member: 1830”]Thank you Tom, Jaymes and JunkHacker for all your input and time.
I will only give further comments in this thread if I discover something significant that you should know about.
[/quote]
I wish one wouldn’t just give up so quickly. I’ve spent a lot of time, as well as the rest of us trying to help. While I realize it’s frustrating, giving up is not the right approach. At least not in my eyes. -
I have spent weeks trying to sort out issues with these new HP PCs.
I’ve tried to get photos of the old HP PC screen output but school camera batteries go flat. Forget to take my camera with me.
The relevant error messages go by really fast. Cannot read them to write them down. I have been trying.
I have teachers coming in 3 days and no system ready. I have a crisis. I must move on.
I’m also not getting paid for my time here at all. I have done hundreds of hours in extra voluntary hours.Firewall is now disabled.
How do I access the sleep timer details to see if they are 10 seconds?
-
It’s obvious now that there is something wrong in my fogserver. Either firmware, hardware, linux, fog or a combination.
-
[quote=“Alistair, post: 25706, member: 1830”]I have spent weeks trying to sort out issues with these new HP PCs.
I’ve tried to get photos of the old HP PC screen output but school camera batteries go flat. Forget to take my camera with me.
The relevant error messages go by really fast. Cannot read them to write them down. I have been trying.
I have teachers coming in 3 days and no system ready. I have a crisis. I must move on.
I’m also not getting paid for my time here at all. I have done hundreds of hours in extra voluntary hours.Firewall is now disabled.
How do I access the sleep timer details to see if they are 10 seconds?[/quote]
Errors with units normally leads to kernel issues, you can try one of the updated kernels Tom provides, or you can build your own, there is a fabulous guide on the forums that will walk you through creating a kernel with specific drivers, but the ones Tom puts out are normally all inclusive.
I don’t think there is an option to change this setting, but if you check /opt/fog/log/multicast.log you will see every 10 seconds the multicast manager logs. Is it getting updated every 10 seconds? If so then it is working properly.
-
Yes every 10 seconds there are 6 items/lines
But the log stopped for 1 hour from 1230 to 13:30.
It also stopped 2 hours ago and has not recorded anything since then.The following is Chinese to me. I know nothing about kernels.
[I]Errors with units normally leads to kernel issues, you can try one of the updated kernels Tom provides, or you can build your own, there is a fabulous guide on the forums that will walk you through creating a kernel with specific drivers, but the ones Tom puts out are normally all inclusive.[/I]Are you saying that a part of the OS is corrupted and needs fixing? Or can I use the system temporarily as it is and just stop and start the FOGMulticastManager if I want to multicast?
[I][/I]
-
I’ve fixed the sleep 30 syntax.
-
the kernels he is talking about are the ones that the fog server sends to clients when they network boot
-
Ohhh
-
When the Client systems are loading, they load two files. First one is bzImage, second is init.xz.
The bzImage is a linux kernel, the init.xz file is the “OS” layer of the FOG System on the host.
The kernel Jaymes is referring to is the bzImage that’s loaded onto the client systems. There are no particular issues on the FOG Server itself. This is the kernel I’ve been maintaining as much as possible. It has NOTHING to do with the fog server, besides it is stored under /var/www/fog/service/ipxe/bzImage.
The fact that the FOGMulticastManager service is stopped leads me to think there is some problem, just not one that I can pinpoint without trying to get more information. If you’re willing to work forward I can try to help with this. I realize your not being paid for this and I feel the pain. Which is why I’m frustrated with you, from my eyes, seemingly just giving up.
-
Very helpful explanation.
Actually I haven’t given up. It’s just that I MUST get the staff system running in 2 days time. And that’s impossible. That’s the real source of my frustration. I’m happy to keep working at this for the sake of others in the future who might have similar problems and helping with 0.33b, but not for at least a week until I’ve got the school system up and running. I will keep you very briefly updated on anything I discover, but nothing time-consuming for the time being. I’m in the middle of what is, for me, a big migration and huge learning curve. All new PCs, Server, printers, VLANS, Firewall, multimedia systems. Going from XP to Win 8.1 is a big jump. I’ll be using SYSPREP for the first time and answer files, ADBA etc…
-
Have now connected everything to the school VLAN again.
Rebooted after fixing sleep 30.
Checked multicast log - no log for the last 3 hours
Multicast to 2 HP
TFTP is now running after reboot (problem fixed)
Multicast failed as expected
Stop and start multicast
Multicast to 2 HP
Multicast worked as expected
Checked multicast log - it is running now, BUT, the log shows 7:30 PM when it is actually 9:30 PM on the Ubuntu clock as it is in reality here in Norway. I just mention it in case it has significance.This multicast problem is now predictable and fixable for me. I’m heading home.