Hey Tom, take your time and get yourself settled!
I am wondering if this is a good point too where we reach out to the community for people to join into the FOG team to help work on the code, fix issues, improve stuff here and there.
Hey Tom, take your time and get yourself settled!
I am wondering if this is a good point too where we reach out to the community for people to join into the FOG team to help work on the code, fix issues, improve stuff here and there.
@Tom-Elliott, @Developers, @Moderators
I got carried away trying to improve the current iPXE script. Now that I’ve dug into the syntax and found some interesting new stuff I want to see what you guys think about this:
#!ipxe
isset ${net0/mac} && dhcp net0 || goto dhcpnet1
echo Received DHCP answer on interface net0 && goto proxycheck
:dhcpnet1
isset ${net1/mac} && dhcp net1 || goto dhcperror
echo Received DHCP answer on interface net1 && goto proxycheck
:dhcperror
prompt --key s --timeout 10000 DHCP failed, hit 's' for the iPXE shell; reboot in 10 seconds && shell || reboot
:proxycheck
isset ${proxydhcp/next-server} && isset ${next-server} && echo Duplicate option 66 (next server) from DHCP proxy and DHCP server && echo Using IP sent by DHCP proxy ${proxydhcp/next-server} && prompt --timeout 5000 || goto nextservercheck
:nextservercheck
isset ${proxydhcp/next-server} && set next-server ${proxydhcp/next-server} ||
isset ${next-server} && goto netboot || goto setserv
:setserv
echo -n Please enter tftp server: && read next-server && goto netboot || goto setserv
:netboot
chain tftp://${next-server}/default.ipxe ||
prompt --key s --timeout 10000 Chainloading failed, hit 's' for the iPXE shell; reboot in 10 seconds && shell || reboot
Feel free to comment and improve. I’ve tested the script and tried to remember all the issues I came across in the last months but I am sure we’re not there yet.
@TrialAndError said:
Obviously the development of FOG stopped after a long time of hard work.
It would be friendly by the developers to inform the users about that.
I am not sure what exactly you mean. This is an open source project and we have no time schedule for new releases and we don’t have a set list of features to add or bugs to resolve. We simply do what we can and like.
It’s quite bizarre to state that FOG development stopped… totally wrong!
I want to thank Tom again for this! I tried to assist and help but I just don’t know the code (and its history) as much as Tom does. He’s done all the hard work to find and fix this issue. Along the way he also fixed a couple other things as well.
@Tom-Elliott Don’t say sorry. FOG is work in progress and you are pushing things way forward!! :metal:
We are working towards a most stable release of the 1.5.x line of FOG and will publish release candidates of FOG 1.5.9 for this over the next weeks. We ask people to participate and help test to get the final release as good as we can.
@Wirefall Great you posted the full kernel messages listing. At first I didn’t notice any issue but looking closer I found the issue:
igb: probe of 0000:01:00.0 failed with error -2
The PCI ID perfectly matches the one you mentioned in your fist post. Didn’t take long to find several reports on this issue that came in just lately:
https://lkml.org/lkml/2016/11/24/172
https://bugzilla.suse.com/show_bug.cgi?id=1009911
https://patchwork.ozlabs.org/patch/700615/
Some say that it might possibly work if you disable PXE boot for this NIC in BIOS. I don’t think this is a great solution as FOG heavily relies on PXE booting the clients. Let’s hope that this will be fixed in the latest kernel fairly soon!
@sudburr Why not re-install the whole system from scratch?
@Developers @Moderators Ok, I just opened a pull request to remove the 7156 binaries. Please all keep that in mind. Might cause some minor confusion in the next weeks. But it’s good to get rid of it.
As you all can see the tests of the most recent iPXE version shows that things are back to normal again. I reported back to Michael Brown on the iPXE devel mailing list. So he’s got that in mind as well.
@Avaryan To me this sounds like a different issue. Would you mind opening a new thread on this and posting more details (FOG version, USB ID of the NIC used as shown with lsusb
, …).
I’ll mark this solved/closed now. @Psycholiquid Thanks heaps for the great work on testing all the binaries and such!
@julio said in application fog:
packet tracer, virsual studio code, notepad++, putty, googleChrome et adobe
Installing those on Linux doesn’t make much sense to me… are we talking about different things here?
@danieln said in Clients imaging despite recieving "Read ERROR: No such file or directory" and "ata1.00: failed command" errors:
The output of md5sum /images/DellE5450-80-Non-Office/d1p1.img on the Master node was:
e929a14a17c60b2b9a7dfdf18f526232 /images/DellE5450-80-Non-Office/d1p1.imgThe output of md5sum /images/DellE5450-80-Non-Office/d1p1.img on the problematic node was:
1d4bf4ac2bcef83013fe4589149b0e30 /images/DellE5450-80-Non-Office/d1p1.img
That’s very interesting. I did not expect the checksums to be different but good that I asked. To me that means that the file was not replicated from the master to the storage properly. So please delete /images/DellE5450-80-Non-Office/d1p1.img on the storage node and wait till it’s being replicated from the master. Then check md5sums again. The FOG replication services checks filesize and checksums (this check only happens for smaller files because it puts too much load on the server if checksums for large files are calculated on every run) but seems like this is a seldom case where filesize matches but checksum doesn’t.
I am using Zstd for compression. Do you recommend Gzip? What are the pros/cons of both?
Both are fine. I tend to use Zstd more and more.
you’re thinking its more along the lines of hardware issues with the laptop and not with FOS or the Node itself? I feel like it only throws those ATA errors when connecting to that one node, but I could be wrong. Maybe that’s the next thing i’ll test.
Yes I would say it’s very unlikely to be caused by FOS or the node unless you have different FOG/kernel versions installed.
@klisza1993 said in Fresh FOG installation and no Client detected:
Middleware::Communication ERROR: Communication ERROR: Request Aborted: Unable to create SSL / TLS secure channel.
I have never seen this error before and I am not sure where it might come from yet. In your first post you said you didn’t enable HTTPS when running the installer but you later post says you are using HTTPS as well. So I would imagine that a request to the HTTP URL is being redirected to HTTPS (default when you enable HTTPS with the installer) and the error stems from an issue with the server certificate.
Did you manually change the webserver configuration?
During troubleshooting, what i have discoverd, tah i can get communication with Server pasting this link into browser.:
https://192.168.205.132//fog/service/register.php?hostname=rk-test&mac=D6:C8:93:84:8B:24&newService&json
From my poinf of view its NOT SECURE.
Surely not ideal but you need to consider that FOG is not a secure product. Very few people help working on the code to find and fix bugs. You are more than welcome to join the force and get this out of the way.
@julio This is a public forum where people help others in their free time. You can’t expect anyone to help/answer especially not in such a short time.
You should be able to run snapins on Linux systems through the fog-client software. Most Linux software comes from external repositories so installing such a software you’d just need to call a bash script to do the job, for example:
#!/bin/bash
apt update
apt -y install software-xyz
or for CentOS/RHEL clients
#!/bin/bash
dnf -y install software-xyz
If you need further help you need to tell us what exactly you are trying to install.
@danieln said in Clients imaging despite recieving "Read ERROR: No such file or directory" and "ata1.00: failed command" errors:
I only appear to get the “No such file or directory” error on the node I set up yesterday.
I should have explained this a bit more in depth earlier. FOS (the Linux OS doing all the work) reads from the file (e.g. d1.p1.img
) piping it through a decompression fifo. So if partclone says “No such file” it’s very likely the decompression fifo died for some reason (file corrupted, RAM issue, …) and partclone is not able to read from it anymore.
Please run file /images/DellE5450-80-Non-Office/d1p1.img
and md5sum /images/DellE5450-80-Non-Office/d1p1.img
on both your nodes and compare the output. Which compression do you use, Gzip or Zstd?
Do you think it’s maybe isolated to the image? I’d assume the ATA errors have something to do with the hard drive but I’m not sure what.
The ATA errors stem from the same FOS (FOG Linux OS) and I would read that as kind of an issue with the Linux kernel with those particular notebooks. It is possible the deploy is fine despite the messages but I am not sure. When you search the web for those ATA messages people say that very often the SATA cable or even power supply (in PCs) can cause such messages. Often Windows is less picky with this kind of things and so I can imagine for Linux to complain (still trying hard) but Windows not so.
@danieln Have you tried imaging that exact same machine from both servers and you only get the “No such file or directory” error on the later one?
And asking again, do you have another notebook - exact same model - that you can deploy to, just to see if you get the same ATA errors?!
@dominique said in Database engine MyISAM - upgrade FOG?:
Any idea when 1.6.0 will be general available?
There is no timeline for this yet. Many people use FOG but very few help working on it. So things are going very slow at the moment. We’ll bring out a last 1.5.x release (1.5.10) to finish up that line and then the next will be a first 1.6 release (months down the road).
If you are keen to get InnoDB earlier you can follow the topic Goerge posted. From my point of view this should be possible in 1.5.x setups as well. But we won’t officially support this and don’t want to push out for a major change in the 1.5.x branch anymore.
@palloquin said in fog client no longer rebooting system to deploy, where to start?:
21/01/2021 11:26 Middleware::Communication URL: http://172.16.1.2/fog/management/index.php?sub=requestClientInfo&mac=00:FF:3E:84:3E:C1|40:A8:F0:61:47:B3&newService&json
21/01/2021 11:26 Middleware::Communication ERROR: Could not contact FOG server
21/01/2021 11:26 Middleware::Communication ERROR: The remote server returned an error: (500) Internal Server Error.
Those are the relevant lines frm the log. Please check the apache and PHP-FPM error logs (see my signature) to find out what is causing the HTTP 500 internal server error.
@aschaffer What do you mean by “needs to be touched”?
By the way, this topic is years old and you might want to create a new one and post all your specific details there like version and a detailed description…
@danieln Please run ls -al /images/DellE5450-80-Non-Office/
on your FOG server console and post output here.
It’s strange you get those many ATA error messages and it would still finish. I would never expect that! Do you have another device of the exact same model? Does it show the same error messages when deploying to that?