• Recent
    • Unsolved
    • Tags
    • Popular
    • Users
    • Groups
    • Search
    • Register
    • Login

    Storage Nodes stop reporting after a while

    Scheduled Pinned Locked Moved Solved
    FOG Problems
    2
    28
    4.2k
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • Wayne WorkmanW
      Wayne Workman
      last edited by Wayne Workman

      I checked the nodes version reporting and interface reporting this morning - it’s still working.

      So - I have no idea why but - the moral of the story is if you change your fogstorage password on the main database, after changing the snmysqlpass field in /opt/fog/.fogsettings on each node and re-running the installer, reboot the node.

      Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG!
      Daily Clean Installation Results:
      https://fogtesting.fogproject.us/
      FOG Reporting:
      https://fog-external-reporting-results.fogproject.us/

      1 Reply Last reply Reply Quote 0
      • Wayne WorkmanW
        Wayne Workman
        last edited by

        I spoke too soon.

        They are all not reporting again. Considering - they stayed reporting all night long into the morning before anyone got here - this must be load related somehow.

        Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG!
        Daily Clean Installation Results:
        https://fogtesting.fogproject.us/
        FOG Reporting:
        https://fog-external-reporting-results.fogproject.us/

        1 Reply Last reply Reply Quote 0
        • Wayne WorkmanW
          Wayne Workman
          last edited by

          I wrote a script that reboots them all, and ran that. They are all now reporting again…

          This wasn’t an issue until I changed the password for fogstorage - Why would that cause this?

          Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG!
          Daily Clean Installation Results:
          https://fogtesting.fogproject.us/
          FOG Reporting:
          https://fog-external-reporting-results.fogproject.us/

          1 Reply Last reply Reply Quote 0
          • Wayne WorkmanW
            Wayne Workman
            last edited by Wayne Workman

            Below is the apache error log from one of the nodes, snipped from when the server rebooted to when the issue happened again. The server operated for exactly 37 minutes before falling off again. All storage nodes fell off at the same time - since they were all rebooted at the same time.

            [Thu Aug 18 08:29:09.737517 2016] [mpm_prefork:notice] [pid 1504] AH00170: caught SIGWINCH, shutting down gracefully
            [Thu Aug 18 08:29:22.823456 2016] [core:notice] [pid 1503] SELinux policy enabled; httpd running as context system_u:system_r:httpd_t:s0
            [Thu Aug 18 08:29:22.907692 2016] [suexec:notice] [pid 1503] AH01232: suEXEC mechanism enabled (wrapper: /usr/sbin/suexec)
            [Thu Aug 18 08:29:23.127056 2016] [ssl:warn] [pid 1503] AH02292: Init: Name-based SSL virtual hosts only work for clients with TLS server name indication support (RFC 4366)
            AH00558: httpd: Could not reliably determine the server's fully qualified domain name, using fe80::215:5dff:fe01:a00. Set the 'ServerName' directive globally to suppress this message
            [Thu Aug 18 08:29:23.149139 2016] [auth_digest:notice] [pid 1503] AH01757: generating secret for digest authentication ...
            [Thu Aug 18 08:29:23.149672 2016] [lbmethod_heartbeat:notice] [pid 1503] AH02282: No slotmem from mod_heartmonitor
            [Thu Aug 18 08:29:23.150447 2016] [ssl:warn] [pid 1503] AH02292: Init: Name-based SSL virtual hosts only work for clients with TLS server name indication support (RFC 4366)
            [Thu Aug 18 08:29:23.230529 2016] [mpm_prefork:notice] [pid 1503] AH00163: Apache/2.4.6 (CentOS) OpenSSL/1.0.1e-fips PHP/5.6.24 configured -- resuming normal operations
            [Thu Aug 18 08:29:23.230555 2016] [core:notice] [pid 1503] AH00094: Command line: '/usr/sbin/httpd -D FOREGROUND'
            [Thu Aug 18 09:07:34.373250 2016] [:error] [pid 2727] [client 10.51.1.53:44934] PHP Fatal error:  Uncaught exception 'UnexpectedValueException' with message 'RecursiveDirectoryIterator::__construct(/var/www/html/fog/lib/plugins/capone/html/images): failed to open dir: Too many open files' in /var/www/html/fog/lib/fog/eventmanager.class.php:62\nStack trace:\n#0 [internal function]: RecursiveDirectoryIterator->__construct('/var/www/html/f...', 4096)\n#1 [internal function]: RecursiveDirectoryIterator->getChildren()\n#2 [internal function]: FilterIterator->next()\n#3 /var/www/html/fog/lib/fog/eventmanager.class.php(62): iterator_to_array(Object(RegexIterator), false)\n#4 /var/www/html/fog/commons/init.php(137): EventManager->load()\n#5 /var/www/html/fog/commons/base.inc.php(10): require('/var/www/html/f...')\n#6 /var/www/html/fog/status/bandwidth.php(2): require_once('/var/www/html/f...')\n#7 {main}\n  thrown in /var/www/html/fog/lib/fog/eventmanager.class.php on line 62
            [Thu Aug 18 09:07:38.374827 2016] [:error] [pid 2727] [client 10.51.1.53:45122] PHP Fatal error:  Uncaught exception 'UnexpectedValueException' with message 'RecursiveDirectoryIterator::__construct(/var/www/html/fog/lib/plugins/capone/html/images): failed to open dir: Too many open files' in /var/www/html/fog/commons/init.php:14\nStack trace:\n#0 [internal function]: RecursiveDirectoryIterator->__construct('/var/www/html/f...', 4096)\n#1 [internal function]: RecursiveDirectoryIterator->getChildren()\n#2 [internal function]: FilterIterator->next()\n#3 /var/www/html/fog/commons/init.php(14): iterator_to_array(Object(RegexIterator))\n#4 /var/www/html/fog/commons/init.php(121): Initiator->__construct()\n#5 /var/www/html/fog/commons/base.inc.php(10): require('/var/www/html/f...')\n#6 /var/www/html/fog/status/bandwidth.php(2): require_once('/var/www/html/f...')\n#7 {main}\n  thrown in /var/www/html/fog/commons/init.php on line 14
            [Thu Aug 18 09:07:42.356020 2016] [:error] [pid 2727] [client 10.51.1.53:45330] PHP Fatal error:  Uncaught exception 'UnexpectedValueException' with message 'RecursiveDirectoryIterator::__construct(/var/www/html/fog/lib/plugins/capone/html/images): failed to open dir: Too many open files' in /var/www/html/fog/commons/init.php:14\nStack trace:\n#0 [internal function]: RecursiveDirectoryIterator->__construct('/var/www/html/f...', 4096)\n#1 [internal function]: RecursiveDirectoryIterator->getChildren()\n#2 [internal function]: FilterIterator->next()\n#3 /var/www/html/fog/commons/init.php(14): iterator_to_array(Object(RegexIterator))\n#4 /var/www/html/fog/commons/init.php(121): Initiator->__construct()\n#5 /var/www/html/fog/commons/base.inc.php(10): require('/var/www/html/f...')\n#6 /var/www/html/fog/status/bandwidth.php(2): require_once('/var/www/html/f...')\n#7 {main}\n  thrown in /var/www/html/fog/commons/init.php on line 14
            [Thu Aug 18 09:07:46.249272 2016] [:error] [pid 2727] [client 10.51.1.53:45518] PHP Fatal error:  Uncaught exception 'UnexpectedValueException' with message 'RecursiveDirectoryIterator::__construct(/var/www/html/fog/lib/plugins/capone/html/images): failed to open dir: Too many open files' in /var/www/html/fog/commons/init.php:14\nStack trace:\n#0 [internal function]: RecursiveDirectoryIterator->__construct('/var/www/html/f...', 4096)\n#1 [internal function]: RecursiveDirectoryIterator->getChildren()\n#2 [internal function]: FilterIterator->next()\n#3 /var/www/html/fog/commons/init.php(14): iterator_to_array(Object(RegexIterator))\n#4 /var/www/html/fog/commons/init.php(121): Initiator->__construct()\n#5 /var/www/html/fog/commons/base.inc.php(10): require('/var/www/html/f...')\n#6 /var/www/html/fog/status/bandwidth.php(2): require_once('/var/www/html/f...')\n#7 {main}\n  thrown in /var/www/html/fog/commons/init.php on line 14
            

            Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG!
            Daily Clean Installation Results:
            https://fogtesting.fogproject.us/
            FOG Reporting:
            https://fog-external-reporting-results.fogproject.us/

            1 Reply Last reply Reply Quote 1
            • Wayne WorkmanW
              Wayne Workman
              last edited by

              Found a blurb about this error on Stack Overflow:

              http://stackoverflow.com/questions/14748499/fatal-error-too-many-open-files

              Quote:

              Don’t store DirectoryIterator objects for later; you will get an error saying “too many open files” when you store more than the operating system limit (usually 256 or 1024).

              For example, this will yield an error if the directory has too many files:

              <?php 
              $files = array(); 
              foreach (new DirectoryIterator('myDir') as $file) { 
                  $files[] = $file; 
              } 
              ?>
              

              Presumably, this approach is memory intensive as well.

              source: http://php.net/manual/pt_BR/directoryiterator.construct.php#87425

              Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG!
              Daily Clean Installation Results:
              https://fogtesting.fogproject.us/
              FOG Reporting:
              https://fog-external-reporting-results.fogproject.us/

              1 Reply Last reply Reply Quote 0
              • Wayne WorkmanW
                Wayne Workman
                last edited by

                Here is a more detailed post about it:

                http://superuser.com/questions/871276/failed-to-open-stream-too-many-open-files-php-1024-maximum-limit-redhat-wh

                Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG!
                Daily Clean Installation Results:
                https://fogtesting.fogproject.us/
                FOG Reporting:
                https://fog-external-reporting-results.fogproject.us/

                1 Reply Last reply Reply Quote 0
                • Wayne WorkmanW
                  Wayne Workman
                  last edited by Wayne Workman

                  I’ve found this site that shows how to set new limits properly on Ubuntu.

                  https://www.drupal.org/node/474152

                  I’ve converted the commands for CentOS 7/Fedora/RHEL - and made it into an SSH command ran against a remote server.

                  ssh aifog "echo 'apache soft nofile 1600' >> /etc/security/limits.conf;echo 'apache hard nofile 65535' >> /etc/security/limits.conf;systemctl restart httpd"

                  Note that the above is an ssh command that uses aliases and pki-based certificates - that my project called ssh-pki-setup has already established for me. That command executes the commands in double quotes against the remote server, and it successfully added the options to the remote server’s file and restarts httpd.

                  If successful, I’ll do this for all storage nodes - and post commands that can be ran manually too if you don’t have so many servers to manage as I do and prefer manual.

                  Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG!
                  Daily Clean Installation Results:
                  https://fogtesting.fogproject.us/
                  FOG Reporting:
                  https://fog-external-reporting-results.fogproject.us/

                  1 Reply Last reply Reply Quote 0
                  • Wayne WorkmanW
                    Wayne Workman
                    last edited by

                    After the last reboot - all storage nodes are still reporting. But I assume they will eventually fail. I’m just waiting for that to happen - to see if the one I modified hangs in there.

                    In the mean time, I’ve been watching the number of files open by apache with this command I put together: lsof -l -u apache | wc -l

                    Open files have been steadily growing:

                    [root@wgfog ~]# lsof -u apache | wc -l
                    lsof: WARNING: can't stat() fuse.gvfsd-fuse file system /run/user/42/gvfs
                          Output information may be incomplete.
                    6316
                    [root@wgfog ~]# lsof -u apache -l | wc -l
                    lsof: WARNING: can't stat() fuse.gvfsd-fuse file system /run/user/42/gvfs
                          Output information may be incomplete.
                    6412
                    [root@wgfog ~]# lsof -l -u apache | wc -l
                    lsof: WARNING: can't stat() fuse.gvfsd-fuse file system /run/user/42/gvfs
                          Output information may be incomplete.
                    6424
                    [root@wgfog ~]# lsof -l -u apache | wc -l
                    lsof: WARNING: can't stat() fuse.gvfsd-fuse file system /run/user/42/gvfs
                          Output information may be incomplete.
                    6432
                    [root@wgfog ~]# clear
                    [root@wgfog ~]# lsof -l -u apache | wc -l
                    lsof: WARNING: can't stat() fuse.gvfsd-fuse file system /run/user/42/gvfs
                          Output information may be incomplete.
                    6444
                    [root@wgfog ~]# lsof -l -u apache | wc -l
                    lsof: WARNING: can't stat() fuse.gvfsd-fuse file system /run/user/42/gvfs
                          Output information may be incomplete.
                    6448
                    [root@wgfog ~]#
                    [root@wgfog ~]# lsof -l -u apache | wc -l
                    lsof: WARNING: can't stat() fuse.gvfsd-fuse file system /run/user/42/gvfs
                          Output information may be incomplete.
                    6448
                    [root@wgfog ~]# lsof -l -u apache | wc -l
                    lsof: WARNING: can't stat() fuse.gvfsd-fuse file system /run/user/42/gvfs
                          Output information may be incomplete.
                    6456
                    [root@wgfog ~]# lsof -l -u apache | wc -l
                    lsof: WARNING: can't stat() fuse.gvfsd-fuse file system /run/user/42/gvfs
                          Output information may be incomplete.
                    6456
                    [root@wgfog ~]# lsof -l -u apache | wc -l
                    lsof: WARNING: can't stat() fuse.gvfsd-fuse file system /run/user/42/gvfs
                          Output information may be incomplete.
                    6464
                    [root@wgfog ~]# lsof -l -u apache | wc -l
                    lsof: WARNING: can't stat() fuse.gvfsd-fuse file system /run/user/42/gvfs
                          Output information may be incomplete.
                    6472
                    [root@wgfog ~]# lsof -l -u apache | wc -l
                    lsof: WARNING: can't stat() fuse.gvfsd-fuse file system /run/user/42/gvfs
                          Output information may be incomplete.
                    7010
                    [root@wgfog ~]# lsof -l -u apache | wc -l
                    lsof: WARNING: can't stat() fuse.gvfsd-fuse file system /run/user/42/gvfs
                          Output information may be incomplete.
                    7538
                    [root@wgfog ~]# [root@wgfog ~]# lsof -l -u apache | wc -l
                    lsof: WARNING: can't stat() fuse.gvfsd-fuse file system /run/user/42/gvfs
                          Output information may be incomplete.
                    6444
                    [root@wgfog ~]# lsof -l -u apache | wc -l
                    lsof: WARNING: can't stat() fuse.gvfsd-fuse file system /run/user/42/gvfs
                          Output information may be incomplete.
                    6448
                    [root@wgfog ~]#
                    [root@wgfog ~]# lsof -l -u apache | wc -l
                    lsof: WARNING: can't stat() fuse.gvfsd-fuse file system /run/user/42/gvfs
                          Output information may be incomplete.
                    6448
                    [root@wgfog ~]# lsof -l -u apache | wc -l
                    lsof: WARNING: can't stat() fuse.gvfsd-fuse file system /run/user/42/gvfs
                          Output information may be incomplete.
                    6456
                    [root@wgfog ~]# lsof -l -u apache | wc -l
                    lsof: WARNING: can't stat() fuse.gvfsd-fuse file system /run/user/42/gvfs
                          Output information may be incomplete.
                    6456
                    [root@wgfog ~]# lsof -l -u apache | wc -l
                    lsof: WARNING: can't stat() fuse.gvfsd-fuse file system /run/user/42/gvfs
                          Output information may be incomplete.
                    6464
                    [root@wgfog ~]# lsof -l -u apache | wc -l
                    lsof: WARNING: can't stat() fuse.gvfsd-fuse file system /run/user/42/gvfs
                          Output information may be incomplete.
                    6472
                    [root@wgfog ~]# lsof -l -u apache | wc -l
                    lsof: WARNING: can't stat() fuse.gvfsd-fuse file system /run/user/42/gvfs
                          Output information may be incomplete.
                    7010
                    

                    Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG!
                    Daily Clean Installation Results:
                    https://fogtesting.fogproject.us/
                    FOG Reporting:
                    https://fog-external-reporting-results.fogproject.us/

                    1 Reply Last reply Reply Quote 0
                    • Wayne WorkmanW
                      Wayne Workman
                      last edited by

                      Ok, the node I’ve been monitoring open files on - just fell off.

                      And - the magical limit is… 8006 files open by Apache - causes everything to just stop.

                      And… the one storage node I modified… it stopped reporting too… 😞

                      Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG!
                      Daily Clean Installation Results:
                      https://fogtesting.fogproject.us/
                      FOG Reporting:
                      https://fog-external-reporting-results.fogproject.us/

                      1 Reply Last reply Reply Quote 0
                      • Wayne WorkmanW
                        Wayne Workman
                        last edited by

                        This article seems relevant but I don’t see how it could help in the case of FOG. https://httpd.apache.org/docs/current/vhosts/fd-limits.html

                        Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG!
                        Daily Clean Installation Results:
                        https://fogtesting.fogproject.us/
                        FOG Reporting:
                        https://fog-external-reporting-results.fogproject.us/

                        1 Reply Last reply Reply Quote 0
                        • Wayne WorkmanW
                          Wayne Workman
                          last edited by

                          I think this must be a file descriptors leak in the storage node configuration.

                          I’m going to patch it myself after lunch.

                          Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG!
                          Daily Clean Installation Results:
                          https://fogtesting.fogproject.us/
                          FOG Reporting:
                          https://fog-external-reporting-results.fogproject.us/

                          1 Reply Last reply Reply Quote 0
                          • Wayne WorkmanW
                            Wayne Workman
                            last edited by

                            It would also appear that Apache is upping the file descriptors limits as it can - but it just can’t keep up, because there are so many files getting opened so fast, and not closed.

                            Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG!
                            Daily Clean Installation Results:
                            https://fogtesting.fogproject.us/
                            FOG Reporting:
                            https://fog-external-reporting-results.fogproject.us/

                            1 Reply Last reply Reply Quote 0
                            • Wayne WorkmanW
                              Wayne Workman
                              last edited by Wayne Workman

                              Figured out the bandwidth monitor is the primary file that’s getting so many calls. bandwidth.php

                              Currently hunting down every place where it’s used - including it’s define line in config.class.php called STORAGE_BANDWIDTHPATH

                              Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG!
                              Daily Clean Installation Results:
                              https://fogtesting.fogproject.us/
                              FOG Reporting:
                              https://fog-external-reporting-results.fogproject.us/

                              1 Reply Last reply Reply Quote 0
                              • Tom ElliottT
                                Tom Elliott
                                last edited by

                                [root@mastaweb fog]# grep -rl 'fopen' .
                                ./status/logtoview.php
                                ./lib/fog/eventmanager.class.php
                                ./lib/fog/reportmaker.class.php
                                ./lib/fog/hookmanager.class.php
                                ./lib/fog/tasktype.class.php
                                ./lib/fog/fogbase.class.php
                                ./lib/fog/schema.class.php
                                ./lib/fog/fogpage.class.php
                                ./lib/pages/fogconfigurationpage.class.php
                                ./lib/db/mysqldump.class.php
                                ./lib/service/fogservice.class.php
                                ./lib/client/snapinclient.class.php
                                ./client/download.php
                                [root@mastaweb fog]# grep -rl 'readfile' .
                                [root@mastaweb fog]# grep -rl 'file_get_content' .
                                ./status/bandwidth.php
                                ./lib/fog/fogbase.class.php
                                

                                These are a list of all files that are calling fopen and file_get_contents.

                                Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG! Get in contact with me (chat bubble in the top right corner) if you want to join in.

                                Web GUI issue? Please check apache error (debian/ubuntu: /var/log/apache2/error.log, centos/fedora/rhel: /var/log/httpd/error_log) and php-fpm log (/var/log/php*-fpm.log)

                                Please support FOG if you like it: https://wiki.fogproject.org/wiki/index.php/Support_FOG

                                1 Reply Last reply Reply Quote 1
                                • Wayne WorkmanW
                                  Wayne Workman
                                  last edited by

                                  I’m struggling to find the file descriptor leak.

                                  For now, I’ve patched together a way to keep things going. This is a CRON script that executes ssh commands against remote servers - and uses existing pki ssh authentication. I’ve set it up to run every 2 minutes.

                                  echo=$(command -v echo)
                                  ssh=$(command -v ssh)
                                  
                                  
                                  array=( aifog annex bmfog clfog cvfog ckfog dufog fmfog hffog jwfog lhfog prfog rofog wgfog )
                                  for i in "${array[@]}"
                                  do
                                  
                                      $ssh $i "lsof -l -u apache | wc -l > /root/apacheOpenFiles.txt" > /dev/null 2>&1
                                      openApacheFiles=$($ssh $i "cat /root/apacheOpenFiles.txt")
                                      if [[ "$openApacheFiles" -gt 8000 ]]; then
                                          $ssh $i "killall --user apache" > /dev/null 2>&1
                                      fi
                                  done
                                  

                                  Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG!
                                  Daily Clean Installation Results:
                                  https://fogtesting.fogproject.us/
                                  FOG Reporting:
                                  https://fog-external-reporting-results.fogproject.us/

                                  1 Reply Last reply Reply Quote 0
                                  • Wayne WorkmanW
                                    Wayne Workman
                                    last edited by

                                    Additionally - apache error logs are filling up really fast on all the storage nodes from this issue. I’m going to have to script clearing them.

                                    Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG!
                                    Daily Clean Installation Results:
                                    https://fogtesting.fogproject.us/
                                    FOG Reporting:
                                    https://fog-external-reporting-results.fogproject.us/

                                    1 Reply Last reply Reply Quote 0
                                    • Wayne WorkmanW
                                      Wayne Workman
                                      last edited by

                                      Cross-linking similar thread:
                                      https://forums.fogproject.org/topic/8314/fog-log-viewer-for-nodes-giving-error

                                      Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG!
                                      Daily Clean Installation Results:
                                      https://fogtesting.fogproject.us/
                                      FOG Reporting:
                                      https://fog-external-reporting-results.fogproject.us/

                                      1 Reply Last reply Reply Quote 0
                                      • Wayne WorkmanW
                                        Wayne Workman
                                        last edited by Wayne Workman

                                        Update on this issue - I can reproduce it at home quite easily.

                                        Using just my laptop, I simulated many users at once just by opening 10 tabs and letting them sit on the fog dashboard.

                                        Very, very quickly, my one storage node at home stopped reporting it’s version and interface.

                                        Even after closing all but one tab, it continued to not respond.

                                        Checking the number of open files by the user apache with the command lsof -l -u apache | wc -l the number of open files after just a few short moments is 47029.

                                        After waiting a while - and without taking any corrective action - the number of open files by apache dropped to 11812 and the node started reporting it’s version and interface again.

                                        My guess is either Linux, or httpd is closing the files on it’s own after some time, because it might have it’s very own built-in cleanup mechanisms. I don’t think the files are being closed properly by FOG.

                                        So - my guess is that 2 to 4 people at work have the fog dashboard open and just let it sit, all day long. This isn’t necessarily wrong to do, and FOG should be able to cope with this.

                                        Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG!
                                        Daily Clean Installation Results:
                                        https://fogtesting.fogproject.us/
                                        FOG Reporting:
                                        https://fog-external-reporting-results.fogproject.us/

                                        1 Reply Last reply Reply Quote 0
                                        • Wayne WorkmanW
                                          Wayne Workman
                                          last edited by Wayne Workman

                                          Tom worked on a patch quietly since I started this thread - and I’ve tested it - and it works.

                                          My storage node at home that I tested on is an old Pentium 4 with an IDE drive in it - so it’s very slow.
                                          Results at home:

                                          I could open 12 pages and let them sit on the fog dashboard - CPU usage on the storage node stayed below 4, there weren’t uncontrollable httpd processes spawning, Open files by the apache user stayed below 4k - and the bandwidth chart not only reported - but it’s now reporting more smoothly than ever, is actually accurate now, and doesn’t have spells of mental disability anymore. It’s like butter.

                                          The issue, as it was explained to me, is that the JS which renders the bandwidth chart and does the polling of bandwidth.php on all nodes enabled for bandwidth reporting - it wasn’t waiting for a response, it would just re-issue another poll before the response was received. The more dashboards open, the worse it got, until eventually apache was unable to do anything do to it’s tremendous load of ‘stacked’ processes that were back-logged. I didn’t tell anyone but Tom - but this (now solved) issue would break imaging at remote locations, too - because we use the location plugin and apache on the remote nodes is relied upon to get imaging done in this setup. That too is solved now.

                                          These fixes will all be in RC-9.

                                          Please help us build the FOG community with everyone involved. It's not just about coding - way more we need people to test things, update documentation and most importantly work on uniting the community of people enjoying and working on FOG!
                                          Daily Clean Installation Results:
                                          https://fogtesting.fogproject.us/
                                          FOG Reporting:
                                          https://fog-external-reporting-results.fogproject.us/

                                          1 Reply Last reply Reply Quote 1
                                          • 1
                                          • 2
                                          • 1 / 2
                                          • First post
                                            Last post

                                          233

                                          Online

                                          12.0k

                                          Users

                                          17.3k

                                          Topics

                                          155.2k

                                          Posts
                                          Copyright © 2012-2024 FOG Project