FOG Client log reporting old RC and odd behavior


  • Testers

    Server
    • FOG Version: RC-18
    • OS: UBUNTU 14.04
    Client
    • Service Version: 0.11.5
    • OS: Windows 7 Pro x64
    Description

    I should have mentioned this a while back as I have seen it a few times in the past. First thing that happens is some sort of odd behavior of the client.

    I have seen two variations of this:

    Most common is that the client will think it is communicating with the server, but will not “listen” to anything the server says. For example, I tell a snapin to deploy to a lab. 95% of the lab does so. The other few just sit. Mind you, they all were imaged with the same image at the same time. Re-installing the FOG client resolves this. I can try to provide logs for this, but I can tell you the pattern. Everything looks normal except for one glaring anomaly: The client is reporting the server as an older RC than the current one that is running. For example, the oddly behaving client reports RC-14 when we were on RC-17.

    The other newer issue is that the client may be rebooting PC’s randomly. It has been difficult to track down what is causing these reboots, but we may have caught one “in the act” on Friday. Please see the excerpt of the client log below:

    ------------------------------------------------------------------------------
    --------------------------------HostnameChanger-------------------------------
    ------------------------------------------------------------------------------
     10/28/2016 2:06 PM Client-Info Client Version: 0.11.5
     10/28/2016 2:06 PM Client-Info Client OS:      Windows
     10/28/2016 2:06 PM Client-Info Server Version: 1.3.0-RC-14
     10/28/2016 2:06 PM Middleware::Response Success
     10/28/2016 2:06 PM HostnameChanger Checking Hostname
     10/28/2016 2:06 PM HostnameChanger Hostname is correct
     10/28/2016 2:06 PM HostnameChanger Attempting to join domain
     10/28/2016 2:06 PM HostnameChanger Success, code =  0
     10/28/2016 2:06 PM Power Creating shutdown request
     10/28/2016 2:06 PM Power Parameters: /r /c "Host joined to Active Directory, restart required" /t 0
     10/28/2016 2:06 PM Bus {
      "self": true,
      "channel": "Power",
      "data": "{\r\n  \"action\": \"shuttingdown\"\r\n}"
    }
     10/28/2016 2:06 PM Bus Emmiting message on channel: Power
    ------------------------------------------------------------------------------
    
     10/28/2016 2:07 PM Main Overriding exception handling
     10/28/2016 2:07 PM Main Bootstrapping Zazzles
     10/28/2016 2:07 PM Controller Initialize
     10/28/2016 2:07 PM Zazzles Creating main thread
     10/28/2016 2:07 PM Zazzles Service construction complete
     10/28/2016 2:07 PM Controller Start
    
     10/28/2016 2:07 PM Service Starting service
     10/28/2016 2:07 PM Bus Became bus server
     10/28/2016 2:07 PM Bus {
      "self": true,
      "channel": "Status",
      "data": "{\r\n  \"action\": \"load\"\r\n}"
    }
     10/28/2016 2:07 PM Bus Emmiting message on channel: Status
     10/28/2016 2:07 PM Service Invoking early JIT compilation on needed binaries
    

    Mind you, this is not reporting the proper RC version. I find it very strange that it would randomly need to re-join the domain being that no changes were made in AD at the time of this occurrence. Please see below for Windows logs on the client at that exact moment

    0_1477920791906_Capturelog.PNG

    So to wrap things up, I am now getting random reboots such as this all around my district. FOG seems to think that these PC’s need to be joined, but evidence points to this being a false positive due to it being on the domain a minute earlier and no actions being taken by us to disjoin these PC’s from the domain. I just updated to RC-18 on Saturday, so if this was already addressed, I apologize. As always, if you need logs, I can provide them. The current work-around is disabling task-reboot and hostname changer globally.

    Thanks!
    Paul


  • Moderator

    @fry_p It could be a monitor, keyboard, mouse, patch cable, video cable, or power cable causing the issue too.

    We had a computer “Spot” once that was just voodoo’d. It blue screen’d, I said OK. Image it. Fixed. A few weeks later, blue screen. I said, OK it’s a hardware issue. PC Replaced, boom fixed. A few weeks later, another blue screen. I said, OK, this is highly unusual, but I’ll image it. Fixed. And again, another blue screen.

    After that last one, I went in with a replacement monitor, replacement VGA cables, replacement power cables for everything, replacement keyboard, mouse, surge protector, and replacement patch cable. The whole 9-yards. Everything but the physical desk and chair was replaced. And that fixed it. All the old crap I pulled from there was marked “bad” and surplussed, I didn’t even bother testing those items as they were old and low-cost.


  • Testers

    @Wayne-Workman
    We had one sitting on the desk putting it through any stress test we could think of for days at a time. To date we are unable to replicate the random shutdowns.


  • Moderator

    @fry_p You should begin using this model at your desk that way you’re present and knowing of what was happening if it happens.


  • Testers

    @Wayne-Workman
    I will check if there is anything weird, but I don’t use any scheduled shutdowns or power management settings. We have exhausted all other factors through weeks of troubleshooting. The only affected models are Optiplex 5040’s. Older PC’s and other equipment run great on the same strips and outlets. The strips used are cheap and old. No surge protection on most of them. We’ve also replaced the PC in the same room twice to no avail. That is the only room we are currently still having issues. We’ve tried BIOS updates, latest drivers, different video adapters, re-building the image from scratch, etc. It keeps me up at night worrying about the stability of the new most common model of PCs in our district.


  • Moderator

    @fry_p said in FOG Client log reporting old RC and odd behavior:

    We’ve discovered Daisy-Chained powerstrips on problem PC’s

    Personally I doubt this is the issue. At my next-last job and last job, we daisy-chained surge protectors together in order to power computer labs that simply didn’t have enough outlets. It was always fine. And my next-last and last job had no shortage of labs like this. I believe it to be quite a common setup. Even here at home, I have surge protectors daisy chained because the area I’ve setup for my home lab environment has only 2 outlets and it’s just not enough for 10 computers + networking gear. I’ve never had such an issue with random shutdowns.

    Something else is causing your issue. It could be that you have “Smart” surge protectors that have a “Master” outlet, and if the master outlet is drawing power then all the other outlets are turned on, and if the master outlet is not drawing power the other outlets get turned off. At my last job, we got a large order of these and all they do is cause confusion because nobody but the technicians understand how they work. It could be windows updates causing a restart, could be scheduled shutdowns in FOG are jacked up because of timezones not being set right in the server and on hosts. Could be something as simple as power management settings turning off computers after a few minutes activity. I really couldn’t tell you what is causing your issue without troubleshooting it myself.


  • Testers

    @Joe-Schmitt said in FOG Client stops reporting & working:

    @michael_f @Wayne-Workman @Hanz a patch has been developed and verified for this issue. It will be released with 0.11.6 (along with a few other new features) once release candidate testing finishes for the client.

    If that’s the case, this can be marked as solved. Thank you everyone! It appears that most of the random shutdowns in my district were caused by some sort of dirty power situation. We’ve discovered Daisy-Chained powerstrips on problem PC’s (GRRRRR) and in one case we think the actual wiring in the room (the building and wiring is really old) may be causing random shutdowns. We are installing a mini UPS there tomorrow. Just out of curiosity does anyone else have random shutdowns on Dell Optiplex 5040’s? It may be that they are very sensitive to dirty power, or maybe a design flaw… We pulled our hair out over this for weeks on this.


  • Testers

    @Wayne-Workman Windows 7 Pro x64


  • Moderator

    @fry_p Is your environment all Windows 7 pro? My environment is Windows 10 Enterprise LTSB x64.


  • Testers

    @Wayne-Workman

    I have none of the KB Windows updates, no flash player update, and my acrobat reader DC update version is 15.020.20039.


  • Moderator

    @fry_p Can you look at my list and see if you have any of those installed?


  • Testers

    @Wayne-Workman there are quite a few more updates on my side. I exported a nice html list of them but am unable to upload anything besides a picture to the forums. The list is too long for a screenshot. I have a csv of them also. Let me know how to proceed.


  • Moderator

    @fry_p We need to compare windows updates with each other.
    Control Panel -> Small Icons -> Programs and Features -> View Installed Updates

    Here’s mine, post yours too please from a system that was affected.

    0_1478196700935_upload-bbf21eb8-369f-4ea1-93d5-6cb1ac1794b5


  • Testers

    @Wayne-Workman I have the utmost confidence in the team. I patiently await the results.


  • Moderator

    Just wanted to post an update here, Joe has been helping me with figuring out what the problem is, he built a special copy of the fog client for me to use, and I’ve been using this along with some other scripting techniques to try to “cause” the issue to happen, and when it does this special fog client has mechanisms in it to immediately gather data that we need to figure out what’s happening.

    So… it’s just a waiting game right now.


  • Senior Developer

    @Wayne-Workman My description of elements searches was even further limited that how much is searched. If the string r01300 shows up in any of the inventory and what not. It’s difficult to express what and/or where, but I can try a session tonight with you if you’re up to it to see if we can figure out exactly where this is all occurring.


  • Moderator

    @Tom-Elliott I know how the search works, it’s very robust and useful. However I’ve looked through this host’s groups, inventory, snapins, printers. I can’t find any string with “r01300” in it in any of those areas for that host. I don’t know why it’s matching - and I think it odd that it also happened to suffer from the OP’s bug.


  • Testers

    @Joe-Schmitt This is the very end of the log in question in hex (no permissions to upload the file and too many lines for the full dump in the code window).

    0003B040  20 31 31 3A 34 32 20 41 4D 20 43 6C 69 65 6E 74   11:42 AM Client
    0003B050  2D 49 6E 66 6F 20 43 6C 69 65 6E 74 20 56 65 72  -Info Client Ver
    0003B060  73 69 6F 6E 3A 20 30 2E 31 31 2E 35 0D 0A 20 39  sion: 0.11.5.. 9
    0003B070  2F 31 35 2F 32 30 31 36 20 31 31 3A 34 32 20 41  /15/2016 11:42 A
    0003B080  4D 20 43 6C 69 65 6E 74 2D 49 6E 66 6F 20 43 6C  M Client-Info Cl
    0003B090  69 65 6E 74 20 4F 53 3A 20 20 20 20 20 20 57 69  ient OS:      Wi
    0003B0A0  6E 64 6F 77 73 0D 0A 20 39 2F 31 35 2F 32 30 31  ndows.. 9/15/201
    0003B0B0  36 20 31 31 3A 34 32 20 41 4D 20 43 6C 69 65 6E  6 11:42 AM Clien
    0003B0C0  74 2D 49 6E 66 6F 20 53 65 72 76 65 72 20 56 65  t-Info Server Ve
    0003B0D0  72 73 69 6F 6E 3A 20 31 2E 33 2E 30 2D 52 43 2D  rsion: 1.3.0-RC-
    0003B0E0  31 30 0D 0A 20 39 2F 31 35 2F 32 30 31 36 20 31  10.. 9/15/2016 1
    0003B0F0  31 3A 34 32 20 41 4D 20 4D 69 64 64 6C 65 77 61  1:42 AM Middlewa
    0003B100  72 65 3A 3A 52 65 73 70 6F 6E 73 65 20 53 75 63  re::Response Suc
    0003B110  63 65 73 73 0D 0A 20 39 2F 31 35 2F 32 30 31 36  cess.. 9/15/2016
    0003B120  20 31 31 3A 34 32 20 41 4D 20 50 6F 77 65 72 4D   11:42 AM PowerM
    0003B130  61 6E 61 67 65 6D 65 6E 74 20 43 61 6C 63 75 6C  anagement Calcul
    0003B140  61 74 69 6E 67 20 74 61 73 6B 73 20 74 6F 20 75  ating tasks to u
    0003B150  6E 73 63 68 65 64 75 6C 65 0D 0A 20 39 2F 31 35  nschedule.. 9/15
    0003B160  2F 32 30 31 36 20 31 31 3A 34 32 20 41 4D 20 50  /2016 11:42 AM P
    0003B170  6F 77 65 72 4D 61 6E 61 67 65 6D 65 6E 74 20 43  owerManagement C
    0003B180  61 6C 63 75 6C 61 74 69 6E 67 20 74 61 73 6B 73  alculating tasks
    0003B190  20 74 6F 20 73 63 68 65 64 75 6C 65 0D 0A 2D 2D   to schedule..--
    0003B1A0  2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D  ----------------
    0003B1B0  2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D  ----------------
    0003B1C0  2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D  ----------------
    0003B1D0  2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D  ----------------
    0003B1E0  2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 0D 0A 0D 0A  ------------....
    0003B1F0  0D 0A 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D  ..--------------
    0003B200  2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D  ----------------
    0003B210  2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D  ----------------
    0003B220  2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D  ----------------
    0003B230  2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D  ----------------
    0003B240  0D 0A 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D  ..--------------
    0003B250  2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D  ----------------
    0003B260  2D 2D 2D 2D 55 73 65 72 54 72 61 63 6B 65 72 2D  ----UserTracker-
    0003B270  2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D  ----------------
    0003B280  2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D  ----------------
    0003B290  0D 0A 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D  ..--------------
    0003B2A0  2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D  ----------------
    0003B2B0  2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D  ----------------
    0003B2C0  2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D  ----------------
    0003B2D0  2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D  ----------------
    0003B2E0  0D 0A 20 39 2F 31 35 2F 32 30 31 36 20 31 31 3A  .. 9/15/2016 11:
    0003B2F0  34 32 20 41 4D 20 43 6C 69 65 6E 74 2D 49 6E 66  42 AM Client-Inf
    0003B300  6F 20 43 6C 69 65 6E 74 20 56 65 72 73 69 6F 6E  o Client Version
    0003B310  3A 20 30 2E 31 31 2E 35 0D 0A 20 39 2F 31 35 2F  : 0.11.5.. 9/15/
    0003B320  32 30 31 36 20 31 31 3A 34 32 20 41 4D 20 43 6C  2016 11:42 AM Cl
    0003B330  69 65 6E 74 2D 49 6E 66 6F 20 43 6C 69 65 6E 74  ient-Info Client
    0003B340  20 4F 53 3A 20 20 20 20 20 20 57 69 6E 64 6F 77   OS:      Window
    0003B350  73 0D 0A 20 39 2F 31 35 2F 32 30 31 36 20 31 31  s.. 9/15/2016 11
    0003B360  3A 34 32 20 41 4D 20 43 6C 69 65 6E 74 2D 49 6E  :42 AM Client-In
    0003B370  66 6F 20 53 65 72 76 65 72 20 56 65 72 73 69 6F  fo Server Versio
    0003B380  6E 3A 20 31 2E 33 2E 30 2D 52 43 2D 31 30 0D 0A  n: 1.3.0-RC-10..
    0003B390  20 39 2F 31 35 2F 32 30 31 36 20 31 31 3A 34 32   9/15/2016 11:42
    0003B3A0  20 41 4D 20 4D 69 64 64 6C 65 77 61 72 65 3A 3A   AM Middleware::
    0003B3B0  52 65 73 70 6F 6E 73 65 20 53 75 63 63 65 73 73  Response Success
    0003B3C0  0D 0A 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D  ..--------------
    0003B3D0  2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D  ----------------
    0003B3E0  2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D  ----------------
    0003B3F0  2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D  ----------------
    0003B400  2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D 2D  ----------------
    0003B410  0D 0A 0D 0A 20 39 2F 31 35 2F 32 30 31 36 20 31  .... 9/15/2016 1
    0003B420  31 3A 34 32 20 41 4D 20 4D 69 64 64 6C 65 77 61  1:42 AM Middlewa
    0003B430  72 65 3A 3A 43 6F 6D 6D 75 6E 69 63 61 74 69 6F  re::Communicatio
    0003B440  6E 20 55 52 4C 3A 20 68 74 74 70 3A 2F 2F 31 30  n URL: http://10
    0003B450  2E 31 2E 31 30 36 2E 31 36 39 2F 66 6F 67 2F 6D  .1.106.169/fog/m
    0003B460  61 6E 61 67 65 6D 65 6E 74 2F 69 6E 64 65 78 2E  anagement/index.
    0003B470  70 68 70 3F 73 75 62 3D 72 65 71 75 65 73 74 43  php?sub=requestC
    0003B480  6C 69 65 6E 74 49 6E 66 6F 26 63 6F 6E 66 69 67  lientInfo&config
    0003B490  75 72 65 26 6E 65 77 53 65 72 76 69 63 65 26 6A  ure&newService&j
    0003B4A0  73 6F 6E 0D 0A 20 39 2F 31 35 2F 32 30 31 36 20  son.. 9/15/2016 
    0003B4B0  31 31 3A 34 32 20 41 4D 20 4D 69 64 64 6C 65 77  11:42 AM Middlew
    0003B4C0  61 72 65 3A 3A 52 65 73 70 6F 6E 73 65 20 53 75  are::Response Su
    0003B4D0  63 63 65 73 73 0D 0A 20 39 2F 31 35 2F 32 30 31  ccess.. 9/15/201
    0003B4E0  36 20 31 31 3A 34 32 20 41 4D 20 53 65 72 76 69  6 11:42 AM Servi
    0003B4F0  63 65 20 45 52 52 4F 52 3A 20 49 6E 76 61 6C 69  ce ERROR: Invali
    0003B500  64 20 70 72 6F 6D 70 74 54 69 6D 65 2C 20 75 73  d promptTime, us
    0003B510  69 6E 67 20 64 65 66 61 75 6C 74 0D 0A 20 39 2F  ing default.. 9/
    0003B520  31 35 2F 32 30 31 36 20 31 31 3A 34 32 20 41 4D  15/2016 11:42 AM
    0003B530  20 53 65 72 76 69 63 65 20 53 6C 65 65 70 69 6E   Service Sleepin
    0003B540  67 20 66 6F 72 20 31 32 38 20 73 65 63 6F 6E 64  g for 128 second
    0003B550  73 0D 0A                                         s..
    

  • Senior Developer

    @Wayne-Workman incorrectly shows up? The search at first searches for multiple Ares. If r01300 is group the host is in, a snapin, an image, description, etc… It will show in the list. Fog isn’t just searching data on one field.


  • Moderator

    Even after deleting all those junk entries in the DB, that one computer that did have the bug still incorrectly shows up in the search results.


Log in to reply
 

Looks like your connection to FOG Project was lost, please wait while we try to reconnect.