No server monitoring from Foglight

This question has been hounding me and figured I'd ask it here but keep searching for an answer.

One of my sql servers is running fine but Foglight' dashboard is showing dashes where the system utilization\up since are at.  I can ping it from the agent manager, and it is gathering sql server PI data as well.

What do I need to check or change to get this to start gathering OS information?

Parents
  • Hi John,

    The OS column shows that there is (at least looks to be) an agent. If you click the host name, it should take you to an OS dashboard - is data showing there?

    If not, can you verify in the Administration -> Agent Status page that the Windows/Linux agent for that host is up and collecting data? Does it show a recent value in the Last Data Submission column of that page? Can you also check the log file for any errors?

    In the Databases dashboard, click the box for the instance, then Admin -> Settings -> Collection Details. Can you test the connection (and verify there is a green check on the OS column)?

    Are there any credential errors?

    Aside from those checks.. if that doesn't resolve it, I would suggest a support case.

  • Darren,

    Thanks for the quick reply and the help.  Much appreciated!!

    I verified all the things you suggested and here are my results:

    1. OS Dashboard does not appear to be gathering data at all.  It shows a lot of green but nothing is "running" as it normally would.

    2.  It's showing "Last Data Submission" from about 5 hours ago on the "Monitor@servername" Agent Name.  I did indeed check the log for "monitor@servername" and found the following: 

    2020-01-15 05:57:15.114 ECHO    <fb8be857-fbcf-4747-973d-295a8570e581/HostAgents/5.9.5/WindowsAgent/Monitor@brokenservernamehere> VERBOSE [Quartz[0]-28746] com.quest.glue.core.remoteconnection.windows.wmi.WMIUtils - The connection to the remote system was lost and the query "select SystemUpTime from Win32_PerfFormattedData_PerfOS_System" could not be completed, no results will be returned.
    com.quest.glue.api.services.ConnectionBrokenException: The query 'select SystemUpTime from Win32_PerfFormattedData_PerfOS_System' has failed (0x80041032)
        at com.quest.glue.core.remoteconnection.windows.wmi.WMIUtils.createQueryFailedException(WMIUtils.java:393)
        at com.quest.glue.core.remoteconnection.windows.wmi.WMIUtils.createQueryConnectionBrokenException(WMIUtils.java:350)
        at com.quest.glue.core.remoteconnection.windows.wmi.BaseWMINativeConnectionImpl.executeQueryInternal(BaseWMINativeConnectionImpl.java:286)
        at com.quest.glue.core.remoteconnection.windows.wmi.BaseWMINativeConnectionImpl.executeQuery(BaseWMINativeConnectionImpl.java:255)
        at com.quest.glue.core.remoteconnection.windows.wmi.PoolableWMINativeConnectionImpl.executeQuery(PoolableWMINativeConnectionImpl.java:136)
        at sun.reflect.GeneratedMethodAccessor158.invoke(Unknown Source)
        at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
        at java.lang.reflect.Method.invoke(Method.java:498)
        at com.quest.glue.core.services.EquivalenceInvocationHandler.handleInvocation(EquivalenceInvocationHandler.java:60)
        at com.quest.common.proxy.AbstractEquivalenceInvocationHandler.invoke(AbstractEquivalenceInvocationHandler.java:83)
        at com.quest.glue.core.remoteconnection.windows.wmi.WMIConnectionFactory$PoolableNativeWMIConnectionFactory$PoolableNativeWMIInvocationHandler$1.call(WMIConnectionFactory.java:425)
        at java.util.concurrent.FutureTask.run(FutureTask.java:266)
        at com.quest.glue.core.util.ExecutionTaskThread$ExecutionTaskThreadFutureTask.run(ExecutionTaskThread.java:254)
        at com.quest.glue.core.util.ExecutionTaskThread.run(ExecutionTaskThread.java:119)
    Caused by: com.jacob.com.ComFailException: Invoke of: Count
    Source: SWbemObjectSet
    Description: Call cancelled

        at com.jacob.com.Dispatch.invokev(Native Method)
        at com.jacob.com.Dispatch.invokev(Dispatch.java:625)
        at com.jacob.com.Dispatch.get(Dispatch.java:788)
        at com.quest.glue.core.remoteconnection.windows.wmi.BaseWMINativeConnectionImpl.validateDispatch(BaseWMINativeConnectionImpl.java:332)
        at com.quest.glue.core.remoteconnection.windows.wmi.BaseWMINativeConnectionImpl.executeQueryInternal(BaseWMINativeConnectionImpl.java:272)
        ... 11 more

    3.  I ran a test connection per your suggestion and it came back green & verified.

    Not sure if you've run into this before.  If so, what was your solution?  I checked the timeline & it appears to have been working this morning. 

    Thanks again.

  • Hi John,

    Normally when I see the WMI calls failing I engage with support.

    It looks like the Windows agent lost connection to the host (for some reason, the log doesn't say).

    You could try to deactivate/activate the Windows agent to see if it resolves it. But if it doesn't or re-occurs, I would suggest a support case.

  • Tried activating\deactivating several times with no luck.  Not sure what happened so just this one server out of 33 started doing this.

    Anyone else have any suggestions?  It's still gathering SQL PI data and occasionally the "Up Since" column shows a date but then will go away only to popup again later.

    I'd like to try something else before I have to put a help ticket in.  I have a feeling the tech support will either want me to restart sql server services (not an easy task since both instances are reporting servers) or ask for a log file download.

Reply
  • Tried activating\deactivating several times with no luck.  Not sure what happened so just this one server out of 33 started doing this.

    Anyone else have any suggestions?  It's still gathering SQL PI data and occasionally the "Up Since" column shows a date but then will go away only to popup again later.

    I'd like to try something else before I have to put a help ticket in.  I have a feeling the tech support will either want me to restart sql server services (not an easy task since both instances are reporting servers) or ask for a log file download.

Children
No Data