Quantcast
Channel: THWACK: All Content - Network Performance Monitor
Viewing all 21870 articles
Browse latest View live

When you installed NPM, did you add Nodes manually or did you run discovery?

$
0
0

We would like to improve user experience and for such reason I'd like to better understand if our users prefers INITIALLY to add nodes manually or run product network discovery in order to import devices to NPM


Unmanaged Devices (Availability Report shows 0%)

$
0
0

Hi All,

 

I have some devices which have been down for a couple of months due to ongoing building renovations. I unmanaged them last month in order for the availability report to show 100% up. As I check the report for the last month its show 0% availability for these devices.

 

My understanding is that the availability for unmanaged devices is calculated as 100% while they are unmanaged.

 

Please help me understand, what could have caused the availability report to be 0%. One thing that I can guess is, I unmanaged these devices at a time when they were already down.

 

 

Thanks and Regards

 

Main Ahmad

Adding link to a hostname in a result of an UnDP, in custom table?

$
0
0

Hello!

 

I currently have a custom table that returns different values polled via SNMP. Basically, I'm asking an application who host it. Values returned are the same as caption of another monitored node. (Think of it as if you were asking a VM who's the active and standby host).

Polled Node it the node where the UnDP result is coming from, and Active_Host & Standby_host are text values returned from polls.

I would like to enable link to the page of the active_host and standby_host.

 

My data source is defined this way:

SELECT        N.NodeID,        '<img src="'+'/Orion/Images/StatusIcons/'+N.StatusLED+'" width="16" height="16">'  as Status,         '/Orion/NetPerfMon/NodeDetails.aspx?NetObject=N:'+CAST(N.NodeID as varchar(256)) as 'DetailsURL' ,        N.Caption as 'Polled Node',        N.System,        N.SysName,        Active_Host = ( SELECT TOP 1                            CPSD.Status                    FROM    CustomPollerAssignment CPA                            INNER JOIN CustomPollers CP                                ON CPA.CustomPollerID = CP.CustomPollerID                                   AND CP.UniqueName = 'eadsP25ActiveHostName'                            INNER JOIN CustomPollerStatistics_Detail CPSD                                ON CPA.CustomPollerAssignmentID = CPSD.CustomPollerAssignmentID                    WHERE   CPA.NodeID = N.NodeID                    ORDER BY CPSD.DATETIME DESC                  ),        Standby_Host = ( SELECT TOP 1                            CPSD.Status                     FROM   CustomPollerAssignment CPA                            INNER JOIN CustomPollers CP                                ON CPA.CustomPollerID = CP.CustomPollerID                                   AND CP.UniqueName = 'eadsP25StandByHostName'                            INNER JOIN CustomPollerStatistics_Detail CPSD                                ON CPA.CustomPollerAssignmentID = CPSD.CustomPollerAssignmentID                     WHERE  CPA.NodeID = N.NodeID                     ORDER BY CPSD.DATETIME DESC                   )
FROM    Nodes N
WHERE   NodeID IN ( SELECT  NodeID                    FROM    CustomPollerAssignment CPA                            INNER JOIN CustomPollers CP                                ON CPA.CustomPollerID = CP.CustomPollerID                                   AND CP.UniqueName IN ( 'eadsP25ActiveHostName', 'eadsP25StandByHostName')
AND N.System = 'XXXXX'
)
ORDER BY N.Caption ASC

 

How could I approach the issue and provide the links (and ideally the tooltips) in the active_host & standby_host column?

Device Add/Registration TimeStamp ?

$
0
0

I am trying to determine when a device was added into the SolarWinds DB.   I have looked through the following tables without luck

 

nodes.statistics

nodesdata

npm_nodes

 

Any help will result in a correct answer and or a potential like.   Have a great weekend my fellow SolarWinds compadres!

Upgrade to NPM 12 from 11.5 - Caveats

$
0
0

Guys, we are about to start monitoring a lot more data in our NPM instance. As such, it think it might better to upgrade from 11.5 to 12 before we start adding in tonnes of customized scripts etc. Just wondering how the community have upgrade their instances?

 

  • Have you done in place upgrades or build outs?
  • Any gotchas to be aware of that aren't documented?
  • Anything that plain doesn't work?

 

Just looking to see if it's worth doing the upgrade now which I am leaning to.

Availability report, not adding up

$
0
0

Hello,

 

I created a custom availability report that was just a modified version of the current "Group Availability (with members) - This Year" report.  The only thing I changed was that I dropped the up/down status and added Group_Member_Availability.  However, one discrepancy that I am seeing is that in some areas the group availability will show up as 99%, however the group members are all at 100%.  How can the entire group have a lower availability then the average of all the members in that group?

 

BadMath.png

 

GroupAvailability.png

NPM & NTA Vs Network Bandwidth Analyzer Pack

$
0
0

Heloo,

 

I am wondering what is the difference between buying NPM then buy NTA  and bying "Network Bandwidth Analyzer Pack" which is NPM+NTA ?  is there any benefit ?

Which Help Desk / Service Desk are you using?


Documentation of Solarwinds Services

$
0
0

I've been looking around on the Success Center for a document which explains which Orion services do what, ie the Orion Module Engine, the Solarwinds Information Service V3, etc.  I'm also interested in finding out how the different services interact with one another.  A good example of what I am looking  for would be how the individual services work together to send an email for an alert.

 

I can't really say that I found much.

 

I did find this Thwack post earlier (https://thwack.solarwinds.com/docs/DOC-170557#start=25 ), and it was a good place to start, but I didn't get much further.  I'll be grateful for any other resources that I can be pointed toward.

NPM 12.0.1 SSH Client settings

$
0
0

Are there any sort of settings to configure in the embedded SSH client built into NPM 12.0.1?  I don't see any in the web interface

 

Perhaps an ".ini" file with options?

NetPath agent management issues

$
0
0

Hi guys,

 

Can anybody help out here?

 

We have recently migrated our solarwinds server to a new server and new database and upgraded to the latest version. So this is what we did:

 

 

1. OlderServer moved to NewServer (which has different name and IP address temporarily)

2. OldServer was shutdown and NewServer's name was changed to the OldServer's name and IP address

3. All worked fine and polling continued as normal. (licensing was also migrated successfully)

 

The problem is, NetPath just doesn't work and never has. Under 'manage agents' the agent says the NewServer's temporary name (that was changed) and can't connect. I tried deleting it but it won't delete. I tried adding another agent and it throws up website errors. I tried installing the agent software manually and this is the type of error I keep seeing:

 

Error getting agent installer for download. There was no response from the Agent Management Control Service located at "SOLARWINDSSERVER:17777".

 

I've run the configuration wizard numerous times. I've tried re-installing NetPath alone. I'm getting no where. I've raised a call with the Solarwinds guys and they've told me to run some queries on the database to remove the incorrect name, but this hasn't worked. It looks like I need to re-install the agent management service or something like that but i'm not sure what it is I need to re-install. I've also found the table in the DB references the wrong name and removed it. Still nothing. Maybe I just need to do a complete re-install of solarwinds???

 

Any help or advice, would be great. Thanks.

The Ultimate CPU Alert for Large Environments

$
0
0

If you've been playing along at home you will have likely implemented adatole's The Ultimate CPU Alert and the slightly modified version (by yours truly!) for Linux The Ultimate CPU Alert ... for Linux!  While I don't have a problem with either alert for small and medium sized environments when you start scaling to large environments things can get a little hairy.

 

What is large?  We have nearly 11,000 nodes and almost 145 million entries in the CPUMultiLoad view.  (You can find out how many rows you have by running SELECT COUNT(NodeID) FROM CPUMultiLoad WITH (NOLOCK) against your DB)  While examining our database via Database Performance Analyzer (you have DPA, don't you!?) we noticed that the Alert Status query associated with our CPU alerts kept showing up as a source of blocking.  Blocking queries generate wait time for other queries while the consume resources to complete.  In this case the blocking appeared to be caused by a query that was taking a long time to execute.  I put on my SQL query detective's hat and went to work.

 

The core of the The Ultimate CPU Alert is a pair of INNER JOINS to get the number of CPUs for each node so that it can be compared to the Win_Processor_Queue_Len value that is captured by the Server and Application Monitor component.  The top of the query looks like this:

 

SELECT DISTINCT Nodes.NodeID AS NetObjectID, Nodes.Caption AS Name

   FROM Nodes

   INNER JOIN APM_AlertsAndReportsData  ON (Nodes.NodeID = APM_AlertsAndReportsData.NodeId)

   INNER JOIN

      (SELECT c1.NodeID, COUNT(c1.CPUIndex) as CPUCount

     FROM

         ( SELECT DISTINCT CPUMultiLoad.NodeID,  CPUMultiLoad.CPUIndex FROM CPUMultiLoad )  c1

      GROUP BY c1.NodeID

      )  c2

   ON Nodes.NodeID = c2.NodeID

 

That innermost SELECT statement is where we run through the entire CPUMultiLoad table so that we can then select the NodeID and count of the CPUIndex.  Given that this view (as CPUMultiLoad is a view, not a table) has 144 million rows in our DB (and since you don't really change the number of CPUs on a server all that often) there might be a better way to perform this query.  Here's what we did:

 

   INNER JOIN APM_AlertsAndReportsData

   ON (Nodes.NodeID = APM_AlertsAndReportsData.NodeId)

   INNER JOIN

      (

  SELECT DISTINCT CPUMultiLoad.NodeID, MAX(CPUMultiLoad.CPUIndex)+1 AS CPUCount

  FROM CPUMultiLoad

  WHERE DATEDIFF(hh,TimeStampUTC,GETDATE()) <=4

  GROUP BY CPUMultiLoad.NodeID

    ) c1

  ON Nodes.NodeID = c1.NodeID

 

Instead of grabbing the data from CPUMultiLoad and then selecting it again we removed an INNER JOIN and selected the NodeID and MAX(CPUMultiLoad.CPUIndex) instead.  Of course, CPUIndex values start with zero and, since we want a count for comparison against the number of processes running against those CPUs, we added 1 and called it CPUCount.  In order to trim the number of rows returned from this SELECT statement (remembering that we have to INNER JOIN the select results (we called it c1) with the Nodes table) we added a WHERE DATEDIFF clause.  We take the time stamp in the CPUMultiLoad table and compare the difference, in hours, against the current date and then return only those rows where the hours are less or equal to 4.  Why 4?  Our Orion environment is set to Eastern time and UTC is 4 hours difference from Eastern time.  (I might do 6 hours -- just to be safe for daylight savings time, etc. -- but you get the idea!)

 

When we ran the two queries back-to-back we found that the updated query returned results 250% faster!  I'm not a SQL whiz by any stretch, but I definitely think this is a great step in the right direction.

 

How would you improve the query?

 

For the record, this is the entire query for the The Ultimate CPU Alert:

 

SELECT DISTINCT Nodes.NodeID AS NetObjectID, 

      Nodes.Caption AS Name 

   FROM Nodes 

   INNER JOIN APM_AlertsAndReportsData 

   ON (Nodes.NodeID = APM_AlertsAndReportsData.NodeId) 

   INNER JOIN 

      (

  SELECT DISTINCT CPUMultiLoad.NodeID, 

        MAX(CPUMultiLoad.CPUIndex)+1 AS CPUCount

  FROM CPUMultiLoad

  WHERE DATEDIFF(hh,TimeStampUTC,GETDATE()) <=4

  GROUP BY CPUMultiLoad.NodeID

    ) c1 

     

   ON Nodes.NodeID = c1.NodeID 

   WHERE Nodes.n_mute <> 1 

  AND Nodes.Prod_State = 'PROD' 

  AND APM_AlertsAndReportsData.ComponentName = 'Win_Processor_Queue_Len' 

  AND APM_AlertsAndReportsData.StatisticData <= c1.CPUCount 

  AND ( (nodes.CPU_Crit is null 

  AND nodes.CPULoad < 90) 

   OR (nodes.CPU_Crit is not null 

  AND nodes.CPULoad < nodes.CPU_Crit) )

Wireless AP no longer detected after 12.0.1 upgrade from 12.0

$
0
0

We have Avaya Wireless APs, which are OEM Xirrus units.  SInce the 12.0 upgrade we were receiving AP-type stats on each of the endpoint's home pages. However, since the upgrade to NPM 12.0.1 these seem to have disappeared.

 

Also a possibility a recent software upgrade from 7.5.4 to 8.1 on the APs may have caused this.

SNMP requests for Serial Numbers of Cisco Devices

$
0
0

Hi

 

I am trying to find out how it is possible to display values for a Cisco Host from a specfic table.

 

The Table is from Cisco ENTITY-MIB

entPhysicalEntry (OID:1.3.6.1.2.1.47.1.1.1.1)

 

In the end I would like to have a table of the Serial Number associated with this device/host, e.g.

entPhysicalDesc (OID:1.3.6.1.2.1.47.1.1.1.1.2)

entPhysicalName (OID:1.3.6.1.2.1.47.1.1.1.1.7)

entPhysicalModelName (OID:1.3.6.1.2.1.47.1.1.1.1.27)

enyPhysicalSerialNum (OID:1.3.6.1.2.1.47.1.1.1.1.13)

 

And if possible I would like to filter the results displaying only entries where the serial number is defined.

 

Any help would be mush appreciated.

Thanks

Cisco ASR 1002-X, 1002-HX Node Details


What We're Working on for NPM (Updated October 25th, 2016)

$
0
0

Since the release on NPM 12.0 we've been hard at working building the next round of exciting functionality and improvements in existing functionality.  I'm pleased to share the following list of items we're working on:

 

  • Improved Meraki Wireless Support - Full support for Meraki wireless access points
  • Silence Alerts - Silence Alerts While Still Monitoring
  • Network Insight for Cisco ASA - Covering things like VPN tunnel monitoring, ACL visibility, and improved platform health (H/A status, hardware health).
  • Faster and web based simplified product upgrades and installation of hot fixes
  • Integrated Disaster Recovery Engine
  • Windows Authentication for SQL DB - Orion can optionally use AD user credentials to authenticate to the SQL database instead of SQL local credentials.
  • Performance Analyzer - Pull metrics, statuses, events and other data sets into a single view with a shared timeline for correlation and faster troubleshooting.
  • Import nodes from file with custom properties

Netpath not showing redundant LAN links

$
0
0

Between the probe and the destination is about 4 or 5 hops.  Some hops, with redundant links, aren't showing redundant paths - netpath is showing a straight hop to hop topology, like a straight line.

 

Specifically, the flow is crossing a router, that has four uplinks - 2 each to 2 parallel L3 switches, which also have 2 links each to the destination.

 

So, it should look like line to a baseball diamond.   I'm using Cisco EIGRP, with ECMP routing.  All the hops are monitored solarwinds nodes.

 

here's the routing table I'm worried about - as you can tell, the routing table has four hops, with equal metrics:

 

RMER1#sh ip route 10.24.2.5

Routing entry for 10.24.2.5/32

  Known via "eigrp 250", distance 170, metric 51210880, type external

  Redistributing via eigrp 250

  Last update from 10.24.90.13 on GigabitEthernet0/0/4, 1w0d ago

  Routing Descriptor Blocks:

    10.24.90.13, from 10.24.90.13, 1w0d ago, via GigabitEthernet0/0/4

      Route metric is 51210880, traffic share count is 1

      Total delay is 100011 microseconds, minimum bandwidth is 1000000 Kbit

      Reliability 255/255, minimum MTU 1500 bytes

      Loading 1/255, Hops 2

    10.24.90.9, from 10.24.90.9, 1w0d ago, via GigabitEthernet0/0/2

      Route metric is 51210880, traffic share count is 1

      Total delay is 100011 microseconds, minimum bandwidth is 1000000 Kbit

      Reliability 255/255, minimum MTU 1500 bytes

      Loading 1/255, Hops 2

  * 10.24.90.5, from 10.24.90.5, 1w0d ago, via GigabitEthernet0/0/3

      Route metric is 51210880, traffic share count is 1

      Total delay is 100011 microseconds, minimum bandwidth is 1000000 Kbit

      Reliability 255/255, minimum MTU 1500 bytes

      Loading 1/255, Hops 2

    10.24.90.1, from 10.24.90.1, 1w0d ago, via GigabitEthernet0/0/1

      Route metric is 51210880, traffic share count is 1

      Total delay is 100011 microseconds, minimum bandwidth is 1000000 Kbit

      Reliability 255/255, minimum MTU 1500 bytes

      Loading 1/255, Hops 2

WPM Transaction

$
0
0

I would appreciate if you can tell me if there is a option to create an alert in WPM Transaction Availability.  Want to trigger an alert event after two consecutive failures polls of 5 minutes?

 

Thanks

David

why does Min/Max/Average bps In/Out chart show 90th Percentile when 95th was picked.

$
0
0

I have searched and did not see a discussion on this so I hope I'm not being repetitive here.  We have a custom chart of Min/Max/average bps In/out and have the  show the 95th percentile line checked, but on the chart it is showing the 90th percentile. Any ideas?

11.5 and iframes

$
0
0

Prior to 11.5 I was able to embed selected orion views on other webpages using iframes with a simple html as below.  After upgrading to 11.5 I'm getting "This content cannot be displayed in a frame".  Any knows how to get this going again.

 

<html>
<body>

<iframe src="http://10.10.10.10/Orion/SummaryView.aspx?ViewID=56&isNOCView=true" width="1300" height="1200">
</iframe>

</body>
</html>

Viewing all 21870 articles
Browse latest View live


<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>