Quantcast
Channel: THWACK: All Content - Network Performance Monitor
Viewing all 21870 articles
Browse latest View live

95th Percentile Reporting per ISP - Custom SQL Report

$
0
0

We have several providers for our datacenters, and 10-20 datacenters across the world. Recently, we were hit with some higher-than-expected bills because of bandwidth overages, so we are hoping to take a proactive approach to our bandwidth monitoring to remedy it. Since we get billed on 95th percentile, we are looking to create a report that shows us the 95th percentile usage for each ISP over the last 30 days. We'll then use the reports to gauge where we are with our bandwidth usage and adjust as needed.

 

I had a call with Solarwinds and it was suggested that I post here for more help. Here's what I have so far... I'll use Level 3 as our ISP in the example.

 

SQL for report

DECLARE @StartDate DateTime

DECLARE @EndDate DateTime

 

SET @StartDate = ${FromTime}

SET @EndDate = ${ToTime}

 

SELECT    Interfaces.InterfaceId,

Nodes.NodeID,

Nodes.City,

Nodes.Caption AS NodeName,

Interfaces.Caption AS Interface_Caption,

Maxbps_In95,

Maxbps_Out95,

Maxbps_All95

FROM Nodes

INNER JOIN Interfaces ON Nodes.NodeID = Interfaces.NodeID

INNER JOIN (

SELECT InterfaceID,

dbo.GetInBpsPercentile(InterfaceID, @StartDate, @EndDate,95) AS Maxbps_In95,

dbo.GetOutBpsPercentile(InterfaceID, @StartDate, @EndDate,95) AS Maxbps_Out95,

dbo.GetInBpsPercentile(InterfaceID, @StartDate, @EndDate,95) +

dbo.GetOutBpsPercentile(InterfaceID, @StartDate, @EndDate,95) AS Maxbps_All95

FROM InterfaceTraffic

WHERE InterfaceTraffic.DateTime >= @StartDate AND InterfaceTraffic.DateTime <= @EndDate

GROUP BY InterfaceID

) TrafficStat

ON Interfaces.InterfaceID = TrafficStat.InterfaceID

WHERE (Interfaces.Caption LIKE '%level 3%' OR Interfaces.Caption LIKE '%level3%')

 

This produces a table that shows me the 95th percentile usage. However, it doesn't show me the sum of all locations. We want to see the ISP and the total 95th percentile usage across all sites.

 

Screen Shot 2015-09-18 at 2.29.52 PM.png

 

The other option I tried was to put it on a graph. However, I'm unable to find a Timestamp field so that I can associate the data with a time and put it on a graph. I've tried some different SQL queries but haven't been able to get it yet.

 

So... does anybody know of a way to get a total sum for the column OR how I can associate the data with a time so that it will fit on a graph? Let me know if you need more information and I'll work with you.


How to limit a result from a Universal Device Poller to specific text

$
0
0

All,

 

I am trying to find a way to determine how many sfp's a switch has. I figured out that I can use the 1.3.1.6.1.2.1.47.1.1.1.1.2 OID to get a physical description of the port. So if you do a show interface status, the last column on right will tell you if you have an SFP or not.

If the text is '1000BaseSX' or 'SFP-10Gbase-LR' or '1000BaseT' then it's an SFP. The problem is an SNMPWALK of the above OID table returns a value for every port. So what I'm trying to do is "grep" for only the text that is for SFP's. Does anybody know how I can filter results from a UDP?

So logically, I'm trying to return a value only if it's one of the 3 above, otherwise, don't list the result.

Below is an image of the result.   I'm trying to only have the items with 1000BaseSX or 1000BaseT show up in the list. I don't want Port Container or any other text.

Any help would be great! Thanks!

Screenshot of UDP text values

Let's talk about Solarwinds High Availability. Do you want HA? Why? Do you have it? Why'd you get it? Does it work well?

$
0
0

If you don't have Solarwinds HA:

  • Why not?
    • Cost?
    • Your company doesn't believe monitoring is important enough to support (with licenses, employee setup/support hours, hardware environment, etc.)?
    • You've never thought about it?
    • You just don't need it?
    • You don't have time to set it up or to maintain it?

 

If you DO have Solarwinds HA:

  • Why did you get it?
  • How did you convince your company it is necessary?
  • How satisfied are you with it?
  • Have you set up HA for ALL your polling engines?
  • What would you change about it?
  • Have you seen the new HA administration view that shows all your standby / HA pollers, and that highlights any differences between them?

 

You can check out some of the HA views on your own Main Poller here:

  • https://<YOUR SOLARWINDS MAIN POLLER ADDRESS OR DNS>/ui/ha/settings
  • https://<YOUR SOLARWINDS MAIN POLLER ADDRESS OR DNS>/ui/ha/summary

Null Engines in VoIPEngines Table

$
0
0

I looked into the VoIPEngines table, to see what there is to see, expecting to see 5 rows -- the same number of rows in Engines table.

SELECT * FROM [dbo].[Engines]
SELECT * FROM [dbo].[VoipEngines]

 

However, it seems that somehow there have been a lot of null engines added into the table every since July 31.

VoipEngineID KeepAlive                          BusinessLayerPort      EngineID

1                     2018-10-23 14:03:51.450 17777                          1

3                     2018-10-23 14:03:45.293 17777                          3

6                     2018-07-31 11:03:49.423 17777                          NULL

7                     2018-07-31 11:04:39.257 17777                          NULL

8                     2018-07-31 11:04:49.523 17777                          NULL

...

44990             2018-10-23 13:50:17.450 17777                          NULL

44991             2018-10-23 13:55:17.883 17777                          NULL

44992             2018-10-23 14:00:17.307 17777                          NULL

 

As of writing this question, there are 44987 too many engines in VoipEngineID, all with NULL EngineID. These do not show up in the [dbo].[Engines] table.

Does anyone know what causes this?

 

Also, can I just delete the NULL rows?

DELETE [ve]
FROM [dbo].[VoipEngineID][ve]  WHERE [EngineID] IS NULL

 

I suppose someone is going to say, "ask Tech Support". Sure, I can do that, I just thought that I maybe this would make a good question to post online.

High Availability - What's Up?

$
0
0

All,

 

Here is a correlation of various links and pages from everyone.  As I am moving in this direction, I wanted to get a page together so I have one place to refer instead of bouncing around.  If you come across a page or good information let me know and I will get it down here.

 

If you have deployed HA and would like to add to the conversation, please do.  I know there are pitfalls and gotch-ya's out there.

 

Thanks,

CourtesyIT.

 

Special Thwack-Out toabdulraheemsidzmad-stevooiram

 

Start Here: ---> TheAlwaysAwesomeaLTeReGo

Multi-Subnet Failover (WAN/DR) Deployment

Torture Testing High Availability

 

Second Stop: ---> Excellent @piharrison

Re: SolarWinds High Availability - HA - In a WAN Environment!!!

 

Thwack Postings:

Let's talk about Solarwinds High Availability.  Do you want HA?  Why?  Do you have it?  Why'd you get it?  Does it work well?rschroeder

HA Deployement Summary Red On Additonal

 

Success Center

Which IP address is used as the source? - SolarWinds Worldwide, LLC. Help and Support

Required DNS Permissions to set up a High Availability Pool and access Microsoft DNS - SolarWinds Worldwide, LLC. Help a…

Grant non-domain administrator account rights for IPAM DNS Monitoring - SolarWinds Worldwide, LLC. Help and Support

Required DNS Permissions to set up a High Availability Pool and access Microsoft DNS - SolarWinds Worldwide, LLC. Help a…

How to update multiple DNS servers when failover occurs - SolarWinds Worldwide, LLC. Help and Support

 

User Ideas

Can we have HA VIP used as source IP for Polling all nodes

 

Virtualization Help

https://www.virtualizationhowto.com/2016/02/force-primary-outgoing-ip-address-windows-server-2012-r2/

 

Microsoft

SQL Server Multi-Subnet Clustering (SQL Server) | Microsoft Docs

 

 

More to come.....

Feel free to bookmark for future reference

Poller Checker Tool - SNMPv3 Error

$
0
0

Hello,

 

I'm wondering if anyone has seen this error.

 

I need to change the OID that is being polled for my Cisco ISR 4321's for CPU monitoring.  By default it goes with n.Cpu.SNMP.CiscoGen3 and I need CiscoGen1.  I used to be able to do this with the poller checker tool but I get this error when I click Detect Pollers:

 

"SNMP Version 3 Authorization Requested, but no/short AuthKey is set"

 

SolarWinds is polling these devices using SNMPv3 properly.  I've tried changing the SNMPv3 auth and priv keys.  Deleted/re-added the devices.  I can't figure it out.  I know this worked before, but I think has stopped working since upgrading to NPM 12 or 12.1.

 

Here's what I'm currently running:

 

Orion Platform 2017.1.3 SP3, NCM 7.6, CloudMonitoring 1.0.0, NPM 12.1, DPAIM 11.0.0, QoE 2.3, VIM 7.1.0, SAM 6.4.0, NetPath 1.1.0

 

I have a ticket open with support, but they are not helping. 

 

Has anyone seen this?

Nodes showing down in NPM but are pingable from local user machine

$
0
0

I have a couple of nodes in NPM which are shown as down. But when I try to ping them from my machine, they are pingable and I also checked with the local IT on site, and they informed me the devices are up and running. I have already tried below links:

 

https://support.solarwinds.com/Success_Center/Network_Performance_Monitor_(NPM)/Knowledgebase_Articles/Node_marked_as_down_but_can_be_pinged_from_the_NPM_server

 

A node is showing as down but you can ping the same node successfully from the NPM Server - SolarWinds Worldwide, LLC. H…

 

But unfortunately that has not resolved the issue. I also tried to delete the nodes and re-add it, and when I re-add the nodes they are UP for a while, but after some time they automatically are again shown as down.

 

Can anyone please provide me a solution to resolve this issue?

Reset Polling/Database Retention Settings

$
0
0

Is there an easy way to reset these items to the defaults?


Where are Orion default setting for Temp ?

$
0
0

Hi

 

I can not find the "Temperature default setting" in NPM 12.2

:-(

 

Any one ?

 

 

 

 

 

Custom Table Status Icon

$
0
0

I created a new dashboard for my Helpdesk recently. I included a Custom Table that displays the top circuit utilization. I formatted the table layout to show the green status icon. Would it be possible to customize the status icon so that it turns red after the circuit hits a certain threshold? I'd like it to turn yellow when it hits 90% and red at 95%. Thank you.

 

Custom Table

 

Custom Table Layout

Report for Disk space in last 30 days

$
0
0

Hello, I am trying to create a report that will allow me to select a date range for what the disk volume was overtime, specifically the last 30 days. I want to see the trend of how the volume grew in that time range.

Hi Guys

Is it possible to use a variable in the recipient email address?

$
0
0

We would like to setup a single alert that monitors a couple of common conditions. When one of those conditions are exceeded we would like to email a list of people who are stored in a customer property. Is it possible?

Volume property not displaying in Alert Manager

$
0
0

I am running NPM 12.3 and am trying to setup low disk space alerts but there is no "Volume" property to alert on in Alert Manager. The only item that has volume in the text is related to the polling engine. There is a default alert about projecting disk space utilization that uses a SQL query but this does not do what I need (and returns values on RAM vs fixed disk as well).

 

Are others on 12.3 able to do this and see the Volume property that I am used to in prior versions?

Are your Orion server and SQL database server in the same Active Directory domain?


Discovery Log location

$
0
0

C:\Documents and Settings\All Users\Application Data\SolarWinds\Discovery Engine

When doing a Network Discovery it will fail and I'm told to check the Discovery Log. Had a hard time finding it.

I don't have a fix for the problem yet but here is the log.

 

Mike

VPN Tunnel Monitoring

$
0
0

I am trying to monitor VPN tunnels up/down status.  Can anyone tell me the best way to do this with the Solarwinds Product?  We are trying to avoid just pinging a server on the far side of the VPN tunnel as it is a third party device and we do not particularly care if their server is up and running, only that the tunnel is up.  I already have the Universal Pollers for the Cisco ASA that tells us the number of active tunnels, current connections, and the number of sessions. 

 

If using the cikeGlobalActiveTunnels OID is a good way to monitor active VPN's, how can I configure alerting if a critical VPN is no longer active?  The issue here is that not all of the VPN sessions are critical nor are they up all the time, so how can I differentiate the critical VPNs that should be up 24/7 from the ones that are only used periodically throughout the day?

 

I am aware of using the IP SLA Monitor option to ping an IP on the other end as well.  I am thinking that this may be my best bet, but I wanted to pose this question to the community just to be sure.

 

Thanks. 

Multiple Product Upgrade And Migration - NPM 12.1 - 12.3, SAM 6.4 - 6.7, NCM 7.6 - 7.8, And NTA 4.2 - 4.4

$
0
0

Please Help!!!

According to Upgrade and Migration Guide, I have to migrate the Engine and SQL servers to Windows 2016 and minimum SQL server 2016.

 

Current Config
SAM 6.4
NCM 7.6
NPM 12.1
NTA 4.2

Existing Servers
Engine
Windows 2008 R2

Database Server
Windows 2012
SQL Database 2014

New Servers
Engine
Windows 2016

Database Server
Windows 2016
Windows SQL Server 2016

I was wondering if someone can help with the migration steps that I need to take in order to upgrade all of the applications to the latest software.

SAM 6.7
NCM 7.8
NPM 12.3
NTA 4.4

 

Keep in mind that we have to migrate to new servers for the pollers and the SQL database.

Solarwind Alert integration app for servicenow

$
0
0

Hi All,

i have integrated servicenow with solarwinds for automatic incident generation with help of Solarwinds alert integration app.

Now i have a requirement where in i have modify the create ACL of incident table for this custom application on servicenow side to restrict creating a record in incident table from mobile interface.

i can have two options either i have to restrict the acl for mobile interface or i have to provide the required role in ACL to allow app to create a record in incident table.

now i am struglling as i can not restrict the custom ACL for this app but i can add the role but i am sure about which kind of access app needed to create a record in incident table.

Can anyone help me on this?

Potential ServiceNow Integration Bug Case # - 00162702

$
0
0

Currently, our ServiceNow SolarWinds integration is functioning to an extent but there is one major issue with it.

 

It changes State values in most cases as it should but I've been able to replicate our issue by doing the following

I bring node down, creates an incident if there isn't one

I bring Node back up, sets incident state to monitoring

I bring the Node back down, sets incident state to active

I bring Node back up, sets incident state to monitoring

 

Everything is functioning fine up until this point

So lets say we have verified functionality and their monitoring period is up

 

I set the incident state to resolved (value 6) within ServiceNow

I bring the Node back down, it sends over work notes but it does not send a state value with it

the payload within the Incident Integrations module shows

{"sys_id":"<incident sys_id>","work_notes":"Orion alert has been reactivated."}

with <incident sys_id> being the incidents sys id

 

I figured that it might be an issue with the orion integration account not being able to update incidents in a resolved state

to test it, i attempt to bring the node back up, it changes the state just fine on over to monitoring

 

I figure maybe its an issue with solarwinds attempting to update state immediately following someone manually updating state on the incident (maybe it throws it into an acknowledged state and following that, it doesnt update state)

Made sure that Do not execute this action if the alert has been acknowledged already (Recommended) was unchecked so that it will continue to update the incident following (even though it seems to update fine on node going back up using the same action)

Attempted to see if I could get it to change state from resolved immediately following manual resolve using same action

 

Brought node down, sets state to active

Manually change state to resolved in ServiceNow

Brought node back up, sets state to Monitoring successfully

 

Issue is with Reopen state specifically following a manual change within ServiceNow to Resolved

Reopen state works otherwise and if i bring node back up and back down, it will set state normally.

It is only immediately following doing a manual update changing the state to resolved that it does not send over an updated state value

 

I had thought that maybe it could be the Acknowledge State that is playing with it a bit but i set the Acknowledge State to active from resolved and was still getting the exact same issues

 

I ran a search within the incident integrations module to see if it ever was able to update the state from resolved to something else using that same action

query goes something like this

comment contains state: "6"

comment does not contain incident_state: "6"

payload contains Orion alert has been reactivated

 

I get 10 results but these are for things running different actions

In the actions, instead of setting incidents to monitoring, they set them to resolved when node goes back up

When they go back down, it is updating to active just fine meaning that its not an issue going from resolved to active using the Reopen state field

 

So the issue is narrowed down to,

following a manual modification of the state field, Reopen state fails to include state key value pair in the message it sends to update the incidents state

 

Any assistance would be greatly appreciated with this issue!

We opened a case number with support a few weeks back and recently, they suggested posting here for assistance along with including the case number

This is a huge issue because when nodes go back down following being resolved, no one is being notified because the state is not changing back to active

Viewing all 21870 articles
Browse latest View live


<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>