Quantcast
Channel: THWACK: All Content - Network Performance Monitor
Viewing all 21870 articles
Browse latest View live

SQL Server 2016 Support


Duplicate entries and missing entries

$
0
0

I’ve been noticing some erratic behavior Solarwinds. I’ve seen one of these issues before.  The issues I’m seeing:

 

  1. 1) For some reason Solarwinds is reporting the same interface as a duplicate entry and it has negated the similar interface, ie: 1/0/2 (is duplicated) 1/1/2 (is missing even though it has been added to SW).  Please see pictures below.

 

Particular interfaces in question via HTC000JR01. 

sw1.PNG

 

I was able to add these to the SW map successfully but now 1/1/2 has disappeared and has been replaced with a duplicate 1/0/2 even on the backend Network Atlas Map.

Network Atlas Map.

sw2.PNG

 

Manage Nodes section of SW after clicking the plus sign next to HTC000JR01 to expand the added interfaces.  As you can see there are duplicate entries for 1/0/2.

sw3.PNG

 

Me performing a search for 1/1/2 from tne Manage Nodes section of SW after clicking the expand symbol next to HTC000JR01.  Just to prove I’m not potentially overlooking it.

 

 

This is what is produced when I click the checkbox next to HTC000JR01 and click on List Resources.  You can see that there is only 1/0/2 and 1/1/2 that have been added to SW.  I ran through this again and it still shows the duplicate 1/0/2 and NO 1/1/2.

SW4.PNG

 

 

List resources for HTC000JR01

 

  1. 2) I will input information into Customer properties either for interfaces or nodes.  Click submit and once I’ve been redirected to the node or interface page it will display my changes. Although if I exit out of the window and navigate back to that particular node and/or interface the information is no longer saved and I have to resubmit the information again.  This doesn’t happen all the time although it happens way more than it should.  I’m having to double my efforts in order to add circuits.  I’ve seen this issue in the past.  I’m unable to accurately and successfully complete the monitoring tasks for HTC-2016-0006 due to issue # 1.

NPM 12 Message Center row color missing

$
0
0

For those that have upgraded to NPM 12 and using Message Center, you may have noticed row background color no longer colors the entire row.

 

Get a copy of this change plus more here: NPM 12 Message Center mod

 

I have jquery code update that can be added to the file C:\inetpub\SolarWinds\Orion\NetPerfMon\OrionMessages.aspx

This code will update the background color for the entire row.

 

At the bottom of the file there is a section that starts:

    <script type="text/javascript">

 

Add this code right after that line and the row colors will once again be colored.

 

        <script type="text/javascript">               $('div.event-icon').each(function () {                  var $tr = $(this).closest('tr');                  var $color = $(this).css( 'background-color' );                  $tr.css('background-color', $color);               });

 

Here is the before and the after:

Before:

 

After:

Modify Alerts To Be Smarter With Node Thresholds

$
0
0

Here are some changes that I made to my Orion alert engine to better handle unique alert thresholds per node. I wish these were already set like this out of the box.

 

When you edit each node, at the bottom of the page there is a threshold section that allows you to change a single node's thresholds based on your needs. This makes sense in several cases where maybe the CPU at 50% in a server is critical or a remote server has a poor connection and 300ms response time is acceptable.

 

This requires you to modify the respective alerts to NOT use a static value but to reference the value set per node instead.

 

When you look at available alert variables there are several starting with Warning Value Reached.... or Critical Value Reached..., each with their respective category.

2016-10-13 09_44_29-Program Manager.png

 

 

What i did on each alert was remove the static value in each and replace them with the corresponding warning/critical value reached variables. This allows nodes to still received whatever is set for the global threshold values while modifying nodes on a one off basis and still leveraging the same alert.

 

Response Time

2016-10-13 09_19_21-Edit Alert - _ SMS_Slack Alert me of High Response Time (custom)_.png

 

Interface Utilization: I added both receive and transmit values into the same alert. This was just my preference, be default there is one alert for receive and another for transmit.

2016-10-13 09_20_31-Edit Alert - _ SMS_Slack Alert me of Interface issues (custom)_.png

 

Packet Loss

2016-10-13 09_21_25-Edit Alert - _ SMS_Slack Alert me of Packet Loss (custom)_.png

 

CPU

2016-10-13 09_22_46-Edit Alert - _ SMS_Slack alert me when CPU load has an issue (custom)_.png

 

Memory

2016-10-13 09_23_47-Edit Alert - _ SMS_Slack alert me when Memory load has an issue (custom)_.png

 

 

 

I've put in a feature request for disk/volume alerting to also have warning/critical value reached. Those variables currently do no exist.

 

If you've found this useful, please rate this article.

Add Top Conversations to Alert Email

$
0
0

I have an alert setup that anytime one of our data T1's reaches >80% an email is sent. I want to add the top 10 Conversations from NTA to the email.

 

Is this possible?

Minimize down time due to database migration

$
0
0

Hi all,

 

Going to be migrating our database from one drive to another on the same server, and looking to minimize downtime. I'm wondering what others have done to reduce downtime? Only thought I have is to make a backup, transfer this over to the new drive, detach the current database and then restore from the backup, but that seems like a typical procedure.

 

 

Also worth mentioning, I know Solarwinds had an application that imported the list of nodes with node name and IP address and provided basic monitoring (ping), does anyone know if that is still around?

 

 

Any thoughts?

 

 

Thanks!

Two Factor Authentication

$
0
0

Does the latest version of NPM now support 2FA, such as SecureID, Vasco, etc..?

Two Factor authentication

$
0
0

Can NPM comply with two factor authentication with a radius or taccas server?


Monitoring Via ILO

$
0
0

We have a Server that is basically standalone for security reasons however it failed recently and it was found that both hard drives had failed probably over a period of time.

 

The only way out security team will entertain connecting this server to our network is via its ILO interface, I was wondering if anyone had ever tried any hardware monitoring via the ILO interface and if so how you did it specifically as this requires a logon to the ILO then a further connection to the Server from there.

 

Regards,

 

John

NetPath not showing hops

$
0
0

NetPath from my Solarwinds server is working great. I have two NetPath probes on Windows servers, pointed back to Solarwinds from remote sites. One works great, the other doesn't. The one that doesn't work shows no hops. It shows itself, Solarwinds, and nothing in between. A tracert from the server shows three hops in between, as expected.

 

Anyone know why? Not sure what other info I can provide.

 

The only difference is that pushing out an agent to this particular server failed, and I installed it manually as agent-initiated.

Accessing web console using HTTPS

安装试用版NPM软件时报错

$
0
0

请问在部署NPM时,那些Application ports是都要开通吗,是在数据库服务器上开通还是在NPM服务器上开通?

安装完后无法打开Orion Web Console提示错误如下:谢谢

8.JPG

Hardware requirements for additional polling engine using several modules?

$
0
0

Hello everyone, we are considering adding another additional polling engine to our SolarWinds environment, and we are re-evaluating our hardware needs for additional pollers. I am having a difficult time locating hardware requirements for this for more than just 1 particular module. We are running the following modules:

 

NPM 12.0

IPAM 4.3.2

SAM 6.2.4

DPA 10.0.1

NCM 7.5

IVIM 2.1.2

QoE 2.1.0

NetPath 1.0

NTA 4.2.0

UDT 3.2.4

 

We currently have 2 licenses per server, and all of our servers used for our entire SolarWinds environment are virtual servers if that matters. I would also like to confirm if 12,000 total elements per additional poller is the maximum number of total elements we should be allowing, and if there is lesser number than 12,000 that we should consider moving nodes to an additional polling engine. We currently do not have any nodes that we no longer need to monitor, and the polling rate on one of our additional pollers has been hovering at 60% with just under 9,000 total elements, so we are trying to be a little proactive before we reach 85% of its polling rate and have to start scrambling a little faster to come up with a proper resolution.

 

I apologize if I'm leaving out any pertinent information in this-- I am a SolarWinds newbie and at the very beginning of the learning curve here. If there is anything I left out please let me know and I'll be happy to provide that information. Thanks in advance!!

Basic Email Acknowledgment

$
0
0

Can i have an acknowledgement link in an email alert which doesn't require a NPM account? or a way I can automate the authentication?

 

We currently have 100's of members of operations staff and creating account or even groups of accounts isnt really an option at this stage.

 

Thanks

NPM Top XX Nodes by Percent Memory Using Dynamic Thresholds

$
0
0

Hi all,

 

i have a dashboard showig the "Top XX Nodes by Percent Memory Used" resource,   however as we have a lot of SQL databases, the list is mainly populated by thes (as they use a lot of RAM)

 

 

i have created dynamic thresholds for all of these servers so that the memory used isn't in a warning state, is there a way i can filter the results so it only shows servers that are exceeding their memory thresholds?

 

 

i.e. Server 1 RAM threshold is set to 93% Warning and critical 94%, if RAM is at 92 it wont show on the list and if its at 93% it will?


Bug with custom filter in /Orion/NetPerfMon/Events.aspx

$
0
0

We witnessed a but in the /Orion/NetPerfMon/Events.aspx page (in both 12.0 & 12.0.1).

 

When filtering in the events page, using the custom filter, the time stamps used do not match.

 

 

We suspect that the custom filter does not take into account the difference between local time and GMT.

 

Doing the same thing thru the Message Center is working fine.

NAM vs NOM -- What am I missing?

$
0
0

I usually delete SolarWinds sales emails.  Sorry.  It's the truth.  It's not that I don't want to read them it's just that I usually have my fingers on the pulse of things via Thwack and other things.  So today when I got an email entitled "Two New Enterprise-Class Solutions From SolarWinds" I hesitated before hitting that delete key.  And, wow, I'm glad I did.

 

NAM is Network Automation Manager (Network Automation Tool | Solarwinds )

NOM is Network Operations Manager (Network Operations Management | Solarwinds )

 

As far as I can tell...

 

NAM (Network Automation Manager - SolarWinds Worldwide, LLC. Help and Support ) is a bulk license for NPM, NTA, VNQM, IPAM, NCM, UDT and SolarWinds HA. 

 

NOM (Network Operations Manager - SolarWinds Worldwide, LLC. Help and Support ) is a bulk license for NPM, NTA, UDT and VNQM.

 

At first I thought this was a marketing gimmick of some sort.  Don't get me wrong, I absolutely think that owning NPM, IPAM, NCM, etc. is essential to a NOC, but this is the part that got me.

 

2016-10-14 13_15_49-Two New Enterprise-Class Solutions from SolarWinds - jbiggley@gmail.com - Gmail.png

 

Given that we struggle the size of our environment today that growth to "hundreds of thousands of elements" has me totally intrigued.  However I am wondering if we aren't being toyed with a little here.  If you have 6 products each monitoring 25,000 or so of either respective elements (Orion multi-module system guidelines for a SolarWinds NPM installation - SolarWinds Worldwide, LLC. Help and Support  for deployment guidelines) then we would be monitoring "hundreds of thousands of elements".

 

Which Product Manager wants to field this clarification?  cobrien? aLTeReGo?

Noes not responding to SNMP or WMI

$
0
0

There are times when the clients' device stops polling for whatever reason.  This could be an issue with the device or a change in credentials.  Almost all clients I have been involved with are not aware that the polling has stopped.

 

There is a simple way of noticing this, which is by looking at the timestamp of the CPU polling.  If it is more than 35 minutes from the current time, the node is having issue.

 

Here is the report for it:

 

SELECTn.CaptionasNode_Name,n.ip_addressasIP_Address,n.ObjectSubTypeasPoll_Type

,Cast(DateDiff(day,MAX(c.datetime),getdate())asvarchar)+' Day(s) '+convert(char(8),dateadd(second,DateDiff(second,MAX(c.datetime),getdate()),0),14)as Duration

,DateDiff(mi,MAX(c.datetime),getdate())minutes_since

FROM Nodes n

InnerjoinCPUload c onc.NodeID=n.NodeID

WHEREn.status= 1 and(n.ObjectSubType='wmi'orn.ObjectSubType='snmp')

GROUPBYn.Caption,n.StatusDescription,  n.ip_address,n.ObjectSubType

HavingDateDiff(mi,MAX(c.datetime),getdate())> 35

ORDERBYminutes_sincedesc

 

Reporting is nice, but a better way to notice this is by creating an alert for it - so it can be resolved in a timely manner.  For the alert, you would need to use a custom sql:

 

SELECTnodes.NodeID,nodes.captionFROM Nodes

InnerjoinCPUload c onc.NodeID=nodes.NodeID

WHEREnodes.status= 1 and(nodes.ObjectSubType='wmi'ornodes.ObjectSubType='snmp')

GROUPBYnodes.Caption,nodes.nodeid

HavingDateDiff(mi,MAX(c.datetime),getdate())>35

 

Using both the report and alert will make sure you are getting data from all nodes and avoid the embarrassing situation when a server crashes due to high CPU and the boss comments - "I thought that SolarWinds was monitoring this".

 

 

Thanks

Amit Shah

Loop1 Systems

APC rPDU

OSPF Neighbor down Alert

$
0
0

We are using NPM 11.0.1 monitoring OSPF neighbor events on our Cisco equipment. Several times now this event was logged:

 

12/5/2014 2:22 AM  The neighbor 10.225.3.198 on Node ourhost.com went down.

 

When checking the switch for corresponding neighbor events on that date and time, none were found. Why is this alert being generated in Solarwinds when the L3 switch/router does not log any OSPF events?

 

Any help would be appreciated

 

Thanks!!

Viewing all 21870 articles
Browse latest View live


<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>