Is there a way to keep the Nodes with Problems widgets to stop listing nodes with unknown interfaces when those unknown interfaces are all NTA Virtual Interfaces? It's cluttery and not at all useful since those interfaces can't be anything BUT unknown... or can they? (I don't think they can)
Nodes with problems showing nodes with NTA Virtual Interfaces
Axis Network Cameras
Has anyone had any experience using NPM to monitor Axis network cameras? They show up with the correct 'Machine Type' (Axis Communications AB), but are grouped by vendor under 'Unknown' with an APC logo??
I've tried using the MIB Browser and MIB Walk tools and they seem to identify the device correctly when given it's ip address and SNMP community string, but I dont see any vendor information in the MIB...any help would be GREATLY appreciated.
Thanks!
Ryan
SQL database doubled after upgrade from 2012 R2 to 2016
We ran into an issue we have a fairly new issue. I recently upgraded SW to the newest versions. In doing so, we needed to migrate our SW database from 2012 R2 over to 2016. When we did that and our database doubled. It was around 12 - 13 GB and now its about 25 - 330 GB. IF anyone has any suggestions or reasoning on this please let me know.
Solarwinds is still not stable
The other thread is closed so I figured I would start a new one I usually get more help here than actually contacting support.
So same issues as before but instead of the server not responding in 36 hours or so it took maybe a week but it is the SAME issues.
1. Server stopped sending alerts out sometime around 11AM on the 4th.
2. Logged onto server and opened Orion service manager and both the module engine and the administration service were going back and forth between running and stopping.
3. Orion could not connect to SQL
4. I have some alerts that at are going out but not sure if they are legit or not.
5. After the reboot I notice that a good chunk of my nodes interfaces are 'unknown' this looks like it fixes itself but again something else going on.
I have applied the 'hotfix' that you all pushed out to try to fix this.
I have done the change from streaming to buffered
I have done the registry change for the ports
The only thing I have not done is revert the snap shots back to June 14th prior to the update so Solarwinds is stable again.
At this point I am going to schedule a task in VM Ware to reboot the server every night. That is pretty much the only way I will know Solarwinds will actually work.
NPM/SAM Cert
Hello Thwack!
Anyone took before the certification for NPM/SAM? Understand from Solarwind that only online training.
Is there any documents can focus more on the exam for NPM/SAM?
Thanks!
NPM and Agent
I have NPM in my environment and not SAM. We use our Orion Environment for Network Infrastructure Management.
I do understand that with the agent that the NetPath feature becomes available.
My problem is that I loss any interface monitoring on a device where the agent is used.
So here are my questions -
- Have I miss configured the Agent in that it is not providing interface information?
- Within an NPM environment what is gained by using the Agent?
Thank You for your time and help
O365 MFA and Solarwinds alerting
Trying to configure email alerting from solarwinds with o365
So I have configured details of the for O365 in Solarwinds although it hasn't worked and I need to know how to configure 0365 account
We use MFA with o365, do I need to remove this from the sending account?
that have other people done?
Configuration Wizard error "No valid package combination was found for current system"
Running Versions:
Orion Platform 2016.2.100
NPM 12.0.1
SAM 6.3.0
NCM 7.5
NTA 4.2.0
IPAM 4.3.2
Configuration wizard will not complete due to the error "no valid package combination was found for current system" on 1 of 7 APE's.
The only info on resolving the issue points to PackageCleaner.exe which is contained in Orion Platform 2017, I'm not prepared to upgrade the total infrastructure at this time.
I have uninstalled all software from the APE and reinstalled and the error continues what am I missing has anybody run into this and have a resolution? I do have a support case started 1181939 the initial response was use the PackageCleaner.
Any thoughts or idea's would be greatly appreciated.
Thanks Tom
Changing polling method from ICMP to SNMP using SWIS
Hi All,
Working on a PowerShell script to import nodes into NPM but due to a business need i need to add them as ICMP and then update to SNMP once credentials have been added to the device.
Within SWQL studio i can update the ObjectSubType, Community and SNMPVersion for a V2 device but looking for a SWIS call to do this in my script, V3 seems a little more tricky.
Has anyone done this and can share some pointers or code that would help?
Thanks
Question for the community... Need your help with some issues I'm having.
Hi all,
So I have the perfect storm of issues I've been weathering for nearly 2 years now with no resolution. I was wondering if anyone had these issues and if you could share some tips that might help guide me in the right direction. Between unstable environment and early morning calls telling me the environment is down I have been living in stress and haven't been able to sleep yet. Have several tickets with support but yet unable to resolve.
1. Duplicates and triples in the environment. What I mean by this is for example I'll have one device three time with three different ip's. Or the other way around 3 devices 3 times added with three separate ips. Still haven't found a way to pull this on a report to go fix these devices.
2. Monitoring for snmp and wmi failures. It seems like creating a SAM template would be the best way to go. Can anyone confirm? Simply what I'm trying to do is create a way that solarwinds can send me an email when a device stops polling snmp or wmi.
3. Overloaded SAM. So with close to 300 sql's in appinisight for sql with about 2 to over 50 db's per server. It easily overloaded SAM in component count. What's a more efficient way to monitor sql? Suggestions welcomed.
4. Performance issues. This seems related to disk performance but I have no way to figure out what is the root cause.
5. data integrity in the database. I don't know how to run checks for integrity. and how to make sure I don't have corruption happening.
6. pollers all hanging due to collector and business layer peaking cpu and ram.
These are the top six pressing issues. Any help welcomed.
Index...on table...has fragmentation 90.00% and should be defragmented manually
Is this still the solution for SQL Express?
I have a SQL express server. I followed the instructions up to the part where it states in step 4, "Right-click any table and select Query table." Each table already has it's default or own query. If I erase that and execute the query in this process, what will happen to the deleted query? Will the table keep the new query I just executed for defragging? I just want to make sure I don't break anything by doing this process because I am not a Database guy. Please advise.
Hotfix 3 issue
So I have a case open but haven't got a response yet on it -
So I went to do the hotfix 3 install as the instability issues are becoming an issue with some members of my teams -
Ran the hotfix and then the configuration wizard and got almost done and tripped an error message -
so clicked finished and then started up the services - all came back up no issues but the web site no longer exists in the specified path -
Figured i would then go follow the uninstall directions in the notes but the hotfix is not listed in add/remove - I do see updated things from this morning but wasn't sure if I should remove them and then rerun the config wizard .. do not want to make things worse then they are at this moment
Thoughts ,
Fitz
Repetitive Email Alerts (Noise) - MUST HAVE REPORT!!!
< BACK TO TO THE MAGIC OF SQL SCRIPTS SERIES LIST
My Dear Thwackers,
If you are sick and tired of having to deal with continuous noise from hundreds of alerts and customer complaints due to missed problems - you must have this report, now!
Fact: 80% of your alerts and frustrations come from 20% of stuff within your infrastructure!
It is quite obvious that most of the negative impact on performance and productivity within the NOC is being caused by noise and repetitive alerts that is just keep coming through... non-stop. Ability to highlight them in a most efficient manner has never been easy, until now
I am here to offer you a gift - a custom made report that has served us for years in attacking and eliminating all the noise. It is compatible with Orion Platform 2015.1.2, NPM 11.5+ (also works with latest NPM 12)
Download > Import > Run > Action
... and just for fun - please publish a screenshot in the comments below of the top 5 stats that you will get- it is very interesting to know how this will play out in your environment
To your monitoring success,
Alex Soul
[UPDATES]
[30/08/2017]: Below we have discussed some bug fixes around extracting EmailTo from description. In the attached report this workaround was implemented, so, it should work for you straight out-of-the-box
[31/08/2017]: I have added % column, which will calculate percentage from total umber of email alerts sent over the defined period. I have also updated they way variables are being defined, so, you can now just open SQL script and change them all in one place defined in SET function on top of the script
[04/09/2017]: As suggested by tdanner - there is a better way of extracting EmailTo and EmailCC. This has been implemented in version 1.3 of the report
NetPath Custom Query Widget
I use NetPath in a number of ways, but the foremost is to validate that connections to SaaS applications are within tolerances all around my environment. Having that information in a summary is very helpful.
NetPath already does that, but having a filter would be nice to only show a specific service. What would be more awesome would be being able to give me summary information over time instead of just showing the most recent.
With that in mind, I started digging into the API to find if I could get a better summary of this information. Much to my pleasure, I was able to find the information, but it was strewn across a few tables. So I had to dust off some old SQL tactics in my brain to get the information. (I'm terrible at hand-writing join queries and almost as bad when doing GROUP BY for calculating minimums/averages/minimums).
The two "tables" that I need to use are Orion.NetPath.ServiceAssignments and Orion.NetPath.Tests.
What I cam up with was a little bit of SQL magic, a little bit of SWQL magic, and exactly what I needed in a resource. I've added some comments in the script to show what I'm doing. Without further ado, here's the SWQL script:
SELECT [SA].ProbeName AS [Source] , [SA].ServiceName AS [Destination] , [SA].DetailsUrl AS [_LinkFor_Source] , CONCAT('/Orion/images/StatusIcons/Small-', [SI].IconPostfix, '.gif') AS [_IconFor_Source] -- This is the status for the most recent poll only , ROUND([Tests].MinLatency, 2) AS [Min Latency (ms)] , ROUND([Tests].AvgLatency, 2) AS [Avg Latency (ms)] , ROUND([Tests].MaxLatency, 2) AS [Max Latency (ms)] , CONCAT(ROUND([Tests].MinLatency, 2), ' / ', ROUND([Tests].AvgLatency, 2), ' / ', ROUND([Tests].MaxLatency, 2) ) AS [Min/Avg/Max Latency (ms)] , ROUND([Tests].MinPacketLoss, 2) AS [Min Packet Loss (%)] , ROUND([Tests].AvgPacketLoss, 2) AS [Avg Packet Loss (%)] , ROUND([Tests].MaxPacketLoss, 2) AS [Max Packet Loss (%)] , CONCAT(ROUND([Tests].MinPacketLoss, 2), ' / ', ROUND([Tests].AvgPacketLoss, 2), ' / ', ROUND([Tests].MaxPacketLoss, 2) ) AS [Min/Avg/Max Packet Loss (%)] FROM Orion.NetPath.ServiceAssignments AS [SA] INNER JOIN Orion.StatusInfo AS [SI] ON [SA].Status = [SI].StatusID INNER JOIN ( SELECT EndpointServiceID , ProbeID , MIN(Rtt) AS MinLatency , AVG(Rtt) AS AvgLatency , MAX(Rtt) AS MaxLatency , MIN(PacketLoss) AS MinPacketLoss , AVG(PacketLoss) AS AvgPacketLoss , MAX(PacketLoss) AS MaxPacketLoss FROM Orion.NetPath.Tests WHERE ExecutedAt >= GETUTCDATE() - 1 -- ExecutedAt is stored in UTC, so we use 'GETUTCDATE() - 1' to get last 24 hours only GROUP BY EndpointServiceID, ProbeID ) AS [Tests] ON [Tests].ProbeID = [SA].ProbeID AND [Tests].EndpointServiceID = [SA].EndpointServiceID WHERE [SA].ServiceName = 'Office 365' -- This is the NetPath Service Name as displayed on your NetPath summary page AND [SA].Enabled = 'True' ORDER BY [SA].ProbeName
Let me break it down a little bit. First off, we are joining up three tables here. The one that I didn't mention earlier is Orion.ServiceInfo. This table has the status "names" based on the numbers. I use them for pulling in the icons. This leads to a bit of SWQL magic.
SWQL has the ability to recognize icons and links, but only if they are formatted properly. This is where lines 3 and 4 from the above script come into play. If you have an element in a custom query called "Stuff", then you can give it a URL named "_IconFor_Stuff" and it will be displayed to the left of the label. Similarly, you can use the "_LinkFor_Stuff" to provide a clickable link. As a note for the future, many tables provide a "DetailsUrl" field that's very useful for this.
I'm also using some specific SQL-like constructs - specifically joining a table to a table and joining a table to a query (which returns table-like content). In lines 14 and 15, I'm joining in the Orion.StatusInfo table which provides the IconPostfix needed for line 4.
Later in lines 16 through 28 I'm making a query from Orion.NetPath.Tests where I summarize information over the last 24 hours. Note that in line 26 I'm using a comparison to only pull back minimums, averages, and maximums over the last 24 hours. You can change this, but be aware that it will take more time to execute the query. On a 10 minute probe (the default), each probe provides 60 minutes / 10 minutes per test * 24 hours = 144 tests / probe. Multiple this by the number of probes (4 in my environment) means the query has to summarize 576 entries into six total statistics.
Thankfully SWQL is highly optimized. As an example me running this full query takes 0.023 seconds. If I change the number of days on line 26 to 7 (show me the last week), then it runs in 0.59 seconds. Like I said, the SWQL is highly optimized.
So what's all this look like? Just edit a page and add a Custom Query widget and paste in the above query (changing line 31 to match one of your own NetPath Services) and you should get something very pretty.
I've elected to show Min/Avg/Max in multiple ways, but in a production environment, I probably only care average. To get just the average to display, just comment out (or delete) the unneeded elements. In SQL (and SWQL) the double-dash indicates a comment.
SELECT [SA].ProbeName AS [Source] , [SA].ServiceName AS [Destination] , [SA].DetailsUrl AS [_LinkFor_Source] , CONCAT('/Orion/images/StatusIcons/Small-', [SI].IconPostfix, '.gif') AS [_IconFor_Source] -- This is the status for the most recent poll only -- , ROUND([Tests].MinLatency, 2) AS [Min Latency (ms)] -- , ROUND([Tests].AvgLatency, 2) AS [Avg Latency (ms)] -- , ROUND([Tests].MaxLatency, 2) AS [Max Latency (ms)] , CONCAT(ROUND([Tests].MinLatency, 2), ' / ', ROUND([Tests].AvgLatency, 2), ' / ', ROUND([Tests].MaxLatency, 2) ) AS [Min/Avg/Max Latency (ms)] -- , ROUND([Tests].MinPacketLoss, 2) AS [Min Packet Loss (%)] -- , ROUND([Tests].AvgPacketLoss, 2) AS [Avg Packet Loss (%)] -- , ROUND([Tests].MaxPacketLoss, 2) AS [Max Packet Loss (%)] , CONCAT(ROUND([Tests].MinPacketLoss, 2), ' / ', ROUND([Tests].AvgPacketLoss, 2), ' / ', ROUND([Tests].MaxPacketLoss, 2) ) AS [Min/Avg/Max Packet Loss (%)] FROM Orion.NetPath.ServiceAssignments AS [SA] INNER JOIN Orion.StatusInfo AS [SI] ON [SA].Status = [SI].StatusID INNER JOIN ( SELECT EndpointServiceID , ProbeID , MIN(Rtt) AS MinLatency , AVG(Rtt) AS AvgLatency , MAX(Rtt) AS MaxLatency , MIN(PacketLoss) AS MinPacketLoss , AVG(PacketLoss) AS AvgPacketLoss , MAX(PacketLoss) AS MaxPacketLoss FROM Orion.NetPath.Tests WHERE ExecutedAt >= GETUTCDATE() - 1 -- ExecutedAt is stored in UTC, so we use 'GETUTCDATE() - 1' to get last 24 hours only GROUP BY EndpointServiceID, ProbeID ) AS [Tests] ON [Tests].ProbeID = [SA].ProbeID AND [Tests].EndpointServiceID = [SA].EndpointServiceID WHERE [SA].ServiceName = 'Office 365' -- This is the NetPath Service Name as displayed on your NetPath summary page AND [SA].Enabled = 'True' ORDER BY [SA].ProbeName
So the above returns:
Hopefully this has helped a few of you with getting your feet wet within SWQL and the custom query. Now go forth and script around a bit.
What is difference between NPM 12.2 and 12.3
Hi guys,
Any one of you explain the what is the difference between the NPM 12.2 and 12.3 ?
Please update and bugs also
Thanks and Regards,
Kishore.
NPM (only) upgrade from 12.2 to 12.3 plus weird syslog issue
Getting ready to upgrade NPM (the only module we have) from 12.2 to 12.3. I've downloaded the upgrade executable and read the notes; so far as I can tell it's just run the new file and it will upgrade (after stopping the SW services).
Anything more to it than that? MS Server 2016, SQL Server, MS SQL Server Enterprise, latest version.
The only problem we're experiencing right now is that all our devices that were syslogging to our NPM server are still attempting to do so, but nothing shows up in the syslog in NPM. This behavior began right after Windows rebooted the operating system after a patch (automatically) instead of gracefully shutting down the SolarWinds services first. Was not noticed right away. The syslogs are still viewable in NPM from BEFORE that date and time, just nothing after that, and I've stopped and restarted the services several times in an attempt to get the syslogs recording into the database again.
Another Install recommendation question
My current environment
Nodes 1700
Volumes 1144
NCM 580
SAM 2110
NPM 19,600 interfaces
IPAM 2494 (can increase to 16,384)
UDT 2178 (can increase to 10,000)
I also have
NTA
VNQM
I have the main application server, and an additional poller and also a separate NTA server. I also just purchased an additional Web server license. Of course I have a separate SQL server
This means I have 4 servers (main, add poller, add web, NTA) along with the SQL server.
Does this sound logical to everyone? Does anything here jump out at anyone with "why in the world would you do that..."?
Thank you
Michael
Dev to Live Migration question.
Apologies if this is a dumb newbie question.... but I'm just getting started with this particular product set.
We have a development environment where we are trialling various aspects of the SolarWinds tooling (NPM, NCM, NTA, SAM HA etc).
This gives us a good feel for the product set, and as part of this we are looking to understand how to move some of the configuration into Live operation.
We have stringent operational Quality Control and Change Management procedures in place, so we need to be able to replicate any customisations in our live environment - without copying an entire database or having to re-type the configurations. Essentially we need to be able to export then import various configurations - but not the actual individual device/application settings. As in most commercial organisations our systems (dev & live) are air-gapped, so we want to maximise the control of what appears in Live.
For example the types of things that we would need to be ported through to live would be:
- definitions for Windows User Groups and permissions within SolarWinds
- Menu customisations
- application templates
- Polling configurations
- etc.
Is there a way to do this, ideally as simple as possible
All suggestions gratefully received - Thanks in advance
Network Discovery automatically add missing volumes to existing nodes
I can see a few have tried to solve this over the last few years and wanted to see if there are any new solutions based on the latest releases.
The goal is to automatically add new volumes to existing nodes. I am resigned to the fact Solarwinds is not going to detect that a volume's label has changed and make the update automatically so I have created a custom alert that deletes 'stale' volumes directly from SQL. The next stage is to use Network discover to bring those disks or newly added disks back in.
Currently I am trying to get Network Discover to run against a list of servers pulled from AD. Under the automatic monitoring settings I have deselected everything except fixed disks.
The issue is that although the nodes scanned, about 400, are already monitored by NPM the auto-import will fail because I only have 200 free licences. It errors with License Exceeded. I can do a manual import after the scan and it will pass the licence error and proceed to import missing disks but the goal is to have this be a background automatic process.
Has anyone faced the specific licence issue I have or does anyone have a fresh angle for adding volumes to existing nodes?
Switch uptime
Hi,
I need switch up time & down time report for last month can anyone please suggest.