So I am doing some training with some of my other team members on Solarwinds. This week during our meeting I went to show them the SolarWinds agents we have deployed (we have 18) and one I went to the agents page it showed only 3 online the rest you could not connect to. After our meeting I went to the agents and on the page under other selected Restart Agent Service. This restarted the agents and everything was up and connected. Has anyone else had this same issue? And how can I make sure that the agents are always running and I do not have to worry about checking them to make sure they are running? Thanks!
Solarwinds Agents - All Stopped
New Orion Installer not working on Large Scaled environment
Today i was planning the upgrade of my solarwinds installation.
We have additional polling engines in remote locations, the main poller is in Belgium, and the additional polling engines are in India, Brazil and the US.
The solarwinds orion installer was always failing because of timeouts.
There is a workaround described Installer fails with a socket connection message when installing from the main poller - SolarWinds Worldwide, LLC. Help …
But this can only be done when you start the upgrade, so planning is not an option here. before this new installer there where offline installer packages for the additional servers, now there isn't one anymore.
I'm currently already 8hours in the upgrade fase and still not done, it will probably still take me a couple of hours and this all because we can't plan ahead and copy all files over before we start.
It would be great if the offline installers are back available. i'm quite sure that the new installers works fine in a lab environment and when the polling engines are local for me this is not an option.
Error opening file for writing during NCM 7.5.1 install - Castle.Core.dll
When running the installer 'SolarWinds-NCM-v7.5.1.exe' I keep getting a Loading.... Error opening file for writing C:\....\Temp\2\SWOrionSetup\Castle.Core.dll
Migrating all Pollers to 2012 and Upgrading to 12.1
The imminent GA of npm 12.1 and sam 6.4 is the catalyst for me to finally migrate all of our pollers off of 2008 R2. I was thinking of timing this with upgrading to npm 12.1. I think this should work in theory, any gotchas to this? too many moving parts?
My though process:
*I will be reusing the ip addresses from old servers. Debating whether i should use old hostnames, according to the KB i should be able to update hostnames in SQL.
1. Deactivate licenses
2. Shutdown pollers
3. Install NPM/SAM/IPAM/NCM/NTA/WPM on new main poller
4. Activate Licenses for main poller
5. Run additional poller install and activate license for each additional poller.
How to convert SWQL "tostring" to number for reporting?
Hi all,
I'm trying to find a way to convert a TOSTRING value to a number so that in my reporting I can display "order by" correctly and not in text.
,case when ack.timestamp is null then 'N/A'
else tostring(minutediff(ah.TimeStamp,ack.timestamp)+0)
end as [Minutes Until Acknowledged]
Thank you!
A
Configuration Wizard error "No valid package combination was found for current system"
Running Versions:
Orion Platform 2016.2.100
NPM 12.0.1
SAM 6.3.0
NCM 7.5
NTA 4.2.0
IPAM 4.3.2
Configuration wizard will not complete due to the error "no valid package combination was found for current system" on 1 of 7 APE's.
The only info on resolving the issue points to PackageCleaner.exe which is contained in Orion Platform 2017, I'm not prepared to upgrade the total infrastructure at this time.
I have uninstalled all software from the APE and reinstalled and the error continues what am I missing has anybody run into this and have a resolution? I do have a support case started 1181939 the initial response was use the PackageCleaner.
Any thoughts or idea's would be greatly appreciated.
Thanks Tom
Upgrade of NPM, NCM and NTA - How long does it take?
I am in the early stages of planning an upgrade to the latest version of our Orion platform, we have three modules NPM (12.0.1), NCM (7.5.1) and NTA (4.2.1), and we run an HA setup with an additional poller server. All the servers run Windows 2012, the database is SQL 2016 and the separate Flow Database server for NTA.
I am aware looking at the guidelines that Solarwinds provide that I have to Disable HA, upgrade Primary then Secondary and then re-enable the HA. But how much downtime should I expect to experience?
We run 24x7 so there is never a good time to carry out an upgrade and I have to make the business aware and depending on how long may have an impact on time of day/week that the upgrade occurs.
The easy answer is 'It depends on the hardware, size of database etc', but any help or knowledge from anyone else that has already completed an upgrade will give me a valuable insight and is much appreciated.
Once I am done with the upgrade I will post details on my experience that may help others.
Many thanks in advance.
Showing a node as down (red)
I've created a custom poller. When that custom poller equals a certain value, I want that node to be displayed as down. I'm reading through the documentation but am not understanding if I can do this. Say one of my custom pollers is monitoring a fan on a box. If the MIB value for that fan indicates that it's not working I want to be able to show the end user which box it is just by looking at the network map on the web interface.
Spike in High Response Time on Node
I am trying to investigate an issue on random high response times upwards to 400+ ms. The response time ranges in the spike so its not already that high but over 100 in the spikes. They are very random so no set timing on this. The path between node and solarwinds server appear to be clean with no errors. Has anyone experienced high spike in response times to a network switch and how to isolate where the problem lies?
Interface Description on Nexus Leaves in ACI Mode
Hi There,
One of my customers is running a Nexus 9K fabrix in ACI mode. I recently went through and descriptions for all Leaf interfaces. I did this via the APICs as I want the interfaces in SolarWinds to update dynamically if any of the port descriptions change (saves having to do it on the switch and also in NPM). Once the switch was polled the descriptions started coming through in SolarWinds, but rather than showing the Interface number and description, it is showing the description twice e.g. rather than showing Ethernet0/1 - Description it is showing Description - Description. This means that the alerts coming through do not have the interface number in them, just the description (twice). This isn't what I want for troubleshooting.
Has anybody else experienced this or can anybody suggest a way of changing it (I know I can add the descriptions form SolarWinds, but I'd rather modify the switch and have it come through dynamically)? What I'd like to have for each interface is the interface number then the description e.g. Ethernet 1/42 - Firewall01 MGMT
Regards,
Felix
VRF support with Nexus 9K with NX-OS
Has anyone been successful monitoring VRF routes on Cisco NX-OS on the 9K hardware platform?
On older Cisco platforms, there is an OID list of VRFs on a system which Solarwinds uses. On newer platforms Cisco have removed this and now require the use of separate SNMP contexts mapped to each VRF.
Can Solarwinds support multiple contexts on the same device and consolidate the VRF route data against a single node?
Network Sonar Discovery Scheduler Not Working
I have been using Solarwinds for a number of years and have successfully used the scheduler for the network sonar discovery in the past. I had to setup the schedule discovery, do a run now and then the schedule would resume from that point in time.
Since the addition of the Starting From field in the advance, I've not been able to get them to work properly. I have a number of /16 networks that I divide into /18 networks throughout the day to run at a rolling interval (every 6 days at particular times of the day). As an example, I setup yesterday 4 schedules to run at 0800, 1000, 1300, and 1530 hrs starting today at the appropriate times...none of them have ran (still waiting for the 4th)...my experience is that they and all the others I have setup will run all at once on an arbitrary day...making the use of the schedule and the starting from, pointless.
The scheduled every 2nd Wednesday worked, but the others were set for every 6 days starting on different days, but ended up running on the same day as all others.
Any one with similar problem that have resolved this?
New Orion Installer not working on Large Scaled environment
Today i was planning the upgrade of my solarwinds installation.
We have additional polling engines in remote locations, the main poller is in Belgium, and the additional polling engines are in India, Brazil and the US.
The solarwinds orion installer was always failing because of timeouts.
There is a workaround described Installer fails with a socket connection message when installing from the main poller - SolarWinds Worldwide, LLC. Help …
But this can only be done when you start the upgrade, so planning is not an option here. before this new installer there where offline installer packages for the additional servers, now there isn't one anymore.
I'm currently already 8hours in the upgrade fase and still not done, it will probably still take me a couple of hours and this all because we can't plan ahead and copy all files over before we start.
It would be great if the offline installers are back available. i'm quite sure that the new installers works fine in a lab environment and when the polling engines are local for me this is not an option.
Time out errors when downloading APE updates
A couple of my sites have greater distance and latency involved when their APE's need to get Polling Engine updates from my main NPM instance during hot fix / patch episodes.
They regularly give me this error:
If I let these servers just sit without clicking Yes or No, I can see the progress bar in the background continue moving to the right until the download is complete, and it appears all I need do is click "Next. But that's not possible while the previous dialog box is unanswered, and I can't click X on it to remove it.
If I click No, the process ends and I have to start all over again.
If I wait until the back progress bar makes it all the way to the right, and then the Solarwinds Orion Compatibility Check Summary window (above) appears, then I can click Yes, and the front dialog box closes I can click "Next" and move through the patch/upgrade.
It seems as if that initial window about the Downloading Process being interrupted is caused by some arbitrary timer that expires.
1. Do you ever experience this?
2. Is there something I can do to prevent this? The download is not truly interrupted.
3. Can we get this corrected/repaired/fixed?
Report data for Alert hold off timer
I am wondering if anyone knows where the Alert Trigger condition - "Condition must exist for " exists in the database.
I am building a report of all defined alerts and this is the field that I am missing.
Thanks for any help
Chris
NPM Future Scalability
Hello,
We get a lot of requests from enterprise organisation interested is unsung SolarWinds ORION in Hugh global deployments. Therefore I am acutely interested in the following billed new features coming up:
- Scalability Improvements - For example, increasing total element count per instance and adjusting UI workflows to work better in scale environments.
- Remote Collector - New, agent based collector for distributed environments and hybrid deployments
- Next Generation Orion Mapping
- Centralized Upgrades
- New Scalability Engines Installer
Please let me know urgently how I can get on the BETA program to help you TEST/EVALUATE the new features as they are being developed.
Thanks,
Brian
Solarwinds Agents - All Stopped
So I am doing some training with some of my other team members on Solarwinds. This week during our meeting I went to show them the SolarWinds agents we have deployed (we have 18) and one I went to the agents page it showed only 3 online the rest you could not connect to. After our meeting I went to the agents and on the page under other selected Restart Agent Service. This restarted the agents and everything was up and connected. Has anyone else had this same issue? And how can I make sure that the agents are always running and I do not have to worry about checking them to make sure they are running? Thanks!
Upgrade to NPM 12.2
Hi Experts.
Upgraded our SolarWinds from 12.1 to 12.2 and after that, the Alert section is filled with thousands of alerts. Though apart from the upgrade nothing has been changed.
Any comments / Recommendation for this behavior is highly appreciated.
Thanks
AJ
Removing/Unregistering all instances from DPA
Hello,
My Database team decided to go with another product, and now I have to remove all the DPA instances and decomm our DPA server. I am looking for a way to un-register ALL instances at once, but I am not seeing it. We have MANY instances on there now and to un-register them all is going to take a very long time to do one at a time. Any thoughts on this? Can I do them all at once?
Thanks
Ron
How to create a view that only shows an interfaces that matches a specific criteria
I was tasked with this morning to create a view that would show the Network Team what circuits are down; and nothing else. Since all of the circuits have the carrier name in the Interface name. I was thinking of creating a view and doing an Advanced Filter to only show down devices that have 'Verizon' 'Sprint' etc in the interface name. I would think this would be pretty simple; and feel I am just missing something silly but as of now cannot find a resource/view or option that I could do this with. I also thought about doing this via an Alert resource and again trying to apply a filter but I still unable to do this.
I would think others have created a view very similar to what I'm trying to accomplish and hope someone can point me in the correct direction to accomplish this.
Thoughts????????????????????????
Thanks in advance