Dropping ports on new WS, what is wrong with my setup?

DOWNLOAD THE LATEST FIRMWARE HERE
User avatar
sirhc
Employee
Employee
 
Posts: 7416
Joined: Tue Apr 08, 2014 3:48 pm
Location: Lancaster, PA
Has thanked: 1608 times
Been thanked: 1325 times

Re: Dropping ports on new WS, what is wrong with my setup?

Mon May 02, 2016 5:44 pm

nickwhite wrote:And it just happened again. That was on 1.3.8, rolling it back to 1.4.0rc12.

Any thoughts? Anything I can do? This is happening pretty frequently, so if you guys want access to troubleshoot this, I can provide it.

It did look like the CPU was running at 98% the last two times this was happening. I didn't catch what process was taxing it though. I will try and catch it the next time.


Well try and eliminate things by turning them off on the device/configuration tab

Turn off and or stop using all un-essential services such as SNMP, Discovery (all), IGMP

Some people have reported issues trying to poll the switch to fast with SNMP which Eric made a post about that recently.
Support is handled on the Forums not in Emails and PMs.
Before you ask a question use the Search function to see it has been answered before.
To do an Advanced Search click the magnifying glass in the Search Box.
To upload pictures click the Upload attachment link below the BLUE SUBMIT BUTTON.

User avatar
nickwhite
Member
 
Posts: 45
Joined: Fri Jul 03, 2015 10:17 am
Location: Austin, Texas
Has thanked: 9 times
Been thanked: 2 times

Re: Dropping ports on new WS, what is wrong with my setup?

Mon May 02, 2016 6:14 pm

sirhc wrote:Well try and eliminate things by turning them off on the device/configuration tab

Turn off and or stop using all un-essential services such as SNMP, Discovery (all), IGMP

Some people have reported issues trying to poll the switch to fast with SNMP which Eric made a post about that recently.


Thanks, went through all that already. The only things enabled are SSH, HTTPS, Remote Syslog, SMTP config, Loop Protection, NTP.

User avatar
Eric Stern
Employee
Employee
 
Posts: 532
Joined: Wed Apr 09, 2014 9:41 pm
Location: Toronto, Ontario
Has thanked: 0 time
Been thanked: 130 times

Re: Dropping ports on new WS, what is wrong with my setup?

Mon May 02, 2016 7:09 pm

Try turning off storm control (in a previous post you mentioned it was on).

User avatar
adairw
Associate
Associate
 
Posts: 465
Joined: Wed Nov 05, 2014 11:47 pm
Location: Amarillo, TX
Has thanked: 98 times
Been thanked: 132 times

Re: Dropping ports on new WS, what is wrong with my setup?

Mon May 02, 2016 10:13 pm

We just crashed again... 1.4.0rc12 on all switches, only services that were enabled were syslog (on a few) NTP and SMTP, which are now off.
All discovery, igmp, lag and stp were an are off.. I guess tomorrow we will go throw some more hardware at it..
why sometimes it runs longers than others is beyond me.. some nights it will lock up four times others it wont at all..

its like some certain amount of traffic triggers something. is there not any deeper logging we could turn on or watch?

User avatar
Dave
Employee
Employee
 
Posts: 726
Joined: Tue Apr 08, 2014 6:28 pm
Has thanked: 1 time
Been thanked: 158 times

Re: Dropping ports on new WS, what is wrong with my setup?

Mon May 02, 2016 10:22 pm

did you have storm control on?

User avatar
adairw
Associate
Associate
 
Posts: 465
Joined: Wed Nov 05, 2014 11:47 pm
Location: Amarillo, TX
Has thanked: 98 times
Been thanked: 132 times

Re: Dropping ports on new WS, what is wrong with my setup?

Mon May 02, 2016 10:24 pm

No sir. No services turned on except, ssh, ssl

User avatar
sirhc
Employee
Employee
 
Posts: 7416
Joined: Tue Apr 08, 2014 3:48 pm
Location: Lancaster, PA
Has thanked: 1608 times
Been thanked: 1325 times

Re: Dropping ports on new WS, what is wrong with my setup?

Mon May 02, 2016 10:25 pm

adairw wrote:We just crashed again... 1.4.0rc12 on all switches, only services that were enabled were syslog (on a few) NTP and SMTP, which are now off.
All discovery, igmp, lag and stp were an are off.. I guess tomorrow we will go throw some more hardware at it..
why sometimes it runs longers than others is beyond me.. some nights it will lock up four times others it wont at all..

its like some certain amount of traffic triggers something. is there not any deeper logging we could turn on or watch?


All your switches are locking up or are you saying you just have 1.4.0rc12 on all of them and it is just this one problem switch/location still giving you grief?
Support is handled on the Forums not in Emails and PMs.
Before you ask a question use the Search function to see it has been answered before.
To do an Advanced Search click the magnifying glass in the Search Box.
To upload pictures click the Upload attachment link below the BLUE SUBMIT BUTTON.

User avatar
adairw
Associate
Associate
 
Posts: 465
Joined: Wed Nov 05, 2014 11:47 pm
Location: Amarillo, TX
Has thanked: 98 times
Been thanked: 132 times

Re: Dropping ports on new WS, what is wrong with my setup?

Mon May 02, 2016 10:33 pm

If you refer to the diagram I posted showing the switch layout the only two switches that lock up are at Dumas and Morton.

We are trying to figure out how to setup remote access to these switches since they lock up and a netbooter has to power cycle them to keep them running.

Dumas and Morton switches are basically rebooting at the same time since the netbooters are using the same settings. I think I will adjust Morton for a longer delay time so I can see if just Dumas rebooting on its own will bring it back up. I might regret this as there is already about 10 minutes of downtime ever time this happens..

All switches in this chain are running 1.4.0rc12

User avatar
sirhc
Employee
Employee
 
Posts: 7416
Joined: Tue Apr 08, 2014 3:48 pm
Location: Lancaster, PA
Has thanked: 1608 times
Been thanked: 1325 times

Re: Dropping ports on new WS, what is wrong with my setup?

Mon May 02, 2016 11:51 pm

Yea, this is the part that stumps me:

1) You have many switches that have no issues at all and work as desired all over your network.

2) All of your towers are setup essentially the same way so why this chain (2 towers) has issues?

3) I "think" you swapped out the switches causing the issues to insure it is not just a bad switch (it does happens)

Will be interesting to see what your testing with rebooting 1 switch as maybe it is just 1 bad switch and it is only one of the two causing problems for both but as I said in 3 above I "think" you tried swapping out the units having the problems?

Which if all your towers are setup the same and it is not a bad switch then something is causing the traffic that is causing the switches to lock up, but what is it?

Is it a bad radio, a bad router, a mis-behaving customer, or the the moons of Jupiter aligned wrong? :headb:

I really really really wish we knew what it was and if it is us we could fix it for you Adair.

There will be a new v1.4.0rcX released "maybe" tomorrow but since we do not know what the problem is it is just a poke in the dark that it will help.
Support is handled on the Forums not in Emails and PMs.
Before you ask a question use the Search function to see it has been answered before.
To do an Advanced Search click the magnifying glass in the Search Box.
To upload pictures click the Upload attachment link below the BLUE SUBMIT BUTTON.

User avatar
nickwhite
Member
 
Posts: 45
Joined: Fri Jul 03, 2015 10:17 am
Location: Austin, Texas
Has thanked: 9 times
Been thanked: 2 times

Re: Dropping ports on new WS, what is wrong with my setup?

Tue May 03, 2016 7:31 am

So no lockups on our end overnight. Last one was yesterday around 5pm or so. EDIT: Apparently I missed it while reviewing, but there was a lockup around 2:40AM. We threw a DLI Web Power Switch on to power cycle if it locks.

I was going through and realized I had this switch being monitored in LibreNMS. Here are some interesting graphs:


Memory over last several months:

Image



Memory last 48 hours (I shut off SNMP yesterday, so it didn't catch the last 2 lockups):
Image


CPU the last 48 hours:
Image

PreviousNext
Return to Hardware and software issues

Who is online

Users browsing this forum: mayheart and 16 guests