Non-responsive WS-12-250-DC
Posted: Wed Apr 12, 2017 10:42 am
HI Folks,
I have two WS-12-250-DC switches on a tower. Both switches are in the same equipment box and have been running flawlessly since installed.
On Sunday (4/9) around 3pm I lost all access to the tower. (My first tower so I don't have redundant paths yet...) I located the problem to the Netonix switches but decided to power cycle them before isolating things further.
Yesterday around 6pm the same thing happened again but I took the time to isolate it to one switch. (I plugged into a spare port and could not ping a thing including the switch. I rushed there and didn't have a console cable with me... otherwise I would have checked that way as well.)
All radios are connected to routers running OSPF, and all routers are connected to both switches for redundancy.
The AF24 back haul to the data center is powered from one of the switches and has a dedicated vlan to its router.
Flow control is turned off on all AirFibers. The Netonix switches have negotiated full FC with the ERXs.
The switches were running 1.4.5rc2 which was the latest when I installed the switches last Oct. Yesterday, I upgraded both to 1.4.6.
Networking monitoring does not show anything strange just before the switch becomes non-responsive.
The equipment box housing the switches is in the shade for most of the day, though they catch some sun in the mid to late afternoon. The equip boxes are ventilated and have thermostat controlled fans. (I did test the fans yesterday but am wondering if the thermostat was sticky and didn't not kick in over the last few days. Needless to say I lowered the set point.)
I'm bringing up the fans because I'm thinking about the device temps given the charts below.
Switch 1 recent history (this is the switch that become non-responsive)
Switch 2 recent history
Switch 1 full history
Switch 2 full history
What I find interesting is that the temp profile for the two switches is so different. (Note that switch 1 powers the AF24, while switch 2 powers a AF5x.)
For instance the PHY Temp and the DCDC Control Temp.
I've ordered a new switch and will install it Sat morning.
In the mean time, a couple of questions:
1) What should I check if this happens again?
2) What are the odds this is temp related?
3) What else should I consider.
Thanks
Mark
I have two WS-12-250-DC switches on a tower. Both switches are in the same equipment box and have been running flawlessly since installed.
On Sunday (4/9) around 3pm I lost all access to the tower. (My first tower so I don't have redundant paths yet...) I located the problem to the Netonix switches but decided to power cycle them before isolating things further.
Yesterday around 6pm the same thing happened again but I took the time to isolate it to one switch. (I plugged into a spare port and could not ping a thing including the switch. I rushed there and didn't have a console cable with me... otherwise I would have checked that way as well.)
All radios are connected to routers running OSPF, and all routers are connected to both switches for redundancy.
The AF24 back haul to the data center is powered from one of the switches and has a dedicated vlan to its router.
Flow control is turned off on all AirFibers. The Netonix switches have negotiated full FC with the ERXs.
The switches were running 1.4.5rc2 which was the latest when I installed the switches last Oct. Yesterday, I upgraded both to 1.4.6.
Networking monitoring does not show anything strange just before the switch becomes non-responsive.
The equipment box housing the switches is in the shade for most of the day, though they catch some sun in the mid to late afternoon. The equip boxes are ventilated and have thermostat controlled fans. (I did test the fans yesterday but am wondering if the thermostat was sticky and didn't not kick in over the last few days. Needless to say I lowered the set point.)
I'm bringing up the fans because I'm thinking about the device temps given the charts below.
Switch 1 recent history (this is the switch that become non-responsive)
Switch 2 recent history
Switch 1 full history
Switch 2 full history
What I find interesting is that the temp profile for the two switches is so different. (Note that switch 1 powers the AF24, while switch 2 powers a AF5x.)
For instance the PHY Temp and the DCDC Control Temp.
I've ordered a new switch and will install it Sat morning.
In the mean time, a couple of questions:
1) What should I check if this happens again?
2) What are the odds this is temp related?
3) What else should I consider.
Thanks
Mark