OVHcloud Network Status

Current status
Legend
  • Operational
  • Degraded performance
  • Partial Outage
  • Major Outage
  • Under maintenance
FS#16561 — rbx-g1-a9
Incident Report for Network & Infrastructure
Resolved
We just reloaded a card on rbx-g1-a9 due to an ECC error.

RP/0/RSP0/CPU0:Feb 11 14:30:57 CEST: shelfmgr[392]: %PLATFORM-SHELFMGR-6-NODE_STATE_CHANGE : 0/6/CPU0 A9K-8T-L state:IOS XR FAILURE
RP/0/RSP0/CPU0:Feb 11 14:30:57 CEST: shelfmgr[392]: %PLATFORM-SHELFMGR-6-NODE_STATE_CHANGE : 0/6/CPU0 A9K-8T-L state:BRINGDOWN

RP/0/RSP1/CPU0:Feb 11 14:31:31 CEST: ce_switch_srv[54]: %PLATFORM-CE_SWITCH-6-UPDN : Interface 6 (LC_Slot_6) is up
RP/0/RSP0/CPU0:Feb 11 14:31:32 CEST: ce_switch_srv[54]: %PLATFORM-CE_SWITCH-6-UPDN : Interface 6 (LC_Slot_6) is up
RP/0/RSP0/CPU0:Feb 11 14:31:33 CEST: ce_switch_srv[54]: %PLATFORM-CE_SWITCH-6-UPDN : Interface 6 (LC_Slot_6) is down
RP/0/RSP0/CPU0:Feb 11 14:31:33 CEST: ce_switch_srv[54]: %PLATFORM-CE_SWITCH-6-UPDN : Interface 6 (LC_Slot_6) is up

Update(s):

Date: 2016-02-11 15:55:41 UTC
RMA has been asked for at CISCO.

Date: 2016-02-11 15:55:12 UTC
The card has reloaded OK.
We are shutting it so as to avoid impact on the router.

Date: 2016-02-11 15:54:06 UTC
RP/0/RSP0/CPU0:Feb 11 14:40:17 CEST: shelfmgr[392]: %PLATFORM-SHELFMGR-6-NODE_STATE_CHANGE : 0/6/CPU0 A9K-8T-L state:MBI-RUNNING
RP/0/RSP0/CPU0:Feb 11 14:40:17 CEST: shelfmgr[392]: %PLATFORM-SHELFMGR-6-NODE_STATE_CHANGE : 0/6/CPU0 A9K-8T-L state:MBI-RUNNING

Date: 2016-02-11 15:52:26 UTC
The card seems to reload continuously.

RP/0/RSP0/CPU0:Feb 11 14:39:14 CEST: shelfmgr[392]: %PLATFORM-SHELFMGR_HAL-6-BOOT_REQ_RECEIVED : Boot Request from 0/6/CPU0, RomMon Version: 1.3
RP/0/RSP0/CPU0:Feb 11 14:39:14 CEST: shelfmgr[392]: %PLATFORM-SHELFMGR-6-NODE_STATE_CHANGE : 0/6/CPU0 A9K-8T-L state:MBI-BOOTING
RP/0/RSP0/CPU0:Feb 11 14:39:14 CEST: shelfmgr[392]: %PLATFORM-SHELFMGR-6-NODE_STATE_CHANGE : 0/6/CPU0 A9K-8T-L state:MBI-BOOTING

We will investigare with Cisco and ask for an RMA if necessary.

Date: 2016-02-11 15:51:13 UTC
However, no impact was detected.

Date: 2016-02-11 15:50:47 UTC
LC/0/6/CPU0:Feb 11 14:15:34 CEST: pfm_node_lc[279]: %PLATFORM-NP-0-HW_DOUBLE_ECC_ERROR : Set|prm_server_tr[159813]|0x1008000|NP DOUBLE ECC ERROR, NP=0, memId=17, subMemId=0x1

LC/0/6/CPU0:Feb 11 14:15:34 CEST: pfm_node_lc[279]: %PLATFORM-NP-0-HW_DOUBLE_ECC_ERROR : Set|prm_server_tr[159813]|0x1008000|NP DOUBLE ECC ERROR, NP=0, memId=17, subMemId=0x1
Posted Feb 11, 2016 - 15:50 UTC