硬盤配置為12LFF背板(0302A6VS),P460卡接兩塊F0/F1兩塊SATA SSD,F8-F11為4塊NVMe。
客戶批量報修10台左右的R5300 G6服務器SEL中有硬盤線纜告警:Configuration Error-Incorrect cable connected / Incorrect interconnection---Incorrect SATA cable connection to the system board,實際使用沒有異常。
1、客戶提供其中兩台日誌,發現在11月15日 9:40左右升級HDM版本後出現的Incorrect SATA cable connection告警。懷疑為軟件誤報。
告警信息:
1821 Info NA NA NA From BMC 2024-11-15 09:41:33 UTC+08:00 Reboot Cause: [BMC] [warm reset] BMC occurred warm reset because of updating BMC. 2024-11-15 09:40:41
1830 Minor Cable / Interconnect Cable Asserted From BMC 2024-11-15 09:41:43 UTC+08:00 Configuration Error-Incorrect cable connected / Incorrect interconnection---Incorrect SATA cable connection to the system board
1877 Info NA NA NA From BMC 2024-11-15 09:49:24 UTC+08:00 Reboot Cause: [BMC][cold reset] BMC occurred cold reset because of resetting BMC. 2024-11-15 09:48:44 UTC+8
1883 Minor Cable / Interconnect Cable Asserted From BMC 2024-11-15 09:49:30 UTC+08:00 Configuration Error-Incorrect cable connected / Incorrect interconnection---Incorrect SATA cable connection to the system board
升級固件記錄:
%# 2024-11-15 09:39:49.771 UTC+08:00 HDM210235A4GP********1S [BMC.update] 517 [SUCCESS]: [root][redfish][10.x.x.x] Update space preparation succeeded.
%# 2024-11-15 09:39:54.845 UTC+08:00 HDM210235A4GP********1S [BMC.update] 517 [SUCCESS]: [root][redfish][10.x.x.x] Issued the update verification command successfully.
%# 2024-11-15 09:40:07.254 UTC+08:00 HDM210235A4GP********1S [BMC.update] 517 [SUCCESS]: [root][redfish][10.x.x.x] Issued the update command successfully.
%# 2024-11-15 09:40:10.418 UTC+08:00 HDM210235A4GP********1S [BMC.update] 2667 [SUCCESS]: [root][redfish][10.x.x.x] Issued upgrade configuration. Module: HDM; Conf: Retain(Primary).
%# 2024-11-15 09:41:45.299 UTC+08:00 HDM210235A4GP********1S [BMC.update] 710 [SUCCESS]: [root][redfish][10.x.x.x] Module: HDM; Location: Primary; Model: R5300 G6; Version: 2.03 -> 2.08; Update result: Succeeded.
%# 2024-11-15 09:51:41.206 UTC+08:00 HDM210235A4GP********1S [BMC.update] 517 [SUCCESS]: [root][redfish][10.x.x.x] Update space preparation succeeded.
%# 2024-11-15 09:51:41.711 UTC+08:00 HDM210235A4GP********1S [BMC.update] 517 [SUCCESS]: [root][redfish][10.x.x.x] Issued the update verification command successfully.
%# 2024-11-15 09:51:48.275 UTC+08:00 HDM210235A4GP********1S [BMC.update] 517 [SUCCESS]: [root][redfish][10.x.x.x] Issued the update command successfully.
%# 2024-11-15 09:51:49.783 UTC+08:00 HDM210235A4GP********1S [BMC.update] 3642 [SUCCESS]: [root][redfish][10.x.x.x] Issued upgrade configuration. Module: BIOS; Conf: Retain(BIOS and ME).
%# 2024-11-15 10:13:10.568 UTC+08:00 HDM210235A4GP********1S [BMC.update] 710 [SUCCESS]: [root][redfish][10.x.x.x] Module: BIOS; Location: BIOS; Model: R5300 G6; Version: 6.00.25 -> 6.10.40; Update result: Succeeded.
%# 2024-11-15 10:13:10.593 UTC+08:00 HDM210235A4GP********1S [BMC.update] 710 [SUCCESS]: [root][redfish][10.x.x.x] Module: BIOS; Location: ME; Model: R5300 G6; Version: 6.00.25 -> 6.10.40; Update result: Succeeded.
2、cpld版本是V005,故障版本是V004;在V005版本說明書上有解決該問題。觸發條件:特定機型配置才會有報錯;004邏輯循環檢測存在概率誤報,線纜告警檢測日誌隻會在BMC剛啟動時上報一次,隨後就不再重複上報。現場升級BMC後會重啟HDM,這時候上報了報警 。
升級CPLD版本至V005及以上。
該案例暫時沒有網友評論
✖
案例意見反饋
親~登錄後才可以操作哦!
確定你的郵箱還未認證,請認證郵箱或綁定手機後進行當前操作