本文檔介紹幾種命令列介面(CLI)命令以及其他故障排除技術,可幫助排除硬碟驅動器(HDD)問題。排查HDD問題的最佳方法是使用LED、GUI、BIOS、LSI選項ROM/MegaRaid GUI和日誌。但是,這些選項並非始終可用。在這種情況下,您可以使用CLI。
本文件沒有特定需求。
本文件所述內容不限於特定軟體和硬體版本。
本文中的資訊是根據特定實驗室環境內的裝置所建立。文中使用到的所有裝置皆從已清除(預設)的組態來啟動。如果您的網路正在作用,請確保您已瞭解任何指令可能造成的影響。
如需文件慣例的詳細資訊,請參閱思科技術提示慣例。
附註:本文檔中列出的一些命令取決於您是否有LSI MegaRaid控制器,因為1064/1068e LSI控制器並不支援所有這些命令。
輸入show pci-adapter命令以檢視產品名稱。此示例顯示LSI 1064e介面卡。
ucs-c2xx-m1 /chassis #show pci-adapter Slot Vendor ID Device ID SubVendor ID SubDevice ID Product Name ---- --------- --------- ------------ ------------ ------------------------ M 0x1000 0x0056 0x152d 0x896d Cisco LSI 1064E Mezzan...
輸入show hdd命令以檢視HDD的狀態。
ucs-c2xx-m1 /chassis #show hdd Name Status -------------------- -------------------- HDD_01_STATUS present HDD_02_STATUS absent HDD_03_STATUS absent HDD_04_STATUS absent
輸入show virtual-drive命令以檢視虛擬驅動器的狀態。此命令非常有用,因為它不需要您關閉伺服器並輸入BIOS來檢視資訊。
ucs-c210-m2/chassis #scope storageadapter SLOT-5 ucs-c210-m2/chassis/storageadapter #show virtual-drive Virtual Drive Status Name Size RAID Level -------------- ------------------ ---------------------- --------- ---------- 0 Optimal 139236 MB RAID 1 1 Degraded 974652 MB RAID 5
輸入show physical-drive命令以檢視物理驅動器的狀態。
ucs-c210-m2 /chassis/storageadapter #show physical-drive
Predictive
Slot Failure Drive Coerced
Number Controller Status Manufacturer Model Count Firmware Size Type
------ ---------- ------ ------------ ----------- ---------- -------- --------- ----
0 SLOT-5
1 SLOT-5 online SEAGATE ST9146852SS 0 0005 139236 MB HDD
2 SLOT-5 online SEAGATE ST9146852SS 0 0005 139236 MB HDD
3 SLOT-5 online SEAGATE ST9146852SS 0 0005 139236 MB HDD
4 SLOT-5 online SEAGATE ST9146852SS 0 0005 139236 MB HDD
5 SLOT-5 online SEAGATE ST9146852SS 0 0005 139236 MB HDD
6 SLOT-5 online SEAGATE ST9146852SS 0 0005 139236 MB HDD
7 SLOT-5 online SEAGATE ST9146852SS 0 0005 139236 MB HDD
9 SLOT-5 online SEAGATE ST9146852SS 0 0005 139236 MB HDD
10 SLOT-5 online SEAGATE ST9146852SS 0 0005 139236 MB HDD
輸入show error-counters命令以檢視可糾正和不可糾正的錯誤數量。
ucs-c210-m2 /chassis/storageadapter #show error-counters
PCI Slot SLOT-5:
Memory Correctable Errors: 0
Memory Uncorrectable Errors: 0
輸入show hw-config命令以檢視RAID控制器組態。
ucs-c210-m2 /chassis/storageadapter #show hw-config
PCI Slot SLOT-5:
SAS Address 0: 500e004aaaaaaa3f
SAS Address 1: 0000000000000000
SAS Address 2: 0000000000000000
SAS Address 3: 0000000000000000
SAS Address 4: 0000000000000000
SAS Address 5: 0000000000000000
SAS Address 6: 0000000000000000
SAS Address 7: 0000000000000000
BBU Present: true
NVRAM Present: true
Serial Debugger Present: true
Memory Present: true
Flash Present: true
Memory Size: 512 MB
Cache Memory Size: 394 MB
Number of Backend Ports: 8
輸入show physical-drive-count 命令以檢視HDD的數量。
ucs-c210-m2 /chassis/storageadapter #show physical-drive-count
PCI Slot SLOT-5:
Physical Drive Count: 9
Critical Physical Drive Count: 0
Failed Physical Drive Count: 0
如果您無法訪問CLI,您可以檢視技術支援檔案(/tmp/tech_support)以獲取有關HDD狀態的資訊。技術支援檔案的節選顯示來自智慧平台管理介面(IPMI)感測器的HDD:
Querying All IPMI Sensors: Sensor Name | Reading | Unit | Status | LNR | LC | LNC | UNC | UC | UNR HDD0_INFO | 0x0 | discrete | 0x2181 | na | na | na | na | na | na HDD1_INFO | 0x0 | discrete | 0x2181 | na | na | na | na | na | na HDD2_INFO | 0x0 | discrete | 0x2181 | na | na | na | na | na | na HDD3_INFO | 0x0 | discrete | 0x2181 | na | na | na | na | na | na HDD4_INFO | 0x0 | discrete | 0x2181 | na | na | na | na | na | na HDD5_INFO | 0x0 | discrete | 0x2181 | na | na | na | na | na | na HDD6_INFO | na | discrete | na | na | na | na | na | na | na HDD7_INFO | na | discrete | na | na | na | na | na | na | na
技術支援檔案的摘錄顯示硬碟狀態的細分:
Bit[15:10] - Unused Bit[9:8] - Fault Bit[7:4] â LED Color Bit[3:0] â LED State Fault: 0x100 â On Line 0x200 - Degraded LED Color: 0x10 â GREEN 0x20 â AMBER 0x40 â BLUE 0x80 â RED LED State: 0x01 â OFF 0x02 â ON 0x04 â FAST BLINK 0x08 â SLOW BLINK
以下是從技術支援檔案摘錄的硬碟狀態(狀態代碼為0x2181):
0x2181 Fault: 0x100 --- HDD is On Line LED Color: 0x80 --- RED LED State: 0x01 --- OFF
您可以選擇將電池備用單元(BBU)用於某些伺服器部署。BBU是一個智慧電池備份單元,可在斷電期間保護RAID控制器上的磁碟寫快取資料長達72小時。
此範例顯示如何使用MegaCli檢查BBU的狀態:
bash$ sudo /opt/MegaRAID/MegaCli/MegaCli64 -AdpBbuCmd -a0 -NoLog Password: . . . Battery Replacement required : Yes . . . Relative State of Charge: 99 % Absolute State of charge: 76 % . . . Date of Manufacture: 11/08, 2008 Design Capacity: 700 mAh Design Voltage: 3700 mV Specification Info: 33 Serial Number: 243 Pack Stat Configuration: 0x6cb0 Manufacture Name: LSI113000G Device Name: 2970700 Device Chemistry: LION Battery FRU: N/A
此範例顯示如何使用CLI檢查BBU的狀態:
ucs-c200-m2 /chassis/storageadapter #show bbu detail
Controller SLOT-7:
Battery Type: iBBU
Battery Present: true
Voltage: 4.023 V
Current: 0.000 A
Charge: 100%
Charging State: fully charged
Temperature: 34 degrees C
Voltage Low: false
Temperature High: false
Learn Cycle Requested: false
Learn Cycle Active: false
Learn Cycle Failed: false
Learn Cycle Timeout: false
I2C Errors Detected: false
Battery Replacement Required: true
Remaining Capacity Low: true
| 修訂 | 發佈日期 | 意見 |
|---|---|---|
1.0 |
07-Dec-2012
|
初始版本 |