Solaris OS - Hardware/Disk - Recommended Guidelines for Status Check |
|
sdx:Error for Command:read (10) Error Level:Fatal Requested Block:x Error Block:x Vendor:x sdx:Error for Command:write(10) Error Level:Retryable Requested Block:x Error Block:x Vendor:x sdx:Error for Command:write Error Level:Retryable Requested Block:x Error Block:x Vendor:x sdx:Error for Command:[undecoded cmd 0x25]Error Level:Fatal Requested Block:x Error Block:x Vendor:x sdx:Error for Command:[undecoded cmd 0x3c]Error Level:Retryable Requested Block:x Error Block:x Vendor:x sdx:SCSI transport failed: reason 'reset':retrying command sdx:SCSI transport failed: reason 'timeout':retrying command sdx:SCSI transport failed: reason 'tran_err':giving up |
echo | /usr/sbin/format | more /usr/sbin/vxdisk list All configured disks should show a status of "online". /usr/sbin/vxprint -ht | egrep 'DISA|FAIL' If a volume is marked DISABLED: Check if the volume is in a restore operation, confirm normal configuration. If a volume is marked FAILED: Confirm the volume is mirrored (ie:The volume has redundant plexes). df -kF ufs; df -kF vxfs I/O errors or a hung listing will indicate you have a serious issue. Verify Solaris Version, if Solaris 10, go to Knowledge Base: EMC Procedures/Solaris 10. For Versions 5.9 and under,run: /etc/powermt display Both lun device paths should show a status of "optimal". I/O path totals should be equal. Errors should be 0. The error count is cumlative from uptime or from the last time a "restore" was run. If there is an error count, run: /etc/powermt restore Recheck. The restore will reset the error count to 0. If the error count resumes after the restore, see Knowledge Base: EMC Procedures for additional diagnostics and escalation procedures. Is this the first occurrence ? Has a disk repair or analyze already been run ? Has the issue already been escalated, if so, can fault monitoring be disabled til the disk is replaced ? |
See Knowledge Base: Solaris Architecture Table to see if it is hot swappable. Send notification to the client server owner groups to schedule for downtime or a low activity period for replacement. |
Knowledge Base: EMC Procedures Knowledge Base: EMC Procedures/Solaris 10 Knowledge Base: Solaris Architecture Table Knowledge Base: Solaris Disk Replacement Procedure Knowledge Base: Veritas Knowledge Base: Standard Procedures/Escalation Knowledge Base: Standard Procedures/Open Issues |