Oct 30,1998 onl8v1: Solaris 2.6 cc ドライバのテスト. Solaris2.6 システムのインストール --- ディスク障害の復旧 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ (http://www-online.kek.jp/~inoue/Machine/onl8v1/ Solaris2.6/system-install3.txt) 高エネルギー加速器研究機構 素粒子原子核研究所 物理、オンライングループ 井上 栄二 (1). 現状確認 (A). 株式会社ロジックハウスの白田様より SPARC CPU-8VT と、VMEドライバ v2.3.1 を借用した。 (B). CPU-8VT に Solaris2.6 のシステムをインストールした。 (C). /etc の下の各設定作業の途中でシステム・ディスクが障害を起こした。 (2). ここでやるべきこと 障害を起こしたディスクを復旧する。 (3). "boot -r" で再度立ち上げてみる ok E-Cache Tag Ram Test Snoop Ram Test IOMMU-TLB Test IOMMU Control Register Test IOMMU Base Addr Register Test IOMMU SBus Config Registers Test Set RAH to 3 Set WS to 3 DRAM Data Bus Test Probing Bank 0 -- 32 Megs Found Probing Bank 1 -- 32 Megs Found Probing Bank 2 -- 0 Megs Found Probing Bank 3 -- 0 Megs Found Probing Bank 4 -- 0 Megs Found Probing Bank 5 -- 0 Megs Found Probing Bank 6 -- 0 Megs Found Probing Bank 7 -- 0 Megs Found DRAM Address Bus Test Memory Address Pattern Test D-Cache Hit Miss Test D-TLB Hit Miss Test I-TLB Hit Miss Test I-Cache Hit Miss Test E-Cache Hit Miss Test, cache_size=0x00080000 bytes I-TLB Ram Test FPU Register File Test FPU Misaligned Reg Pair Test FPU Single-precision Tests FPU Double-precision Tests FPU SP Invalid CEXC Test FPU SP Overflow CEXC Test FPU SP Divide-by-0 CEXC Test FPU SP Inexact CEXC Test FPU SP Trap Priority > Test FPU SP Trap Priority < Test FPU DP Invalid CEXC Test FPU DP Overflow CEXC Test FPU DP Divide-by-0 CEXC Test FPU DP Inexact CEXC Test FPU DP Trap Priority > Test FPU DP Trap Priority < Test PROC0 Interrupt Regs Tests Soft Interrupts OFF Test Soft Interrupts ON Test PROC0 User Timer Test PROC0 Counter/Timer Test DMA2 E_CSR Register Test LANCE Address Port Tests LANCE Data Port Tests DMA2 D_CSR Register Test DMA2 D_ADDR Register Test DMA2 D_BCNT Register Test DMA2 D_NADDR Register Test ESP Registers Tests DMA2 P_CSR Register Test DMA2 P_ADDR Register Test DMA2 P_BCNT Register Test PPORT Registers Tests NVRAM Access Test TOD Registers Test screen not found. Can't open input device. Keyboard not present. Using tty for input and output. SPARC CPU-8VT, No Keyboard ROM Rev. 2.15.1, 64 MB memory installed, Serial #9172389. Ethernet address 0:80:42:b:85:a5, Host ID: 808bf5a5. Type help for more information ok ok boot -r Boot device: /iommu/sbus/espdma@5,8400000/esp@5,8800000/sd@3,0 File and args: - r SunOS Release 5.6 Version Generic [UNIX(R) System V Release 4.0] Copyright (c) 1983-1997, Sun Microsystems, Inc. configuring network interfaces: le0. Hostname: onl8v1 WARNING: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3,0 (sd3): Error for Command: write Error Level: Retryable Requested Block: 95328 Error Block: 95328 Vendor: SEAGATE Serial Number: AY102791 Sense Key: Hardware Error ASC: 0x9 (track following error), ASCQ: 0x0, FRU: 0x0 Type 'go' to resume ok NG. 症状は変わらない。 (4). fsck で復旧できないか調べる onl7v2 のシステムに、障害を起こしたディスクをつないで fsck をやって みる。 障害を起こしたディスクの SCSI ID は 2 にして onl7v2 のシステム を立ち上げた。 ここで、fsck を実行する。 onl7v2# fsck /dev/dsk/c0t2d0s0 ** /dev/dsk/c0t2d0s0 ** Last Mounted on / ** Phase 1 - Check Blocks and Sizes ** Phase 2 - Check Pathnames ** Phase 3 - Check Connectivity ** Phase 4 - Check Reference Counts ** Phase 5 - Check Cyl groups 3022 files, 20024 used, 43135 free (55 frags, 5385 blocks, 0.0% fragmentation) onl7v2# onl7v2# fsck /dev/dsk/c0t2d0s6 ** /dev/dsk/c0t2d0s6 ** Last Mounted on /usr ** Phase 1 - Check Blocks and Sizes ** Phase 2 - Check Pathnames ** Phase 3 - Check Connectivity ** Phase 4 - Check Reference Counts ** Phase 5 - Check Cyl groups 24804 files, 383722 used, 202357 free (957 frags, 25175 blocks, 0.1% fragmentat ion) onl7v2# onl7v2# fsck /dev/dsk/c0t2d0s5 ** /dev/dsk/c0t2d0s5 ** Last Mounted on /opt ** Phase 1 - Check Blocks and Sizes ** Phase 2 - Check Pathnames ** Phase 3 - Check Connectivity ** Phase 4 - Check Reference Counts ** Phase 5 - Check Cyl groups 124 files, 10049 used, 131046 free (70 frags, 16372 blocks, 0.0% fragmentation) onl7v2# fsck を実行すると、コンソールにはいずれの場合も上記の項目(3)で示した WARNING、およびエラー・メッセージが表示された。 障害を起こしたディスクをここで一旦 onl7v2 からはずし、再度 onl8v1 に つないで boot してみることにした。 ok xx~ Power-On Self Test IU Register File Test MMU Context Table Reg Test MMU Context Register Test MMU Sync Fault Stat Reg Test MMU Sync Fault Addr Reg Test D-Cache Tag Ram Test D-Cache Data Ram Testtpp Power-On Self Test IU Register File Test MMU Context Table Reg Test MMU Context Register Test MMU Sync Fault Stat Reg Test MMU Sync Fault Addr Reg Test D-Cache Tag Ram Test D-Cache Data Ram Test D-TLB CAM Test D-TLB RAM Test I-Cache Tag Ram Test I-Cache Data Ram Test E-Cache Addr Bus Test E-Cache Tag Bus Test E-Cache Data Ram Test E-Cache Tag Ram Test Snoop Ram Test IOMMU-TLB Test IOMMU Control Register Test IOMMU Base Addr Register Test IOMMU SBus Config Registers Test Set RAH to 3 Set WS to 3 DRAM Data Bus Test Probing Bank 0 -- 32 Megs Found Probing Bank 1 -- 32 Megs Found Probing Bank 2 -- 0 Megs Found Probing Bank 3 -- 0 Megs Found Probing Bank 4 -- 0 Megs Found Probing Bank 5 -- 0 Megs Found Probing Bank 6 -- 0 Megs Found Probing Bank 7 -- 0 Megs Found DRAM Address Bus Test Memory Address Pattern Test D-Cache Hit Miss Test D-TLB Hit Miss Test I-TLB Hit Miss Test I-Cache Hit Miss Test E-Cache Hit Miss Test, cache_size=0x00080000 bytes I-TLB Ram Test FPU Register File Test FPU Misaligned Reg Pair Test FPU Single-precision Tests FPU Double-precision Tests FPU SP Invalid CEXC Test FPU SP Overflow CEXC Test FPU SP Divide-by-0 CEXC Test FPU SP Inexact CEXC Test FPU SP Trap Priority > Test FPU SP Trap Priority < Test FPU DP Invalid CEXC Test FPU DP Overflow CEXC Test FPU DP Divide-by-0 CEXC Test FPU DP Inexact CEXC Test FPU DP Trap Priority > Test FPU DP Trap Priority < Test PROC0 Interrupt Regs Tests Soft Interrupts OFF Test Soft Interrupts ON Test PROC0 User Timer Test PROC0 Counter/Timer Test DMA2 E_CSR Register Test LANCE Address Port Tests LANCE Data Port Tests DMA2 D_CSR Register Test DMA2 D_ADDR Register Test DMA2 D_BCNT Register Test DMA2 D_NADDR Register Test ESP Registers Tests DMA2 P_CSR Register Test DMA2 P_ADDR Register Test DMA2 P_BCNT Register Test PPORT Registers Tests NVRAM Access Test TOD Registers Test screen not found. Can't open input device. Keyboard not present. Using tty for input and output. SPARC CPU-8VT, No Keyboard ROM Rev. 2.15.1, 64 MB memory installed, Serial #9172389. Ethernet address 0:80:42:b:85:a5, Host ID: 808bf5a5. Type help for more information ok boot -r Boot device: /iommu/sbus/espdma@5,8400000/esp@5,8800000/sd@3,0 File and args: - r SunOS Release 5.6 Version Generic [UNIX(R) System V Release 4.0] Copyright (c) 1983-1997, Sun Microsystems, Inc. configuring network interfaces: le0. Hostname: onl8v1 WARNING: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3,0 (sd3): Error for Command: write Error Level: Retryable Requested Block: 95328 Error Block: 95328 Vendor: SEAGATE Serial Number: AY102791 Sense Key: Hardware Error ASC: 0x9 (track following error), ASCQ: 0x0, FRU: 0x0 WARNING: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3,0 (sd3): disk not responding to selection WARNING: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3,0 (sd3): offline WARNING: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3,0 (sd3): i/o to invalid geometry WARNING: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3,0 (sd3): i/o to invalid geometry WARNING: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3,0 (sd3): i/o to invalid geometry WARNING: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3,0 (sd3): i/o to invalid geometry WARNING: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3,0 (sd3): i/o to invalid geometry WARNING: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3,0 (sd3): i/o to invalid geometry WARNING: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3,0 (sd3): i/o to invalid geometry WARNING: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3,0 (sd3): i/o to invalid geometry NOTICE: /: out of inodes WARNING: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3,0 (sd3): i/o to invalid geometry WARNING: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3,0 (sd3): i/o to invalid geometry NG. 症状は変わらない。 (5). 株式会社 アイ・エイ・アイに問い合わせる 株式会社 アイ・エイ・アイの技術者の松田さんの問いに答えていくつかの 動作確認をやる。 (5-1). 障害を起こしたディスクから立ち上がれないことを確認 ok boot -r Boot device: /iommu/sbus/espdma@5,8400000/esp@5,8800000/sd@3,0 File and args: - r SunOS Release 5.6 Version Generic [UNIX(R) System V Release 4.0] Copyright (c) 1983-1997, Sun Microsystems, Inc. configuring network interfaces: le0. Hostname: onl8v1 WARNING: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3,0 (sd3): Error for Command: write Error Level: Retryable Requested Block: 95328 Error Block: 95328 Vendor: SEAGATE Serial Number: AY102791 Sense Key: Hardware Error ASC: 0x9 (track following error), ASCQ: 0x0, FRU: 0x0 WARNING: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3,0 (sd3): disk not responding to selection WARNING: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3,0 (sd3): offline WARNING: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3,0 (sd3): i/o to invalid geometry WARNING: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3,0 (sd3): i/o to invalid geometry WARNING: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3,0 (sd3): i/o to invalid geometry WARNING: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3,0 (sd3): i/o to invalid geometry WARNING: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3,0 (sd3): i/o to invalid geometry WARNING: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3,0 (sd3): i/o to invalid geometry WARNING: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3,0 (sd3): i/o to invalid geometry WARNING: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3,0 (sd3): i/o to invalid geometry NOTICE: /: out of inodes WARNING: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3,0 (sd3): i/o to invalid geometry WARNING: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3,0 (sd3): i/o to invalid geometry WARNING: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3,0 (sd3): i/o to invalid geometry WARNING: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3,0 (sd3): i/o to invalid geometry WARNING: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3,0 (sd3): i/o to invalid geometry WARNING: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3,0 (sd3): i/o to invalid geometry WARNING: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3,0 (sd3): i/o to invalid geometry WARNING: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3,0 (sd3): i/o to invalid geometry WARNING: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3,0 (sd3): i/o to invalid geometry NOTICE: /: out of inodes 障害を起こしたディスクから立ち上がれない。 (5-2). probe-scsi を実行 ok pp Power-On Self Test IU Register File Test MMU Context Table Reg Test MMU Context Register Test MMU Sync Fault Stat Reg Test MMU Sync Fault Addr Reg Test D-Cache Tag Ram Test D-Cache Data Ram Test D-TLB CAM Test D-TLB RAM Test I-Cache Tag Ram Test I-Cache Data Ram Test No Secondary Cache Exists No Secondary Cache Exists No Secondary Cache Exists Snoop Ram Test IOMMU-TLB Test IOMMU Control Register Test IOMMU Base Addr Register Test IOMMU SBus Config Registers Test Set RAH to 3 Set WS to 3 DRAM Data Bus Test Probing Bank 0 -- 32 Megs Found Probing Bank 1 -- 32 Megs Found Probing Bank 2 -- 0 Megs Found Probing Bank 3 -- 0 Megs Found Probing Bank 4 -- 0 Megs Found Probing Bank 5 -- 0 Megs Found Probing Bank 6 -- 0 Megs Found Probing Bank 7 -- 0 Megs Found DRAM Address Bus Test Memory Address Pattern Test D-Cache Hit Miss Test D-TLB Hit Miss Test I-TLB Hit Miss Test I-Cache Hit Miss Test No Secondary Cache Exists I-TLB Ram Test FPU Register File Test FPU Misaligned Reg Pair Test FPU Single-precision Tests FPU Double-precision Tests FPU SP Invalid CEXC Test FPU SP Overflow CEXC Test FPU SP Divide-by-0 CEXC Test FPU SP Inexact CEXC Test FPU SP Trap Priority > Test FPU SP Trap Priority < Test FPU DP Invalid CEXC Test FPU DP Overflow CEXC Test FPU DP Divide-by-0 CEXC Test FPU DP Inexact CEXC Test FPU DP Trap Priority > Test FPU DP Trap Priority < Test PROC0 Interrupt Regs Tests Soft Interrupts OFF Test Soft Interrupts ON Test PROC0 User Timer Test PROC0 Counter/Timer Test DMA2 E_CSR Register Test LANCE Address Port Tests LANCE Data Port Tests DMA2 D_CSR Register Test DMA2 D_ADDR Register Test DMA2 D_BCNT Register Test DMA2 D_NADDR Register Test ESP Registers Tests DMA2 P_CSR Register Test DMA2 P_ADDR Register Test DMA2 P_BCNT Register Test PPORT Registers Tests NVRAM Access Test TOD Registers Test SPARC CPU-7V, No Keyboard ROM Rev. 2.15.1, 64 MB memory installed, Serial #9159440. Ethernet address 0:80:42:b:53:10, Host ID: 808bc310. Initializing Memory - Type help for more information ok probe-scsi Target 3 Unit 0 Disk SEAGATE ST39140N 1206AY102791 Copyright (c) 1997 Seagate All rights reserved 1205 ok romモニター・コマンドの、probe-scsi コマンドの実行では障害を起こした ディスクは見えている。 (5-3). 障害を起こしたディスクを onl7v2 に付加して、"boot -rv" で立ち上げる ok pp Power-On Self Test IU Register File Test MMU Context Table Reg Test MMU Context Register Test MMU Sync Fault Stat Reg Test MMU Sync Fault Addr Reg Test D-Cache Tag Ram Test D-Cache Data Ram Test D-TLB CAM Test D-TLB RAM Test I-Cache Tag Ram Test I-Cache Data Ram Test No Secondary Cache Exists No Secondary Cache Exists No Secondary Cache Exists Snoop Ram Test IOMMU-TLB Test IOMMU Control Register Test IOMMU Base Addr Register Test IOMMU SBus Config Registers Test Set RAH to 3 Set WS to 3 DRAM Data Bus Test Probing Bank 0 -- 32 Megs Found Probing Bank 1 -- 32 Megs Found Probing Bank 2 -- 0 Megs Found Probing Bank 3 -- 0 Megs Found Probing Bank 4 -- 0 Megs Found Probing Bank 5 -- 0 Megs Found Probing Bank 6 -- 0 Megs Found Probing Bank 7 -- 0 Megs Found DRAM Address Bus Test Memory Address Pattern Test D-Cache Hit Miss Test D-TLB Hit Miss Test I-TLB Hit Miss Test I-Cache Hit Miss Test No Secondary Cache Exists I-TLB Ram Test FPU Register File Test FPU Misaligned Reg Pair Test FPU Single-precision Tests FPU Double-precision Tests FPU SP Invalid CEXC Test FPU SP Overflow CEXC Test FPU SP Divide-by-0 CEXC Test FPU SP Inexact CEXC Test FPU SP Trap Priority > Test FPU SP Trap Priority < Test FPU DP Invalid CEXC Test FPU DP Overflow CEXC Test FPU DP Divide-by-0 CEXC Test FPU DP Inexact CEXC Test FPU DP Trap Priority > Test FPU DP Trap Priority < Test PROC0 Interrupt Regs Tests Soft Interrupts OFF Test Soft Interrupts ON Test PROC0 User Timer Test PROC0 Counter/Timer Test DMA2 E_CSR Register Test LANCE Address Port Tests LANCE Data Port Tests DMA2 D_CSR Register Test DMA2 D_ADDR Register Test DMA2 D_BCNT Register Test DMA2 D_NADDR Register Test ESP Registers Tests DMA2 P_CSR Register Test DMA2 P_ADDR Register Test DMA2 P_BCNT Register Test PPORT Registers Tests NVRAM Access Test TOD Registers Test SPARC CPU-7V, No Keyboard ROM Rev. 2.15.1, 64 MB memory installed, Serial #9159440. Ethernet address 0:80:42:b:53:10, Host ID: 808bc310. Boot device: /iommu/sbus/espdma@5,8400000/esp@5,8800000/sd@3,0 File and args: SunOS Release 5.6 Version Generic [UNIX(R) System V Release 4.0] Copyright (c) 1983-1997, Sun Microsystems, Inc. FRCvme VME Nexus V2.2.2 (FGA-5000 FGA-5100) NOTICE: VME: slavewin at vme=0x0, size=0x100000 space=0x2c020017 configuring network interfaces: hme0 le0. Hostname: onl7v2 The system is coming up. Please wait. checking ufs filesystems /dev/rdsk/c0t3d0s5: is clean. /dev/rdsk/c0t3d0s7: is clean. add net default: gateway ICCFG1 NIS domainname is kek.jp starting rpc services: rpcbind keyserv done. Type 'go' to resume Type help for more information ok boot -rv Resetting ... SPARC CPU-7V, No Keyboard ROM Rev. 2.15.1, 64 MB memory installed, Serial #9159440. Ethernet address 0:80:42:b:53:10, Host ID: 808bc310. Rebooting with command: -rv Boot device: /iommu/sbus/espdma@5,8400000/esp@5,8800000/sd@3,0 File and args: - rv Size: 243536+176918+41926 Bytes SunOS Release 5.6 Version Generic [UNIX(R) System V Release 4.0] Copyright (c) 1983-1997, Sun Microsystems, Inc. vac: enabled in writeback mode cpu0: FMI,MB86907 (mid 0 impl 0x0 ver 0x5 clock 171 MHz) mem = 65536K (0x4000000) avail mem = 61169664 Ethernet address = 0:80:42:b:53:10 root nexus = FORCE,CPU-7V iommu0 at root: obio 0x10000000 sbus0 at iommu0: obio 0x10001000 espdma0 at sbus0: SBus slot 5 0x8400000 esp0: esp-options=0x46 esp0 at espdma0: SBus slot 5 0x8800000 sparc ipl 4 sd2 at esp0: target 2 lun 0 sd2 is /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@2,0 sd3 at esp0: target 3 lun 0 sd3 is /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3,0 fas0: rev 2.2 FEPS chip SUNW,fas0 at sbus0: SBus slot 3 0x8800000 and SBus slot 3 0x8810000 SBus level 3 sparc ipl 5 SUNW,fas0 is /iommu@0,10000000/sbus@0,10001000/SUNW,fas@3,8800000 root on /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3,0: a fstype ufs obio0 at root zs0 at obio0: obio 0x100000 sparc ipl 12 zs0 is /obio/zs@0,100000 zs1 at obio0: obio 0x0 sparc ipl 12 zs1 is /obio/zs@0,0 VME: SBus device(s) present in physical SBus slots - SBus space for VME interface will be fragmented. FRCvme VME Nexus V2.2.2 (FGA-5000 FGA-5100) NOTICE: VME: slavewin at vme=0x0, size=0x100000 space=0x2c020017 VME0 at sbus0: SBus slot 5 0xffffe00 sparc ipl 0, SBus level 1 sparc ipl 2, SBus level 2 sparc ipl 3, SBus level 3 sparc ipl 5, SBus level 4 sparc ipl 7, SBus l evel 5 sparc ipl 9, SBus level 6 sparc ipl 11, SBus level 7 sparc ipl 13 configuring network interfaces:SUNW,hme0: FEPS (Rev Id = 22) Found SUNW,hme0 at sbus0: SBus slot 3 0x8c00000 and SBus slot 3 0x8c02000 and SBus slo t 3 0x8c04000 and SBus slot 3 0x8c06000 and SBus slot 3 0x8c07000 SBus level 4 s parc ipl 7 SUNW,hme0 is /iommu@0,10000000/sbus@0,10001000/SUNW,hme@3,8c00000 hme0ledma0 at sbus0: SBus slot 5 0x8400010 le0 at ledma0: SBus slot 5 0x8c00000 sparc ipl 6 le0 is /iommu@0,10000000/sbus@0,10001000/ledma@5,8400010/le@5,8c00000 le0. SUNW,hme0: Using Internal Transceiver SUNW,hme0: 100 Mbps full-duplex Link Up Hostname: onl7v2 dump on /dev/dsk/c0t3d0s1 size 83988K The / file system (/dev/rdsk/c0t3d0s0) is being checked. /dev/rdsk/c0t3d0s0: INCORRECT BLOCK COUNT I=179 (2 should be 0) (CORRECTED) /dev/rdsk/c0t3d0s0: INCORRECT BLOCK COUNT I=12890 (2 should be 0) (CORRECTED) /dev/rdsk/c0t3d0s0: INCORRECT BLOCK COUNT I=12891 (8 should be 0) (CORRECTED) /dev/rdsk/c0t3d0s0: FREE BLK COUNT(S) WRONG IN SUPERBLK (SALVAGED) /dev/rdsk/c0t3d0s0: 3296 files, 23700 used, 42067 free /dev/rdsk/c0t3d0s0: (163 frags, 5238 blocks, 0.2% fragmentation) Configuring the /devices directory pem: no event handler CAMAC device driver V1.3x, 1991-1993 by Y.TAKEUCHI (T.I.T.) cc0 at VME0: vme16d16 0xff00 VME level 4 vector 0xff sparc ipl 7 cc0 is /iommu@0,10000000/sbus@0,10001000/VME@5,ffffe00/cc@2d,ff00 vmemem0 at VME0: vme32d16 0x0 vmemem0 is /iommu@0,10000000/sbus@0,10001000/VME@5,ffffe00/vmemem@d,0 vmemem1 at VME0: vme16d16 0x0 vmemem1 is /iommu@0,10000000/sbus@0,10001000/VME@5,ffffe00/vmemem@2d,0 vmemem2 at VME0: vme24d16 0x0 vmemem2 is /iommu@0,10000000/sbus@0,10001000/VME@5,ffffe00/vmemem@3d,0 vmemem3 at VME0: vme32d32 0x0 vmemem3 is /iommu@0,10000000/sbus@0,10001000/VME@5,ffffe00/vmemem@4d,0 vmemem4 at VME0: vme16d32 0x0 vmemem4 is /iommu@0,10000000/sbus@0,10001000/VME@5,ffffe00/vmemem@6d,0 vmemem5 at VME0: vme24d32 0x0 vmemem5 is /iommu@0,10000000/sbus@0,10001000/VME@5,ffffe00/vmemem@7d,0 sbusmem0 at sbus0: SBus slot 0 0x0 sbusmem0 is /iommu@0,10000000/sbus@0,10001000/sbusmem@0,0 sbusmem1 at sbus0: SBus slot 1 0x0 sbusmem1 is /iommu@0,10000000/sbus@0,10001000/sbusmem@1,0 sbusmem2 at sbus0: SBus slot 2 0x0 sbusmem2 is /iommu@0,10000000/sbus@0,10001000/sbusmem@2,0 sbusmem3 at sbus0: SBus slot 3 0x0 sbusmem3 is /iommu@0,10000000/sbus@0,10001000/sbusmem@3,0 sbusmem4 at sbus0: SBus slot 4 0x0 sbusmem4 is /iommu@0,10000000/sbus@0,10001000/sbusmem@4,0 sbusmem5 at sbus0: SBus slot 5 0x0 sbusmem5 is /iommu@0,10000000/sbus@0,10001000/sbusmem@5,0 SUNW,bpp0 at sbus0: SBus slot 5 0xc800000 SBus level 2 sparc ipl 3 SUNW,bpp0 is /iommu@0,10000000/sbus@0,10001000/SUNW,bpp@5,c800000 pseudo-device: winlock0 winlock0 is /pseudo/winlock@0 sysconfig0: LCA is XC4003 compatible sysconfig0 at root: obio 0x71380000 sysconfig0 is /sysconfig@0,71380000 pseudo-device: lockstat0 lockstat0 is /pseudo/lockstat@0 pseudo-device: vol0 vol0 is /pseudo/vol@0 pseudo-device: llc10 llc10 is /pseudo/llc1@0 vmeplus0 at VME0: vme16d16 0x0 and vme24d16 0x0 and vme32d16 0x0 and vme16d32 0x 0 and vme24d32 0x0 and vme32d32 0x0 and space 0x2f offset 0x0 and space 0x6f offset 0x0 and space 0x10 offset 0x0 and space 0x11 offset 0x0 and space 0x50 offset 0x0 and space 0x51 offset 0x0 vmeplus0 is /iommu@0,10000000/sbus@0,10001000/VME@5,ffffe00/vmeplus@2d,0 pseudo-device: pm0 pm0 is /pseudo/pm@0 vmedma0 at sbus0 vmedma0 is /iommu@0,10000000/sbus@0,10001000/vmedma vmefdma0 at sbus0 vmefdma0 is /iommu@0,10000000/sbus@0,10001000/vmefdma vmedvma0 at sbus0 vmedvma0 is /iommu@0,10000000/sbus@0,10001000/vmedvma vmectl0 at VME0 vmectl0 is /iommu@0,10000000/sbus@0,10001000/VME@5,ffffe00/vmectl Configuring the /dev directory Configuring the /dev directory (compatibility devices) The system is coming up. Please wait. checking ufs filesystems /dev/rdsk/c0t3d0s5: is stable. /dev/rdsk/c0t3d0s7: is stable. add net default: gateway ICCFG1 NIS domainname is kek.jp starting rpc services: rpcbind keyserv done. Setting netmask of le0 to 255.255.252.0 Setting default interface for multicast: add net 224.0.0.0: gateway onl7v2 syslog service starting. Print services started. volume management starting. The system is ready. onl7v2 console login: root 障害を起こしたディスクを sd2 としてつないで onl7v2 を立ち上げた。 sd2 についてのエラーメッセージは特には表示されていない。 (5-4). formatコマンドを使って障害を起こしたディスクを調べる (5-4-1). パーティション情報の確認 onl7v2 console login: root Password: Last login: Thu Oct 29 17:43:09 on console Oct 29 17:55:39 onl7v2 login: ROOT LOGIN /dev/console Sun Microsystems Inc. SunOS 5.6 Generic August 1997 Starting OpenWindows in 5 seconds (type Control-C to interrupt) ^C# # csh onl7v2# source /.cshrc onl7v2# format Searching for disks...done AVAILABLE DISK SELECTIONS: 0. c0t2d0 /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@2, 0 1. c0t3d0 /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3, 0 Specify disk (enter its number): 0 selecting c0t2d0 [disk formatted] FORMAT MENU: disk - select a disk type - select (define) a disk type partition - select (define) a partition table current - describe the current disk format - format and analyze the disk repair - repair a defective sector label - write label to the disk analyze - surface analysis defect - defect list management backup - search for backup labels verify - read and display labels save - save new disk/partition definitions inquiry - show vendor, product and revision volname - set 8-character volume name ! - execute , then return quit format> partition PARTITION MENU: 0 - change `0' partition 1 - change `1' partition 2 - change `2' partition 3 - change `3' partition 4 - change `4' partition 5 - change `5' partition 6 - change `6' partition 7 - change `7' partition select - select a predefined table modify - modify a predefined partition table name - name the current table print - display the current table label - write partition map and label to the disk ! - execute , then return quit partition> print Current partition table (original): Total disk cylinders available: 9004 + 2 (reserved cylinders) Part Tag Flag Cylinders Size Blocks 0 root wm 0 - 68 66.30MB (69/0/0) 135792 1 swap wu 69 - 151 79.76MB (83/0/0) 163344 2 backup wm 0 - 9003 8.45GB (9004/0/0) 17719872 3 home wm 152 - 2234 1.95GB (2083/0/0) 4099344 4 home wm 2235 - 4317 1.95GB (2083/0/0) 4099344 5 unassigned wm 4318 - 4470 147.02MB (153/0/0) 301104 6 usr wm 4471 - 5104 609.23MB (634/0/0) 1247712 7 home wm 5105 - 9003 3.66GB (3899/0/0) 7673232 partition> q パーティション情報に矛盾は見られない。 (5-4-2). analyze コマンドで調べる partition> q FORMAT MENU: disk - select a disk type - select (define) a disk type partition - select (define) a partition table current - describe the current disk format - format and analyze the disk repair - repair a defective sector label - write label to the disk analyze - surface analysis defect - defect list management backup - search for backup labels verify - read and display labels save - save new disk/partition definitions inquiry - show vendor, product and revision volname - set 8-character volume name ! - execute , then return quit format> q onl7v2# format Searching for disks...done AVAILABLE DISK SELECTIONS: 0. c0t2d0 /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@2, 0 1. c0t3d0 /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3, 0 Specify disk (enter its number): 0 selecting c0t2d0 [disk formatted] FORMAT MENU: disk - select a disk type - select (define) a disk type partition - select (define) a partition table current - describe the current disk format - format and analyze the disk repair - repair a defective sector label - write label to the disk analyze - surface analysis defect - defect list management backup - search for backup labels verify - read and display labels save - save new disk/partition definitions inquiry - show vendor, product and revision volname - set 8-character volume name ! - execute , then return quit format> analyze ANALYZE MENU: read - read only test (doesn't harm SunOS) refresh - read then write (doesn't harm data) test - pattern testing (doesn't harm data) write - write then read (corrupts data) compare - write, read, compare (corrupts data) purge - write, read, write (corrupts data) verify - write entire disk, then verify (corrupts data) print - display data buffer setup - set analysis parameters config - show analysis parameters ! - exeucte , then return quit analyze> read Ready to analyze (won't harm SunOS). This takes a long time, but is interruptable with CTRL-C. Continue? yes pass 0 WARNING: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@2,0 (sd2): Error for Command: read Error Level: Retryable Requested Block: 756 Error Block: 756 Vendor: SEAGATE Serial Number: AY102791 Sense Key: Media Error ASC: 0x11 (unrecovered read error), ASCQ: 0x0, FRU: 0x0 Medium error during read: block 756 (0x2f4) (0/3/18) ASC: 0x11 ASCQ: 0x0 Repairing hard error on 756 (0/3/18)...failed. エラーが確認できた。 松田さんの話では、何かの原因でディスク表面に傷ができて修復できない状況 にあるようだとのこと。 新しいディスクを送るので、障害を起こしたディスク を送り返せとのこと。 9003/7/132 pass 1 9003/7/132 Total of 0 defective blocks repaired. analyze> quit FORMAT MENU: disk - select a disk type - select (define) a disk type partition - select (define) a partition table current - describe the current disk format - format and analyze the disk repair - repair a defective sector label - write label to the disk analyze - surface analysis defect - defect list management backup - search for backup labels verify - read and display labels save - save new disk/partition definitions inquiry - show vendor, product and revision volname - set 8-character volume name ! - execute , then return quit format> quit onl7v2# df -k Filesystem kbytes used avail capacity Mounted on /dev/dsk/c0t3d0s0 65767 23735 41967 37% / /dev/dsk/c0t3d0s6 586735 384578 201571 66% /usr /proc 0 0 0 0% /proc fd 0 0 0 0% /dev/fd /dev/dsk/c0t3d0s7 7910181 152221 7678859 2% /export/home /dev/dsk/c0t3d0s5 28719 11684 17007 41% /opt swap 115716 12 115704 1% /tmp onl7v2# mount /dev/dsk/c0t2d0s0 /mnt WARNING: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@2,0 (sd2): Error for Command: read Error Level: Retryable Requested Block: 1920 Error Block: 1920 Vendor: SEAGATE Serial Number: AY102791 Sense Key: Media Error ASC: 0x11 (unrecovered read error), ASCQ: 0x0, FRU: 0x0 WARNING: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@2,0 (sd2): Error for Command: read Error Level: Retryable Requested Block: 1920 Error Block: 1920 Vendor: SEAGATE Serial Number: AY102791 Sense Key: Media Error ASC: 0x11 (unrecovered read error), ASCQ: 0x0, FRU: 0x0 WARNING: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@2,0 (sd2): Error for Command: read Error Level: Retryable Requested Block: 1920 Error Block: 1920 Vendor: SEAGATE Serial Number: AY102791 Sense Key: Media Error ASC: 0x11 (unrecovered read error), ASCQ: 0x0, FRU: 0x0 onl7v2# 障害を起こしたディスクをマウントしようとしたが、エラーになった。 onl7v2# df -k Filesystem kbytes used avail capacity Mounted on /dev/dsk/c0t3d0s0 65767 23736 41966 37% / /dev/dsk/c0t3d0s6 586735 384578 201571 66% /usr /proc 0 0 0 0% /proc fd 0 0 0 0% /dev/fd /dev/dsk/c0t3d0s7 7910181 152221 7678859 2% /export/home /dev/dsk/c0t3d0s5 28719 11684 17007 41% /opt swap 115716 12 115704 1% /tmp /dev/dsk/c0t2d0s0 63159 20024 43072 32% /mnt onl7v2# umount /mnt WARNING: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@2,0 (sd2): Error for Command: write Error Level: Retryable Requested Block: 1920 Error Block: 16 Vendor: SEAGATE Serial Number: AY102791 Sense Key: Media Error ASC: 0xc (), ASCQ: 0x2, FRU: 0x0 onl7v2# 新たに送られてくるディスクが届いたら、障害を起こしたディスクは送り返す ことにする。 --- (6). セクション (6-1). サブセクション (6-1-1). サブサブセクション