[arch-general] Problems about SSD

KangJing Huang huangkangjing at gmail.com
Sat Mar 15 12:30:04 EDT 2014


Hi, there.

I'm not sure if I should post this question here, maybe I should go to
somewhere else like archlinux bbs or LKML, but if I did do anything wrong,
please point it out...

I'm currently encountering some really weird hardware problems, that is my
newly installed SSD randomly refuses to work. I have looked up in my kernel
messages and found something like this:

Mar 15 23:03:34 hkj-laptop kernel: ata3: exception Emask 0x10 SAct 0x0 SErr
0x4090000 action 0xe frozen
Mar 15 23:03:34 hkj-laptop kernel: ata3: irq_stat 0x00400040, connection
status changed
Mar 15 23:03:34 hkj-laptop kernel: ata3: SError: { PHYRdyChg 10B8B DevExch }
Mar 15 23:03:34 hkj-laptop kernel: ata3: hard resetting link
Mar 15 23:03:34 hkj-laptop kernel: psmouse serio2: trackpoint: IBM
TrackPoint firmware: 0x0e, buttons: 3/3
Mar 15 23:03:35 hkj-laptop kernel: ata3: SATA link up 3.0 Gbps (SStatus 123
SControl 300)
Mar 15 23:03:35 hkj-laptop kernel: ata3.00: ACPI cmd ef/02:00:00:00:00:a0
(SET FEATURES) succeeded
Mar 15 23:03:35 hkj-laptop kernel: ata3.00: ACPI cmd f5/00:00:00:00:00:a0
(SECURITY FREEZE LOCK) filtered out
Mar 15 23:03:35 hkj-laptop kernel: ata3.00: ACPI cmd ef/10:03:00:00:00:a0
(SET FEATURES) filtered out
Mar 15 23:03:35 hkj-laptop kernel: ata3.00: ACPI cmd ef/02:00:00:00:00:a0
(SET FEATURES) succeeded
Mar 15 23:03:35 hkj-laptop kernel: ata3.00: ACPI cmd f5/00:00:00:00:00:a0
(SECURITY FREEZE LOCK) filtered out
Mar 15 23:03:35 hkj-laptop kernel: ata3.00: ACPI cmd ef/10:03:00:00:00:a0
(SET FEATURES) filtered out
Mar 15 23:03:35 hkj-laptop kernel: ata3.00: configured for UDMA/133
Mar 15 23:03:35 hkj-laptop kernel: ata3: EH complete
Mar 15 23:03:35 hkj-laptop kernel: input: TPPS/2 IBM TrackPoint as
/devices/platform/i8042/serio1/serio2/input/input15
Mar 15 23:03:36 hkj-laptop kernel: r8169 0000:0c:00.0 enp12s0: link down
Mar 15 23:03:36 hkj-laptop kernel: IPv6: ADDRCONF(NETDEV_UP): enp12s0: link
is not ready
Mar 15 23:03:39 hkj-laptop kernel: type=1006 audit(1394895819.257:2):
pid=351 uid=0 old auid=4294967295 new auid=1000 old ses=4294967295 new
ses=1 res=1
Mar 15 23:03:39 hkj-laptop kernel: type=1006 audit(1394895819.263:3):
pid=414 uid=0 old auid=4294967295 new auid=1000 old ses=4294967295 new
ses=2 res=1
Mar 15 23:04:05 hkj-laptop kernel: type=1006 audit(1394895845.503:4):
pid=395 uid=0 old auid=4294967295 new auid=1000 old ses=4294967295 new
ses=3 res=1
Mar 15 23:08:53 hkj-laptop kernel: r8169 0000:0c:00.0 enp12s0: link up
Mar 15 23:08:53 hkj-laptop kernel: IPv6: ADDRCONF(NETDEV_CHANGE): enp12s0:
link becomes ready
Mar 15 23:08:53 hkj-laptop kernel: ata3: exception Emask 0x10 SAct 0x0 SErr
0x4090000 action 0xe frozen
Mar 15 23:08:53 hkj-laptop kernel: ata3: irq_stat 0x00400040, connection
status changed
Mar 15 23:08:53 hkj-laptop kernel: ata3: SError: { PHYRdyChg 10B8B DevExch }
Mar 15 23:08:53 hkj-laptop kernel: ata3: hard resetting link
Mar 15 23:08:54 hkj-laptop kernel: ata3: SATA link down (SStatus 0 SControl
300)
Mar 15 23:08:59 hkj-laptop kernel: ata3: hard resetting link
Mar 15 23:08:59 hkj-laptop kernel: ata3: SATA link down (SStatus 0 SControl
300)
Mar 15 23:08:59 hkj-laptop kernel: ata3: limiting SATA link speed to 1.5
Gbps
Mar 15 23:09:04 hkj-laptop kernel: ata3: hard resetting link
Mar 15 23:09:04 hkj-laptop kernel: ata3: SATA link down (SStatus 0 SControl
310)
Mar 15 23:09:04 hkj-laptop kernel: ata3.00: disabled
Mar 15 23:09:04 hkj-laptop kernel: ata3: EH complete
Mar 15 23:09:04 hkj-laptop kernel: sd 2:0:0:0: rejecting I/O to offline
device
Mar 15 23:09:04 hkj-laptop kernel: sd 2:0:0:0: [sdb] killing request
Mar 15 23:09:04 hkj-laptop kernel: ata3.00: detaching (SCSI 2:0:0:0)
Mar 15 23:09:04 hkj-laptop kernel: sd 2:0:0:0: [sdb] Unhandled error code
Mar 15 23:09:04 hkj-laptop kernel: sd 2:0:0:0: [sdb]
Mar 15 23:09:04 hkj-laptop kernel: Result: hostbyte=0x01 driverbyte=0x00
Mar 15 23:09:04 hkj-laptop kernel: sd 2:0:0:0: [sdb] CDB:
Mar 15 23:09:04 hkj-laptop kernel: cdb[0]=0x28: 28 00 01 40 0a 48 00 00 10
00
Mar 15 23:09:04 hkj-laptop kernel: end_request: I/O error, dev sdb, sector
20974152
Mar 15 23:09:04 hkj-laptop kernel: btrfs: bdev /dev/sdb2 errs: wr 0, rd 1,
flush 0, corrupt 0, gen 0
Mar 15 23:09:04 hkj-laptop kernel: btrfs csum failed ino 145333 extent
10738774016 csum 2566472073 wanted 2242882195 mirror 0
Mar 15 23:09:04 hkj-laptop kernel: btrfs: bdev /dev/sdb2 errs: wr 0, rd 2,
flush 0, corrupt 0, gen 0
Mar 15 23:09:04 hkj-laptop kernel: btrfs csum failed ino 145333 extent
10738774016 csum 1786973475 wanted 2242882195 mirror 0
Mar 15 23:09:04 hkj-laptop kernel: btrfs: bdev /dev/sdb2 errs: wr 0, rd 3,
flush 0, corrupt 0, gen 0
Mar 15 23:09:04 hkj-laptop kernel: btrfs csum failed ino 145333 extent
10738774016 csum 2566472073 wanted 2242882195 mirror 0
Mar 15 23:09:04 hkj-laptop kernel: btrfs: bdev /dev/sdb2 errs: wr 0, rd 4,
flush 0, corrupt 0, gen 0
Mar 15 23:09:04 hkj-laptop kernel: btrfs csum failed ino 145333 extent
10738774016 csum 4252743571 wanted 2242882195 mirror 0
Mar 15 23:09:04 hkj-laptop kernel: btrfs: bdev /dev/sdb2 errs: wr 0, rd 5,
flush 0, corrupt 0, gen 0
Mar 15 23:09:04 hkj-laptop kernel: btrfs csum failed ino 145333 extent
10738774016 csum 235249680 wanted 2242882195 mirror 0
Mar 15 23:09:04 hkj-laptop kernel: btrfs: bdev /dev/sdb2 errs: wr 0, rd 6,
flush 0, corrupt 0, gen 0
Mar 15 23:09:04 hkj-laptop kernel: btrfs csum failed ino 145333 extent
10738774016 csum 4252743571 wanted 2242882195 mirror 0
Mar 15 23:09:04 hkj-laptop kernel: btrfs: bdev /dev/sdb2 errs: wr 0, rd 7,
flush 0, corrupt 0, gen 0
Mar 15 23:09:04 hkj-laptop kernel: btrfs csum failed ino 145333 extent
10738774016 csum 235249680 wanted 2242882195 mirror 0
Mar 15 23:09:04 hkj-laptop kernel: btrfs: bdev /dev/sdb2 errs: wr 0, rd 8,
flush 0, corrupt 0, gen 0
Mar 15 23:09:04 hkj-laptop kernel: btrfs csum failed ino 33247 extent
5844008960 csum 1491206752 wanted 1630629534 mirror 0
Mar 15 23:09:04 hkj-laptop kernel: btrfs: bdev /dev/sdb2 errs: wr 0, rd 9,
flush 0, corrupt 0, gen 0
Mar 15 23:09:04 hkj-laptop kernel: btrfs csum failed ino 33247 extent
5844008960 csum 2566472073 wanted 1630629534 mirror 0
Mar 15 23:09:04 hkj-laptop kernel: sd 2:0:0:0: [sdb] Synchronizing SCSI
cache
Mar 15 23:09:04 hkj-laptop kernel: sd 2:0:0:0: [sdb]
Mar 15 23:09:04 hkj-laptop kernel: Result: hostbyte=0x04 driverbyte=0x00
Mar 15 23:09:04 hkj-laptop kernel: sd 2:0:0:0: [sdb] Stopping disk
Mar 15 23:09:04 hkj-laptop kernel: sd 2:0:0:0: [sdb] START_STOP FAILED
Mar 15 23:09:04 hkj-laptop kernel: sd 2:0:0:0: [sdb]
Mar 15 23:09:04 hkj-laptop kernel: Result: hostbyte=0x04 driverbyte=0x00
Mar 15 23:09:06 hkj-laptop kernel: btrfs csum failed ino 145333 extent
10632859648 csum 2566472073 wanted 2242882195 mirror 0
Mar 15 23:09:06 hkj-laptop kernel: btrfs csum failed ino 145333 extent
10632859648 csum 2566472073 wanted 2242882195 mirror 0
Mar 15 23:09:06 hkj-laptop kernel: btrfs csum failed ino 145333 extent
10632859648 csum 2566472073 wanted 2242882195 mirror 0
Mar 15 23:09:06 hkj-laptop kernel: btrfs csum failed ino 145333 extent
10632859648 csum 2566472073 wanted 2242882195 mirror 0
Mar 15 23:09:06 hkj-laptop kernel: btrfs csum failed ino 145333 extent
10632859648 csum 2566472073 wanted 2242882195 mirror 0
Mar 15 23:09:08 hkj-laptop kernel: btrfs csum failed ino 145333 extent
10632859648 csum 2566472073 wanted 2242882195 mirror 0
Mar 15 23:09:08 hkj-laptop kernel: btrfs csum failed ino 145333 extent
10632859648 csum 2566472073 wanted 2242882195 mirror 0
Mar 15 23:09:08 hkj-laptop kernel: btrfs csum failed ino 145333 extent
10632859648 csum 2566472073 wanted 2242882195 mirror 0
Mar 15 23:09:08 hkj-laptop kernel: btrfs csum failed ino 145333 extent
10632859648 csum 2566472073 wanted 2242882195 mirror 0
Mar 15 23:09:08 hkj-laptop kernel: btrfs csum failed ino 145333 extent
10632859648 csum 2566472073 wanted 2242882195 mirror 0
Mar 15 23:09:08 hkj-laptop kernel: ata3: exception Emask 0x10 SAct 0x0 SErr
0x4050002 action 0xe frozen
Mar 15 23:09:08 hkj-laptop kernel: ata3: irq_stat 0x00000040, connection
status changed
Mar 15 23:09:08 hkj-laptop kernel: ata3: SError: { RecovComm PHYRdyChg
CommWake DevExch }
Mar 15 23:09:08 hkj-laptop kernel: ata3: hard resetting link
Mar 15 23:09:09 hkj-laptop kernel: ata3: SATA link down (SStatus 0 SControl
300)
Mar 15 23:09:09 hkj-laptop kernel: ata3: EH complete
Mar 15 23:09:10 hkj-laptop kernel: r8169 0000:0c:00.0 enp12s0: link down
Mar 15 23:09:11 hkj-laptop kernel: r8169 0000:0c:00.0 enp12s0: link up

As you can see, the kernel is reporting some ata errors, and with these
errors reporting, the kernel tries to hard reset the ata link with the SSD,
sometimes it succeeded sometimes failed, and when failed, the drive is
turned down and disabled by kernel, which would immediately lead to serious
problems and crashes in  both file system modules of kernel  and user space.

I have googled through the net, and found this
https://ata.wiki.kernel.org/index.php/Libata_error_messages wiki saying
that these error messages indicates "a hardware problem (often a bad SATA
cable or a bad or inadequate power supply)", but I'm not sure which
specific hardware problem this case really is. It can be an inadequate
power supply, or a hardware failure of the SSD ata controller.  As I have
bought this drive  in less than 3 months, I have to decide whether to start
a claim of refund, so I would like to know if these is anyone who
encountered any kind of similar problems here.

What bothers most in this case is that the problem seems to be completely
random and hard to reproduce. I have no idea how it is triggered, sometimes
when I start up virtual machine, sometimes when I plug in an external USB
device, sometimes when I have run the computer in long hours. But none of
the cases described above seems to be reproduce-able, this is driving me
mad...

Additional data about my hardware and software:
Model: Lenovo Thinkpad Edge E430 3254-A71
Hardware modifications:
An additional 4GB RAM is added.
Original HDD was replaced by a HGST HTS721010A9E630 7200 rpm drive
An external SanDisk SD6SF1M128G1022 solid drve was added to the spare
mSata/mPCIe port of the laptop.


Please help, any kind of reply will be greatly appreciated !

cheers,
Chaserhkj


More information about the arch-general mailing list