I’m starting to see more and more e-mails from my own Server indicating:
Device: /dev/sda [SAT], ATA error count increased from 588 to 601.
After doing some research, it turns out this is most likely a physical issue caused by loose cabling or controller card not seated correctly. So I did some hunting around inside the server chassis, checking cabling and reseating controller cards, unfortunately the trouble keeps incrementing every few weeks with a new e-mail. I’ll have to keep and eye on this going forward.
I find myself wishing that Steve Gibson’s SpinRite application was available on the various Linux environments, and on drives situated behind an HBA (Host Bus Adapter) Controller card (Hey! if I’m going to wish, might was well go big!). That program would have found the damaged sections, relocated any readable data from the impacted section to a known good section, and marked the spot so the OS didn’t bother to try using it again in the future. Ah the good ol’ days…