Sounds like the big thing everyone is concerned with is the high rate of failure with Seagate’s 1-1.5TB drives.  There’s a firmware update available, and it will probably be a good idea to upgrade the drive firmware if you are affected.  Actually updating the firmware from two drives plugged into my 323 will be a separate problem, but in this post I will describe how to extract all sorts of information from the drives you are running, and proactively monitor them for failures using smartmontools.

You know you want it.  Hit the jump for more.

Note:  Before we start, I need to tell you that it is possible to get the majority of this information out of the 323 by simply going to the “Status” page in the admin interface.  (On firmware 1.06 anyway).  But what’s the fun in that?

Anyway.  First thing we’re going to do is download and install smartmontools.

<Log into your 323 using ssh or telnet>.
cd /mnt/HD_a4/packages (or whatever temporary directory you use for packages)
wget http://www.inreto.de/dns323/fun-plug/0.5/packages/smartmontools-5.38-3.tgz
funpkg -i smartmontools-5.38-3.tgz

You should see:

Installing package smartmontools-5.38-3 …

Now run:

 smartctl -i -d marvell /dev/sda

And you get the drive information for your first drive (/dev/sda):

smartctl version 5.38 [arm-unknown-linux-uclibc] Copyright (C) 2002-8 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

=== START OF INFORMATION SECTION ===
Model Family:     Seagate Barracuda 7200.11
Device Model:     ST31000340AS
Serial Number:    XXXXXXXX
Firmware Version: SD15
User Capacity:    1,000,204,886,016 bytes
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   8
ATA Standard is:  ATA-8-ACS revision 4
Local Time is:    Mon Jan 19 15:06:18 2009 GMT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

Run it again for the second drive (/dev/sdb) :

 smartctl -i -d marvell /dev/sdb

 smartctl version 5.38 [arm-unknown-linux-uclibc] Copyright (C) 2002-8 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

=== START OF INFORMATION SECTION ===
Model Family:     Seagate Barracuda 7200.11
Device Model:     ST31000340AS
Serial Number:    XXXXXXXX
Firmware Version: SD15
User Capacity:    1,000,204,886,016 bytes
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   8
ATA Standard is:  ATA-8-ACS revision 4
Local Time is:    Mon Jan 19 15:07:15 2009 GMT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

I’ve X’d out my serial numbers, but if you are running Seagates you can cross-reference the Device Model & Firmware Versions to see if you are affected.

For a lot more information out of smartmontools, run the following (/dev/sda for disk1, /dev/sdb for disk 2), and look at the “RAW_VALUE” column.

smartctl -a -i -d marvell /dev/sda

SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000f   108   099   006    Pre-fail  Always       -       17947361
  3 Spin_Up_Time            0x0003   092   085   000    Pre-fail  Always       -       0
  4 Start_Stop_Count        0x0032   098   098   020    Old_age   Always       -       2241
  5 Reallocated_Sector_Ct   0x0033   100   100   036    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x000f   069   060   030    Pre-fail  Always       -       7451978
  9 Power_On_Hours          0x0032   098   098   000    Old_age   Always       -       2215
 10 Spin_Retry_Count        0x0013   100   100   097    Pre-fail  Always       -       6
 12 Power_Cycle_Count       0x0032   100   100   020    Old_age   Always       -       22
184 Unknown_Attribute       0x0032   100   100   099    Old_age   Always       -       0
187 Reported_Uncorrect      0x0032   100   100   000    Old_age   Always       -       0
188 Unknown_Attribute       0x0032   100   100   000    Old_age   Always       -       0
189 High_Fly_Writes         0x003a   100   100   000    Old_age   Always       -       0
190 Airflow_Temperature_Cel 0x0022   060   055   045    Old_age   Always       -       40 (Lifetime Min/Max 30/43)
194 Temperature_Celsius     0x0022   040   045   000    Old_age   Always       -       40 (0 21 0 0)
195 Hardware_ECC_Recovered  0x001a   029   025   000    Old_age   Always       -       17947361
197 Current_Pending_Sector  0x0012   100   100   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0010   100   100   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x003e   200   200   000    Old_age   Always       -       0

That’s all for now, but later I’ll show you how to set up smartd to monitor your drives and email you if it thinks they are about to fail.  I’ll also post information on how I upgraded the firmware of the these affected Seagate drives (once I figure it out!)

Update 19-Jan-09:  I logged a support ticket with Seagate.  I’ll report back on what they tell me.


11 Comments to “Seagate drive firmware issues, your DNS-323 and smartmontools”  

  1. 1 Jason

    Oh my god, I am so happy you’re covering this. I JUST installed two seagate drives in the dns-323, spent 3 days setting it up and transferring all my important data, and now I read that the drives are doomed (my model and firmware exactly).

    I run a mac, so will I have to install an ext2 driver, eject them from the NAS and flash them? Will I need with a bootable linux distro? Which one?

    I’ve got a billion questions/concerns, so I eagerly await your next article! Thanks!

  2. 2 admin

    Jason, it doesn’t sound promising dude… especially if you are mac-only. I am bummed too. Keep tabs on this forum thread.

  3. 3 Jason

    What a clusterf@ck. Oh how the mighty have fallen…Seagate used to be gold. =(

    Thanks for the forum link. I’ve been following this one.

  4. 4 Mo

    One of my drive (Samsung 1tb) is showing SMART as being disabled:

    === START OF INFORMATION SECTION ===
    Device Model: SAMSUNG HD103UJ
    Serial Number: XXXXXXXXXX
    Firmware Version: 1AA01113
    User Capacity: 1,000,204,886,016 bytes
    Device is: In smartctl database [for details use: -P show]
    ATA Version is: 8
    ATA Standard is: ATA-8-ACS revision 3b
    Local Time is: Sun Jan 25 19:43:10 2009 GMT

    ==> WARNING: May need -F samsung or -F samsung2 enabled; see manual for details.

    SMART support is: Available - device has SMART capability.
    SMART support is: Disabled

    SMART Disabled. Use option -s with argument ‘on’ to enable it.

    How do I enable SMART? (sorry, I’m a linux noob)

  5. 5 Mo

    Disregard my last post, figured out how to enable SMART:

    smartctl -s on -d marvell /dev/sda
    :)

  6. 6 Gary H

    Thank you! Exactly what I’m looking for. I can’t believe I found this site!!
    Thank you thank you!!
    Gary

  7. 7 Gary H

    I have high values for raw_read_error_rate and seek_error_rate.
    Do I need to worry about these values?

    Here is my data on one of my two 1.5T seagate drives:

    ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
    1 Raw_Read_Error_Rate 0×000f 105 099 006 Pre-fail Always - 7404434
    3 Spin_Up_Time 0×0003 092 092 000 Pre-fail Always - 0
    4 Start_Stop_Count 0×0032 100 100 020 Old_age Always - 206
    5 Reallocated_Sector_Ct 0×0033 100 100 036 Pre-fail Always - 0
    7 Seek_Error_Rate 0×000f 071 060 030 Pre-fail Always - 14515558
    9 Power_On_Hours 0×0032 099 099 000 Old_age Always - 1086
    10 Spin_Retry_Count 0×0013 100 100 097 Pre-fail Always - 0
    12 Power_Cycle_Count 0×0032 100 100 020 Old_age Always - 7
    184 Unknown_Attribute 0×0032 100 100 099 Old_age Always - 0
    187 Reported_Uncorrect 0×0032 100 100 000 Old_age Always - 0
    188 Unknown_Attribute 0×0032 100 100 000 Old_age Always - 0
    189 High_Fly_Writes 0×003a 100 100 000 Old_age Always - 0
    190 Airflow_Temperature_Cel 0×0022 059 050 045 Old_age Always - 41 (Lifetime Min/Max 35/50)
    194 Temperature_Celsius 0×0022 041 050 000 Old_age Always - 41 (0 22 0 0)
    195 Hardware_ECC_Recovered 0×001a 056 036 000 Old_age Always - 7404434
    197 Current_Pending_Sector 0×0012 100 100 000 Old_age Always - 0
    198 Offline_Uncorrectable 0×0010 100 100 000 Old_age Offline - 0
    199 UDMA_CRC_Error_Count 0×003e 200 200 000 Old_age Always - 0
    240 Head_Flying_Hours 0×0000 100 253 000 Old_age Offline - 49868865274923
    241 Unknown_Attribute 0×0000 100 253 000 Old_age Offline - 953925977
    242 Unknown_Attribute 0×0000 100 253 000 Old_age Offline - 1122719573

    Thanks,
    Gary

  8. 8 admin

    @Gary H, I have no idea. :) Maybe ask Seagate or a sysadmin that knows smartctl better. Curious to know what the result is. Good luck!

  9. 9 Sash

    Hi there,

    Look forward to the email notification. I’m surprised the DNS 323 firmware doesn’t support email raid/HDD fail notification and smart info built right in ? (Or have I just not found it?)

    I’ve been looking around your site and want to know if it’s safe to do this on fimrware 1.08 (i.e hack it so I can run linux, smart tools, bittorrent and plug in USB devices?)

    Thanks again
    Sash

  1. 1 Some thoughts on your data, backups, failed drives, and architecting for availability at horto.ca
  2. 2 Upgrading the firmware on affected Seagate 1TB hdds at horto.ca


Leave a Reply