Author Topic: HDD errors!  (Read 872 times)

Agent007

  • Member
  • **
  • Posts: 120
  • Kudos: 0
HDD errors!
« on: 1 November 2002, 22:37 »
hi all,

I was going throught the logs and came across these errors. What do they mean?
This is a brand new disc, 2 days old. 40GB supports UDMA mode 4.

Following is the info:-

/dev/hda:
 multcount    = 16 (on)
 IO_support   =  1 (32-bit)
 unmaskirq    =  1 (on)
 using_dma    =  1 (on)
 keepsettings =  0 (off)
 readonly     =  0 (off)
 readahead    =  8 (on)
 geometry     = 4865/255/63, sectors = 78165360, start = 0


thanks & rgds,
007


 
quote:

Oct 30 22:00:37 localhost kernel: hda: dma_intr: status=0x51 { DriveReady
SeekComplete Error }
Oct 30 22:00:37 localhost kernel: hda: dma_intr: error=0x84 { DriveStatusError
BadCRC }
Oct 30 22:00:37 localhost kernel: hda: dma_intr: status=0x51 { DriveReady
SeekComplete Error }
Oct 30 22:00:37 localhost kernel: hda: dma_intr: error=0x84 { DriveStatusError
BadCRC }
Oct 30 22:07:27 localhost kernel: hda: dma_intr: status=0x51 { DriveReady
SeekComplete Error }
Oct 30 22:07:27 localhost kernel: hda: dma_intr: error=0x84 { DriveStatusError
BadCRC }
Oct 31 22:53:10 localhost kernel: hda: dma_intr: status=0x51 { DriveReady
SeekComplete Error }
Oct 31 22:53:10 localhost kernel: hda: dma_intr: error=0x84 { DriveStatusError
BadCRC }
Oct 31 23:25:38 localhost kernel: hda: dma_intr: status=0x51 { DriveReady
SeekComplete Error }
Oct 31 23:25:38 localhost kernel: hda: dma_intr: error=0x84 { DriveStatusError
BadCRC }
Nov  1 18:50:23 localhost kernel: hda: dma_intr: status=0x51 { DriveReady
SeekComplete Error }
Nov  1 18:50:23 localhost kernel: hda: dma_intr: error=0x84 { DriveStatusError
BadCRC }
Nov  1 19:02:03 localhost kernel: hda: dma_intr: status=0x51 { DriveReady
SeekComplete Error }
Nov  1 19:02:03 localhost kernel: hda: dma_intr: error=0x84 { DriveStatusError
BadCRC }
Nov  1 19:02:03 localhost kernel: hda: dma_intr: status=0x51 { DriveReady
SeekComplete Error }
Nov  1 19:02:03 localhost kernel: hda: dma_intr: error=0x84 { DriveStatusError
BadCRC }
Nov  1 19:02:25 localhost login(pam_unix)[1510]: session opened for user root by


LOGIN(uid=0)

AMD Athlon processor
256MB SDRAM
Linux Distro - RedHat 9.0

voidmain

  • VIP
  • Member
  • ***
  • Posts: 5,605
  • Kudos: 184
    • http://voidmain.is-a-geek.net/
HDD errors!
« Reply #1 on: 1 November 2002, 23:00 »
I get these messages on old drives when they are about ready to die. I believe it is caused by bad sectors. If you can figure out exactly where the bad spot is you can partition around it and get longer life out of your drive. I find when you place data over this area with the bad sectors the app will come to a crawl when trying to read data from this sector until it is either successful or times out. Might also try reformatting with a "-c" (check for bad blocks). You can also pass the format command a list of bad blocks so it will format around it.

Since it is a new drive I would think about running some disk diagnostics on it and if it finds problems get it replaced. Nothing good has ever come from a drive I have had when you start seeing these messages.
Someone please remove this account. Thanks...

pkd_lives

  • Member
  • **
  • Posts: 554
  • Kudos: 0
HDD errors!
« Reply #2 on: 1 November 2002, 23:12 »
It looks to be failing a CRC and that is always bad. This error can be caused by corruption of files, it can be the hardrive failing in many ways - including physical deteriation. Generally a disk that fails CRC is trash ready.

Unless you can find out the specific cause - but I warn you trying to keep a CRC damaged disk going can be hell - it WILL make your system unstable.

I would suggest you get the disk replaced, and I would suggest you try going direct to the manufacturer. You will need to find the damaged files - if they exist and then reinstall them, which may well mean a complete reinstallation of the data. Out of interest which OS are you using?

If it is a new drive it was probably tuned slightly bad, maybe not bad enough to be spoted by the manufacturer.

It is possible to save these disks, but CRC generally comes back to haunt. This is low level stuff and is not really attributable in any way to a specific operating system, although I have found OS/2 Warp 4 does do this a lot to floppy disks, possibly as a result of using HPFS. And I have my suspicons about NTFS.

[ November 01, 2002: Message edited by: pkd ]

Tough - Adapt or die : Read The Fucking Manual.

Local Area Network in Australia: the LAN down under.


KernelPanic

  • VIP
  • Member
  • ***
  • Posts: 1,878
  • Kudos: 222
HDD errors!
« Reply #3 on: 2 November 2002, 01:05 »
so IBM or Western Digital  ;)
Contains scenes of mild peril.

Agent007

  • Member
  • **
  • Posts: 120
  • Kudos: 0
HDD errors!
« Reply #4 on: 2 November 2002, 22:46 »
Hi guys,

Thanks for the info. Btw, I'm on RedHat 8.0..I changed the drive cable to ide1, it was on ide0 first and the prob seems to have gone, atleast for now. This is a new disc, the previous one of the same make (seagate) crashed after a period of six months. Do u guys think that this disc is going too? How come?

thanks,
007
AMD Athlon processor
256MB SDRAM
Linux Distro - RedHat 9.0

voidmain

  • VIP
  • Member
  • ***
  • Posts: 5,605
  • Kudos: 184
    • http://voidmain.is-a-geek.net/
HDD errors!
« Reply #5 on: 2 November 2002, 23:21 »
There are some drive types, and some batches of drives within the same drive types that are just crap or have manufacturing defects. I don't know if this is one of them or if you have a bad cable. My suspicion is that you have a bad cable if moving it seemed to have an effect on the drive. Also make sure you don't have any master/slave conflicts with other drives.
Someone please remove this account. Thanks...

pkd_lives

  • Member
  • **
  • Posts: 554
  • Kudos: 0
HDD errors!
« Reply #6 on: 3 November 2002, 00:57 »
Okay you switched IDE port and the error vanished. I am pretty sure this is hardware related.

It could be the drive - I am guessing on the bad hardrive because you are getting CRC errors. By the time any hardrive is reaching Cyclic Redundancy Checks, then usually the connection hardware has been proven. However that is not always the case. I don't know the order of tasks from startup - It's probable that only a couple of people in the world actually know this, and that is most likely guess work.

So it's a process of elimination. You changed the port:

Switch back and see if the problem replicates. Use the same cable if possible. Switch back and see if the problem vanishes switching from prot to port. If it continually follows. That can indicate a bad port (which could mean a mobo failure).

If the problem comes and goes on both ports, then suspect the cable. If you have another cable try it out, and see if you can get it to repeat the fault. If the cable switch eliminates the prob, then dump the old cable. It's always good to suspect ribbon cable, it has failed on me many times - people like to crease that stuff like paper, when it should be allowed to curl in a more natural manner.

Now it could be the hardrive, the motherboard or the cable, and it could be intermittent (which means you may be fault finding this for a long time), if it is then I am afraid you will just have to keep watching it until something fails properly.

I doubt this is software related, but it is always possible.

You can always try a reinstall. Start up minimal and add a program, and keep checking and if it switches on at a certain programme's installation then bob's your uncle.
Tough - Adapt or die : Read The Fucking Manual.

Local Area Network in Australia: the LAN down under.


Agent007

  • Member
  • **
  • Posts: 120
  • Kudos: 0
HDD errors!
« Reply #7 on: 3 November 2002, 08:47 »
I forgot to ask, what tool can I use to check for bad blocks in Linux?

thanks,
007
AMD Athlon processor
256MB SDRAM
Linux Distro - RedHat 9.0

voidmain

  • VIP
  • Member
  • ***
  • Posts: 5,605
  • Kudos: 184
    • http://voidmain.is-a-geek.net/
HDD errors!
« Reply #8 on: 3 November 2002, 21:23 »
fsck. See "man fsck".
Someone please remove this account. Thanks...