Filed Under (raid) by Dave Mast on February-21-2007

Last week I came into the office to find that a drive on our domain controller had failed.  No big deal, right?  I overnighted 2 drives, and installed them both the next day; one as a replacement for the failed drive, and another as a hot spare.

I was hurried into work this morning by a voicemail from K.  She informed me that noone was able to log into the Exchange server.  I know I had everything running last night before I left, so this really caught me off-guard.

I rush into the office (fortunately I’m only about 5 miles from the church) and sit down to begin diagnosis.  My tests led me once again to the domain controller I had just worked on less than a week ago.  I tried to click on the start menu, and the whole system froze up.  Wow, the same symtoms that I experienced last week.  I winced, hit the reset button on the machine, and sure enough, the RAID controller’s BIOS shows a rebuild in progress.  Once the server was back up and running, I could see in the event log that the failure had taken place at 7:30

I was partially relieved to see that the drive I had just installed wasn’t the culprit, but was another one of the original drives for that array.  Wow…2 drives in under a week’s time.  These were both Maxtor DiamondMax hard drives that had failed on me, and they’ve been in use for less than a year.  The controller is a Promise Technology TX4310, and we’ve had no issues up to this point.

screenhunter_09-feb-21-1303.jpg

 

 

 

 

 

 

.

 screenhunter_10-feb-21-1312.jpg

 

 

 

 

 

 

 

Another weird point is that the rebuild actually started BEFORE the machine locked up.  The hot spare stepped in just like it was supposed to, but the controller shows 5 device timeouts before the system gave up on it.

Hopefully this is a drive issue and not a controller issue.  Nonetheless, the task of getting a second DC up and running just went up in priority by many notches.



Comments
todd on February 21st, 2007 at 7:47 pm #

Dave,

Your the man. You knew what to do and you fixed it!

Rhonda on February 22nd, 2007 at 12:15 pm #

hey dave!
u’ve all inspired me to enter the world of blogging…..i’ve caught the fever!

Shannon Bailey on March 1st, 2007 at 7:20 am #

Dave,

Do you still have the bad Maxtor Drives? I read in your blog that they where less than 1 year old. Maxtor usually has a three year warranty but they certainly have a 1 year warranty. Here is a link to the warranty verification page.

http://support.seagate.com/customer/warranty_validation.jsp

It says seagate because they bought Maxtor but it is the correct site. All you have to do is type in the serial numbers to see if they are still under warranty.

You may be able to get them replaced for nothing so that you can have a few more spares for free :>).

Dave Mast on March 2nd, 2007 at 4:29 pm #

Thanks for the tip, Shannon! I’ll give that a try when I get back in the office. :-)

Post a comment
Name: 
Email: 
URL: 
Comments: 

FireStats iconPowered by FireStats