<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>The Storage Architect &#187; RAID</title>
	<atom:link href="http://thestoragearchitect.com/tag/raid/feed/" rel="self" type="application/rss+xml" />
	<link>http://thestoragearchitect.com</link>
	<description>Storage, Virtualisation &#38; Cloud</description>
	<lastBuildDate>Tue, 07 Feb 2012 10:08:05 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>Is RAID Dead?  No &#8211; Next Question</title>
		<link>http://thestoragearchitect.com/2010/12/01/is-raid-dead-no-next-question/</link>
		<comments>http://thestoragearchitect.com/2010/12/01/is-raid-dead-no-next-question/#comments</comments>
		<pubDate>Wed, 01 Dec 2010 09:23:10 +0000</pubDate>
		<dc:creator>Chris M Evans</dc:creator>
				<category><![CDATA[Featured]]></category>
		<category><![CDATA[Storage]]></category>
		<category><![CDATA[Ulitzer]]></category>
		<category><![CDATA[IBM 3380]]></category>
		<category><![CDATA[RAID]]></category>
		<category><![CDATA[Wikibon]]></category>
		<category><![CDATA[xiv]]></category>

		<guid isPermaLink="false">http://www.thestoragearchitect.com/?p=2110</guid>
		<description><![CDATA[<p><a href="http://www.portakabin.co.uk/what-is-modular.html" target="_blank">Portakabin Modular Construction</a></p> <p>I had a small discussion today with a few folks on Twitter that was a forerunner to a Wikibon &#8220;Peer Incite&#8221; discussion on whether RAID was still relevant or not (hence the title of this post).  It seems to me that perhaps the Wikibon discussion was a good way to [...]<!--Begin ClixTrac.com Rotator Code -->
<script type="text/javascript" language="javascript" src="http://www.clixtrac.com/rotate/321"></script>
<!--End ClixTrac.com Rotator Code -->]]></description>
			<content:encoded><![CDATA[<p><span style="text-decoration: underline;"><strong><a href="http://www.portakabin.co.uk/what-is-modular.html"  target="_blank">Portakabin Modular Construction</a></strong></span></p>
<p>I had a small discussion today with a few folks on Twitter that was a forerunner to a Wikibon &#8220;Peer Incite&#8221; discussion on whether RAID was still relevant or not (hence the title of this post).  It seems to me that perhaps the Wikibon discussion was a good way to introduce a &#8220;RAID beating&#8221; technology to the market (and that&#8217;s a subject for another post), but regardless it may be worth reviewing where we are with RAID and it&#8217;s relevance in the future.</p>
<h3>Background</h3>
<div id="attachment_2112" class="wp-caption alignleft" style="width: 220px"><a href="http://31.222.189.99/wp-content/uploads/2010/12/PH3380CJ2.jpg" ><img class="size-medium wp-image-2112" title="PH3380CJ2" src="http://50.57.85.110/wp-content/uploads/2010/12/PH3380CJ2-210x300.jpg" alt="IBM 3380 Model CJ2" width="210" height="300" /></a><p class="wp-caption-text">IBM 3380 Model CJ2</p></div>
<p>The concept of RAID for disk devices was first discussed in a 1987 technical paper called <em>&#8220;<a href="http://www-2.cs.cmu.edu/~garth/RAIDpaper/Patterson88.pdf"  target="_blank">A Case for Redundant Arrays of Inexpensive Disks (RAID)&#8221;</a></em> by David A Patterson, Garth Gibson and Randy H Katz from the University of California at Berkeley.  The premise of their paper was to compare the price and performance of SLEDs (Single Large Expensive Disks) versus cheaper alternatives in a redundant form and get improvements in performance, scalability and reliability for a lower cost.  The paper compares the price of inexpensive disks to the IBM 3380, one of the first disks I ever used.  These were monster devices the size of fridges, as you can see from this image I&#8217;ve borrowed from IBM&#8217;s archives (full link <a href="http://www-03.ibm.com/ibm/history/exhibits/storage/storage_PH3380CJ2.html"  target="_blank">here</a>).</p>
<p>I spend many hours recovering individual files from failed devices, having to recover the lost files from backups.  I&#8217;m not saying these devices were unreliable, but unfortunately individual file restores isn&#8217;t a scalable solution to managing continually increasing capacities.  To give you an idea of scale, I managed a mere 300GB of IBM 3380 &amp; 3390 storage in one installation; what seems an incredibly small amount.</p>
<p>So, RAID promised to increase reliability significantly by providing a means to recover lost data.  RAID-1 is probably the simplest and easiest of the RAID levels to understand; write the data to two separate drives and if one fails, you can read/write from the other.  RAID-5 and RAID-6 offer distributed parity data recovery, with single and dual parity respectively.  This reduces the RAID overhead (RAID-1 was 100% more expensive than no RAID on the disk costs alone) while retaining the ability to recover.  However RAID is not infallible.  Double disk failures (where a second or third drive fails during recovery of the first failed drive) can occur, despite how small the chances seem.  This means RAID should be part of a portfolio of protection measures and not your only one.</p>
<p>Today we see RAID in even the smallest home storage devices, embedded as software options in operating systems and of course an mandatory component of enterprise storage arrays.  However, RAID suffering from potential scalability issues.</p>
<h3>The Scalability Problem</h3>
<p>Whilst RAID was good for the hard drives of 20 years ago, we now are starting to see issues with the RAID architecture in a number of areas.  These issues are being driven by the sheer increase in capacity we&#8217;ve seen for modern disk drives; an increase that hasn&#8217;t been matched by an equivalent improvement in performance and I/O throughput.  Here are some of the challenges:</p>
<ul>
<li>Capacity improvements have increased 100-fold between the IBM 3380 and the latest range of Savvio 15K drives (1.26GB to 146GB)</li>
<li>Performance has only increased 50-fold in the same period</li>
<li>Density (volume occupied by drives) has increased 2 million-fold in the same period</li>
<li>Price has dropped to 1/27,000th of the cost</li>
</ul>
<p>With storage arrays that can hold up to 2000 drives, we can see that I/O isn&#8217;t keeping up with capacity even for the fastest hard drives on the market today.  The problem is exacerbated when we look at high capacity SATA drives, currently pushing 3TB and set to go higher very quickly.  These drives have slower spin speeds and less throughput, meaning RAID rebuilds have to be counted in hours (and potentially days) rather than minutes.  This extended rebuild time has a number of implications:</p>
<ul>
<li>There is a greater risk of data loss, as rebuilds are taking substantially longer</li>
<li>There is a greater performance impact as rebuilds impact host I/O capacity</li>
</ul>
<p>There is one other problem that also can&#8217;t be ignored and that&#8217;s the chance of a non-recoverable bit read error &#8211; that is, the chance that your data cannot be re-read from disk.  This error is recoverable with RAID, but not if  RAID recovery is taking place, as this failed read may be essential to rebuild missing data.  As an example, the latest Seagate Constellation drives have a non-recoverable bit read error rate of 1 bit in 10E15 (in fact the failure is reading a whole sector).  A 1TB drive has 1E13 bits.  If we have to recover a disk by reading the whole RAID stripe (imagine it consists of 10 disks) then we have a 1 in 10 chance that we will be unable to recover some of that data.  This risk doubles to 1 in 5 with 2TB drives and so on.  As we reach 8-10TB drives, rebuilding the entire RAID group in this instance means we are almost always likely to fail to recover some data.  These values will change with different RAID group sizes and disk capacities but we&#8217;re pushing the technology close to the edge at this point.</p>
<h3>The Solutions</h3>
<p>We&#8217;re already seeing some workarounds to the RAID scalability problems.</p>
<ul>
<li><strong>Block-based RAID</strong> &#8211; these architectures change the way in which RAID is implemented to a block rather than whole disk level.  So, if a drive has a partial failure, only the failed areas of the disk are rebuilt.  In addition, if the RAID group isn&#8217;t full, the system doesn&#8217;t spend time rebuilding white space.</li>
<li><strong>Failure Prediction</strong> &#8211; Storage arrays like Hitachi&#8217;s USP &amp; VSP use SMART to predict when a drive looks likely to fail.  That drive is then &#8220;soft failed&#8221; and the data copied off it while it is still working.  This means data can simply be moved to another drive, rather than rebuilt from the other disks in the parity group.  This has a significant impact on improving recovery time and reducing the impact on performance, but can be more expensive in drive costs and maintenance.</li>
<li><strong>Data Distribution</strong> &#8211; The IBM XIV storage array distributes data across many drives.  This has the impact of dramatically improving the time taken to recover failed disks as all drives participate in recovery (and protection is only RAID-1).  Of course the tradeoffs are both the increase in cost and the still possible risk of a double disk failure, which will impact every single LUN on the system.</li>
</ul>
<p>Whilst the above solutions are good, I believe we need to see more from the drive manufacturers themselves.  Ultimately the reason RAID has become a problem is due to reliability and RAID rebuild times.  We need a new approach to the way host I/O and rebuild I/O is prioritised and managed by a drive.  For instance, with the ability to put large amounts of flash into a drive, flash could be used as a repository for RAID reads.  As a drive executes normal read/write operations, it caches any data needed for requested RAID rebuilds into flash.  This is then made available to the RAID controller via a separate channel to perform the RAID rebuild.  Effectively data is rebuilt on a failed drive only as that data is read/written from the original drive; any unaccessed data is rebuilt as a low priority task.  If this approach was coupled with the ability to highlight a failing sector (and so recover that first) then reliability improves.  This idea is only one thought; I expect extending RAID&#8217;s useful life will follow a similar path to that of the increase in drive capacities; lots of incremental improvements that over time move things forward.</p>
<p>RAID isn&#8217;t dead, merely evolving to meet new challenges.  In 20 years&#8217; time I suspect RAID will still exist but will be barely recognisable from the original Berkeley paper.</p>
<p><span style="text-decoration: underline;"><strong><a href="http://www.portakabin.co.uk/what-is-modular.html"  target="_blank">Portakabin Modular Construction</a></strong></span></p>
]]></content:encoded>
			<wfw:commentRss>http://thestoragearchitect.com/2010/12/01/is-raid-dead-no-next-question/feed/</wfw:commentRss>
		<slash:comments>10</slash:comments>
		</item>
		<item>
		<title>Hardware Review: Promise SmartStor NS4600 &#8211; Part II</title>
		<link>http://thestoragearchitect.com/2010/07/01/hardware-review-promise-smartstor-ns4600-part-ii/</link>
		<comments>http://thestoragearchitect.com/2010/07/01/hardware-review-promise-smartstor-ns4600-part-ii/#comments</comments>
		<pubDate>Thu, 01 Jul 2010 11:00:34 +0000</pubDate>
		<dc:creator>Chris M Evans</dc:creator>
				<category><![CDATA[Featured]]></category>
		<category><![CDATA[Reviews]]></category>
		<category><![CDATA[hard drive]]></category>
		<category><![CDATA[NAS]]></category>
		<category><![CDATA[NS4600]]></category>
		<category><![CDATA[Promise]]></category>
		<category><![CDATA[Promise inc]]></category>
		<category><![CDATA[RAID]]></category>
		<category><![CDATA[SmartStor]]></category>
		<category><![CDATA[SOHO]]></category>

		<guid isPermaLink="false">http://www.thestoragearchitect.com/?p=1395</guid>
		<description><![CDATA[<p>This is a series of posts on the Promise SmartStor NS4600 home storage server.  Previous posts:</p> <a href="http://www.thestoragearchitect.com/2010/06/25/hardware-review-promise-smartstor-ns4600-part-i/" target="_blank">Hardware Review: Promise SmartStor NS4600 &#8211; Part I</a> <p>In the first post of this series, we discussed the basic hardware configuration.  This post will look at connectivity and RAID configurations supported by the NS4600.</p> <p>A quick glance [...]<!--Begin ClixTrac.com Rotator Code -->
<script type="text/javascript" language="javascript" src="http://www.clixtrac.com/rotate/321"></script>
<!--End ClixTrac.com Rotator Code -->]]></description>
			<content:encoded><![CDATA[<p>This is a series of posts on the Promise SmartStor NS4600 home storage  server.  Previous posts:</p>
<ul>
<li><a href="http://www.thestoragearchitect.com/2010/06/25/hardware-review-promise-smartstor-ns4600-part-i/"  target="_blank">Hardware Review: Promise SmartStor NS4600 &#8211; Part I</a></li>
</ul>
<p>In the first post of this series, we discussed the basic hardware configuration.  This post will look at connectivity and RAID configurations supported by the NS4600.</p>
<p>A quick glance at the back of the unit provides a clue as to what connectivity exists.  See the first image in this post.  There is are 2x USB, 1x eSATA and 1x Ethernet ports available. The Gigabit Ethernet connection supports multiple host protocols, which we&#8217;ll discuss in more detail later.  The eSATA port provides connectivity to external devices for either backup of the NS4600 or backup of the external device to the NS4600.  The USB port also supports the same source and target backup functionality, meaning effectively all host connection protocols are IP-based.  The USB port also functions as a printer server for USB printers.</p>
<h3>RAID Support</h3>
<p>The NS4600 supports RAID levels 0, 1, 5 &amp; 10, implemented by the onboard Promise PDC42819 SATA RAID Controller.  The required RAID level is specified at the time a volume is created; multiple logical volumes are supported on the NS4600 as long as there are sufficient disks available.  This means, for example, two RAID-1 volumes could be created or a RAID-5 volume with another RAID-0 volume could be established.  RAID settings are configured from the &#8220;RAID Management&#8221; option in the online GUI (PASM).  A number of example screenshots showing various RAID configurations are displayed in the gallery at the end of this post.</p>
<p>Volumes form the foundation of how data is presented from the NS4600.  They give the options to use RAID to manage the trade-off between capacity and performance.  For example, a single RAID-0 volume could be used for backups, while the main data is RAID-1 protected, with a final drive kept as spare.  Of course drives are hot-pluggable, so not all slots need to be initially populated.  This means drives can be added to a RAID group to increase capacity over time.  For instance two drives could be used to populate the NS4600 in the first instance.  This can then be expanded dynamically, changing the RAID level or adding additional capacity.</p>
<p>Although RAID is implemented in hardware, the options available are reasonably flexible in offering multiple dynamically expandable configurations.  However, I&#8217;d question whether traditional RAID implementations are the way forward in home storage devices.  Bear in mind that 2TB drives are becoming the norm and that means within 18 months to 2 years, 3/4TB and even 5TB drives will become commonplace.  As we move to much larger capacities, unrecoverable read errors become a real issue, so rather than recovering an entire disk, the ability to recover individual chunks of data is more preferable.  This is the methodology <a href="http://www.drobo.com/"  target="_blank">Data Robotics</a> have implemented within their BeyondRAID technology.  Promise are playing the trade-off between rock-solid RAID-in-silicon versus RAID-in-software.  At the moment my money goes with software RAID and the enhanced flexibility it brings.</p>
<div id="attachment_1617" class="wp-caption alignleft" style="width: 310px"><a href="http://31.222.189.99/wp-content/uploads/2010/06/NS4600-SS9-RAID-Rebuild.png" ><img class="size-medium wp-image-1617" title="NS4600 SS#9 - RAID Rebuild" src="http://50.57.85.110/wp-content/uploads/2010/06/NS4600-SS9-RAID-Rebuild-300x112.png" alt="RAID Rebuild on NS4600" width="300" height="112" /></a><p class="wp-caption-text">RAID Rebuild on NS4600</p></div>
<p>Now I like a bit of fun with RAID systems and one of my favourite tricks is to exchange RAID drives within an array.  So, on one of the NS4600&#8242;s, I powered it down, removed all the drives and powered it back up.  Fortunately, the device configuration isn&#8217;t stored on the removed disks and the NS4600 remained accessible, although protesting at the fact no drives were present.  After adding the drives back (in a random order) the NS4600 detected them and recovered the RAID sets and I was back in business.  I wouldn&#8217;t advise removing all the drives in normal practice, however if a chassis completely fails, then presumably the data can be recovered in another unit (although I haven&#8217;t tested this).</p>
<h3>Volumes</h3>
<div id="attachment_1618" class="wp-caption alignleft" style="width: 310px"><a href="http://31.222.189.99/wp-content/uploads/2010/06/NS4600-SS3-Free-Disks.png" ><img class="size-medium wp-image-1618" title="NS4600 SS#3 - Free Disks" src="http://50.57.85.110/wp-content/uploads/2010/06/NS4600-SS3-Free-Disks-300x132.png" alt="NS4600 - Free Disks - Unconfigured Volumes" width="300" height="132" /></a><p class="wp-caption-text">NS4600 - Free Disks - Unconfigured Volumes</p></div>
<p>Volumes are the logical entity that are created when establishing the RAID configuration of the NS4600.  The screenshot below shows the NS4600 before any volumes have been created.  There are four volumes in the free pool.  The second screenshot shows two volumes that have been created from the four available drives in my test NS4600.  Volumes and the RAID set on which they are stored are a 1:1 relationship; a volume may not span a RAID set and a RAID set may not contain more than one volume.  This may seem a little restrictive, however as we&#8217;ll see later, file systems and iSCSI LUNs are</p>
<div id="attachment_1620" class="wp-caption alignleft" style="width: 310px"><a href="http://31.222.189.99/wp-content/uploads/2010/06/NS4600-SS10-Multiple-Volumes.png" ><img class="size-medium wp-image-1620" title="NS4600 SS#10 - Multiple Volumes" src="http://50.57.85.110/wp-content/uploads/2010/06/NS4600-SS10-Multiple-Volumes-300x109.png" alt="NS4600 Multiple Volumes" width="300" height="109" /></a><p class="wp-caption-text">NS4600 Multiple Volumes</p></div>
<p>contained within a volume and therefore volumes should be thought of as providing specific RAID availability.</p>
<p>Volumes can be expanded; see the screenshot, which shows a volume in the process of being expanded.  This volume is being converted from RAID-1 to RAID-5.  RAID-0 volumes can be expanded to larger RAID-0 volumes or moved to RAID-1 or RAID-5 configurations.  Basically, volumes can be increased in size or moved to a higher level of RAID protection &#8211; but not down.</p>
<div id="attachment_1621" class="wp-caption alignleft" style="width: 310px"><a href="http://31.222.189.99/wp-content/uploads/2010/06/NS4600-SS5-RAID-Disk-Creation-Step-2.png" ><img class="size-medium wp-image-1621" title="NS4600 SS#5 - RAID Disk Creation Step 2" src="http://50.57.85.110/wp-content/uploads/2010/06/NS4600-SS5-RAID-Disk-Creation-Step-2-300x111.png" alt="NS4600 Disk Formatting" width="300" height="111" /></a><p class="wp-caption-text">NS4600 Disk Formatting</p></div>
<p>The NS4600 has the concept of spares as opposed to free (unused) disks.  A hot spare can be used to dynamically rebuild a failed RAID group.  The screenshot shows a rebuild in place for a failed drive.  In this instance I&#8217;d left drive 4 as the hot spare and pulled drive 1 to simulate a failure.  The NS4600 automatically kicks off the rebuild and spare drive becomes part of the RAID group.  Now we see the second issue with the use of traditional RAID systems.  I simulated this RAID failure on RAID groups containing no data, yet the rebuild took hours due to the nature of the rebuild &#8211; a physical</p>
<div id="attachment_1622" class="wp-caption alignleft" style="width: 310px"><a href="http://31.222.189.99/wp-content/uploads/2010/06/NS4600-SS9-RAID-Rebuild1.png" ><img class="size-medium wp-image-1622" title="NS4600 SS#9 - RAID Rebuild" src="http://50.57.85.110/wp-content/uploads/2010/06/NS4600-SS9-RAID-Rebuild1-300x112.png" alt="NS4600 Raid Rebuild" width="300" height="112" /></a><p class="wp-caption-text">NS4600 Raid Rebuild</p></div>
<p>drive recovery.  Contrast this with more progressive RAID systems where only the active data is copied, significantly reducing recovery times.  Of course the trade-off here is whether in a home system you would be rebuilding on a regular basis.  Chances are you wouldn&#8217;t but if a failure did occur, you would want it to occur as quickly as possible.  For the record, the total rebuild of a 2TB drive in a mirrored RAID-1 pair took 12 hours to complete with no other workload on the device.</p>
<p>In the next post I&#8217;ll look at the logical level of file systems and iSCSI LUNs.  Comments always welcome as usual.</p>
<p><em><strong>Disclaimer: Promise have provided me with two NS4600 devices  for this review.  These devices will be returned at the end of this  period.  This is an independent review and has not been sponsored or  paid for by Promise.</strong></em></p>
<p><em><strong>
<a href='http://thestoragearchitect.com/2010/07/01/hardware-review-promise-smartstor-ns4600-part-ii/ns4600-ss9-raid-rebuild/' title='NS4600 SS#9 - RAID Rebuild'><img width="150" height="56" src="http://thestoragearchitect.com/wp-content/uploads/2010/06/NS4600-SS9-RAID-Rebuild.png" class="attachment-thumbnail" alt="RAID Rebuild on NS4600" title="NS4600 SS#9 - RAID Rebuild" /></a>
<a href='http://thestoragearchitect.com/2010/07/01/hardware-review-promise-smartstor-ns4600-part-ii/ns4600-ss3-free-disks/' title='NS4600 SS#3 - Free Disks'><img width="150" height="66" src="http://thestoragearchitect.com/wp-content/uploads/2010/06/NS4600-SS3-Free-Disks.png" class="attachment-thumbnail" alt="NS4600 - Free Disks - Unconfigured Volumes" title="NS4600 SS#3 - Free Disks" /></a>
<a href='http://thestoragearchitect.com/2010/07/01/hardware-review-promise-smartstor-ns4600-part-ii/ns4600-ss10-multiple-volumes/' title='NS4600 SS#10 - Multiple Volumes'><img width="150" height="54" src="http://thestoragearchitect.com/wp-content/uploads/2010/06/NS4600-SS10-Multiple-Volumes.png" class="attachment-thumbnail" alt="NS4600 Multiple Volumes" title="NS4600 SS#10 - Multiple Volumes" /></a>
<a href='http://thestoragearchitect.com/2010/07/01/hardware-review-promise-smartstor-ns4600-part-ii/ns4600-ss5-raid-disk-creation-step-2/' title='NS4600 SS#5 - RAID Disk Creation Step 2'><img width="150" height="55" src="http://thestoragearchitect.com/wp-content/uploads/2010/06/NS4600-SS5-RAID-Disk-Creation-Step-2.png" class="attachment-thumbnail" alt="NS4600 Disk Formatting" title="NS4600 SS#5 - RAID Disk Creation Step 2" /></a>
<a href='http://thestoragearchitect.com/2010/07/01/hardware-review-promise-smartstor-ns4600-part-ii/ns4600-ss9-raid-rebuild-2/' title='NS4600 SS#9 - RAID Rebuild'><img width="150" height="56" src="http://thestoragearchitect.com/wp-content/uploads/2010/06/NS4600-SS9-RAID-Rebuild1.png" class="attachment-thumbnail" alt="NS4600 Raid Rebuild" title="NS4600 SS#9 - RAID Rebuild" /></a>
<a href='http://thestoragearchitect.com/2010/07/01/hardware-review-promise-smartstor-ns4600-part-ii/ns4600-ss4-raid-disk-creation-step-1/' title='NS4600 SS#4 - RAID Disk Creation Step 1'><img width="150" height="60" src="http://thestoragearchitect.com/wp-content/uploads/2010/06/NS4600-SS4-RAID-Disk-Creation-Step-1.png" class="attachment-thumbnail" alt="NS4600 SS#4 - RAID Disk Creation Step 1" title="NS4600 SS#4 - RAID Disk Creation Step 1" /></a>
<a href='http://thestoragearchitect.com/2010/07/01/hardware-review-promise-smartstor-ns4600-part-ii/ns4600-ss2-file-system-status/' title='NS4600 SS#2 - File System Status'><img width="150" height="57" src="http://thestoragearchitect.com/wp-content/uploads/2010/06/NS4600-SS2-File-System-Status.png" class="attachment-thumbnail" alt="NS4600 SS#2 - File System Status" title="NS4600 SS#2 - File System Status" /></a>
<a href='http://thestoragearchitect.com/2010/07/01/hardware-review-promise-smartstor-ns4600-part-ii/ns4600-ss1-raid-status/' title='NS4600 SS#1 - RAID Status'><img width="150" height="60" src="http://thestoragearchitect.com/wp-content/uploads/2010/06/NS4600-SS1-RAID-Status.png" class="attachment-thumbnail" alt="NS4600 SS#1 - RAID Status" title="NS4600 SS#1 - RAID Status" /></a>
<a href='http://thestoragearchitect.com/2010/07/01/hardware-review-promise-smartstor-ns4600-part-ii/ns4600-ss8/' title='NS4600 SS#8'><img width="150" height="64" src="http://thestoragearchitect.com/wp-content/uploads/2010/06/NS4600-SS8.png" class="attachment-thumbnail" alt="NS4600 SS#8" title="NS4600 SS#8" /></a>
<a href='http://thestoragearchitect.com/2010/07/01/hardware-review-promise-smartstor-ns4600-part-ii/ns4600-ss7-raid-1-extension/' title='NS4600 SS#7 - RAID-1 Extension'><img width="150" height="66" src="http://thestoragearchitect.com/wp-content/uploads/2010/06/NS4600-SS7-RAID-1-Extension.png" class="attachment-thumbnail" alt="NS4600 SS#7 - RAID-1 Extension" title="NS4600 SS#7 - RAID-1 Extension" /></a>
<a href='http://thestoragearchitect.com/2010/07/01/hardware-review-promise-smartstor-ns4600-part-ii/ns4600-ss6-raid-1-extension/' title='NS4600 SS#6 - RAID-1 Extension'><img width="150" height="65" src="http://thestoragearchitect.com/wp-content/uploads/2010/06/NS4600-SS6-RAID-1-Extension.png" class="attachment-thumbnail" alt="NS4600 SS#6 - RAID-1 Extension" title="NS4600 SS#6 - RAID-1 Extension" /></a>
<br />
</strong></em></p>
]]></content:encoded>
			<wfw:commentRss>http://thestoragearchitect.com/2010/07/01/hardware-review-promise-smartstor-ns4600-part-ii/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Hardware Review: Promise SmartStor NS4600 &#8211; Part I</title>
		<link>http://thestoragearchitect.com/2010/06/25/hardware-review-promise-smartstor-ns4600-part-i/</link>
		<comments>http://thestoragearchitect.com/2010/06/25/hardware-review-promise-smartstor-ns4600-part-i/#comments</comments>
		<pubDate>Fri, 25 Jun 2010 16:04:17 +0000</pubDate>
		<dc:creator>Chris M Evans</dc:creator>
				<category><![CDATA[Featured]]></category>
		<category><![CDATA[Reviews]]></category>
		<category><![CDATA[hard drive]]></category>
		<category><![CDATA[NAS]]></category>
		<category><![CDATA[NS4600]]></category>
		<category><![CDATA[Promise]]></category>
		<category><![CDATA[Promise inc]]></category>
		<category><![CDATA[RAID]]></category>
		<category><![CDATA[SmartStor]]></category>
		<category><![CDATA[SOHO]]></category>

		<guid isPermaLink="false">http://www.thestoragearchitect.com/?p=1394</guid>
		<description><![CDATA[<p>This is a series of posts on the Promise SmartStor NS4600 home storage server.</p> Background <p><a href="http://www.promise.com/" target="_blank">Promise Technology Inc</a> have been manufacturing RAID controllers since 1988 and iSCSI storage systems since 2004.  In 2007, the company released the first of the SmartStor devices, the NS4300, a fully-functioned home NAS storage array.  That was followed [...]<!--Begin ClixTrac.com Rotator Code -->
<script type="text/javascript" language="javascript" src="http://www.clixtrac.com/rotate/321"></script>
<!--End ClixTrac.com Rotator Code -->]]></description>
			<content:encoded><![CDATA[<p>This is a series of posts on the Promise SmartStor NS4600 home storage server.</p>
<h3>Background</h3>
<p><a href="http://www.promise.com/"  target="_blank">Promise Technology Inc</a> have been manufacturing RAID controllers since 1988 and iSCSI storage systems since 2004.  In 2007, the company released the first of the SmartStor devices, the NS4300, a fully-functioned home NAS storage array.  That was followed up in 2009 with the second generation NS4600.  I must admit I&#8217;m not at all familiar with their products, however Promise have provided me two NS4600 units for a short term evaluation.  The home NAS server market has become pretty competitive, with lots of features built into todays&#8217; hardware.  What made the NS4600 interesting is the ability to backup between devices; something we will cover in these posts.  The ability to replicate to another unit will make it compelling for those advanced users who have large volumes of data to secure.</p>
<p>The NSx600 range are classed as high performance SOHO or home NAS devices and have the following specifications:</p>
<ul>
<li>Processor: Intel EP80579 600Mhz</li>
<li>Memory: 256MB DDRII</li>
<li>1x GbE port</li>
<li>1x eSATA port</li>
<li>2x USB ports</li>
<li>Four 3.5&#8243; SATA drive slots</li>
<li>Promise PDC42819 SATA RAID Controller</li>
</ul>
<p>The device itself is pretty solid despite feeling quite light without the drives installed.  In terms of design, the unit looks quite attractive, with blue LEDs on the front, showing drive and network activity.  Drives are installed behind a single door in a horizontal fashion and are required to be mounted in small caddies.  These are screwed to the drive itself and aren&#8217;t large and provide the runners for correct insertion.  Drives are hot-pluggable while the unit is running.</p>
<p>At the rear of the unit, there are the network connections and power socket for the integrated power supply.  There&#8217;s also a power button for turning the unit on and off.  For cooling, there&#8217;s an integrated fan;
<a href='http://thestoragearchitect.com/2010/06/25/hardware-review-promise-smartstor-ns4600-part-i/ds4600_front/' title='DS4600_front'><img width="150" height="90" src="http://thestoragearchitect.com/wp-content/uploads/2010/06/DS4600_front.jpg" class="attachment-thumbnail" alt="DS4600_front" title="DS4600_front" /></a>
</p>
<p>this kicks out a reasonable amount of heat when all four drive slots are occupied.  Although we&#8217;ll touch on software and management later, it&#8217;s worth mentioning here that the device (or enclosure as it is described) can be monitored through the built-in web server interface.  Screen shots are included at the end of this post and show the power, fan and temperature metrics being tracked.  I like this feature; it provides that extra level of information needed when doing problem determination.</p>
<p>Overall, the the NS4600 hardware is pretty cool.  In future posts, we&#8217;ll discuss software features and management.</p>
<p><em><strong>Disclaimer: Promise have provided me with two NS4600 devices for this review.  These devices will be returned at the end of this period.  This is an independent review and has not been sponsored or paid for by Promise.</strong></em></p>
]]></content:encoded>
			<wfw:commentRss>http://thestoragearchitect.com/2010/06/25/hardware-review-promise-smartstor-ns4600-part-i/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Personal Computing: The End of The Parallel Interface</title>
		<link>http://thestoragearchitect.com/2009/12/30/personal-computing-the-end-of-the-parallel-interface/</link>
		<comments>http://thestoragearchitect.com/2009/12/30/personal-computing-the-end-of-the-parallel-interface/#comments</comments>
		<pubDate>Wed, 30 Dec 2009 18:15:31 +0000</pubDate>
		<dc:creator>Chris M Evans</dc:creator>
				<category><![CDATA[Featured]]></category>
		<category><![CDATA[Storage]]></category>
		<category><![CDATA[hard drive]]></category>
		<category><![CDATA[RAID]]></category>
		<category><![CDATA[recycling]]></category>
		<category><![CDATA[Seagate Barracuda]]></category>

		<guid isPermaLink="false">http://thestoragearchitect.com/?p=978</guid>
		<description><![CDATA[This is the time of year I like to do a little early spring cleaning and one group of storage devices due a clean out are my old hard drives.<!--Begin ClixTrac.com Rotator Code -->
<script type="text/javascript" language="javascript" src="http://www.clixtrac.com/rotate/321"></script>
<!--End ClixTrac.com Rotator Code -->]]></description>
			<content:encoded><![CDATA[<p>This is the time of year I like to do a little early spring cleaning and one group of storage devices due a clean out are my old <strong>hard drives</strong>.  After many faithful years, I&#8217;m finally saying goodbye to all of my <strong>parallel ATA</strong> devices. </p>
<p>One the one hand, I hate throwing things out.  I like to use technology to the absolute limit, but eventually some things outlive their usefulness and so it has come to pass for PATA hard disks.  Unfortunately the <strong>maths</strong> and <strong>environmentals</strong> no longer stack up.</p>
<p><strong>Capacity and Power</strong></p>
<p>I&#8217;m disposing of <strong>twelve (12)</strong> 3.5&#8243; drives that have a total capacity of only <strong>344GB</strong>.  That&#8217;s a mere 28GB per HDD.  The lowest capacity drive is only <strong>1280MB</strong> in size.  Contrast that to the latest hard drive I&#8217;ve just purchased.  It&#8217;s (another) 2TB model &#8211; the Seagate Barracuda <strong>ST32000542AS</strong>.  This drive offers <strong>five (5) times</strong> more capacity for only <strong>1/12</strong> of the physical space and power requirements.  With these kind of gains in efficiency, why retain old drives any further? </p>
<p><strong>RAID and Speed</strong></p>
<p>Now of course, multiple drives have one or two benefits over a single hard drive; firstly they can be protected using <strong>RAID</strong> and second, multiple spindles provide improved <strong>throughput</strong>.  However the cost of 1TB and 2TB drives is now so <strong>low</strong>, they can be purchased in multiples reasonably cheaply and the RAID benefits retained.</p>
<p><strong>Recycling</strong></p>
<p>As a final thought, I think we should be doing more to recycle old hard drives.  After all, we recycle <strong>mundane </strong>components such as printer ink cartridges, batteries and so on.  Perhaps we could see a slot for hard disks at the local recycling centre?</p>
]]></content:encoded>
			<wfw:commentRss>http://thestoragearchitect.com/2009/12/30/personal-computing-the-end-of-the-parallel-interface/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Personal Computing: Drobo Replacement</title>
		<link>http://thestoragearchitect.com/2009/11/20/personal-computing-drobo-replacement/</link>
		<comments>http://thestoragearchitect.com/2009/11/20/personal-computing-drobo-replacement/#comments</comments>
		<pubDate>Fri, 20 Nov 2009 17:13:27 +0000</pubDate>
		<dc:creator>Chris M Evans</dc:creator>
				<category><![CDATA[Storage]]></category>
		<category><![CDATA[Drive Replacement]]></category>
		<category><![CDATA[Drobo]]></category>
		<category><![CDATA[RAID]]></category>

		<guid isPermaLink="false">http://thestoragearchitect.com/?p=910</guid>
		<description><![CDATA[<p>Following on from my <a href="http://thestoragearchitect.com/2009/10/19/personal-computing-drobo-weirdness/" >Personal Computing: Drobo Wierdness</a>, I have received a replacement unit from Data Robotics.  I thought for a change I&#8217;d produce a video and so here is my attempt to swap out my drives into the new unit.  In the end I formatted the existing drives as four new NAS shares.  [...]<!--Begin ClixTrac.com Rotator Code -->
<script type="text/javascript" language="javascript" src="http://www.clixtrac.com/rotate/321"></script>
<!--End ClixTrac.com Rotator Code -->]]></description>
			<content:encoded><![CDATA[<p>Following on from my <a href="http://thestoragearchitect.com/2009/10/19/personal-computing-drobo-weirdness/" >Personal Computing: Drobo Wierdness</a>, I have received a replacement unit from Data Robotics.  I thought for a change I&#8217;d produce a video and so here is my attempt to swap out my drives into the new unit.  In the end I formatted the existing drives as four new NAS shares.  So the unit is back and working. Enjoy.</p>
<p><a href="http://thestoragearchitect.com/2009/11/20/personal-computing-drobo-replacement/"><em>Click here to view the embedded video.</em></a></p>
]]></content:encoded>
			<wfw:commentRss>http://thestoragearchitect.com/2009/11/20/personal-computing-drobo-replacement/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Personal Computing: Drobo Jitters</title>
		<link>http://thestoragearchitect.com/2009/09/14/personal-computing-drobo-jitters/</link>
		<comments>http://thestoragearchitect.com/2009/09/14/personal-computing-drobo-jitters/#comments</comments>
		<pubDate>Mon, 14 Sep 2009 20:44:18 +0000</pubDate>
		<dc:creator>Chris M Evans</dc:creator>
				<category><![CDATA[Storage]]></category>
		<category><![CDATA[Drobo]]></category>
		<category><![CDATA[file protection]]></category>
		<category><![CDATA[RAID]]></category>

		<guid isPermaLink="false">http://thestoragearchitect.com/?p=728</guid>
		<description><![CDATA[<p>As regular readers will know, I&#8217;ve had a <a href="http://www.drobo.com/index.php" >Drobo</a> storage system for some time now.  I&#8217;ve not had a bad word to say about the product &#8211; it just works and does what I expect it to do.  However, there&#8217;s one function I think could do with some improvement and it&#8217;s the one [...]<!--Begin ClixTrac.com Rotator Code -->
<script type="text/javascript" language="javascript" src="http://www.clixtrac.com/rotate/321"></script>
<!--End ClixTrac.com Rotator Code -->]]></description>
			<content:encoded><![CDATA[<p>As regular readers will know, I&#8217;ve had a <a href="http://www.drobo.com/index.php" >Drobo</a> storage system for some time now.  I&#8217;ve not had a bad word to say about the product &#8211; <strong>it just works</strong> and does what I expect it to do.  However, there&#8217;s one function I think could do with some improvement and it&#8217;s the one thing that scares me most &#8211; <strong>removing drives</strong>.</p>
<p><img class="alignright size-full wp-image-730" title="DroboRemove" src="http://thestoragearchitect.files.wordpress.com/2009/09/droboremove1.png" alt="DroboRemove" width="558" height="472" />I&#8217;ve been reviewing <strong>2TB drives</strong> recently and now they&#8217;re destined to be used in the Drobo to expand my storage footprint.  This means I&#8217;ve been swapping drives in and out of the Drobo as I test them on another PC.  The process for drive removal is quite simple &#8211; just pull it out.  That&#8217;s where the problem lies. </p>
<p>I&#8217;m never keen about just removing a drive and letting the system reconfigure itself.  I see that as a &#8220;hard&#8221; failure which could have been avoided.  Before drive removal, surely it&#8217;s best to logically &#8220;evacuate&#8221; a drive and move the contents in a controlled fashion?  Physical removal must be causing the Drobo to perform rebuilds on data, which means if the other drives have faulty copies, then my data&#8217;s lost.</p>
<p>Have a look at the picture; you can see I&#8217;ve removed a drive and the Drobo is in a state of rebuild.  Fortunately the removed drive only had a 500GB capacity and hadn&#8217;t been in the device long.  Reconstruction of RAID only took about 7 minutes and I was protected again.  However I have swapped drives before and experienced a 13 hour rebuild, during which time data was unprotected. </p>
<p>Maybe I&#8217;m being paranoid, but Drobo guys, please give us a controlled way to remove a drive without forcing a rebuild.  It will help me sleep better at night!</p>
]]></content:encoded>
			<wfw:commentRss>http://thestoragearchitect.com/2009/09/14/personal-computing-drobo-jitters/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Enterprise Computing: RAID Is Not Enough</title>
		<link>http://thestoragearchitect.com/2009/01/05/enterprise-computing-raid-is-not-enough/</link>
		<comments>http://thestoragearchitect.com/2009/01/05/enterprise-computing-raid-is-not-enough/#comments</comments>
		<pubDate>Mon, 05 Jan 2009 20:18:00 +0000</pubDate>
		<dc:creator>Chris M Evans</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[data management]]></category>
		<category><![CDATA[Journalspace]]></category>
		<category><![CDATA[RAID]]></category>

		<guid isPermaLink="false">http://thestoragearchitect.wordpress.com/2009/01/05/enterprise-computing-raid-is-not-enough/</guid>
		<description><![CDATA[<p><a rel="nofollow" href="http://2.bp.blogspot.com/_b1B7GuxiR0o/SWJvvM3V9pI/AAAAAAAAAL8/fEaN2T42wJs/s1600-h/jamster-simpsons-homer.jpg" ></a> Happy New Year and welcome back to all my readers! <p> I&#8217;ve been messing about with some old hard drives this week and unusually for me, one is sounding decidedly sickly. I&#8217;ve never had a personal hard drive go on me (I guess I always upgrade/move on before it happens), but [...]<!--Begin ClixTrac.com Rotator Code -->
<script type="text/javascript" language="javascript" src="http://www.clixtrac.com/rotate/321"></script>
<!--End ClixTrac.com Rotator Code -->]]></description>
			<content:encoded><![CDATA[<p><a rel="nofollow" href="http://2.bp.blogspot.com/_b1B7GuxiR0o/SWJvvM3V9pI/AAAAAAAAAL8/fEaN2T42wJs/s1600-h/jamster-simpsons-homer.jpg" ><img style="float:left;width:270px;cursor:hand;height:270px;margin:0 10px 10px 0;" alt="" src="http://2.bp.blogspot.com/_b1B7GuxiR0o/SWJvvM3V9pI/AAAAAAAAAL8/fEaN2T42wJs/s320/jamster-simpsons-homer.jpg" border="0" /></a>
<div>Happy New Year and welcome back to all my readers! </div>
<div> </div>
<div> <img src='http://thestoragearchitect.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </div>
<div></div>
<p>
<div>I&#8217;ve been messing about with some old hard drives this week and unusually for me, one is sounding decidedly sickly. I&#8217;ve never had a personal hard drive go on me (I guess I always upgrade/move on before it happens), but rest assured I&#8217;ve had plenty &#8220;fail&#8221; in the Enterprise arena. Usually those failures are pre-emptive microcode soft-fails and the array seamlessly rebuilds onto another spare device and no data is lost.</div>
<p>
<div></div>
<p>
<div>Pity poor <a href="http://journalspace.com/this_is_the_way_the_world_ends/not_with_a_bang_but_a_whimper.html" >JournalSpace</a> who managed to total their business this week by relying purely on RAID within their main database server.</div>
<p>
<div></div>
<p>
<div>The loss of the data is not clear &#8211; the server had a RAID-1 configuration; follow the link and have a read, but I quote:</div>
<p>
<div></div>
<p>
<div><span style="color:#000099;">&#8220;There was no hardware failure. Both drives are operating fine; DriveSavers had no problem in making images of the drives. The data was simply gone. Overwritten.&#8221;</span></div>
<p>
<div></div>
<p>
<div>Now RAID is a great technology for recovering from physical drive failure and that is all it is &#8211; a mechanism to reduce the risk of data loss from failure of a hard drive. It is not a solution for managing data correctly. In this instance Journalspace must have suffered from the other things all good storage admins think (worry) about;</div>
<p>
<div></div>
<p>
<ul>
<li>Sabotage</li>
<p>
<li>Server failure</li>
<p>
<li>Catastrophic array failure</li>
<p>
<li>Software bug</li>
<p>
<li>Site failure</li>
<p>
<li>User stupidity</li>
</ul>
<p>
<p>If data is the lifeblood of your organisation then you *must* replicate it onto another online copy or at least onto a backup and have multiple copies in multiple locations.</p>
<p>
<p>If anyone out there is not sure they&#8217;re protecting their data properly &#8211; then give me a call!</p>
<div class="blogger-post-footer">
<p>_uacct = &#8220;UA-1104321-2&#8243;;<br />
urchinTracker();
</p></div>
]]></content:encoded>
			<wfw:commentRss>http://thestoragearchitect.com/2009/01/05/enterprise-computing-raid-is-not-enough/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Understanding EVA &#8211; revisited</title>
		<link>http://thestoragearchitect.com/2008/10/29/understanding-eva-revisited/</link>
		<comments>http://thestoragearchitect.com/2008/10/29/understanding-eva-revisited/#comments</comments>
		<pubDate>Wed, 29 Oct 2008 13:00:00 +0000</pubDate>
		<dc:creator>Chris M Evans</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[3par]]></category>
		<category><![CDATA[double disk failure]]></category>
		<category><![CDATA[HP EVA]]></category>
		<category><![CDATA[RAID]]></category>
		<category><![CDATA[RSS]]></category>
		<category><![CDATA[xiv]]></category>

		<guid isPermaLink="false">http://thestoragearchitect.wordpress.com/2008/10/29/understanding-eva-revisited/</guid>
		<description><![CDATA[<p>Thanks to all those who posted in response to <a rel="nofollow" href="http://storagearchitect.blogspot.com/2008/10/understanding-eva.html" >Understanding EVA</a> earlier this week, especially Cleanur who added a lot of detail. Based on the additional knowledge, I&#8217;d summarise again: <p> EVA disks are placed in groups &#8211; usually recommended to be one single group unless there&#8217;s a compelling reason not to [...]<!--Begin ClixTrac.com Rotator Code -->
<script type="text/javascript" language="javascript" src="http://www.clixtrac.com/rotate/321"></script>
<!--End ClixTrac.com Rotator Code -->]]></description>
			<content:encoded><![CDATA[<p>Thanks to all those who posted in response to <a rel="nofollow" href="http://storagearchitect.blogspot.com/2008/10/understanding-eva.html" >Understanding EVA</a> earlier this week, especially Cleanur who added a lot of detail.  Based on the additional knowledge, I&#8217;d summarise again:
<div></div>
<p>
<ul>
<li>EVA disks are placed in groups &#8211; usually recommended to be one single group unless there&#8217;s a compelling reason not to (like different disk types e.g. FC/FATA).</li>
<p>
<li>Disk groups are logically divided into Redundancy Storage Sets, which can be from 6-11 disks in size, depending on the number of disks in the group, but ideally 8 drives.</li>
<p>
<li>Virtual LUNs are created across all disks in a group, however to minimise the risk of data loss from disk failure, equal slices of LUNs (called PSEGs) are created in each RSS with additional parity to recreate the data within the RSS if a disk failure occurs.  PSEGs are 2MB in size.</li>
<p>
<li>In the event of a drive failure, data is moved dynamically/automagically to spare space reserved on each remaining disk.</li>
</ul>
<p><a rel="nofollow" href="http://3.bp.blogspot.com/_b1B7GuxiR0o/SQheZXxvOxI/AAAAAAAAAGQ/eiBh4MfHJCo/s1600-h/Understanding+EVA-1.jpg" ><img style="float:right;width:320px;cursor:hand;height:130px;margin:0 0 10px 10px;" alt="" src="http://3.bp.blogspot.com/_b1B7GuxiR0o/SQheZXxvOxI/AAAAAAAAAGQ/eiBh4MfHJCo/s320/Understanding+EVA-1.jpg" border="0" /></a>
<p>I&#8217;ve created a new diagram to show this relationship. The vRAID1 devices are pretty much as before, although now numbered as 1-1 &amp; 1-2 to show the two mirrors of each PSEG. For vRAID5, there are 4 data and 1 parity PSEG, which initially hits RSS1, then RSS2 then back to RSS1 again. I haven&#8217;t shown it, but presumably the EVA does a calculation to ensure that the data resides evenly on each disk. </p>
<p>So here&#8217;s some maths on the numbers.  There are many good links worth reading; try <a href="http://pages.cs.wisc.edu/~sschang/OS-Qual/reliability/RAID.htm" >here</a> and <a href="http://www.tcs.hut.fi/~hhk/phd/phd_Hannu_H_Kari.pdf" >here</a>.  I&#8217;ve taken the simplest formula and churned the numbers on a 168-drive array with a realistic MTBF (mean time before failure) of 100,000 hours.  Before people leap in and quote the manufacturers numbers that Seagate et al provide, which are higher figures, remember arrays will predictively fail a drive and in any case with temperature variation, heavy workload, manufacturing defects etc, the probability is lower than manufacturing figures (as Google <a rel="nofollow" href="http://research.google.com/archive/disk_failures.pdf" >have</a> already pointed out).  </p>
<p>I&#8217;ve also assumed a repair (i.e. replace) time of 8 hours, which seems reasonable for arrays unattended overnight.  If disks are not grouped, then the MTTDL (mean time to data loss) is about 44553 hours, or just over five years.  This is for a single array &#8211; imagine if you had 70-80 of them &#8211; the risk would be increased.  Now, with the disks in groups of 8 (meaning that data will be written across only 8 disks at a time), the double disk failure becomes 1,062,925 hours or just over 121 years.  This is without any parity.</p>
<p>Clearly grouping disks into RSSs does improve things and quite considerably so, even if no parity is implemented, so thumbs up to RSSs from a mathematical perspective.  However if a double disk failure does occur then every LUN in the disk group is impacted as data is spread across the whole disk group.  So it&#8217;s a case of very low probability, very high impact.</p>
<p>Mark &amp; Marc commented on 3Par&#8217;s implementation being similar to EVA.  I think XIV sounds similar too.  I&#8217;ll do more investigation on this as I&#8217;d like to understand the implications of double disk failures on all array types.  </p>
<div class="blogger-post-footer">
<p>_uacct = &#8220;UA-1104321-2&#8243;;<br />
urchinTracker();
</p></div>
]]></content:encoded>
			<wfw:commentRss>http://thestoragearchitect.com/2008/10/29/understanding-eva-revisited/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Two for the price of one</title>
		<link>http://thestoragearchitect.com/2008/01/03/two-for-the-price-of-one/</link>
		<comments>http://thestoragearchitect.com/2008/01/03/two-for-the-price-of-one/#comments</comments>
		<pubDate>Thu, 03 Jan 2008 18:24:00 +0000</pubDate>
		<dc:creator>Chris M Evans</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[ibm]]></category>
		<category><![CDATA[Onaro]]></category>
		<category><![CDATA[RAID]]></category>
		<category><![CDATA[sanscreen]]></category>
		<category><![CDATA[Tony Pearson]]></category>
		<category><![CDATA[xiv]]></category>

		<guid isPermaLink="false">http://thestoragearchitect.wordpress.com/2008/01/03/two-for-the-price-of-one/</guid>
		<description><![CDATA[<p>The holidays are over and it&#8217;s back to work for me. In fact I returned yesterday; the break was good however it is also good to be back.</p> <p>It seems that I&#8217;ve returned to a flurry of acquisitions. Yesterday there was the heavily reported (on the blogosphere) purchase of <a href="http://www.xivstorage.com/" >XiV</a> by IBM. Tony [...]<!--Begin ClixTrac.com Rotator Code -->
<script type="text/javascript" language="javascript" src="http://www.clixtrac.com/rotate/321"></script>
<!--End ClixTrac.com Rotator Code -->]]></description>
			<content:encoded><![CDATA[<p>The holidays are over and it&#8217;s back to work for me. In fact I returned yesterday; the break was good however it is also good to be back.</p>
<p>It seems that I&#8217;ve returned to a flurry of acquisitions. Yesterday there was the heavily reported (on the blogosphere) purchase of <a href="http://www.xivstorage.com/" >XiV</a> by IBM. Tony Pearson gives a summary of the features on his <a href="http://www.ibm.com/developerworks/blogs/page/InsideSystemStorage?entry=ibm_acquires_xiv" >post</a>. One thing that interests me is the use of distributed writes across an entire array by creating 1MB blocks from (presumably) LUNs and filesystems. If a drive fails, then the data is still available on other disks in the system and spread across a great number rather than a single drive (RAID-1) or potentially a small number of drives (RAID5/6).</p>
<p>I&#8217;ve been trying to get my head around what this means. On the one hand it sounds like a real problem, as a double drive failure could impact a wide number of hosts; it all depends on how well the 1MB chunks are distributed. However maybe it isn&#8217;t that much of a problem as the issue only arises when both of the chunks that mirror a 1MB block both occur on failing drives. I would expect that as the number of physical drives increases then the impact of double failure reduces, as does the number of 1MB blocks affected. In addition, a drive may fail only in one area rather than on the whole device, so the affected blocks could be quite small; the remainder could be perfectly readable and be quickly moved. No doubt Moshe and the team have done the maths to know what the risk is and compared it to that of standard arrays and wouldn&#8217;t be selling the product if it was not inherently more safe.</p>
<p>The only other issue I can see is what market the product will slot into; Tony mentions that the product is not for structured data (although I guess it supports it) but was designed for unstructured data of large binary file types. So, why use RAID-1 compared to say a 14+2 RAID-6 configuration which would be much cheaper in terms of the disk cost? Presumably another selling point is performance, but I would expect the target data profile (medical, large binary objects) to be more sequential than random access and not be that impacted by using SATA.</p>
<p>I guess only time will tell. I look forward to seeing how things go.</p>
<p>The other purchase announced today was that of <a href="http://www.onaro.com/" >Onaro</a> by <a href="http://www.netapp.com/" >Netapp</a>. Onaro sell SANScreen, a tool to collect and analyse fibre channel SANs and to highlight configuration issues. Whilst I think it is a good product, I don&#8217;t see the fit with Netapp&#8217;s business in the NAS market (in fact I&#8217;m sure SANScreen doesn&#8217;t currently support NAS), so where&#8217;s the benefit here other than buying up a company which must be close to or is making money.</p>
<p>I wonder who will be bought tomorrow?
<div class="blogger-post-footer">
<p>_uacct = &#8220;UA-1104321-2&#8243;;<br />
urchinTracker();
</p></div>
]]></content:encoded>
			<wfw:commentRss>http://thestoragearchitect.com/2008/01/03/two-for-the-price-of-one/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Giving RAID the thumbs up</title>
		<link>http://thestoragearchitect.com/2007/04/04/giving-raid-the-thumbs-up/</link>
		<comments>http://thestoragearchitect.com/2007/04/04/giving-raid-the-thumbs-up/#comments</comments>
		<pubDate>Wed, 04 Apr 2007 19:32:00 +0000</pubDate>
		<dc:creator>Chris M Evans</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[backup]]></category>
		<category><![CDATA[data storage]]></category>
		<category><![CDATA[RAID]]></category>

		<guid isPermaLink="false">http://thestoragearchitect.wordpress.com/2007/04/04/giving-raid-the-thumbs-up/</guid>
		<description><![CDATA[<p>Just read Robin Harris&#8217; post at his new blog location; <a href="http://blogs.zdnet.com/storage/?p=116" >http://blogs.zdnet.com/storage/?p=116</a> and his comment on another blog discussing RAID. He quotes a VAR who has tracked disk failures and thinks RAID is an expensive luxury for desktops. </p> <p>It&#8217;s interesting to see the failure rates quoted, anywhere from 1-3%, which on the face [...]<!--Begin ClixTrac.com Rotator Code -->
<script type="text/javascript" language="javascript" src="http://www.clixtrac.com/rotate/321"></script>
<!--End ClixTrac.com Rotator Code -->]]></description>
			<content:encoded><![CDATA[<p>Just read Robin Harris&#8217; post at his new blog location; <a href="http://blogs.zdnet.com/storage/?p=116" >http://blogs.zdnet.com/storage/?p=116</a> and his comment on another blog discussing RAID.  He quotes a VAR who has tracked disk failures and thinks RAID is an expensive luxury for desktops. </p>
<p>It&#8217;s interesting to see the failure rates quoted, anywhere from 1-3%, which on the face of it, seems low.  However when its *your* disk that has failed and the data is irretrievable, there&#8217;s cold comfort to be had in failure rate statistics.  I run RAID1 on my server; I have two 500GB SATA drives.  Backing up that volume of data on a regular basis is a nightware without investing in a very expensive backup solution like LTO and it is a real disappointment to see tape hasn&#8217;t kept pace with disk in terms of the capacity/cost ratio.</p>
<p>So, I&#8217;m sticking with RAID.  I augment it with disk-to-disk backups because, yes, you do have to cater for the d&#8217;oh factor of user errors or even dodgy software which corrupts files too, but RAID works for me and that&#8217;s all I need to worry about.
<div class="blogger-post-footer">
<p>_uacct = &#8220;UA-1104321-2&#8243;;<br />
urchinTracker();
</p></div>
]]></content:encoded>
			<wfw:commentRss>http://thestoragearchitect.com/2007/04/04/giving-raid-the-thumbs-up/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>

