Enterprise Computing: The Benefits of Wide Striping – Avoiding A Long Tail
I took part in a podcast last night that discussed the XIV platform. One of the “key features” of XIV is the wide striping of data across all spindles. It’s a concept we’re seeing more and more in contemporary storage hardware architectures and one that’s being shoe-horned into older storage arrays too. Have you ever wondered what the point is? Take a look at the following graphic. It shows the number of write operations per RAID group, ordered by the busiest RAID group to the least active. It’s real data from a real system. What you see is the Long Tail effect, where a small number of RAID groups are doing most of the I/O. In this example, 80% of the workload is performed by 50% of the RAID groups; only 3 RAID groups account for 20% of the workload.
The chart shows that in some array designs (typically the older Enterprise arrays), I/O distribution was not evenly balanced and so not all drives were being used to their full capacity. This was mitigated by using tools to move LUNs or sub-LUNs around; alternatively concatenated devices like metas and LUSEs were employed to spread the load.
The only real solution to the I/O balancing problem is genuine wide striping. Manual or even automated rebalancing, or the use of metas are just workarounds. Once wide striping is in place, either more work can be performed or the number of spindles or their “quality” can be reduced, i.e. you can build a complete SATA array like XIV.
There are of course disadvantages to having your data more widely spread. The most obvious is the increased risk of data loss when the RAID system fails – i.e. a double disk failure. The wider the striping, the wider the impact. The tradeoff is the benefit of increased performance. You have to choose what level of risk/impact you consider acceptable versus the potential gains.
If you’re not doing wide striping today then you should seriously be considering it. After all, you’re only harnessing performance capacity within the array that you’ve already paid for.
10 Responses to Enterprise Computing: The Benefits of Wide Striping – Avoiding A Long Tail
Leave a Reply Cancel reply
You must be logged in to post a comment.
- Use Symantec and know your sensitive data is protected with industry-leading backup & recovery software.
Experience Symantec Backup Software
Popular Posts
- Netapp: The Inflexibility of Flexvols (3590)
- Back to Blogging (2157)
- The technical solution is not always the best (1897)
- Data ONTAP 8.0 – Part III (1719)
- EMC Releases All Flash VNX (1693)
- Solid State Arrays: Pure Storage Inc (1693)
- Enterprise Computing: Why Thin Provisioning Is Not The Holy Grail for Utilisation (1429)
- Who Will Be The First Solid State Array Vendor To Be Acquired? (1423)
- Drive Prices Increase – Who Will Suffer Most? (1395)
- VAAI Follow Up – VMware Recommend Disabling Thin Reclaim (1314)









Imagine if you could wide stripe using a mix of drive types instead of just SATA like XIV. Now imagine that your array could make sure the writes used only the fastest disks (FC 15K or SSD)in the stripe and moved older, infrequently accessed data (reads) to the slower, more cost efficient SATA drives?
You’d be a Compellent Storage Center customer!
Chris,
Can you comment on the sun open storage use of wide stripes.
From the whitepaper :
https://www.sun.com/offers/docs/Unified_Storage_Systems_Architecture.pdf
Page:15
Triple parity RAID, wide stripes — a RAID configuration where each stripe has three
disks for parity, and where wide stripes are configured to maximize capacity. This
configuration will yield high capacity, and high availability as data will remain
available even after sustaining three disk failures. The availability and capacity are
delivered at the expense of performance as this mode requires more calculations
than double parity RAID. Also, while bandwidth will be acceptable in this wide stripe
configuration, the number of I/O operations that the entire system can perform will
be diminished. As with other RAID configurations,
It is simple math, a 15K drive will generate sustained IOPS of about 170, a SATA drive about 70. If you create a “pool” of these drives and increase the number of spindles working with each IO, the IOPS increase with each drive used. We have been building these powerful performance pools for customers for over 5 years.
Fast, Simple, and Efficient – that is how a Compellent system was designed.
Paul Clifford
http://www.DavenportGroup.com
[...] The Storage Architect » Blog Archive » Enterprise Computing: The … [...]
[...] The Storage Architect » Blog Archive » Enterprise Computing: The … [...]
[...] The Storage Architect » Blog Archive » Enterprise Computing: The … [...]
[...] The Storage Architect » Blog Archive » Enterprise Computing: The … [...]
@ Paul:
Paul, how do you come to the conclusion that a SATA drive will do 70 IOPs only ?
As discussed some time ago in this blog space already, I think a FC or SAS Drive can be estimated at roughly 185-200 IOPs, and a SATA Drive at about 100…
Apart from that: do you Compellent Guys do anything else that sailing in XIV’s wake in blogs and forums ?
(Don’t get me wrong – I think both technologies are well worth a closer look…)
@Yves;
Yves maybe I can help out. You’re right technically you can run the disks to a much higher IOPs level. However you’ll begin to see a substantial increase in latency. This trade off of squeezing very high IOPS out of drives can wreak havoc on database/application systems.
I agree with Paul the IOPs numbers to use for production configuration design are 70-80 for SATA and 170-180 for FC.
My clients love their Compellent SANs because they deliver with out surprises.
Joel Carlson
storage.firsttech.com
Don’t you understand that this is the best time to receive the mortgage loans, which can realize your dreams.