<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>The Storage Architect &#187; Optimizer</title>
	<atom:link href="http://thestoragearchitect.com/tag/optimizer/feed/" rel="self" type="application/rss+xml" />
	<link>http://thestoragearchitect.com</link>
	<description>Storage, Virtualisation &#38; Cloud</description>
	<lastBuildDate>Wed, 23 May 2012 22:20:49 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>Enterprise Computing: Do We Need FAST v1, EMC?</title>
		<link>http://thestoragearchitect.com/2009/10/18/enterprise-computing-do-we-need-fast-v1-emc/</link>
		<comments>http://thestoragearchitect.com/2009/10/18/enterprise-computing-do-we-need-fast-v1-emc/#comments</comments>
		<pubDate>Sun, 18 Oct 2009 09:50:45 +0000</pubDate>
		<dc:creator>Chris M Evans</dc:creator>
				<category><![CDATA[GestaltIT]]></category>
		<category><![CDATA[barry whyte]]></category>
		<category><![CDATA[binfile]]></category>
		<category><![CDATA[Compellent]]></category>
		<category><![CDATA[Data Progression]]></category>
		<category><![CDATA[EMC]]></category>
		<category><![CDATA[FAST]]></category>
		<category><![CDATA[HiCommand Tiered Storage Manager]]></category>
		<category><![CDATA[hitachi]]></category>
		<category><![CDATA[hyper]]></category>
		<category><![CDATA[ibm]]></category>
		<category><![CDATA[Optimizer]]></category>
		<category><![CDATA[Storage Tiering 1.0]]></category>
		<category><![CDATA[Storage Tiering 2.0]]></category>
		<category><![CDATA[Symmetrix]]></category>

		<guid isPermaLink="false">http://thestoragearchitect.com/?p=772</guid>
		<description><![CDATA[<p>So, here&#8217;s my rash statement from Twitter last night: &#8220;If FAST isn&#8217;t free, I don&#8217;t want it!  All it&#8217;s doing is automating process I could script/do manually&#8221;.  It&#8217;s a bold statement, I know, so is FAST really offering something better than what could be achieved today using EMC&#8217;s <a href="http://uk.emc.com/products/detail/software/symmetrix-optimizer.htm" >Symmetrix Optimizer</a>?</p> <p>Hot Spots</p> <p>EMC&#8217;s [...]<!--Begin ClixTrac.com Rotator Code -->
<script type="text/javascript" language="javascript" src="http://www.clixtrac.com/rotate/321"></script>
<!--End ClixTrac.com Rotator Code -->]]></description>
			<content:encoded><![CDATA[<p>So, here&#8217;s my rash statement from Twitter last night: <em><span style="color:#0000ff;">&#8220;If FAST isn&#8217;t free, I don&#8217;t want it!  All it&#8217;s doing is automating process I could script/do manually&#8221;</span></em>.  It&#8217;s a bold statement, I know, so is <strong>FAST</strong> really offering something better than what could be achieved today using EMC&#8217;s <a href="http://uk.emc.com/products/detail/software/symmetrix-optimizer.htm" >Symmetrix Optimizer</a>?</p>
<p><strong>Hot Spots</strong></p>
<p>EMC&#8217;s Symmetrix architecture (18 years old and counting, I believe) uses the concept of disk <strong>hypers</strong> to present LUNs.  Each physical disk is carved into a number of slices, which are then recombined to create LUNs to present to a host.  A mirrored (RAID-1) LUN uses two hypers, a RAID-5 (3+1) LUN uses 4.  EMC ensure general performance by setting standards on how LUNs are created from hypers and that&#8217;s reflected in a <strong>&#8220;binfile&#8221;</strong> layout.  However despite this sensible planning, it is possible (especially as hard drives are now much larger and contain many more hypers) that two hypers on a single physical disk could be highly active and so contend against each other &#8211; in other words <strong>&#8220;hot spots&#8221;</strong> on disk.</p>
<p>Optimizer helps alleviate the issue of hot spots by <strong>exchanging</strong> the high I/O hypers with low I/O ones, distributing busy LUNs across more physical spindles.  This is classic load balancing where resources are distributed across the available infrastructure in order to obtain better overall generic performance.  EMC have now rebranded Optimizer as part of <strong>Ionix</strong> for Storage Resource Managment, but it&#8217;s still effectively the same product.  Hyper swaps can be managed automatically, based on historical performance data.  They can also be user-defined &#8211; a manual swap at the users request.</p>
<p>Although tedious (and not as well automated as Hitachi&#8217;s HiCommand Tiered Storage Manager), in theory Optimizer could be used to manually move workload between storage tiers.  In fact, Optimizer is already aware of a tiered storage infrastructure.  Here&#8217;s a quote directly from the ControlCenter 6.1 manual:</p>
<blockquote><p><span style="color:#0000ff;">&#8220;Optimizer is also aware of physical drives that operate at different speeds, as well as location of the data on the physical media, which influences the I/O rate. This information is used when determining which logical devices to move.&#8221;</span></p></blockquote>
<p>So with a little bit of knowledge on the layout of data on a Symmetrix array, it would be possible today to use Optimizer to perform LUN-based FAST.</p>
<p><strong>Load-Balancing Versus Policy</strong></p>
<p>Unfortunately, simple load-balancing of I/O across a storage array doesn&#8217;t offer what should be seen as the next generation of storage tiering.  Where <strong>Storage Tiering 1.0</strong> was about offering multiple layers of storage within the same physical infrastructure and manually placing or moving LUNs to the appropriate tier, <strong>Storage Tiering 2.0</strong> will be about establishing policies that determine more service-based measurements of the performance and availability customers receive. </p>
<p>A policy-based approach would allow rules to be established on how <strong>data at the application layer</strong> moves between tiers.  This is a critical distinction from the load-balancing  methodology earlier described.  As an example, where an application was known to require higher performance at a certain time of day or day of the week, data could be moved proactively to a faster tier of storage, returning later once the high I/O workload had completed.  Whilst achievable using Optimizer, there&#8217;s no doubt the process of application migration would be tedious and time consuming.  I expect the v1.0 implementation of FAST will simply package up Optimizer into a tool that automates the migration of related data between tiers.  Don&#8217;t forget, other vendors have been <strong>offering this feature for some time</strong> &#8211; for example Hitachi and Tiered Storage Manager.</p>
<p><strong>Increasing Granularity</strong></p>
<p>Now LUN-based migration has its benefits.  Where large numbers of disks exist in an infrastructure, application data can be placed or moved to the most appropriate location as required.  However with the introduction of <strong>solid state disks</strong> (SSDs), a more granular approach is needed as the number of SSDs deployed in an array is likely to be low due to their excessive cost.  Moving an entire application (or even LUN) to SSD will be undesirable unless that application can take full use of the SSD hardware.  There are <strong>very few</strong>, if any, applications that require high-intensity read/write activity from every piece of application data all the time.</p>
<p>Block-level tiering offers a higher level of granularity to the placement of data.  A LUN can be split into blocks and placed across multiple layers of storage technology including traditional HDDs and faster SSDs.  Selective placement will ensure the more efficient use of expensive SSD media by placing only the highly active data onto it.</p>
<p>All of a sudden with increased granularity we&#8217;re back to Storage Tiering 1.0 where data is being placed on faster technology purely based on <strong>increasing overall system performance</strong>.  This is a feature <a href="http://www.compellent.com" >Compellent</a> have been offering for some time.  Data is migrated up or down the tier hierarchy on a <strong>daily basis</strong>, subject to performance figures over a 12-day period.  This level of granular performance management is possible because data is stored in a block-based structure.  Unfortunately for EMC, the <strong>hyper design legacy</strong> represents a technical challenge in making FAST version 2 a reality. </p>
<p><strong>Patent Rights</strong></p>
<p>As just mentioned, Compellent already offer block-based data migration in their products.  At a recent dinner in London with the Compellent team, they highlighted their strong position in the market, protected by patents covering block-level data migration between tiers.  You can find the filed patent <a href="http://www.patentstorm.us/patents/7398418/fulltext.html" >here</a>.  Compellent use the term <strong>&#8220;Data Progression&#8221;</strong> to describe how blocks are moved between tiers based on I/O activity.  As I/O activity is monitored over time, it is possible to determine the most appropriate tier of storage to use when expanding capacity.  Typically these are lower tier SATA drives, as initial performance requirements are usually over-estimated.  This metholodogy is very much Storage Tiering 1.0 discussed earlier.</p>
<p>Compellent aren&#8217;t the only people claiming rights to block-level tiering within a storage array.  I&#8217;ve also found the following <a href="http://www.patentstorm.us/patents/7421556/fulltext.html" >patent application</a> from <strong>IBM</strong>, filed by Barry Whyte, Steve Legg and others.  If IBM and Compellent both claim to have invented the FAST concept, how does that position EMC?  Do they have an earlier patent which trumps these two?</p>
<p><strong>Summary</strong></p>
<p>Storage Tiering 1.0 provides performance management of storage arrays.  Storage Tiering 2.0 extends this to offer policy-driven optimisation offerings.  Both of these technologies are available today from existing vendors in one format or another.  EMC will simply be playing catchup with these vendors once FAST 1 &amp; FAST 2 are released.  I&#8217;d like to be surprised and see EMC offer something the competition currently don&#8217;t.  I&#8217;m not holding my breath&#8230;</p>
]]></content:encoded>
			<wfw:commentRss>http://thestoragearchitect.com/2009/10/18/enterprise-computing-do-we-need-fast-v1-emc/feed/</wfw:commentRss>
		<slash:comments>14</slash:comments>
		</item>
		<item>
		<title>Performance Part III</title>
		<link>http://thestoragearchitect.com/2007/07/09/performance-part-iii/</link>
		<comments>http://thestoragearchitect.com/2007/07/09/performance-part-iii/#comments</comments>
		<pubDate>Mon, 09 Jul 2007 19:39:00 +0000</pubDate>
		<dc:creator>Chris M Evans</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[array group]]></category>
		<category><![CDATA[Cruise Control]]></category>
		<category><![CDATA[Optimizer]]></category>
		<category><![CDATA[performance]]></category>

		<guid isPermaLink="false">http://thestoragearchitect.wordpress.com/2007/07/09/performance-part-iii/</guid>
		<description><![CDATA[<p>Next under discussion for performance is array groups. </p> <p>First the background (and as usual, apologies to those who already know all this). HDS enterprise arrays lay their disks out in array groups, either RAID-1/0, RAID-5 and RAID-6. Variable size LUNs are then carved out of the array groups for presentation to hosts. Obviously it [...]<!--Begin ClixTrac.com Rotator Code -->
<script type="text/javascript" language="javascript" src="http://www.clixtrac.com/rotate/321"></script>
<!--End ClixTrac.com Rotator Code -->]]></description>
			<content:encoded><![CDATA[<p>Next under discussion for performance is array groups. </p>
<p>First the background (and as usual, apologies to those who already know all this).  HDS enterprise arrays lay their disks out in array groups, either RAID-1/0, RAID-5 and RAID-6.  Variable size LUNs are then carved out of the array groups for presentation to hosts.  Obviously it makes sense to ensure that every host has their LUNs selected from as many array groups as possible.  This means that large volumes of read and write data can be serviced quickly and on writes, the data written into cache can be destaged to disk quickly. </p>
<p>Pick any hour in the production day and use something like Tuning Manager and you&#8217;ll likely see (especially on unbalanced systems) the standard exponential curve for IOPS or MB/s written.  The 80/20 rule applies; around 20% of the array groups will be doing 80% of the I/O.  Ideally it would be best to have workload balanced evenly across all array groups, however the effort of rebalancing the data probably doesn&#8217;t justify the returns, unless of course you have some very busy array groups.  Personally I&#8217;d look to ensure no single array group exceeds 50% active and I&#8217;d want 25-50% read hits (remember that you need to make sure you have some spare capacity to cater for disk sparing).  Any array groups exceeding these metrics are candidates for data movement.  I&#8217;ve recently used Cruise Control and I found it a disappointment &#8211; EMC&#8217;s Optimiser swaps two LUNs using a third temporary LUN to manage the exchange.  Cruise Control expects you to provide a free LUN as a target of migration.  This may be difficult if a very quiet array group has been fully allocated.  Therefore I tend to recommend manual exchanges, especially if hosts have a Logical Volume Manager product installed.</p>
<p>Balancing workload will mean checking array groups and moving any &#8220;hot&#8221; LUNs away from each other.  This is where knowing your data becomes important and if possible knowing how the data is mapped on the host to make sure LUNs aren&#8217;t just busy with transient data  (for example, multiple LUNs that comprise a single concatentate volume on a host may be busy over time as more data is allocated to the volume).   It is also equally possible that a single host may be able to overload one array group, so in that instance, moving the LUN will provide no benefit and the data layout on the host will need to be addressed.</p>
<p>It&#8217;s possible to spend hours looking at array group balancing.  The key is to make sure the effort is worth the result.
<div class="blogger-post-footer">
<p>_uacct = &#8220;UA-1104321-2&#8243;;<br />
urchinTracker();
</p></div>
]]></content:encoded>
			<wfw:commentRss>http://thestoragearchitect.com/2007/07/09/performance-part-iii/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Optimisation tools</title>
		<link>http://thestoragearchitect.com/2007/04/24/optimisation-tools/</link>
		<comments>http://thestoragearchitect.com/2007/04/24/optimisation-tools/#comments</comments>
		<pubDate>Tue, 24 Apr 2007 21:16:00 +0000</pubDate>
		<dc:creator>Chris M Evans</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[Cruise Control]]></category>
		<category><![CDATA[EMC]]></category>
		<category><![CDATA[HDS]]></category>
		<category><![CDATA[Optimizer]]></category>
		<category><![CDATA[Volume Migrator]]></category>

		<guid isPermaLink="false">http://thestoragearchitect.wordpress.com/2007/04/24/optimisation-tools/</guid>
		<description><![CDATA[<p>Large disk arrays can suffer from an imbalance of data across their RAID/parity groups. This is inevitable even if you plan your LUN allocation as data profiles change over time and storage is allocated and de-allocated.</p> <p>So, tools are available. Think of EMC Optimizer, HDS Cruise Control and Volume Migrator.</p> <p>I&#8217;ve put a poll up [...]<!--Begin ClixTrac.com Rotator Code -->
<script type="text/javascript" language="javascript" src="http://www.clixtrac.com/rotate/321"></script>
<!--End ClixTrac.com Rotator Code -->]]></description>
			<content:encoded><![CDATA[<p>Large disk arrays can suffer from an imbalance of data across their RAID/parity groups. This is inevitable even if you plan your LUN allocation as data profiles change over time and storage is allocated and de-allocated.</p>
<p>So, tools are available. Think of EMC Optimizer, HDS Cruise Control and Volume Migrator.</p>
<p>I&#8217;ve put a poll up on the blog to see what people think &#8211; I have my own views and I&#8217;ll save them until after the vote closes next week.
<div class="blogger-post-footer">
<p>_uacct = &#8220;UA-1104321-2&#8243;;<br />
urchinTracker();
</p></div>
]]></content:encoded>
			<wfw:commentRss>http://thestoragearchitect.com/2007/04/24/optimisation-tools/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

