July 26, 2012September 7, 2020 by Dimitris

An explanation of IOPS and latency

IOPS: Possibly the most common measure of storage system performance.

IOPS means Input/Output (operations) Per Second. Seems straightforward. A measure of work vs time (not the same as MB/s, which is actually easier to understand – simply, MegaBytes per Second).

How many of you have seen storage vendors extolling the virtues of their storage by using large IOPS numbers to illustrate a performance advantage?

How many of you decide on storage purchases and base your decisions on those numbers?

However: how many times has a vendor actually specified what they mean when they utter “IOPS”? 🙂

For the impatient, I’ll say this: IOPS numbers by themselves are meaningless and should be treated as such. Without additional metrics such as RAID type, randomness, latency, read vs write % and I/O size (to name a few), an IOPS number is useless.

And now, let’s elaborate… (and, as a refresher regarding the perils of ignoring such things when it comes to sizing, you can always go back here).

One hundred billion IOPS…

drevil

I’ve competed with various vendors that promise customers high IOPS numbers. On a small system with under 100 standard 15K RPM spinning disks, a certain three-letter vendor was claiming half a million IOPS. Another, a million. Of course, my customer was impressed, since that was far, far higher than the number I was providing. But what’s reality?

Here, I’ll do one right now: an SSD can do a million IOPS. Maybe even two million.

Go ahead, prove otherwise.

It’s impossible, since there is no standard way to measure IOPS, and the official definition of IOPS (operations per second) does not specify certain extremely important parameters. By doing any sort of I/O test on the box, you are automatically imposing your benchmark’s definition of IOPS for that specific test.

Maybe I meant “1 million 512-byte sequential reads”. And maybe your test was “random 8K overwrites”.

What’s an operation? What kind of operations are there?

It can get complicated.

An I/O operation is simply some kind of work the disk subsystem has to do at the request of a host and/or some internal process. Typically a read or a write, with sub-categories (for instance read, re-read, write, re-write, random, sequential) and a size.

Depending on the operation, its size could range anywhere from bytes to kilobytes to several megabytes.

Now consider the following most assuredly non-comprehensive list of operation types:

A random 4KB read
A random 4KB read followed by more 4KB reads of blocks in logical adjacency to the first
A 512-byte metadata lookup and subsequent update
A 256KB read followed by more 256KB reads of blocks in logical sequence to the first
A 64MB read
A series of random 8KB writes followed by 256KB sequential reads of the same data that was just written
Random 8KB overwrites
Random 32KB reads and writes
Combinations of the above in a single thread
Combinations of the above in multiple threads

…this could go on.

As you can see, there’s a large variety of I/O types, and true multi-host I/O is almost never of a single type. Virtualization further mixes up the I/O patterns, too.

Now here comes the biggest point (if you can remember one thing from this post, this should be it):

No storage system can do the same maximum number of IOPS irrespective of I/O type, latency and size.

Let’s re-iterate:

It is impossible for a storage system to sustain the same peak IOPS number when presented with different I/O types and latency requirements.

Another way to see the limitation…

A gross oversimplification that might help prove the point that the type and size of operation you do matters when it comes to IOPS. Meaning that a system that can do a million 512-byte IOPS can’t necessarily do a million 256K IOPS. The IOPS vs I/O size relationship is not a linear one, but there is a correlation.

Imagine a bucket, or a shotshell, or whatever container you wish.

Imagine in this container you have either:

A few large balls or…
Many tiny balls

The bucket ultimately contains about the same volume of stuff either way, and it is the major limiting factor. Clearly, you can’t completely fill that same container with the same number of large balls as you can with small balls.

They kinda look like shotshells, don’t they?

Now imagine the little spheres being forcibly evacuated rapildy out of one end… which takes us to…

Latency matters

So, we’ve established that not all IOPS are the same – but what is of far more significance is latency as it relates to the IOPS.

If you want to read no further – never accept an IOPS number that doesn’t come with latency figures, in addition to the I/O sizes and read/write percentages.

Simply speaking, latency is a measure of how long it takes for a single I/O request to happen from the application’s viewpoint.

In general, when it comes to data storage, high latency is just about the least desirable trait, right up there with poor reliability.

Databases especially are very sensitive with respect to latency – DBs make several kinds of requests that need to be acknowledged quickly (ideally in under 10ms, and writes especially in well under 5ms). In particular, the redo log writes need to be acknowledged almost instantaneously for a heavy-write DB – well under 1ms is preferable.

High sustained latency in a mission-critical app can have a nasty compounding effect – if a DB can’t write to its redo log fast enough for a single write, everything stalls until that write can complete, then moves on. However, if it constantly can’t write to its redo log fast enough, the user experience will be unacceptable as requests get piled up – the DB may be a back-end to a very busy web front-end for doing Internet sales, for example. A delay in the DB will make the web front-end also delay, and the company could well lose thousands of customers and millions of dollars while the delay is happening. Some companies could also face penalties if they cannot meet certain SLAs.

On the other hand, applications doing sequential, throughput-driven I/O (like backup or archival) are nowhere near as sensitive to latency (and typically don’t need high IOPS anyway, but rather need high MB/s).

It follows that not all I/O sizes and I/O operations are subject to the same latency requirements.

Here’s an example from an Oracle DB – a system doing about 15,000 IOPS at 25ms latency. Doing more IOPS would be nice but the DB needs the latency to go a lot lower in order to see significantly improved performance – notice the increased IO waits and latency, and that the top event causing the system to wait is I/O:

AWR example Now compare to this system (different format this data but you’ll get the point):

Notice that, in this case, the system is waiting primarily for CPU, not storage.

A significant amount of I/O wait is a good way to determine if storage is an issue (there can be other latencies outside the storage of course – CPU and network are a couple of usual suspects). Even with good latencies, if you see a lot of I/O waits it means that the application would like faster speeds from the storage system.

But this post is not meant to be a DB sizing class. Here’s the important bit that I think is confusing a lot of people and is allowing vendors to get away with unrealistic performance numbers:

It is possible (but not desirable) to have high IOPS and high latency simultaneously.

How? Here’s a, once again, oversimplified example:

Imagine 2 different cars, both with a top speed of 150mph.

Car #1 takes 50 seconds to reach 150mph
Car #2 takes 200 seconds to reach 150mph

The maximum speed of the two cars is identical.

Does anyone have any doubt as to which car is actually faster? Car #1 indeed feels about 4 times faster than Car #2, even though they both hit the exact same top speed in the end.

Let’s take it an important step further, keeping the car analogy since it’s very relatable to most people (but mostly because I like cars):

Car #1 has a maximum speed of 120mph and takes 30 seconds to hit 120mph
Car #2 has a maximum speed of 180mph, takes 50 seconds to hit 120mph, and takes 200 seconds to hit 180mph

In this example, Car #2 actually has a much higher top speed than Car #1. Many people, looking at just the top speed, might conclude it’s the faster car.

However, Car #1 reaches its top speed (120mph) far faster than Car # 2 reaches that same top speed of Car #1 (120mph).

Car #2 continues to accelerate (and, eventually, overtakes Car #1), but takes an inordinately long amount of time to hit its top speed of 180mph.

Again – which car do you think would feel faster to its driver?

You know – the feeling of pushing the gas pedal and the car immediately responding with extra speed that can be felt? Without a large delay in that happening?

Which car would get more real-world chances of reaching high speeds in a timely fashion? For instance, overtaking someone quickly and safely?

Which is why car-specific workload benchmarks like the quarter mile were devised: How many seconds does it take to traverse a quarter mile (the workload), and what is the speed once the quarter mile has been reached?

(I fully expect fellow geeks to break out the slide rules and try to prove the numbers wrong, probably factoring in gearing, wind and rolling resistance – it’s just an example to illustrate the difference between throughput and latency, I had no specific cars in mind… really).

As the icing on the complexity cake, latency measurement results can seem very different depending on various things:

Are you measuring from the host or the array? if from the host, understand that this value will include the entire I/O stack, including all your storage networking latency.
What is the time resolution of measurement? The results can hugely depending on whether you are measuring latency every few milliseconds, to averaging latency over a whole hour. If ultra-granular you may see latency microbursts that are fairly common and nothing to worry about. If not granular enough, you won’t see enough detail. I recommend no more than 5 minutes for a measuring interval. 1 minute is better if you can get it.

And, finally, some more storage-related examples…

Some vendor claims… and the fine print explaining the more plausible scenario beneath each claim:

“Mr. Customer, our box can do a million IOPS!”

…512-byte ones, sequentially out of cache.

“Mr. Customer, our box can do a quarter million random 4K IOPS – and not from cache!”

…at 50ms latency and 100% reads.

“Mr. Customer, our box can do a quarter million 8K IOPS, not from cache, at 20ms latency!”

…but only if you have 1000 threads going in parallel.

“Mr. Customer, our box can do a hundred thousand 4K IOPS, at under 20ms latency!”

…but only if you have a single host hitting the storage so the array doesn’t get confused by different I/O from other hosts.

Notice how none of these claims are talking about writes or working set sizes… or the configuration required to support the claim.

What to look for when someone is making a grandiose IOPS claim

Audited validation and a specific workload to be measured against (that includes latency as a metric) both help. I’ll pick on HDS since they habitually show crazy numbers in marketing literature.

For example, from their website:

HDS USP IOPS

It’s pretty much the textbook case of unqualified IOPS claims. No information as to the I/O size, reads vs writes, sequential or random, what type of medium the IOPS are coming from, or, of course, the latency…

However, that very same box almost makes 270,000 SPC-1 IOPS with good latency in the audited SPC-1 benchmark:

Last I checked, 270,000 was almost 15 times less than 4,000,000. Don’t get me wrong, 260,000 low-latency IOPS is a great SPC-1 result, but it’s not 4 million SPC-1 IOPS.

Check my previous article on SPC-1 and how to read the results here. And if a vendor is not posting results for a platform – ask why.

Where are the IOPS coming from?

So, when you hear those big numbers, where are they really coming from? Are they just ficticious? Not necessarily. So far, here are just a few of the ways I’ve seen vendors claim IOPS prowess:

What the controller will theoretically do given unlimited back-end resources.
What the controller will do purely from cache (a LOT of the later v1 SPC-1 submissions game the results that way, using systems with gigantic cache and small amounts of test data).
What a controller that can compress data will do with all zero data.
What the controller will do assuming the data is at the FC port buffers (“huh?” is the right reaction, only one three-letter vendor ever did this so at least it’s not a widespread practice).
What the controller will do given the configuration actually being proposed driving a very specific application workload with a specified latency threshold and real data.

The figures provided by the approaches above are all real, in the context of how the test was done by each vendor and how they define “IOPS”. However, of the (non-exhaustive) options above, which one do you think is the more realistic when it comes to dealing with real application data?

What if someone proves to you a big IOPS number at a PoC or demo?

Proof-of-Concept engagements or demos are great ways to prove performance claims.

But, as with everything, garbage in – garbage out.

If someone shows you a benchmark tool doing crazy IOPS, use the information in this post to help you at least find out what the exact configuration of the benchmark is. What’s the block size, is it random, sequential, a mix, how many hosts are doing I/O, etc. Is the config being short-stroked? Is it coming all out of cache?

Typically, things like benchmark tools can be a good demo but that doesn’t mean the combined I/O of all your applications’ performance follows the same parameters, nor does it mean the few servers hitting the storage at the demo are representative of your server farm with 100x the number of servers. Testing with as close to your application workload as possible is preferred. Don’t assume you can extrapolate – systems don’t always scale linearly.

Factors affecting storage system performance

In real life, you typically won’t have a single host pumping I/O into a storage array. More likely, you will have many hosts doing I/O in parallel. Here are just some of the factors that can affect storage system performance in a major way:

Amount of I/O concurrency/threads.
Controller, CPU, memory, interlink counts, speeds and types.
A lot of random writes. This is the big one, since, depending on RAID level, the back-end I/O overhead could be anywhere from 2 I/Os (RAID 10) to 6 I/Os (RAID6) per write, unless some advanced form of write management is employed.
Uniform latency requirements – certain systems will exhibit latency spikes from time to time, even if they’re SSD-based (sometimes especially if they’re SSD-based).
A lot of writes to the same logical disk area. This, even with autotiering systems or giant caches, still results in tremendous load on a rather limited set of disks (whether they be spinning or SSD).
The storage type used and the amount – different types of media have very different performance characteristics, even within the same family (the performance between SSDs can vary wildly, for example).
CDP tools for local protection – sometimes this can result in 3x the I/O to the back-end for the writes.
Copy on First Write snapshot algorithms with heavy write workloads.
Misalignment.
Heavy use of space efficiency techniques such as compression and deduplication.
Heavy reliance on autotiering (resulting in the use of too few disks and/or too many slow disks in an attempt to save costs).
Insufficient cache with respect to the working set coupled with inefficient cache algorithms, too-large cache block size and poor utilization.
Shallow port queue depths.
Inability to properly deal with different kinds of I/O from more than a few hosts.
Inability to recognize per-stream patterns (for example, multiple parallel table scans in a Database).
Inability to intelligently prefetch data.

What you can do to get a solution that will work…

You should work with your storage vendor to figure out, at a minimum, the items in the following list, and, after you’ve done so, go through the sizing with them and see the sizing tools being used in front of you. (You can also refer to this guide).

Applications being used and size of each (and, ideally, performance logs from each app)
Number of servers
Desired backup and replication methods
Random read and write I/O size per app
Sequential read and write I/O size per app
The percentages of read vs write for each app and each I/O type
The working set (amount of data “touched”) per app
Whether features such as thin provisioning, pools, CDP, autotiering, compression, dedupe, snapshots and replication will be utilized, and what overhead they add to the performance
The RAID type (R10 has an impact of 2 I/Os per random write, R5 4 I/Os, R6 6 I/Os – is that being factored?)
The impact of all those things to the overall headroom and performance of the array.

If your vendor is unwilling or unable to do this type of work, or, especially, if they tell you it doesn’t matter and that their box will deliver umpteen billion IOPS – well, at least now you know better 🙂

58 Replies to “An explanation of IOPS and latency”

Jim Grayson says:

July 27, 2012 at 3:57 pm

Thanks – lots to read, wish it were shorter 🙂
Mike Riley says:

July 27, 2012 at 4:18 pm

A real-world example of why latency matters. Before coming to NetApp, I worked at a bank card company (later bought by JPMC). For our credit card authorization system, the number of IOPS quoted by the vendor were irrelevant. Our application IOPS number was helpful to the storage vendors but latency was king. If the storage latency was >4ms, our fraud and abandonment numbers went through the roof. The storage had to respond in 4ms or less in order for us to run the transaction through our fraud detection systems and respond to the user standing at the cash register before they got frustrated and switched to a different card. Basically, if storage couldn’t respond in time, we had enough lost revenue due to either fraud or abandonment to pay for an entirely new storage farm in about an hour. Nothing personal to the storage vendors but we had them all on speed-dial. You could either do it or you couldn’t. One year we literally had vendors stacked on the loading dock – one was on their way out; one was on their way in; all were on a right-of-return. The business literally depended on latency.
sl0n says:

August 7, 2012 at 7:32 am

Another example where latency, as Mike says, is king: mysql master-slave replication. Not providing to many details here and simplifying – you can do writes in multiple threads on a master, however when the same writes get replicated to a slave they have to be applied sequentially. And therefore it’s a single thread process .. at this point everything what matters really is latency. It’s 8Gb SAN fabric in our case and the numbers itself are very good, eg most of the time our latency is ~ 1ms, however for some db’s it’s “too high” and storage attached slaves are not keeping up with replication.
Jay says:

August 10, 2012 at 12:47 am

Well i understand your oversimplification for the latency, but i can’t quite get the curve to a real storage system.

I think that if a storage can provide high IOPS, then it automatically can also answer very very fast to a read/write command (and that means good latency).

NetApp-techies in TR-3808, page 6 write something similar:
http://www.netapp.com/us/library/technical-reports/tr-3808.html
“Overall average latencies for all test cases closely tracked the performance differences as measured in
IOPS between the protocols. This is expected as all test cases were run for the same time duration and,
in general, higher numbers of IOPS map directly to lower overall average latencies.”

So in the end, latency depends on high IOPS, doesn’t it? (i mean a specific I/O workload that simulates a specific application, e.g. VMware)
1. Dimitris says:
  
  August 10, 2012 at 5:01 am
  
  Hi Jay,
  
  It’s not quite that simple. There are storage systems out there that advertise high IOPS – but the latency is not advertised. If you push the vendor for the number, you may discover the latency for the high IOPS is an unusable high number.
  
  BTW – there is a relationship between IOPS and latency as you note, but it’s definitely not linear, and it depends on each type of system (some scale more linearly than others). So, a certain system might be able to provide good latency and linear scaling up to 300000 IOPS, then the latency might jump 3x to go to 400000 IOPS.
  
  If you only knew the latency at the 400000 IOPS point and tried to estimate the latency at 100000 IOPS, your number would be wrong due to the nonlinear scaling.
  
  So, my point is that using just the IOPS number without the latency is useless in every single case.
  
  More useful are IOPS vs latency curves like you see in the SPC-1 benchmark, then you know how the system responds as IOPS go up.
  
  In addition, some systems respond to certain types of IOPS more efficiently than others.
  
  For example, a system may provide excellent latency at high 512-byte read IOPS, but the latency will take a nosedive when doing a blend of 4K reads and writes.
  
  D
  1. Nikolay says:
    
    October 10, 2012 at 8:00 am
    
    Hi, Dimitris!
    
    First of all I’d like to thank you for your blog – it’s one of the best on the Internet about storage systems aspects!
    Regarding this particular undoubtedly fundamental and very useful post – one thing is still unclear to me.
    IOPS (as per Wikipedia) stands for Input/Output Operations _Per Second_ so why the relationship between IOPS and latency is no linear, if with more operations per second ( it doesn’t matter how big or radnom/sequential these IOPS are at the moment, let’s say 4K random within single volume and aggregate for example) the system will get more busy overall (CPU, iSCSI/FC ports, internal HD cache an so on) and, therefore, it will mean that latency will increase (like network ping replies time increase when link gets more utilized). Please correct my logic if I’m wrong 🙂
    
    Thanks in advance!
    1. Dimitris says:
      
      October 12, 2012 at 1:26 am
      
      Hi Nikolay, and thanks for the kind words.
      
      Latency is relatively linear up to a point, then it goes up very quickly.
      
      Why?
      
      Several reasons. The easiest way to explain it is that not everything in the storage system runs out of steam at the same time.
      
      There is of course a ton of proof that latency doesn’t increase linearly with IOPS. Look at pretty much every SPC-1 submission and you see the same thing.
      
      Unless I misunderstod the question.
      
      Thx
      
      D
Jay says:

August 13, 2012 at 6:22 am

Thanks Dimitris,

now it is clear 🙂
Livio LZSPP Doidao says:

December 20, 2012 at 1:52 pm

What does IOPS stand for?
1. Christopher Waltham says:
  
  February 6, 2013 at 10:18 pm
  
  Livio, it stands for: Inputs & Outputs Per Second. Hope this helps!
Some Guy Named Jay says:

January 15, 2013 at 8:34 am

As a SAN filled with spinning disks fills up with data, won’t the latency increase as the sectors being written are further from the faster outer part of the disk? How does this play into the performance calculation?
1. Dimitris says:
  
  January 15, 2013 at 8:42 am
  
  Seek time will increase, yes. But it depends on the storage system being used. For example, NetApp’s Data ONTAP has an option (wafl.optimize_write_once off) that will randomize writes to the disk surface in order to provide geometric fairness for all workloads (the default is on but for most systems I prefer off).
  
  With other systems, you can’t really control that, so yes, as the system fills up, it will get slower (which is also ONTAP’s default mode of operation).
  
  NetApp turns the option off for the SPC-1 tests – nobody can accuse us of disk short-stroking 🙂 (well, they can and they do, but they’re easily proven wrong after I point out that the option was off for the test).
  
  D
  1. Some Guy Named Jay says:
    
    January 15, 2013 at 9:59 am
    
    Interesting. Have you tested this system with a Flash Pool yet? Will you? What would you expect the results to do?
Gen says:

January 28, 2013 at 1:16 pm

Thanks! this is a great set of knowledge.
I just don’t understand one thing.
IOPS is a number of operations done in one second.
Latency is a time spent on doing one operation. Isn’t it?
So let’s say I have 5ms latency. Doesn’t it indicate that in one second I would do 1000/ms / 5ms = 200 operations?
I know that this is not right because my storage just did 2984 IOs in one second with latency 5ms (cache hit+miss). Where this extra 2,4k ios are coming from!? One second is one second right?
1. Dimitris says:
  
  January 28, 2013 at 2:56 pm
  
  There is a lot of parallelism in disk arrays, plus a lot of cache-related cleverness, plus other methods for increasing IOPS with low latency. For instance, NetApp systems running ONTAP code will convert random writes to sequential ones, making writes more efficient.
  
  For a single-threaded workload on a single disk with zero cache, your calculation would be more right.
  1. Gen says:
    
    February 1, 2013 at 6:01 pm
    
    Thanks! Now it makes all sense to me. We have over here many threads (10 hosts + nSeries head) so we must have a lot of parallelism.
    Thanks again!
  2. Peter Jacobs says:
    
    July 10, 2014 at 8:53 pm
    
    That’s only true on a fresh system. Once the system gets past a certain percentage it no longer does applies. Once you get above 50% you’re sure to start see latency climb up since it’s harder for the head of the drive to find an open space.
nm says:

February 15, 2013 at 10:37 am

Hi Recovery Monkey,
In the start of your post you mention 15,000 IOPs then show a screen shot from an awr report that shows physical reads at 14,676 per second. Correct me if I am wrong I’ve always thought that IOPs from an AWR were calculated using the following statistics from awr:

physical read total IO requests + physical write total IO requests (assuming 10.2 or above)

It’s my understanding that the load profile that shows physical reads – shows them in units of o/s blocks.? From the reference manual of 11.2:
.
“Total number of data blocks read from disk. This value can be greater than the value of “physical reads direct” plus “physical reads cache” as reads into process private buffers also included in this statistic.”
.
I’m always confused on this cause I see people using different statistics for this?
1. Dimitris says:
  
  February 15, 2013 at 11:48 am
  
  Hi NM,
  
  Physical reads in AWR is a measure of the number of Oracle blocks read using read(2) system calls.
  
  For typical random-access workloads each one of those is a read operation. If there are a lot of sequential read requests then many blocks may be read in one read operation.
  
  So that statistic is an upper bound on the number of read IOPS.
  
  The physical reads and writes only account for physical I/O related to data blocks, and does not include redo and archiving activity, nor any “non-database” I/O activity like RMAN, LOB, etc…
  
  Since those physical reads could be of varying sizes, you also have to check the Instance stats for “physical read total bytes”. That way you can tie the read operations per second. Just look at the middle column since that’s the per second number.
  
  D
Rod says:

February 21, 2013 at 6:00 am

Thanks for a most informative and entertaining post. Cutting through the marketing hype and buzz word jungle like a sword of clarity 🙂
Tom Elliott says:

March 14, 2013 at 5:20 pm

What a great post! Very informative. Thanks for sharing it.
Niall Byrne says:

April 13, 2013 at 1:14 am

Great article. Cut’s through the BS I read in a lot of technical articles. Explains IOPs and Latency beautifully and in a clear manner. Highly recommended.
Adrian says:

September 10, 2013 at 9:27 am

LOve this article, well done and thank you on behalf of many people who wanted to understand the depths of iops and latency.
Mahmoud says:

October 29, 2013 at 8:27 pm

Thanks so much for this very useful article, but I wish you have introduced the difference between IOPS and MB/s…is it possible to give a brief explanation, or just recommend any other article about it….Thanks again 🙂
Simon says:

November 27, 2013 at 4:35 pm

WOW! Thank you, Dimitris!

This is the greatest article I have ever read regarding storage I/O etc.
As a DBA, I have seen tons of event 833 error messages recorded by MS SQL Server on Windows, and some ocfs2 fencing of 1 minute latency delay on Linux boxes.
Those problems are apparently caused by disk latency, aren’t they?
Hoaviet says:

February 22, 2014 at 1:51 am

Hi, so I was reading this and it said http://www.storagereview.com/samsung_840_evo_ssd_review

That ssd has 98,000 IOPS and a max latency of 0.87 for 1tb, is this good?
1. Randy says:
  
  December 29, 2014 at 8:12 pm
  
  Yes. I have one in my desktop, and it’s great. But is it consumer grade and not for a write-heavy applications. Even so, I’ll get many years at the rate I’m writing. Below is a sample test and some disk counters from during the test – they show the latency is pretty darn good. (SQL Server instances are running on the drive at the same time, so it’s not a isolated test.)
  
  It’s a lot easier to get such performance from one drive than to create a SAN that provides such performance for a 100 drives. I always expect my desktop to easily out perform any system built off the SAN – even if it’s the latest and greatest SAN.
  
  Hey vendor, how many drives of type “fill in the blank” (e.g. 1 TB EVOs) can I simulated with your SAN at the same time? What minimum metrics can I expect from any drive no matter what? Can I run CrystalDiskMark on all of the disks at the same time? And, btw, how many days does our over-worked SAN admin have to work to get that metric for these drives?
  
  ———————————————————————–
  CrystalDiskMark 3.0.3 x64 (C) 2007-2013 hiyohiyo
  Crystal Dew World : http://crystalmark.info/
  ———————————————————————–
  * MB/s = 1,000,000 byte/s [SATA/300 = 300,000,000 byte/s]
  
  Sequential Read : 518.926 MB/s
  Sequential Write : 491.751 MB/s
  Random Read 512KB : 466.305 MB/s
  Random Write 512KB : 478.286 MB/s
  Random Read 4KB (QD=1) : 45.566 MB/s [ 11124.4 IOPS]
  Random Write 4KB (QD=1) : 127.164 MB/s [ 31046.0 IOPS]
  Random Read 4KB (QD=32) : 362.285 MB/s [ 88448.5 IOPS]
  Random Write 4KB (QD=32) : 335.330 MB/s [ 81867.7 IOPS]
  
  Test : 1000 MB [D: 73.3% (614.6/838.4 GB)] (x5)
  Date : 2014/12/29 15:53:31
  OS : Windows 7 Professional SP1 [6.1 Build 7601] (x64)
  
  Avg. Disk Queue Length 15.650321
  Avg. Disk Queue Length 24.188959
  Avg. Disk Queue Length 49.667465
  Avg. Disk Queue Length 14.084006
  Avg. Disk Queue Length 13.317783
  Avg. Disk sec/Read 0.000207
  Avg. Disk sec/Read 0.000289
  Avg. Disk sec/Read 0.000285
  Avg. Disk sec/Read 0.000233
  Avg. Disk sec/Read 0.000000
  Avg. Disk sec/Write 0.002113
  Avg. Disk sec/Write 0.000000
  Avg. Disk sec/Write 0.001119
  Avg. Disk sec/Write 0.000192
  Avg. Disk sec/Write 0.000181
  Disk Read Bytes/sec 306364089.826151
  Disk Read Bytes/sec 342972819.952190
  Disk Read Bytes/sec 182800181.749099
  Disk Read Bytes/sec 3260.157804
  Disk Read Bytes/sec 0.000000
  Disk Reads/sec 74795.920368
  Disk Reads/sec 83707.463238
  Disk Reads/sec 44587.389011
  Disk Reads/sec 0.397968
  Disk Reads/sec 0.000000
  Disk Write Bytes/sec 69624570.829117
  Disk Write Bytes/sec 0.000000
  Disk Write Bytes/sec 135471826.901399
  Disk Write Bytes/sec 302652267.032678
  Disk Write Bytes/sec 300871930.072978
  Disk Writes/sec 68.457038
  Disk Writes/sec 0.000000
  Disk Writes/sec 33010.259770
  Disk Writes/sec 73472.012545
  Disk Writes/sec 73449.527361
rsr72 says:

April 9, 2014 at 9:26 am

Nice article, thank you!
Sam @ Xcentech says:

April 27, 2014 at 12:32 am

Thank you for the post. I’m glad that you took the time to breakdown what can quickly become a daunting subject. I look forward to seeing your other posts once I get the time to read them! 😀

—Sam
Guido says:

May 9, 2014 at 3:10 pm

This makes a lot of sense. I’m not a storage guy, but whenever the storage dudes mentioned IOPs, I always thought that they needed to add more info to this term to be meaningful. It’s kind of like using RPM to measure how fast your car is moving. Well, it depends….
crookedm says:

June 5, 2014 at 3:23 pm

Storage guys quoting IOPS oh dear, latency is king. Exchange completion times even more so. Sometimes I’m convinced people in application teams think arrays are solely dedicated to their apps and storage colleagues tell them IOPS to make them not blame the storage. Let’s not get started on queue depths either!
Sreenath Gupta says:

July 9, 2014 at 4:55 am

Hello My friend, i think i reached you atlast, i am in very big trouble with my new Dell R820 server with Hyper-v 2012 installed on it. I am facing high latency issues with all the vm’s as well with the host machine, for few hours server works good and after that i am facing the latency issue again and at the time of latency, i am restarting the server and problem will disappear, and again it come back after some time. I have raised ticket with Dell and Microsoft for the same and both of them could not solve my issue. We are using the server for virtuliazation and the hardware configuration is 64 GB RAM/xeon processor quad core 2 sockets total 32 threads/1 x 3 SAS 6 gb 7k RPM. Kindly suggest me if is this due to the HDD issue.
1. Peter Jacobs says:
  
  July 10, 2014 at 9:13 pm
  
  Absolutely a Disk issue. Depending on the RPM of the drive, At most it would give you 450 IOPS at most.
  1. Peter Jacobs says:
    
    July 10, 2014 at 9:14 pm
    
    I missed the RPM. I would give you 210 IOPS at most :-).
alexey says:

December 10, 2014 at 7:03 am

Make a little notice that the SPC-1 HDS benchmark is USP Array where as the 4 Milion IOPS is from VSP wich is a totally different and modern storage array.
1. Dimitris says:
  
  December 15, 2014 at 3:21 pm
  
  Good catch – I put in the updated VSP result. Just 70,000 more IOPS so the point is still valid – HDS claims 4 MILLION IOPS for the VSP, but makes almost 270,000 SPC-1 IOPS. Nowhere near.
rouncer says:

October 8, 2015 at 3:52 am

Only guys that have an APPLICATION for this new style of random access will even benefit from it. Do you really think your average user has a use for a gtx980 video card? doubtful, if hes just running games on it. However If its a guy doing realtime raytracing, hes actually a real user. That guy saying IOPS wasnt as important as latency for him, was due to him actually having an application for the hardware.
IOPS are important to my application, because im running searching algorythms, also IOPS is damn important in spacial division applications, and any form of virtual addressing – also requires IOPS as a leading factor to the performance of the software.
1. Dimitris says:
  
  October 8, 2015 at 8:20 am
  
  Everyone has different requirements and, broadly speaking, there are applications that need low latency, and applications that need high throughput.
  
  If latency is largely unimportant to you, then your application seems to be the latter. That’s all.
  
  However, most people running DBs, especially in the finance industry, need a combination of low latency per operation and high number of operations.
  
  Getting high IOPS without any regard to latency is actually very easy for most systems. Check my SPC-1 articles where I compare various storage systems, you will see that some systems that can achieve huge IOPS can only do so at very high latencies.
  
  Thx
  
  D
Oskar Arg says:

April 11, 2016 at 8:47 am

Thanks for the article and apologies for this late question.

I suspect their is an IO performance issue caused by very intermittent latency problems.

The subsystem vendor shows figues which demonstrate that the average IOPS is very good.

What standard Unix tools can be used to determine the real latency on an IO subsystem.
Roxanne Dimacale says:

May 29, 2016 at 4:13 am

Savvy post – I am thankful for the info – Does someone know if my business could locate a template Hyatt Credit Card Authorization Form copy to complete ?
trebor says:

July 1, 2016 at 1:08 pm

“I had no specific cars in mind… really” … *cough* … Bugatti Veyron … *cough* … 😉
tommisan says:

August 10, 2016 at 7:22 am

This post was really interesting. thanks a lot!
pinkesh says:

September 19, 2016 at 7:00 am

Which tool is best to measure both parameters – latency and IOPS of disk ?
1. Dimitris says:
  
  January 30, 2017 at 8:18 pm
  
  vdbench is good, as is fio.
pinkesh says:

September 19, 2016 at 11:23 pm

Hi,
I am performing testing my system with SAS drive.
Suppose I want to use my system for data center, so can you suggest me which of the test cases I have to performed to test my CPU with SAS drive ?
Adam Barton says:

February 17, 2017 at 11:10 pm

Make a little notice that the SPC-1 HDS benchmark is USP Array where as the 4 Milion IOPS is from VSP wich is a totally different and modern storage array.
http://17oxen.com/best-external-hard-drives/
1. Dimitris says:
  
  March 31, 2017 at 11:05 am
  
  Hi Adam, the screenshot is right: HDS used to claim 4 million IOPS on the USP. Look at the text.
Agha says:

May 15, 2017 at 7:28 am

An eye opener post we found here.
Thanx for your thoughts
Kirk Brocas says:

December 31, 2018 at 3:56 pm

Great Post I was actually thinking Oracle Databases as I started reading 🙂 I use to always have to shoot the storage guys down on IOPS and RAID 5.
1. Kirk Brocas says:
  
  December 31, 2018 at 4:01 pm
  
  Tim Hall also has a good post
  https://oracle-base.com/articles/misc/oracle-and-raid
DATAHK says:

July 15, 2019 at 12:44 pm

thanks for sharing just love it…
TOGEL HK
pengeluaran hk
datahk
PENGELUARAN HK says:

August 31, 2019 at 8:45 am

Thanx For Sharing…
https://www.hkpools.cc/jumat/
DEWATOGEL says:

August 31, 2019 at 8:47 am

very good http://klub4d.cc
DNDPOKER says:

August 31, 2019 at 8:48 am

good very good http://dndpkr.club
maha168 says:

March 22, 2020 at 6:57 pm

I am really loving the theme/design of your site. Do you ever run into any
internet browser compatibility problems? A handful of my blog visitors
have complained about my blog not operating correctly in Explorer but looks great in Opera.
Do you have any solutions to help fix this issue?
Gongbola says:

March 23, 2020 at 12:27 am

Great Post
Situsalternatif says:

October 4, 2020 at 1:50 am

Daftar Situs Judi Online Terbaik in Indonesia
Situsalternatif are the collection of listings of the greatest online gambling sites from
all popular bookies / providers in Indonesia. Information on online gambling internet sites listed on this specific page is definitely altering,
because we just provide sites that get good evaluations. Each site offers some informative about the games available, deposit
transaction alternatives, and alternative links.

Collection of Situs Alternatif and Daftar Judi Online Terbaik
Actively playing on gambling internet sites is not completely smooth, one of many problems frequently experienced by players is usually difficulty accessing typically the site where to enjoy due
to web site blocking. Therefore, the choice Site collects option links here which are incredibly useful to be able to help gamblers to be able to easily
login in addition to daftar judi online. Within addition, you don’t
need to be worried about the link alternatif judi that we provide, of course an individual can access it easily and with regard to
free.
خرد says:

April 5, 2021 at 2:11 am

Hello dear

It was really helpful for – thank you

I came here last week looking for something I found – thank you

Comments are closed.