The Micron 9400 NVMe SSD is the top PCIe Gen4 SSD for AI storage

According to their website, MLCommons was started in 2018 “…to accelerate machine learning innovation and increase its positive impact on society...” Today, MLCommons maintains and develops 6 different AI benchmark suites and is developing open datasets to support future state-of-the-art AI workload model development. The MLPerf Storage Benchmark Suite is the latest addition to the benchmark collection.

As a member of the MLCommons Storage Working Group, I’ve helped develop AI benchmark rules and processes to help ensure that benchmark results are meaningful to researchers, customers, and vendors alike, and we’ve just published the first round of submissions including results for the Micron 9400 SSD.

But why do we need a new AI benchmark utility that’s specific to AI workloads?

Characterizing the storage workload for AI Training systems faces two unique challenges that the MLPerf Storage Benchmark Suite aims to address – the cost of AI accelerators and the small size of available datasets.

The first is obvious, AI accelerators can be expensive, complex compute systems and most storage vendors won’t have enough AI systems available just to analyze their products’ scalability in storage solutions.

The second issue is that the openly-available datasets are small compared to what is commonly used in AI industry. Whereas the datasets available to MLCommons and its participants may get as large as 150 Gigabytes, datasets used in production are frequently 10s to 100s of Terabytes. Modern servers can easily have 1 to 2 Terabytes of DRAM which has the effect of caching the small benchmark datasets in system memory after the first training epoch then executing subsequent runs from that in-DRAM data. But production datasets would not see the same behavior due to their size.

MLPerf Storage addresses the first issue by emulating the accelerators in standard CPU-based servers. At the low level, MLPerf Storage is using the same AI workload frameworks as the commonly-used workloads (pytorch, tensorflow, etc.) but MLPerf bypasses the compute portion of the platform with a “sleep time” that is found experimentally by running the real workload on systems with the actual AI accelerators.

Comparisons of the emulated accelerators and real accelerators show that AI workloads are extremely similar.

MLPerf Storage addresses the second issue by creating datasets that are similar to actual, production datasets but replicated to be much larger. The AI benchmark supports various data storage technologies like filesystems and object storage as well as multiple data types like serialized numpy arrays, TFRecord files, HDF5 files, and more.

In addition to solving these problems, in a previous blog post with John Mazzie, we showed that the AI workload for training is more complex than many expect – the AI workload is both bursty and latency sensitive.

The MLPerf Storage Benchmark Suite is a great way to exercise AI storage systems in a way that represents real training workloads without requiring expensive AI accelerators while also supporting dataset sizes representative of real-world datasets.

Now we’re proud to announce that the Micron 9400 NVMe SSD supports 17x accelerators in the 3D Medical Imaging benchmark (Unet3D). This translates to 41 samples per second or 6.1 GB/s of IO throughput.

Armed with this AI benchmark that’s easy to run and representative of real training environments, the Micron Data Center Workloads Engineering team will be presenting data across storage devices and solutions so that we can all better understand how to tune and design AI storage to increase accelerator utilization.

To find out more on how the Micron 9400 NVMe SSD can power your business, connect with our Sales Support team today.

Micron 9400 NVMe SSD

SMTS Systems Performance Engineer

Wes Vaske

Wes Vaske is a Senior Member of Technical Staff (SMTS) and Systems Performance Engineer at Micron Technology. With a strong background in storage solutions and AI infrastructure, Wes plays a pivotal role in advancing Micron’s capabilities in data intelligence and machine learning. He is known for his expertise in benchmarking AI training systems and optimizing storage performance to meet the demands of next-generation GPUs. Before joining Micron, Wes was a Systems Engineer at Dell. He holds a Bachelor’s degree from Iowa State University.

Products overview

Search for, filter and download Micron data sheets

Market & Industries overview

AI data center

Partners overview

Learn about and enroll in Micron's Technology Enablement Program (TEP)

Sales & Support overview

Contact Micron's sales support

About overview

Investor Relations overview

Visit Micron's Investor Relations site

Recent Search

The Micron 9400 NVMe SSD is the top PCIe Gen4 for AI storage

Wes Vaske

Related blogs