20x Faster Time to First Token: The HPE Alletra X10000 Edge for AI

In March 2026, HPE, NVIDIA, Kamiwaza and Signal65 published a paper (check here and here) showing  about a 20x acceleration for both time-to-first-token (TTFT) and token generation rate, using an HPE Alletra X10000 to store the KV cache. The storage system used S3 over RDMA to achieve this (and is in fact the first NVIDIA-certified object storage system).

The test has, as of the date of this writing, the most complete benchmark disclosure of all the KV Cache competitive tests I could find. It also consists of an extremely hard workload with high concurrency.

I will explain why all this is important, why it’s different from competitor numbers, plus provide some insights about what this means regarding overall system efficiency.

Because the goal isn’t just to keep GPUs busy. It’s to keep them busy generating new stuff, not recalculating old stuff.

The benefits with this solution are numerous:

  • Far more workload becomes possible but also…
  • One could approach it as a lot less infrastructure is needed due to far more efficient use of the hardware, which means…
  • Lower power and rackspace requirements, which all leads to…
  • Lower Watt/token and lower $/token.
Continue reading “20x Faster Time to First Token: The HPE Alletra X10000 Edge for AI”

Ransomware Detection in the HPE Alletra B10000 – Successfully Recover

With the R5 release of the code for the HPE Alletra MP B10000, among many other nice improvements (for example NAS and a nice new CSI driver), everyone now gets heavy-duty ransomware detection.

We have invented some new and unique ways for detecting modern ransomware that encrypts data in advanced ways (for example intermittent encryption), but most importantly, ways to detect encryption that doesn’t even look like encryption (which defeats other detection methods that rely on fixed entropy detection thresholds). See explainer here for more details on why this is crucial.

The whole point of doing this is early detection.

Continue reading “Ransomware Detection in the HPE Alletra B10000 – Successfully Recover”

The Architectural Benefits of HPE Alletra MP – Plus R4 Coolness

When we first released the new HPE Alletra MP platforms, I wrote a few articles going over the benefits and how the flexible new hardware platform manifests into different “personalities” for high end block and file solutions.

This time I want to take a deeper dive into the architectural benefits of our approach and how the new R4 software for Alletra MP Block enables certain things no other vendor can come close to – plus give a taste of what may be possible in the future given the amazing flexibility of the underlying architecture (it’s a blog, I can’t provide roadmaps here).

I will cover things like fractional multi-dimensional scaling (that allows things impossible with other vendors like adding a single controller node without needing to add capacity) but also resiliency in the face of simultaneous failures that would cripple all other storage systems I’m aware of. It’s not meant to be a comprehensive coverage of everything, but hopefully enough to give you a taste.

Let’s go!

Continue reading “The Architectural Benefits of HPE Alletra MP – Plus R4 Coolness”

HPE Ransomware Detection and Recovery in Zerto 10: Sophistication that Works

Ransomware seems to be at the forefront of many discussions today, and for good reason: The ransomware gangs make a ton of money by causing massive problems to businesses that are in turn losing billions because of this – but most importantly, losing time.

So eventually, like for anything that’s a problem, people tried to find solutions.

The challenge becomes finding what solutions truly address the problem in a realistic way instead of being mostly marketing in order to show a vendor isn’t behind in this area.

Some of you may remember the awesome Chrysler ads with Ricardo Montalban talking about “rich Corinthian Leather”. There is no such thing, the leather came from New Jersey. Corinth in Greece was never known for its prowess in leather anything – but the name sounded cool and different, so marketing went with it, as is their idiom.   I’ll explain how HPE’s Zerto ransomware detection & recovery is truly useful in both detecting modern ransomware and rapidly recovering with a tight RPO. I’ll also show which types of protection are more like Corinthian Leather 🙂

A good example of Corinthian Leather: “Immutable Snapshots”. Practically every serious storage system from the major vendors has this technology, which mostly means locking snaps so that even if ransomware has infected the backup system (and therefore has the permissions to delete snaps, which is the least of the many things ransomware will try and do), the storage system won’t allow the deletion to happen.

Techniques like locking snapshots are, at best, a supplemental form of defense. Some ransomware indeed tries to delete snaps before the hackers demand the ransom – but they have already been encrypting your data for months, so your snaps are also infected…

So if you can’t detect, with accuracy, when encryption started happening, you have no defense and no safe recovery point.

To summarize: Aside from prevention, what’s most useful if you have been infected is:

  • Real-time detection but also…
  • …the ability to detect modern kinds of ransomware that fool methods like standard Shannon entropy detection (for example, encryption that results in compressible data) but also…
  • …the ability to very quickly recover, and with a minimal loss of data (tight RPO and RTO) – not in hours/days but seconds/minutes. Time is money and all that.

Let’s get started:

Continue reading “HPE Ransomware Detection and Recovery in Zerto 10: Sophistication that Works”

HPE GreenLake for File Storage

A critical part of the recent April 4th, 2023 announcements from HPE Storage was the scale-out HPE GreenLake for File Storage.

For the foundational piece explaining the common hardware between the various offerings please go here. For the Block storage piece, here.

The new HPE File offering is based on the HPE Alletra Storage MP hardware, and uses a common management interface for both File and Block, providing a seamless, centralized, multiprotocol management experience.

For the people that like looking at boxes, a small one would look like this:

A Small HPE GreenLake for File Storage System – Compute Separate from Capacity
Continue reading “HPE GreenLake for File Storage”