Home Big Data Object Storage a ‘Complete Cop Out,’ Hammerspace CEO Says. ‘You All Obtained Duped’

Object Storage a ‘Complete Cop Out,’ Hammerspace CEO Says. ‘You All Obtained Duped’

Object Storage a ‘Complete Cop Out,’ Hammerspace CEO Says. ‘You All Obtained Duped’


(NAS CREATIVES/Shutterstock)

The mass adoption of object storage methods like Amazon S3 might look like a prime achievement of the large knowledge period, since we received primarily limitless storage accessible by means of REST instructions. However to Hammerspace CEO David Flynn, object storage is a “whole cop out” that perpetuates knowledge orchestration complications. “That was the cut price you made with the satan,” Flynn mentioned on the latest HPC + AI on Wall Avenue occasion.

The trade standardized on object storage as a result of the market discovered it too troublesome to do the appropriate factor: turning normal NFS right into a distributed file system able to dealing with cloud scale, Flynn mentioned September 27 throughout his keynote handle at Tabor Communications’ 2023 HPC + AI on Wall Avenue convention in New York Metropolis.

“That is all about fixing the evil that file methods weren’t in a position to do cloud-scale,” Flynn mentioned. “Let’s face it: all the motive Amazon created S3–Tremendous Easy Storage–was as a result of file methods are so rattling laborious to make and even tougher to make them scale that they dumbed it down and mentioned ‘Let’s simply use REST interfaces from consumer area and never attempt to get that top bar of getting the OS know learn how to truly mount it and devour knowledge from it.’

“Object storage was all the time a cop out, and also you guys all received duped into considering you need to rewrite all your purposes to make use of it simply to get to cloud,” he continued. “And it’s as a result of they didn’t do the laborious work of creating an actual file system that OSes natively know learn how to use, and that’s nonetheless thought-about the traditional knowledge. ‘Cloud means rewrite every thing to make use of object storage and speak to persistence from consumer area.’

Hammerspace CEO David Flynn at HPC + AI on Wall Avenue September 27, 2023

“That could be a whole cop out,” Flynn mentioned. “It’s throwing the child out with the bathtub water. However they needed to do it. There was no different method to host the a number of tenants and the large quantity of scale that was wanted within the cloud. However that was the cut price you made with the satan to get to the cloud as you rewrite that stuff.”

These have been sturdy phrases, however with a long time of expertise within the enterprise IT and HPC markets, Flynn has discovered a factor or two about excessive efficiency storage. Earlier than co-founding flash storage startup Fusion-io, which was acquired by SanDisk for $1.1 billion 2014, Flynn helped construct among the world’s largest supercomputer at Linux Networx. These travels confirmed Flynn a major hole exists between the enterprise storage market and the HPC neighborhood, which he proposes to assist fill along with his knowledge orchestration startup, Hammerspace.

AI’s Good Storm

The appearance of AI has created a “excellent storm” of wants that may exhibit that the item storage compromise is now not adequate, Flynn mentioned.

Everyone now wants HPC capabilities to coach AI fashions, however distributed file methods are the one method to effectively handle I/O and preserve GPU nodes saturated. The present established order with object storage, which requires customers to rewrite their purposes from native NFS to make use of HTTP instructions (i.e. REST) over the community, doesn’t minimize it.

Information I/O stays a bottleneck to AI

Whereas NFS is the defacto normal within the enterprise NAS enterprise and is natively supported in Linux, NFS has had its share of false begins. The group behind NFS tried to handle the statefulness situation with NFS 4.0, however ended up butchering the file system, Flynn mentioned.

“NFS 4 took the entire sins of NFS 3–the statelessness and all of that–and tried to treatment it by including statefulness so that you could cache stuff and get it extra environment friendly,” he mentioned. “Nevertheless it was retrofitted in each the purchasers and the servers. And also you ended up with the entire overheads and evils of statelessness mixed with the overheads and evils of statefulness.”

For instance, having a program create a file after which write it so one other program on the identical laptop learn it, took 5 spherical journeys serial to the filer in NFS 3. With NFS, that jumped to fifteen serial spherical journeys, Flynn mentioned.

“They sort of tousled once they launched all of the statefulness as a result of it wasn’t effectively tuned to essentially be exploited by the purchasers and servers collectively, and it ended up simply being huge overhead,” Flynn mentioned. “We nonetheless pay the value in the present day. You wish to go to NFS 4 to perhaps get higher safety or different issues, and but you find yourself paying an enormous value in a efficiency perspective. Principally, NFS has from inception all the time sort of sucked.”

After he left Fusion-io, Flynn vowed to discover a method to treatment this example. He teamed up with Trond Myklebust, the maintainer and lead developer of the Linux kernel NFS consumer, and co-founded a brand new enterprise Hammerspace. Led by Myklebust, a Hammerspace co-founder and its CTO, the staff took what was successfully an instructional challenge to develop a parallel model of NFS led by Los Alamos Nationwide Lab and turned it into an enterprise-ready product.

“We launched the NFS 4.2 spec. That got here from my staff right here at Hammerspace,” Flynn mentioned. “Because of Trond, we have been in a position to trick it out to really actually work effectively, particularly with our NFS 4.2 parallel NFS server.”

A World Storage Abstraction

Growing a parallel model of NFS was essential to Hammerspace, however it was however one step on its general journey to the last word purpose: creating a brand new storage abstraction that eliminates the limitation of bodily storage and places knowledge again within the driver’s seat.

As an information orchestration layer, Hammerspace primarily connects any storage infrastructure, whether or not it speaks NFS, object storage, and even block storage, to an software. Whether or not it’s a NAS system from NetApp, Dell EMC, Qumulo, or Huge–whether or not it’s an Amazon S3 bucket or a Linux server configured as a storage node–Hammerspace can take all that knowledge and make it seem to an software to be sitting on native storage, even when the info is sitting on the opposite facet of the world.

Simply as characters in Japanese cartoons can pull a mallet out of skinny air and whack their opponents within the head, Hammerspace turns into the skinny air (or the only world namespace) out of which you’ll pull any piece of knowledge (simply don’t hit your opponents within the head with it).

The important thing to this seeming magic trick is unified metadata, Flynn mentioned.

“Hammerspace solves the seeming paradox of how are you going to have knowledge in every single place and anyplace you want it, with out ever having copied it?” Flynn mentioned. “That’s not as a result of it’s not native and accessed at native entry efficiency. It’s as a result of the metadata is unified. So whereas the info is bodily distributed, the metadata is logically unified.”

Pulling the metadata out of the storage layer into a brand new layer that transcends these different layers addresses a litany of knowledge administration ache factors, Flynn mentioned. Information migration turns into a factor of the previous when you’ll be able to hook up Hammerspace to an current community file system and immediately begin accessing it by means of an software beforehand linked to a special storage repository, Flynn mentioned.

Hammerspace implements a world knowledge abstraction layer atop bodily storage

“When was the final time you had to consider having your knowledge once you transfer from one cellphone to the following, or once you go out of your cellphone to your laptop computer or pill?” he mentioned. “Our client knowledge already lives in a Hammerspace and the iOS platform, the Android platform have mainly orchestrated your expertise and your knowledge for you. That’s what we’re speaking about doing right here, however for the petabytes to exabytes scale unstructured knowledge that’s behind every thing.”

The headache of sustaining a excessive availability server configuration will probably be a factor of the previous when Hammerspace is making certain that knowledge from one node could be accessed from one other node. “You possibly can do away with all of those completely different types of copy, like your knowledge copy sync packages, the info migration duties, once you’re going from one system to a different,” Flynn mentioned.

In the long run, Flynn hopes that Hammerspace’s notion of knowledge orchestration flips the script on fashionable concepts of knowledge administration. As an alternative letting knowledge storage outline what knowledge is and what it means to us, the info orchestration layer defines the info as soon as, and makes the place it’s saved a mere implementation element.

“Information orchestration differs from knowledge administration in a quite simple manner: Information administration is what you do from the surface of the info presentation layer,” Flynn mentioned. “The true evil is the truth that the file system is embedded within the storage system or service. The info presentation layer being inside that storage system signifies that the info is de facto nothing however a mirage that’s being rendered by that storage. And if you happen to put it in numerous storage, it’s by definition completely different knowledge, as a result of its very existence is an artifact of the storage system or service presenting it.”

It appears very pure to say that your knowledge exists in your NAS filer or your S3 bucket, as a result of that’s what possesses the metadata and presents the info to you, Flynn mentioned. However that’s truly getting it backward, as a result of knowledge, by definition, is a better stage abstraction than storage, which is simply infrastructure. And therein lies the important thing factor that Hammerspace permits.

“Information doesn’t exist aside from as rendered by storage,” Flynn mentioned. “That appears very pure to say, however it’s the other way up. It signifies that the infrastructure is in cost and the platform layer depends upon it. However with Hammerspace, that modifications since you pull the entire metadata out of the storage layer into one thing that may transcend any of these storage methods. So now you will have the place knowledge can sit appropriately as the upper stage factor that you simply concentrate on, and the place the info truly lives, on which storage, can change.”

You possibly can view Flynn’s complete HPC and AI on Wall Avenue presentation by registering at www.hpcaiwallstreet.com.

Associated Objects:

Hammerspace Raises $56M to Reimagine Information Orchestration

Three Methods to Join the Dots in a Decentralized Huge Information World

Hammerspace Hits the Market with World Parallel File System





Please enter your comment!
Please enter your name here