What are you looking for ?
Infinidat
Articles_top

Stellus Emerges From Stealth With Data Platform That Drives Massive Data Access Throughputs

Introduces Key-Value over Fabrics technology to accelerate performance for unstructured data.

Stellus Technologies announced its Data Platform, based onKey-Value over Fabrics technology, that can efficiently accelerate by 10x the storing and accessing of the world’s unstructured data.

The platform delivers groundbreaking read and write performance for file and object-based data, necessary for the most demanding data-intensive workloads and applications such as cognitive AI, machine learning, high-speed microscopy, genomic sequencing, and data streaming applications.

“The explosive pace of data creation is driving the need for massive storage and processing of structured and unstructured Big Data in cloud-native environments,” said Young Sohn, president and chief strategy officer, Samsung Electronics and chairman of Stellus. “The introduction of this innovative new-generation file system by Stellus will unleash the full performance of industry-leading memory and storage created by Samsung, and deliver the flexibility and scalability of cloud-native architectures for customers and partners.”

The Data Platform is a software as a service-based data system that leverages cloud, c ore and edge infrastructures to enable hyperscale throughput for digital enterprises. Key-Value over Fabrics technology can make unstructured data access faster and more resource efficient than other data storage architectures available today. It allows businesses to capture, store and process increasingly large data sets from an increasing number of data sources, which is critical to power new parallel data analysis workloads.

“We see significant demand from customers looking for the unique combination of Key-Value over Fabrics technology, high-throughput Key-Value Stores and algorithmic data locality,” said Jeff Treuhaft, CEO at Stellus. “Stellus Data Platform turns storage investments into a massively parallel ‘data computer’ that consistently delivers fast parallel data access
performance that scales efficiently for core, edge or cloud-native infrastructures.”

Unlike traditional file systems for unstructured data that support scale-up or scale-out storage strategies, the Stellus Data Platform allows for scale-through performance and efficiency. It uses Key-Value over Fabrics technology to scale-through the hybrid cloud, core and edge infrastructures struggling to keep unstructured data flowing quickly and efficiently enough to the most critical workloads.

ESG conducted a technical evaluation to validate the consistent performance of the scale-through architecture, which can be found here (video and whitepaper).

“Stellus gives companies a choice of throughput or capacity or both, all within the same key-value-based platform. It’s no longer about choosing to scale-up or scale-out. Stellus’s ‘Scale-Through’ technology provides digital enterprises with a clear and unprecedented path to efficiently and effectively manage unstructured data at massive throughput and scale. Stellus brings hyperscale capabilities to every digital enterprise, eliminating the requirement to compromise,” said Steve Duplessie, ESG founder and senior analyst.

Comments

Company's profile

Company

Location
HQ  in San Jose, CA, offices in Irvine and India
 
Date founded
2016
 
Financial funding
$70 million in series A with investor Samsung owner of 100% of the start-up
 
Main executives 
  • TreuhaftJeff Treuhaft, president and CEO, held executive product management, marketing, and corporate development positions at Fusion-io, Zetta, Verisign, and Netscape.

 

 

  • Bala Ganeshan, PhD - CTO and VP engineering, held the CTO roles at EMC and QLogic. Additionally, he has held research roles as assistant professor at Virginia Tech and architect role at AT&T and Delta Air Lines.
  • Ken Grohe, CRO and SVP, was president of SignNow, SVP and GM at Barracuda, CRO at Virident (WDC acquired) and spent 25 years at EMC, culminating in running the flash division.
Number of employees
100+
 
Technology
The start-up created  new file system - taking a hyperscale approach to managing the increasing amounts of unstructured data. Stellus Data Platform, based on the industry’s first Key-Value over Fabrics technology, can accelerate by 10x the storing and accessing of the world’s unstructured data. The platform delivers fast read and write performance for file and object-based data, necessary for the most demanding data-intensive workloads and applications such as cognitive AI, machine learning, high-speed microscopy, genomic sequencing, and data streaming applications. 
 
Product
The platform is a software-as-a-service-based data system that leverages cloud, core and edge infrastructures to enable hyperscale throughput for digital enterprises. Key-Value over Fabrics technology makes unstructured data access faster and more resource efficient than other data storage architectures. It allows businesses to capture, store and process increasingly large data sets from an increasing number of data sources, which is critical to power new parallel data analysis workloads.
 
Release date
February 5, 2020
 
Partners/distributors/ISV/OEMs
Logicalis, Scalelogic, Colfax International, Thompson Partners, ServIQ, StorExel, Procurri, CDI, and IT Creations, as well as Spectra Logic, Arista and Nvidia
 
Customers
Including UCLA, National Pediatric Cancer Foundation, and ALT Systems
 
Applications
The product is designed for demanding data-intensive workloads and applications, such as high-speed microscopy, genomic sequencing, machine learning, cognitive AI, and data streaming for cloud, edge and core infrastructures.
 
Target market
Organizations looking for better ways to manage growing amounts of data will benefit from Stellus, the platform is suited for organizations that deal with large amounts of unstructured data and rely heavily on that data to make strategic business decisions -- including M&E, life sciences and industrial IoT verticals.
 
Competitors
NetApp and EMC Isilon


Our comments:

Stellus claims to play in high-performance data systems with a memory native file system delivering a new offering in file storage. Interestingly it received a significant investment of Samsung illustrating a form of system development try for the South-Korean company.

Via the launch of the company, we had access to the presentation and we're surprised by the file system listing the company promotes indicating a strange view of the market. It mixes disk file systems like ZFS and WAFL with parallel ones like GPFS and Lustre and OneFS as a distributed one (but not parallel). It also creates confusion as some of them are external file system (Lustre, GPFS, etc.), we mean used by applications servers directly and other internal (OneFS) i.e not seen and touched by these machines, in that case EMC Isilon exposes NAS protocols (NFS and/or SMB on top). Also, several high performance file systems (external and internal in our terminology) are missing such WekaIO WekaFS, Panasas PanFS, VAST Data 'Transaction File System' or Qumulo QFS, and this slide doesn't give us a good feeling. As Stellus promotes Key-Values, it would have been good to mention Pure Storage FlashBlade with its KV layer embedded below its NFS service. The comment on the right of the slide is incredible as WekaIO, Qumulo and VAST Data designed their file system with flash in mind and not HDD. Why this miss?

Stellus with their Data Platform (SDP) promotes a shared-everything architecture with Data Managers (DM) - their access node or compute server (1U) - able to access to Key Value Stores (KVS) nodes - their storage server (1U) - via NVMe-oF. DMs are stateless and the minimum configuration is 2 DMs coupled to 2 KVS nodes for a capacity of 369TB raw with 8TB NF1 NVMe SSD (we suppose). The maximum capacity is 1.4PB raw. A KVS node provides 32 SSDs for a native capacity of 256TB but 184TB effective and each SSD manages 4 KVS so in total 128 KVS per node. It shows an interesting disaggregated model than tends to grow on the market. Also the design offers a high degree of parallelism between DMs and KVS nodes and even to several KVS on each KVS node.

The web site and presentation don't mention access methods, try to find NFS for instance, on the web site. We understand that SDP supports NFS, SMB and S3 and we didn't get confirmation of a Posix client.

For data protection, SPD uses erasure coding model, a good fit when file system and file size are bigs.

In term of performance, SDP delivers above founders expectations with 80GB/s of throughput coupling an interesting more than 40GB/s of reads and writes at the same time. This performance is delivered with 4 Data Managers and the SDP440 model. We understand that performance direction is bandwidth and not IO/s, at least today, as only throughput numbers are given, meaning that the company targets some vertical use cases like M&E, oil and gas, life sciences, etc., and some HPC related applications with large file sizes and large I/Os. The system seems to be pretty linear as DMs are added (SPD 220 - 420 - 820) with a decoupling of performance and capacity (SPD 280 - 480 - 880), the first number representing the number of DMs and the two other the number of KV Store to offer the capacity.

In the middle, between DMs and KV Store, Stellus supports a NVMe-oF via Ethernet switch interconnect.

To illustrate such performance need, the company targets 2 use cases in M&E and life sciences but again the interface between applications servers and DMs are not mentioned. As a file storage vendor it could be NAS protocols such NFS and/or SMB but also a Posix client.

As Kinetic in the past and several other vendors' developments, we anticipate that Samsung will offer KV drives in the future able to connect directly in such Stellus architecture. We remember drives with embedded RocksDB or LevelDB.

Read also:
Stealthy Start-Up Stellus Technologies in SSDs for Servers
100% subsidiary of Samsung
May 5, 2016 | Press Release
A newcomer Stellus Technologies
August 23, 2019

Articles_bottom
AIC
ATTO
OPEN-E