What are you looking for ?

Samsung CXL Solutions CMM-H or Memory Module- Hybrid Device

Features firm's high-performance DRAM, coupled with NAND flash and CXL Type 3 interface.

From Samsung Semiconductor

As AI and Machine Learning solutions continue to be deployed across data center infrastructures, it is important to optimize the balance between compute, memory, and storage resources to best support language model processing performance along with cost management of such resources.

Compute Express Link (CXL) Memory Module- Hybrid (CMM-H)

Samsung Cmm H Device

In May of 2021, Samsung announced the development of an industry’s 1st Compute Express Link (CXL) Memory Module-DRAM (CMM-D). CMM-D addresses the memory capacity server-bound limitations by supporting memory expansion and pooling. The CMM-D device is currently sampling to customers.

The next product in the CMM family portfolio is the Compute Express Link (CXL) Memory Module- Hybrid (CMM-H), which was 1st introduced at FMS’22 as Memory-Semantic SSD. Hybrid means that there is a mix of media types in the CMM-H device, specifically DRAM and NAND flash. The management of DRAM and NAND resources with the CMM-H controller supports two use cases in (1) DRAM persistence and (2) memory tiering, both supporting host processor calls to the CMM-H device as 1 addressable memory space, intelligently integrated with the host DRAM memory. Use cases are currently being developed on the CMM-H platform including (1) memory persistence for in-memory data bases, (2) tiered memory for data analytics and AI inference models, and (3) memory optimization to improve memory utilization for better TCO across data center infrastructures.

Click to enlarge

Samsung Cmm H Architecture Scheme

What Is CMM-H?
The CMM-H device features Samsung’s high-performance DRAM, coupled with NAND
flash, and a CXL Type 3 (1) interface.

These technological characteristics are combined to offer a cost-effective memory expansion device. The motivation behind CMM-H is to combine NAND flash capacity with the CXL load/store memory interface. It presents large NUMA nodes with existing Linux kernel framework and seamlessly integrates with applications without the need for modification. For applications that prioritize capacity, TCO, and throughput over random access latency, CMM-H is a design choice. As a side benefit, it comes with built-in data persistence to minimize down time during data recovery. Examples include in-memory databases and AI inferencing of large language models.

Samsung Cxl Solutions – Cmm H F1

Samsung Cxl Solutions – Cmm H F2

Samsung Cxl Solutions – Cmm H F3

CMM-H features and benefits
Traditionally, adding memory capacity and bandwidth in a system involves increasing the number of native CPU memory channels. But adding memory channels to a CPU increases engineering complexity and drives up cost. A CXL Type 3 memory expansion device provides a flexible and powerful option to increase memory capacity and increase memory bandwidth, without increasing the number of primary CPU memory channels.

CMM-H tiered memory feature
The tiered memory model offers an architectural solution to the complex problem of keeping pace with rapidly evolving processor and accelerator speeds. By strategically positioning frequently accessed data closer to the processing units, it not only effectively expands memory capacity but also enhances cost efficiency. In other words, placing memory where the data is stored will enable faster data processing, lower power requirements, and reduced TCO.

CMM-H can be used to expand the available memory in 2 ways. First, CMM-H can be used in the same tier as DRAM in the memory hierarchy. Alternatively, CMM-H can be used one tier below the main memory (DRAM) as a swap space. The CMM-H tiered memory goal is to create a CXL based Memory Module solution that utilizes a combination of small amounts of DRAM and large capacity of NAND. Since CMM-H uses NAND memory on the backend, the persistent memory aspect provides large capacity, non-volatile memory at an affordable cost. Such CMM-H persistent memory solutions can be used to target Intel Optane as well as NVDIMM customers.

CMM-H device memory cache feature
A key element of
CMM-H is its built-in DRAM cache designed to mitigate the long latency associated with NAND flash. A CMM-H device performs the device cache function in an application agnostic manner. It provides a facility by which some applications or workloads are aware and hints are given to the device to improve its overall performance. The Host Hints module provides an API to the Host software and applications to optionally send heatmap hints to the device to improve device cache performance. The CXL.mem protocol also provides an impressive 64-byte cache granularity that is truly revolutionary and a game changer for AI applications.Samsung Cxl Solutions – Cmm H F4CMM-H persistent memory feature
The CMM-H device supports a
NVM type, in other words, a CXL based large capacity Persistent Memory (PMEM) solution. In the case of Persistent Memory (PMEM) mode, the CMM-H device supports two options: 1) Global Persistent Flush (GPF) and 2) Sudden Power Loss (SPL). The CMM-H device supports full CXL GPF protocol. When the device receives a GPF message, it immediately starts data flush operation to the backend SSD. When the CMM-H device detects sudden power loss, it immediately flushes all the device cache data to the backend SSD.

CMM-H memory pooling and switching
The CXL 2.0 specification used for CMM-H also supports single-level switching and memory pooling. Memory pooling increases the overall system efficiency by allowing dynamic allocation and deallocation of memory resources. Memory pooling also enables reduction of stranded memory, a common problem observed in server systems.

Samsung Cxl Solutions – Cmm H F5

Samsung’s Memory Module solutions are forging the next frontier for
AI, ML, and LLM processing. Taking more of the data processing and placing it in and around the memory modules is reshaping the way computing will be done in this new AI era.

(1) CXL provides 3 different devices types. Type 1 is used in caching devices such as Accelerators and SmartNICs. Type 2 are GPUs and FPGAs that have memories like DDR and HBM attached to the device, and Type 3 are memory expansion devices that allow host processors to access CXL device memory cache coherently through cxl.mem transactions.

Resources :
World’s First CMM-D Technology Leading the AI Era     
CMM-H (CXL Memory Module, H: Hybrid)      
Webinar CMM-H (CXL Memory Module – Hybrid): Samsung’s CXL-based SSD for the Memory-centric Computing Era    
Memory Tech Day 2023 Near Memory Solutions for the AI Era