How did the H20 141G version come about?

The rise of NVIDIA H20 141GB graphics card and HBM3e technology innovation.
Core content:
1. The reasons behind the surge in market demand for H20 141GB graphics cards
2. Analysis of H20 graphics card performance characteristics and market competitiveness
3. HBM3e technology development and its impact on video memory capacity improvement
HBM3e
The first HBM product was launched in 2014. HBM1 is the first to use 3D stacking technology, with a bandwidth of 128GB/s and a capacity of 1GB per stack. It uses unique vertical stacking technology and TSV (Through Silicon Via) technology to closely connect multiple DRAM units and form efficient interconnections with GPUs or CPUs, thus building a large-capacity, high-bandwidth DDR combination array.
Since then, HBM technology has gone through multiple generations of development, with significant improvements in capacity, bandwidth, and data transfer rate. For example, SK Hynix has successfully developed multiple generations of products such as HBM2E, HBM3, and HBM3E; Micron launched HBM2 and HBM2E products, and in 2023, it skipped HBM3 and directly launched HBM3E products; Samsung launched HBM2E, HBM3 and other products, and plans to release the first HBM3E product in 2024. HBM3E further improves bandwidth, latency, energy efficiency, and capacity based on HBM3.
The reason why the video memory capacity reaches 141GB
HBM3e uses more advanced 3D packaging technology and vertical stacking process. By increasing the number of stacked DRAM chip layers, higher storage density is achieved in the same physical space. For example, the storage capacity is increased from 8 layers of HBM2 to 12 layers of HBM3. HBM3e may further optimize the stacking technology on this basis, so that the capacity of a single chip can be increased, thereby achieving a significant increase in total capacity.
Taking H200 as an example, its 141GB HBM3e video memory may be achieved through a combination of multiple high-capacity HBM3e chips. It is speculated that H200 uses 6 24GB HBM3e stacks to form a physical capacity of 144GB of memory, but NVIDIA has reserved part of the capacity for production reasons and finally provides users with 141GB of available video memory. In addition, the memory architecture inside the GPU has also been optimized, including the design of data channels and improvements to the memory controller, so that large-capacity video memory can be efficiently managed and utilized.