High-performance compute nodes for deep learning in medical imaging and light propagation modeling (DL-Platform)

General information

Description
High-performance compute nodes for deep learning in medical imaging and light propagation modeling
Acquisition year
2022 and 2025
Co-financier
Package22 (Acquisition year: 2022)
Slovenian Research and Innovation Agency (ARIS)
Co-financing extent: 50%
Public procurement portal: 005761/2022
 
Package23 (Acquisition year: 2025)
Slovenian Research and Innovation Agency (ARIS)
Acquisition value 92,567.50 EUR
Co-financing extent: 50%

Parallel computing system

The new equipment will be part of the program group Biomedical image and signal analysis in the Laboratory of Imaging Technologies (LIT), and will enable new and in-depth research in the field of analysis of large amounts of medical data. For example, to analyze the database UK BioBank , which includes 150 TB of medical data for more than 50 thousand people, such as electronic medical records, MRI imaging examinations of the head and heart, etc. With new equipment and the most advanced deep learning techniques, we develop and evaluate diagnostic and prognostic predictive models for a wide variety of medical conditions and pathologies, both based on MRI images and unstructured data. We focused on predictive models for neurological, cardiovascular and musculoskeletal diseases and various types of cancer.
An alternative to new equipment is the co-use of domestic supercomputer networks, such as SLING and services of cloud service providers abroad (Amazon AWS, Google, Oracle, etc.). However, the transfer and storage of medical data to a remote computer center or cloud is neither secure nor economically justified due to security risks in the management of personal data and contractual restrictions, as well as due to the large amount of data.
Deep machine learning methods are computationally intensive and demonstrate capabilities that are approximately directly proportional to the amount of free parameters (e.g. the number of layers of the neural network) and the amount of processed training data. The analysis of multidimensional structured data such as 3D or 4D medical images is particularly demanding from the computational and memory perspective. Massive parallel computer systems are used for this purpose. Such systems are effective for the analysis of 3D or 4D medical images only if they are associated with large amounts of working memory, since the convergence of learning predictive models from such images is critically determined by the selected number of model layers (depth) and the number of samples in the package (batch size).

Important components

  1. One Supermicro AS-2124GQ server unit with components (Package22)

    1. two AMD EPYC™ 7453 computational processing units, each with 28 cores

    2. 512 GB of working memory PC4-25600 DRR4 ECC

    3. two NVMe SSD disks with a capacity of 3.84 TB on the PCIe 4.0 x4 bus

    4. four graphics processing units NVIDIA® A100 SXM4 GPU 80GB connected by NVLink bus

    5. two Intel X550-T2 Ethernet network connectors with a nominal bandwidth of 10 GB/s

    6. two redundant power supplies, each 2200W

  2. Powerful server with optical network interface (Package23)

    1. two AMD EPYC 9355 compute processing units, each with 32 cores

    2. 1 TB of DDR5-6400 memory

    3. two NVMe SSD disks with a capacity of 15.36 TB on the PCIe 5.0 bus

    4. NVIDIA H200 NVL graphics processing unit with 141 GB of memory

    5. two Broadcom 57508 QSFP56 optical network connectors with a nominal bandwidth of 100 Gb/s

    6. two Intel X550-T2 Ethernet network ports with a nominal bandwidth of 10 Gb/s

    7. four power supplies, each 2600 W

  3. Powerful network infrastructure for connection to existing NAS system and servers (Package23)

    1. backbone network switch with 32 QSFP56 100Gb ports - FS S8550-32C

    2. router with two QSFP56 100Gb and 12 SFP28 25Gb ports - CCR2216-1G-12XS-2XQ

    3. access switches with 48 10 Gb Ethernet ports and 8 QSFP56 100 Gb ports

Access to equipment

Access to the equipment is possible only by prior arrangement and at a time when the equipment is not occupied by conducting ongoing research.

Price list

The cost of using the system depends on the complexity of preparation. The informative price of using the equipment with the operator is 200 EUR/h.