Dell EMC Ready Bundle for HPC Digital Manufacturing—Dassault Systѐmes’ Simulia Abaqus Performance This Dell EMC technical white paper discusses performance benchmarking results and analysis for Simulia Abaqus on the Dell EMC Ready Bundle for HPC. Martin Feyereisen Joshua Weage Dell EMC HPC Innovation Lab January 2018 A Dell EMC Technical Whitepaper Revisions Date Description January 2018 Initial release The information in this publication is provided “as is.” Dell Inc. makes no representations or warranties of any kind with respect to the information in this publication, and specifically disclaims implied warranties of merchantability or fitness for a particular purpose. Use, copying, and distribution of any software described in this publication requires an applicable software license. Copyright © 2018 Dell Inc. or its subsidiaries. All Rights Reserved. Dell, EMC, and other trademarks are trademarks of Dell Inc. or its subsidiaries. Other trademarks may be the property of their respective owners. Published in the USA [1/31/2018] [Technical Whitepaper] Dell believes the information in this document is accurate as of its publication date. The information is subject to change without notice. 2 Dell EMC Ready Bundle for HPC Digital Manufacturing—Dassault Systѐmes’ Simulia Abaqus Performance | January 31, 2018 Table of contents Revisions............................................................................................................................................................................. 2 1 Introduction ................................................................................................................................................................... 4 2 System Building Blocks ................................................................................................................................................ 5 2.1 Infrastructure Servers ......................................................................................................................................... 5 2.2 Explicit Building Blocks ....................................................................................................................................... 6 2.3 Implicit Building Blocks ....................................................................................................................................... 6 2.4 Dell EMC NSS-HA Storage ................................................................................................................................ 7 2.5 Dell EMC IEEL Storage ...................................................................................................................................... 8 2.6 System Networks ................................................................................................................................................ 8 2.7 Cluster Software ................................................................................................................................................. 9 2.8 Services and Support ......................................................................................................................................... 9 3 Reference System ...................................................................................................................................................... 10 4 Simulia Abaqus Performance ..................................................................................................................................... 12 5 Conclusion .................................................................................................................................................................. 17 References ........................................................................................................................................................................ 18 3 Dell EMC Ready Bundle for HPC Digital Manufacturing—Dassault Systѐmes’ Simulia Abaqus Performance | January 31, 2018 1 Introduction This technical white paper describes the performance of for Dassault Systѐmes’ Simulia Abaqus on the Dell EMC Ready Bundle for HPC Digital Manufacturing, which was designed and configured specifically for Digital Manufacturing workloads, where Computer Aided Engineering (CAE) applications are critical for virtual product development. The Dell EMC Ready Bundle for HPC Digital Manufacturing uses a flexible building block approach, where individual building blocks can be combined to build HPC systems which are ideal for customer specific work-loads and use cases. The individual building blocks are configured to provide good performance for specific application types and workloads common to this industry. The architecture of the Dell EMC Ready Bundle for HPC Digital Manufacturing and a description of the building blocks are presented in Section 2. Section 3 describes the system configuration, software and application versions, and the benchmark test cases that were used to measure and analyze the performance of the Dell EMC HPC Ready Bundle for HPC Digital Manufacturing. Section 4 quantifies the capabilities of the system and presents benchmark performance for Simulia Abaqus. 4 Dell EMC Ready Bundle for HPC Digital Manufacturing—Dassault Systѐmes’ Simulia Abaqus Performance | January 31, 2018 2 System Building Blocks The Dell EMC Ready Bundle for HPC Digital Manufacturing is assembled by using preconfigured building blocks. The available building blocks are infrastructure servers, storage, networking, and application specific compute building blocks. These building blocks are preconfigured to provide good performance for typical applications and workloads within the manufacturing domain. The building block architecture allows for a customizable HPC system based on specific end-user requirements, while still making use of standardized, domain-specific building blocks. This section describes the available building blocks along with the rationale of the recommended system configurations. 2.1 Infrastructure Servers The infrastructure servers are used to administer the system and provide user access. They are not typically involved in computation or storage, but they provide services that are critical to the overall HPC system. Typically these servers are the master nodes and the login nodes. For small sized clusters, a single physical server can provide these functions. The infrastructure server can also be used for storage, by using NFS, in which case it must be configured with additional disk drives or an external storage array. One master node is mandatory and is required to deploy and manage the system. If high-availability (HA) functionality is required, two master nodes are necessary. Login nodes are optional and one login server per 30-100 users is recommended. A recommended base configuration for infrastructure servers is: Dell EMC PowerEdge R640 server Dual Intel® Xeon® Bronze 3106 processors 192 GB of memory, 12 x 16GB 2667 MT/s DIMMS PERC H330 RAID controller 1 x 800GB Mixed-use SATA SSD Dell EMC iDRAC9 Enterprise 2 x 750 W power supply units (PSUs) Mellanox EDR InfiniBandTM (optional) The recommended base configuration for the infrastructure server is described here. The PowerEdge R640 server is suited for this role. A cluster will have only a small number of infrastructure servers; therefore, density is not a concern, but manageability is more important. The Intel Xeon 3106 processor, with 8 cores per socket, is sufficient for this role. 192 GB of memory provided by 12x16 GB DIMMs provides sufficient memory capacity, with minimal cost per GB, while also providing good memory bandwidth. These servers are not expected to perform much I/O, so a single Mixed-use SATA SSD should be sufficient for the operating system. For small systems (four nodes or less), an Ethernet network may provide sufficient performance. For most other systems, EDR InfiniBand is likely to be the data interconnect of choice, which provides a high throughput, low latency fabric for node-node communications, or access to a Dell EMC NFS Storage Solution (NSS) or Dell EMC Intel Enterprise Edition for Lustre (IEEL) storage solution. 5 Dell EMC Ready Bundle for HPC Digital Manufacturing—Dassault Systѐmes’ Simulia Abaqus Performance | January 31, 2018 2.2 Explicit Building Blocks Explicit Building Block (EBB) servers are typically used for Computational Fluid Dynamics (CFD) and explicit Finite Element Analysis (FEA) solvers such as Simulia Abaqus Explicit, LSTC LS-DYNA, ANSYS Fluent®, Siemens’ STAR-CCM+®, and others. These software applications typically scale well across many processor cores and multiple servers. The memory capacity requirements are typically modest and these solvers perform minimal disk I/O while solving. In most HPC systems used for Digital Manufacturing, the large majority of servers are EBBs. The recommended configuration for EBBs is: Dell EMC PowerEdge C6420 server Dual Intel® Xeon® Gold 6142 processors 192 GB of memory, 12 x 16GB 2667 MT/s DIMMS PERC H330 RAID controller 2 x 480GB Mixed-use SATA SSD in RAID 0 Dell EMC iDRAC9 Enterprise 2 x 1600 W power supply units per chassis Mellanox EDR InfiniBandTM (optional) The recommended configuration for the EBB servers is described here. Because the largest percentage of servers in the majority of systems will be EBB servers, a dense solution is important; therefore, the PowerEdge C6420 server is selected. The Intel Xeon Gold 6142 processor is a 16-core CPU with a base frequency of 2.6 GHz and a maximum all-core turbo frequency of 3.3 GHz. 32 cores per server provides a dense compute solution, with good memory bandwidth per core, and a power of two quantity of cores. The maximum all-core turbo frequency is important because EBB applications are typically CPU bound. This CPU model provides the best balance of CPU cores and core speed. 192 GB of memory using 12x16GB DIMMs provides sufficient memory capacity, with minimal cost per GB, while also providing good memory bandwidth. Relevant applications typically perform limited I/O while solving; therefore, the system is configured with two disks in RAID 0 using the PERC H330 RAID controller, which leaves the PCIe slot available for an EDR InfiniBand HCA. The compute nodes do not require extensive out-of-band (OOB) management capabilities; therefore, an iDRAC9 Express is sufficient. For small systems (four nodes or less), an Ethernet network may provide sufficient performance. For most other systems, EDR InfiniBand is likely to be the data interconnect of choice, which provides a high throughput, low latency fabric for node-node communications, or access to an NSS or IEEL storage solution. 2.3 Implicit Building Blocks Implicit Building Block (IBB) servers are typically used for implicit FEA solvers such as Simulia Abaqus, MSC Nastran, ANSYS MechanicalTM, and others. These applications typically have large memory requirements and do not scale to as many cores as the EBB applications. File system I/O performance can also have a significant effect on application performance. 6 Dell EMC Ready Bundle for HPC Digital Manufacturing—Dassault Systѐmes’ Simulia Abaqus Performance | January 31, 2018 The recommended configuration for IBBs is: Dell EMC PowerEdge R640 server Dual Intel® Xeon® Gold 6136 processors 384 GB of memory, 24 x 16GB 2667 MT/s DIMMS PERC H740P RAID controller 4 x 480GB Mixed-use SATA SSD in RAID 0 Dell EMC iDRAC9 Express 2 x 750 W power supply units (PSUs) Mellanox EDR InfiniBandTM (optional) The recommended configuration for the IBB servers is described here. Typically, a smaller percentage of the system will be comprised of IBB servers. Because of the memory and drive recommendations explained here, a 1U PowerEdge R640 server is a good choice. The Intel Xeon Gold 6136 processor is a twelve-core CPU with a base frequency of 3.0 GHz and a max all-core turbo frequency of 3.6 GHz. A memory configuration of 24 x 16 GB DIMMs is used to provide the larger memory capacities needed for these applications. While 384GB is typically sufficient for most CAE workloads, customers expecting to handle very large production jobs should consider increasing the memory capacity to 768GB. IBB applications often have large file system I/O requirements and four Mixed-use SATA SSD’s in RAID 0 are used to provide fast local I/O. The compute nodes do not require extensive OOB management capabilities; therefore, an iDRAC9 Express is recommended. InfiniBand is not typically necessary for IBBs because most uses cases only require running applications on a single IBB; however, an InfiniBand HCA can be added to enable multi-server analysis or to access an NSS or IEEL storage solution. 2.4 Dell EMC NSS-HA Storage The Dell EMC NFS Storage Solution (NSS) provides a tuned NFS storage option that can be used as primary storage for user home directories and application data. The current version of NSS is NSS7.0-HA with options of 240 TB or 480 TB raw disk space. NSS is an optional component and a cluster can be configured without NSS. NSS-HA is a high performance computing network file system (NFS) storage solution from Dell EMC, optimized for performance, availability, resilience, and data reliability. The best practices used to implement this solution result in better throughput compared to non-optimized systems. A high availability (HA) setup, with an active-passive pair of servers, provides a reliable and available storage service to the compute nodes. The HA unit consists of a pair of Dell EMC PowerEdge R730 servers. A Dell EMC PowerVault MD3460 dense storage enclosure provides 240 TB of storage for the file system with 60 x 4 TB, 7.2K near-line SAS drives. This unit can be extended with a PowerVault MD3060e to provide an additional 240 TB of disk space for the 480 TB solution. Each of the PowerVault arrays is configured with 6 virtual disks (VDs). Each VD consists of 10 hard drives configured in RAID6 (8+2). The NFS server nodes are directly attached to the dense storage enclosures by using 12 Gbps SAS connections. NSS7.0-HA provides two network connectivity options for the compute cluster: EDR InfiniBand and 10 Gigabit Ethernet. The active and passive NFS servers run Red Hat Enterprise Linux (RHEL) 7.2 with Red Hat’s Scalable File System (XFS) and Red Hat Cluster Suite to implement the HA feature. 7 Dell EMC Ready Bundle for HPC Digital Manufacturing—Dassault Systѐmes’ Simulia Abaqus Performance | January 31, 2018 2.5 Dell EMC IEEL Storage Dell EMC IEEL storage is an Intel Enterprise Edition for Lustre (IEEL) based storage solution consisting of a management station, Lustre metadata servers, Lustre object storage servers, and the associated backend storage. The management station provides end-to-end management and monitoring for the entire Lustre storage system. The Dell EMC IEEL storage solution provides a parallel file system with options of 480 TB or 960 TB raw storage disk space. This solution is typically used for scratch space for larger clusters. Figure 1 Overview of the Dell EMC IEEL Components and Connectivity 2.6 System Networks Most HPC systems are configured with two networks—an administration network and a high-speed/low- latency switched fabric. The administration network is typically Gigabit Ethernet that connects to the onboard LOM/NDC of every server in the cluster. This network is used for provisioning, management and administration. On the EBB and IBB servers, this network will also be used for IPMI hardware management. For infrastructure and storage servers, the iDRAC Enterprise ports may be connected to this network for OOB server management. The heartbeat ports for NSS-HA and IEEL Ethernet management ports may also be connected to this network. The management network typically uses the Dell Networking S3048-ON Ethernet switch. If there is more than one switch in the system, multiple switches will be stacked with 10 Gigabit Ethernet stacking cables. A high-speed/low-latency fabric is recommended for clusters with more than four servers. The current recommendation is an EDR InfiniBand fabric. The fabric will typically be assembled using Mellanox SB7890 36-port EDR InfiniBand switches. The number of switches required depends on the size of the cluster and the blocking ratio of the fabric. 8 Dell EMC Ready Bundle for HPC Digital Manufacturing—Dassault Systѐmes’ Simulia Abaqus Performance | January 31, 2018 2.7 Cluster Software The Cluster Software is used to install and monitor the system’s compute servers. Bright Cluster Manager (BCM) is the recommended cluster software. 2.8 Services and Support The Dell EMC Ready Bundle for HPC Digital Manufacturing is available with full hardware support and deployment services, including NSS-HA and IEEL deployment services. 9 Dell EMC Ready Bundle for HPC Digital Manufacturing—Dassault Systѐmes’ Simulia Abaqus Performance | January 31, 2018 3 Reference System The reference system was assembled in the Dell EMC HPC Innovation Lab using the building blocks described in Section 2. The building blocks used for the reference system are listed in Table 1. Table 1. Reference System Configuration Building Block Quantity Infrastructure Server 1 Explicit Building Block 8 with EDR InfiniBand Implicit Building Block 1 Dell Networking S3048-ON 1 Ethernet Switch Mellanox SB7790 EDR 1 InfiniBand Switch The BIOS configuration options used for the reference system are listed in Table 2. Table 2. BIOS Configuration Options BIOS Option Setting Logical Processor Disabled Virtualization Technology Disabled System Profile Performance Optimized 10 Dell EMC Ready Bundle for HPC Digital Manufacturing—Dassault Systѐmes’ Simulia Abaqus Performance | January 31, 2018
Description: