Several years ago, Appsbroker, Google Cloud and Intel collaborated to build an HPC capability for customers to showcase and build the largest compute solutions on Google’s public cloud. We work with customers who have the most demanding workloads across Media and Entertainment, Health and Life Sciences, Manufacturing, and Financial Services.
A concern for our customers is always whether or not cloud hosted servers can perform as well as bare metal servers. Google was able to showcase their performance across ultramem instances in late 2018 for the STAC-M3 benchmark, which focuses on time-series databases and market tick data:
Can cloud instances perform better than bare metal?
At Appsbroker, we wanted to highlight how the latest Intel Xeon 2nd gen Scalable Processors perform on Google Cloud, with a Financial Services benchmark that is becoming more relevant in recent years: the STAC-A2 benchmark for Derivative Risk.
We collaborated with Intel and their STAC-A2 experts to deploy their STAC-A2 pack based on the Intel oneAPI compilers and libraries that can leverage the performance of the latest generation of Intel CPUs. In particular, this means:
Stack under test:
For a deep dive into the benchmark, please visit STAC to access the report.
We chose the Google Cloud c2 compute optimised instances to ensure the best possible performance. These Compute Engine VMs can achieve 3.8GHz sustained all-core turbo frequency and provide full NUMA transparency into the underlying server architecture.
STAC-A2 is a series of benchmarks that will scale across a multi-node cluster. Performance is ultimately limited by the network interconnect’s latency and speed. Following Google Cloud’s best practices for tightly coupled HPC workloads, the 10 node cluster was deployed with a Compact Placement Policy to ensure the co-location of nodes in a physical location, which minimised latency between the nodes.
Appsbroker automated creating both the individual node software installation and deploying the 10 node cluster on Google Cloud using Infrastructure as Code techniques. This means that a powerful cluster, capable of breaking records, can be deployed and ready for processing derivative risk workloads in under five minutes from a repeatable template. We strongly believe that risk workloads due to their spiky nature, dependent on market trading conditions, will benefit from the dynamic deployment patterns that Google Cloud makes available to financial services organisations.
Cost is a key consideration for using services in the Cloud. The configuration described above meant that for burst workloads, 12,398 options per dollar was achievable. If a one-year commit was locked-in for these resources, this figure increased to 19,693 options per dollar.
We were able to achieve two (2) records in the STAC-A2 benchmark and be the first to achieve this benchmark in the public cloud. The two records were:
Thank you to Intel and Google Cloud for collaborating and supporting creating this benchmark. We’re looking forward to rerunning this benchmark later in the year when the latest Ice Lake (3rd gen Xeon Scalable Processors) are available on Google Cloud. Plus for reduced inter-node latency, we will be taking advantage of the new 100Gbps Tier-1 network now available on Google Cloud for compute optimised VMs.
Authors: Geoff Newell (Technical Director), David Young (Technical Lead - HPC)
Referenced Links: