Amazon Announces Graviton2 SoC Along With New AWS Instances: 64-Core Arm With Large Performance Uplifts
by Andrei Frumusanu on December 3, 2019 12:30 PM ESTWe only recently reported on the story that Amazon are designing a custom server SoC based on Arm’s Neoverse N1 CPU platforms, only for Amazon to now officially announce the new Graviton2 processor as well as AWS instances based on the new hardware.
The new Graviton2 SoC is a custom design by Amazon’s own in-house silicon design teams and is a successor to the first-generation Graviton chip. The new chip quadruples the core count from 16 cores to 64 cores and employs Arm’s newest Neoverse N1 cores. Amazon is using the highest performance configuration available, with 1MB L2 caches per core, with all 64 cores connected by a mesh fabric supporting 2TB/s aggregate bandwidth as well as integrating 32MB of L3 cache.
Amazon claims the new Graviton2 chip is can deliver up to 7x higher performance than the first generation based A1 instances in total across all cores, up to 2x the performance per core, and delivers memory access speed of up to 5x compared to its predecessor. The chip comes in at a massive 30B transistors on a 7nm manufacturing node - if Amazon is using similar high density libraries to mobile chips (they have no reason to use HPC libraries), then I estimate the chip to fall around 300-350mm² if I was forced to put out a figure.
The memory subsystem of the new chip is supported by 8 DDR4-3200 channels with support for hardware AES256 memory encryption. Peripherals of the system are supported by 64 PCIe4 lanes.
Powered by the new generation processor, Amazon also detailed its new 6th generation instances M6g, R6g and C6g, offering various configuration up to the full 64 cores of the chip and up to 512GB of RAM for the memory optimised instance variants. 25Gbps “enhanced networking” connectivity, as well as 18Gbps bandwidth to EBS (Elastic Block Storage).
Amazon is also making some very impressive benchmark comparisons against its fifth-generation instances, supporting Intel Xeon Platinum 8175 processor of up to 2.5GHz:
- All of these performance enhancements come together to give these new instances a significant performance benefit over the 5th generation (M5, C5, R5) of EC2 instances. Our initial benchmarks show the following per-vCPU performance improvements over the M5 instances:
- SPECjvm® 2008: +43% (estimated)
- SPEC CPU® 2017 integer: +44% (estimated)
- SPEC CPU 2017 floating point: +24% (estimated)
- HTTPS load balancing with Nginx: +24%
- Memcached: +43% performance, at lower latency
- X.264 video encoding: +26%
- EDA simulation with Cadence Xcellium: +54%
Amazon is making M6g instances with the new Graviton2 processor available for CPU for non-production workloads, with expected wider rollout in 2020.
The announcement is a big win for Amazon and especially for Arm’s endeavours in the server space as they try to surpass the value that the x86 incumbents are able to offer. Amazon describes that the new 6g instances are able to offer 40% higher performance/$ than the existing x86 5th generation platforms, which represents some drastic cost savings for the company and its customers.
Related Reading:
- AWS Designing a 32-Core Arm Neoverse N1 CPU for Cloud Servers
- Arm Announces Neoverse N1 & E1 Platforms & CPUs: Enabling A Huge Jump In Infrastructure Performance
- GIGABYTE's Cavium ThunderX2 Systems: 1U R181-T90 and 2U R281-T91
- Assessing Cavium's ThunderX2: The Arm Server Dream Realized At Last
- GIGABYTE's ThunderXStation with Dual Cavium ThunderX2 Arm SoCs
- Investigating Cavium's ThunderX: The First Arm Server SoC With Ambition
- Marvell Completes Acquisition of Cavium, Gets CPU, Networking & Security Assets
- Amazon AWS Offers Another AMD EPYC-Powered Instance: T3a
- Amazon Offers More EPYC: M5ad & R5ad Instances
41 Comments
View All Comments
Raqia - Tuesday, December 3, 2019 - link
I assume those are multi-core figures being quoted against the M5?Andrei Frumusanu - Tuesday, December 3, 2019 - link
Correct.Raqia - Tuesday, December 3, 2019 - link
So they're comparing a 24 core x86 Xeon to a 64 core Neoverse implementation.PeachNCream - Tuesday, December 3, 2019 - link
But it can tow the rear drive only Xeon uphill with an only slightly obvious rolling start so it's clearly better until Intel requests Amazon send a system over to them for an "apples-to-apples" compairson.andrewaggb - Tuesday, December 3, 2019 - link
Unless I'm misunderstanding something it sounds like it'll have worse perf/$ than epyc and not be x64.shompa - Tuesday, December 3, 2019 - link
Not being X64 is great. Why use fake 64bit extensions that need a 32bit CPU core to work when you can use real 64bit and remove the whole 32bit CPU block and save energy and die space.scineram - Wednesday, December 4, 2019 - link
No.kallinteris - Wednesday, December 4, 2019 - link
what do you mean by " use fake 64bit extensions that need a 32bit CPU core to work when you can use real 64bit"all modern x86 programs are compiled for 64 bit anyway
vanilla_gorilla - Tuesday, December 3, 2019 - link
And probably at lower cost and power usage.SarahKerrigan - Tuesday, December 3, 2019 - link
How do you figure? It says "per vCPU." A vCPU is a single thread.