Why Amazon EC2 P4 Instances?
Amazon Elastic Compute Cloud (Amazon EC2) P4d instances deliver high performance for machine learning (ML) training and high performance computing (HPC) applications in the cloud. P4d instances are powered by NVIDIA A100 Tensor Core GPUs and deliver industry-leading high throughput and low-latency networking. These instances support 400 Gbps instance networking. P4d instances provide up to 60% lower cost to train ML models, including an average of 2.5x better performance for deep learning models compared to previous-generation P3 and P3dn instances.
P4d instances are deployed in hyperscale clusters called Amazon EC2 UltraClusters that comprise high performance compute, networking, and storage in the cloud. Each EC2 UltraCluster is one of the most powerful supercomputers in the world, helping you run your most complex multinode ML training and distributed HPC workloads. You can easily scale from a few to thousands of NVIDIA A100 GPUs in the EC2 UltraClusters based on your ML or HPC project needs.
Researchers, data scientists, and developers can use P4d instances to train ML models for use cases such as natural language processing, object detection and classification, and recommendation engines. They can also use it to run HPC applications like pharmaceutical discovery, seismic analysis, and financial modeling. Unlike on-premises systems, you can access virtually unlimited compute and storage capacity, scale your infrastructure based on business needs, and spin up a multinode ML training job or a tightly coupled distributed HPC application in minutes, without any setup or maintenance costs.
Announcing the new Amazon EC2 P4d Instances
Benefits
Features
Customer testimonials
Here are some examples of how customers and partners have achieved their business goals with Amazon EC2 P4 instances.
Toyota Research Institute (TRI)

TRI-AD

TRI-AD

GE Healthcare

HEAVY.AI

Zenotech Ltd.

Aon

Rad AI

Product details
Instance Size
|
vCPUs
|
Instance Memory (GiB)
|
GPU – A100
|
GPU memory
|
Network Bandwidth (Gbps)
|
GPUDirect RDMA
|
GPU Peer to Peer
|
Instance Storage (GB)
|
EBS Bandwidth (Gbps)
|
On-demand Price/hr
|
1-yr Reserved Instance Effective Hourly *
|
3-yr Reserved Instance Effective Hourly *
|
---|---|---|---|---|---|---|---|---|---|---|---|---|
p4d.24xlarge
|
96
|
1152
|
8
|
320 GB HBM2 |
400 ENA and EFA
|
Yes
|
600 GB/s NVSwitch
|
8 x 1000 NVMe SSD
|
19
|
$32.77
|
$19.22
|
$11.57
|
p4de.24xlarge
|
96
|
1152
|
8
|
640 GB HBM2e |
400 ENA and EFA
|
Yes
|
600 GB/s NVSwitch
|
8 x 1000 NVMe SSD
|
19
|
$40.96
|
$24.01
|
$14.46
|
* Prices shown are for Linux/Unix in the US East (N. Virginia) AWS Region and rounded to the nearest cent. For full pricing details, see Amazon EC2 Pricing.
P4d instances are available in the US East (N. Virginia and Ohio), US West (Oregon), Asia Pacific (Seoul and Tokyo), and Europe (Frankfurt and Ireland) Regions. P4de instances are available in the US East (N. Virginia) and US West (Oregon) Regions.
Customers can purchase P4d and P4de instances as On-Demand Instances, Reserved Instances, Spot Instances, Dedicated Hosts, or as part of Savings Plan.
Getting started with P4d instances for ML
Getting started with P4d instances for HPC
P4d instances are ideal to run engineering simulations, computational finance, seismic analysis, molecular modeling, genomics, rendering, and other GPU-based HPC workloads. HPC applications often require high network performance, fast storage, large amounts of memory, high compute capabilities, or all of the above. P4d instances support EFA that enables HPC applications using the Message Passing Interface (MPI) to scale to thousands of GPUs. AWS Batch and AWS ParallelCluster help HPC developers quickly build and scale distributed HPC applications.