What Local Storage Solution Should I Use for a Project
Compute Engine offers several types of storage options for your instances. Each of the following storage options has unique price and performance characteristics:
- Zonal persistent disk: Efficient, reliable block storage.
- Regional persistent disk: Regional block storage replicated in two zones.
- Local SSD: High performance, transient, local block storage.
- Cloud Storage buckets: Affordable object storage.
- Filestore: High performance file storage for Google Cloud users.
If you are not sure which option to use, the most common solution is to add a persistent disk to your instance.
Introduction
By default, each Compute Engine instance has a single boot persistent disk (PD) that contains the operating system. When your apps require additional storage space, you can add one or more additional storage options to your instance. For cost comparisons, see disk pricing.
Zonal standard PD | Regional standard PD | Zonal balanced PD | Regional balanced PD | Zonal SSD PD | Regional SSD PD | Zonal extreme PD | Local SSDs | Cloud Storage buckets | |
---|---|---|---|---|---|---|---|---|---|
Storage type | Efficient and reliable block storage | Efficient and reliable block storage with synchronous replication across two zones in a region | Cost-effective and reliable block storage | Cost-effective and reliable block storage with synchronous replication across two zones in a region | Fast and reliable block storage | Fast and reliable block storage with synchronous replication across two zones in a region | Highest performance persistent block storage option | High performance local block storage | Affordable object storage |
Minimum capacity per disk | 10 GB | 200 GB | 10 GB | 10 GB | 10 GB | 10 GB | 500 GB | 375 GB | n/a |
Maximum capacity per disk | 64 TB | 64 TB | 64 TB | 64 TB | 64 TB | 64 TB | 64 TB | 375 GB | n/a |
Capacity increment | 1 GB | 1 GB | 1 GB | 1 GB | 1 GB | 1 GB | 1 GB | 375 GB | n/a |
Maximum capacity per instance | 257 TB* | 257 TB* | 257 TB* | 257 TB* | 257 TB* | 257 TB* | 257 TB* | 9 TB | Almost infinite |
Scope of access | Zone | Zone | Zone | Zone | Zone | Zone | Zone | Instance | Global |
Data redundancy | Zonal | Multi-zonal | Zonal | Multi-zonal | Zonal | Multi-zonal | Zonal | None | Regional, dual-regional or multi-regional |
Encryption at rest | Yes | Yes | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
Custom encryption keys | Yes | Yes | Yes | Yes | Yes | Yes | Yes | No | Yes |
How-to | Add a standard persistent disk | Add a regional standard persistent disk | Add a balanced persistent disk | Add a regional balanced persistent disk | Add an SSD persistent disk | Add a regional SSD persistent disk | Add an extreme persistent disk | Add a local SSD | Connect a bucket |
In addition to the storage options that Google Cloud provides, you can deploy alternative storage solutions on your instances.
- Create a file server or distributed file system on Compute Engine to use as a network file system with NFSv3 and SMB3 capabilities.
- Mount a RAM disk within instance memory to create a block storage volume with high throughput and low latency.
Block storage resources have different performance characteristics. Consider your storage size and performance requirements to help you determine the correct block storage type for your instances.
Zonal standard PD | Regional standard PD | Zonal balanced PD | Regional balanced PD | Zonal SSD PD | Regional SSD PD | Zonal extreme PD | Local SSD (SCSI) | Local SSD (NVMe) | |
---|---|---|---|---|---|---|---|---|---|
Maximum sustained IOPS | |||||||||
Read IOPS per GB | 0.75 | 0.75 | 6 | 6 | 30 | 30 | – | – | – |
Write IOPS per GB | 1.5 | 1.5 | 6 | 6 | 30 | 30 | – | – | – |
Read IOPS per instance | 7,500* | 7,500* | 15,000–80,000* | 15,000–60,000* | 15,000–100,000* | 15,000–60,000* | 15,000–120,000* | 900,000 | 2,400,000 |
Write IOPS per instance | 15,000* | 15,000* | 15,000–80,000* | 15,000–30,000* | 15,000–100,000* | 15,000–30,000* | 15,000–120,000* | 800,000 | 1,200,000 |
Maximum sustained throughput (MB/s) | |||||||||
Read throughput per GB | 0.12 | 0.12 | 0.28 | 0.28 | 0.48 | 0.48 | – | – | – |
Write throughput per GB | 0.12 | 0.12 | 0.28 | 0.28 | 0.48 | 0.48 | – | – | – |
Read throughput per instance | 240–1,200* | 240–1,200* | 240–1,200* | 240–1,200* | 240–1,200* | 240–1,200* | 240–2,200** | 9,360 | 9,360 |
Write throughput per instance | 76–400** | 38–200** | 240–1,200* | 120–600* | 240–1,200* | 120–600* | 240–2,200** | 4,680 | 4,680 |
* Persistent disk IOPS and throughput performance depends on disk size, instance vCPU count, and I/O block size, among other factors.
Attaching a disk to multiple virtual machine instances in read-only mode does not affect aggregate performance or cost. Each machine gets a share of the per-disk performance limit.Persistent disks created in multi-writer mode have specific IOPS and throughput limits. See performance of persistent disks in multi-writer mode for details.
Persistent disks
Persistent disks are durable network storage devices that your instances can access like physical disks in a desktop or a server. The data on each persistent disk is distributed across several physical disks. Compute Engine manages the physical disks and the data distribution for you to ensure redundancy and optimal performance.
Persistent disks are located independently from your virtual machine (VM) instances, so you can detach or move persistent disks to keep your data even after you delete your instances. Persistent disk performance scales automatically with size, so you can resize your existing persistent disks or add more persistent disks to an instance to meet your performance and storage space requirements.
Add a persistent disk to your instance when you need reliable and affordable storage with consistent performance characteristics.
Add a persistent disk to your instance
Disk types
When you configure a persistent disk, you can select one of the following disk types.
- Standard persistent disks (
pd-standard
) are backed by standard hard disk drives (HDD). - Balanced persistent disks (
pd-balanced
) are backed by solid-state drives (SSD). They are an alternative to SSD persistent disks that balance performance and cost. - SSD persistent disks (
pd-ssd
) are backed by solid-state drives (SSD). - Extreme persistent disks (
pd-extreme
) are backed by solid-state drives (SSD). With consistently high performance for both random access workloads and bulk throughput, extreme persistent disks are designed for high-end database workloads. Unlike other disk types, you can provision your desired IOPS. For more information, see Extreme persistent disks.
If you create a disk in the Cloud Console, the default disk type is pd-balanced
. If you create a disk using the gcloud
tool or the Compute Engine API, the default disk type is pd-standard
.
The following table shows disk type and machine type support for zonal persistent disks.
Disk type | Supported machine types |
---|---|
pd-standard | All machine types |
pd-balanced | All machine types |
pd-ssd | All machine types |
pd-extreme | n2-standard with 64 or more vCPUs, m1-megamem-96, m2-ultramem-208, m2-ultramem-416 |
The following table shows disk type and machine type support for regional persistent disks:
Disk type | Supported machine types |
---|---|
pd-standard | N1, N2, N2D, E2 |
pd-balanced | N1, N2, N2D, E2 |
pd-ssd | N1, N2, N2D, E2 |
Durability
Disk durability represents the probability of data loss, by design, for a typical disk in a typical year, using a set of assumptions about hardware failures, the likelihood of catastrophic events, isolation practices and engineering processes in Google data centers, and the internal encodings used by each disk type. Persistent disk data loss events are extremely rare and have historically been the result of coordinated hardware failures, software bugs, or a combination of the two. Google also takes many steps to mitigate the industry-wide risk of silent data corruption. Human error by a Google Cloud customer, such as when a customer accidentally deletes a disk, is outside the scope of persistent disk durability.
There is a very small risk of data loss occurring with a regional persistent disk due to its internal data encodings and replication. Regional persistent disks provide twice as many replicas as zonal persistent disks, with their replicas distributed between two zones in the same region, so they provide high availability and can be used for disaster recovery if an entire data center is lost and cannot be recovered (although that has never happened). The additional replicas in a second zone can be accessed immediately if a primary zone becomes unavailable during a long outage.
Note that durability is in the aggregate for each disk type, and does not represent a financially-backed service level agreement (SLA).
The table below shows durability for each disk type's design. 99.999% durability means that with 1,000 disks, you would likely go a hundred years without losing a single one.
Zonal standard persistent disk | Zonal balanced persistent disk | Zonal SSD persistent disk | Zonal extreme persistent disk | Regional standard persistent disk | Regional balanced persistent disk | Regional SSD persistent disk |
---|---|---|---|---|---|---|
Better than 99.99% | Better than 99.999% | Better than 99.999% | Better than 99.9999% | Better than 99.999% | Better than 99.9999% | Better than 99.9999% |
Zonal persistent disks
Ease of use
Compute Engine handles most disk management tasks for you so that you do not need to deal with partitioning, redundant disk arrays, or subvolume management. Generally, you don't need to create larger logical volumes, but you can extend your secondary attached persistent disk capacity to 257 TB per instance and apply these practices to your persistent disks if you want. You can save time and get the best performance if you format your persistent disks with a single file system and no partition tables.
If you need to separate your data into multiple unique volumes, create additional disks rather than dividing your existing disks into multiple partitions.
When you require additional space on your persistent disks, resize your disks and resize your single file system rather than repartitioning and formatting.
Performance
Persistent disk performance is predictable and scales linearly with provisioned capacity until the limits for an instance's provisioned vCPUs are reached. For more information about performance scaling limits and optimization, see Block storage performance.
Standard persistent disks are efficient and economical for handling sequential read/write operations, but they aren't optimized to handle high rates of random input/output operations per second (IOPS). If your apps require high rates of random IOPS, use SSD or extreme persistent disks. SSD persistent disks are designed for single-digit millisecond latencies. Observed latency is application specific.
Compute Engine optimizes performance and scaling on persistent disks automatically. You don't need to stripe multiple disks together or pre-warm disks to get the best performance. When you need more disk space or better performance, resize your disks and possibly add more vCPUs to add more storage space, throughput, and IOPS. Persistent disk performance is based on the total persistent disk capacity attached to an instance and the number of vCPUs that the instance has.
For boot devices, you can reduce costs by using a standard persistent disk. Small, 10 GB persistent disks can work for basic boot and package management use cases. However, to ensure consistent performance for more general use of the boot device, use a balanced persistent disk as your boot disk.
Each persistent disk write operation contributes to the cumulative network egress traffic for your instance. This means that persistent disk write operations are capped by the network egress cap for your instance.
Reliability
Persistent disks have built-in redundancy to protect your data against equipment failure and to ensure data availability through datacenter maintenance events. Checksums are calculated for all persistent disk operations, so we can ensure that what you read is what you wrote.
Additionally, you can create snapshots of persistent disks to protect against data loss due to user error. Snapshots are incremental, and take only minutes to create even if you snapshot disks that are attached to running instances.
Multi-writer mode
You can attach an SSD persistent disk in multi-writer mode to up to two N2 VMs simultaneously so that both VMs can read and write to the disk. Persistent disks in multi-writer mode provide a shared block storage capability and present an infrastructural foundation for building distributed Network File System (NFS) and similar highly available services. However, persistent disks with multi-writer mode require specialized file systems such as GlusterFS or GFS2. Many file systems such as EXT4, XFS, and NTFS are not designed to be used with shared block storage. For more information about the best practices when sharing persistent disks between VMs, see Best practices. If you require a fully managed file storage, you can mount a Filestore file share on your Compute Engine VMs.
To enable multi-writer mode for new persistent disks, create a new persistent disk and specify the --multi-writer
flag in the gcloud
tool or the multiWriter
property in the Compute Engine API. For more information, see Sharing persistent disks between VMs.
Persistent disk encryption
Compute Engine automatically encrypts your data before it travels outside of your instance to persistent disk storage space. Each persistent disk remains encrypted either with system-defined keys or with customer-supplied keys. Google distributes persistent disk data across multiple physical disks in a manner that users do not control.
When you delete a persistent disk, Google discards the cipher keys, rendering the data irretrievable. This process is irreversible.
If you want to control the encryption keys that are used to encrypt your data, create your disks with your own encryption keys.
Restrictions
-
You cannot attach a persistent disk to an instance in another project.
-
You can attach a balanced persistent disk to at most 10 VM instances in read-only mode.
-
For custom machine types or predefined machine types with a minimum of 1 vCPU, you can attach up to 128 persistent disks.
-
Each persistent disk can be up to 64 TB in size, so there is no need to manage arrays of disks to create large logical volumes. Each instance can attach only a limited amount of total persistent disk space and a limited number of individual persistent disks. Predefined machine types and custom machine types have the same persistent disk limits.
-
Most instances can have up to 128 persistent disks and up to 257 TB of total persistent disk space attached. Total persistent disk space for an instance includes the size of the boot persistent disk.
-
Shared-core machine types are limited to 16 persistent disks and 3 TB of total persistent disk space.
-
Creating logical volumes larger than 64 TB might require special consideration. For more information see larger logical volume performance.
Regional persistent disks
Regional persistent disks have storage qualities that are similar to zonal persistent disks. However, regional persistent disks provide durable storage and replication of data between two zones in the same region.
If you are designing robust systems or high availability services on Compute Engine, use regional persistent disks combined with other best practices such as backing up your data using snapshots. Regional persistent disks are also designed to work with regional managed instance groups.
In the unlikely event of a zonal outage, you can usually failover your workload running on regional persistent disks to another zone by using the --force-attach
flag. The --force-attach
flag lets you attach the regional persistent disk to a standby VM instance even if the disk can't be detached from the original VM due to its unavailability. To learn more, see Regional persistent disk failover. You cannot force attach a zonal persistent disk to an instance.
Performance
Regional persistent disks are designed for workloads that require a lower Recovery Point Objective (RPO) and Recovery Time Objective (RTO) compared to using persistent disk snapshots.
Regional persistent disks are an option when write performance is less critical than data redundancy across multiple zones.
Like zonal persistent disks, regional persistent disks can achieve greater IOPS and throughput performance on instances with a greater number of vCPUs. For more information about this and other limitations, see SSD persistent disk performance limits.
When you need more disk space or better performance, you can resize your regional disks to add more storage space, throughput, and IOPS.
Reliability
Compute Engine replicates data of your regional persistent disk to the zones you selected when you created your disks. The data of each replica is spread across multiple physical machines within the zone to ensure redundancy.
Similar to zonal persistent disks, you can create snapshots of persistent disks to protect against data loss due to user error. Snapshots are incremental, and take only minutes to create even if you snapshot disks that are attached to running instances.
Restrictions
- You can't use a regional persistent disk with a memory-optimized, compute-optimized, or accelerator-optimized machine type VM.
- You cannot use regional persistent disks as boot disks.
- You can create a regional persistent disk from a snapshot but not an image.
- The minimum size of a regional standard persistent disk is 200 GB.
- When resizing a regional persistent disk, you can only increase its size.
- Regional persistent disks perform differently from zonal persistent disks. For more information, see Block storage performance.
Local SSDs
Local SSDs are physically attached to the server that hosts your VM instance. Local SSDs have higher throughput and lower latency than standard persistent disks or SSD persistent disks. The data that you store on a local SSD persists only until the instance is stopped or deleted. Each local SSD is 375 GB in size, but you can attach a maximum of 24 local SSD partitions for a total of 9 TB per instance.
Create an instance with Local SSDs when you need a fast scratch disk or cache and don't want to use instance memory.
Create an instance with Local SSDs
Performance
Local SSDs are designed to offer very high IOPS and low latency. Unlike persistent disks, you must manage the striping on local SSDs yourself. Combine multiple local SSD partitions into a single logical volume to achieve the best local SSD performance per instance, or format local SSD partitions individually.
Local SSD performance depends on which interface you select. Local SSDs are available through both SCSI and NVMe interfaces.
The following table provides an overview of local SSD capacity and estimated performance using NVMe. To reach maximum performance limits with an N1 machine type, use 32 or more vCPUs. To reach maximum performance limits on an N2 and N2D machine type, use 24 or more vCPUs.
Storage space | Partitions | IOPS | Throughput (MB/s) | ||||
---|---|---|---|---|---|---|---|
Read | Write | Read | Write | ||||
3 TB | 8 | 680,000 | 360,000 | 2,650 | 1,400 | ||
6 TB | 16 | 1,600,000 | 800,000 | 6,240 | 3,120 | ||
9 TB | 24 | 2,400,000 | 1,200,000 | 9,360 | 4,680 |
For more information, see Local SSD performance and Optimizing Local SSD performance.
Local SSD encryption
Compute Engine automatically encrypts your data when it is written to local SSD storage space. You can't use customer-supplied encryption keys with local SSDs.
Data persistence on Local SSDs
Please read Local SSD data persistence to learn what events preserve your Local SSD data and what events can cause your Local SSD data to be unrecoverable.
General limitations
-
You can create an instance with 16 or 24 local SSD partitions for 6 TB or 9 TB of local SSD space, respectively. This is available on instances with N1, N2, N2D, and custom machine types. To reach the maximum IOPS limits, use a VM instance with 32 or more vCPUs.
-
Instances with shared-core machine types can't attach any local SSD partitions.
-
You can't attach Local SSDs to E2, N2A, T2D and M2 machine types.
Local SSDs and machine types
You can attach Local SSDs to most machine types available on Compute Engine, unless otherwise noted. However, there are constraints around how many local SSDs you can attach based on each machine type:
N1 machine types | Number of local SSD partitions allowed per VM instance |
---|---|
All N1 machine types | 1 to 8, 16, or 24 |
N2 machine types | |
Machine types with 2 to 10 vCPUs, inclusive | 1, 2, 4, 8, 16, or 24 |
Machine types with 12 to 20 vCPUs, inclusive | 2, 4, 8, 16, or 24 |
Machine types with 22 to 40 vCPUs, inclusive | 4, 8, 16, or 24 |
Machine types with 42 to 80 vCPUs, inclusive | 8, 16, or 24 |
N2D machine types | |
Machine types with 2 to 16 vCPUs, inclusive | 1, 2, 4, 8, 16, or 24 |
Machine types with 32 or 48 vCPUs | 2, 4, 8, 16, or 24 |
Machine types with 64 or 80 vCPUs | 4, 8, 16, or 24 |
Machine types with 96 to 224 vCPUs, inclusive | 8, 16, or 24 |
C2 machine types | |
Machine types with 4 or 8 vCPUs | 1, 2, 4, or 8 |
Machine types with 16 vCPUs | 2, 4, or 8 |
Machine types with 30 vCPUs | 4 or 8 |
Machine types with 60 vCPUs | 8 |
A2 machine types | |
a2-highgpu-1g | 1, 2, 4, or 8 |
a2-highgpu-2g | 2, 4, or 8 |
a2-highgpu-4g | 4 or 8 |
a2-highgpu-8g or a2-megagpu-16g | 8 |
M1 machine types | |
m1-ultramem-40 | Not available |
m1-ultramem-80 | Not available |
m1-megamem-96 | 1 to 8 |
m1-ultramem-160 | Not available |
E2, N2A, T2D and M2 machine types | These machine types don't support local SSD drives. |
Local SSDs and preemptible VM instances
You can start a preemptible VM instance with a local SSD and Compute Engine charges you discounted spot prices for the local SSD usage . Local SSDs attached to preemptible instances work like normal local SSDs, retain the same data persistence characteristics, and remain attached for the life of the instance.
Compute Engine doesn't charge you for local SSDs if their instances are preempted in the first minute after they start running.
For more information about local SSDs, see Adding local SSDs.
Reserving Local SSDs with committed use discounts
To reserve Local SSD resources in a specific zone, see Reserving zonal resources. Reservations are required for committed-use pricing for Local SSDs.
Cloud Storage buckets
Cloud Storage buckets are the most flexible, scalable, and durable storage option for your VM instances. If your apps don't require the lower latency of Persistent Disks and Local SSDs, you can store your data in a Cloud Storage bucket.
Connect your instance to a Cloud Storage bucket when latency and throughput aren't a priority and when you must share data easily between multiple instances or zones.
Connect your instance to a Cloud Storage bucket
Performance
The performance of Cloud Storage buckets depends on the storage class that you select and the location of the bucket relative to your instance.
The standard storage class used in the same location as your instance gives performance that is comparable to persistent disks but with higher latency and less consistent throughput characteristics. The standard storage class used in a multiregional location stores your data redundantly across at least two regions within a larger multiregional location.
Nearline and coldline storage classes are primarily for long-term data archival. Unlike the standard storage class, these archival classes have minimum storage durations and read charges. Consequently, they are best for long-term storage of data that is accessed infrequently.
Reliability
All Cloud Storage buckets have built-in redundancy to protect your data against equipment failure and to ensure data availability through datacenter maintenance events. Checksums are calculated for all Cloud Storage operations to help ensure that what you read is what you wrote.
Flexibility
Unlike persistent disks, Cloud Storage buckets aren't restricted to the zone where your instance is located. Additionally, you can read and write data to a bucket from multiple instances simultaneously. For example, you can configure instances in multiple zones to read and write data in the same bucket rather than replicate the data to persistent disks in multiple zones.
Furthermore, you can mount a Cloud Storage bucket to your instance as a file system. Mounted buckets function similarly to a persistent disk when you read or write files. However, Cloud Storage buckets are object stores that don't have the same write constraints as a POSIX file system and can't be used as boot disks. Your instance can write data to a file and overwrite critical data from other instances that are also writing data to the storage object simultaneously.
Cloud Storage encryption
Compute Engine automatically encrypts your data before it travels outside of your instance to Cloud Storage buckets. You don't need to encrypt files on your instances before you write them to a bucket.
Just like persistent disks, you can encrypt buckets with your own encryption keys.
What's next
- Add a persistent disk to your instance.
- Add a regional persistent disk to your instance.
- Create an instance with Local SSDs.
- Connect your instance to a Cloud Storage bucket.
- Create a file server or distributed file system.
- Review the quotas for disks.
- Mount a RAM disk on your instance.
If you're new to Google Cloud, create an account to evaluate how Compute Engine performs in real-world scenarios. New customers also get $300 in free credits to run, test, and deploy workloads.
Try Compute Engine free
What Local Storage Solution Should I Use for a Project
Source: https://cloud.google.com/compute/docs/disks/
0 Response to "What Local Storage Solution Should I Use for a Project"
Post a Comment