Skip to main content

GCP GKE

When creating a GCP GKE engine, AIchor provisions and manages a Google Kubernetes Engine cluster in the specified GCP project. The cluster lifecycle (upgrades, patches) is handled by AIchor.

Prerequisites

  • A GCP project must exist and be accessible to the service account used for provisioning.
  • A GCP service account must be created by the GCP administrator and granted the permissions required for AIchor to manage cluster resources.
  • The target GCP region must be decided in advance. All GCP regions are supported.
  • Optional: decide whether Google Cloud Filestore should be enabled for file storage.

Steps

  1. In the AIchor UI, open Engines from the left menu and click Add Engine.
  2. Select In The Cloud, then GCP, then GKE, then Create.
  3. Fill in the form fields described below and click Submit.

GCP GKE create form GCP GKE create form

Form fields

FieldRequiredDescription
Engine NameYesLowercase alphanumeric characters and hyphens only. Must start with a letter. Maximum 128 characters.
EcosystemNoTag passed to infrastructure-as-code tooling. Only required for specific organisations on InstaDeep recommendation.
GCP RegionYesDropdown of all available GCP regions. Select the region where the cluster will be provisioned.
GCP Project IDYesThe identifier of the GCP project that will host the cluster.
GCP Service AccountYesService account created by the GCP administrator to grant AIchor access to the cloud provider account.
CIDR RangeNoIP address range used for VPC creation.
FilestoreNoCheckbox. When enabled, Google Cloud Filestore is activated for file storage on the engine.

Engine resources

The Engine Resources section defines the compute capacity available to the engine.

FieldRequiredDescription
Max CPUYes (minimum 12)Maximum number of CPU cores the engine can allocate across all workloads.
Max MemoryYes (minimum 48)Maximum memory in GB the engine can allocate.
GPUsNoOne or more GPU entries. Each entry specifies a GPU type (from the GCP catalogue) and a minimum and maximum count.
TPUsNoOne or more TPU entries. Each entry specifies a TPU type (v4, v5e, v5p, or v6e/Trillium) and a minimum and maximum count.

Supported GPU types:

GPU typeIdentifier
NVIDIA Tesla T4t4
NVIDIA Tesla P4tesla-p4
NVIDIA Tesla P100tesla-p100
NVIDIA Tesla V100v100
NVIDIA A100a100
NVIDIA L4l4

Supported TPU types:

TPU typeIdentifier
TPU v4v4
TPU v5ev5e
TPU v5pv5p
TPU v6e (Trillium)v6e

After submission

After the form is submitted, AIchor begins provisioning the GKE cluster. Once provisioning is complete, the engine status is displayed as READY on the engine management page.