This legacy version of AI Platform Training is deprecated and will no longer be available on Google Cloud after January 31, 2025. Migrate your resources to Vertex AI custom training to get new machine learning features that are unavailable in AI Platform.
Stay organized with collections
Save and categorize content based on your preferences.
This page provides detailed reference information about arguments you submit to
AI Platform Training when running a training job using the built-in
BERT algorithm.
Versioning
The built-in BERT algorithm uses TensorFlow 2.3.
Data format arguments
The following arguments are used for data formatting:
Arguments
Details
train_dataset_path
Cloud Storage path to a TFRecord file.
Required Type: String
eval_dataset_path
Cloud Storage path to a TFRecord file. Must have the same format as training_data_path.
Required Type: String
job-dir
Cloud Storage path where model, checkpoints and other training
artifacts reside. The following directories are created here:
model: This contains the trained model
Will also contain model training checkpoints
Required Type: String
Hyperparameters
Hyperparameter
Details
BASIC PARAMETERS
input_meta_data_path
Google Cloud Storage path to an input metadata schema file.
Required Type: String
bert_config_file
Google Cloud Storage path where the BERT config file is stored.
Required Type: String
initial_checkpoint
Starting checkpoint for fine-tuning (usually a pre-trained BERT model.)
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-03-05 UTC."],[[["This page details the arguments required for running a training job using the built-in BERT algorithm on AI Platform Training, which utilizes TensorFlow 2.3."],["Training and evaluation data must be provided as TFRecord files through the `train_dataset_path` and `eval_dataset_path` arguments, respectively, along with the `job-dir` argument to define the storage for training artifacts."],["The BERT algorithm requires several arguments to be provided for basic parameters, such as `input_meta_data_path`, `bert_config_file`, `initial_checkpoint`, `mode`, and `num_train_epochs`."],["Advanced hyperparameters like `train_batch_size`, `eval_batch_size`, `steps_per_loop`, `learning_rate`, `scale_loss`, and `use_keras_compile_fit` allow for fine-tuning the training process, and each one has a default value."],["This is a beta feature and thus it is subject to the \"Pre-GA Offerings Terms\", and it's available \"as is\" with potentially limited support, in addition to the fact that more information about the availability of the product can be found in the \"launch stage descriptions\" page."]]],[]]