Pricing for legacy models

Prices are listed in US Dollars (USD).

This page covers pricing for legacy models on Vertex AI. The models in a legacy model family are no longer updated with new stable versions. For details, see Legacy model information.

PaLM and Codey models

Generative AI on Vertex AI charges by every 1,000 characters of input (prompt) and every 1,000 characters of output (response). Characters are counted by UTF-8 code points and whitespace is excluded from the count. During the Preview stage, charges are 100% discounted. Prediction requests that lead to filtered responses are charged for the input only. At the end of each billing cycle, fractions of one cent ($0.01) are rounded to one cent.

Model Type Region Price per 1,000 characters
PaLM 2 for Text (Text Bison) Input Global
  • Online requests: $0.00025
  • Batch requests: $0.00020
Output Global
  • Online requests: $0.0005
  • Batch requests: $0.0004
Supervised Tuning us-central1
europe-west4
$ per node hour Vertex AI custom training pricing
Reinforcement Learning from Human Feedback us-central1
europe-west4
$ per node hour Vertex AI custom training pricing
PaLM 2 for Text 32k (Text Bison 32k) Input Global
  • Online requests: $0.00025
  • Batch requests: $0.00020
Output Global
  • Online requests: $0.0005
  • Batch requests: $0.0004
Supervised Tuning us-central1
europe-west4
$ per node hour Vertex AI custom training pricing
PaLM 2 for Text
(Text Unicorn)
Input Global
  • Online requests: $0.0025
  • Batch requests: $0.0020
Output Global
  • Online requests: $0.0075
  • Batch requests: $0.0060
PaLM 2 for Chat (Chat Bison) Input Global
  • Online requests: $0.00025
Output Global
  • Online requests: $0.0005
Supervised Tuning us-central1
europe-west4
$ per node hour Vertex AI custom training pricing
Reinforcement Learning from Human Feedback us-central1
europe-west4
$ per node hour Vertex AI custom training pricing
PaLM 2 for Chat 32k (Chat Bison 32k) Input Global
  • Online requests: $0.00025*
Output Global
  • Online requests: $0.0005*
Supervised Tuning us-central1
europe-west4
$ per node hour Vertex AI custom training pricing
Codey for Code Generation Input Global
  • Online requests: $0.00025
  • Batch requests: $0.00020
Output Global
  • Online requests: $0.0005
  • Batch requests: $0.0004
Supervised Tuning us-central1
europe-west4
$ per node hour Vertex AI custom training pricing
Codey for Code Generation 32k Input Global
  • Online requests: $0.00025
Output Global
  • Online requests: $0.0005
Supervised Tuning us-central1
europe-west4
$ per node hour Vertex AI custom training pricing
Codey for Code Chat Input Global
  • Online requests: $0.00025
Output Global
  • Online requests: $0.0005
Supervised Tuning us-central1
europe-west4
$ per node hour Vertex AI custom training pricing
Codey for Code Chat 32k Input Global
  • Online requests: $0.00025
Output Global
  • Online requests: $0.0005
Supervised Tuning us-central1
europe-west4
$ per node hour Vertex AI custom training pricing

Prices are listed in US Dollars (USD).

Example cost calculation

If a user sends five separate requests to the PaLM Text Bison model, and each request has a 200-character input and 400-character output, the total charge is calculated as follows:

Input cost:
200 input characters x 5 prompts = 1,000 total input characters;
1,000 total input characters x ($0.00025 / 1000) = $0.00025 input cost.

Output cost:
400 output characters x 5 prompts = 2,000 total output characters;
2,000 total output characters x ($0.0005 / 1000) = $0.001 output cost.

Total cost:
$0.00025 input cost + $0.001 output cost = $0.00125 total cost.