Prices are listed in US Dollars (USD).
This page covers pricing for legacy models on Vertex AI. The models in a legacy model family are no longer updated with new stable versions. For details, see Legacy model information.
PaLM and Codey models
Generative AI on Vertex AI charges by every 1,000 characters of input (prompt) and every 1,000 characters of output (response). Characters are counted by UTF-8 code points and whitespace is excluded from the count. During the Preview stage, charges are 100% discounted. Prediction requests that lead to filtered responses are charged for the input only. At the end of each billing cycle, fractions of one cent ($0.01) are rounded to one cent.
Model | Type | Region | Price per 1,000 characters |
---|---|---|---|
PaLM 2 for Text (Text Bison) | Input | Global |
|
Output | Global |
|
|
Supervised Tuning | us-central1 europe-west4 |
$ per node hour Vertex AI custom training pricing | |
Reinforcement Learning from Human Feedback | us-central1 europe-west4 |
$ per node hour Vertex AI custom training pricing | |
PaLM 2 for Text 32k (Text Bison 32k) | Input | Global |
|
Output | Global |
|
|
Supervised Tuning | us-central1 europe-west4 |
$ per node hour Vertex AI custom training pricing | |
PaLM 2 for Text (Text Unicorn) |
Input | Global |
|
Output | Global |
|
|
PaLM 2 for Chat (Chat Bison) | Input | Global |
|
Output | Global |
|
|
Supervised Tuning | us-central1 europe-west4 |
$ per node hour Vertex AI custom training pricing | |
Reinforcement Learning from Human Feedback | us-central1 europe-west4 |
$ per node hour Vertex AI custom training pricing | |
PaLM 2 for Chat 32k (Chat Bison 32k) | Input | Global |
|
Output | Global |
|
|
Supervised Tuning | us-central1 europe-west4 |
$ per node hour Vertex AI custom training pricing | |
Codey for Code Generation | Input | Global |
|
Output | Global |
|
|
Supervised Tuning | us-central1 europe-west4 |
$ per node hour Vertex AI custom training pricing | |
Codey for Code Generation 32k | Input | Global |
|
Output | Global |
|
|
Supervised Tuning | us-central1 europe-west4 |
$ per node hour Vertex AI custom training pricing | |
Codey for Code Chat | Input | Global |
|
Output | Global |
|
|
Supervised Tuning | us-central1 europe-west4 |
$ per node hour Vertex AI custom training pricing | |
Codey for Code Chat 32k | Input | Global |
|
Output | Global |
|
|
Supervised Tuning | us-central1 europe-west4 |
$ per node hour Vertex AI custom training pricing |
Prices are listed in US Dollars (USD).
Example cost calculation
If a user sends five separate requests to the PaLM Text Bison model, and each request has a 200-character input and 400-character output, the total charge is calculated as follows:
Input cost:
200 input characters x 5 prompts = 1,000 total input characters;
1,000 total input characters x ($0.00025 / 1000) = $0.00025 input cost.
Output cost:
400 output characters x 5 prompts = 2,000 total output characters;
2,000 total output characters x ($0.0005 / 1000) = $0.001 output cost.
Total cost:
$0.00025 input cost + $0.001 output cost = $0.00125 total cost.