Claude Code Google Vertex Ai
Prerequisites
Before configuring Claude Code with Vertex AI, ensure you have:
- A Google Cloud Platform (GCP) account with billing enabled
- A GCP project with Vertex AI API enabled
- Access to desired Claude models (e.g., Claude Sonnet 4.5)
- Google Cloud SDK (
gcloud) installed and configured - Quota allocated in desired GCP region
Region Configuration
Claude Code can be used with both Vertex AI global and regional endpoints.
Note: Vertex AI may not support the Claude Code default models on all regions. You may need to switch to a supported region or model.
Note: Vertex AI may not support the Claude Code default models on global endpoints. You may need to switch to a regional endpoint or supported model.
Setup
1. Enable Vertex AI API
Enable the Vertex AI API in your GCP project:
# Set your project ID
gcloud config set project YOUR-PROJECT-ID
# Enable Vertex AI API
gcloud services enable aiplatform.googleapis.com
2. Request model access
Request access to Claude models in Vertex AI:
- Navigate to the Vertex AI Model Garden
- Search for "Claude" models
- Request access to desired Claude models (e.g., Claude Sonnet 4.5)
- Wait for approval (may take 24-48 hours)
3. Configure GCP credentials
Claude Code uses standard Google Cloud authentication.
For more information, see Google Cloud authentication documentation.
Note: When authenticating, Claude Code will automatically use the project ID from the
ANTHROPIC_VERTEX_PROJECT_IDenvironment variable. To override this, set one of these environment variables:GCLOUD_PROJECT,GOOGLE_CLOUD_PROJECT, orGOOGLE_APPLICATION_CREDENTIALS.
4. Configure Claude Code
Set the following environment variables:
# Enable Vertex AI integration
export CLAUDE_CODE_USE_VERTEX=1
export CLOUD_ML_REGION=global
export ANTHROPIC_VERTEX_PROJECT_ID=YOUR-PROJECT-ID
# Optional: Disable prompt caching if needed
export DISABLE_PROMPT_CACHING=1
# When CLOUD_ML_REGION=global, override region for unsupported models
export VERTEX_REGION_CLAUDE_3_5_HAIKU=us-east5
# Optional: Override regions for other specific models
export VERTEX_REGION_CLAUDE_3_5_SONNET=us-east5
export VERTEX_REGION_CLAUDE_3_7_SONNET=us-east5
export VERTEX_REGION_CLAUDE_4_0_OPUS=europe-west1
export VERTEX_REGION_CLAUDE_4_0_SONNET=us-east5
export VERTEX_REGION_CLAUDE_4_1_OPUS=europe-west1
Note: Prompt caching is automatically supported when you specify the
cache_controlephemeral flag. To disable it, setDISABLE_PROMPT_CACHING=1. For heightened rate limits, contact Google Cloud support.
Note: When using Vertex AI, the
/loginand/logoutcommands are disabled since authentication is handled through Google Cloud credentials.
5. Model configuration
Claude Code uses these default models for Vertex AI:
| Model type | Default value |
|---|---|
| Primary model | claude-sonnet-4-5@20250929 |
| Small/fast model | claude-haiku-4-5@20251001 |
Note: For Vertex AI users, Claude Code will not automatically upgrade from Haiku 3.5 to Haiku 4.5. To manually switch to a newer Haiku model, set the
ANTHROPIC_DEFAULT_HAIKU_MODELenvironment variable to the full model name (e.g.,claude-haiku-4-5@20251001).
To customize models:
export ANTHROPIC_MODEL='claude-opus-4-1@20250805'
export ANTHROPIC_SMALL_FAST_MODEL='claude-haiku-4-5@20251001'
IAM configuration
Assign the required IAM permissions:
The roles/aiplatform.user role includes the required permissions:
aiplatform.endpoints.predict- Required for model invocation and token counting
For more restrictive permissions, create a custom role with only the permissions above.
For details, see Vertex IAM documentation.
Note: We recommend creating a dedicated GCP project for Claude Code to simplify cost tracking and access control.
1M token context window
Claude Sonnet 4 and Sonnet 4.5 support the 1M token context window on Vertex AI.
Note: The 1M token context window is currently in beta. To use the extended context window, include the
context-1m-2025-08-07beta header in your Vertex AI requests.
Troubleshooting
If you encounter quota issues:
- Check current quotas or request quota increase through Cloud Console
If you encounter "model not found" 404 errors:
- Confirm model is Enabled in Model Garden
- Verify you have access to the specified region
- If using
CLOUD_ML_REGION=global, check that your models support global endpoints in Model Garden under "Supported features". For models that don't support global endpoints, either:- Specify a supported model via
ANTHROPIC_MODELorANTHROPIC_SMALL_FAST_MODEL, or - Set a regional endpoint using
VERTEX_REGION_<MODEL_NAME>environment variables
- Specify a supported model via
If you encounter 429 errors:
- For regional endpoints, ensure the primary model and small/fast model are supported in your selected region
- Consider switching to
CLOUD_ML_REGION=globalfor better availability