Installation

Open Claude Code and run this command:

/plugin install jeremy-vertex-terraform@claude-code-plugins-plus

Use --global to install for all projects, or --project for current project only.

What It Does

This plugin provides Terraform modules for deploying Vertex AI services including Model Garden foundation models, Gemini API endpoints, vector search for RAG applications, ML pipelines, and production model serving infrastructure.

Key Infrastructure Components:

googlevertexai_endpoint for model serving
googlevertexaideployedmodel for model versions
googlevertexai_index for vector search
googlevertexaiindexendpoint for similarity search
googlevertexaifeaturestore for feature management
Cloud Storage for model artifacts
BigQuery for ML model training

Features

✅ Model Garden Deployment: Foundation models (Gemini, PaLM, Claude, Llama)

✅ Gemini API Endpoints: Dedicated endpoints with rate limiting

✅ Vector Search: ScaNN-based similarity search for RAG

✅ ML Pipelines: Kubeflow Pipelines for training workflows

✅ Model Serving: Production endpoints with auto-scaling

✅ Batch Predictions: Large-scale inference jobs

✅ Feature Store: Centralized feature management

✅ Monitoring: Model performance tracking and drift detection

Skills (1)

vertex-infra-expert View full skill →

'Execute use when provisioning Vertex AI infrastructure with Terraform.

ReadWriteEditGrepGlobBash(terraform:*)Bash(gcloud:*)

Vertex Infra Expert

Overview

Provision Vertex AI infrastructure with Terraform (endpoints, deployed models, vector search indices, pipelines) with production guardrails: encryption, autoscaling, IAM least privilege, and operational validation steps. Use this skill to generate a minimal working Terraform baseline and iterate toward enterprise-ready deployments.

Prerequisites

Before using this skill, ensure:

Google Cloud project with Vertex AI API enabled
Terraform 1.0+ installed
gcloud CLI authenticated with appropriate permissions
Understanding of Vertex AI services and ML models
KMS keys created for encryption (if required)
GCS buckets for model artifacts and embeddings

Instructions

Define AI Services: Identify required Vertex AI components (endpoints, vector search, pipelines)
Configure Terraform: Set up backend and define project variables
Provision Endpoints: Deploy Gemini or custom model endpoints with auto-scaling
Set Up Vector Search: Create indices for embeddings with appropriate dimensions
Configure Encryption: Apply KMS encryption to endpoints and data
Implement Monitoring: Set up Cloud Monitoring for model performance
Apply IAM Policies: Grant least privilege access to AI services
Validate Deployment: Test endpoints and verify model availability

Output

Configuration files or code changes applied to the project
Validation report confirming correct implementation
Summary of changes made and their rationale

See Terraform implementation details for output format specifications.

Error Handling

See ${CLAUDESKILLDIR}/references/errors.md for comprehensive error handling.

Examples

See ${CLAUDESKILLDIR}/references/examples.md for detailed examples.

Resources

Vertex AI Terraform: https://registry.terraform.io/providers/hashicorp/google/latest/docs/resources/vertexaiendpoint
Vertex AI documentation: https://cloud.google.com/vertex-ai/docs
Model Garden: https://cloud.google.com/model-garden
Vector Search guide:
Terraform examples in ${CLAUDESKILLDIR}/vertex-examples/

How It Works

Natural Language Activation


"Create Terraform for Gemini endpoint deployment"
"Deploy vector search for RAG application"
"Set up Vertex AI Pipeline for model training"
"Create Feature Store for ML features"
"Deploy custom model to Vertex AI endpoint"

Use Cases

Gemini API Deployment


"Create Terraform for Gemini 2.0 Flash endpoint"
"Deploy Gemini Pro with auto-scaling"

Vector Search for RAG


"Set up vector search infrastructure for RAG application"
"Deploy embeddings index with 768 dimensions"

Custom Model Serving


"Deploy custom scikit-learn model to Vertex AI"
"Create endpoint for TensorFlow model with GPU"

Batch Predictions


"Set up batch prediction job for large dataset"
"Deploy batch inference with T4 GPUs"

Feature Store


"Create Feature Store for user features"
"Deploy feature serving for real-time predictions"

jeremy-vertex-terraform

Installation

What It Does

Features

Skills (1)

Vertex Infra Expert

Overview

Prerequisites

Instructions

Output

Error Handling

Examples

Resources

How It Works

Natural Language Activation

Use Cases

Gemini API Deployment

Vector Search for RAG

Custom Model Serving

Batch Predictions

Feature Store

Ready to use jeremy-vertex-terraform?

Related Plugins

jeremy-vertex-terraform

Installation

What It Does

Features

Skills (1)

Vertex Infra Expert

Overview

Prerequisites

Instructions

Output

Error Handling

Examples

Resources

How It Works

Natural Language Activation

Use Cases

Gemini API Deployment

Vector Search for RAG

Custom Model Serving

Batch Predictions

Feature Store

Ready to use jeremy-vertex-terraform?

Related Plugins

ansible-playbook-creator

auto-scaling-configurator

backup-strategy-implementor

ci-cd-pipeline-builder

cloud-cost-optimizer

compliance-checker