How to integrate Google cloud vision MCP with Claude Code

Manage your Google cloud vision directly from Claude Code with zero worries about OAuth hassles, API-breaking issues, or reliability and security concerns. You can do this in two different ways: Via Composio Connect - Direct and easiest approach Via Composio SDK - Programmatic approach with more control

Google cloud vision logoGoogle cloud vision
Api Key

Google Cloud Vision API adds advanced image analysis—like labeling, OCR, and detection—to apps. It helps you extract structured data and insights from images at scale.

29 Tools

Introduction

Manage your Google cloud vision directly from Claude Code with zero worries about OAuth hassles, API-breaking issues, or reliability and security concerns.

You can do this in two different ways:

  1. Via Composio Connect - Direct and easiest approach
  2. Via Composio SDK - Programmatic approach with more control

Also integrate Google cloud vision with

Why use Composio?

  • Only one MCP URL to connect multiple apps with Claude Code with zero auth hassles.
  • Programmatic tool calling allows LLMs to write its code in a remote workbench to handle complex tool chaining. Reduces to-and-fro with LLMs for frequent tool calling.
  • Handling Large tool responses out of LLM context to minimize context rot.
  • Dynamic just-in-time access to 20,000 tools across 1000+ other Apps for cross-app workflows. It loads the tools you need, so LLMs aren't overwhelmed by tools you don't need.

Connecting Google cloud vision to Claude Code using Composio

1. Add the Composio MCP to Claude

Terminal

2. Start Claude Code

bash
claude

3. Open your MCP list

bash
/mcp

4. Select Composio and click on Authenticate

Select Composio and click Authenticate

5. This will redirect you to the Composio OAuth page. Complete the flow by authorizing Composio and you're all set.

Composio OAuth authorization page
Composio authorization complete
Ask Claude to connect to your account and authenticate via the link

What is the Google cloud vision MCP server, and what's possible with it?

The Google cloud vision MCP server is an implementation of the Model Context Protocol that connects your AI agent and assistants like Claude, Cursor, etc directly to your Google Cloud Vision account. It provides structured and secure access to your image analysis resources, so your agent can perform actions like registering products, managing reference images, listing endpoints, and automating large-scale image operations on your behalf.

  • Product and reference image management: Easily create new products and add reference images for visual search, enabling your agent to organize and expand your vision datasets effortlessly.
  • Bulk import and product set operations: Let your agent import large numbers of reference images into product sets from Cloud Storage CSV files, streamlining dataset curation at scale.
  • Automated product cleanup and deletion: Direct your agent to purge unused or orphan products from your project, keeping your cloud resources tidy without manual effort.
  • Location and endpoint discovery: Quickly list available Vision AI service locations and existing IndexEndpoints, making it easy for your agent to select optimal regions and manage deployment targets.
  • Vision API operation tracking: Retrieve and review ongoing or past Vision API operations, so your agent can monitor processing jobs and ensure workflow transparency.

Connecting Google cloud vision via Composio SDK

Composio SDK is the underlying tech that powers Rube. It's a universal gateway that does everything Rube does but with much more programmatic control. You can programmatically generate an MCP URL with the app you need (here Google cloud vision) for even more tool search precision. It's secure and reliable.

How the Composio SDK works

The Composio SDK follows a three-phase workflow:

  1. Discovery: Searches for tools matching your task and returns relevant toolkits with their details.
  2. Authentication: Checks for active connections. If missing, creates an auth config and returns a connection URL via Auth Link.
  3. Execution: Executes the action using the authenticated connection.

Step-by-step Guide

Step by step10 STEPS
1

Prerequisites

Before starting, make sure you have:
  • Claude Pro, Max, or API billing enabled Anthropic account
  • Composio API Key
  • A Google cloud vision account
  • Basic knowledge of Python or TypeScript
2

Install Claude Code

bash
# macOS, Linux, WSL
curl -fsSL https://claude.ai/install.sh | bash

# Windows PowerShell
irm https://claude.ai/install.ps1 | iex

# Windows CMD
curl -fsSL https://claude.ai/install.cmd -o install.cmd && install.cmd && del install.cmd

To install Claude Code, use one of the following methods based on your operating system:

3

Set up Claude Code

bash
cd your-project-folder
claude

Open a terminal, go to your project folder, and start Claude Code:

  • Claude Code will open in your terminal
  • Follow the prompts to sign in with your Anthropic account
  • Complete the authentication flow
  • Once authenticated, you can start using Claude Code
Claude Code initial setup showing sign-in prompt
Claude Code terminal after successful login
4

Set up environment variables

bash
COMPOSIO_API_KEY=your_composio_api_key_here
USER_ID=your_user_id_here

Create a .env file in your project root with the following variables:

  • COMPOSIO_API_KEY authenticates with Composio (get it from Composio dashboard)
  • USER_ID identifies the user for session management (use any unique identifier)
5

Install Composio library

npm install @composio/core dotenv

Install the Composio TypeScript library to create MCP sessions.

  • @composio/core provides the core Composio functionality
  • dotenv loads environment variables from your .env file
6

Generate Composio MCP URL

import 'dotenv/config';
import { Composio } from '@composio/core';

const { COMPOSIO_API_KEY, USER_ID } = process.env;

if (!COMPOSIO_API_KEY || !USER_ID) {
  throw new Error('COMPOSIO_API_KEY and USER_ID required in .env');
}

const composioClient = new Composio({ apiKey: COMPOSIO_API_KEY });

const composioSession = await composioClient.create(USER_ID, {
  toolkits: ['google_cloud_vision'],
});

const composioMcpUrl = composioSession?.mcp.url;

console.log(`MCP URL: ${composioMcpUrl}`);
console.log(`\nUse this command to add to Claude Code:`);
console.log(`claude mcp add --transport http google_cloud_vision-composio "${composioMcpUrl}" --headers "X-API-Key:${COMPOSIO_API_KEY}"`);

Create a script to generate a Composio MCP URL for Google cloud vision. This URL will be used to connect Claude Code to Google cloud vision.

What's happening

  • We import the Composio client and load environment variables
  • Create a Composio instance with your API key
  • Call create() to create a Tool Router session for Google cloud vision
  • The returned mcp.url is the MCP server URL that Claude Code will use
  • The script prints this URL so you can copy it
7

Run the script and copy the MCP URL

node --loader ts-node/esm generate_mcp_url.ts
# or if using tsx
tsx generate_mcp_url.ts

Run your TypeScript script to generate the MCP URL.

  • The script connects to Composio and creates a Tool Router session
  • It prints the MCP URL and the exact command you need to run
  • Copy the entire claude mcp add command from the output
8

Add Google cloud vision MCP to Claude Code

bash
claude mcp add --transport http google_cloud_vision-composio "YOUR_MCP_URL_HERE" --headers "X-API-Key:YOUR_COMPOSIO_API_KEY"

# Then restart Claude Code
exit
claude

In your terminal, add the MCP server using the command from the previous step. The command format is:

  • claude mcp add registers a new MCP server with Claude Code
  • --transport http specifies that this is an HTTP-based MCP server
  • The server name (google_cloud_vision-composio) is how you'll reference it
  • The URL points to your Composio Tool Router session
  • --headers includes your Composio API key for authentication

After running the command, close the current Claude Code session and start a new one for the changes to take effect.

9

Verify the installation

bash
claude mcp list

Check that your Google cloud vision MCP server is properly configured.

  • This command lists all MCP servers registered with Claude Code
  • You should see your google_cloud_vision-composio entry in the list
  • This confirms that Claude Code can now access Google cloud vision tools

If everything is wired up, you should see your google_cloud_vision-composio entry listed:

Claude Code MCP list showing the toolkit MCP server
10

Authenticate Google cloud vision

The first time you try to use Google cloud vision tools, you'll be prompted to authenticate.

  • Claude Code will detect that you need to authenticate with Google cloud vision
  • It will show you an authentication link
  • Open the link in your browser (or copy/paste it)
  • Complete the Google cloud vision authorization flow
  • Return to the terminal and start using Google cloud vision through Claude Code

Once authenticated, you can ask Claude Code to perform Google cloud vision operations in natural language. For example:

  • "Bulk import product images from GCS CSV"
  • "List all Vision AI service locations"
  • "Create a new product for image recognition"

Complete Code

Here's the complete code to get you started with Google cloud vision and Claude Code:

import 'dotenv/config';
import { Composio } from '@composio/core';

const { COMPOSIO_API_KEY, USER_ID } = process.env;

if (!COMPOSIO_API_KEY || !USER_ID) {
  throw new Error('COMPOSIO_API_KEY and USER_ID required in .env');
}

const composioClient = new Composio({ apiKey: COMPOSIO_API_KEY });

const composioSession = await composioClient.create(USER_ID, {
  toolkits: ['google_cloud_vision'],
});

const composioMcpUrl = composioSession?.mcp.url;

console.log(`MCP URL: ${composioMcpUrl}`);
console.log(`\nUse this command to add to Claude Code:`);
console.log(`claude mcp add --transport http google_cloud_vision-composio "${composioMcpUrl}" --headers "X-API-Key:${COMPOSIO_API_KEY}"`);

Conclusion

You've successfully integrated Google cloud vision with Claude Code using Composio's MCP server. Now you can interact with Google cloud vision directly from your terminal using natural language commands.

Key features of this setup:

  • Terminal-native experience without switching contexts
  • Natural language commands for Google cloud vision operations
  • Secure authentication through Composio's managed MCP
  • Tool Router for dynamic tool discovery and execution

Next steps:

  • Try asking Claude Code to perform various Google cloud vision operations
  • Add more toolkits to your Tool Router session for multi-app workflows
  • Integrate this setup into your development workflow for increased productivity

You can extend this by adding more toolkits, implementing custom workflows, or building automation scripts that leverage Claude Code's capabilities.

TOOLS

Supported Tools

Every Google cloud vision action and event your agent gets out of the box.

Annotate Files with Vision API

Tool to perform image detection and annotation for batch files in Google Cloud Vision.

Async Batch Annotate Files

Tool to run asynchronous image detection and annotation for a list of generic files (PDF, TIFF, GIF).

Annotate Images

Run image detection and annotation for a batch of images using Google Cloud Vision API.

Annotate Images Async Batch

Tool to run asynchronous image detection and annotation for a batch of images.

Annotate Location Images

Tool to run image detection and annotation for a batch of images scoped to a specific project and location.

Create Vision Product

Creates a new Product resource in Google Cloud Vision Product Search.

Create Product Set

Creates a new ProductSet resource in Google Cloud Vision Product Search.

Create ReferenceImage

Tool to create a ReferenceImage under a product.

Delete Product

Permanently deletes a Product and its associated reference images from Google Cloud Vision API.

Get Product

Tool to get information associated with a Product.

Get Product Set

Tool to get a ProductSet.

Import Product Sets

Asynchronously imports product sets and reference images from a CSV file stored in Google Cloud Storage.

List Vision AI IndexEndpoints

Lists IndexEndpoints in Vertex AI Vision for a given project and location.

List Locations

Tool to list available Vision AI service locations for a project.

List Vision API Operations

Tool to list operations that match the specified filter.

Purge Products

Tool to asynchronously delete products in a ProductSet or orphan products.

Update Product

Tool to update a Product's mutable fields: displayName, description, and productLabels.

Update Product Set

Tool to update a ProductSet resource.

Add Product to ProductSet

Add a Product to a ProductSet in Google Cloud Vision Product Search.

Cancel Vision Operation

Starts asynchronous cancellation of a long-running Vision API operation.

Delete Vision API Operation

Tool to delete a long-running Vision API operation.

Delete Product Set

Tool to permanently delete a ProductSet.

Delete Reference Image

Permanently removes a reference image from a product in Google Cloud Vision Product Search.

Get Vision API Operation

Retrieves the latest state of a long-running Vision API operation.

Get Reference Image

Tool to get information associated with a ReferenceImage.

List Products in ProductSet

Tool to list Products in a specified ProductSet.

List Projects

List Google Cloud projects accessible to the authenticated user via Cloud Resource Manager API.

List Reference Images

Tool to list reference images for a product.

Remove Product from ProductSet

Removes a Product from a specified ProductSet in Google Cloud Vision API.

FAQ

Frequently asked questions

With a standalone Google cloud vision MCP server, the agents and LLMs can only access a fixed set of Google cloud vision tools tied to that server. However, with the Composio Tool Router, agents can dynamically load tools from Google cloud vision and many other apps based on the task at hand, all through a single MCP endpoint.

Yes, you can. Claude Code fully supports MCP integration. You get structured tool calling, message history handling, and model orchestration while Tool Router takes care of discovering and serving the right Google cloud vision tools.

Yes, absolutely. You can configure which Google cloud vision scopes and actions are allowed when connecting your account to Composio. You can also bring your own OAuth credentials or API configuration so you keep full control over what the agent can do.

All sensitive data such as tokens, keys, and configuration is fully encrypted at rest and in transit. Composio is SOC 2 Type 2 compliant and follows strict security practices so your Google cloud vision data and credentials are handled as safely as possible.

Start with Google cloud vision.It takes 30 seconds.

Managed auth, hosted MCP servers, and every Google cloud vision tool your agent needs.Free to start.

Start building