Google cloud vision MCP for AI Agents

Securely connect your AI agents and chatbots (Claude, ChatGPT, Cursor, etc) with Google cloud vision MCP or direct API to analyze images, detect faces and landmarks, extract text via OCR, and moderate content through natural language.

Google cloud vision logoGoogle cloud vision
Api Key

Google Cloud Vision API adds advanced image analysis—like labeling, OCR, and detection—to apps. It helps you extract structured data and insights from images at scale.

29 Tools

Try Google cloud vision now

Type what you want done — sign in and watch it run live in the Tool Router playground.

TOOL ROUTER PLAYGROUND
Google cloud vision
Try asking
TOOLS

Supported Tools

Every Google cloud vision action and event your agent gets out of the box.

Annotate Files with Vision API

Tool to perform image detection and annotation for batch files in Google Cloud Vision.

Async Batch Annotate Files

Tool to run asynchronous image detection and annotation for a list of generic files (PDF, TIFF, GIF).

Annotate Images

Run image detection and annotation for a batch of images using Google Cloud Vision API.

Annotate Images Async Batch

Tool to run asynchronous image detection and annotation for a batch of images.

Annotate Location Images

Tool to run image detection and annotation for a batch of images scoped to a specific project and location.

Create Vision Product

Creates a new Product resource in Google Cloud Vision Product Search.

Create Product Set

Creates a new ProductSet resource in Google Cloud Vision Product Search.

Create ReferenceImage

Tool to create a ReferenceImage under a product.

Delete Product

Permanently deletes a Product and its associated reference images from Google Cloud Vision API.

Get Product

Tool to get information associated with a Product.

Get Product Set

Tool to get a ProductSet.

Import Product Sets

Asynchronously imports product sets and reference images from a CSV file stored in Google Cloud Storage.

List Vision AI IndexEndpoints

Lists IndexEndpoints in Vertex AI Vision for a given project and location.

List Locations

Tool to list available Vision AI service locations for a project.

List Vision API Operations

Tool to list operations that match the specified filter.

Purge Products

Tool to asynchronously delete products in a ProductSet or orphan products.

Update Product

Tool to update a Product's mutable fields: displayName, description, and productLabels.

Update Product Set

Tool to update a ProductSet resource.

Add Product to ProductSet

Add a Product to a ProductSet in Google Cloud Vision Product Search.

Cancel Vision Operation

Starts asynchronous cancellation of a long-running Vision API operation.

Delete Vision API Operation

Tool to delete a long-running Vision API operation.

Delete Product Set

Tool to permanently delete a ProductSet.

Delete Reference Image

Permanently removes a reference image from a product in Google Cloud Vision Product Search.

Get Vision API Operation

Retrieves the latest state of a long-running Vision API operation.

Get Reference Image

Tool to get information associated with a ReferenceImage.

List Products in ProductSet

Tool to list Products in a specified ProductSet.

List Projects

List Google Cloud projects accessible to the authenticated user via Cloud Resource Manager API.

List Reference Images

Tool to list reference images for a product.

Remove Product from ProductSet

Removes a Product from a specified ProductSet in Google Cloud Vision API.

SETUP GUIDE

Connect Google cloud vision MCP Tool with your Agent

1

Install Composio

typescript
npm install @composio/core ai @ai-sdk/openai @ai-sdk/mcp
Install the Composio SDK and Claude Agent SDK
2

Create Tool Router Session

typescript
import { Composio } from '@composio/core';

const composio = new Composio({ apiKey: 'your-api-key' });

console.log("Creating Tool Router session...");
const { mcp } = await composio.create('your-user-id');
console.log(`Tool Router session created: ${mcp.url}`);
Initialize the Composio client and create a Tool Router session
3

Connect to AI Agent

typescript
import { openai } from '@ai-sdk/openai';
import { experimental_createMCPClient as createMCPClient } from '@ai-sdk/mcp';
import { generateText, stepCountIs } from 'ai';

const client = await createMCPClient({
  transport: {
    type: 'http',
    url: mcp.url,
    headers: { 'x-api-key': 'your-composio-api-key' }
  }
});

const tools = await client.tools();

const { text } = await generateText({
  model: openai('gpt-4o'),
  tools,
  messages: [{ role: 'user', content: 'Detect text in this image from the provided URL' }],
  stopWhen: stepCountIs( 5 )
});

console.log(`Agent: ${text}`);
Use the MCP server with your AI agent
SETUP GUIDE

Connect Google cloud vision API Tool with your Agent

1

Install Composio

typescript
npm install @composio/openai
Install the Composio SDK
2

Initialize Composio and Create Tool Router Session

typescript
import OpenAI from 'openai';
import { Composio } from '@composio/core';
import { OpenAIResponsesProvider } from '@composio/openai';

const composio = new Composio({
  provider: new OpenAIResponsesProvider(),
});
const openai = new OpenAI({});
const session = await composio.create('your-user-id');
Import and initialize Composio client, then create a Tool Router session
3

Execute Google cloud vision Tools via Tool Router with Your Agent

typescript
const tools = session.tools;
const response = await openai.responses.create({
  model: 'gpt-4.1',
  tools: tools,
  input: [{
    role: 'user',
    content: 'Detect and extract text from an uploaded receipt image using OCR.'
  }],
});
const result = await composio.provider.handleToolCalls(
  'your-user-id',
  response.output
);
console.log(result);
Get tools from Tool Router session and execute Google cloud vision actions with your Agent

Why Use Composio?

AI Native Google cloud vision Integration

  • Supports both Google cloud vision MCP and direct API based integrations
  • Structured, LLM-friendly schemas for reliable tool execution
  • Rich coverage for reading, writing, and querying your Google cloud vision data

Managed Auth

  • Built-in OAuth handling with automatic token refresh and rotation
  • Central place to manage, scope, and revoke Google cloud vision access
  • Per user and per environment credentials instead of hard-coded keys

Agent Optimized Design

  • Tools are tuned using real error and success rates to improve reliability over time
  • Comprehensive execution logs so you always know what ran, when, and on whose behalf

Enterprise Grade Security

  • Fine-grained RBAC so you control which agents and users can access Google cloud vision
  • Scoped, least privilege access to Google cloud vision resources
  • Full audit trail of agent actions to support review and compliance
FAQ

Frequently asked questions

Yes, Google cloud vision requires you to configure your own API key credentials. Once set up, Composio handles secure credential storage and API request handling for you.

Yes! Composio's Tool Router enables agents to use multiple toolkits. Learn more.

Composio is SOC 2 and ISO 27001 compliant with all data encrypted in transit and at rest. Learn more.

Composio maintains and updates all toolkit integrations automatically, so your agents always work with the latest API versions.

Start with Google cloud vision.It takes 30 seconds.

Managed auth, hosted MCP servers, and every Google cloud vision tool your agent needs.Free to start.

Start building