$ man image-caption-localize
/image-caption-localize
PRICE / CALL
$0.04
USDC · base mainnet · scheme: exact
METHOD
POST
CLUSTER
composeCATEGORY
uncategorized
STATUS
● live
NAME
image-caption-localize — captions an image and translates the caption into any of 100+ languages in one call
SYNOPSIS
POST https://x402.agentutility.ai/image-caption-localize
Content-Type: application/json
X-PAYMENT: <signed-transferWithAuthorization>
{ ... }↳ first call →
402 Payment Required. Sign USDCtransferWithAuthorization, retry with theX-PAYMENT header.DESCRIPTION
Captions an image and translates the caption into any of 100+ languages in one call. Composite: one call runs describe-image + translate. A vision LLM writes a single-sentence caption, then it is translated into the target language. Returns the source caption, translated caption, and per-component telemetry. Use it for translated image captions, multilingual alt text, or vision caption plus translation.
INPUT — request schema
| property | type | description | req? |
|---|---|---|---|
| image_url | string | Public http(s) URL of the image to caption. Max 2048 chars. | required |
| target_language | string | Target language name or ISO code (e.g. 'Japanese', 'ja'). Max 60 chars. | required |
OUTPUT — response shape
| field | type | description |
|---|---|---|
| image_url | string | — |
| caption | string | — |
| target_language | string | — |
| translated_caption | string | — |
| composed_of | string | — |
| components | string | — |
| degraded | string | — |
EXAMPLES — two ways to call
EXAMPLE 1 · curl
curl -X POST https://x402.agentutility.ai/image-caption-localize \
-H 'Content-Type: application/json' \
-d '{ }'first response =
402 Payment Required with payment requirements; sign + retry with X-PAYMENT.EXAMPLE 2 · mcp
# Install the MCP package for this endpoint's cluster npx -y @agentutility/mcp-<cluster> # Required: EVM private key with USDC on Base export X402_PRIVATE_KEY=0x... # Then call the image-caption-localize tool from your MCP-aware agent.
MCP server handles payment automatically — your coding agent just calls the tool by name.
METADATA
- tags
- composeimagecaptionlocalizeimage-caption-localize
- methods
- POST
- cluster
- compose
- price
- $0.04 USDC per call
ADJACENT — other endpoints in compose
| endpoint | description | price |
|---|---|---|
| article-brief | Analyzes a news article from its URL into a summary, named entities, and sentiment in one call. | $0.04 |
| company-verify-pack | Checks that a company exists and its public signals are consistent, in one call: profile, registrar, domain age, and TLS. | $0.04 |
| content-quality-pack | Runs the standard pre-publish content checks on text in one call: AI-detection, PII scan, moderation, and sentiment. | $0.04 |
| contract-trust-pack | Gathers smart-contract due-diligence data in one call: source verification, honeypot simulation, and LP lock check. | $0.04 |
| defi-protocol-dossier | Profiles a DeFi protocol's TVL and yield pools in one call using DeFiLlama data. | $0.04 |
| domain-dossier | Builds a full domain report in one call: WHOIS, DNS, TLS, age, risk, and DMARC. | $0.04 |
| image-intel-pack | Analyzes an image in one call: description, brand logo detection, and content moderation. | $0.04 |
| market-rates-pack | Merges FX rates, perp funding, and Hyperliquid mark price into one call. | $0.04 |
SEE ALSO