Skip to content
clusters: prooflayer · edgemarket · edgefinance · synthforge · mediakit · wordmint · webprobe · locale · comppoint
$ man describe-image

/describe-image

agentutility / wordmint / describe-image
PRICE / CALL
$0.02
USDC · base mainnet · scheme: exact
METHOD
POST
CLUSTER
wordmint
CATEGORY
ai
STATUS
live
NAME
describe-image ai image descriptor / vision llm
SYNOPSIS
POST https://x402.agentutility.ai/describe-image
     Content-Type: application/json
     X-PAYMENT:    <signed-transferWithAuthorization>

     { ... }
↳ first call → 402 Payment Required. Sign USDCtransferWithAuthorization, retry with theX-PAYMENT header.
DESCRIPTION

AI image descriptor / vision LLM. Modes: describe, alt_text (accessibility, ≤125 chars), OCR (extract visible text), tags (8-15 keywords), caption (single-sentence). Vision LLM powered.

INPUTrequest schema
propertytypedescriptionreq?
image_urlstringPublic URL of an image.required
modestring'describe' (default), 'alt_text', 'ocr', 'tags', 'caption'.
enum: describe · alt_text · ocr · tags · caption
optional
promptstringOptional custom instruction. Overrides mode.optional
OUTPUTresponse shape
fieldtypedescription
textstringGenerated output for the selected mode: prose description, alt text, extracted OCR text, keyword list, or caption.
modestringMode used to generate the output: describe, alt_text, ocr, tags, or caption.
image_urlstringURL of the source image that was analyzed by the vision LLM.
modelstringVision LLM model name that produced the description (e.g. claude-haiku-4-5).
EXAMPLEStwo ways to call
EXAMPLE 1 · curl
curl -X POST https://x402.agentutility.ai/describe-image \
  -H 'Content-Type: application/json' \
  -d '{ }'
first response = 402 Payment Required with payment requirements; sign + retry with X-PAYMENT.
EXAMPLE 2 · mcp
# MCP packages on npm under
# @agentutility/mcp-*  (one per cluster)
#
# Catalog + install:
# https://mcp.agentutility.ai
#
# Or call describe-image directly over HTTP — see above.
MCP server handles payment automatically — your coding agent just calls the tool by name.
METADATA
tags
imagevisionocralt-textcaptionaillm
env
VENICE_API_KEY
methods
POST
cluster
wordmint
price
$0.02 USDC per call
ADJACENTother endpoints in wordmint
endpointdescriptionprice
classifyZero-shot text classifier.$0.02
classify-textText classifier / zero-shot classifier / category sorter.$0.02
detect-piiPII detector / data leak scanner.$0.02
email-draftAI email writer / cold outreach / follow-up generator.$0.02
extractNamed entity extractor / NER.$0.02
moderate-contentContent moderation / safety classifier / OpenAI-style toxicity API.$0.02
resume-scorerAI resume scorer / ATS keyword analyzer.$0.02
rewrite-toneTone rewriter / paraphraser / writing style changer.$0.02
SEE ALSO
agentutility · wordmint · x402 · mcp · llms.txt · registry.json · bazaar.x402.org