Image Generation Agent by Apoth3osis

Name: Image Generation Agent
Brand: Apoth3osis
SKU: 6a054f5c90a57115271c1316
Price: 0.08 USD
Availability: InStock

Image Generation Agent

Model

Available ActionsEach successful request consumes credits as outlined below.

generate_budget_image^8crgenerate_image_0_5k^10crgenerate_image_1k^15crgenerate_image_2k^25crgenerate_image_4k^40cr

Details

AI image generator powered by Google Gemini 3 Flash Image and "Nano Banana". Create photorealistic product photography, marketing creative, social graphics, app icons, concept art, hero images, and brand assets from a single text prompt — or edit an existing image by passing up to four reference photos for style, subject, or scene guidance. Choose your output tier: a low-cost budget draft for ideation, or crisp 0.5K, 1K, 2K, and 4K final renders for print, ads, e-commerce, and presentation use. Supports 14 aspect ratios including 1:1, 16:9, 9:16, 21:9, 4:5, and ultra-wide 8:1 banner formats. Every generated image is auto-saved to AgentPMT File Manager with a 7-day signed download URL, file_id, width, height, MIME type, and size bytes — ready to drop into chat, hand off to another tool, or pull into a workflow. Built for designers, marketers, e-commerce sellers, content creators, and AI agents that need on-demand visuals without leaving the conversation.

Workflows Using This Tool

2 / 3

Workflow

Saves ~3 hr

Human-Voice AI Blog Writer: Research, Write, and Illustrate SEO Articles from Your Content Calendar

+3 more tools

Turn a topic or a content-calendar spreadsheet into a publish-ready, fact-checked blog article written in a natural human voice. This AI blog writing workflow picks the next due topic from your Google Sheet (or takes one directly), researches it across live news and authoritative web sources, builds a sourced fact sheet and SEO outline, then drafts the full long-form article with a human-style writing agent that writes only from verified facts. Every draft runs through an automated writing quality check that catches robotic, banned AI phrases and rewrites them until the copy passes. A custom hero image is generated to match the story, the finished article is assembled into a formatted Google Doc with a sources section, the run is logged back to your content calendar, and the doc link lands in your inbox. Ideal for content marketing teams, SEO agencies, founders, newsletters, and solo bloggers who want an AI blog post generator and content automation pipeline that delivers consistent, on-brand, long-form SEO content without the research grind or the telltale AI voice.

Use Cases

AI image generation, Nano Banana image creation, Google Gemini image API, text-to-image, image editing with reference photos, product photography mockups, hero banner generation, social media graphics for Instagram and TikTok and LinkedIn, e-commerce product visuals, concept art, app and product icon design, marketing campaign creative, ad creative generation, brand asset production, style transfer with reference images, photoshoot replacement, background swapping, ultra-wide 21:9 banner generation, 4K poster rendering, multi-aspect-ratio variants for omnichannel campaigns, AI-generated stock imagery, illustration generation, storyboard panels, presentation graphics, content marketing visuals, blog header images, YouTube thumbnails, podcast cover art, book cover generation, packaging mockups, real estate listing renders, automated visual content pipelines for AI agents

Dynamic MCP Setup

Connect once through AgentPMT Dynamic MCP, then use approved tools from the same agent connection.

30 Second Setup

STDIO connector for Claude Code, Codex, Cursor, Zed, and other LLMs that require STDIO or custom connections.

npm install -g @agentpmt/mcp-routeragentpmt-setup

Hosted Streamable HTTPS

MCP endpoint for browser-based apps like ChatGPT, Claude, Grok, or any time you want a streamable connection with no local install.

https://api.agentpmt.com/mcp

Config Example

Use the hosted endpoint directly in clients that support remote MCP. Store your Bearer token in the client config or secret field.

Full connection guide

{
  "mcpServers": {
    "agentpmt": {
      "type": "streamable-http",
      "url": "https://api.agentpmt.com/mcp",
      "headers": {
        "Authorization": "Bearer <AGENTPMT_BEARER_TOKEN>",
        "x-instance-metadata": "{\"client\":\"generic-mcp\",\"platform\":\"remote\"}"
      }
    }
  }
}

Need client videos, organization controls, audit details, and the full feature overview?

More About Dynamic MCP

Actions(5)

generate_budget_image^8cr5 params(1 required)

Create or edit a lower-cost image from a prompt and optional reference images. Use for drafts, previews, and standard 1024px-class outputs.

promptrequiredstring

Image generation or edit instruction, 3 to 4000 characters.

aspect_ratiostring

Desired output aspect ratio. Default is 1:1.

Values:

1:12:33:23:44:34:55:49:1616:921:9

reference_imagesarray

Optional reference images for edits or style/subject guidance. Maximum 4.

Array of: object

filenamestring

Optional output filename base. Extension is inferred from generated image MIME type.

expiration_daysinteger

File Manager expiration in days, from 1 to 7. Default is 7.

generate_image_0_5k^10cr5 params(1 required)

Create or edit a high-efficiency 0.5K image from a prompt and optional reference images.

promptrequiredstring

Image generation or edit instruction, 3 to 4000 characters.

aspect_ratiostring

Desired output aspect ratio. Default is 1:1.

Values:

1:11:41:82:33:23:44:14:34:55:48:19:1616:921:9

reference_imagesarray

Optional reference images for edits or style/subject guidance. Maximum 4.

Array of: object

filenamestring

Optional output filename base. Extension is inferred from generated image MIME type.

expiration_daysinteger

File Manager expiration in days, from 1 to 7. Default is 7.

generate_image_1k^15cr5 params(1 required)

Create or edit a 1K image from a prompt and optional reference images.

promptrequiredstring

Image generation or edit instruction, 3 to 4000 characters.

aspect_ratiostring

Desired output aspect ratio. Default is 1:1.

Values:

1:11:41:82:33:23:44:14:34:55:48:19:1616:921:9

reference_imagesarray

Optional reference images for edits or style/subject guidance. Maximum 4.

Array of: object

filenamestring

Optional output filename base. Extension is inferred from generated image MIME type.

expiration_daysinteger

File Manager expiration in days, from 1 to 7. Default is 7.

generate_image_2k^25cr5 params(1 required)

Create or edit a 2K image from a prompt and optional reference images.

promptrequiredstring

Image generation or edit instruction, 3 to 4000 characters.

aspect_ratiostring

Desired output aspect ratio. Default is 1:1.

Values:

1:11:41:82:33:23:44:14:34:55:48:19:1616:921:9

reference_imagesarray

Optional reference images for edits or style/subject guidance. Maximum 4.

Array of: object

filenamestring

Optional output filename base. Extension is inferred from generated image MIME type.

expiration_daysinteger

File Manager expiration in days, from 1 to 7. Default is 7.

generate_image_4k^40cr5 params(1 required)

Create or edit a 4K image from a prompt and optional reference images.

promptrequiredstring

Image generation or edit instruction, 3 to 4000 characters.

aspect_ratiostring

Desired output aspect ratio. Default is 1:1.

Values:

1:11:41:82:33:23:44:14:34:55:48:19:1616:921:9

reference_imagesarray

Optional reference images for edits or style/subject guidance. Maximum 4.

Array of: object

filenamestring

Optional output filename base. Extension is inferred from generated image MIME type.

expiration_daysinteger

File Manager expiration in days, from 1 to 7. Default is 7.

curl -X POST "https://api.agentpmt.com/products/purchase" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer ********" \
  -d '{
    "product_id": "6a054f5c90a57115271c1316",
    "parameters": {
      "action": "generate_budget_image",
      "prompt": "example_prompt"
    }
  }'

import requests
import json

url = "https://api.agentpmt.com/products/purchase"

headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer ********"
}

data = {
    "product_id": "6a054f5c90a57115271c1316",
    "parameters": {
        "action": "generate_budget_image",
        "prompt": "example_prompt"
    }
}

response = requests.post(url, headers=headers, json=data)
print(response.status_code)
print(response.json())

const url = "https://api.agentpmt.com/products/purchase";

const headers = {
  "Content-Type": "application/json",
  "Authorization": "Bearer ********"
};

const data = {
  product_id: "6a054f5c90a57115271c1316",
  parameters: {
    "action": "generate_budget_image",
    "prompt": "example_prompt"
  }
};

fetch(url, {
  method: "POST",
  headers,
  body: JSON.stringify(data)
})
  .then(response => response.json())
  .then(data => console.log(data))
  .catch(error => console.error("Error:", error));

const axios = require('axios');

const url = "https://api.agentpmt.com/products/purchase";

const headers = {
  "Content-Type": "application/json",
  "Authorization": "Bearer ********"
};

const data = {
  product_id: "6a054f5c90a57115271c1316",
  parameters: {
    "action": "generate_budget_image",
    "prompt": "example_prompt"
  }
};

axios.post(url, data, { headers })
  .then(response => {
    console.log(response.status);
    console.log(response.data);
  })
  .catch(error => {
    console.error("Error:", error.message);
  });

Login to view your API and budget keys. The example above uses placeholder values. Sign in to see personalized code with your bearer token.

Autonomous agents can access this tool through AgentAddress credit balances or direct x402 payments. Use the Autonomous Agent API reference for endpoint shapes after choosing the access pattern below.

Recommended

Credit-Based Access Using AgentAddress

AgentAddress is preferred when an autonomous agent needs persistent file access, stored platform state, or maximum tool use ability across repeated calls.

Open Credit-Based Access Using AgentAddress

Direct x402 Payment

Use direct x402 for independent one-off tool calls that do not require shared files or stored platform state.

Accepted public payments

Stablecoin: USDC
Chains: Base, Arbitrum, Optimism, Polygon, and Avalanche

Direct x402 payments are not enabled for this product; use AgentAddress credit access instead.

Product Skill Package

This product has a published Agent Skill package. Install it when an autonomous agent needs product-specific operating instructions in its local skill registry.

Download SKILL.md View package source OpenClaw listing

OpenClaw install

Copied to clipboard

skills.sh install

Copied to clipboard

Usage Instructions

Usage guidance provided directly by the developer for this product.

AI Image Creator

Create images from prompts, or edit an image by adding reference images. Generated images are saved to File Manager and returned with a signed URL that users can open or download directly, plus file_id, MIME type, size, width, and height.

Choosing an action

generate_budget_image: Lower-cost drafts, previews, simple assets, or 1024px-class output.
generate_image_0_5k: Small high-efficiency output.
generate_image_1k: Standard final images.
generate_image_2k: Higher-resolution social, presentation, or product assets.
generate_image_4k: Highest-resolution final assets.

Inputs

prompt is required for every generation action.
aspect_ratio defaults to 1:1.
reference_images is optional and accepts up to 4 images.
Reference image formats: PNG, JPEG, or WebP.
Reference image sources:
- {"source_kind":"file_id","file_id":"<file-id>"}
- {"source_kind":"url","url":"https://example.com/image.png"}
- {"source_kind":"base64","base64_data":"<base64>","mime_type":"image/png"}
filename is optional. The final extension is inferred from the generated image.
expiration_days defaults to 7 and must be from 1 to 7.
Generated image bytes are not returned inline. Use the signed URL to open or download the file.

Only use reference images you have the right to use.

Aspect ratios

generate_budget_image supports: 1:1, 2:3, 3:2, 3:4, 4:3, 4:5, 5:4, 9:16, 16:9, 21:9.

High-resolution actions support: 1:1, 1:4, 1:8, 2:3, 3:2, 3:4, 4:1, 4:3, 4:5, 5:4, 8:1, 9:16, 16:9, 21:9.

Examples

Budget text-to-image:

{
  "action": "generate_budget_image",
  "prompt": "A clean blue circle icon on a white background, flat vector style",
  "aspect_ratio": "1:1"
}

High-resolution text-to-image:

{
  "action": "generate_image_2k",
  "prompt": "A polished product mockup of a reusable water bottle on a white studio sweep",
  "aspect_ratio": "16:9",
  "filename": "water-bottle-product-mockup"
}

Reference-image edit from File Manager:

{
  "action": "generate_image_1k",
  "prompt": "Keep the subject and pose. Replace the background with a bright minimalist studio scene.",
  "reference_images": [
    {
      "source_kind": "file_id",
      "file_id": "<file-manager-file-id>"
    }
  ],
  "aspect_ratio": "3:4"
}

Output

Successful calls return:

{
  "success": true,
  "action": "generate_image_1k",
  "tier": "1k",
  "model_tier": "standard",
  "aspect_ratio": "1:1",
  "image_size": "1K",
  "reference_image_count": 0,
  "images": [
    {
      "file_id": "...",
      "filename": "...png",
      "signed_url": "https://...",
      "signed_url_expires_in": 604800,
      "content_type": "image/png",
      "size_bytes": 123456,
      "width": 1024,
      "height": 1024
    }
  ],
  "text_parts": [],
  "usage": {
    "input_tokens": 0,
    "output_tokens": 0,
    "total_tokens": 0
  }
}

About this Product

AI image generation powered by Nano Banana (Google Gemini 3 Flash Image)

Turn a text prompt into polished, photorealistic visuals in seconds. The Image Generation Agent runs on Google's Gemini 3 Flash Image model — the "Nano Banana" family — to create marketing creative, product photography, social graphics, icons, concept art, and brand assets, or to edit an existing image with reference photos. Generate by hand or wire it into your agents and workflows for on-demand visuals without leaving the conversation.

What you can create

Photorealistic product photography and e-commerce mockups.
Marketing and ad creative, hero banners, and campaign visuals.
Social graphics for Instagram, TikTok, LinkedIn, and YouTube thumbnails.
App and product icons, concept art, illustrations, and storyboard panels.
Brand assets, packaging mockups, blog headers, and presentation graphics.

Generate or edit

Text-to-image

Describe what you want and get a finished image — composition, lighting, and style follow your prompt.

Reference-image editing

Attach up to four reference photos to guide subject, style, or scene. Keep the same product across new backgrounds, restyle a shot, swap a setting, or carry a consistent look through an entire campaign — the subject stays locked from one render to the next.

Pick your resolution

Budget draft — fast, low-cost ideation and previews.
0.5K & 1K — efficient standard finals for web and social.
2K — crisp social, presentation, and product assets.
4K — highest-resolution renders for print, ads, and large-format use.

14 aspect ratios, including ultra-wide banners

Render square, portrait, landscape, and cinematic formats — 1:1, 16:9, 9:16, 21:9, 4:5, and ultra-wide 4:1 and 8:1 banners — so one prompt can fuel an omnichannel campaign.

Ready for your workflow

Every image is auto-saved to AgentPMT File Manager and returned with a download link plus its file_id, width, height, MIME type, and size — ready to drop into chat, hand off to another tool, or pull into an automated pipeline.

Who it's for

Designers, marketers, e-commerce sellers, content creators, and AI agents that need on-demand, high-quality visuals at any resolution.

Frequently Asked Questions

How do I connect this tool to an external agent?

You can install the local MCP server by opening a terminal and running:

Install commands

npm install -g @agentpmt/mcp-router
agentpmt-setup

This will connect you to local agents like Claude Code, Windsurf, Grok Build, Cursor, etc.

Alternatively you can connect to the hosted version with this config block, no installation required:

Hosted MCP config

{
  "mcpServers": {
    "agentpmt": {
      "type": "streamable-http",
      "url": "https://api.agentpmt.com/mcp",
      "headers": {
        "Authorization": "Bearer <AGENTPMT_BEARER_TOKEN>",
        "x-instance-metadata": "{\"client\":\"generic-mcp\",\"platform\":\"remote\"}"
      }
    }
  }
}

View MCP Connection Instructions for more details.

How does an external agent use this tool?

After the external agent is connected to an Agent Group that can use this tool, paste this prompt into the agent:

Agent prompt

Use the AgentPMT-Tool-Search-and-Execution tool. First call action 'get_instructions' so you know how to use the tool search interface. Then call action 'get_schema' with tool_id 6a054f5c90a57115271c1316 ("Image Generation Agent"). After reading the schema and any returned instructions, tell me what this tool can do, we are going to be using it

The agent should fetch the tool schema first, collect the required parameters for your request, and then call the tool through AgentPMT.

Can I edit an existing image or keep a product consistent across scenes?

Yes. Attach up to four reference images to guide subject, style, or scene. Reference-image editing keeps the same subject across new backgrounds, so you can place the same product in different settings or carry one look through an entire campaign.

What aspect ratios are supported?

14 aspect ratios, including 1:1, 16:9, 9:16, 21:9, and 4:5, plus ultra-wide 4:1 and 8:1 banner formats — so a single prompt can produce variants for an omnichannel campaign.

What is the Image Generation Agent?

An AI image generator powered by Nano Banana (Google Gemini 3 Flash Image). Create photorealistic product photos, marketing creative, social graphics, icons, concept art, and brand assets from a text prompt, or edit an existing image with reference photos — directly in chat, agents, and workflows.

What resolutions can I generate?

Choose the tier that fits the job: a low-cost budget draft for ideation, efficient 0.5K and 1K finals, crisp 2K for social and product assets, and 4K for print, ads, and large-format use.

Where do my images go after they're generated?

Every image is auto-saved to AgentPMT File Manager and returned with a download link, file_id, width, height, MIME type, and size — ready to drop into chat, hand to another tool, or use in an automated workflow.

Who is it built for?

Designers, marketers, e-commerce sellers, content creators, and AI agents that need on-demand, high-quality visuals at any resolution without leaving the conversation.

Image Generation Agent

Available ActionsEach successful request consumes credits as outlined below.

Details

Workflows Using This Tool

Human-Voice AI Blog Writer: Research, Write, and Illustrate SEO Articles from Your Content Calendar

Use Cases

Dynamic MCP Setup

30 Second Setup

Hosted Streamable HTTPS

Config Example

About this Product

AI image generation powered by Nano Banana (Google Gemini 3 Flash Image)

What you can create

Generate or edit

Text-to-image

Reference-image editing

Pick your resolution

14 aspect ratios, including ultra-wide banners

Ready for your workflow

Who it's for

Frequently Asked Questions

How do I connect this tool to an external agent?

How does an external agent use this tool?

Can I edit an existing image or keep a product consistent across scenes?

What aspect ratios are supported?

What is the Image Generation Agent?

What resolutions can I generate?

Where do my images go after they're generated?

Who is it built for?

Related Content

Creative Workflow Automation: Image Generation Agent

Looking for help integrating AI into your business? Set up a free consultation.