

Image Generation Agent
Model
Available ActionsEach successful request consumes credits as outlined below.
generate_budget_image8crgenerate_image_0_5k10crgenerate_image_1k15crgenerate_image_2k25crgenerate_image_4k40cr
Details
AI image generator powered by Google Gemini 3 Flash Image and "Nano Banana". Create photorealistic product photography, marketing creative, social graphics, app icons, concept art, hero images, and brand assets from a single text prompt — or edit an existing image by passing up to four reference photos for style, subject, or scene guidance. Choose your output tier: a low-cost budget draft for ideation, or crisp 0.5K, 1K, 2K, and 4K final renders for print, ads, e-commerce, and presentation use. Supports 14 aspect ratios including 1:1, 16:9, 9:16, 21:9, 4:5, and ultra-wide 8:1 banner formats. Every generated image is auto-saved to AgentPMT File Manager with a 7-day signed download URL, file_id, width, height, MIME type, and size bytes — ready to drop into chat, hand off to another tool, or pull into a workflow. Built for designers, marketers, e-commerce sellers, content creators, and AI agents that need on-demand visuals without leaving the conversation.
Workflows Using This Tool
2 / 3- X (Twitter) Thought Leadership Engine: Human-Voice AI Posts from Industry News with Approval and Auto-Publish
- Human-Voice AI Blog Writer: Research, Write, and Illustrate SEO Articles from Your Content Calendar
- Pipedrive Personalized Direct Mail Engine: AI-Designed Greeting Cards to Any CRM Segment
Workflow
Saves ~3 hr





+3 more tools
Turn a topic or a content-calendar spreadsheet into a publish-ready, fact-checked blog article written in a natural human voice. This AI blog writing workflow picks the next due topic from your Google Sheet (or takes one directly), researches it across live news and authoritative web sources, builds a sourced fact sheet and SEO outline, then drafts the full long-form article with a human-style writing agent that writes only from verified facts. Every draft runs through an automated writing quality check that catches robotic, banned AI phrases and rewrites them until the copy passes. A custom hero image is generated to match the story, the finished article is assembled into a formatted Google Doc with a sources section, the run is logged back to your content calendar, and the doc link lands in your inbox. Ideal for content marketing teams, SEO agencies, founders, newsletters, and solo bloggers who want an AI blog post generator and content automation pipeline that delivers consistent, on-brand, long-form SEO content without the research grind or the telltale AI voice.
Use Cases
AI image generation, Nano Banana image creation, Google Gemini image API, text-to-image, image editing with reference photos, product photography mockups, hero banner generation, social media graphics for Instagram and TikTok and LinkedIn, e-commerce product visuals, concept art, app and product icon design, marketing campaign creative, ad creative generation, brand asset production, style transfer with reference images, photoshoot replacement, background swapping, ultra-wide 21:9 banner generation, 4K poster rendering, multi-aspect-ratio variants for omnichannel campaigns, AI-generated stock imagery, illustration generation, storyboard panels, presentation graphics, content marketing visuals, blog header images, YouTube thumbnails, podcast cover art, book cover generation, packaging mockups, real estate listing renders, automated visual content pipelines for AI agents
Dynamic MCP Setup
Connect once through AgentPMT Dynamic MCP, then use approved tools from the same agent connection.
30 Second Setup
STDIO connector for Claude Code, Codex, Cursor, Zed, and other LLMs that require STDIO or custom connections.
npm install -g @agentpmt/mcp-routeragentpmt-setupHosted Streamable HTTPS
MCP endpoint for browser-based apps like ChatGPT, Claude, Grok, or any time you want a streamable connection with no local install.
https://api.agentpmt.com/mcpConfig Example
Use the hosted endpoint directly in clients that support remote MCP. Store your Bearer token in the client config or secret field.
{
"mcpServers": {
"agentpmt": {
"type": "streamable-http",
"url": "https://api.agentpmt.com/mcp",
"headers": {
"Authorization": "Bearer <AGENTPMT_BEARER_TOKEN>",
"x-instance-metadata": "{\"client\":\"generic-mcp\",\"platform\":\"remote\"}"
}
}
}
}Need client videos, organization controls, audit details, and the full feature overview?
More About Dynamic MCPAbout this Product
AI image generation powered by Nano Banana (Google Gemini 3 Flash Image)
Turn a text prompt into polished, photorealistic visuals in seconds. The Image Generation Agent runs on Google's Gemini 3 Flash Image model — the "Nano Banana" family — to create marketing creative, product photography, social graphics, icons, concept art, and brand assets, or to edit an existing image with reference photos. Generate by hand or wire it into your agents and workflows for on-demand visuals without leaving the conversation.
What you can create
- Photorealistic product photography and e-commerce mockups.
- Marketing and ad creative, hero banners, and campaign visuals.
- Social graphics for Instagram, TikTok, LinkedIn, and YouTube thumbnails.
- App and product icons, concept art, illustrations, and storyboard panels.
- Brand assets, packaging mockups, blog headers, and presentation graphics.
Generate or edit
Text-to-image
Describe what you want and get a finished image — composition, lighting, and style follow your prompt.
Reference-image editing
Attach up to four reference photos to guide subject, style, or scene. Keep the same product across new backgrounds, restyle a shot, swap a setting, or carry a consistent look through an entire campaign — the subject stays locked from one render to the next.
Pick your resolution
- Budget draft — fast, low-cost ideation and previews.
- 0.5K & 1K — efficient standard finals for web and social.
- 2K — crisp social, presentation, and product assets.
- 4K — highest-resolution renders for print, ads, and large-format use.
14 aspect ratios, including ultra-wide banners
Render square, portrait, landscape, and cinematic formats — 1:1, 16:9, 9:16, 21:9, 4:5, and ultra-wide 4:1 and 8:1 banners — so one prompt can fuel an omnichannel campaign.
Ready for your workflow
Every image is auto-saved to AgentPMT File Manager and returned with a download link plus its file_id, width, height, MIME type, and size — ready to drop into chat, hand off to another tool, or pull into an automated pipeline.
Who it's for
Designers, marketers, e-commerce sellers, content creators, and AI agents that need on-demand, high-quality visuals at any resolution.
Frequently Asked Questions
How do I connect this tool to an external agent?
You can install the local MCP server by opening a terminal and running:
Install commands
npm install -g @agentpmt/mcp-router
agentpmt-setupThis will connect you to local agents like Claude Code, Windsurf, Grok Build, Cursor, etc.
Alternatively you can connect to the hosted version with this config block, no installation required:
Hosted MCP config
{
"mcpServers": {
"agentpmt": {
"type": "streamable-http",
"url": "https://api.agentpmt.com/mcp",
"headers": {
"Authorization": "Bearer <AGENTPMT_BEARER_TOKEN>",
"x-instance-metadata": "{\"client\":\"generic-mcp\",\"platform\":\"remote\"}"
}
}
}
}View MCP Connection Instructions for more details.
How does an external agent use this tool?
After the external agent is connected to an Agent Group that can use this tool, paste this prompt into the agent:
Agent prompt
Use the AgentPMT-Tool-Search-and-Execution tool. First call action 'get_instructions' so you know how to use the tool search interface. Then call action 'get_schema' with tool_id 6a054f5c90a57115271c1316 ("Image Generation Agent"). After reading the schema and any returned instructions, tell me what this tool can do, we are going to be using it
The agent should fetch the tool schema first, collect the required parameters for your request, and then call the tool through AgentPMT.
Can I edit an existing image or keep a product consistent across scenes?
Yes. Attach up to four reference images to guide subject, style, or scene. Reference-image editing keeps the same subject across new backgrounds, so you can place the same product in different settings or carry one look through an entire campaign.
What aspect ratios are supported?
14 aspect ratios, including 1:1, 16:9, 9:16, 21:9, and 4:5, plus ultra-wide 4:1 and 8:1 banner formats — so a single prompt can produce variants for an omnichannel campaign.
What is the Image Generation Agent?
An AI image generator powered by Nano Banana (Google Gemini 3 Flash Image). Create photorealistic product photos, marketing creative, social graphics, icons, concept art, and brand assets from a text prompt, or edit an existing image with reference photos — directly in chat, agents, and workflows.
What resolutions can I generate?
Choose the tier that fits the job: a low-cost budget draft for ideation, efficient 0.5K and 1K finals, crisp 2K for social and product assets, and 4K for print, ads, and large-format use.
Where do my images go after they're generated?
Every image is auto-saved to AgentPMT File Manager and returned with a download link, file_id, width, height, MIME type, and size — ready to drop into chat, hand to another tool, or use in an automated workflow.
Who is it built for?
Designers, marketers, e-commerce sellers, content creators, and AI agents that need on-demand, high-quality visuals at any resolution without leaving the conversation.


