freshcrate
Skin:/
Home > MCP Servers > banana-claude

banana-claude

AI image generation skill for Claude Code -- Creative Director powered by Gemini

Why this rank:Strong adoptionRelease freshnessHealthy release cadence

Description

AI image generation skill for Claude Code -- Creative Director powered by Gemini

README

Banana Claude

Banana Claude

AI image generation skill for Claude Code where Claude acts as Creative Director using Google's Gemini Nano Banana models.

Unlike simple API wrappers, Claude interprets your intent, selects domain expertise, constructs optimized prompts using Google's official 5-component formula, and orchestrates Gemini for the best possible results.

Claude Code Skill Version License: MIT

Blog: See banana-claude in action

Table of Contents

Installation

Plugin Install (Recommended)

Add the marketplace and install:

/plugin marketplace add AgriciDaniel/banana-claude
/plugin install banana-claude@banana-claude-marketplace

Or test locally:

git clone --depth 1 https://github.com/AgriciDaniel/banana-claude.git
claude --plugin-dir ./banana-claude
Standalone Install (without plugin system)
git clone --depth 1 https://github.com/AgriciDaniel/banana-claude.git
bash banana-claude/install.sh

One-liner (curl):

curl -fsSL https://raw.githubusercontent.com/AgriciDaniel/banana-claude/main/install.sh | bash

With MCP Setup:

git clone --depth 1 https://github.com/AgriciDaniel/banana-claude.git
cd banana-claude
./install.sh --with-mcp YOUR_API_KEY

Get a free API key at Google AI Studio.

Quick Start

# Start Claude Code
claude

# Generate an image
/banana generate "a hero image for a coffee shop website"

# Edit an existing image
/banana edit ~/photo.png "remove the background"

# Multi-turn creative session
/banana chat

# Browse 2,500+ prompt database
/banana inspire

Claude will ask about your brand, select the right domain mode (Cinema, Product, Portrait, Editorial, UI, Logo, Landscape, Infographic, Abstract), construct a detailed prompt with lighting and composition, set the right aspect ratio, and generate.

Banana Claude in action

Commands

Command Description
/banana Interactive -- Claude detects intent and guides you
/banana generate <idea> Full Creative Director pipeline
/banana edit <path> <instructions> Intelligent image editing
/banana chat Multi-turn visual session (maintains consistency)
/banana inspire [category] Browse 2,500+ prompt database
/banana batch <idea> [N] Generate N variations (default: 3)
/banana setup Configure MCP and API key
/banana preset [list|create|show|delete] Manage brand/style presets
/banana cost [summary|today|estimate] View cost tracking and estimates

How It Works

Creative Director Pipeline

What Makes This Different

  • Intent Analysis -- Understands what you actually need (blog header? app icon? product shot?)
  • Domain Expertise -- Selects the right creative lens (Cinema, Product, Portrait, Editorial, UI, Logo, Landscape, Infographic, Abstract)
  • 5-Component Prompt Formula -- Constructs prompts with Subject + Action + Location/Context + Composition + Style (includes lighting)
  • Prompt Adaptation -- Translates patterns from a 2,500+ curated prompt database to Gemini's natural language format
  • Post-Processing -- Crops, removes backgrounds, converts formats, resizes for platforms
  • Batch Variations -- Generates N variations rotating different components
  • Session Consistency -- Maintains character/style across multi-turn conversations
  • 4K Resolution Output -- Up to 4096ร—4096 with imageSize control
  • 14 Aspect Ratios -- Including ultra-wide 21:9 for cinematic compositions

The 5-Component Prompt Formula

Prompt Formula

Instead of sending "a cat in space" to Gemini, Claude constructs:

A medium shot of a tabby cat floating weightlessly inside the cupola module of the International Space Station, paws outstretched toward a floating droplet of water, Earth visible through the circular windows behind. Soft directional light from the windows illuminates the cat's fur with a blue-white rim light, while the interior has warm amber instrument panel glow. Captured with a Canon EOS R5, 35mm f/2.0 lens, slight barrel distortion emphasizing the curved module interior. In the style of a National Geographic cover story on the ISS, with the sharp documentary clarity of NASA mission photography.

Components used: Subject (tabby cat, physical detail) โ†’ Action (floating, paw gesture) โ†’ Location/Context (ISS cupola, Earth visible) โ†’ Composition (medium shot, curved framing) โ†’ Style (Canon R5, National Geographic documentary, directional window light + amber instruments)

Domain Modes

Domain Modes

Mode Best For Example
Cinema Dramatic, storytelling "A noir detective scene in a rain-soaked alley"
Product E-commerce, packshots "Photograph my handmade candle for Etsy"
Portrait People, characters "A cyberpunk character portrait for my game"
Editorial Fashion, lifestyle "Vogue-style fashion shot for my brand"
UI/Web Icons, illustrations "A set of onboarding illustrations"
Logo Branding, identity "A minimalist logo for a tech startup"
Landscape Backgrounds, wallpapers "A misty mountain sunrise for my desktop"
Infographic Data, diagrams "Visualize our Q1 sales growth"
Abstract Generative art, textures "Voronoi tessellation in neon gradients"

Models

Model ID Notes
Flash 3.1 (default) gemini-3.1-flash-image-preview Fastest, newest, 14 aspect ratios, up to 4K
Flash 2.5 gemini-2.5-flash-image Stable fallback

Architecture

banana-claude/                         # Claude Code Plugin
โ”œโ”€โ”€ .claude-plugin/
โ”‚   โ”œโ”€โ”€ plugin.json                    # Plugin manifest
โ”‚   โ””โ”€โ”€ marketplace.json               # Marketplace catalog
โ”œโ”€โ”€ skills/banana/                     # Main skill
โ”‚   โ”œโ”€โ”€ SKILL.md                       # Creative Director orchestration (v1.4)
โ”‚   โ”œโ”€โ”€ references/
โ”‚   โ”‚   โ”œโ”€โ”€ prompt-engineering.md      # 5-component formula, banned keywords, safety rephrase
โ”‚   โ”‚   โ”œโ”€โ”€ gemini-models.md           # Model specs, rate limits, capabilities
โ”‚   โ”‚   โ”œโ”€โ”€ mcp-tools.md              # MCP tool parameters and responses
โ”‚   โ”‚   โ”œโ”€โ”€ post-processing.md        # ImageMagick/FFmpeg pipelines, green screen
โ”‚   โ”‚   โ”œโ”€โ”€ cost-tracking.md          # Pricing table, usage guide
โ”‚   โ”‚   โ””โ”€โ”€ presets.md                # Brand preset schema and examples
โ”‚   โ””โ”€โ”€ scripts/
โ”‚       โ”œโ”€โ”€ setup_mcp.py              # Configure MCP in Claude Code
โ”‚       โ”œโ”€โ”€ validate_setup.py         # Verify installation
โ”‚       โ”œโ”€โ”€ generate.py               # Direct API fallback -- generation
โ”‚       โ”œโ”€โ”€ edit.py                   # Direct API fallback -- editing
โ”‚       โ”œโ”€โ”€ cost_tracker.py           # Cost logging and summaries
โ”‚       โ”œโ”€โ”€ presets.py                # Brand/style preset management
โ”‚       โ””โ”€โ”€ batch.py                  # CSV batch workflow parser
โ””โ”€โ”€ agents/
    โ””โ”€โ”€ brief-constructor.md           # Subagent for prompt construction

Requirements

  • Claude Code
  • Node.js 18+ (for npx)
  • Google AI API key (free tier: ~5-15 RPM / ~20-500 RPD, cut ~92% Dec 2025)
  • ImageMagick (optional, for post-processing)

Uninstall

Plugin:

/plugin uninstall banana-claude@banana-claude-marketplace

Standalone:

bash banana-claude/install.sh --uninstall

Contributing

Contributions welcome! Please open an issue or submit a pull request.

License

MIT License -- see LICENSE for details.


Built for Claude Code by @AgriciDaniel


Author

Built by Agrici Daniel - AI Workflow Architect.

Release History

VersionChangesUrgencyDate
v1.4.1## What's Changed Restructured as an official Claude Code plugin for Anthropic marketplace submission. ### Changed - Restructured as official Claude Code plugin (`.claude-plugin/plugin.json` manifest) - Added marketplace catalog (`.claude-plugin/marketplace.json`) for distribution via `/plugin marketplace add` - Moved `banana/` to `skills/banana/` (standard plugin layout) - Moved `.claude/agents/` to `agents/` (standard plugin layout) - Plugin install is now the primary installation method - UMedium3/27/2026
v1.4.0## What's New in v1.4.0 ### Breaking Changes - **5-Component Prompt Formula** replaces the old 6-component system -- based on Google's official validation - **Banned Keywords** -- "8K", "masterpiece", "ultra-realistic", "high resolution" now degrade output. Use prestigious context anchors instead - **Default resolution** changed from 1K to 2K in fallback scripts - **Dead model removed** -- `gemini-3-pro-image-preview` shut down March 9, 2026 ### Added - Banned Keywords section with Stable DiffLow3/19/2026
v2.1.0## What's New ### New Gemini API Features - **4K resolution output** via `imageSize` parameter (512, 1K, 2K, 4K) - **5 new aspect ratios**: 2:3, 3:2, 4:5, 5:4, 21:9 (14 total) - **Thinking levels**: minimal/low/medium/high for complex compositions - **Search grounding**: Google Search integration for real-world visual references - **Multi-image input**: Up to 14 reference images for character/style consistency - **Image-only output mode**: `responseModalities: ["IMAGE"]` - **Safety filter docs*Low3/13/2026

Dependencies & License Audit

Loading dependencies...

Similar Packages

comfy-pilot๐Ÿค– Create and modify workflows effortlessly with ComfyUI's AI assistant, enabling natural conversations with agents like Claude and Gemini.main@2026-06-08
nix-aiYour AI coding toolkit, declared in Nix โ€” Claude, Gemini, Copilot, 15+ MCP servers, one flakev1.73.2
notebooklm-pyProvide full Python API access to NotebookLM features, including advanced functions beyond the web interface, via CLI and AI agent integration.main@2026-06-05
Auto-claude-code-research-in-sleepARIS โš”๏ธ (Auto-Research-In-Sleep) โ€” Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in โ€” works wiv0.4.16
mcp-tidy๐Ÿงน Simplify your MCP servers with mcp-tidy, clearing server bloat to enhance performance and improve tool selection in Claude Code.main@2026-06-07

More from AgriciDaniel

claude-adsComprehensive paid advertising audit & optimization skill for Claude Code. 225+ checks across Google, Meta, YouTube, LinkedIn, TikTok, Microsoft & Apple Search Ads with weighted scoring, parallel agen

More in MCP Servers

bifrostFastest enterprise AI gateway (50x faster than LiteLLM) with adaptive load balancer, cluster mode, guardrails, 1000+ models support & <100 ยตs overhead at 5k RPS.
claude-code-plugins-plus-skills423 plugins, 2,849 skills, 177 agents for Claude Code. Open-source marketplace at tonsofskills.com with the ccpi CLI package manager.
npcpyThe python library for research and development in NLP, multimodal LLMs, Agents, ML, Knowledge Graphs, and more.
oxidetermAll-in-one terminal workspace โ€” local shells, SSH, SFTP, remote IDE, AI agent, and file manager in a single native binary. Built with Tauri 2 and pure Rust SSH (no OpenSSL). Smart reconnect, MCP, RAG,