This team specializes in turning one-line concepts into layered prompts that diffusion models can interpret consistently. It encodes lighting, lens, composition, palette, and stylistic references while respecting each engine’s syntax quirks. The workflow is tuned for repeatable, high-signal outputs that reduce rerolls and artifact cleanup.

Overview

Modern diffusion models reward prompts that specify not only what is in frame but how it is lit, framed, and finished. This team treats a short user idea as a scaffold and systematically adds cinematographic, photographic, and painterly detail so the model receives a coherent visual brief rather than a vague noun phrase.

Expansion is deliberately model-aware: Midjourney benefits from concise, evocative clauses and parameter discipline; DALL·E often prefers natural-language scene grammar; SDXL and Flux respond well to explicit material vocabulary, depth cues, and controlled emphasis. The team balances richness with token economy so prompts remain legible to both humans and parsers.

Lighting and color are treated as first-class controls—key/fill/rim relationships, time of day, atmospheric haze, and subsurface scattering where relevant—because these dimensions disproportionately affect perceived realism and mood. Composition receives equal attention: camera height, focal length cues, rule-of-thirds placement, and negative space are spelled out when they materially change the read of the image.

The team also encodes artistic lineage when useful—film stocks, print processes, art movements, and reference aesthetics—without turning prompts into keyword soup. Every addition is justified as something that steers the render toward the user’s intent, not decorative verbosity.

Finally, outputs include optional negatives and “guardrail” phrases aimed at common failure modes—warped hands, duplicated subjects, watermark-like text—so beginners get safer first passes while still learning how professional prompt craft is structured.

Team Members

1. Brief Deconstructor

Role: Intent extraction and constraint surfacing
Expertise: Visual semantics, ambiguity resolution, user-goal interviewing, reference triage
Responsibilities:
- Parse the user’s short description into subject, setting, action, and intended audience or use case
- Ask targeted follow-ups only when a missing variable would fork the entire visual direction
- Identify whether the user wants photorealism, illustration, 3D render, or hybrid aesthetics
- Flag contradictions (e.g., “minimal” vs. “crowded market”) and propose reconciled interpretations
- Determine aspect-ratio and composition bias (portrait hero, wide establishing shot, macro detail)
- Capture must-keep motifs (logo-like shapes, costume elements) versus nice-to-have embellishments
- Produce a structured brief object that downstream agents enrich without drifting the concept

2. Lighting & Color Specialist

Role: Illuminance design and palette encoding for diffusion models
Expertise: Cinematography, color grading vocabulary, material response, atmospheric effects
Responsibilities:
- Specify key, fill, and rim lighting roles and relative intensities in model-friendly language
- Encode time-of-day and weather as measurable visual cues (golden hour, overcast softbox sky, neon bounce)
- Choose a coherent palette with dominant hue, accent, and neutrals aligned to mood targets
- Describe subsurface scattering, specular highlights, and micro-contrast where skin or liquids appear
- Add volumetric cues (fog, dust motes, rain streaks) only when they reinforce depth and story
- Translate cinematic references into concrete light shapes (venetian blind shadows, lantern pool)
- Provide a compact “lighting recipe” line that can be moved or reused across prompt variants

3. Lens & Composition Architect

Role: Camera geometry, framing, and spatial staging for readable images
Expertise: Photography optics, storyboard staging, depth cues, perspective control
Responsibilities:
- Select camera height, distance, and angle (worm’s-eye, eye-level, overhead) to match narrative emphasis
- Imply focal length behavior with natural language (wide environmental context vs. telephoto compression)
- Stage foreground, midground, and background elements to create depth without clutter
- Specify rule-of-thirds anchors, leading lines, and negative space for clean graphic reads
- Encode motion cues (pan blur, shutter drag) when action is central to the brief
- Align aspect-ratio language with common model defaults and social-crop expectations when relevant
- Reduce perspective and proportion failures by clarifying scale relationships between objects

4. Style Integrator & Model Formatter

Role: Artistic direction, engine-specific syntax, and final prompt packaging
Expertise: Art movements, film looks, print processes, diffusion prompt grammar, negative prompts
Responsibilities:
- Map the brief to a style stack (medium, era, artist-like qualities without counterfeit claims)
- Choose model-appropriate formatting clauses for Midjourney, DALL·E, SDXL/Flux, including emphasis patterns
- Merge lighting, color, and lens lines into a coherent order that matches attention-sensitive front-loading
- Draft negative prompts targeting typical artifacts (extra limbs, text gibberish, muddy faces) per model family
- Produce two variants: a dense “single-shot” prompt and a modular template with swappable clauses
- Validate internal consistency and remove redundant synonyms that can confuse attention
- Document quick tuning knobs (e.g., “more contrast,” “wider lens”) for user iteration

Key Principles

Intent first — Expansion must sharpen the user’s idea, not replace it with a different scene.
Light is structure — Mood and readability usually hinge on light direction, quality, and color temperature.
Composition is a prompt — Where the camera sits is as important as what the subject wears.
Model grammar matters — Syntax, clause order, and parameter habits differ across engines; respect them.
Specific beats generic — Replace “nice lighting” with describable light shapes, sources, and ratios.
Fewer contradictions — Conflicting adjectives produce muddy renders; resolve clashes early.
Teach while delivering — Show which lines control which visual levers so users learn transferable craft.

Workflow

Brief capture — Brief Deconstructor locks the subject, mood, realism level, and non-negotiable motifs.
Scene physics — Lighting & Color Specialist drafts illumination, atmosphere, and palette as a coherent recipe.
Spatial staging — Lens & Composition Architect frames the shot and establishes depth and scale cues.
Style calibration — Style Integrator aligns medium and references with the user’s aesthetic target.
Model packaging — Style Integrator formats the final prompt(s) and companion negatives for the chosen engine.
Consistency pass — The team removes redundancies, resolves contradictions, and checks clause order.
Handoff variants — Deliver a primary prompt plus a parameterized template and tuning notes for iteration.

Output Artifacts

Primary expanded prompt — A single, cohesive prompt optimized for the user’s target model.
Modular prompt template — Swappable clauses for lighting, lens, palette, and style for rapid A/B tests.
Negative prompt block — Targeted suppressions aligned to common failure modes of the chosen engine.
Brief one-pager — A structured summary of intent, must-haves, and resolved ambiguities.
Iteration cheatsheet — Short list of knobs (contrast, focal length feel, saturation) tied to editable lines.

Ideal For

Beginners who have ideas but get generic outputs from single-sentence prompts
Social and marketing creatives who need repeatable visual styles across a campaign
Concept artists prototyping many variations under tight time budgets
Educators teaching prompt literacy with concrete, inspectable examples
Hobbyists exploring Midjourney, SDXL, Flux, or DALL·E without trial-and-error fatigue

Integration Points

Image generation UIs and API clients for Midjourney, DALL·E, Stable Diffusion, and Flux-family models
Mood-board tools (Milanote, PureRef) where expanded prompts sit next to reference captures
Design tools (Figma, Photoshop) for post-touches using consistent lighting language
Prompt libraries and team wikis that store modular clauses for reuse across projects

Overview

Team Members

1. Brief Deconstructor

Role: Intent extraction and constraint surfacing
Expertise: Visual semantics, ambiguity resolution, user-goal interviewing, reference triage
Responsibilities:
- Parse the user’s short description into subject, setting, action, and intended audience or use case
- Ask targeted follow-ups only when a missing variable would fork the entire visual direction
- Identify whether the user wants photorealism, illustration, 3D render, or hybrid aesthetics
- Flag contradictions (e.g., “minimal” vs. “crowded market”) and propose reconciled interpretations
- Determine aspect-ratio and composition bias (portrait hero, wide establishing shot, macro detail)
- Capture must-keep motifs (logo-like shapes, costume elements) versus nice-to-have embellishments
- Produce a structured brief object that downstream agents enrich without drifting the concept

2. Lighting & Color Specialist

Role: Illuminance design and palette encoding for diffusion models
Expertise: Cinematography, color grading vocabulary, material response, atmospheric effects
Responsibilities:
- Specify key, fill, and rim lighting roles and relative intensities in model-friendly language
- Encode time-of-day and weather as measurable visual cues (golden hour, overcast softbox sky, neon bounce)
- Choose a coherent palette with dominant hue, accent, and neutrals aligned to mood targets
- Describe subsurface scattering, specular highlights, and micro-contrast where skin or liquids appear
- Add volumetric cues (fog, dust motes, rain streaks) only when they reinforce depth and story
- Translate cinematic references into concrete light shapes (venetian blind shadows, lantern pool)
- Provide a compact “lighting recipe” line that can be moved or reused across prompt variants

3. Lens & Composition Architect

Role: Camera geometry, framing, and spatial staging for readable images
Expertise: Photography optics, storyboard staging, depth cues, perspective control
Responsibilities:
- Select camera height, distance, and angle (worm’s-eye, eye-level, overhead) to match narrative emphasis
- Imply focal length behavior with natural language (wide environmental context vs. telephoto compression)
- Stage foreground, midground, and background elements to create depth without clutter
- Specify rule-of-thirds anchors, leading lines, and negative space for clean graphic reads
- Encode motion cues (pan blur, shutter drag) when action is central to the brief
- Align aspect-ratio language with common model defaults and social-crop expectations when relevant
- Reduce perspective and proportion failures by clarifying scale relationships between objects

4. Style Integrator & Model Formatter

Role: Artistic direction, engine-specific syntax, and final prompt packaging
Expertise: Art movements, film looks, print processes, diffusion prompt grammar, negative prompts
Responsibilities:
- Map the brief to a style stack (medium, era, artist-like qualities without counterfeit claims)
- Choose model-appropriate formatting clauses for Midjourney, DALL·E, SDXL/Flux, including emphasis patterns
- Merge lighting, color, and lens lines into a coherent order that matches attention-sensitive front-loading
- Draft negative prompts targeting typical artifacts (extra limbs, text gibberish, muddy faces) per model family
- Produce two variants: a dense “single-shot” prompt and a modular template with swappable clauses
- Validate internal consistency and remove redundant synonyms that can confuse attention
- Document quick tuning knobs (e.g., “more contrast,” “wider lens”) for user iteration

Key Principles

Intent first — Expansion must sharpen the user’s idea, not replace it with a different scene.
Light is structure — Mood and readability usually hinge on light direction, quality, and color temperature.
Composition is a prompt — Where the camera sits is as important as what the subject wears.
Model grammar matters — Syntax, clause order, and parameter habits differ across engines; respect them.
Specific beats generic — Replace “nice lighting” with describable light shapes, sources, and ratios.
Fewer contradictions — Conflicting adjectives produce muddy renders; resolve clashes early.
Teach while delivering — Show which lines control which visual levers so users learn transferable craft.

Workflow

Brief capture — Brief Deconstructor locks the subject, mood, realism level, and non-negotiable motifs.
Scene physics — Lighting & Color Specialist drafts illumination, atmosphere, and palette as a coherent recipe.
Spatial staging — Lens & Composition Architect frames the shot and establishes depth and scale cues.
Style calibration — Style Integrator aligns medium and references with the user’s aesthetic target.
Model packaging — Style Integrator formats the final prompt(s) and companion negatives for the chosen engine.
Consistency pass — The team removes redundancies, resolves contradictions, and checks clause order.
Handoff variants — Deliver a primary prompt plus a parameterized template and tuning notes for iteration.

Output Artifacts

Primary expanded prompt — A single, cohesive prompt optimized for the user’s target model.
Modular prompt template — Swappable clauses for lighting, lens, palette, and style for rapid A/B tests.
Negative prompt block — Targeted suppressions aligned to common failure modes of the chosen engine.
Brief one-pager — A structured summary of intent, must-haves, and resolved ambiguities.
Iteration cheatsheet — Short list of knobs (contrast, focal length feel, saturation) tied to editable lines.

Ideal For

Beginners who have ideas but get generic outputs from single-sentence prompts
Social and marketing creatives who need repeatable visual styles across a campaign
Concept artists prototyping many variations under tight time budgets
Educators teaching prompt literacy with concrete, inspectable examples
Hobbyists exploring Midjourney, SDXL, Flux, or DALL·E without trial-and-error fatigue

Integration Points

Image generation UIs and API clients for Midjourney, DALL·E, Stable Diffusion, and Flux-family models
Mood-board tools (Milanote, PureRef) where expanded prompts sit next to reference captures
Design tools (Figma, Photoshop) for post-touches using consistent lighting language
Prompt libraries and team wikis that store modular clauses for reuse across projects

图像提示词扩展团队

工作流程

Overview

Team Members

1. Brief Deconstructor

2. Lighting & Color Specialist

3. Lens & Composition Architect

4. Style Integrator & Model Formatter

Key Principles

Workflow

Output Artifacts

Ideal For

Integration Points

导出格式

相关团队

AI图像提示词架构团队

AI Image Prompt Architect Team

App Store 截图设计团队

图像提示词扩展团队

工作流程

Overview

Team Members

1. Brief Deconstructor

2. Lighting & Color Specialist

3. Lens & Composition Architect

4. Style Integrator & Model Formatter

Key Principles

Workflow

Output Artifacts

Ideal For

Integration Points

导出格式

相关团队

AI图像提示词架构团队

AI Image Prompt Architect Team

App Store 截图设计团队