AI Prompt Assistant

Delphin Video Prompt Chat

Chat with AI to craft the perfect video generation prompts. Get expert suggestions, refine your ideas, and explore curated prompt examples below.

Sign in is required to start a prompt chat session. Sign in to continue.

How can I help you today?

Ask me anything about video generation prompts. I can help you write, refine, or translate your ideas into effective prompts.

Press Enter to send, Shift+Enter for new line. Free for all paid subscribers.

Prompt Examples

Ocean Sunset Timelapse

A breathtaking timelapse of a golden sunset over the Pacific Ocean, waves gently crashing on a rocky shoreline, warm amber and pink hues reflecting on the water surface, cinematic 4K quality, smooth camera slowly panning right

NatureTimelapseCinematic

Cyberpunk City Night

A neon-lit cyberpunk city street at night, rain-soaked pavement reflecting holographic advertisements, flying cars passing overhead, a lone figure walking with an umbrella, Blade Runner aesthetic, moody blue and purple lighting

Sci-FiUrbanAtmospheric

Coffee Pour in Slow Motion

Extreme slow-motion close-up of espresso being poured into a ceramic cup, rich crema forming on the surface, steam rising elegantly, warm studio lighting with a soft bokeh background, product advertisement style

ProductSlow MotionClose-up

Astronaut on Mars

An astronaut walking across the rust-red Martian landscape, dust swirling around their boots, Earth visible as a tiny blue dot in the orange sky, dramatic long shadows from the low sun, NASA-style documentary cinematography, wide establishing shot

Sci-FiSpaceDocumentary

Prompt Writing Tips

  • 1.Be specific — describe lighting, camera angle, mood, and style
  • 2.Set the motion — specify camera movement (pan, zoom, tracking shot)
  • 3.Reference styles — mention film directors, genres, or visual aesthetics
  • 4.Include details — textures, colors, atmosphere, and time of day matter
  • 5.Keep it focused — one clear scene per prompt works best

Prompt Guides

Delphin prompt guides and workflow notes

Everything that used to live on the separate guides pages is now collected below the chat experience, so you can refine prompts and read the supporting guidance in one place.

What Is OpenAI Images 2.0? ChatGPT Images 2.0 Explained

Understand what OpenAI Images 2.0 actually refers to, how it maps to ChatGPT Images 2.0 and gpt-image-2, and why creators are paying attention.

Step-by-step workflow

1. Separate the product name from the model name

Think of ChatGPT Images 2.0 as the public product label, then map the actual developer surface to the official Images API and gpt-image-2.

2. Judge it by workflow, not by hype

Look at whether you need stronger typography, cleaner compositional control, and a reliable edit loop rather than only chasing a viral release.

3. Match the tool to the output type

It is especially useful when your team needs marketing creatives, concept art, UI mockups, product visuals, or reference-based edits that still fit a production workflow.

What Images 2.0 actually refers to

The phrase OpenAI Images 2.0 is a convenient shorthand, but the official naming is split across product and API surfaces. On the product side, OpenAI announced ChatGPT Images 2.0 on April 21, 2026. On the developer side, the current model and documentation point to gpt-image-2 through the official image generation and edit APIs.

That distinction matters because teams often confuse a viral ChatGPT feature launch with API availability. In this case, the good news is that the two are connected. The product release has a real developer story behind it rather than being limited to the ChatGPT interface.

Why creators and product teams are paying attention

The biggest reason is not just image quality in the abstract. It is that the system is trying to solve more practical image problems: directed composition, sharper text handling, cleaner brand-style iteration, and a more useful edit loop when you already have inputs.

For many teams, that changes the economics of image production. Instead of using one tool for ideation, another for edits, and a third for final cleanup, OpenAI is making a stronger case for a single image workflow that can start rough and become production-grade through iteration.

Text and layout fidelity

This matters for posters, social creatives, product packaging, UI mockups, and any image where words inside the frame are part of the actual deliverable rather than a happy accident.

Editability instead of one-shot prompting

OpenAI's image docs now frame edits as a first-class workflow. That is important because most real teams are not generating from zero every time. They are revising an existing asset, a brand image, or a draft creative.

A cleaner deployment story

Once a model has an official API path, it becomes much easier to justify in a real product. Billing, auth, moderation, rate limits, and server-side routing all become easier to reason about than an interface-only release.

Where Images 2.0 fits best right now

Images 2.0 is strongest when the output has to survive contact with the real world. If the image is going into an ad account, product detail page, investor deck, pitch mockup, or on-site creative workflow, precision and editability matter more than raw novelty.

What teams should verify before committing

Even when the official story is strong, production adoption still needs a checklist. The right question is not whether the model is impressive. The right question is whether your organization can deploy it safely, predictably, and economically.

FAQ

Is Images 2.0 the same thing as gpt-image-2?

Not exactly. Images 2.0 is the public shorthand for the current OpenAI image release, while gpt-image-2 is the API-facing model name documented for developers.

Can I use Images 2.0 outside ChatGPT?

Yes. OpenAI's current developer docs describe official image generation and image edit workflows, so the capability is not limited to the ChatGPT interface.

Why is this release more useful than a normal image-model announcement?

Because it pairs a consumer launch with a real API path. That makes it immediately more relevant for websites, creator tools, and internal production systems.

Related resources

Nano Banana vs OpenAI Images 2.0

Compare Nano Banana and OpenAI Images 2.0 across naming, workflow, editing depth, and developer fit so you can choose the right image stack.

Step-by-step workflow

1. Clarify the naming before comparing output

Treat Nano Banana as the Google-side image experience and Images 2.0 as OpenAI's current image release so you are not mixing product branding with API labels.

2. Choose based on your bottleneck

If you mostly need faster ideation, your answer may differ from a team that needs typography, revisions, and shipping-ready creative assets.

3. Compare the ecosystems, not only the samples

API maturity, edit flow, platform fit, and how the model plugs into your current stack often matter more than one viral side-by-side image.

This is partly a naming problem

A lot of confusion in this comparison comes from vocabulary. On the Google side, many people say Nano Banana when they mean the newer Gemini image-generation experience. On the OpenAI side, many people say Images 2.0 when they mean the current ChatGPT release plus the gpt-image-2 developer path.

Once you clear that up, the comparison becomes much more useful. You are not really comparing two isolated screenshots. You are comparing two image ecosystems, each with a different strength profile and a different integration story.

Where Nano Banana tends to win

Nano Banana often makes the strongest case when the team values fast ideation, playful prompt iteration, and a workflow that already lives inside the broader Gemini stack. If the main job is to generate many loose options quickly and decide later, that kind of speed can matter more than surgical control.

Where Images 2.0 tends to win

OpenAI Images 2.0 makes a stronger case when precision is the point. If your image has to include better text handling, fit a brand direction, or survive several rounds of reference-based edits, OpenAI's official image stack currently has the cleaner story.

Text inside the frame

This is where a lot of practical product work either succeeds or falls apart. The more your team cares about poster copy, packaging labels, signage, or UI compositions, the more valuable stronger text fidelity becomes.

Edit loops and references

OpenAI's current docs clearly support image edits, which matters for brand teams and product teams that rarely start from zero. They usually start from an existing mockup, shot, or campaign draft and need controlled revision.

Official API fit

If you want to wire image generation directly into a website, product dashboard, or internal creative tool, OpenAI now offers a clearer official route. That reduces ambiguity for engineering teams making production decisions.

Which one should a product team choose

Choose Nano Banana when your biggest constraint is exploration speed. Choose Images 2.0 when your biggest constraint is output control. A lot of teams will even use both at different stages: Google-side ideation earlier, OpenAI-side finishing later.

FAQ

Is Nano Banana an official public API model name?

Not in the same clean way that gpt-image-2 is documented by OpenAI. In practice, people use Nano Banana as a Google-side product or community label while developers work through Gemini image generation documentation.

Which model is better for typography and layout-sensitive images?

Images 2.0 is usually the safer choice when text fidelity and controlled layout matter more than raw ideation speed.

Should I choose based on a single benchmark image?

No. Integration quality, edit support, consistency, and ecosystem fit are usually more important than one attractive sample.

Related resources

DeepSeek vs OpenAI Images 2.0

Compare DeepSeek and OpenAI Images 2.0 the right way, then use a curated set of DeepSeek resources to decide whether you need open research or a managed image API.

Step-by-step workflow

1. Decide whether you need an open ecosystem or a managed API

This is the first filter. DeepSeek is more compelling when you want open research and flexibility, while OpenAI is more compelling when you want a production-ready image service.

2. Separate hosted products from research assets

Do not compare a polished API product directly against a research repo without accounting for deployment work, tooling, and operational overhead.

3. Use the best of both where it makes sense

Many teams can use DeepSeek for reasoning, prompt planning, or open experimentation, then use Images 2.0 for the final managed image-generation layer.

These are not direct one-to-one products

If you frame the comparison as DeepSeek versus OpenAI image quality, you risk flattening two very different ecosystems. DeepSeek's official API docs currently emphasize chat and reasoning models. OpenAI's current story, by contrast, clearly includes an official image API surface with generation and edit flows.

That means the real comparison is open research plus flexibility on one side versus managed deployment plus official image workflows on the other. Which one is better depends far more on your engineering and product constraints than on abstract model identity.

Where DeepSeek is genuinely strong

DeepSeek is especially attractive when you value open assets, inspectability, and the ability to experiment outside a tightly managed vendor boundary. That is why the Janus family matters. It gives technical teams a way to study, adapt, and self-direct parts of the multimodal stack rather than only consume an opaque hosted endpoint.

Where Images 2.0 is stronger today

Images 2.0 is stronger when the question is not what can be researched, but what can be shipped. If your team needs a website to call an official image API, store jobs, manage credits, edit reference images, and keep the whole system maintainable, OpenAI currently offers the cleaner production story.

DeepSeek resources worth reading

If you are evaluating DeepSeek seriously, do not stop at social posts or reposted benchmark screenshots. Start with the official resources below so you can distinguish hosted API capability from open-source research capability.

For the official hosted API

Use DeepSeek's official API docs first. That is where you can verify what the company is actively exposing as a managed platform rather than what the community assumes it supports.

For image and multimodal research

Use the official DeepSeek GitHub organization for Janus, Janus-Pro, and JanusFlow-style resources. Those repos are the clearest signal that DeepSeek's image direction is strongest in open multimodal research and experimentation.

A practical combined workflow

For many teams, the best answer is not either-or. Use DeepSeek where reasoning, planning, and open experimentation help you think better. Use OpenAI Images 2.0 where you need a managed image layer that can plug into a website or production creative tool with less friction.

FAQ

Does DeepSeek have an official hosted image API like OpenAI Images 2.0?

As of April 22, 2026, DeepSeek's official hosted API is easier to verify for chat and reasoning. Its image-generation story is more visible through official open-source projects such as Janus rather than a directly comparable managed image API surface.

Is DeepSeek still useful if I mainly need image generation?

Yes, especially if you care about open multimodal research, self-hosting paths, or pairing reasoning with image workflows. It is just a different kind of value than a managed image API.

What is the simplest production choice for a website today?

If the goal is a managed, official image API that you can wire into a site quickly, Images 2.0 currently has the clearer path.

Related resources

OpenAI Images 2.0 API Integration Guide

Learn how to integrate OpenAI Images 2.0 into a website, what is officially supported today, and how to judge production feasibility before you ship.

Step-by-step workflow

1. Use a server-side adapter, never a browser-side key

Your site should call your own backend route first, then your server should call OpenAI so authentication, credits, moderation, and audit logic stay under your control.

2. Handle prompt-only and edit flows separately

Prompt-only requests should go through image generation, while reference-image requests should route into an image edit flow with stronger input preservation.

3. Prepare an operational fallback plan

Even with an official API, you still need cost controls, rate limits, and a backup provider strategy for high-volume or outage-sensitive products.

Short answer: yes, official integration is viable

OpenAI's current documentation supports official image generation and image edits. In practical terms, that means you can build a website flow where users submit prompts or reference images to your app, your server calls OpenAI, and your product stores the resulting assets or job records just like any other media workflow.

This matters because a lot of image launches are either demo-only or consumer-only. Images 2.0 is different in that the official developer story exists right now.

How the integration should be structured

The safest integration pattern is straightforward: the browser sends requests to your own API, your API validates the request and user, then your server calls OpenAI. That keeps your OpenAI key off the client and gives you a place to enforce product rules before a request reaches the model.

Prompt-only generation

Use this when the user starts from text. Normalize the prompt, map your UI size options to OpenAI-supported output sizes, send the request to the official generation endpoint, then return the resulting image URLs or base64 assets to the frontend.

Reference-image edit flow

Use this when the user uploads an existing visual. Fetch or persist the reference image server-side, then forward it through the official image edit route so the result stays tied to the user's actual input rather than starting from scratch.

Storage, billing, and moderation

Store generation jobs in your database, track credits before you call OpenAI, and decide how much moderation or prompt filtering belongs at your app layer versus the provider layer. This is where a thin adapter architecture helps a lot.

Why this codebase can support it cleanly

This codebase already uses an image adapter registry, server-side API routes, job persistence, and credit accounting. That means the official OpenAI path is not a ground-up rebuild. It is mostly an adapter decision plus a few provider-specific mapping rules.

Current feasibility checklist

Before shipping, a product team should answer a few operational questions. These are the difference between a cool internal prototype and a reliable public feature.

When to keep a fallback provider

Even when the official path is the preferred one, keeping a fallback is still a healthy engineering choice. It gives you negotiating room on cost, lets you serve less demanding use cases with cheaper providers, and protects the product if one provider has an outage or access issue.

FAQ

Can my website call OpenAI directly from the browser?

It should not. Keep the OpenAI key on your server and let the browser talk only to your own backend route.

Do I need a brand-new architecture to add OpenAI image support?

Not if your product already has a provider adapter layer. In this codebase, it fits naturally into the existing server-side image generation flow.

Is image editing officially supported or only raw generation?

OpenAI's current documentation supports both generation and edit workflows, which makes the integration far more practical for real products.

Should I still keep another provider connected?

Usually yes. A fallback provider helps with resilience, cost control, and capacity planning even when OpenAI is your preferred path.

Related resources

How to Write Delphin Video Prompts

Learn a practical Delphin-style prompt-writing process for AI video scenes, better camera direction, and cleaner visual storytelling.

Step-by-step workflow

1. Define the visual goal

Start with the subject, action, and setting so the model knows what the shot is trying to show before style details are layered on.

2. Add camera and motion cues

Translate your idea into framing, movement, and pacing language only where it improves the shot rather than making it noisier.

3. Refine style and atmosphere

Use mood, lighting, and texture language to shape the result, then remove any conflicting directions that blur the scene.

What separates a good prompt from a generic one

A generic prompt leaves the model too much room to invent details that may not fit your goal. A better prompt makes the visual target obvious early, then adds style, motion, and emotional texture in a deliberate order.

How to structure a Delphin-style scene prompt

Think in layers. Start with who or what is in frame. Then describe the action. After that, shape the camera behavior, visual style, and emotional tone.

Subject and scene anchor

Lead with the main subject and location so the frame has a stable center of gravity.

Action and pacing

Describe what changes inside the shot, not just what exists in the shot.

Camera and mood language

Use cinematic wording to shape how the viewer should feel, but keep it tied to the visual event.

FAQ

Should I write prompts like prose or like instructions?

The best prompts usually sit between the two. They should read naturally, but still give clear visual instructions about subject, action, camera, and mood.

Why do text-only prompts often feel generic?

They usually skip specificity. When the subject, action, and scene objective are vague, the model fills the gap with average-looking choices.

Related resources

How to Use Delphin Image Prompts

Learn how to write stronger Delphin-style image prompts with clearer composition, style control, and better use of references.

Step-by-step workflow

1. Describe the core subject clearly

Name the subject, environment, and intended style in the opening phrase so the model gets a reliable frame for the image.

2. Add composition and lighting direction

Use composition cues, camera distance, and lighting language to guide how the image should feel rather than only what it contains.

3. Use references when the visual target is specific

Reference images help lock the composition or mood when your target is more exact than a text-only prompt can easily convey.

How image prompts differ from video prompts

Image prompts need more pressure on composition and still-frame detail. You are not describing a sequence of action, but rather a single visual outcome that should feel intentional and complete.

Prompt elements that matter most for image quality

The strongest image prompts balance clarity with restraint. Too few details can feel generic. Too many details can create noisy or contradictory results.

FAQ

Should I use style words at the beginning of an image prompt?

Usually it is better to start with the actual subject and scene, then layer style language once the visual anchor is clear.

When are reference images most useful?

They are most useful when you care about composition, pose, mood, or visual direction that is hard to describe precisely with text alone.

Related resources

Text to Video vs Image to Video

Compare text-to-video and image-to-video workflows so you can choose the right Delphin-style process for your next project.

Step-by-step workflow

1. Decide whether you already have a visual anchor

If you already have a still image or product render, image-to-video often gives better continuity than text-only prompting.

2. Choose based on creative uncertainty

If the concept is still abstract and you need ideation, text-to-video is often the better starting point.

3. Match the workflow to the deliverable

Pick the method that best fits the asset you already have and the amount of control you need over composition or motion.

When text-to-video is the better choice

Text-to-video works best when you are still writing the scene into existence. It is ideal for rough ideation, storyboard drafting, and translating scripts or marketing copy into visual moments.

When image-to-video is the better choice

Image-to-video is stronger when you already know what the frame should look like. It tends to preserve brand direction and composition more consistently because the model starts from a real visual anchor.

FAQ

Which workflow gives more visual control?

Image-to-video usually offers more visual continuity when you already have a strong still reference. Text-to-video gives more freedom when the scene is still being invented.

Which workflow is better for prompt writing practice?

Text-to-video is the better environment for improving prompt-writing skill because the written description has to carry more of the scene.

Related resources

How to Make Product Demo Videos with AI

Learn how to turn product images, scripts, and features into clear demo videos using a Delphin-style AI workflow.

Step-by-step workflow

1. Map the product story

Decide what the viewer needs to understand first, second, and third so the demo follows a clear sequence rather than a random feature list.

2. Collect the right inputs

Use screenshots, product renders, campaign stills, and short text scenes to give the model enough material for accurate visual direction.

3. Convert features into visual moments

Write prompts and transitions around actions, user benefits, and interface focus instead of only listing technical capabilities.

Why product demos benefit from AI workflows

Traditional product demos can be slow to storyboard and expensive to revise. AI workflows help teams test angles quickly, especially when the story depends on product screens, concept renders, or feature mockups.

What makes a product demo prompt effective

The prompt should describe what part of the product the viewer is noticing, what changes on screen, and what emotional tone the demo should communicate. Clarity beats hype in this context.

FAQ

Should a product demo start with text-to-video or image-to-video?

If you already have product visuals, image-to-video often creates a smoother starting point. If you are still shaping the narrative, text-to-video can help you plan the story first.

What assets help product demo generation the most?

Screenshots, polished product stills, UI mockups, and concise scene prompts are usually the most useful assets.

Related resources