«

对不起,我要的是精准出图,而不是随机抽卡

qimuai 发布于 阅读:6 AI工具


在Gemini应用中获取最佳图像生成与编辑效果的技巧

概述

内容来源:https://blog.google/products/gemini/image-generation-prompting-tips/

Gemini现已在Gemini应用、AI Studio和Vertex AI中提供先进的图像生成与编辑功能。通过特定提示词可实现角色一致性、精准编辑和图像融合。在提示词中尝试使用主体、构图、动作、场景、风格和编辑指令以获得最佳效果。

摘要由Google AI生成。生成式AI处于实验阶段。

我们最新推出的Gemini图像生成与编辑更新包含角色一致性、对话式编辑等多项进阶功能。以下技巧助您充分发挥其潜力。

对不起,我要的是精准出图,而不是随机抽卡

今日我们推出了先进的图像生成与编辑模型,可在Gemini应用AI StudioVertex AI中使用。本次更新显著提升了角色一致性、精准对话式编辑以及将照片合成为全新作品的能力。为帮助您充分利用此更新,以下是为Gemini图像生成与编辑编写更有效提示词的技巧。

Gemini图像生成的核心能力

在深入使用前,了解Gemini的改进功能将帮助您探索更多应用场景:

  1. 角色设计一致性:在多次生成和编辑中保持角色或物体的外观特征
  2. 创意构图:将不同元素、主体和风格融合为统一图像
  3. 局部编辑:使用简单语言对图像特定部分进行精确修改
  4. 设计与外观适配:将某种风格、纹理或设计应用到其他概念中
  5. 逻辑推理:运用现实世界理解能力生成复杂场景或预测序列后续发展

构建有效提示词的六大要素

简单的一两句话输入即可在Gemini中获得不错的效果。但若要获得最佳效果并实现更精细的创作控制,建议在提示词中包含以下要素:

提示词示例:创意技巧展示

不同的提示策略可解锁从照片级编辑到奇幻新世界的各种创作。以下是五种可尝试的技巧及关键示例:

1. 保持角色外观一致性

Gemini能在不同姿势、光照和环境条件下保持人物或角色的相似度,甚至能将同一角色应用到新风格和场景中。以下示例展示同一会话中如何通过多个提示词使用同一角色:

对不起,我要的是精准出图,而不是随机抽卡

通过在第一个提示词中建立具有具体细节的明确定义角色,您可以使用后续提示词将同一角色置于全新语境中。此处Gemini保留了角色的关键特征,如面部特征、独特外观和服装。

2. 实现精准定向变换

凭借更新的图像编辑功能,您可对照片进行快速精确的编辑。这非常适合从产品模型到完善个人照片的各种场景:

对不起,我要的是精准出图,而不是随机抽卡

这展示了Gemini在局部编辑方面的优势。通过直接对话式指令,您可修改图像中的特定元素,无需复杂软件或重新生成整个场景。

3. 创意构图融合概念

尝试将两个或多个创意融合为单张惊艳图像。提示Gemini创建两张图像,然后以富有想象力的方式结合其主体和环境:

对不起,我要的是精准出图,而不是随机抽卡

4. 适配与应用新风格

通过应用新风格、配色或纹理完全改变图像的情绪和美学风格,同时保持原始主体不变:

对不起,我要的是精准出图,而不是随机抽卡

通过"风格迁移",Gemini理解核心主体(摩托车)及其形态,然后以要求的艺术风格完全重新渲染。这可应用于设计灵感、艺术探索等场景。

5. 运用逻辑推理进行复杂生成

给予Gemini简单概念,让其推理能力构建细节。这适用于需要理解现实世界关系或流程的内容创作:

对不起,我要的是精准出图,而不是随机抽卡

模型可运用逻辑推理能力预测后续发展。它理解第一张图像的语境和物理原理——人物小心平衡蛋糕——然后模拟如绊倒等动作的合理后果,生成动态且具有语境感知的新图像。

当前限制说明

随着我们持续开发和微调模型,以下方面仍需改进:

我们正积极改进这些领域,并感谢您在我们共同构建下一代图像工具过程中展现的创造力。

创意可能性正待您采撷——我们迫不及待想看到您的创作!

特别感谢Greenfield团队高级生成工程师的创意贡献。

General summary

Gemini now offers state-of-the-art image generation and editing within the Gemini app, AI Studio, and Vertex AI. You can use specific prompts to achieve consistent characters, precise edits, and blended images. Try using subject, composition, action, location, style, and editing instructions in your prompts to get the best results.

Summaries were generated by Google AI. Generative AI is experimental.

Our latest update to image generation and editing in Gemini includes advancements in character consistency, conversational editing and more. Here are some tips to get the most out of it.

对不起,我要的是精准出图,而不是随机抽卡

Earlier today, we launched a state-of-the-art image generation and editing model, available in the Gemini app, AI Studio and Vertex AI. This update introduces significant advancements in character consistency; precise, conversational editing; and the ability to combine photos into a completely new creation. To help you get the most out of this update, here are some tips for writing more effective prompts for image generation and editing in Gemini.

Key capabilities of image generation in Gemini

Before you dive in, it’s helpful to familiarize yourself with what’s been improved in Gemini, so you can consider which use cases to try with it:

  1. Consistent character design. Preserve a character or object's appearance across multiple generations and edits.
  2. Creative composition. Blend disparate elements, subjects and styles from multiple concepts into a single, unified image.
  3. Local edits. Make precise edits to specific parts of an image using simple language.
  4. Design and appearance adaptation. Apply a style, texture or design from one concept to another.
  5. Logic and reasoning. Use real-world understanding to generate complex scenes or predict the next step in a sequence.

6 elements of constructing effective prompts

You can get great results with Gemini from simple one or two-sentence inputs. However, to achieve the best results and unlock more nuanced creative control, consider including the following elements in your prompt:

Prompting examples: A showcase of creative techniques

Different prompting strategies can unlock everything from photorealistic edits to fantastical new worlds. Here are five techniques to try, each with a key example.

1. Preserve characters’ appearances.

Gemini can maintain the likeness of a person or character across different poses, lighting and environments, and even apply the same character to new styles and surfaces. Here’s an example of how one character can be used across multiple prompts in the same session:

对不起,我要的是精准出图,而不是随机抽卡

By establishing a clearly defined character with specific details in the first prompt, you can use follow-up prompts to place that same character in entirely new contexts. Here, Gemini preserves key features of the character like facial features, distinctive appearance and clothing.

2. Make targeted transformations with precision.

With updated image editing capabilities, you can make quick, highly precise edits to your photos. This is perfect for everything from product mockups to perfecting personal pictures. Here’s an example:

对不起,我要的是精准出图,而不是随机抽卡

This showcases Gemini’s strength in local edits. By using direct, conversational commands, you can modify specific elements within the image without needing complex software or re-generating the entire scene.

3. Blend concepts with creative composition.

Try fusing two or more ideas into a single, striking image. Prompt Gemini to create two images, and then combine their subjects and environments in imaginative ways:

对不起,我要的是精准出图,而不是随机抽卡

4. Adapt and apply new styles.

Completely change the mood and aesthetic of an image by applying a new style, color palette or texture, all while keeping the original subject intact.

对不起,我要的是精准出图,而不是随机抽卡

With “style transfer,” Gemini understands the core subject (the motorcycle) and its form, then re-renders it entirely in the requested artistic style. This can be used for design inspiration, artistic exploration, and more.

5. Use logic and reasoning for complex generation.

Give Gemini a simple concept and let its reasoning capabilities build out the details. This is useful for creating content that requires an understanding of real-world relationships or processes.

对不起,我要的是精准出图,而不是随机抽卡

The model can use its logic and reasoning capabilities to predict what comes next. It understands the context and physics of the first image — a person carefully balancing a cake — and can then simulate the plausible consequences of an action like tripping, resulting in a dynamic and context-aware new image.

A note on current limitations

As we continue to develop and fine-tune our models, there are still areas in need of improvement:

We’re actively working to improve these areas and appreciate your creativity as we build the next generation of image tools together.

The creative possibilities are ripe for your picking — we can’t wait to see what you come up with!

With special thanks to the Greenfield team of senior staff generative engineers for their creative contributions.

AI新品 gemini

文章目录


    扫描二维码,在手机上阅读
    请先 登录 再评论