Amazon's Titan image generation model received an upgrade

Amazon recently shared that the Amazon Titan Image Generator v2 model is generally available on Amazon Bedrock. Amazon Titan Image Generator v2 allows users to guide generations using reference images, edit existing visual content, remove backgrounds, create variations of an image, and easily finetune the model to preserve brand or subject consistency. These capabilities are possible due to several new features that Titan v2 incorporates in conjunction with its predecessor's features: image conditioning, color palette guidance, background removal, and subject consistency.

The image conditioning capability lets users provide a reference image and indicate the visual characteristics the model should focus on, including edges, outlines, structural elements, and segmentation maps. The model supports canny edge and segmentation-based image conditioning, which provide different levels of granular control. Canny edge image conditioning extracts the most prominent edges of the picture, so it is better for applications that require preserving the whole structure of an image as a guide for a new generation. Segmentation allows the specification of an area or objects within a picture, thus allowing for finer control of the result.

Color palette conditioning means the model can process text prompts that include a specific color palette provided in hex color codes and generate images color-conditioned on the supplied palette. This enables, for instance, the generation of visual material that consistently follows a brand's color palette guidelines. Titan v2's segmentation capabilities are also behind the model's background removal feature, as these capabilities can intelligently detect objects in the foreground and cleanly isolate them even when they overlap with other objects in the image.

With the subject consistency feature, users provide reference images of the target object so Titan v2 can learn its characteristics. Then, once the model is fine-tuned on the subject, users only need to provide a text prompt referencing the target object, and the model will provide generated images containing a consistent depiction of the target, thus unlocking a series of possibilities for marketing, advertising, and visual storytelling.

The Amazon Titan Image Generator is the latest model in a parade of visual media generation models emerging over the last several weeks, including Black Forest Labs' FLUX.1, Stable Video 4D, and Hedra's Character-1.

Subscribe

Amazon's Titan image generation model received an upgrade

Comments

Read Next

xAI's Grok has been getting some updates: a new canvas tool, memory, and vision

Google introduces Veo 2 for video generation in Gemini and Whisk

Artisan, famous for its 'Stop Hiring Humans' campaign, raised a $25 million Series A