Those that only change the color of the animal, such as “animal colored pink,” are less reliable, but show that DALL♾ is sometimes capable of segmenting the animal from the background. Other transformations, such as “animal with sunglasses” and “animal wearing a bow tie,” require placing the accessory on the correct part of the animal’s body. This works less reliably, and for several of the photos, DALL♾ only generates plausible completions in one or two instances. The transformation “animal in extreme close-up view” requires DALL♾ to recognize the breed of the animal in the photo, and render it up close with the appropriate details.
The most straightforward ones, such as “photo colored pink” and “photo reflected upside-down,” also tend to be the most reliable, although the photo is often not copied or reflected exactly. We find that DALL♾ is able to apply several kinds of image transformations to photos of animals, with varying degrees of reliability.