OpenAI’s new AI image generator pushes the limits in detail and prompt fidelity

Voyager · 1 year ago

OpenAI’s new AI image generator pushes the limits in detail and prompt fidelity

@[email protected] · 1 year ago

I wish more people realised this. It’s much harder to create very specific images with the current image generation tools than most people seem to think, which is creating an inaccurate view of the technology in the public eye.

The generator will create something inspired by the prompt it is given, but it can be very hard to make it match the output the prompt writer imagines when writing the prompt. There are various tools that can refine and narrow the generator’s output, to try and control things like posing, composition, style etc and to redraw details. But even then it’s often pot luck as to the output. The generated images aren’t necessarily bad, just not what was wanted.

I think the comparison to stock photo images is apt, current image generators are great for creating themed but somewhat generic images. The tools are going to continue to advance, and they are useful in for some applications already. But they are still a long way off from truly replacing human artistry.

@[email protected] · 1 year ago

The crux with that argument is that the artists is the only one that cares about specific output, meanwhile the art consumer doesn’t. When somebody plays a game or watch a movie, they don’t know what to expect, that’s part of the fun, they just care about it being good. So as long as the output is good enough for the consumer, whatever the artists thinks about it, really doesn’t matter, assuming they still have a job to begin with.