Tried to generate a realistic-looking Jessica Rabbit. Any ideas on how I could improve this?
Prompt info:
- Positive Prompt:
Jessica Rabbit in a red sequin dress and red high heels sitting facing the camera and slightly leaning back, slim waist, (big boobs:1.1), (deep cleavage:1.5), smiling, god rays, volumetric lighting, photorealistic, 8k, hdr
- Negative prompt:
bad anatomy, bad hands, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, worst face, three crus, extra crus, fused crus, worst feet, three feet, fused feet, fused thigh, three thigh, fused thigh, extra thigh, worst thigh, missing fingers, extra fingers, ugly fingers, long fingers, horn, realistic photo, extra eyes, huge eyes, 2girl, amputation, disconnected limbs
- Steps: 35
- Sampler: DPM++ 2S a Karras
- CFG scale: 7
- Seed: 2155243983
- Size: 768x1024,
- Model hash: 255f68ed9a
- Model: dynavisionXLAllInOneStylized_beta0411Bakedvae
- Clip skip: 2
Better than the original.
Thanks!
Nice! Are there any particular things you feel need improving? I’d be pretty proud of myself if I made this. I guess as far as photorealism goes, you’re probably pretty restricted by the model you’re using.
I haven’t really used sdxl much myself, do you use an interface like ComfyUI?
The face is a little flat and plastic feeling, so it would be nice to get a more natural skin texture. I would also like to bring out a bit nicer of a smile, and zoom out a bit more to show off some more of her legs without the legs fusing - I was fighting with that a fair amount. There were a couple I generated with other models that had her wearing stilleto heels that made who whole figure gorgeous, but there were so other parts that broke the realism.
As far as tooling goes, I’ve not yet had a chance to use ComfyUI yet, and just used AUTOMATIC1111 on my M2 MacBook Pro.
The face is a little flat and plastic feeling,
It’s worth considering switching models - dynavision tends to go that way with faces. You might also have some luck with prompt engineering e.g. removing ‘photorealistic’. It sounds counterintuitive but I’ve definitely seen that term make things look more “CGI” - which makes some sense, you dont normally call a real photo “photorealistic”. I sometimes try things like “portrait photography” instead. For some better skin texture, sometimes adding a refiner will help as well.
I also noticed you listed your resolution as 1024x768. That’s not one of the “recommended” SDXL resolutions ( https://stablediffusionxl.com/sdxl-resolutions-and-aspect-ratios/ ), you might have better luck with 1152x896. Not likely to make a substantial difference with the results youve gotten so far, though.
The real win with what you have so far will probably be changing techniques: A common one is to generate your image as close as possible with txt2img, and then to get a better face, send the image to img2img inpainting. Paint out the face and you can get much more specific with a facial generation on top of the previous image, changing models/lora/etc if needed.
You say you were going for a realistic looking Jessica rabbit but you’ve got “realistic photo” as a negative prompt. Is there a reason for that?
Seems they didn’t want photorealism, but wanted more realistic than flat 2d drawing.
Not really. I was looking for something like a real person, even though her proportions (like her cartoon form) would be largely impossible. Just trying to have some fun with what I can do in SD.
Actually, you have a good point. I copied the list of negative prompts from somewhere and didn’t go through the list to remove that. Tried again and got marginally better results, even when adding a couple additional prompts and lora:
- Positive:
Jessica Rabbit in a red sequin dress, sitting facing the camera and slightly leaning back, slim waist, (big boobs:1.1), (deep cleavage:1.5), smiling, perfect skin texture, god rays, volumetric lighting, (photorealistic:1.5), 8k, hdr,
- Negative:
bad anatomy, bad hands, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, worst face, three crus, extra crus, fused crus, worst feet, three feet, fused feet, fused thigh, three thigh, fused thigh, extra thigh, worst thigh, missing fingers, extra fingers, ugly fingers, long fingers, horn, extra eyes, huge eyes, 2girl, amputation, disconnected limbs
- Steps: 35
- Sampler: DPM++ 2S a Karras
- CFG scale: 7
- Seed: 2155243983
- Size: 768x1024
- Model hash: 255f68ed9a
- Model: dynavisionXLAllInOneStylized_beta0411Bakedvae
- Clip skip: 2
- Lora hashes: “add-detail-xl: 9c783c8ce46c”
- Positive:
Looks pretty good, I feel like one of JR’s distinctive characteristics is more prominent eye shadow. This eye make-up looks too light to be JR
True, there is a bit of the “smokey eye shadow” going on and - I’ll admit - I didn’t notice it the first time. I’ll have to see what I can do with trying to get it to generate an image with the purple shadow she had in the movie. I’ve tried for the last few minutes but it seems to throw other things off.
Ok, I think this one looks a little better. The eye shadow isn’t super vibrant, but its close.
- Positive:
Jessica Rabbit in a red sequin dress, sitting facing the camera and slightly leaning back, slim waist, (big boobs:1.5), (deep cleavage:1.5), smiling, (vibrant purple eye shadow), perfect skin texture, white chair, god rays, volumetric lighting, (photorealistic:1.5), 8k, hdr,
- Negative:
bad anatomy, bad hands, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, worst face, three crus, extra crus, fused crus, worst feet, three feet, fused feet, fused thigh, three thigh, fused thigh, extra thigh, worst thigh, missing fingers, extra fingers, ugly fingers, long fingers, horn, extra eyes, huge eyes, 2girl, amputation, disconnected limbs
- Steps: 35
- Sampler: DPM++ 2S a Karras
- CFG scale: 7
- Seed: 2155243983
- Size: 768x1024
- Model hash: 255f68ed9a
- Model: dynavisionXLAllInOneStylized_beta0411Bakedvae
- Clip skip: 2
- Lora hashes: “add-detail-xl: 9c783c8ce46c”
- Positive: