*Example images' prompts in InvokeAI format, not A1111 format. If you don't use InvokeAI, don't copy/paste my prompts without first adjusting the syntax.
Animated version is less heavily anime and thus more general-purpose. Anime version is more focused towards anime, and has some extra specialization in chaos and scenery, and much less tendency for uncanny valley.
USAGE TIPS, BEST PRACTICES, AND SETTINGS (Anime-V2 version):
After more practice with the model, the negative prompt I use most is bad-artist, negative_hand, (low quality, worst quality:1.3)
in A1111 syntax, or bad-artist, negative_hand, (low quality, worst quality)1.3
in InvokeAI syntax. Doesn't work perfect for everything, mind you, though it's a good start. Some prompts work best without negative embeddings and low quality, worst quality
with 1.4 weight instead
The recommended negative embeddings are negative_hand, and then either a choice of bad-artist (the base version), <neg-anime>, or <neg-sketch-2>. negative_hand
doesn't interfere with the model and has an okay success rate. bad-artist
has less effect on model imagination than <neg-anime>
or <neg-sketch-2>
, and provides brighter and more colorful results, though may not increase detail as much. <neg-anime>
or <neg-sketch-2>
are the opposite. More impact on imagination, yet more detailed and less change in composition. <neg-anime>
seems to lean towards darker images in Anime-V2. Luna's KHFB and AuroraNegative are also safe to use, though I don't use them personally.
Other negative embeddings (e.g. bad-artist-anime, EasyNegative, bad_prompt_version2) are generally not recommended, as they impact art style and model expressiveness and have little, if any, actual benefit.
With or without any negative embeddings, negative_hand
aside, using (low quality, worst quality)
with a 1.3 or 1.4 weight does the job for most prompts if you get any subpar definition or color.
Does not respond very well to LoRAs trained mainly 2.5d, 3d, or photorealistic-style images, such as dogu_cat's it's Skyler White Yo guy LoRA.
Upscaling via img2img / High-Res Optimization in InvokeAI is ideal, especially at a strength like 0.47-0.55.
Example images were generated without CLIP skip. CLIP skip 2 has interesting effect, though does reduce stylization.
This model uses the Waifu Diffusion 1.4 VAE with the pixel layer modified for -6% contrast. img2img at low strength may reduce quality unless you swap the VAE.
USAGE TIPS, BEST PRACTICES, AND SETTINGS (Animated version):
negative_hand
and bad-artist
are the only negative embeddings I can comfortably recommend using for this model, and not even that is a requirement. <neg-anime>
or <neg-sketch-2>
are also great, but tend towards a semi-realistic style in this model, so be wary of the uncanny valley if you use them.
To liven up color and lighting (when not using a negative embedding), I suggest putting desaturated
and pixelated
in your negative prompt with 2 or 3 units of down-weighting (that means [[[desaturated]]], [[[pixelated]]]
in A1111 UI, or desaturated---, pixelated---
in InvokeAI).
To lean output more towards an anime style rather than semi-realistic, include anime
in your prompt. Put realistic
in your negative prompt to further lean towards 2D, or by the front of your positive prompt to lean more 3D.
Example images were generated in InvokeAI, so you'll have to use your UI's weighting syntax (which means the + and - in my prompts likely won't do anything for you unless you're using InvokeAI).
This model will be great for:
People who want an anime aesthetic that is different from other SD 1.5 models
People who are more distant, casual enjoyers of anime, who might find this model more welcoming than one that's heavily leaned to modern anime art only
People who like more classic anime styles, ala Voltron and Ghost in the Shell
People who like generating in 2D and semi-real styles and liked 526Mix, who will probably enjoy the better lighting and touch of wackiness in this version
The "Animated" model is a straightforward mix of 526Mix-V1.3.5, and Nerfgun3's Macaron Mix, the latter having had the noise offset added at 0.70 multiplier. This is done with a weighted sum at a 0.3 multiplier with Macaron Mix. I always like Nerfgun3's art and embeddings, so I felt I could trust that model to be fairly in line with my own creative desires and expectations.
As always, I suggest going to the source models for the full experience, and Nerfgun3's Macaron Mix and Newartmodel4 aren't exceptions here.
Example images were generated in Invoke AI with the model converted to Diffusers format, hires fix on (0.45 strength, works like img2img), and the sampler DDIM (unless listed otherwise). This means unless you use Invoke AI, you likely won't be able to recreate my images exactly. Just learn from the prompts and modify the weighting in prompts as needed for the UI you use (if you use the A1111 UI, any (plus sign)+ is equal to one set of parentheses).