Version 1 series are incremental. Major versions changes are their own entities in this sequence. For instance, v1 was an invisible device, a vortex of sorts and v2 is through a hand-held cell phone. V2 is unlikely to produce any of the specific content available in v1 series.
The v2 download file includes a 3, which is the major training rendition, and the reason this doesn't match the version is because the training rendition 2 was useless, it produced nothing usable after 20k steps. I had to manually generate new images, again, what were not too disparate from each other in order for it to converse. These manually created images took me 3 days to compose and was a serious PITA, using Illustrious, Z-Image, Qwen edit and Flux 2 Klein in order to help me composite something useful.
