OmniGen2 Released: Unified Image Understanding & Generation Model

VectorSpaceLab team has officially released OmniGen2, a powerful multimodal image generation model. Unlike its predecessor OmniGen v1, OmniGen2 features a dual-pathway decoding design for text and image modalities, utilizing independent parameters and a decoupled image tokenizer, achieving significant performance improvements in image editing.

OmniGen2 possesses four core capabilities, with particular excellence in image editing:

Natural Language Instruction-Guided Image Editing

Text-to-Image Generation

In-Context Generation

In-Context Generation

OmniGen2 Released: Unified Image Understanding & Generation Model

Related links

Comments