[Feature] Nucleus-Image MoE 17B-2B

### Feature Summary

Sparse MoE efficiency: 17B total capacity with only ~2B active parameters per forward pass, enabling high-quality generation at a fraction of the inference cost of dense models.

### Detailed Description

<img width="3856" height="4202" alt="Image" src="https://github.com/user-attachments/assets/123cb95e-21c5-4d51-a9c4-b3be5d9a2690" />

https://huggingface.co/NucleusAI/Nucleus-Image
https://github.com/WithNucleusAI/Nucleus-Image
storage.googleapis.com/nucleus_image_v1/Nucleus-Image-Technical-Report.pdf

"Nucleus-Image is a text-to-image generation model built on a sparse mixture-of-experts (MoE) diffusion transformer architecture. It scales to 17B total parameters across 64 routed experts per layer while activating only ~2B parameters per forward pass, establishing a new Pareto frontier in quality-versus-efficiency. Nucleus-Image matches or exceeds leading models including Qwen-Image, GPT Image 1, Seedream 3.0, and Imagen4 on GenEval, DPG-Bench, and OneIG-Bench. This is a base model released without any post-training optimization (no DPO, no reinforcement learning, no human preference tuning). All reported results reflect pre-training performance only. We release the full model weights, training code, and dataset, making Nucleus-Image the first fully open-source MoE diffusion model at this quality tier."

<img width="2678" height="2934" alt="Image" src="https://github.com/user-attachments/assets/6d18f76f-8b6a-4166-8df0-0beaa26d391d" />

### Alternatives you considered

_No response_

### Additional context

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Nucleus-Image MoE 17B-2B #1421

Feature Summary

Detailed Description

Alternatives you considered

Additional context

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

[Feature] Nucleus-Image MoE 17B-2B #1421

Description

Feature Summary

Detailed Description

Alternatives you considered

Additional context

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions