A General Reasoning Modular Core for Image Generation
Unified Thinker bridges the reasoning-execution gap with modular upgrades.
Figure 1: Think-then-Execute. Structured planning for precise synthesis.
We decouple the reasoning process into a dedicated Thinker module. This allows the model to generate structured plans before the Generator begins the pixel-level synthesis.
@misc{zhou2026unifiedthinkergeneralreasoning,
title={Unified Thinker: A General Reasoning Modular Core for Image Generation},
author={Sashuai Zhou and Qiang Zhou and Jijin Hu and Hanqing Yang and Yue Cao and Junpeng Ma and Yinchao Ma and Jun Song and Tiezheng Ge and Cheng Yu and Bo Zheng and Zhou Zhao},
year={2026},
eprint={2601.03127},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2601.03127},
}