UEval: A Benchmark for Unified Multimodal Generation

Bo Li, Yida Yin, Wenhao Chai, Xingyu Fu*, Zhuang Liu*

Princeton University

(* indicates co-advising)

What is UEval?

UEval comprises 1,000 expert-curated prompts that require both images and text in the model outputs, sourced from 8 diverse real-world domains.

teaser

Full-Leaderboard

view the full leaderboard ↗

view UEval problems

submit your results

Submit your results by opening an issue in our GitHub.

BibTeX

@article{xxx,
    title    = {UEval: A Benchmark for Unified Multimodal Generation},
    author   = {xx},
    year     = {2025},
    journal  = {}
}

Website template modified from https://www.tbench.ai/.