Skip to content
HN On Hacker News ↗

Efficient and Training-Free Single-Image Diffusion Models

▲ 52 points 0 comments by yorwba 2w ago HN discussion ↗

Pangram verdict · v3.3

We believe that this document is fully human-written

0 %

AI likelihood · overall

Human
100% human-written 0% AI-generated
SEGMENTS · HUMAN 1 of 1
SEGMENTS · AI 0 of 1
WORD COUNT 246
PEAK AI % 0% · §1
Analyzed
Jun 7
backend: pangram/v3.3
Segments scanned
1 windows
avg 246 words each
Distribution
100 / 0%
human / AI fraction
Verdict
Human
Pangram v3.3

Article text · 246 words · 1 segments analyzed

Human AI-generated
§1 Human · 0%

View PDF HTML (experimental) Abstract:We consider the problem of generating images whose internal structure -- defined by the distribution of patches across multiple scales -- matches that of a single reference image. Recent approaches address this problem by training a diffusion model on a single image. But even in this setting, training is computationally expensive and requires hours of optimization. Instead, we model the image using a dataset of its patches at different scales. As this dataset is finite and the dimensionality of its patches is small, the score function for a noisy patch can be computed tractably using an optimal, closed-form denoiser, eliminating the need for neural network training. We integrate this patch-based denoiser into an efficient, training-free image diffusion model, and we describe how our method connects to classical patch-based image restoration techniques. Our approach achieves state-of-the-art generation quality and diversity compared to trained single-image diffusion models, and we demonstrate applications, including unconditional image generation, text-guided stylization, image symmetrization, and retargeting. Further, we show that our approach is compatible with latent space diffusion, and we show multiple additional acceleration techniques to achieve megapixel single-image generation in one second, and gigapixel generation in minutes.

Comments: CVPR 2026; Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG) Cite as: arXiv:2606.04299 [cs.CV]   (or arXiv:2606.04299v1 [cs.CV] for this version)   https://doi.org/10.48550/arXiv.2606.04299 arXiv-issued DOI via DataCite (pending registration) Submission history From: Haojun Qiu [view email] [v1] Wed, 3 Jun 2026 00:05:36 UTC (45,344 KB)