brendt.wohlberg.net
HomePublications
› Publications
› Software

Cite Details

Weijie Gan, Qiuchen Zhai, Michael T. McCann, Cristina Garcia-Cardona, Ulugbek S. Kamilov and Brendt Wohlberg, "PtychoDV: Vision Transformer-Based Deep Unrolling Network for Ptychographic Image Reconstruction", IEEE Open Journal of Signal Processing, vol. 5, doi:10.1109/OJSP.2024.3375276, pp. 539--547, Mar 2024

Abstract

Ptychography is an imaging technique that captures multiple overlapping snapshots of a sample, illuminated coherently by a moving localized probe. The image recovery from ptychographic data is generally achieved via an iterative algorithm that solves a nonlinear phase retrieval problem derived from measured diffraction patterns. However, these iterative approaches have high computational cost. In this paper, we introduce PtychoDV, a novel deep model-based network designed for efficient, high- quality ptychographic image reconstruction. PtychoDV comprises a vision transformer that generates an initial image from the set of raw measurements, taking into consideration their mutual correlations. This is followed by a deep unrolling network that refines the initial image using learnable convolutional priors and the ptychography measurement model. Experimental results on simulated data demonstrate that PtychoDV is capable of outperforming existing deep learning methods for this problem, and significantly reduces computational cost compared to iterative methodologies, while maintaining competitive performance.

BibTeX Entry

@article{gan-2024-ptychodv,
author = {Weijie Gan and Qiuchen Zhai and Michael T. McCann and Cristina Garcia-Cardona and Ulugbek S. Kamilov and Brendt Wohlberg},
title = {{PtychoDV}: Vision Transformer-Based Deep Unrolling Network for Ptychographic Image Reconstruction},
year = {2024},
month = Mar,
journal = {IEEE Open Journal of Signal Processing},
volume = {5},
doi = {10.1109/OJSP.2024.3375276},
pages = {539--547}
}