Logo del repository
  1. Home
 
Opzioni

Deep Learning Based Efficient Single Image Super Resolution

KHAN, ASIF HUSSAIN
2025-07-01
  • doctoral thesis

Abstract
Blind image super-resolution (Blind-SR) involves recovering a high-resolution (HR) image from its low-resolution (LR) counterpart under unknown degradation conditions. Existing approaches often rely on explicit degradation estimators that require ground-truth information about the degradation kernel, which is challenging to obtain in real-world scenarios. Implicit degradation estimators offer an alternative but typically suffer from a performance gap compared to explicit methods, particularly in computational efficiency and accuracy. In our first study, we addressed these challenges by designing a lightweight end-to-end framework for Blind-SR. This method integrates a deep convolutional neural network (CNN)-based Estimator module to implicitly estimate the blur kernel and a super-resolution residual convolutional generative adversarial network (Super Resolver) to reconstruct the HR image. The proposed model employs a novel loss formulation and achieves competitive performance on benchmark datasets, with a computational efficiency advantage—12× fewer parameters compared to state-of-the-art methods—making it suitable for devices with limited computational capacity. Building on this foundation, our second study introduced an enhanced approach to implicit blind-SR by developing a novel loss component that allows the implicit learning of degradation kernels without ground-truth supervision. We also designed a learnable Wiener filter module that efficiently performs deconvolution in the Fourier domain via a closed-form solution and a transformer-based refinement module to reconstruct the final HR image. Our model IDENet achieved significant performance improvements, outperforming existing implicit methods by 3dB PSNR and 8.5% SSIM on average while narrowing the gap with explicit methods to only 0.6dB PSNR and 0.5% SSIM. Remarkably, these results were obtained with 33% and 71% fewer parameters than state-of-the-art implicit and explicit methods, respectively. In our final study, we further refined the implicit blind-SR framework by introducing a degradation-conditioned prompt-learning module. This module leverages the estimated kernel to focus on discriminative contextual features, improving the reconstruction process. Our model, named PL-IDENet, demonstrated significant gains over state-of-the-art methods, achieving more than 0.4dB and 1.3% PSNR and SSIM improvements over the best implicit methods and 1.4dB and 4.8% over the best explicit methods. These results were achieved while maintaining a significantly lower computational complexity, with 25% and 68% fewer parameters than the best implicit and explicit methods, respectively. Together, these studies contribute to the field of blind image super-resolution by offering lightweight, effective, and scalable solutions that bridge the performance gap between implicit and explicit degradation estimators, making them practical for real-world deployment.
Archivio
https://hdl.handle.net/11390/1308666
https://ricerca.unityfvg.it/handle/11390/1308666
Diritti
open access
Soggetti
  • Super Resolution

  • Prompt Learning

  • Implicit Degradation

  • Kernel Estimator

  • Deep Learning

  • Settore INF/01 - Info...

google-scholar
Get Involved!
  • Source Code
  • Documentation
  • Slack Channel
Make it your own

DSpace-CRIS can be extensively configured to meet your needs. Decide which information need to be collected and available with fine-grained security. Start updating the theme to match your nstitution's web identity.

Need professional help?

The original creators of DSpace-CRIS at 4Science can take your project to the next level, get in touch!

Realizzato con Software DSpace-CRIS - Estensione mantenuta e ottimizzata da 4Science

  • Impostazioni dei cookie
  • Informativa sulla privacy
  • Accordo con l'utente finale
  • Invia il tuo Feedback