TY - JOUR
T1 - Zooming Into Clarity
T2 - Image Denoising Through Innovative Autoencoder Architectures
AU - Mohammadi, Khatereh
AU - Islam, Ashhadul
AU - Belhaouari, Samir Brahim
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2024/7/8
Y1 - 2024/7/8
N2 - In today's era of increasing data complexity and pervasive noise, robust techniques for data processing, reconstruction, and denoising are crucial. Autoencoders, known for their adaptability in unsupervised learning, offer a strategic solution to these challenges. This research introduces two novel autoencoder architectures, ResoFocus and FragmentumZoom, complemented by a refined loss function. The ResoFocus Autoencoder is designed to optimize and denoise upsized images, while the FragmentumZoom Autoencoder targets small image segments to enhance denoising tasks. Empirical evaluations on diverse datasets, including CelebA and Autism Face data, demonstrate significant improvements in the Peak Signal-to-Noise Ratio (PSNR) and Structural Similarity Index Measure (SSIM). For the CelebA dataset, the ResoFocus 4X autoencoder achieved PSNR values of 31.578, with a peak increase of 1.9 dB compared to the baseline Simple autoencoder and SSIM values of 0.898 at 1000 epochs. In the Autism Face dataset, the FragmentumZoom 4X autoencoder recorded PSNR values of 30.951, with an increase of 1.289 dB over the baseline, and SSIM values of 0.906. These results underscore the efficacy of these novel architectures in improving image quality and advancing image processing techniques, particularly in scenarios requiring heightened resolution and noise reduction.
AB - In today's era of increasing data complexity and pervasive noise, robust techniques for data processing, reconstruction, and denoising are crucial. Autoencoders, known for their adaptability in unsupervised learning, offer a strategic solution to these challenges. This research introduces two novel autoencoder architectures, ResoFocus and FragmentumZoom, complemented by a refined loss function. The ResoFocus Autoencoder is designed to optimize and denoise upsized images, while the FragmentumZoom Autoencoder targets small image segments to enhance denoising tasks. Empirical evaluations on diverse datasets, including CelebA and Autism Face data, demonstrate significant improvements in the Peak Signal-to-Noise Ratio (PSNR) and Structural Similarity Index Measure (SSIM). For the CelebA dataset, the ResoFocus 4X autoencoder achieved PSNR values of 31.578, with a peak increase of 1.9 dB compared to the baseline Simple autoencoder and SSIM values of 0.898 at 1000 epochs. In the Autism Face dataset, the FragmentumZoom 4X autoencoder recorded PSNR values of 30.951, with an increase of 1.289 dB over the baseline, and SSIM values of 0.906. These results underscore the efficacy of these novel architectures in improving image quality and advancing image processing techniques, particularly in scenarios requiring heightened resolution and noise reduction.
KW - Autoencoders
KW - and high-resolution imaging enhancement
KW - architecture
KW - data reconstruction
KW - loss function
KW - noise reduction
UR - http://www.scopus.com/inward/record.url?scp=85198314188&partnerID=8YFLogxK
U2 - 10.1109/ACCESS.2024.3424972
DO - 10.1109/ACCESS.2024.3424972
M3 - Article
AN - SCOPUS:85198314188
SN - 2169-3536
VL - 12
SP - 98816
EP - 98834
JO - IEEE Access
JF - IEEE Access
ER -