Date of Award

2023

Document Type

Thesis

Publisher

Edith Cowan University

Degree Name

Doctor of Philosophy

School

School of Science

First Supervisor

Syed Mohammed Shamsul Islam

Second Supervisor

Syed Afaq Ali Shah

Third Supervisor

Asim Iqbal

Fourth Supervisor

Adel Al-Jumaily

Abstract

The advances in machine learning and artificial intelligence are drastically improving our capabilities of solving very easy to extremely complex tasks using computational models. Although these models perform very well on a given data distribution used for training, when presented a data drawn from a different distribution during inference, they tend to degrade in performance. Data bias, within-domain, out-of-domain deviation, and overfitting to the specific data are some of the main challenges for learning based models. These challenges are prevalent in different imaging datasets, when we are getting different image modalities from a variety of different imaging sensors each with different intensity distribution. The analysis of these imaging data becomes more challenging when data is multi-vendor and collected at multiple sites at different time points under different protocols. For example, medical data from different sites, natural images under different environmental conditions (e.g. day and night, sunny and cloudy), data through different imaging modalities are challenging to deal with learning based models. These variations in the data during inference time degrade the performance of the models. On the contrary, biological brains are much better at handling such unseen circumstances. By taking advantage of the current understanding of biological structures and their functionality, we can aim toward making improvements in the existing methods to make them relatively more robust against unseen variations.

In this thesis, we investigate the effect of different types of domain shifts on deep learning based methods. We choose to analyse the performance of deep learning based models for various computer vision tasks i.e. image registration, image classification and image segmentation. The aim is to thoroughly probe into limitations of deep learning based models and to investigate that how learning based models can be made robust against domain shifts. In this study, our focus is on the specific case, when learning based methods have no access to the possible domain shifts as in practice it is not possible to know all the possible variation in the data during training phase.

To address the issue of different intensity distributions (within-domain shifts) in medical image data under the image registration paradigm, we investigated the effects of introducing perceptual and structural-based losses, in comparison with mean square and cross correlation-based losses in the training of deep learning based registration models. Image registration is an important computer vision technique that can also be used to precisely monitor disease progression and to analyse large-scale datasets in a high-throughput manner. Deep learning based image registration methods are mainly inspired by optical flow-based backbone architectures with the addition of spatial transformer networks. The optical flow algorithm assumes a certain constraint on the pixel values in consecutive frames, we argue that this assumption violates in case of different intensity distributions which in turn affects the performance of registration methods. We addressed the specific case for non-rigid registration in brain MRI images. By adding perceptual and structural losses we observe that the models become more robust towards change in intensity.

We then explored the effects of local pixel contrast extracted through modelling a module of the human visual system for saliency region detection in dynamic natural scenes under different illumination conditions. Based on the clear effectiveness of adding such a bio-inspired approach to the existing methods on the natural imaging dataset, we proposed a novel bio-inspired layer (NeDev) in deep neural networks that can greatly enhance the robustness and tolerance against out-of-domain intensity distribution in the case of medical images as well natural image datasets. This layer transforms the input image into a common image space which is computed by local pixel variance. We benchmark the performance of our approach on different datasets to show the efficacy of the proposed layer. Finally, we provide an application tool for the community that can help them label, apply active learning, perform segmentation and registration tasks on medical imaging datasets with a set of trained models.

This study provides a thorough analysis of the effects of different types of domain shifts on deep learning based methods by investigating the performance against major computer vision tasks i.e. image registration, image classification and image segmentation. Findings of this research study through the combination of the different pathways have led to the conclusion of effectiveness of structural and local pixel deviation as a defence against within-domain and out-of-domain shifts.

Related Publications

Mahmood, H., Iqbal, A., & Islam, S. M. S. (2020 November - December). Exploring intensity invariance in deep neural networks for brain image registration [Paper presentation]. 2020 Digital Image Computing: Techniques and Applications, (DICTA), Melbourne, Australia. https://doi.org/10.1109/DICTA51227.2020.9363409

https://ro.ecu.edu.au/ecuworkspost2013/9987/

Mahmood, H., Islam, S. M. S., Hill, J., & Tay, G. (2021, November-December). Rapid segmentation of thoracic organs using u-net architecture [Paper presentation]. 2021 Digital Image Computing: Techniques and Applications (DICTA), Gold Coast, Australia. https://doi.org/10.1109/DICTA52665.2021.9647312

https://ro.ecu.edu.au/ecuworkspost2013/11811/

Mahmood, H., Islam, S. M. S., Gilani, S. O., & Ayaz, Y. (2018, December). Dynamic Saliency Model Inspired by Middle Temporal Visual Area: A Spatio-Temporal Perspective. In 2018 Digital Image Computing: Techniques and Applications (DICTA) (pp. 1-8). IEEE. https://doi.org/10.1109/DICTA.2018.8615806

https://ro.ecu.edu.au/ecuworkspost2013/5700/

Islam, S. M., Mahmood, H., Al-Jumaily, A. A., & Claxton, S. (2018, December). Deep Learning of Facial Depth Maps for Obstructive Sleep Apnea Prediction. In 2018 International Conference on Machine Learning and Data Engineering (iCMLDE) (pp. 154-157). IEEE. https://doi.org/10.1109/iCMLDE.2018.00036

https://ro.ecu.edu.au/ecuworkspost2013/5698/

Access Note

Access to this thesis is embargoed until 3rd February 2026

Recommended Citation

Mahmood, H. (2023). Domain shift robustness in deep learning models. Edith Cowan University. Retrieved from https://ro.ecu.edu.au/theses/2621

Theses: Doctorates and Masters

Domain shift robustness in deep learning models

Date of Award

Document Type

Publisher

Degree Name

School

First Supervisor

Second Supervisor

Third Supervisor

Fourth Supervisor

Abstract

Related Publications

Access Note

Recommended Citation

Included in

Search

Links

Browse

Author Information

Links

Paper Locations

Theses: Doctorates and Masters

Domain shift robustness in deep learning models

Author

Date of Award

Document Type

Publisher

Degree Name

School

First Supervisor

Second Supervisor

Third Supervisor

Fourth Supervisor

Abstract

Related Publications

Access Note

Recommended Citation

Included in

Share

Search

Links

Browse

Author Information

Links

Paper Locations