Addressing Data Heterogeneity in Image-Based Computer-Aided Detection and Diagnosis Methods | TU Wien

Information

Publication Type: PhD-Thesis
Workgroup(s)/Project(s): not specified
Date: 2024
TU Wien Library:
Open Access: yes
First Supervisor: Eduard Gröller
Pages: 163
Keywords: Medical Image Analysis, Image Processing, Data Heterogeneity, Computer-Aided Detection and Diagnosis, Mammography, Spine Labeling, Machine Learning, Deep Learning

Abstract

The acquisition of medical imaging data is inevitable for screening, diagnosis, planning of surgery or therapy, or monitoring of diseases. In clinical practice, the data is assessed by medical experts, which can be a very time-consuming task. Hence, for decades a lot of research effort has been dedicated to the automated analysis of medical imaging data and to the question of how Computer-Aided Detection and Diagnosis algorithms can assist the tasks mentioned above. However, one of the biggest challenges in this regard is the highly heterogeneous nature of medical imaging data. The acquisition of data from different imaging modalities, like X-ray or Magnetic Resonance Imaging (MRI), changes of acquisition parameters, and the use of different scanners results in diverse data. The varying spatial resolution as well as the high dimensionality of the data pose additional challenges to the development of automated solutions. In this thesis, we investigate different machine learning-based methods to address the analysis of heterogeneous medical imaging data, such as multi-parametric, multi-modal, multi-center, or multi-view data. We present three different pipeline approaches that follow generalization- and fusion- based approaches and demonstrate their applicability on diverse public datasets. Our contributions target two selected use cases in radiology: the semantic labeling of the spine in MRI data and the analysis of mammograms. In semi- and fully-automated spine labeling in MRI data, we are confronted with the problem that MRI data does not exhibit a standardized intensity scale, which results in a large variety of different image contrasts. To overcome this problem for semantic spine labeling, we propose an iterative labeling pipeline that employs Entropy-Optimized Texture Models (ETMs). The application of trained ETMs allows us to apply our models to a wide range of different MRI data. This is in contrast to various related works that develop methods for specific MRI image sequences and protocols. For the analysis of mammography screening data, not only one but four X-ray images from different fields of view are available that form a study of a patient. In addition to this multi-view data, we deal with multi-scale information at various levels, e.g., on a patient, image, or lesion level. To utilize and combine this information efficiently, we develop several deep learning-based models that aim for a specific task important in examining mammograms, such as the localization of abnormalities. For a comprehensive prediction on a patient level, we propose to fuse predictions and features from the individual models to increase performance, which is in contrast to standard ensembling techniques. The results in this thesis demonstrate that considering the different aspects of heterogeneous medical imaging data is inevitable to improve both generalization and predictive capabilities of Computer-Aided Detection and Diagnosis methods.

Additional Files and Images

thesis

Weblinks

BibTeX

@phdthesis{wimmer-2024-adh,
  title =      "Addressing Data Heterogeneity in Image-Based Computer-Aided
               Detection and Diagnosis Methods",
  author =     "Maria Wimmer",
  year =       "2024",
  abstract =   "The acquisition of medical imaging data is inevitable for
               screening, diagnosis, planning of surgery or therapy, or
               monitoring of diseases. In clinical practice, the data is
               assessed by medical experts, which can be a very
               time-consuming task. Hence, for decades a lot of research
               effort has been dedicated to the automated analysis of
               medical imaging data and to the question of how
               Computer-Aided Detection and Diagnosis algorithms can assist
               the tasks mentioned above. However, one of the biggest
               challenges in this regard is the highly heterogeneous nature
               of medical imaging data. The acquisition of data from
               different imaging modalities, like X-ray or Magnetic
               Resonance Imaging (MRI), changes of acquisition parameters,
               and the use of different scanners results in diverse data.
               The varying spatial resolution as well as the high
               dimensionality of the data pose additional challenges to the
               development of automated solutions. In this thesis, we
               investigate different machine learning-based methods to
               address the analysis of heterogeneous medical imaging data,
               such as multi-parametric, multi-modal, multi-center, or
               multi-view data. We present three different pipeline
               approaches that follow generalization- and fusion- based
               approaches and demonstrate their applicability on diverse
               public datasets. Our contributions target two selected use
               cases in radiology: the semantic labeling of the spine in
               MRI data and the analysis of mammograms. In semi- and
               fully-automated spine labeling in MRI data, we are
               confronted with the problem that MRI data does not exhibit a
               standardized intensity scale, which results in a large
               variety of different image contrasts. To overcome this
               problem for semantic spine labeling, we propose an iterative
               labeling pipeline that employs Entropy-Optimized Texture
               Models (ETMs). The application of trained ETMs allows us to
               apply our models to a wide range of different MRI data. This
               is in contrast to various related works that develop methods
               for specific MRI image sequences and protocols. For the
               analysis of mammography screening data, not only one but
               four X-ray images from different fields of view are
               available that form a study of a patient. In addition to
               this multi-view data, we deal with multi-scale information
               at various levels, e.g., on a patient, image, or lesion
               level. To utilize and combine this information efficiently,
               we develop several deep learning-based models that aim for a
               specific task important in examining mammograms, such as the
               localization of abnormalities. For a comprehensive
               prediction on a patient level, we propose to fuse
               predictions and features from the individual models to
               increase performance, which is in contrast to standard
               ensembling techniques. The results in this thesis
               demonstrate that considering the different aspects of
               heterogeneous medical imaging data is inevitable to improve
               both generalization and predictive capabilities of
               Computer-Aided Detection and Diagnosis methods.",
  pages =      "163",
  address =    "Favoritenstrasse 9-11/E193-02, A-1040 Vienna, Austria",
  school =     "Research Unit of Computer Graphics, Institute of Visual
               Computing and Human-Centered Technology, Faculty of
               Informatics, TU Wien ",
  keywords =   "Medical Image Analysis, Image Processing, Data
               Heterogeneity, Computer-Aided Detection and Diagnosis,
               Mammography, Spine Labeling, Machine Learning, Deep Learning",
  URL =        "https://www.cg.tuwien.ac.at/research/publications/2024/wimmer-2024-adh/",
}