Information
- Publication Type: Bachelor Thesis
- Workgroup(s)/Project(s):
- Date: August 2019
- Date (Start): 20. January 2019
- Date (End): 16. August 2019
- Matrikelnummer: 08330864
- First Supervisor: Michael Wimmer
Abstract
In this thesis, we developed an image classification model with improving classification performance over a training phase. The model is using a pre-trained convolutional neuronal network (CNN) for feature extraction and a k-means algorithm for clustering. Performance optimization is realized by optimized weight factors for the extracted feature values. The optimization of the weight factors is calculated iteratively during a training phase. The measure of the weight factor adoption in a training step is related to the ground-truth dependent clustering contribution of the newly added image feature. We see as an advantage of our approach that the optimization requires no internal changes of the applied feature extraction and clustering algorithms, hence pre-trained models or closed-source implementations can be used. As a further advantage, we see the step-wise transparency of the performance development during the training phase for each newly added image as opposed to batch-based training for CNNs. This enables dynamic control of the training phase by the user. Another advantage is the small number of parameters to be optimized, which results in reduced processing time. A further advantage is the classification performance of our model that outperforms the reference model without feature weight optimization. In the course of our work, we developed a Python application that implements our model and provides a user-friendly interface. It allows easy set-up of test cases and provides graphics and tables for a comprehensive evaluation on process steps level. We consider this application as a starting point for future work.Additional Files and Images
Weblinks
No further information available.BibTeX
@bachelorsthesis{Gruber2019,
title = "Extended Image Classification",
author = "Horst Gruber",
year = "2019",
abstract = "In this thesis, we developed an image classification model
with improving classification performance over a training
phase. The model is using a pre-trained convolutional
neuronal network (CNN) for feature extraction and a k-means
algorithm for clustering. Performance optimization is
realized by optimized weight factors for the extracted
feature values. The optimization of the weight factors is
calculated iteratively during a training phase. The measure
of the weight factor adoption in a training step is related
to the ground-truth dependent clustering contribution of the
newly added image feature. We see as an advantage of our
approach that the optimization requires no internal changes
of the applied feature extraction and clustering algorithms,
hence pre-trained models or closed-source implementations
can be used. As a further advantage, we see the step-wise
transparency of the performance development during the
training phase for each newly added image as opposed to
batch-based training for CNNs. This enables dynamic control
of the training phase by the user. Another advantage is the
small number of parameters to be optimized, which results in
reduced processing time. A further advantage is the
classification performance of our model that outperforms the
reference model without feature weight optimization. In the
course of our work, we developed a Python application that
implements our model and provides a user-friendly interface.
It allows easy set-up of test cases and provides graphics
and tables for a comprehensive evaluation on process steps
level. We consider this application as a starting point for
future work.",
month = aug,
address = "Favoritenstrasse 9-11/E193-02, A-1040 Vienna, Austria",
school = "Research Unit of Computer Graphics, Institute of Visual
Computing and Human-Centered Technology, Faculty of
Informatics, TU Wien ",
URL = "https://www.cg.tuwien.ac.at/research/publications/2019/Gruber2019/",
}