TY - JOUR
T1 - Ensembles of deep learning models and transfer learning for ear recognition
AU - Alshazly, Hammam
AU - Linse, Christoph
AU - Barth, Erhardt
AU - Martinetz, Thomas
N1 - Funding Information:
Acknowledgments: The authors acknowledge the financial support by Land Schleswig-Holstein within the funding program Open Access Publikationsfonds. Hammam Alshazly is supported by the German Academic Exchange Service (DAAD) and the Ministry of Higher Education and Scientific Research (MHESR) in Egypt. Christoph Linse is supported by the Mittelstand 4.0-Kompetenzzentrum Kiel project.
Publisher Copyright:
© 2019 by the authors. Licensee MDPI, Basel, Switzerland.
Copyright:
Copyright 2019 Elsevier B.V., All rights reserved.
PY - 2019/10/1
Y1 - 2019/10/1
N2 - The recognition performance of visual recognition systems is highly dependent on extracting and representing the discriminative characteristics of image data. Convolutional neural networks (CNNs) have shown unprecedented success in a variety of visual recognition tasks due to their capability to provide in-depth representations exploiting visual image features of appearance, color, and texture. This paper presents a novel system for ear recognition based on ensembles of deep CNN-based models and more specifically the Visual Geometry Group (VGG)-like network architectures for extracting discriminative deep features from ear images. We began by training different networks of increasing depth on ear images with random weight initialization. Then, we examined pretrained models as feature extractors as well as fine-tuning them on ear images. After that, we built ensembles of the best models to further improve the recognition performance. We evaluated the proposed ensembles through identification experiments using ear images acquired under controlled and uncontrolled conditions from mathematical analysis of images (AMI), AMI cropped (AMIC) (introduced here), and West Pomeranian University of Technology (WPUT) ear datasets. The experimental results indicate that our ensembles of models yield the best performance with significant improvements over the recently published results. Moreover, we provide visual explanations of the learned features by highlighting the relevant image regions utilized by the models for making decisions or predictions.
AB - The recognition performance of visual recognition systems is highly dependent on extracting and representing the discriminative characteristics of image data. Convolutional neural networks (CNNs) have shown unprecedented success in a variety of visual recognition tasks due to their capability to provide in-depth representations exploiting visual image features of appearance, color, and texture. This paper presents a novel system for ear recognition based on ensembles of deep CNN-based models and more specifically the Visual Geometry Group (VGG)-like network architectures for extracting discriminative deep features from ear images. We began by training different networks of increasing depth on ear images with random weight initialization. Then, we examined pretrained models as feature extractors as well as fine-tuning them on ear images. After that, we built ensembles of the best models to further improve the recognition performance. We evaluated the proposed ensembles through identification experiments using ear images acquired under controlled and uncontrolled conditions from mathematical analysis of images (AMI), AMI cropped (AMIC) (introduced here), and West Pomeranian University of Technology (WPUT) ear datasets. The experimental results indicate that our ensembles of models yield the best performance with significant improvements over the recently published results. Moreover, we provide visual explanations of the learned features by highlighting the relevant image regions utilized by the models for making decisions or predictions.
UR - http://www.scopus.com/inward/record.url?scp=85072653152&partnerID=8YFLogxK
U2 - 10.3390/s19194139
DO - 10.3390/s19194139
M3 - Journal articles
C2 - 31554303
AN - SCOPUS:85072653152
SN - 1424-8220
VL - 19
JO - Sensors (Switzerland)
JF - Sensors (Switzerland)
IS - 19
M1 - 4139
ER -