TY - GEN
T1 - From Simulation to Reality
T2 - 10th IEEE International Conference on Intelligent Systems, IS 2020
AU - Bird, Jordan J.
AU - Faria, Diego R.
AU - Ekart, Aniko
AU - Ayrosa, Pedro P.S.
PY - 2020/9/18
Y1 - 2020/9/18
N2 - In this work, we show that both fine-tune learning and cross-domain sim-to-real transfer learning from virtual to real-world environments improve the starting and final scene classification abilities of a computer vision model. A 6-class computer vision problem of scene classification is presented from both videogame environments and photographs of the real world, where both datasets have the same classes. 12 networks are trained from 2, 4, 8, , 4096 hidden interpretation neurons following a fine-tuned VGG16 Convolutional Neural Network for a dataset of virtual data gathered from the Unity game engine and for a photographic dataset gathered from an online image search engine. 12 Transfer Learning networks are then benchmarked using the trained networks on virtual data as a starting weight distribution for a neural network to classify the real-world dataset. Results show that all of the transfer networks have a higher starting accuracy pre-training, with the best showing an improvement of +48.34% image classification ability and an average increase of +38.33% for the starting abilities of all hyperparameter sets benchmarked. Of the 12 experiments, nine transfer experiments showed an improvement over non-transfer learning, two showed a slightly lower ability, and one did not change. The best accuracy overall was obtained by a transfer learning model with a layer of 64 interpretation neurons scoring 89.16% compared to the non-transfer counterpart of 88.27%. An average increase of +7.15% was observed over all experiments. The main finding is that not only can a higher final classification accuracy be achieved, but strong classification abilities prior to any training whatsoever are also encountered when transferring knowledge from simulation to real-world data, proving useful domain knowledge transfer between the datasets
AB - In this work, we show that both fine-tune learning and cross-domain sim-to-real transfer learning from virtual to real-world environments improve the starting and final scene classification abilities of a computer vision model. A 6-class computer vision problem of scene classification is presented from both videogame environments and photographs of the real world, where both datasets have the same classes. 12 networks are trained from 2, 4, 8, , 4096 hidden interpretation neurons following a fine-tuned VGG16 Convolutional Neural Network for a dataset of virtual data gathered from the Unity game engine and for a photographic dataset gathered from an online image search engine. 12 Transfer Learning networks are then benchmarked using the trained networks on virtual data as a starting weight distribution for a neural network to classify the real-world dataset. Results show that all of the transfer networks have a higher starting accuracy pre-training, with the best showing an improvement of +48.34% image classification ability and an average increase of +38.33% for the starting abilities of all hyperparameter sets benchmarked. Of the 12 experiments, nine transfer experiments showed an improvement over non-transfer learning, two showed a slightly lower ability, and one did not change. The best accuracy overall was obtained by a transfer learning model with a layer of 64 interpretation neurons scoring 89.16% compared to the non-transfer counterpart of 88.27%. An average increase of +7.15% was observed over all experiments. The main finding is that not only can a higher final classification accuracy be achieved, but strong classification abilities prior to any training whatsoever are also encountered when transferring knowledge from simulation to real-world data, proving useful domain knowledge transfer between the datasets
KW - Autonomous Perception
KW - Computer Vision
KW - Deep Learning
KW - Environment Recognition
KW - Scene Classification
KW - Sim-to-real
KW - Transfer Learning
UR - http://www.scopus.com/inward/record.url?scp=85092719936&partnerID=8YFLogxK
UR - https://ieeexplore.ieee.org/document/9199968
U2 - 10.1109/IS48319.2020.9199968
DO - 10.1109/IS48319.2020.9199968
M3 - Conference publication
AN - SCOPUS:85092719936
T3 - 2020 IEEE 10th International Conference on Intelligent Systems, IS 2020 - Proceedings
SP - 619
EP - 625
BT - 2020 IEEE 10th International Conference on Intelligent Systems, IS 2020 - Proceedings
A2 - Sgurev, Vassil
A2 - Jotsov, Vladimir
A2 - Kruse, Rudolf
A2 - Hadjiski, Mincho
PB - IEEE
Y2 - 28 August 2020 through 30 August 2020
ER -