TY - GEN
T1 - YOLOv3-Based Human Activity Recognition as Viewed from a Moving High-Altitude Aerial Camera
AU - Mmereki, Wazha
AU - Jamisola, Rodrigo S.
AU - Mpoeleng, Dimane
AU - Petso, Tinao
N1 - Funding Information:
ACKNOLEDGEMENT The authors would like to acknowledge the funding support on this work from the Botswana International University of Science and Technology (BIUST) Drones Project with project number P00015.
Publisher Copyright:
© 2021 IEEE.
PY - 2021/2/4
Y1 - 2021/2/4
N2 - This paper presents a method to classify human activities as normal or suspicious using YOLOv3 to automatically process video footages taken from a high altitude moving aerial camera, such as the one attached to a drone. We consider four human activities namely, jogging, walking, fighting, and chasing. Objects generally appear much smaller, with less visible features, when viewed from high altitudes. The reduced visible features make automatic human activity detection from ground surveillance cameras not applicable to the high altitude case. Through transfer learning, we modified existing pre-trained YOLOv3 convolutional neural networks (CNN's) and retrained with our own high aerial human action dataset. By so doing, we were able to customize YOLOv3 to detect, localize, and recognize aerial human activities in real-time as normal or suspicious. The proposed approach achieves a promising average precision accuracy of 82.30%, and average F1 score of 88.10% on classifying high aerial human activities. We demonstrated that YOLOv3 is a powerful approach and relatively fast for the recognition and localization of human subjects as seen from above.
AB - This paper presents a method to classify human activities as normal or suspicious using YOLOv3 to automatically process video footages taken from a high altitude moving aerial camera, such as the one attached to a drone. We consider four human activities namely, jogging, walking, fighting, and chasing. Objects generally appear much smaller, with less visible features, when viewed from high altitudes. The reduced visible features make automatic human activity detection from ground surveillance cameras not applicable to the high altitude case. Through transfer learning, we modified existing pre-trained YOLOv3 convolutional neural networks (CNN's) and retrained with our own high aerial human action dataset. By so doing, we were able to customize YOLOv3 to detect, localize, and recognize aerial human activities in real-time as normal or suspicious. The proposed approach achieves a promising average precision accuracy of 82.30%, and average F1 score of 88.10% on classifying high aerial human activities. We demonstrated that YOLOv3 is a powerful approach and relatively fast for the recognition and localization of human subjects as seen from above.
UR - http://www.scopus.com/inward/record.url?scp=85103740058&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85103740058&partnerID=8YFLogxK
U2 - 10.1109/ICARA51699.2021.9376435
DO - 10.1109/ICARA51699.2021.9376435
M3 - Conference contribution
AN - SCOPUS:85103740058
T3 - 2021 International Conference on Automation, Robotics and Applications, ICARA 2021
SP - 241
EP - 246
BT - 2021 International Conference on Automation, Robotics and Applications, ICARA 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2021 International Conference on Automation, Robotics and Applications, ICARA 2021
Y2 - 4 February 2021 through 6 February 2021
ER -