Content oriented 3D-CNN sequence learning architecture for academic activities recognition using a realistic CAD dataset

Sedik, A., Marey, M. & Mostafa, H. An adaptive fatigue detection system based on 3D CNNs and ensemble models. Symmetry 15(6), 1274 (2023).

ADS

Google Scholar

Shafik, W., Matinkhah, S. M. & Shokoor, F. Cybersecurity in unmanned aerial vehicles: A review. Int. J. Smart Sens. Intell. Syst., 16(1), (2023).

Melhim, L. K. B., Jemmali, M., Boulila, W., Alazab, M., Rani, S., Campbell, H. & Amdouni, H. Leveraging drone-assisted surveillance for effective forest conservation: A case study in australia’s daintree rainforest. IEEE Internet of Things J., (2024).

Kakadiya, R., Lemos, R., Mangalan, S., Pillai, M. & Nikam, S. Ai based automatic robbery/theft detection using smart surveillance in banks. In 2019 3rd International conference on Electronics, Communication and Aerospace Technology (ICECA), pp. 201–204 (2019).

Park, J. H., Song, K. & Kim, Y.-S. A kidnapping detection using human pose estimation in intelligent video surveillance systems. J. Korea Soc. Comput. Inform. 23, 9–16 (2018).

Google Scholar

Hattersley-Gray, R. 2021 Video Surveillance Deep Dive Survey. Campus Safety Magazine (2021). Available: https://www.campussafetymagazine.com/download/2021-video-surveillance-deep-dive-survey/

Tran, D., Bourdev, L., Fergus, R., Torresani, L. & Paluri, M. Learning spatiotemporal features with 3d convolutional networks. In Proceedings of the IEEE international conference on computer vision, pp. 4489–4497 (2015).

Sultana, F., Sufian, A. & Dutta, P. Advancements in image classification using convolutional neural network. In 2018 Fourth international conference on research in computational intelligence and communication networks (ICRCICN), pp. 122–129 (2018).

Ramesh, M. & Mahesh, K. A performance analysis of pre-trained neural network and design of CNN for sports video classification. In 2020 international conference on communication and signal processing (ICCSP), pp. 0213–0216 (2020).

Ribani, R. & Marengoni, M. A survey of transfer learning for convolutional neural networks. In 2019 32nd SIBGRAPI conference on graphics, patterns and images tutorials (SIBGRAPI-T), pp. 47–57 (2019).

Graves, A. Long short-term memory. In Supervised sequence labelling with recurrent neural networks, pp. 37–45 (Springer, 2012).

Li, X., Wang, Y., Zhou, Z. & Qiao, Y. Smallbignet: Integrating core and contextual views for video classification. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 1092–1101 (2020).

Rangasamy, K., As’ari, M. A., Rahmad, N. A. & Ghazali, N. F. Hockey activity recognition using pre-trained deep learning model. ICT Express 6, 170–174 (2020).

Google Scholar

Keçeli, A. & Kaya, A. Violent activity detection with transfer learning method. Electron. Lett. 53, 1047–1048 (2017).

ADS

Google Scholar

Ribeiro, E., Uhl, A., Wimmer, G. & Häfner, M. Transfer learning for colonic polyp classification using off-the-shelf CNN features. In International workshop on computer-assisted and robotic endoscopy, pp. 1–13 (2016).

Shi, Z. et al. A deep CNN based transfer learning method for false positive reduction. Multimedia Tools Appl. 78, 1017–1033 (2019).

Google Scholar

Khalifa, N. E. M., Loey, M., Taha, M. H. N. & Mohamed, H. N. E. T. Deep transfer learning models for medical diabetic retinopathy detection. Acta Informatica Medica 27, 327 (2019).

PubMed
PubMed Central

Google Scholar

Patrini, I. et al. Transfer learning for informative-frame selection in laryngoscopic videos through learned features. Medical Biol. Eng. Comput. 58, 1225–1238 (2020).

Google Scholar

Huang, X., He, P., Rangarajan, A. & Ranka, S. Intelligent intersection: two-stream convolutional networks for real-time near-accident detection in traffic video. ACM Trans. Spatial Algorithms Syst. (TSAS) 6, 1–28 (2020).

Google Scholar

Hammerla, N. Y., Halloran, S. & Plötz, T. Deep, convolutional, and recurrent models for human activity recognition using wearables, arXiv preprint arXiv:1604.08880, (2016).

Ordóñez, F. J. & Roggen, D. Deep convolutional and lstm recurrent neural networks for multimodal wearable activity recognition. Sensors 16, 115 (2016).

ADS
PubMed
PubMed Central

Google Scholar

Chen, Y., Zhong, K., Zhang, J., Sun, Q. & Zhao, X. LSTM networks for mobile human activity recognition. In Proceedings of the 2016 international conference on artificial intelligence: Technologies and applications, Bangkok, Thailand, pp. 24–25 (2016).

Tang, J., Shu, X., Yan, R. & Zhang, L. Coherence constrained graph LSTM for group activity recognition. IEEE Trans. Pattern Anal. Mach. Intell., (2019).

Shu, X., Zhang, L., Sun, Y. & Tang, J. Host–parasite: Graph LSTM-in-LSTM for group activity recognition. IEEE Trans. Neural Netw. Learn. Syst. 32, 663–674 (2020).

Google Scholar

Sarma, N., Chakraborty, S. & Banerjee, D. S. Activity recognition through feature learning and annotations using LSTM. In 2019 11th international conference on communication systems & networks (COMSNETS), pp. 444–447 (2019).

Wu, Y., Zheng, B. & Zhao, Y. Dynamic gesture recognition based on LSTM-CNN. In 2018 Chinese automation congress (CAC), pp. 2446–2450 (2018).

Ullah, W. et al. CNN features with bi-directional LSTM for real-time anomaly detection in surveillance networks. Multimedia Tools Appl. 80, 16979–16995 (2021).

Google Scholar

Tello-Leal, E., Roa, J., Rubiolo, M. & Ramirez-Alcocer, U. M. Predicting activities in business processes with LSTM recurrent neural networks. In 2018 ITU Kaleidoscope: Machine learning for a 5G Future (ITU K), pp. 1–7 (2018).

Song, B., Fan, C., Wu, Y. & Sun, J. Data prediction for public events in professional domains based on improved rnn-lstm. J. Phys.: Conf. Ser., p. 012007, (2018).

Singh, U., Determe, J.-F., Horlin, F. & De Doncker, P. Crowd forecasting based on wifi sensors and lstm neural networks. IEEE Trans. Instrum. Meas. 69, 6121–6131 (2020).

ADS

Google Scholar

Ji, S., Xu, W., Yang, M. & Yu, K. 3D convolutional neural networks for human action recognition. IEEE Trans. Pattern Anal. Mach. Intell. 35, 221–231 (2012).

Google Scholar

Sun, L., Jia, K., Chan, T.-H., Fang, Y., Wang, G. & Yan, S. DL-SFA: Deeply-learned slow feature analysis for action recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2625–2632 (2014).

Poleg, Y., Ephrat, A., Peleg, S. & Arora, C. Compact cnn for indexing egocentric videos. In 2016 IEEE winter conference on applications of computer vision (WACV), pp. 1–9 (2016).

Weyers, P., Schiebener, D. & Kummert, A. Action and object interaction recognition for driver activity classification. In 2019 IEEE intelligent transportation systems conference (ITSC), pp. 4336–4341 (2019).

Ullah, F. U. M., Ullah, A., Muhammad, K., Haq, I. U. & Baik, S. W. Violence detection using spatiotemporal features with 3D convolutional neural network. Sensors 19, 2472 (2019).

ADS
PubMed
PubMed Central

Google Scholar

Molchanov, P. Yang, X., Gupta, S., Kim, K., Tyree, S. & Kautz, J. Online detection and classification of dynamic hand gestures with recurrent 3d convolutional neural network. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 4207–4215 (2016).

Sanford, R., Gorji, S., Hafemann, L. G., Pourbabaee, B. & Javan, M. Group activity detection from trajectory and video data in soccer. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops, pp. 898–899 (2020).

Hwang, S., Ahn, D., Park, H. & Park, T. Maximizing accuracy of fall detection and alert systems based on 3D convolutional neural network. In 2017 IEEE/ACM second international conference on internet-of-things design and implementation (IoTDI), pp. 343–344 (2017).

Wang, Y. et al. An edge 3D CNN accelerator for low-power activity recognition. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 40, 918–930 (2020).

Google Scholar

Shambharkar, P. G., Thakur, P., Imadoddin, S., Chauhan, S. & Doja, M. Genre classification of movie trailers using 3D convolutional neural networks. In 2020 4th international conference on intelligent computing and control systems (ICICCS), pp. 850–858 (2020).

Haddad, J., Lézoray, O. & Hamel, P. 3d-cnn for facial emotion recognition in videos. In International symposium on visual computing, pp. 298–309 (2020).

Tanberk, S., Kilimci, Z. H., Tükel, D. B., Uysal, M. & Akyokuş, S. A hybrid deep model using deep learning and dense optical flow approaches for human activity recognition. IEEE Access 8, 19799–19809 (2020).

Google Scholar

Yao, L. & Qian, Y. Dt-3dresnet-lstm: An architecture for temporal activity recognition in videos. In Pacific Rim Conference on Multimedia, pp. 622–632 (2018).

Ercolano, G. & Rossi, S. Combining CNN and LSTM for activity of daily living recognition with a 3D matrix skeleton representation. Intel. Serv. Robot. 14, 175–185 (2021).

Google Scholar

Deep, S. & Zheng, X. Hybrid model featuring CNN and LSTM architecture for human activity recognition on smartphone sensor data. In 2019 20th international conference on parallel and distributed computing, applications and technologies (PDCAT), pp. 259–264 (2019).

Verma, K. K. & Singh, B. M. Deep multi-model fusion for human activity recognition using evolutionary algorithms. Int. J. Interact. Multimedia Artif. Intell., 7, (2021).

Younesi, A. et al. A comprehensive survey of convolutions in deep learning: Applications, challenges, and future trends. IEEE Access 12, 41180–41218 (2024).

Google Scholar

Lasri, I., Solh, A. R. & El Belkacemi, M. Facial emotion recognition of students using convolutional neural network. In 2019 third international conference on intelligent computing in data sciences (ICDS), pp. 1–6 (2019).

Pabba, C. & Kumar, P. An intelligent system for monitoring students’ engagement in large classroom teaching through facial expression recognition. Expert Systems, p. e12839, (2021).

Ashwin, T. & Reddy, G. R. M. Automatic detection of students’ affective states in classroom environment using hybrid convolutional neural networks. Educat. Inf. Technol. 25, 1387–1415 (2020).

Google Scholar

Gupta, S. K., Ashwin, T. & Guddeti, R. M. R. Students’ affective content analysis in smart classroom environment using deep learning techniques. Multimedia Tools Appl. 78, 25321–25348 (2019).

Google Scholar

Ashwin, T. & Guddeti, R. M. R. Impact of inquiry interventions on students in e-learning and classroom environments using affective computing framework. User Model. User-Adap. Inter. 30, 759–801 (2020).

Google Scholar

Wang, D., Fu, R. & Luo, Z. Classroom attendance auto-management based on deep learning. Adv. Soc. Sci. Educat. Hum. Res., 123, (2017).

Wasim, M., Ahmed, I., Ahmad, J. & Hassan, M. M. A novel deep learning based automated academic activities recognition in cyber-physical systems. IEEE Access 9, 63718–63728 (2021).

Google Scholar

Wang, Y., Li, Y., Song, Y. & Rong, X. The influence of the activation function in a convolution neural network model of facial expression recognition. Appl. Sci. 10, 1897 (2020).

CAS

Google Scholar

Chollet, F. Keras: Python’s deep learning library. Available:

link