Human Action Recognition (HAR) Classification Using MediaPipe and Long Short-Term Memory (LSTM)

Ichsan Arsyi Putra; Oky Dwi Nurhayati; Dania Eridani

doi:10.14710/teknik.v43i2.46439

DOI: https://doi.org/10.14710/teknik.v43i2.46439

Human Action Recognition (HAR) Classification Using MediaPipe and Long Short-Term Memory (LSTM)

*Ichsan Arsyi Putra

- Departemen Teknik Komputer, Fakultas Teknik, Universitas Diponegoro, Jl. Prof. Soedarto, S.H., Tembalang, Semarang, Indonesia 50275, Indonesia

Oky Dwi Nurhayati

- Departemen Teknik Komputer, Fakultas Teknik, Universitas Diponegoro, Jl. Prof. Soedarto, S.H., Tembalang, Semarang, Indonesia 50275, Indonesia

Dania Eridani

- Departemen Teknik Komputer, Fakultas Teknik, Universitas Diponegoro, Jl. Prof. Soedarto, S.H., Tembalang, Semarang, Indonesia 50275, Indonesia

Citation Format:

Abstract

Human Action Recognition is an important research topic in Machine Learning and Computer Vision domains. One of the proposed methods is a combination of MediaPipe library and Long Short-Term Memory concerning the testing accuracy and training duration as indicators to evaluate the model performance. This research tried to adapt proposed LSTM models to implement HAR with image features extracted by MediaPipe library. There would be a comparison between LSTM models based on their testing accuracy and training duration. This research was conducted under OSEMN methods (Obtain, Scrub, Explore, Model, and iNterpret). The dataset was preprocessed Weizmann dataset with data preprocessing and data augmentation implementations. Video features extracted by MediaPipe: Pose was used in training and validation processes on neural network models focusing on Long Short-Term Memory layers. The processes were finished by model performance evaluation based on confusion matrices interpretation and calculations of accuracy, error rate, precision, recall, and F1score. This research yielded seven LSTM model variants with the highest testing accuracy at 82% taking 10 minutes and 50 seconds of training duration.

Fulltext View|Download

Keywords: Classification; Deep Learning; Human Action Recognition; MediaPipe; Long Short-Term Memory

Article Metrics:

Article Info

Section: Articles

Language : EN

In Vol. 43, No. 2 (2022): August 2022