Comparison of CNN and CNN-LSTM Performance in Facial Expression Classification Based on FER2013 Dataset

Main Article Content

Putu Ananda Adi Savitri
Agus Aan Jiwa Permana
Ni Putu Novita Puspa Dewi

Abstract

Although facial expression recognition (FER) using deep learning has received increasing attention in prior studies, research specifically addressing the comparative effectiveness of sequential modeling on static image data remains limited. This study aims to evaluate and compare the performance of a pure Convolutional Neural Network (CNN) model and a hybrid CNN–Long Short-Term Memory (CNN-LSTM) model in classifying seven basic facial expressions using the static FER2013 dataset. A quantitative experimental approach with a comparative study design was employed, utilizing the publicly available FER2013 dataset and two custom deep learning architectures. Data were obtained from FER2013 and model performance was evaluated using accuracy, precision, recall, F1-score, and AUC-ROC metrics. The findings indicate that the pure CNN model significantly outperformed the CNN-LSTM model, achieving a testing accuracy of 63.25% compared to 46.82% for the hybrid model; the CNN provided strong discrimination for visually distinct classes but continued to struggle with visually similar expressions. These results contribute to the theoretical development of deep learning architecture selection and expand understanding of the application of sequence models to static data. The study concludes that data characteristics (static versus temporal) play a crucial role in determining model effectiveness, and that for static datasets such as FER2013, a pure CNN constitutes the more appropriate choice. The implications of this research include theoretical contributions to the growing literature on deep learning-based FER and practical recommendations for developers to prioritize CNN architectures for non-temporal image classification tasks, while also highlighting opportunities for future research on transfer learning and attention mechanisms to better capture subtle expression nuances.

Keywords:
Share Article:

Citation Metrics:

Scopus




Downloads

Download data is not yet available.

Article Details

How to Cite
Savitri, P. A., Permana, A. A., & Puspa Dewi, N. P. (2026). Comparison of CNN and CNN-LSTM Performance in Facial Expression Classification Based on FER2013 Dataset. Asian Journal of Science, Technology, Engineering, and Art, 4(1), 1-18. https://doi.org/10.58578/ajstea.v4i1.8252
Author Biographies

Agus Aan Jiwa Permana, Universitas Pendidikan Ganesha, Indonesia

 

     

Ni Putu Novita Puspa Dewi, Universitas Pendidikan Ganesha, Indonesia

 

     

References

Alfandi, M., & Sihite, A. M. H. (2022). Penerapan Metode CNN-LSTM dalam Memprediksi Hujan pada Wilayah Medan. KOMIK (Konferensi Nasional Teknologi Informasi dan Komputer), 6(1), 490–499. https://ejurnal.stmik-budidarma.ac.id/index.php/komik/article/view/5713

Altiarika, E., & Sari, W. P. (2023). Pengembangan Deteksi Realtime untuk Bahasa Isyarat Indonesia dengan Menggunakan Metode Deep Learning Long Short Term Memory dan Convolutional Neural Network. Jurnal Teknologi Informatika dan Komputer, 9(1), 1–13.

Athanasiadou, E., Geradts, Z., & van Eijk, E. (2018). Camera recognition with deep learning. Forensic Sciences Research, 3(3), 210–218. https://doi.org/10.1080/20961790.2018.1485198

Baek, S. S., Pyo, J., & Chun, J. A. (2020). Prediction of water level and water quality using a CNN-LSTM combined deep learning approach. Water, 12(12), 3399. https://doi.org/10.3390/w12123399

Dewi, N., & Ismawan, F. (2021). Implementasi Deep Learning Menggunakan CNN untuk Sistem Pengenalan Wajah. Faktor Exacta, 14(1), 34–43. https://doi.org/10.30998/faktorexacta.v14i1.8989

Ekawati, I., Putra, F. N. R., Sumadyo, M., & Whidhiasih, R. N. (2024). Deteksi Emosi Menggunakan Convolutional Neural Network Berdasarkan Ekspresi Wajah. Journal of Students' Research in Computer Science, 5(1), 73–82. https://doi.org/10.31599/h0kayy31

Ekman, P., & Friesen, W. V. (1978). Facial action coding system. Consulting Psychologists Press.

Fadilla, M. A., Setiawan, H., & Ramadhan, M. (2023). Implementasi Metode Convolutional Neural Network (CNN) pada Sistem Deteksi Emosi dari Ekspresi Wajah Manusia dengan Aplikasi Android sebagai Antarmuka Pengguna. Jurnal Ilmu Komputer dan Sistem Informasi (JIKSI), 9(4), 126–138.

Hermanto, D. T., Setyanto, A., & Luthfi, E. T. (2021). Algoritma LSTM-CNN untuk Binary Klasifikasi dengan Word2vec pada Media Online. Creative Information Technology Journal, 8(1), 64. https://doi.org/10.24076/citec.2021v8i1.264

Ihsan, M., Niswatin, R. K., & Swanjaya, D. (2021). Deteksi Ekspresi Wajah Menggunakan Tensorflow. Joutica, 6(1), 428. https://doi.org/10.30736/jti.v6i1.554

Indraswari, R., Herulambang, W., & Rokhana, R. (2022). Deteksi Penyakit Mata pada Citra Fundus Menggunakan Convolutional Neural Network (CNN). Techno.Com, 21(2), 378–389. https://doi.org/10.33633/tc.v21i2.6162

Johnston, M. (2014). Secondary data analysis: A method of which the time has come. Qualitative and Quantitative Methods in Libraries, 3, 619–626.

Lesmana, A. M., Fadhillah, R. P., & Rozikin, C. (2022). Identifikasi Penyakit pada Citra Daun Kentang Menggunakan Convolutional Neural Network (CNN). Jurnal Sains dan Informatika, 8(1), 21–30. https://doi.org/10.34128/jsi.v8i1.377

Li, J., Yap, M. H., Cheng, W.-H., See, J., Hong, X., Li, X., & Wang, S.-J. (2022). FME ’22: 2nd workshop on facial micro-expression: Advanced techniques for multi-modal facial expression analysis. In Proceedings of the 30th ACM International Conference on Multimedia (pp. 7397–7399). Association for Computing Machinery. https://doi.org/10.1145/3503161.3554777

Montaha, S., Azam, S., Rafid, A. K. M. R. H., Hasan, M. Z., Karim, A., & Islam, A. (2022). TimeDistributed-CNN-LSTM: A hybrid approach combining CNN and LSTM to classify brain tumor on 3D MRI scans performing ablation study. IEEE Access, 10, 60039–60059. https://doi.org/10.1109/ACCESS.2022.3179577

Mulyana, D. I., Sufriman, A., & Yel, M. B. (2023). Implementasi Deteksi Emosional pada Wajah Menggunakan Deep Learning–YOLOv5. JUTECH: Journal Education and Technology, 4(1), 12–22. https://doi.org/10.31932/jutech.v4i1.2174

Musa, P., Anam, W. K., Musa, S. B., Aryunani, W., Senjaya, R., & Sularsih, P. (2023). Pembelajaran Mendalam Pengklasifikasi Ekspresi Wajah Manusia dengan Model Arsitektur Xception pada Metode Convolutional Neural Network. Rekayasa, 16(1), 65–73. https://doi.org/10.21107/rekayasa.v16i1.16974

Pangestu, A. R., Basuki Rahmat, & Fetty Tri Anggraeny. (2020). Implementasi Algoritma CNN untuk Klasifikasi Citra Lahan dan Perhitungan Luas. Jurnal Informatika dan Sistem Informasi (JIFoSI), 1(1), 166–174.

Primawati, A., Sitanggang, I. S., Annisa, A., & Astuti, D. A. (2023). Perbandingan Kinerja LSTM dan Prophet untuk Prediksi Deret Waktu (Studi Kasus Produksi Susu Sapi Harian). Jurnal Edukasi dan Penelitian Informatika (JEPIN), 9(3), 428. https://doi.org/10.26418/jp.v9i3.72031

Sabrina Salsabila. (2025). Teman AI: Peran Character AI dalam Kehidupan Sosial Generasi Z. Jurnal Yudistira: Publikasi Riset Ilmu Pendidikan dan Bahasa, 3(2), 42–49. https://doi.org/10.61132/yudistira.v3i2.1626

Saputri, A. P., Taqwa, A., & Soim, S. (2022). Analisis Deteksi Objek Citra Digital Menggunakan Algoritma YOLO dan CNN dengan Arsitektur RepVGG pada Sistem Pendeteksian dan Pengenalan Ekspresi Wajah. Syntax Literate: Jurnal Ilmiah Indonesia, 7(9), 13069–13080. https://doi.org/10.36418/syntax-literate.v7i9.9393

Satriawan, P. R., Ferdinand, G. M., I Nyoman Putra Satya Natha, I Gst Ayu Pradnya Saci Devi Sastrawan, Marti, N. W., & Ni Putu Novita Puspa Dewi. (2024). Evaluasi dan Perbandingan Algoritma Klasifikasi dalam Analisis Penggunaan Lahan dengan Teknologi Remote Sensing: Sebuah Kajian Sistematik. INSERT: Information System and Emerging Technology Journal, 5(2), 97–109.

Sedana, N. M. K., Wijaya, I. N. S. W., & Arthana, I. K. R. (2024). Analisis Sentimen Berbahasa Inggris dengan Metode LSTM Studi Kasus Berita Online Pariwisata Bali. Jurnal Teknologi Informasi dan Ilmu Komputer, 11(6), 1325–1334.

Viola, P., & Jones, M. (2001). Rapid object detection using a boosted cascade of simple features. In Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2001) (Vol. 1, pp. I-511–I-518). IEEE Computer Society. https://doi.org/10.1109/CVPR.2001.990517

Zhang, F., Bajwa, U. I., & Ahmed, M. U. (2021). Deception detection in videos using the facial action coding system [Preprint]. arXiv. https://doi.org/10.48550/arXiv.2105.13659


Find the perfect home for your research! If this journal isn't the right fit, don't worry—we offer a wide range of journals covering diverse fields of study. Explore our other journals to discover the ideal platform for your work and maximize its impact. Browse now and take the next step in publishing your research:

| HOME | Yasin | AlSys | Anwarul | Masaliq | Arzusin | Tsaqofah | Ahkam | AlDyas | Mikailalsys | Edumalsys | Alsystech | AJSTEA | AJECEE | AJISD | IJHESS | IJEMT | IJECS | MJMS | MJAEI | AMJSAI | AJBMBR | AJSTM | AJCMPR | AJMSPHR | KIJST | KIJEIT | KIJAHRS |