Home Articles Abstract
Research Article

Optimization and Performance Evaluation of Transformer Models in Complex Fall Scenario Videos

Kim, Junseok1 · Lee, Saerom1 · Park, Jonghwa1

1 Kyungpook National University

Published: January 2025 · Vol. 54 No. 6 · pp. 1485-1510

DOI: https://doi.org/10.17287/kmr.2025.54.6.1485

Full Text

Abstract

As the global population ages, falls represent a significant health risk for the elderly. This study aims to propose a high-performance, end-to-end fall detection model designed to serve as a core component for practical, vision-based monitoring systems in real-world environments. We introduce an optimized Transformer-based architecture that detects falls directly from raw RGB video streams, thereby obviating the need for extensive data pre-processing or wearable sensors. The model's generalizability and effectiveness were rigorously evaluated using the AIHUB dataset, which encompasses diverse scenarios, including varied locations and the use of assistive devices. The proposed model achieved an accuracy of 96.5% and an F1-score of 93.2%, demonstrating robust performance even under challenging conditions. The implications of this work are threefold. First, the system can be deployed on existing camera infrastructure, offering a scalable and cost-effective solution for continuous monitoring. Second, by enabling automated monitoring in residential and care facilities, it has the potential to reduce caregiving costs and address service gaps. Third, the non-intrusive nature of the system preserves the privacy and autonomy of individuals. This research contributes significantly to the development of technology-driven safety nets for vulnerable populations and offers practical considerations for senior welfare policies and the design of smart care infrastructure.
Keywords: 노인 안전낙상 감지트랜스포머 아키텍처유니포머영상 분석