Logo: to the web site of Uppsala University

uu.sePublications from Uppsala University
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Vision-based Deep Learning Approach for Human Fall Detection
Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology. (Beijer Laboratory for Artificial Intelligence Research, Uppsala University)
2024 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

 Falls can affect individuals of all age groups, leading to severe health problems, including serious injuries and death. Vision-based deep learning Fall Detection Systems (s) (FDS) have shown high accuracy and reliability in detecting falls.

This study investigates the application of a large foundational pre-trained Vision Transformer (ViT), specifically Video Masked Autoencoder with dual masking (VideoMAE V2), which was initially trained on a general human action dataset, for the downstream task of fall recognition using transfer learning. The objective is to evaluate the model’s generalization capabilities against state-of-the-art models and assess its performance in real-world scenarios. Additionally, we explore the impact of fine-tuning different components of the model’s architecture on its performance.

We test the model on the simulated benchmark dataset fall detection dataset University of Rzeszow Fall Detection (URFD) and on the Real-world Fall Dataset (RFDS) dataset, which includes unsimulated real-world scenarios. Under the given experimental settings, The model achieved 94% accuracy in identifying falls and non-falls. In real-world scenarios, it identified 97% of falls but only 5% of Actions/Activities of Daily Living (ADL) scenarios, resulting in an overall accuracy of 51%.

These results highlight the need for diverse real-world fall datasets to improve the robustness of FDS.

Place, publisher, year, edition, pages
2024. , p. 69
Series
IT ; IT mDA24029
Keywords [en]
Humanfall detection, computer vision, deep neural networks, action recognition, vision transformers.
National Category
Natural Sciences Engineering and Technology
Identifiers
URN: urn:nbn:se:uu:diva-539723OAI: oai:DiVA.org:uu-539723DiVA, id: diva2:1903209
Educational program
Master's Programme in Data Science
Presentation
2024-08-30, 13:15 (English)
Supervisors
Examiners
Available from: 2024-10-03 Created: 2024-10-03 Last updated: 2024-10-03Bibliographically approved

Open Access in DiVA

OlaKarrar_thesis_2024_30HP(14573 kB)2360 downloads
File information
File name FULLTEXT01.pdfFile size 14573 kBChecksum SHA-512
572b0a570a1c27dab93fa5fd3a550596710d39e40810ad8748042668418672eeef7bfabdfe60cec64ae6987d01b03952de3011571af967e2624b0e7d3e74affb
Type fulltextMimetype application/pdf

By organisation
Department of Information Technology
Natural SciencesEngineering and Technology

Search outside of DiVA

GoogleGoogle Scholar
Total: 2361 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 665 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf