Transformer And Knowledge Distillation-Based Baggage Threat Detection

Khan, Saad Mazhar

DSpace Home
→
E-Theses
→
CEME
→
Computer Software Engineering
→
MS
→
View Item

Transformer And Knowledge Distillation-Based Baggage Threat Detection

Khan, Saad Mazhar

URI: http://10.250.8.41:8080/xmlui/handle/123456789/51858

Date: 2025

Abstract:

Effective detection of threats in baggage screening is one of the most important aspects of security, especially in highly trafficked places such as airports, malls, and other public venues. Automated threat detection systems have advantages but are still hampered by very high false alarm rates, inability to handle complex or occluded threat objects, and computational inefficiencies that hinder real-time applications. Additionally, conventional machine learning and deep learning approaches are often faced with the difficulties of catastrophic forgetting, which can severely curtail their capacity to adapt and remain effective while incrementally learning from new datasets.Considering the significant gaps mentioned above, the thrust of this research is to establish a thoroughly competent, very accurate, and computationally efficient automated threat detection system based on transformer-based architecture (SegFormer) with an incremental learning and knowledge distillation approach. Therefore, the objectives of this research are:developing a transformer-based optimized knowledge-distillation approach,reliable detection of critical security threats such as guns,explosives, and hazardous materials, public X-ray image databases having been used for training and testing, and evaluating system performance and real-life usability with full-fledged simulated baggage environments.To enable the model to adapt to new threats using incremental learning. The proposed approach operates with SegFormer, which is a hierarchical transformer capable of effectively capturing global contextual information and thus significantly outperforming simpler architectures, such as Encoder-Decoder and Convolutional Transformer (CST). Incremental learning coupled with a novel knowledge distillation-based loss function allows the model to assimilate newly generated information sequentially, without suffering from catastrophic forgetting, thus maintaining the learned representations for earlier threats while being capable of efficiently adapting to newer ones. Impressive improvement in segmentation efficacy was proved through a rigorous evaluation on three pertinent and well-known X-ray datasets-SIXRAY, GDXRAY, and PIDRAY-while the intersection over union score attained 0.854, 0.6831, and 0.805, respectively. Valuable insights into the model’s interpretability were gained through the application of Grad-CAM and t-distributed stochastic neighbor embedding (tSNE) visualization techniques, which clearly delineated accurate threat localization and feature clustering enhancement through incremental learning and knowledge distillation. This research anchors the foundations laid in automated luggage threat detection, demonstrating much higher levels of accuracy, adaptability, and computational efficiency for near-real-time security applications. This kind of work has practical implications for bolstering security and saving lives, quickly and accurately identifying threats. Areas for further exploration indicated include optimization for real-time deployment, better interpret-ability, hybrid modeling approaches, continuous adaptation to new threats, and greater generalization due to larger training sets.