AI for Biometric Identification

Back to Verticals page

In Biometrics, AI has revolutionized identity verification and security, enhancing accuracy from facial recognition to fingerprint analysis. We explore the cutting edge research where AI reshapes secure identity verification.

a. Face Alignment and Recognition

In this study, we introduce a novel approach dubbed as `Multi-Scale Channel Attention Network' with adaptive feature fusion, designed for face recognition. This network integrates attention modules to select informative regions and employs an adaptive feature fusion strategy to enhance the discriminative power of features, particularly in scenarios with inconsistent scales. Our method is specifically designed to guide a spatial transformer network for face alignment in challenging, uncontrolled scenarios such as low-resolution images and occlusions. This design makes the approach suitable for end-to-end alignment of feature maps learned from an unaligned training set.

Related Publications

b Human Face Super-resolution on Real-world Poor-quality Surveillance Video Footage

Most super-resolution (SR) methods do not use real ground-truth high-resolution (HR) and low-resolution (LR) image pairs; instead, a vast majority of methods use synthetic LR images generated from the HR images. Their approaches yield excellent performance on synthetic datasets, but on real-world poor quality surveillance video footage, they suffer from performance degradation. A promising alternative is to apply recent advances in style transfer for unpaired datasets, but state-of-the-art work along these lines has used LR images and HR images from completely different datasets, introducing more variation between the HR and LR domains than necessary. In this study, we propose methods that overcome both of these shortcomings, applying unpaired style transfer learning methods to face SR but using HR and LR datasets that share important properties. The key is to acquire roughly paired training data from a high-quality main-stream and a lower-quality sub-stream of the same IP camera. Based on this principle, we have constructed four datasets comprising more than 400 people, with 1–15 weakly aligned real HR–LR pairs for each subject. We adopt a cycle generative adversarial networks (Cycle GANs) approach that produces impressive super-resolved images for low-quality test images never seen during training. Our approach to face SR makes possible many real-world applications requiring the extraction of high-quality face images from low-resolution video streams such as those produced by security cameras. Developers of diverse applications such as face recognition, 3D face reconstruction, face alignment, face parsing, human–computer interaction, remote sensing, and access control will benefit from the methods introduced in this work.

Related Publications

c. Statistical Descriptors for Face Recognition and Object Classification

In this work, we aim to develop novel multi-order statistical descriptors designed for efficient object classification and face recognition from image sets or videos. The methodology involves representing each gallery set with a global second- order statistic that captures correlated global variations and set structure. A lightweight descriptor is constructed using Cholesky decomposition. To enhance representation power, the descriptor is enriched with the first-order statistic of the gallery set. Dimensionality reduction is achieved by projecting the descriptor into a low- dimensional discriminant subspace while preserving discrimination power. This comprehensive approach provides an effective solution for object classification and face recognition, particularly in the context of image sets or videos.

Related Publications

d. Face Recognition using Pyramid Vision Transformer

In this project a novel pyramid vision transformer is proposed to learn discriminative multi-scale facial representations for face recognition and verification. Face spatial reduction attention and dimensionality reduction layers are employed to make the feature maps compact, thus reducing the computations. An improved patch embedding algorithm is also proposed to exploit the benefits of CNNs in ViTs (e.g., shared weights, local context, and receptive fields) to model lower-level edges to higher-level semantic primitives. The proposed method is evaluated on seven benchmark datasets and compared with ten existing state-of-the-art methods, including CNNs, pure ViTs, and Convolutional ViTs. Despite fewer parameters, FPVT has demonstrated excellent performance over the compared methods. Project page is available at https://khawar-islam.github.io/fpvt/

Related Publications

e. Face Recognition using Hyper Spectral and Multi Spectral Images

Hyperspectral imaging offers new opportunities for face recognition via improved discrimination along the spectral dimension. However, it poses new challenges, including low signal-to-noise ratio, inter-band misalignment, and high data dimensionality. Due to these challenges, the literature on hyperspectral face recognition is not only sparse but is limited to ad hoc dimensionality reduction techniques and lacks comprehensive evaluation. We propose a hyperspectral face recognition algorithm using a spatio-spectral covariance for band fusion and partial least square regression for classification. We also extend 13 existing face recognition techniques, for the first time, to perform hyperspectral face recognition. We formulate hyperspectral face recognition as an image-set classification problem and evaluate the performance of seven state-of-the-art image-set classification techniques. We also test six state-of-the-art grayscale and RGB (color) face recognition algorithms after applying fusion techniques on hyperspectral images. Comparison with the 13 extended and five existing hyperspectral face recognition techniques on three standard data sets show that the proposed algorithm outperforms all by a significant margin. Finally, we perform band selection experiments to find the most discriminative bands in the visible and near infrared response spectrum.

Related Publications

f. Semi-Supervised Spectral Clustering for Objects and Face Classification

We study image set classification algorithms based on unsupervised clustering of labeled training and unlabeled test data where labels are only used in the stopping criterion. The probability distribution of each class over the set of clusters is used to define a true set-based similarity measure. To this end, we propose an iterative sparse spectral clustering algorithm. In each iteration, a proximity matrix is efficiently recomputed to better represent the local subspace structure. Initial clusters capture the global data structure and finer clusters at the later stages capture the subtle class differences not visible at the global scale. Image sets are compactly represented with multiple Grassmannian manifolds which are subsequently embedded in Euclidean space with the proposed spectral clustering algorithm. We also propose an efficient eigenvector solver which not only reduces the computational cost of spectral clustering by many folds but also improves the clustering quality and final classification results.

Related Publications

g. Sparse Linear Discriminant Analysis for Face Classification

No single universal image set representation can efficiently encode all types of image-set variations. In the absence of expensive validation data, automatically ranking representations with respect to performance is a challenging task. We propose a sparse kernel learning algorithm for automatic selection and integration of the most discriminative subset of kernels derived from different image set representations. By optimizing a sparse linear discriminant analysis criterion, we learn a unified kernel from the linear combination of the best kernels only. Kernel discriminant analysis is then performed on the unified kernel. Experiments on four standard datasets show that the proposed algorithm outperforms current state- of-the-art image set classification and kernel learning algorithms

Related Publications

h. Palmprint Identification Using an Ensemble of Sparse Representations

The work will delve into an enhancement to palmprint identification using sparse representation for classification (SRC). SRC's performance is often affected by limited training samples per class and intra-class variations in palmprint images. The proposed method introduces an ensemble of sparse representations through discriminative dictionaries, addressing SRC's limitations. Ensemble learning is applied via random subspace sampling over 2D-PCA space, preserving image structure. Discriminative dictionaries are obtained by minimizing and maximizing intra-class and inter-class variations using 2D-LDA. Extensive experiments on two palmprint datasets demonstrate promising results compared to state-of-the-art methods, highlighting the effectiveness of the proposed technique.

Related Publications

i. Person Identification using Eyes and Periocular Region

The study focuses on periocular region-based person identification using videos and images. It addresses intentional and unintentional facial occlusions, treating periocular recognition in videos as an image-set classification challenge. The proposed two-stage inverse error weighted fusion algorithm demonstrates superior performance compared to single-stage fusion. The study emphasizes the feasibility of image-set-based only eye-region based biometrics for practical applications, showcasing enhanced security system resilience against hacking.

Related Publications