Fusion Method with Mean-discrete Algorithm in Feature level for Identical twins Identification

Bayan Omar Mohammed*

Department of Information Technology, College of Science and Technology, University of Human Development, Sulaymaniyah, KRG, Iraq.

Corresponding author’s e-mail: Bayan Omar Mohammed, Department of Information Technology, College of Science and Technology, University of Human Development, Sulaymaniyah, KRG, Iraq. E-mail: bayan.omar@uhd.edu.iq

Received: 28-10-2020 Accepted: 15-12-2020 Published: 27-12-2020

DOI: 10.21928/uhdjst.v4n2y2020.pp141-150

ABSTRACT

The study on twins is an important form of study in the forensic and biometrics field as twins share similar genetic traits. A biometric is one of the common types of pattern recognition which acquires biometric data from a person. From these data, a feature is established and extracted where these features can be used to identify individual. Exiting works in biometric identification concentrate on unimodal biometric identification. The high similarity in a pair of twin’s biometric may lead to miss performance. Hence, due to their great accurateness, multimodal biometric systems have become more favored than unimodal biometric systems in identical twins identification. However, these systems are highly complex. We proposed Mean-Discrete feature based fusion algorithm for Kurdish handwriting and fingerprint for identical twins detection. Its viability and advantage over the unimodal biometric systems are highlighted. This paper employed 800 images from 50 pairs of identical twins from Kurdistan Region to carry out the experiment.

Index Terms: Mean-discrete Algorithm, Multi-biometric, Feature Based Fusion, Twins Identification, Kurdish Handwriting-fingerprint

1. INTRODUCTION

Biometric-based identification and systems of verification are among the integral technologies today [1-3], and their applications are rather common in access controls (e.g., building entry and computers), electronic commerce (elimination of fake transactions), as well as illegal immigration’s reduction [4]. Notably, biometric identification system for identical twins poses greater challenge as opposed to that which identifies non-twins due to the considerable resemblance that twin individuals have with each other [4,5]. For this reason, researchers of pattern recognition and computer vision have shown their interest in identifying the biometric of a twin. Furthermore, due to its high level of accurateness, in certain situation, this method was the one which could recognize a specific person’s biometric pattern from a collection of individuals [6-8].

The unimodal biometric identification systems for identical twins are now significantly more accurate and reliable [9], and in this respect, good performance can be seen in a number of traits. Still, technological issues remain to be addressed. Among the available unimodal biometric systems include Wonder Ears (Identification of Identical Twins based on images of ear) [10], 3D face recognition (Identification of Identical Twins based on faces) [11], double trouble (Identification of Identical Twins based on Face Recognition) [12], analysis of facial marks [5], and aside from identification of identical twins based on handwriting individuality [13]; these systems have all been scrutinized. Nonetheless, it should be noted that these studies were all considering the physiological aspects [14] implying the unlikely occurrence of change.

Sharing one zygote causes identical twins to have similar genetic composition, increasing the difficulty in identifying them [15]. More than one biometric trait would thus be used in identifying these twins, making multimodal biometric system suitable due to the use of both physical and behavioral traits in its application [16]. Multimodal biometric system combines many biometric traits from multitude of sources. In the context of enrolment, the use of this system enables the enrolment of users even though they have no identified biometric identifier. Such ability becomes a solution to the problems of enrolment, and these abilities therefore prove the universality of multimodal biometric system. To affirm sound performance in a twin’s biometric identification, good and thorough features have to be used as input to a classifier. Accordingly, in a multi-biometric system, an individual is represented by various features. For this reason, a single feature extracted directly for the multi-biometric itself is not representative of a unique feature for a twin biometric.

There are countless of representations in multi-biometric systems, which have led to the presence of vast variance between features for one individual. Somehow, there is small variance in the context of twins’ comparison, making it necessary to engage one more process. This is to enable the unique features to be represented from the pool of multi-biometric features. During the process, many representations obtained from multi-biometric representations of a twin are merged and converted into a uni-representation. The merging and conversion are done before the execution of the identification task. As a result, the level of variance in the data between the twins is decreased. However, the majority of past researches were focusing on the discrete feature extraction methods of each twin’s biometric. Hence, this study presents the application of Mean-Discrete feature based fusion algorithm to combine these features with twins’ Kurdish handwriting-fingerprint.

2. RESEARCH FRAME WORK

Features extracted from a feature extraction method comprise an ensemble of global features. In the context of this study, the features, which are usually classed individually, are representatives of the twin’s Kurdish handwriting-fingerprint particularly, with respect to word and shape. Furthermore, the individual classification of features allows the identification of an individual within a twin or a group of individuals. Accordingly, this study presents the feature-based fusion that the performance of identification in the arena of twin biometric identification can be improved.

For the purpose, twin’s exclusive individual features would be needed, whereas the extracted features are often in multi-representations. For this reason, individual features for each individual within a set of twins are employed together. Arguably, such usage will increase the performance of twin identification. These are called a Mean-Discrete feature vector and this method is used following the process of feature extraction. Mean-discrete feature vector carries the generalized features of global features possessed by individuals. In the model of twin identification, the features are generalized before the classification task. This generates better outcome. Relevantly, the framework proposed in this study is shown in summarized form in Fig. 1.

Fig. 1. Proposed framework for twin multi-biometric.

3. FEATURE EXTRACTION

Aspect United Moment Invariant (AUMI) allows the extraction of global features from the region and boundary (word or shape) in a separate and continuous manner to represent an individual [17]. Here, the fusion embedded scaling factor of aspect is created [18] into the United Moment Invariant [19], as shown in Fig. 2.

Fig. 2. Aspect united moment invariant structure by Mohammed and Shamsuddin [14], Hu [20].

This instantly combines the capacities of these two functions of moment into the proposed AUMI. The [19] United Moment Invariant has an association with the geometrical representation that considers the normalized central moment equations of Geometric Moment Invariant (GMI) [20] and the boundary representation of Improved Moment Invariant (IMI) [21]. Finally, [17] AUMI comprises eight features with the construction of the [19] United Moment Invariants (UMI), as shown below:

As f_i denotes large values, the natural logarithm is employed. As such, below is obtained for i = 1 to 7; q_i←log10 f_i.

The features of the AUMI satisfy the individuality of the concept of the twin’s handwriting-fingerprint [22,23], and the outcomes demonstrate a lower intra-class value for the variance between features for the mean absolute error (MAE) in comparison to the value for the inter-class. This is the reason why the features of the AUMI were explored and employed in the domain of twin biometric identification in this study. The AUMI presents the striking individual features in the extracted invariant feature. In the context of a twin’s biometric identification, getting features that denote the twin’s handwriting-fingerprint from numerous writing styles and shapes are the main purpose [23,24]. The AUMI is primarily concerned with obtaining the twin’s handwriting-fingerprint’s unique features. The purpose of employing algorithms is to extract individual features. Eventually, such use accurately reflects the handwriting-fingerprint of the twin. Conversely, for a multi-biometric representing an individual belonging to a twin, the directly extracted individual features do not directly represent the unique features of a twin biometric. As such, before the measurement task, additional processes should be included. This study proposes use of the Mean- Discrete feature based fusion algorithm before the identification task.

4. PROPOSED MEAN-DISCRETE FEATURE-BASED FUSION

Due to the performance of the collective process on the task of identification, Mean-Discrete Algorithm is dubbed as global combination as well. Furthermore, due to its reliance on each attribute for each feature within the dataset, the Mean-Discrete Algorithm also becomes part of the global characteristic class. Such reliance allows the calculation of Mean-Discrete value for each attribute for each multi-biometric of twin. The application of Mean-Discrete feature vector leads to improved representation of data for the twin’s Kurdish handwriting-fingerprint’s individuality. Accordingly, the Mean-Discrete algorithm methodologies are presented in this section. Mean-Discrete algorithm is used at the feature level and it entails a blend of twins’ multiple features for the concluding decision making. In the task of feature extraction for the multi-biometric of each twin, AUMI is used in producing the columns of eight feature vector, and for the process of Mean-Discrete, eight features are generated from the twin’s Kurdish handwriting-fingerprint. For this reason, it is not impossible to keep the initial amount of invariant feature vector columns within the moment function that is utilized within the feature extraction task. Accordingly, the following section provides the elaboration of the process of Mean-Discrete feature-based fusion, and Fig. 3 details the flowcharts of the Mean-Discrete algorithm proposed in this study.

Fig. 3. Flowchart of Proposed Mean-Discrete Algorithm.

The feature value of Mean-Discrete is computed alongside both biometrics for a twin. This is in line with the class of the individual due to the fact that within a twin set, every individual has distinctive style of writing and shape of fingerprint. Hence, there is individuality of handwriting-fingerprint in twin identification. Using the individual’s class, computation is made to the feature value of Mean-Discrete. The application of this method affirms the protection of the characteristic’s uniqueness or individuality. In addition, the supervised method demonstrates its aptness for twin’s handwriting-fingerprint particularly with respect to individuality. This is because this method maintains the individual features of each twin. In this context, if the two twins have a close or indistinguishable feature vector, then the two classes will be regarded as possessing indistinguishable or nearly indistinguishable Mean-Discrete feature. For this reason, the Mean-Discrete feature will be identical or nearly identical as well.

Moreover, the obtained information and the characteristics of each twin will not be altered by the algorithm proposed. Rather, it denotes the initially extracted multi-biometric feature vector within the Mean-Discrete feature vector of a uni-representation. The process of Mean-Discrete will provide a clear elucidation on the linkages between features, while the characteristics of the features remain the same. For this reason, Mean-Discrete algorithm is appropriate while also fulfilling the individuality of the twin’s handwriting-fingerprint in the context of twin identification. Fig. 4 presents the terms as well as the Mean-Discrete process.

Fig. 4. Mean-discrete (uni-representation) features.

The process of Mean-Discrete feature-based fusion entails the labeling of the person’s class, and the conversion of the multi-representation features into uni-representation features. In the line of Mean-Discrete, first concatenate twins Kurdish handwriting-fingerprint then the intervals are computed using the minimum (Fe_min) and maximum (Fe_max) feature vectors (ifv) for an individual. The line of Mean-Discrete line entails a line of invariant feature vectors. This line begins from the minimum (Fe_min) invariant feature vector value and finishes with the maximum (Fe_min) invariant feature vector value for an individual twin. An interval encompasses the average of the line of Mean-Discrete apportioned by the number of columns within the invariant feature vector. The following is the calculation of the width (wd) of an interval:

where

Fe_min: Minimum value of invariant feature vector for a person in a twin.

Fe_max: Maximum value of invariant feature vector for a person in a twin.

f: Number of columns in the invariant features vector.

The width describes the interval’s cut points. It also computes the value of representation value. The “f” value denotes eight for the number of columns of the feature from AUMI. Meanwhile, the cut point denotes the intervals’ divider (iv) within the line of Mean-Discrete. For invariant feature vectors with similar interval, they have the exact value of representation. For each interval, the value of representation (rv) comprises the average of an interval computed using the following formula: . The value of representation value for all intervals (1-8) denotes the invariant feature vector that is in the following range: if MHF ≥ 1b and if MHF ≤ ub. Meanwhile, mean features (MHF) for each twin are computed using the invariant feature vectors of the twin’s handwriting (H_ij) and fingerprint (F_ij).

Eight features were created in this study, which represents the number of columns for the number of features of the AUMI applied for the twin’s multi-biometric. These features are called the Mean-Discrete feature vector. This vector symbolizes the individuality of the twin’s handwriting-fingerprint to the person. Tables 1 and 2 exemplify the transformation of the twin’s multi-biometric feature vector into the Mean-Discrete feature vector.

TABLE 1 Real data for handwriting and fingerprint for twins

TABLE 2 Mean handwriting and fingerprint for Twin a1

Tables 1 and 2 exhibit a total of eight columns. These columns signify the eight columns of the invariant feature vectors in the AUMI. The Mean-Discrete process is then performed using these data. Fig. 5 shows the Mean-Discrete process for twin number a1.

Fig. 5. From histogram for Max matching score where true matches are on range [0.0109, 99.7900] and non-match score is on range [0.0090, 798.9000].

Table 3 shows the Mean-Discrete feature vector produced from the Mean-Discrete algorithm process. The Mean-Discrete features vector includes the generalized features of a twin individual. These features clearly represent the general individual features for each individual in a twin.

TABLE 3 Example of mean- discrete feature for twins

5. EXPERIMENT AND RESULTS

Mean-Discrete algorithm generates more accurate results of identification task results, and for this reason, it is of value to the context of this study. Within the context of multi-representation analysis, the individuality of the twin’s handwriting-fingerprint is this study’s focal point. The twin’s multi-biometric identification has been improved using the Score Histogram which is based on false Non-Match Rate (FNMR) and False Match Rate (FMR), and the improved aspect is its individuality. An example of the performance of identification with the current accuracy in twin identification task shows the feasibility of Mean-Discrete data in generating better performance. In other words, the prospect of attaining better level of individuality of a twin’s Kurdish handwriting-fingerprint with Mean-Discrete feature based fusion data utilization is proven in this work. Comparison was made between this study’s outcomes of the uni-representation analysis with the Mean-Discrete feature based fusion data and those from the analysis of multi-representation.

5.1. Matching Performance with Mean-discrete Algorithm

In this study, the matching performance is evaluated using Score Histogram which is based on false Non-Match Rate (FNMR) and False Match Rate (FMR). The number of multiple biometric such as the true and non-true match datasets, taken into consideration for the experimental purpose, is also included in the section.

In this study, the datasets used for the experimental purposes are summarized in Table 4:

TABLE 4 Size and characteristic of the multimodal datasets obtained from Kurdistan Reign used in experimental

5.1.1. Score histogram

Here, the histogram illustrates the comparative probability distribution of score for both matches and non-matches over the full range of possible score. This histogram is computed using the relative equations in (10, 11).

Fig. 6 thought 14 shows the significant difference for Mean-Discrete, Min, Max, Produced, Weight-Sum, Sum-Produced, Handwriting, Fingerprint, Sum and Concatenate methods and the following Figs. 6-14 shows the characteristics of the whole methods with matched score distribution after the matching process, respectively. Based on the scores distribution, it can be observed that the match score are lower than [48.5700, 99.7900, 32.7900, 8.5708, 44.1100, 99.7901, 86.9926 , 141.9000, and 103] and non-match score are higher than [0.0013, 0.0090, 0.0000299, 0.0105, 0.0122, 0.0061, 0.0020, 0.0154, and 4.2534], respectively. Thus, if a score is higher than [ 48.5700, 99.7900, 32.7900, 8.5708, 44.1100, 99.7901, 86.9926, 141.9000, and 103], it is definitely represented as false or non-match score with zero probability of error and if a score is lower than [0.0013, 0.0090, 0.0000299, 0.0105, 0.0122, 0.0061, 0.0020, 0.0154, and 4.2534 ], it is represented as true match score with zero probability of error. However, if the score value falls between scores [0.0013 , 0.0090, 0.0000299, 0.0105 , 0.0122, 0.0061, 0.0020, 0.0154, and 4.2534 ] and [48.5700, 99.7900, 32.7900, 8.5708 , 44.1100, 99.7901, 86.9926, 141.9000, and 103], it is known as undermined score because both true and false match score are in this range value [0.0013, 0.0090, 0.0000299, 0.0105, 0.0122, 0.0061, 0.0020, 0.0154, and 4.2534], [48.5700, 99.7900, 32.7900, 8.5708, 44.1100, 99.7901, 86.9926, 141.9000, and 103], respectively . This is where the case of miss-classification or probability of fusion error occurs. Fig. 6 thought 14 shows the histogram of the Min, Max, Produced, Weight-Sum, Sum-Produced, handwriting, fingerprint, Sum, and concatenate, respectively.

Fig. 6. From histogram for Min matching score where true matches are on range [0.0060, 48.5700] and non-match score is on range [0.0013, 105.9000].

Fig. 7. From histogram for product matching score where true matches are on range [0.0000119, 32.7900] and non-match score is on range [0.0000299, 30844].

Fig. 8. From histogram for weight-sum matching score where true matches are on range [0.0140 , 8.5708] and non-match score is on range [0.0105, 250.2000].

Fig. 9. From histogram for sum-product matching score where true matches are on range [0.0171, 44.1100] and non-match score is on range [0.0122, 31233].

Fig. 10. From histogram for handwriting matching score where true matches are on range [0.0099, 99.7901] and non-match score is on range [0.0061, 154.0403].

Fig. 11. From histogram for fingerprint matching score where true matches are on range [0.0060, 86.9926] and non-match score is on range [0.0020, 798.9393].

Fig. 12. From histogram for sum matching score where true matches are on range [0.0170, 141.9000] and non-match score is on range [0.0154, 419.6000].

Fig. 13. From histogram for concatenate matching score where true matches are on range [3.5249, 103] and non-match score is on range [4.2534, 197.6095].

Fig. 14. Score histogram for Mean-Discrete matching score where true matches are on range [0.0123, 5.7095] and non-matches scores are on range [0.0043, 234, 5054].

Fig. 14 shows the histogram of the Mean-Discrete feature matching score. The true Mean-Discrete matching score is lower than 5.7095, while the non-matching Mean-Discrete score is higher than 0.0043. Interestingly, from the figure, it can be seen that they have a very slightly little overlap between the true match and non-match scores [0.0043, 5.7095]. Thus, this technique can minimize the issues of miss-classification and probability of fusion error.

REFERENCES

[1]. Y. Koda, T. Higuchi and A. K. Jain. “Advances in Capturing Child Fingerprints:A High Resolution CMOS Image Sensor with SLDR Method“. 2016 International Conference of the Biometrics Special Interest Group IEEE, pp. 1-4, 2016.

[2]. S. Karahan, M. Kılınc and H. K. Ekenel. “How Image Degradations Affect Deep CNN-based Face Recognition“?IEEE Conference Publications, pp. 1-5, 2016.

[3]. S. Easwaramoorthy, F. Sophia and A. Prathik. “Biometric Authentication Using Finger Nails“. International Conference on Emerging Trends in Engineering, Technology and Science, IEEE Conference Publications, pp. 1-6, 2016.

[4]. H, Behravan and K. Faez. “Introducing a New Multimodal Database from Twins'Biometric Traits“. IEEE Conference Publications, IEEE, pp. 1-6, 2013.

[5]. N. Srinivas, G. Aggarwal, P. J. Flynn and R. W. V. Bruegge. “Analysis of facial marks to distinguish between identical twins“. IEEE Transactions on Information Forensics and Security, vol. 7, pp. 1536-1550, 2012.

[6]. C. Kauba, A. UhlWavelab, E. Piciucco, E. Maiorana and P. Campisi. “Advanced Variants of Feature Level Fusion for Finger Vein Recognition“. IEEE Conference Publications, pp. 1-7, 2016.

[7]. N. Nain, B. M. Deepak, D. Kumar, M. Baswal and B. Gautham. “Optimized minutiae-based fingerprint matching“. Lecture Notes in Engineering and Computer Science, vol. 2170, pp. 682-687, 2008.

[8]. W. Y. Leng and S. M. Shamsuddin. “Fingerprint identification using discretization technique“. International Journal of Computer, Electrical, Automation, Control and Information Engineering, vol. 6, pp. 240-248, 2012.

[9]. J. Weber-Lehmann, E. Schilling, G. Gradl, D. C. Richter, J. Wiehler and R. Burkhard. “Finding the needle in the haystack:Differentiating ''identical''twins in paternity testing and forensics by ultra-deep next generation sequencing“. Forensic Science International:Genetics, vol. 9, pp. 42-46, 2014.

[10]. H. Nejati, L. Zhang, T. Sim E. Martinez-Marroquin and G. Dong. “Wonder Ears:Identification of Identical Twins from Ear Images.“Proceedings of the 21^st International Conference on Pattern Recognition, Nov. 11-15, IEEE Xplorev Press, Tsukuba, Japan, pp. 1201-1204, 2012.

[11]. V. Vipin, K. W. Bowyer, P. J. Flynn, D. Huang, L. Chen, M. Hansen, O. Ocegueda, S. K. Shah and I. A. Kakadiaris. “Twins 3D Face Recognition Challenge“. Proceedings of the International Joint Conference on Biometrics, Oct. 11-13, IEEE Xplore Press, Washington, DC, USA, pp. 1-7, 2011.

[12]. J. R. Paone, P. J. Flynn, P. J. Philips, K. W. Bowyer, R. W. V. Bruegge, P. J. Grodher, G. W. Quinn, P. T. Pruitt and J. M. Grant. “Double trouble:Differentiating identical twins by face recognition“. IEEE Transactions on Information Forensics and Security, vol. 9, pp. 285-295, 2014.

[13]. S. N. Srihari, S. H. Cha, H. Arora and S. Lee. “Individuality of handwriting“. Journal of Forensic Sciences, vol. 47, pp. 1-17, 2002.

[14]. B. O. Mohammed and S. M. Shamsuddin. “Improvement in twins handwriting identification with invariants discretization“. EURASIP Journal on Advances in Signal Processing, vol. 48, pp. 1-19, 2012.

[15]. A. S. Al-Waisy, R. Qahwaji, S. Ipson, S. Al-Fahdawi, T. A. M. Nagem. “A multi-biometric iris recognition system based on a deep learning approach“. Pattern Analysis and Applications, vol. 21, pp. 783-802, 2018.

[16]. B. L. Priya, M. P. Rani. “Authentication of Identical Twins Using Tri Modal Matching“. World Congress on Computing and Communication Technologies, IEEE, 2017.

[17]. K. M. Azah, S. M. Shamsuddin and A. Abrahamz. “Improvement of authorship invarianceness for individuality representation in writer identification“. Neural Network World, vol. 3, pp. 371-387, 2010.

[18]. P. Feng P. and M. Kean. “A new set of moment invariants for handwritten numeral recognition“. IEEE International Conference of Image Processing, vol. 1, pp. 154-158, 1994.

[19]. S. Yinan, L. Weijun and W. Yuechao. “United Moment Invariants for Shape Discrimination“. Proceedings of the IEEE International Conference on Robotics, Intelligent Systems and Signal Processing, Oct. 8-13, IEEE Xplore Press, Changsha, Hunan, China, pp. 88-93, 2003.

[20]. M. K. Hu. “Visual pattern recognition by moment invariants“. IEEE Transactions on Information Theory, vol. 8, pp. 179-187, 1962.

[21]. C. C. Chen. “Improved moment invariants for shape discrimination“. Pattern Recognition, vol. 26, pp. 683-686, 1993.

[22]. B. O. Mohammed and S. M. Shamsuddin. “Twins multimodal biometric identification system with aspect united moment invariant“. Journal of Theoretical and Applied Information Technology, vol. 95, 2895, 2017.

[23]. B. O. Mohammed and S. M. Shamsuddin. “A multimodal biometric system using global features for identical twins identification“. Journal of Computer Science, vol. 14, pp. 92-107, 2018.

[24]. B. O. Mohammed and S. M. Shamsuddin. “Feature discretization for individuality representation in twins handwritten identification“. Journal of Computer Science, vol. 7, pp. 1080-1087, 2011.