You are here

Ardışıl ileri yönlü öznitelik seçim algoritmasında etkin özniteliklerin belirlenmesi

Determining the effective features in sequential forward feature selection algorithm

Journal Name:

Publication Year:

Author NameUniversity of Author
Abstract (2. Language): 
In recent years, pattern recognition and machine learning has become a very active research area due to the applicability in various kind of subjects, including brain computer interface, commercial and financial approaches. Furthermore, it is worthwhile mentioning that there is no definite technique to solve any classification problem. Thus, it can be helpful to get the most discriminative features using feature selection algorithm. Such approaches are generally performed in four steps: 1- preprocessing, 2- feature extraction, 3- feature selection and 4- classification/regression. Among these steps, feature extraction and feature selection are very vital for representing input signals in a reduced feature space and for identifying discriminative information in order to propose fast and high performance application. Because of there are many kind of feature extraction method, in some cases hundreds of features might be calculated for identifying input signals. However, enormous number of features might reduce the decision performance and speed which are very crucial two parameters in pattern recognition and machine learning approaches. In order to eliminate those disadvantages and reduce the number of features some techniques have been proposed. Those techniques are used by machine learning community for selecting the most suitable feature subset among all the extracted feature sets in order to increase the performance of the proposed model. The sequential forward feature selection (SFFS) and the sequential backward feature selection (SBFS) algorithms are very widely used feature selection techniques in literature. In this study, we extracted Continuous Wavelet Transform based features from BCI Competition 2005 Data Set I. Afterwards, among the extracted features, the most stable and effective features were selected by SFFS and SBFS techniques. BCI Competition 2005 Data Set I includes electrocorticogram (ECoG) based brain computer interface signals which was taken from an epilepsy subject on two different days with about one week of delay. In the both sessions the ECoG signals were recorded while subject was asked to imagine of either the left small finger or the tongue movement. The signals were acquired with an 8x8 ECoG platinum electrode grid (totally from 64 points) which was placed on the contralateral (right) motor cortex. All recordings were performed with a sampling rate of 1 kHz (acquired 3000 samples per channel for every trial). Additionally, BCI Competition 2005 Data Set I consist of 278 training trials (139 trials for finger movements, 139 trials for tongue movements) and 100 test trials which were recorded in the first session and the second session, respectively. Each trial’s duration was 3 seconds. In this paper, SFFS and SBFS algorithms tested for determining of effective features after feature extraction procedure. Afterwards, they compared in terms of classification accuracy and speed. While those methods determine the effective features from training data set using cross validation method, the sub-training data sets are selected randomly. So that, different features might be selected for every running of those methods. Thus, selecting different features are influenced the test performance of the model positively/negatively, as well. Moreover, in this paper a method is proposed to overcome this randomly selection disadvantage. In order to show the robustness of the proposed method the SFFS and SBFS algorithms were run 1000 times in the training stage. Afterwards the features, which were selected more than the determining threshold level, were selected as effective features. Moreover, SFFS and SBFS algorithms were compared in terms of the speed and classification accuracy. The obtained results showed that, SFFS is approximately 40 times faster than SBFS and SFFS provides more than 22% classification accuracy.
Abstract (Original Language): 
Bu çalışmada, örüntü tanıma ve makine öğrenmesi uygulamalarında öznitelik çıkarma işleminden sonra etkin özniteliklerin belirlenmesi için kullanılan yöntemlerden; ardışıl ileri yönlü öznitelik seçme (AİYÖS) ve ardışıl geri yönlü öznitelik seçme (AGYÖS) algoritmaları sınıflandırma doğruluğu ve hız bakımından karşılaştırılmıştır. Bu yöntemler, eğitim kümesinden çapraz doğrulama yöntemi ile en yüksek doğrulama başarısını veren öznitelikleri belirlerken, alt eğitim kümeleri rastgele seçilir. Bundan ötürü bu yöntemlerin her koşulmasında farklı öznitelikler sonuç olarak seçilebilmektedir. Dolayısıyla farklı özniteliklerin seçimi ise önerilecek modelin test performansını olumlu/olumsuz etkilemektedir. Bu çalışmada bu rastgele seçimin dezavantajını ortadan kaldırmak için bir yöntem önerilmiştir. Önerilen yöntemin kararlılığını göstermek amacıyla eğitim aşamasında AİYÖS ve AGYÖS algoritmaları 1000 defa koşturulmakta ve belirlenen eşik değerden fazla sayıda seçilen öznitelikler etkin öznitelikler olarak belirlenmektedir. Elde edilen sonuçlara göre; AİYÖS algoritmasının AGYÖS’e göre yaklaşık 40 kat daha hızlı olduğu ve %22 daha fazla sınıflandırma doğruluğu sağladığı görülmüştür.
495
501

REFERENCES

References: 

Aydemir, Ö., (2013). Imlecin Iki Boyutlu
Hareketinin Hayali Sırasında Kaydedilmiş EEG
İşaretlerinin Karar Ağaç Yapısı Esaslı
Sınıflandırılması, Doktora Tezi, Karadeniz
Teknik Üniversitesi Fen Bilimleri Enstitüsü.
Aydemir, O. ve Kayikcioglu, T., (2011). Wavelet
transform based classification of invasive brain
computer interface data, Radioengineering, 20:1,
31-38.
Aydemir, O., Kayikcioglu, T., (2010). Classifying
ECoG based mental tasks using wavelet
transform features”, In 33rd International
Conference on Telecommunications and Signal
Processing. Baden near Vienna (Austria), p. 103-
107.
de la Fraga, L. G., Coello, C. A. C., (2011). A
review of applications of evolutionary algorithms
in pattern recognition In Pattern Recognition,
Machine Intelligence and Biometrics Springer
Berlin Heidelberg, 3-28.
Gan, J. Q., Hasan, B. A. S., ve Tsui, C. S. L., (2014).
A filter-dominating hybrid sequential forward
floating search method for feature subset
selection in high-dimensional space, International
Journal of Machine Learning and Cybernetics,
5(3), 413-423.
Jain, A. K., Duin, R. P., Mao, J., (2000). Statistical
pattern recognition: A review, Pattern Analysis
and Machine Intelligence, 22:1, 4-37.
Kayikcioglu, T., Aydemir, O., (2010). A polynomial
fitting and k-NN based approach for improving
classification of motor imagery BCI data, Pattern
Recognition Letters, 31:11, 1207-1215.
Lal, T. N., Hinterberger, T., Widman, G., Schröder,
M., Hill, N. J., Rosenstiel, W., Schölkopf, B.,
(2004). Methods towards invasive human brain
computer interfaces, In Advances in neural
information processing systems, 737-744.
Li, L., Yu, S., Xiao, W., Li, Y., Li, M., Huang, L.,
Yang, H., (2014). Prediction of bacterial protein
subcellular localization by incorporating various
features into Chou's PseAAC and a backward
feature selection approach”, Biochimie, 104, 100-
107.
Mao, K. Z., (2004). Orthogonal forward selection
and backward elimination algorithms for feature
subset selection. Systems, IEEE Transactions on
Man, and Cybernetics, Part B: Cybernetics,
34(1), 629-634.
500
Ö. Aydemir
Reif, M., ve Shafait, F., (2014). Efficient feature size
reduction via predictive forward selection,
Pattern Recognition, 47(4), 1664-1673.
Xian, G., Xian, G., (2013). Speaker-independent
Recognition by Using Mel Frequency Cepstrum
Coefficient and Multi-dimensional Space Bionic
Pattern Recognition”, Advances in Information
Sciences and Service Sciences, 5:1, 221.
Ververidis, D., Kotropoulos, C., (2008). Fast and
accurate sequential floating forward feature
selection with the Bayes classifier applied to
speech emotion recognition, Signal Processing,
88(12), 2956-2970.
Yavuz E., Aydemir, Ö., (2016). EEG Tabanlı Beyin
Bilgisayar Arayüzü İşaretlerinin Etkin Zaman
Dilimlerinden Çıkarılmış Özniteliklerle
Sınıflandırma Doğruluklarının Artırılması, 20.
Biyomedikal Mühendisliği Ulusal Toplantısı, 3-5
Kasım 2016.

Thank you for copying data from http://www.arastirmax.com