An efficient way of using wrappers in big data classification

dc.contributor.authorFajila, M.N.F.
dc.date.accessioned2018-12-07T08:41:29Z
dc.date.available2018-12-07T08:41:29Z
dc.date.issued2017-11-28
dc.description.abstractData is dramatically growing with the growth of time. However, the value of data forces the scientists to find patters to use the high dimensional data efficiently. Dimensionality reduction is an essential technique in data science when handling big data. Although always the techniques are being introduced, applying correct technique at right position still seems to be challenging. One such technique is wrappers for machine learning. Feature selection plays a major role in classification of big data. A feature can be more informative in the presence of another feature. Thus, no feature should be removed without assessing. Wrappers select all the possible combinations of feature subsets, and finally provide the most informative subset which classifies the data with a higher accuracy. But, compared to filters wrappers are much slower and consume a huge amount of time when applied to big data. Therefore, in the proposed approach, wrapper is applied after the application of filter in order to get rid of the computational complexity. This approach uses gain ratio filter followed by classifier subset evaluate, the wrapper for feature sub set selection. The proposed technique is validated and evaluated on two high dimensional micro array data sets namely; lung cancer data set and breast cancer data set. It provided 97.10% accuracy (only with two mis classifications) and 78.78% accuracy for lung cancer and breast cancer data sets respectively. Thus, the results show that the proposed approach is extremely efficient in terms of accuracy and computational time too.en_US
dc.identifier.isbn9789556271232
dc.identifier.urihttp://ir.lib.seu.ac.lk/handle/123456789/3279
dc.language.isoen_USen_US
dc.publisherFaculty of Applied Science, South Eastern University of Sri Lankaen_US
dc.subjectBig data,en_US
dc.subjectClassification,en_US
dc.subjectDimensionality reduction, r.en_US
dc.subjectMicro array,en_US
dc.subjectWrapper.en_US
dc.titleAn efficient way of using wrappers in big data classificationen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ASRS 2017 03....pdf
Size:
9.35 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections