<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article  PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article"><front><journal-meta><journal-id journal-id-type="publisher-id">JCC</journal-id><journal-title-group><journal-title>Journal of Computer and Communications</journal-title></journal-title-group><issn pub-type="epub">2327-5219</issn><publisher><publisher-name>Scientific Research Publishing</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.4236/jcc.2015.311029</article-id><article-id pub-id-type="publisher-id">JCC-61320</article-id><article-categories><subj-group subj-group-type="heading"><subject>Articles</subject></subj-group><subj-group subj-group-type="Discipline-v2"><subject>Computer Science&amp;Communications</subject></subj-group></article-categories><title-group><article-title>
 
 
  Semi Advised SVM with Adaptive Differential Evolution Based Feature Selection for Skin Cancer Diagnosis
 
</article-title></title-group><contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Ammara</surname><given-names>Masood</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Adel</surname><given-names>Al-Jumaily</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref></contrib></contrib-group><aff id="aff1"><addr-line>School of Electrical, Mechanical, Mechatronic Systems, University of Technology, Sydney, Australia</addr-line></aff><pub-date pub-type="epub"><day>19</day><month>11</month><year>2015</year></pub-date><volume>03</volume><issue>11</issue><fpage>184</fpage><lpage>190</lpage><history><date date-type="received"><day>October</day>	<month>2015</month>	</date></history><permissions><copyright-statement>&#169; Copyright  2014 by authors and Scientific Research Publishing Inc. </copyright-statement><copyright-year>2014</copyright-year><license><license-p>This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/</license-p></license></permissions><abstract><p>
 
 
   Automated diagnosis of skin cancer is an important area of research that had different automated learning methods proposed so far. However, models based on insufficient labeled training data can badly influence the diagnosis results if there is no advising and semi supervising capability in the model to add unlabeled data in the training set to get sufficient information. This paper proposes a semi-advised support vector machine based classification algorithm that can be trained using labeled data together with abundant unlabeled data. Adaptive differential evolution based algorithm is used for feature selection. For experimental analysis two type of skin cancer datasets are used, one is based on digital dermoscopic images and other is based on histopathological images. The proposed model provided quite convincing results on both the datasets, when compared with respective state-of-the art methods used for feature selection and classification phase. 
 
</p></abstract><kwd-group><kwd>Classification</kwd><kwd> Feature Selection</kwd><kwd> Skin Cancer</kwd><kwd> Support Vector Machine</kwd><kwd> Differential Evolution</kwd></kwd-group></article-meta></front><body><sec id="s1"><title>1. Introduction</title><p>Malignant melanoma is one of the most dangerous forms of skin cancer. Melanoma cases are recorded in big numbers over the last few decades [<xref ref-type="bibr" rid="scirp.61320-ref1">1</xref>]. In 2014, in Australia an estimated 128,000 new cases of cancer were diagnosed and the number may rise to 150,000 by 2020 [<xref ref-type="bibr" rid="scirp.61320-ref2">2</xref>].</p><p>Traditionally, in the skin cancer diagnosis process, dermoscopic images are used by dermatologists while pathologists use histopathological images of biopsy samples taken from patients and examine them using microscope. However, all the analysis and judgments depend on personal experience and expertise and often lead to considerable variability [<xref ref-type="bibr" rid="scirp.61320-ref3">3</xref>]. The biggest challenges in developing automated diagnostic tools is selection of distinguishing quantitative features and development of an efficient classification algorithm that can be trained using both labeled and unlabeled data due to the limited availability of related skin cancer datasets.</p><p>Due to the complex nature of skin cancer images specially the histopathological images [<xref ref-type="bibr" rid="scirp.61320-ref4">4</xref>], it is suggested to have a good variety of differentiating features to begin with, and then use an efficient feature selection method for removing redundant features for reducing amount of data for classifier learning.</p><p>Searching for the optimal feature subset, which can result in best training as well as testing performance, is a challenging task. Various studies show that DE has outperformed many other optimization algorithms in terms of robustness over common benchmark problems and real world applications [<xref ref-type="bibr" rid="scirp.61320-ref5">5</xref>] [<xref ref-type="bibr" rid="scirp.61320-ref6">6</xref>]. This paper proposes a new adaptive differential evolution algorithm based search strategy for feature selection that showed promising results for skin cancer diagnosis both for dermatological as well as histopathological image datasets. It should be noted that the control parameters and learning strategies involved in DE are highly dependent on the problem under consideration and need to be adjusted more adaptively [<xref ref-type="bibr" rid="scirp.61320-ref7">7</xref>]. Thus, one of the research objectives was to adaptively adjust the related control parameter and select the feature subset simultaneously for the problem under consideration, without degrading the classification accuracy.</p><p>On the other hand, for the evaluation of selected feature sets, there are various classification/learning methods proposed in literature [<xref ref-type="bibr" rid="scirp.61320-ref8">8</xref>] [<xref ref-type="bibr" rid="scirp.61320-ref9">9</xref>]. However, as we discussed in [<xref ref-type="bibr" rid="scirp.61320-ref10">10</xref>] more research is required that should take into consideration the experts’ advice using the labeled data along with the capability of using unlabeled data due to the high costs and time involved in getting labeled datasets. Unfortunately, this is also a case in developing skin cancer diagnosis models as obtaining the labeled datasets from experts is not a trivial task and it cost a lot of money and is quite time consuming. To solve this issue, developing semi supervised learning methods that can use unlabeled data together with labeled data to build better learners, is an important area that needs consideration. Typical semi-supervised methods used for different applications include Expectation-maximization (EM) algorithm with generative mixture models [<xref ref-type="bibr" rid="scirp.61320-ref11">11</xref>], transductive support vector machines [<xref ref-type="bibr" rid="scirp.61320-ref12">12</xref>] and graph-based methods [<xref ref-type="bibr" rid="scirp.61320-ref13">13</xref>] etc. In addition to using unlabeled data for training it is also of utmost importance that learning process should have self-advising and self-correcting capability to deal with the misclassified data to avoid its effect on the diagnostic performance of the learning model. The proposed semi advised support vector machine is meant to deals with both of the issues mentioned above.</p><p>The paper is organized as follows: Section 2 provides the details of the adaptive differential evolution algorithm proposed for feature selection and the semi-advised support vector machine algorithm proposed for classification. Section 3 provides the overview of the experimental model based on the proposed algorithms and presents the experimental results and finally conclusion is given in Section 4.</p></sec><sec id="s2"><title>2. Proposed Algorithms</title><sec id="s2_1"><title>2.1. Adaptive Differential Evolution Based Feature Selection</title><p>Differential evolution (DE) is a population based optimization method, which has attracted an increased attention in the past few years. Although it showed quite promising results in various applications but in complex applications the search performance get highly depended on the mutation strategy, crossover operation and control factors including scale factor (F), Cross over rate (Cr) and population size (NP) [<xref ref-type="bibr" rid="scirp.61320-ref7">7</xref>] [<xref ref-type="bibr" rid="scirp.61320-ref14">14</xref>].</p><p>The paper proposes a DE-based feature selection technique with an adaptive approach to make the feature selection process more dynamic to be applied for complex pattern recognition applications like the histopathological image analysis. It will use advised support vector machine explained in following section for evaluation of selected feature subset. The steps of the feature selection procedure are as follows.</p><p>1) Initialize the population of NP individuals Pop<sub>G</sub> = {<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x4.png" xlink:type="simple"/></inline-formula>, <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x5.png" xlink:type="simple"/></inline-formula>, <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x6.png" xlink:type="simple"/></inline-formula>} where <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x7.png" xlink:type="simple"/></inline-formula> = [x1_i<sub>G</sub>, x2_i<sub>G</sub>, x3_i<sub>G</sub>, <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x8.png" xlink:type="simple"/></inline-formula>, xD_i<sub>G</sub>], with i = [1, 2, <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x9.png" xlink:type="simple"/></inline-formula>, NP] where D is the number of parameter to be optimized.</p><p>2) Set the mutation parameter (F) and cross over control parameter (Cr) using the following equations</p><p><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x10.png" xlink:type="simple"/></inline-formula>with <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x11.png" xlink:type="simple"/></inline-formula> and <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x12.png" xlink:type="simple"/></inline-formula> with <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x13.png" xlink:type="simple"/></inline-formula> and<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x14.png" xlink:type="simple"/></inline-formula>. Note <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x15.png" xlink:type="simple"/></inline-formula> while Cauchy distribution prevent premature convergence due to its wider tail property. While <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x16.png" xlink:type="simple"/></inline-formula> is the most successful scale factor in the current generation. While <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x17.png" xlink:type="simple"/></inline-formula> and Gaussian distribution is used as opposite to Cauchy distribution its short tail property help in keeping the value of Cr within unity [<xref ref-type="bibr" rid="scirp.61320-ref7">7</xref>] which is also required here. <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x18.png" xlink:type="simple"/></inline-formula>is the successful crossover probability in the current generation .</p><p>3) While the termination criterion (maximum number of iterations) is not satisfied</p><p>Do for i = 1 to NP //do for each individual</p><p>a. Perform Mutation: A mutant vector <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x19.png" xlink:type="simple"/></inline-formula> {v1_i<sub>G</sub>, <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x19.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x20.png" xlink:type="simple"/></inline-formula>, vD_i<sub>G</sub>} is created corresponding to the ith target vector <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x19.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x20.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x21.png" xlink:type="simple"/></inline-formula> by merging three different randomly selected vectors i.e. using the DE/rand/1 Mutation strategy.</p><disp-formula id="scirp.61320-formula124"><label>(1)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/61320x22.png"  xlink:type="simple"/></disp-formula><p>b. Crossover operation : Employ binomial crossover on each of the D variable as follows for building trial vector</p><disp-formula id="scirp.61320-formula125"><label>(2)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/61320x23.png"  xlink:type="simple"/></disp-formula><p>Here j<sub>rand <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x24.png" xlink:type="simple"/></inline-formula></sub> [1, 2, <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x24.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x25.png" xlink:type="simple"/></inline-formula>, D] is a randomly selected index to ensure that <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x24.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x25.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x26.png" xlink:type="simple"/></inline-formula> gets at least some component from <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x24.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x25.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x26.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x27.png" xlink:type="simple"/></inline-formula></p><p>c. Evaluate the population with the objective function.</p><p>d. Perform Selection: Evaluate the trial vector <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x28.png" xlink:type="simple"/></inline-formula> with the fitness function f = accuracy of classifier</p><p>If<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x29.png" xlink:type="simple"/></inline-formula>, then <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x29.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x30.png" xlink:type="simple"/></inline-formula></p><p>Else <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x31.png" xlink:type="simple"/></inline-formula> end if end for</p><p>4) Repeat from step 2 - 3 until G<sub>max </sub></p></sec><sec id="s2_2"><title>2.2. Semi Advised SVM</title><p>For using automated learning techniques in any area in order to improve performance requires a proper choice of the learning algorithm and of their statistical validation. Classifier training with insufficient number of labeled data is a well-known hard problem [<xref ref-type="bibr" rid="scirp.61320-ref15">15</xref>] [<xref ref-type="bibr" rid="scirp.61320-ref16">16</xref>]. Development of computer aided diagnostic models for skin cancer is a difficult problem given the relative paucity of labeled lesion data and consequently the training data available is not of high quality [<xref ref-type="bibr" rid="scirp.61320-ref17">17</xref>]. The proposed algorithm addresses the skin lesion classification problem with training the classifier using unlabeled data by making efficient use of limited labeled data.</p><p>In this paper, a semi-advising algorithm for SVM is proposed that extracts subsequent knowledge during the training phase using both labeled and sets of unlabeled data that is added in a batch-mode. The effect of misclassified data during the training phase is controlled by generating advice weights [<xref ref-type="bibr" rid="scirp.61320-ref18">18</xref>] based on using misclassified training data. The details of semi-advised SVM algorithm are as follows:</p><p>Given the dataset is divided into two subsets: a labeled data set D<sub>L</sub> = <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x32.png" xlink:type="simple"/></inline-formula> and an unlabeled data set</p><p>D<sub>UL</sub> = <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x33.png" xlink:type="simple"/></inline-formula></p><p>Step 1: The unlabeled data set D<sub>UL</sub> is equally divided into n subsets D<sub>UL1</sub>, D<sub>UL2</sub>, <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x34.png" xlink:type="simple"/></inline-formula>, D<sub>ULn</sub>, then D<sub>L</sub> is taken as the initial training set T<sub>s</sub> and initialize i = 1, where i denotes the i<sup>th</sup> loop of the algorithm.</p><p>Step 2: SVM classifier is trained using labeled data &amp; classifying hyperplane is found using decision function</p><disp-formula id="scirp.61320-formula126"><label>(3)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/61320x35.png"  xlink:type="simple"/></disp-formula><p>where x<sub>i</sub> is input vector corresponding to the i<sup>th</sup> sample and is labeled by y<sub>i</sub> depending on its class, b is constant and α<sub>i</sub> is the nonnegative Lagrange multiplier that is inconsistence with standard SVM training.</p><p>As the data is comprised of nonlinearly separable cases so kernel based SVM is used to produce non-linear decision functions and radial basis function kernel (RBF)</p><disp-formula id="scirp.61320-formula127"><label>(4)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/61320x36.png"  xlink:type="simple"/></disp-formula><p>is used to make all necessary operations in the input space.</p><p>Step 3: The misclassified data sets (MD) in the training phase is determined using following relationship.</p><disp-formula id="scirp.61320-formula128"><label>(5)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/61320x37.png"  xlink:type="simple"/></disp-formula><p>The MD set can be null, but most experiments showed that the occurrence of misclassified data in training phase is a common occurrence. It must also be noted that any method that tries to get benefit from misclassified data, must also have some control on the impact of outlier data. We observed that when the misclassified data is comprised of resembling samples, the use of misclassified data actually improved the classification accuracy more as it can lead to the variations required in the final separating hyperplane.</p><p>If the MD is null, go to the next step, else compute neighbourhood length (NL) for each member of MD using the following mathematical relation, which is then used during advised weight calculation.</p><disp-formula id="scirp.61320-formula129"><label>(6)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/61320x38.png"  xlink:type="simple"/></disp-formula><p>where x<sub>j</sub>, j = 1, <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x39.png" xlink:type="simple"/></inline-formula>, N are the training data that do not belong to the MD set. Here as the training data is mapped to a higher dimension, the distance between x<sub>i</sub> and x<sub>j</sub> is computed according to the following equation with reference to the related RBF kernel.</p><disp-formula id="scirp.61320-formula130"><label>(7)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/61320x40.png"  xlink:type="simple"/></disp-formula><p>Step 4: The labels for data samples in D<sub>UL1</sub> are estimated using current classifier, and then the most confidently classified elements are determined according to the distance between the element and the separating boundary. The criteria is formulated as |x ・ w ? b} ≥ Th, where constant Th &gt; 0 is the distance threshold. If distance between element and separating boundary is larger than Th, we take it as confident element. The most confidently classified elements with their predicted labels are represented as set R and are added with their predicted labels, to training set T<sub>s</sub>, i.e., T<sub>s</sub> = T<sub>s</sub> ∪ R. Remaining elements of D<sub>UL1</sub> are denoted as unlabeled query UL_Q<sub>i</sub>.</p><p>For each sample x<sub>k</sub> from the unlabelled Query set UL_Q<sub>i</sub> advised weight AW(x<sub>k</sub>) is computed using following mathematical relationship. These AWs represent how close data is to the misclassified data from the labelled set.</p><disp-formula id="scirp.61320-formula131"><label>(8)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/61320x41.png"  xlink:type="simple"/></disp-formula><p>The absolute value of the SVM decision values for each x<sub>k</sub> from the unlabeled Query set set are calculated and scaled to [0, 1]. For each x<sub>k</sub> from unlabelled Query set, if (AW (x<sub>k</sub>) &lt; decision value (x<sub>k</sub>) then <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x42.png" xlink:type="simple"/></inline-formula> which is in consistence with normal SVM labelling, otherwise<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x42.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/61320x43.png" xlink:type="simple"/></inline-formula>. After getting labels of UL_Q<sub>i</sub> add UL_Q<sub>i</sub> with predicted labels to T, that is, T = T ∪ Qi.</p><p>Step 5: i = i + 1</p><p>Step 6: If i equals n, terminate; otherwise, go back to Step 2.</p></sec></sec><sec id="s3"><title>3. Experimental Analysis</title><sec id="s3_1"><title>3.1. Overall Learning Model</title><p>The proposed model is presented in <xref ref-type="fig" rid="fig1">Figure 1</xref>. In order avoid the domination of features in greater numeric ranges on the ones with smaller numeric ranges, both the datasets under consideration are linearly scaled to the range [−1, +1] or [0, 1]. The optimization of feature subsets and control parameters is done based on the adaptive differential evolution algorithm explained in the previous section. Using the selected feature sets, the training sets are fed into classification stage where the semi advised SVM classifier explained in the previous section is used. Once the trained model is obtained it is then used for classification of the test data.</p></sec><sec id="s3_2"><title>3.2. Experimental Results</title><p>Two datasets were used in the experiments, dataset 1 is based on dermoscopic images and dataset 2 is based on histopathological images obtained from the biopsy samples of skin cancer patients. Most of the images in the datasets came from Sydney Melanoma Diagnostic Centre. Dataset one comprise of 300 labeled and 500 unlabeled images. While the Dataset 2 consists of 160 images including 60 labeled and 100 unlabeled samples.</p><fig id="fig1"  position="float"><label><xref ref-type="fig" rid="fig1">Figure 1</xref></label><caption><title> Proposed learning model</title></caption><graphic mimetype="image"   position="float"  xlink:type="simple"  xlink:href="http://html.scirp.org/file/61320x44.png"/></fig><p>For testing the effect of feature selection method and the number of selected features on the overall performance of the model, the performance of the proposed feature selection method is also compared with the ones based on well-established binary Genetic Algorithm BGA [<xref ref-type="bibr" rid="scirp.61320-ref19">19</xref>], Binary PSO (BPSO) [<xref ref-type="bibr" rid="scirp.61320-ref20">20</xref>] and Standard Differential evolution based feature selection [<xref ref-type="bibr" rid="scirp.61320-ref21">21</xref>] (see <xref ref-type="fig" rid="fig2">Figure 2</xref>). All methods were made to start from the same initial population with the population size set to 50 and the evolution process is terminated at the same number of iterations set to 100. The fitness function used for evaluation was the classification accuracy. It can be seen that the proposed method attained comparable or better classification accuracies (using the proposed semi-advised SVM classifier) as compared to other methods for comparatively lesser number of selected features.</p><p>This shows that if parameter tuning and feature selection is done simultaneously and effect of misclassified data/outliers is minimized, it can improve the classification performance of the learning models. In addition to that, it can also help in minimizing the use of redundant/irrelevant features in the final optimized model, which will make system computationally less complex and will also decrease the chances of having over fitted models.</p><p>10 fold cross validation rule is used to validate the performance of the overall model for both datasets. We also compared classification performance of proposed classification algorithm with SVM and T-SVM. <xref ref-type="fig" rid="fig3">Figure 3</xref> shows average classification error rate of different classifiers with respect to the change in the ratio of labeled and unlabeled data samples used for training phase. In consistent with a lot of finding in different other applications [<xref ref-type="bibr" rid="scirp.61320-ref11">11</xref>] it was observed that by increasing the number of labeled data in the training phase helps in reducing the classification error. The classification error reduced to around 16.5% for Histopathological images and 6% for Dermoscopic images when the learning model used 50% of labeled and 50% of unlabeled data.</p></sec></sec><sec id="s4"><title>4. Conclusion</title><p>This paper presents a novel learning model with adaptive differential evolution based feature selection and semi advised support vector machine based classification. The proposed feature selection method is meant to adaptively adjust the tuning parameter for the differential evolution process and do the feature selection for the corresponding dataset simultaneously. On the other hand, the proposed semi advised SVM is trained using labeled data along with adding sets of unlabeled data to deal with the misclassified data elements and improve the gene-</p><fig-group id="fig2"><label><xref ref-type="fig" rid="fig2">Figure 2</xref></label><caption><title> Average classification accuracies vs. feature subset sizes.</title></caption><fig id ="fig2_1"><label></label><graphic mimetype="image"   position="float"  xlink:type="simple"  xlink:href="http://html.scirp.org/file/61320x45.png"/></fig><fig id ="fig2_2"><label></label><graphic mimetype="image"   position="float"  xlink:type="simple"  xlink:href="http://html.scirp.org/file/61320x46.png"/></fig></fig-group><fig-group id="fig3"><label><xref ref-type="fig" rid="fig3">Figure 3</xref></label><caption><title> Classification error rate of different classifiers with increasing percentage of labeled training data.</title></caption><fig id ="fig3_1"><label></label><graphic mimetype="image"   position="float"  xlink:type="simple"  xlink:href="http://html.scirp.org/file/61320x47.png"/></fig><fig id ="fig3_2"><label></label><graphic mimetype="image"   position="float"  xlink:type="simple"  xlink:href="http://html.scirp.org/file/61320x48.png"/></fig></fig-group><p>ralization performance of the classifier by using increased amount of training data. Experimental analysis shows that the proposed learning model works well and provides an optimal feature set with higher classification rate when compared with some other popular methods used in literature. The efficient use of unlabeled data with the aid of limited labeled dataset helped in obtaining better generalization of the model over the test data and obtained accuracy of around 94% for dermoscopic images and 86.5% for histopathological images.</p></sec><sec id="s5"><title>Cite this paper</title><p>Ammara Masood,Adel Al-Jumaily, (2015) Semi Advised SVM with Adaptive Differential Evolution Based Feature Selection for Skin Cancer Diagnosis. Journal of Computer and Communications,03,184-190. doi: 10.4236/jcc.2015.311029</p></sec></body><back><ref-list><title>References</title><ref id="scirp.61320-ref1"><label>1</label><mixed-citation publication-type="other" xlink:type="simple">Siegel, R., Naishadham, D. and Jemal, A. (2012) CA: A Cancer Journal for Clinicians. Cancer Statistics, 62, 10-29.</mixed-citation></ref><ref id="scirp.61320-ref2"><label>2</label><mixed-citation publication-type="other" xlink:type="simple">Australian Institute of Health and Welfare (2014) In ACIM (Australian Cancer Incidence and Mortality) Books.</mixed-citation></ref><ref id="scirp.61320-ref3"><label>3</label><mixed-citation publication-type="other" xlink:type="simple">Preti, M., et al. (2000) Inter-Observer Variation in Histopathological Diagnosis and Grading of Vulvar Intraepithelial Neoplasia: Results of an European Collaborative Study. BJOG: An International Journal of Obstetrics &amp; Gynaecology, 107, 594-599. http://dx.doi.org/10.1111/j.1471-0528.2000.tb13298.x</mixed-citation></ref><ref id="scirp.61320-ref4"><label>4</label><mixed-citation publication-type="other" xlink:type="simple">Verhaegen, P.D., van Zuijlen, P.P., Pennings, N.M., van Marle, J., Niessen, F.B., van der Horst, C.M. and Middelkoop, E. (2009) Differences in Collagen Architecture between Keloid, Hypertrophic Scar, Normotrophic Scar, and Normal Skin: An Objective Histo-pathological Analysis. Wound Repair and Regeneration, 17, 649-656.  
http://dx.doi.org/10.1111/j.1524-475X.2009.00533.x</mixed-citation></ref><ref id="scirp.61320-ref5"><label>5</label><mixed-citation publication-type="other" xlink:type="simple">Khushaba, R.N., Ahmed, A.-A. and Al-Jumaily, A. (2011) Feature Subset Selection Using Differential Evolution and a Statistical Repair Mechanism. Expert Systems with Appli-cations, 38, 11515-11526.  
http://dx.doi.org/10.1016/j.eswa.2011.03.028</mixed-citation></ref><ref id="scirp.61320-ref6"><label>6</label><mixed-citation publication-type="other" xlink:type="simple">Islam, S.M.D., Ghosh, S., Roy, S., Suganthan, S. and Nagaratnam, P. (2012) An Adaptive Differential Evolution Algorithm with Novel Mutation and Crossover Strategies for Global Numerical Optimization. IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, 42, 482-500. http://dx.doi.org/10.1109/TSMCB.2011.2167966</mixed-citation></ref><ref id="scirp.61320-ref7"><label>7</label><mixed-citation publication-type="other" xlink:type="simple">Skrovseth, S.O., et al. (2010) A Computer Aided Diagnostic System for Malignant Melanomas. 3rd International Symposium on Applied Sciences in Biomedical and Communication Technologies (ISABEL).  
http://dx.doi.org/10.1109/isabel.2010.5702825</mixed-citation></ref><ref id="scirp.61320-ref8"><label>8</label><mixed-citation publication-type="other" xlink:type="simple">Ruiz, D., et al. (2011) A Decision Support System for the Diagnosis of Melanoma: A Comparative Approach. Expert Systems with Applications, 38, 15217-15223. http://dx.doi.org/10.1016/j.eswa.2011.05.079</mixed-citation></ref><ref id="scirp.61320-ref9"><label>9</label><mixed-citation publication-type="other" xlink:type="simple">Rahman, M.M., Bhattacharya, P. and Desai, B.C. (2008) A Multiple Expert-Based Melanoma Recognition System for Dermoscopic Images of Pigmented Skin Lesions. 8th IEEE International Conference on BioInformatics and BioEngineering, BIBE.</mixed-citation></ref><ref id="scirp.61320-ref10"><label>10</label><mixed-citation publication-type="other" xlink:type="simple">Masood, A. and Ali Al-Jumaily, A. (2013) Computer Aided Diagnostic Support System for Skin Cancer: A Review of Techniques and Algorithms. International Journal of Biomedical Imaging, 22. http://dx.doi.org/10.1155/2013/323268</mixed-citation></ref><ref id="scirp.61320-ref11"><label>11</label><mixed-citation publication-type="other" xlink:type="simple">Mitchell, K.N. (2000) Text Classification from Labeled and Unlabeled Documents Using EM. Machine Learning, 39, 103-134. http://dx.doi.org/10.1023/A:1007692713085</mixed-citation></ref><ref id="scirp.61320-ref12"><label>12</label><mixed-citation publication-type="other" xlink:type="simple">Collobert, R., Sinz, F., Weston, J. and Bottou, L. (2006) Large Scale Transductive SVMs. Journal of Machine Learning Research, 7, 1687-1712.</mixed-citation></ref><ref id="scirp.61320-ref13"><label>13</label><mixed-citation publication-type="other" xlink:type="simple">Blum, A. and Chawla, S. (2001) Learning from Labeled and Unlabeled Data Using Graph Mincuts. Intl. Conference on Machine Learning.</mixed-citation></ref><ref id="scirp.61320-ref14"><label>14</label><mixed-citation publication-type="other" xlink:type="simple">Kumar, P.P. and Millie, A (2010) Self Adaptive Differential Evolution Algorithm for Global Optimization. Swarm, Evolutionary, and Memetic Computing, 103-110. http://dx.doi.org/10.1007/978-3-642-17563-3_13</mixed-citation></ref><ref id="scirp.61320-ref15"><label>15</label><mixed-citation publication-type="other" xlink:type="simple">Zhou, Z.-H., Zhan, D.-C. and Yang, Q. (1999) Semi-Supervised Learning with Very Few Labeled Training Examples. Proceedings of the National Conference on Artificial Intelligence, Menlo Park, Cambridge, London.</mixed-citation></ref><ref id="scirp.61320-ref16"><label>16</label><mixed-citation publication-type="other" xlink:type="simple">Li, K., Luo, X. and Jin, M. (2010) Semi-Supervised Learning for SVM-KNN. Journal of Computers, 5, 671-678. 
http://dx.doi.org/10.4304/jcp.5.5.671-678</mixed-citation></ref><ref id="scirp.61320-ref17"><label>17</label><mixed-citation publication-type="other" xlink:type="simple">Masood, A., Al-Jumaily, A. and Anam, K. (2015) Self-Supervised Learning Model for Skin Cancer Diagnosis. 7th International IEEE/EMBS Conference on Neural Engineering (NER), 1012-1015.  
http://dx.doi.org/10.1109/ner.2015.7146798</mixed-citation></ref><ref id="scirp.61320-ref18"><label>18</label><mixed-citation publication-type="other" xlink:type="simple">Masood, A., Al-Jumaily, A. and Anam, K. (2014) Texture Analysis Based Automated Decision Support System for Classification of Skin Cancer Using SA-SVM. Neural Information Processing, Springer International Publishing, 101- 109. http://dx.doi.org/10.1007/978-3-319-12640-1_13</mixed-citation></ref><ref id="scirp.61320-ref19"><label>19</label><mixed-citation publication-type="other" xlink:type="simple">Haupt, R.L. and Haupt, S.E. (2004) Practical Genetic Algorithms. John Wiley &amp; Sons.</mixed-citation></ref><ref id="scirp.61320-ref20"><label>20</label><mixed-citation publication-type="other" xlink:type="simple">Firpi, H.A.G. and Erik, D. (2004) Swarmed Feature Selection. 33rd Applied Imagery Pattern Recognition Workshop. 
http://dx.doi.org/10.1109/AIPR.2004.41</mixed-citation></ref><ref id="scirp.61320-ref21"><label>21</label><mixed-citation publication-type="other" xlink:type="simple">Khushaba, R.N., Al-Ani, A. and Al-Jumaily, A. (2008) Differential Evolution Based Feature Subset Selection. 19th International Conference on Pattern Recognition. http://dx.doi.org/10.1109/icpr.2008.4761255</mixed-citation></ref></ref-list></back></article>