<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article  PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article"><front><journal-meta><journal-id journal-id-type="publisher-id">OJS</journal-id><journal-title-group><journal-title>Open Journal of Statistics</journal-title></journal-title-group><issn pub-type="epub">2161-718X</issn><publisher><publisher-name>Scientific Research Publishing</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.4236/ojs.2017.71003</article-id><article-id pub-id-type="publisher-id">OJS-74080</article-id><article-categories><subj-group subj-group-type="heading"><subject>Articles</subject></subj-group><subj-group subj-group-type="Discipline-v2"><subject>Physics&amp;Mathematics</subject></subj-group></article-categories><title-group><article-title>
 
 
  RobCoP: A Matlab Package for Robust CoPlot Analysis
 
</article-title></title-group><contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Yasemin</surname><given-names>Kayhan Atilgan</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Erdinc</surname><given-names>Levent Atilgan</given-names></name><xref ref-type="aff" rid="aff2"><sup>2</sup></xref></contrib></contrib-group><aff id="aff1"><addr-line>Department of Statistics, Hacettepe University, Ankara, Turkey</addr-line></aff><aff id="aff2"><addr-line>Meteksan Defense Ind., Ankara, Turkey</addr-line></aff><author-notes><corresp id="cor1">* E-mail:<email>ykayhan@hacettepe.edu.tr(YKA)</email>;</corresp></author-notes><pub-date pub-type="epub"><day>09</day><month>02</month><year>2017</year></pub-date><volume>07</volume><issue>01</issue><fpage>23</fpage><lpage>35</lpage><history><date date-type="received"><day>22,</day>	<month>December</month>	<year>2016</year></date><date date-type="rev-recd"><day>10,</day>	<month>February</month>	<year>2017</year>	</date><date date-type="accepted"><day>13,</day>	<month>February</month>	<year>2017</year></date></history><permissions><copyright-statement>&#169; Copyright  2014 by authors and Scientific Research Publishing Inc. </copyright-statement><copyright-year>2014</copyright-year><license><license-p>This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/</license-p></license></permissions><abstract><p>
 
 
  The graphical representation method, Robust CoPlot, is a robust variant of the classical CoPlot method. CoPlot is an adaptation of multidimensional scaling (MDS), and is a practical tool for visual inspection and rich interpretation of multivariate data. CoPlot enables presentation of a multidimensional dataset in a two dimensions, in a manner that relations between both variables and observations to be analyzed together. It has also been used as a supplemental tool to cluster analysis, data envelopment analysis (DEA) and outlier detection methods in the literature. However, this method is very sensitive to outliers. When a multidimensional dataset contains outliers, this can lead to undesirable consequences such as the inaccurate representation of the variables. The motivation is to produce Robust CoPlot that is not unduly affected by outliers. In this study, we have presented a new MATLAB package RobCoP for generating robust graphical representation of a multidimensional dataset. This study serves a useful purpose for researchers studying the implementation of Robust CoPlot method by providing a description of the software package RobCoP; it also offers some limited information on the Robust CoPlot analysis itself. The package presented here has enough flexibility to allow a user to select an MDS type and vector correlation method to produce either classical or Robust CoPlot results.
 
</p></abstract><kwd-group><kwd>Robust</kwd><kwd> CoPlot</kwd><kwd> Multidimensional Scaling</kwd><kwd> Multivariate Analysis</kwd><kwd> MATLAB</kwd></kwd-group></article-meta></front><body><sec id="s1"><title>1. Introduction</title><p>CoPlot method, introduced by [<xref ref-type="bibr" rid="scirp.74080-ref1">1</xref>] , is used as a tool for multi-criteria grouping. It consists of two graphs: the first represents the distribution of p dimensional observations over two-dimensional space, whereas the second shows the relation- ships between variables and observations. The main advantage of this method is that it enables the simultaneous investigation of the relations between the observations and between the variables for a set of data. In contrast to many other multivariate methods that produce composites of variables (such as principal component analysis and cluster and factor analysis), CoPlot uses variables that are derived from the original dataset.</p><p>Among the wide spectrum of graphical techniques for the treatment of multidimensional dataset, CoPlot method has attracted much attention in recent years in a wide range of areas for various purposes. CoPlot is used for geome- trical representation of multi-criteria decision problems [<xref ref-type="bibr" rid="scirp.74080-ref2">2</xref>] [<xref ref-type="bibr" rid="scirp.74080-ref3">3</xref>] [<xref ref-type="bibr" rid="scirp.74080-ref4">4</xref>] , has been utilized in econometric studies [<xref ref-type="bibr" rid="scirp.74080-ref5">5</xref>] , in energy and environmental modeling [<xref ref-type="bibr" rid="scirp.74080-ref6">6</xref>] , in exploratory data analysis [<xref ref-type="bibr" rid="scirp.74080-ref7">7</xref>] , as an outlier detection tool [<xref ref-type="bibr" rid="scirp.74080-ref8">8</xref>] [<xref ref-type="bibr" rid="scirp.74080-ref9">9</xref>] and for presenting DEA graphically [<xref ref-type="bibr" rid="scirp.74080-ref10">10</xref>] [<xref ref-type="bibr" rid="scirp.74080-ref11">11</xref>] [<xref ref-type="bibr" rid="scirp.74080-ref12">12</xref>] .</p><p>Although it is increasingly popular for applications involving multidimen- sional datasets, CoPlot method is sensitive to the outliers. To obtain reliable results, a graphical representation is needed that accounts for the presence of outliers. If the dataset contains outliers, the representation of the variables may deviate strongly from those obtained from the clean data in CoPlot method. Aim of Robust CoPlot method is to reduce impact of outliers and try to fit the bulk of the data [<xref ref-type="bibr" rid="scirp.74080-ref13">13</xref>] .</p><p>In this paper, we present the RobCoP package for MATLAB [<xref ref-type="bibr" rid="scirp.74080-ref14">14</xref>] , a software package that implements Robust CoPlot. A first objective in developing this package was to provide researchers with a software package that offers both classical and Robust CoPlot analysis for use with MATLAB; to our knowledge, this is the only package currently providing these features. In the existing literature, there is only one comparable software, which is not open source [<xref ref-type="bibr" rid="scirp.74080-ref15">15</xref>] , enabling only the analysis of classical CoPlot. The package is freely available on the website of the Mathworks file exchange. The site https://www.mathworks.com/matlabcentral/fileexchange/61338-robcop--a-matlab-package-for-robust-coplot-analysis contains the basic functions needed to run the analysis and to obtain the corresponding Robust CoPlot results.</p><p>The paper is organized as follows: Section 2 briefly introduces the Robust CoPlot algorithm, and Section 3 gives details about RobCoP written as a set of MATLAB functions. In Section 4, two examples are provided for the application of the package.</p></sec><sec id="s2"><title>2. Methodology of Robust Coplot</title><sec id="s2_1"><title>2.1. Standardization of Data</title><p>The Robust CoPlot method mainly consists of three steps. In order to obtain Robust CoPlot graphs, an MDS embedding of the dataset should be generated. The first step in the algorithm is to obtain standardized data; otherwise, variables measured at different scales do not contribute equally to the analysis [<xref ref-type="bibr" rid="scirp.74080-ref16">16</xref>] . Typical data standardization procedures transform the data to comparable scales by using sample mean and standard deviation. However, these two estimators are very sensitive to outliers, even if only one strong outlier may attract the sample mean and inflate the sample variance. By using median and median absolute deviation (MAD), which are the robust equivalents of these two estimators, possible effects of outliers on the standardization of data are restricted. In Robust CoPlot, the p -dimensional n point data matrix X n &#215; p is transformed into the standardized matrix Z n &#215; p in a robust way as follows:</p><p>z i j = x i j − med ( x j ) MAD ( x j ) (1)</p><p>where z i j is the i -th row and j -th column element of the standardized matrix Z n &#215; p , x j is the j -th column of data matrix X n &#215; p , med ( . ) is the median function, and MAD ( x j ) = 1.4826 med ( | x j − med ( x j ) | ) stands for the median absolute deviation.</p></sec><sec id="s2_2"><title>2.2. Obtaining MDS Embedding</title><p>In the second step, the p -dimensional dataset is mapped onto a two- dimensional space by taking account of the dissimilarity metric obtained from the standardized data matrix. To find a proper embedding of the dataset, metric (classic) or non-metric (ordinary) MDS is used in the literature. Although non-metric MDS (NMDS) can be considered in order to overcome the existence of outliers, Spence and Lewandowsky [<xref ref-type="bibr" rid="scirp.74080-ref17">17</xref>] demonstrated that NMDS may be adversely affected by outliers. The Robust CoPlot method uses the robust MDS (RMDS) proposed by [<xref ref-type="bibr" rid="scirp.74080-ref18">18</xref>] . The main advantage of RMDS is the use of the outlier aware cost function defined as</p><p>f ( O , Y ) = ∑ i &lt; j [ δ i j − d i j ( Y ) − o i j ] 2 + λ ∑ i &lt; j | o i j | (2)</p><p>where δ i j is the dissimilarity metric among i -th and j -th row of the standardized matrix Z n &#215; p , Y n &#215; 2 is the coordinate matrix for two-dimensional space, d i j ( Y ) shows the Euclidean distance between i -th and j -th row of coordinate matrix Y n &#215; 2 , λ &gt; 0 is the parameter that controls the assumed number of outliers, and the i -th row j -th column element of the outlier matrix O is o i j = sgn ( δ i j − d i j ( Y ) ) max ( 0 , | δ i j − d i j ( Y ) | − λ / 2 ) , which repre- sents the outlier variable.</p></sec><sec id="s2_3"><title>2.3. Adding Variable Vectors</title><p>In the last step of the Robust CoPlot method, vectors representing the variables are located on the obtained robust MDS map. Robust CoPlot decides the direction and magnitude of a vector using the median absolute deviation correlation coefficient (MADCC), ρ j , MADCC , given by [<xref ref-type="bibr" rid="scirp.74080-ref19">19</xref>] .</p><p>ρ j , MADCC = MAD 2 ( u j ) − MAD 2 ( k j ) MAD 2 ( u j ) + MAD 2 ( k j ) . (3)</p><p>Here, u j and k j are the robust principal variables given as follows:</p><p>u j = z j − med ( z j ) MAD ( z j ) + ν j − med ( ν j ) MAD ( ν j ) k j = z j − med ( z j ) MAD ( z j ) − ν j − med ( ν j ) MAD ( ν j ) . (4)</p><p>In (4), z j stands for the j -th column of standardized data matrix Z n &#215; p , and ν j represents the projection values of all n points in the MDS map on the j -th variable vector for a specific direction. For each degree of 360 ∘ , the ρ j , MADCC correlation between the actual values of the variable j and their projection on the vector, ν j , is calculated. The direction of the vector is determined so that the calculated ρ j , MADCC value attains maximum.</p></sec></sec><sec id="s3"><title>3. Features of the RobCoP Package</title><p>The RobCoP package contains just one main function, RobustCoPlot(), and many auxiliary functions. RobustCoPlot() has one input argument, InStrct, and one output argument, OutStrct. Each argument is a MATLAB structure with different fields. The RobustCoPlot() function can perform NMDS, RMDS analysis with many options for dissimilarity distance function, data stand- ardization type, and MDS initialization method. In addition to MDS analysis, classical and Robust CoPlot analyses can also be performed. The desired analysis is determined by the field values of the input structure, InStrct.</p><p>To generate an input structure according to the desired type of analysis, <xref ref-type="fig" rid="fig1">Figure 1</xref> can be used for guidance. The InStrct.X field of the input structure should take the data file name. The data file to be processed by RobCoP should be in comma-separated value (CSV) format. The data columns to be analyzed are selected by using InStrct.DataColNums field. This field should be a one-dimensional matrix whose numeric elements indicate the selected columns from the input CSV file. An optional field, InStrct.ColorColumn, is used for colorizing the data points on the obtained MDS graph. This field should be a scalar that selects the column from the CSV file to be used in colorizing the data points. The InStrct.ColorValues field is a one-dimensional numeric matrix whose elements are the values selected from the column pointed by InStrct. ColorColumn. The RobustCoPlot() can colorize up to six different values selected from InStrct.ColorColumn. In other words, the obtained MDS graph can split the data points by using different shapes and colors up to six groups. The RobustCoPlot() can use three different kinds of distance functions for obtaining the dissimilarity matrix to be used in MDS. The InStrct.DisSimDist field is used for selecting “Euclidean”, “Cityblock”, or “Dominance” distance functions for the analysis. The standardization technique of the dataset can also be chosen by using the InStrct.StdType field. The possible values of the field are “Mean” and “Median”. “Mean” selects the sample mean and sample variance for standardization, while “Median” uses the median value instead of the mean as well as the median absolute deviation (MAD) for variance. The starting point for the MDS analysis is determined by using the InStrct.InitMethod field. The</p><p>possible choices are “PCA” for principal component analysis and “Random” for randomly selected starting points. “NMDS” or “RMDS” selection for the MDS analysis is done by using the InStrct.MDSMethod field. If “RMDS” is selected, InStrct.OutlierRatio field should also be defined. The InStrct.OutlierRatio field can take values from ( 0 , 1 ) interval, and represents the assumed outlier ratio for RMDS analysis. The InStrct.DrawGraph is an optional field which can take values “Shepard”, “MDS”, and “CoPlot”. If this field is not defined, the RobustCoPlot() performs the MDS analysis in silence mode and returns the coordinates of the obtained embedding. “Shepard” option draws the Shepard Diagram only, “MDS” draws the MDS graph, and the “CoPlot” option performs CoPlot analysis. To see all of the graphs, the “ALL” value should be used. If the “CoPlot” option is selected for the InStrct.DrawGraph, the vector correlation method for CoPlot should also be selected by using the InStrct.VecCorrMethod field. If “PCC” is selected the representation of vectors is implemented by the Pearson correlation coefficient; if “MADCC” is selected, representation is implemented by the median absolute deviation correlation coefficient.</p><p>The fields of the output structure, OutStrct, vary by the MDS analysis type selected. The following two fields, OutStrct.StressValue and OutStrct.Embedding, are the returned fields regardless of the MDS method selected. The OutStrct. StressValue field returns the Kruskall stress value of the obtained resultant MDS embedding. The Kruskall stress value shows the quality of the obtained two-dimensional mapping of the multivariate data, and a smaller value means good representation. The OutStrct.Embedding field returns the coordinates of the data points found by the selected MDS method. If “RMDS” is selected as the InStrct.MDSMethod, then OutStrct contains an additional field, OutStrct.Outlier, containing non-zero elements showing the distances that are deemed as outliers during the RMDS analysis.</p></sec><sec id="s4"><title>4. Illustrative Examples</title><p>Robust CoPlot method considers all the variables as well as the observations simultaneously to obtain two dimensional map. Correlations among the va- riables, relations among the observations and mutual relationship among the observations and their measuring variables can be seen by a single graphical representation. Besides possible outliers which are located far from the bulk of the data can easily been detected.</p><p>In this section, we present and illustrate the use of the RobCoP package on the dataset frequently used in the DEA analysis to show the economic performance of China’s cities [<xref ref-type="bibr" rid="scirp.74080-ref20">20</xref>] . Step-by-step instructions will be given on how to obtain classic and Robust CoPlot maps. In the dataset, there are six variables for 35 of China’s cities (Decision Making Units/DMU): labor (ILF), working fund (WF), investment (INV), gross industrial output value (GIOV), profit and taxes (P&amp;T), and retail sales (RS). All of the examples given in this section use the same dataset to make comparisons between classical and Robust CoPlots. The first two examples are related to the embedding of the observations into two- dimensions and the following two examples are prepared for CoPlot results.</p><p>The RobustCoPlot() function takes the CSV file as an input dataset. The first line of the input data file should contain the names of the variables, and the number of columns in the file should be equal to the number of variable names. In other words, the input file should not contain any unnamed columns. The first few lines of the CSV file used in the examples are given in <xref ref-type="table" rid="table1">Table 1</xref> for</p><table-wrap id="table1" ><label><xref ref-type="table" rid="table1">Table 1</xref></label><caption><title> First a few lines of the input CSV file</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >DMU(1),</th><th align="center" valign="middle" >ILF(2),</th><th align="center" valign="middle" >WF(3),</th><th align="center" valign="middle" >INV(4),</th><th align="center" valign="middle" >GIOV(5),</th><th align="center" valign="middle" >P&amp;T(6),</th><th align="center" valign="middle" >RS(7),</th><th align="center" valign="middle" >COLOR(8)</th></tr></thead><tr><td align="center" valign="middle" >1</td><td align="center" valign="middle" >110.22</td><td align="center" valign="middle" >794,509</td><td align="center" valign="middle" >724,255</td><td align="center" valign="middle" >2,374,342</td><td align="center" valign="middle" >680,119</td><td align="center" valign="middle" >12,790</td><td align="center" valign="middle" >1</td></tr><tr><td align="center" valign="middle" >2</td><td align="center" valign="middle" >31.34</td><td align="center" valign="middle" >183,319</td><td align="center" valign="middle" >101,556</td><td align="center" valign="middle" >473,369</td><td align="center" valign="middle" >118,062</td><td align="center" valign="middle" >3460</td><td align="center" valign="middle" >2</td></tr><tr><td align="center" valign="middle" >3</td><td align="center" valign="middle" >18.12</td><td align="center" valign="middle" >99,307</td><td align="center" valign="middle" >83,395</td><td align="center" valign="middle" >255,540</td><td align="center" valign="middle" >50,355</td><td align="center" valign="middle" >2652</td><td align="center" valign="middle" >3</td></tr><tr><td align="center" valign="middle" >4</td><td align="center" valign="middle" >46.86</td><td align="center" valign="middle" >304,726</td><td align="center" valign="middle" >173,655</td><td align="center" valign="middle" >734,613</td><td align="center" valign="middle" >150,853</td><td align="center" valign="middle" >4381</td><td align="center" valign="middle" >2</td></tr><tr><td align="center" valign="middle" >5</td><td align="center" valign="middle" >77.39</td><td align="center" valign="middle" >443,862</td><td align="center" valign="middle" >210,947</td><td align="center" valign="middle" >1,037,584</td><td align="center" valign="middle" >189,878</td><td align="center" valign="middle" >5233</td><td align="center" valign="middle" >2</td></tr><tr><td align="center" valign="middle" ><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/3-1240820x51.png" xlink:type="simple"/></inline-formula></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/3-1240820x52.png" xlink:type="simple"/></inline-formula></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/3-1240820x53.png" xlink:type="simple"/></inline-formula></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/3-1240820x54.png" xlink:type="simple"/></inline-formula></td></tr></tbody></table></table-wrap><p>reference. After adding the package to MATLAB path, the following code is used for importing the input data file.</p><disp-formula id="scirp.74080-formula2"><graphic  xlink:href="//html.scirp.org/file/3-1240820x55.png"  xlink:type="simple"/></disp-formula><p>Then, ChineseCities.csv, which has 36 rows representing the name of variables and observations and 8 columns representing the variables and color values, is ready for the analysis.</p><sec id="s4_1"><title>4.1. NMDS and RMDS Analysis</title><p>The RobCoP package supports non-metric MDS analysis, which is used in classic CoPlot analysis, and RMDS, which is used in Robust CoPlot analysis. The first column of ChineseCities.csv file is excluded from the analysis because it contains the observation number. The last column, COLOR, is generated for coloring the resultant MDS embedding in which the numbers are given in a way to sort the profit and taxes (P&amp;T) values at the sixth column of the dataset. The color value assignment is performed according to the defined ranges in <xref ref-type="table" rid="table2">Table 2</xref>. The color column is also omitted from the analysis.</p><disp-formula id="scirp.74080-formula3"><graphic  xlink:href="//html.scirp.org/file/3-1240820x56.png"  xlink:type="simple"/></disp-formula><p>In order to allow comparisons among variables on different scales, RobCoP package standardizes the data. In this example to generate non-metric MDS embedding, “Mean” is selected for standardization type.</p><disp-formula id="scirp.74080-formula4"><graphic  xlink:href="//html.scirp.org/file/3-1240820x57.png"  xlink:type="simple"/></disp-formula><p>The MDS embedding of the dataset requires a set of distances between the observations. Although given example uses city-block distance, various distance metrics can be selected to create distance matrix in the RobCoP package.</p><disp-formula id="scirp.74080-formula5"><graphic  xlink:href="//html.scirp.org/file/3-1240820x58.png"  xlink:type="simple"/></disp-formula><p>For the starting point of the MDS embedding, “PCA” (Torgerson) is selected by using the InStrct.InitMethod field.</p><disp-formula id="scirp.74080-formula6"><graphic  xlink:href="//html.scirp.org/file/3-1240820x59.png"  xlink:type="simple"/></disp-formula><p>To produce non-metric MDS results, following code snippet can be used. To obtain NMDS map, InStrct.DrawGraph field is selected as “MDS”. Similarly, to</p><p>obtain Shepard diagram, it is entered as “Shepard”.</p><disp-formula id="scirp.74080-formula7"><graphic  xlink:href="//html.scirp.org/file/3-1240820x60.png"  xlink:type="simple"/></disp-formula><p>After preparing the input structure, a single command is required to perform analysis.</p><disp-formula id="scirp.74080-formula8"><graphic  xlink:href="//html.scirp.org/file/3-1240820x61.png"  xlink:type="simple"/></disp-formula><disp-formula id="scirp.74080-formula9"><graphic  xlink:href="//html.scirp.org/file/3-1240820x62.png"  xlink:type="simple"/></disp-formula><p>For the given example, the obtained non-metric MDS embedding of the dataset is shown in <xref ref-type="fig" rid="fig2">Figure 2</xref>. The Shepard diagram of the non-metric MDS analysis is shown in <xref ref-type="fig" rid="fig3">Figure 3</xref>. The Shepard diagram is a scatter plot of the distances between points in the MDS plot against the observed proximities, and ideally the actual proximities versus the predicted proximities fall on a straight line. If the Shepard diagram resembles a step-wise or stair-case function, a degenerate solution may be obtained. The points on the <xref ref-type="fig" rid="fig4">Figure 4</xref> adhere cleanly to a straight line.</p><p>The following code snippet can be used for robust MDS analysis of the same dataset. Only the InStrct.MDSMethod field of the input structure is changed to a</p><table-wrap id="table2" ><label><xref ref-type="table" rid="table2">Table 2</xref></label><caption><title> Color value assignment table according to P&amp;T(6) variable</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >P&amp;T(6) Value Range</th><th align="center" valign="middle" >Assigned Color Code</th></tr></thead><tr><td align="center" valign="middle" >&gt;500,000</td><td align="center" valign="middle" >1</td></tr><tr><td align="center" valign="middle" >100,000 - 499,999</td><td align="center" valign="middle" >2</td></tr><tr><td align="center" valign="middle" >0 - 99,999</td><td align="center" valign="middle" >3</td></tr></tbody></table></table-wrap><p>“RMDS” value, and since robust MDS is selected, the InStrct.OutlierRatio value should be given. The outlier ratio for the example is assumed to be 10% [<xref ref-type="bibr" rid="scirp.74080-ref13">13</xref>] . In addition, the output structure also contains an OutStrct.OutlierMatrix field to show which distances are taken as outliers during RMDS analysis. The obtained results are shown in <xref ref-type="fig" rid="fig5">Figure 5</xref> and <xref ref-type="fig" rid="fig4">Figure 4</xref>. Although <xref ref-type="fig" rid="fig2">Figure 2</xref> and <xref ref-type="fig" rid="fig5">Figure 5</xref> seem similar for the given example, as the percentage of outliers in the data</p><p>increases contamination of the predicted proximities in NMDS solution in- creases.</p><disp-formula id="scirp.74080-formula10"><graphic  xlink:href="//html.scirp.org/file/3-1240820x67.png"  xlink:type="simple"/></disp-formula></sec><sec id="s4_2"><title>4.2. Robust CoPlot Analysis</title><p>The maps generated so far are the NMDS and RMDS maps without variables. In this section, a second map, superimposed on the first, consisting of vectors for each variable is generated. The following code snippet provides classical CoPlot analysis. The user needs to know that the data matrix standardization type and computation method of the vector correlation coefficients, InStrct.VecCorr- Method, should be chosen as “Mean” and “PCC” respectively to obtain classical analysis results (see <xref ref-type="fig" rid="fig6">Figure 6</xref>).</p><disp-formula id="scirp.74080-formula11"><graphic  xlink:href="//html.scirp.org/file/3-1240820x68.png"  xlink:type="simple"/></disp-formula><disp-formula id="scirp.74080-formula12"><graphic  xlink:href="//html.scirp.org/file/3-1240820x69.png"  xlink:type="simple"/></disp-formula><p>The following code snippet enables to draw Robust CoPlot. The data matrix standardization type and the computation method of the vector correlation coefficients have to be specified as “Median” and “MADCC” to obtain robust analysis results (see <xref ref-type="fig" rid="fig7">Figure 7</xref>).</p><disp-formula id="scirp.74080-formula13"><graphic  xlink:href="//html.scirp.org/file/3-1240820x70.png"  xlink:type="simple"/></disp-formula></sec></sec><sec id="s5"><title>5. Conclusion</title><p>In this paper, we present the RobCoP package for performing graphical display method of multivariate data in MATLAB. Our main objective while developing this package was to provide a useful tool for helping the researchers to depict the multivariate data in the presence of outliers. This paper makes an important</p><p>contribution by presenting a new software package that supplies the reader Robust CoPlot analysis as well as robust MDS and classical CoPlot analysis with open source code. Until recently there was no package for robust version of CoPlot analysis and robust MDS. The package presented in this paper addresses these issues. We believe that this package will be used in various areas, especially in applied statistics.</p></sec><sec id="s6"><title>Cite this paper</title><p>Atilgan, Y.K. and Atilgan, E.L. (2017) RobCoP: A Matlab Package for Robust CoPlot Analysis. Open Journal of Statistics, 7, 23-35. https://doi.org/10.4236/ojs.2017.71003</p></sec></body><back><ref-list><title>References</title><ref id="scirp.74080-ref1"><label>1</label><mixed-citation publication-type="other" xlink:type="simple">Lipshitz, G. and Raveh, A. (1994) Application of the Co-Plot Method in the Study of Socio-Economic Differences between Cities: A Basis for a Differential Development Policy. Urban Studies, 31, 123-135. https://doi.org/10.1080/00420989420080071</mixed-citation></ref><ref id="scirp.74080-ref2"><label>2</label><mixed-citation publication-type="other" xlink:type="simple">Raveh, A. (2000) Co-Plot: A Graphic Display Method for Geometrical Representations of {MCDM}. European Journal of Operational Research, 125, 670-678.  
https://doi.org/10.1016/S0377-2217(99)00276-3</mixed-citation></ref><ref id="scirp.74080-ref3"><label>3</label><mixed-citation publication-type="other" xlink:type="simple">Cangur, S., Ercan, I. and Ozkaya, G. (2015) Co-Plot Method: A Research on Tobacco Control in the European Region. Epidemiology, Biostatistics and Public Health, 12, e11480-1–e11480-9.</mixed-citation></ref><ref id="scirp.74080-ref4"><label>4</label><mixed-citation publication-type="other" xlink:type="simple">Shoval, N. and Raveh, A. (2004) Categorization of Tourist Attractions and the Modeling of Tourist Cities: Based on the Co-Plot Method of Multivariate Analysis. Tourism Management, 25, 741-750. https://doi.org/10.1016/j.tourman.2003.09.005</mixed-citation></ref><ref id="scirp.74080-ref5"><label>5</label><mixed-citation publication-type="other" xlink:type="simple">Raveh, A. (2000) The Greek Banking System: Reanalysis of Performance. European Journal of Operational Research, 120, 525-534.  
https://doi.org/10.1016/S0377-2217(98)00384-1</mixed-citation></ref><ref id="scirp.74080-ref6"><label>6</label><mixed-citation publication-type="other" xlink:type="simple">Demirhan, H. and Atilgan, Y.K. (2015) New Horizontal Global Solar Radiation Estimation Models for Turkey Based on Robust Coplot Supported Genetic Programming Technique. Energy Conversion and Management, 106, 1013-1023.  
https://doi.org/10.1016/j.enconman.2015.10.038</mixed-citation></ref><ref id="scirp.74080-ref7"><label>7</label><mixed-citation publication-type="other" xlink:type="simple">Lipshitz, G. and Raveh, A. (1998) Socio-Economic Differences among Localities: A New Method of Multivariate Analysis. Regional Studies, 32, 747-757.  
https://doi.org/10.1080/00343409850119436</mixed-citation></ref><ref id="scirp.74080-ref8"><label>8</label><mixed-citation publication-type="other" xlink:type="simple">Mahlberg, B. and Raveh, A. (2012) Co-Plot: A Useful Tool to Detect Outliers in Dea. Social Science Research Network.</mixed-citation></ref><ref id="scirp.74080-ref9"><label>9</label><mixed-citation publication-type="other" xlink:type="simple">Bravata, D.M., Shojania, K.G., Olkin, I. and Raveh, A. (2008) Coplot: A Tool for Visualizing Multivariate Data in Medicine. Statistics in Medicine, 27, 2234-2247.  
https://doi.org/10.1080/00343409850119436</mixed-citation></ref><ref id="scirp.74080-ref10"><label>10</label><mixed-citation publication-type="other" xlink:type="simple">Huang, H. and Liao, W. (2012) A Co-Plot-Based Efficiency Measurement to Commercial Banks. Journal of Software, 7, 2247-2251.</mixed-citation></ref><ref id="scirp.74080-ref11"><label>11</label><mixed-citation publication-type="other" xlink:type="simple">Adler, N. and Raveh, A. (2008) Presenting DEA Graphically. Omega, 36, 715-729.  
https://doi.org/10.1016/j.omega.2006.02.006</mixed-citation></ref><ref id="scirp.74080-ref12"><label>12</label><mixed-citation publication-type="other" xlink:type="simple">Nath, P., Mukherjee, A. and Pal, M.N. (2001) Identification of Linkage between Strategic Group and Performance of Indian Commercial Banks: A Combined Approach Using DEA and Co-Plot. International Journal of Digital Accounting Research, 1, 125-152. https://doi.org/10.4192/1577-8517-v1_7</mixed-citation></ref><ref id="scirp.74080-ref13"><label>13</label><mixed-citation publication-type="other" xlink:type="simple">Atilgan, Y.K. (2016) Robust Co-Plot Analysis. Communications in Statistics—Simulation and Computation, 45, 1763-1775.  
https://doi.org/10.1080/03610918.2013.875571</mixed-citation></ref><ref id="scirp.74080-ref14"><label>14</label><mixed-citation publication-type="other" xlink:type="simple">The MathWorks Inc. (2016) MATLAB—The Language of Technical Computing, Version 2016a. The MathWorks Inc., Natick.</mixed-citation></ref><ref id="scirp.74080-ref15"><label>15</label><mixed-citation publication-type="other" xlink:type="simple">Talby, D. (2015) The Visual Co-Plot, Version 5.5.</mixed-citation></ref><ref id="scirp.74080-ref16"><label>16</label><mixed-citation publication-type="other" xlink:type="simple">Borg, I. and Groenen, P.J. (2005) Modern Multidimensional Scaling: Theory and Applications. Springer Science &amp; Business Media, Berlin.</mixed-citation></ref><ref id="scirp.74080-ref17"><label>17</label><mixed-citation publication-type="other" xlink:type="simple">Spence, I. and Lewandowsky, S. (1989) Robust Multidimensional Scaling. Psychometrika, 54, 501-513. https://doi.org/10.1007/BF02294632</mixed-citation></ref><ref id="scirp.74080-ref18"><label>18</label><mixed-citation publication-type="other" xlink:type="simple">Forero, P. and Giannakis, G. (2012) Sparsity-Exploiting Robust Multidimensional Scaling. IEEE Transactions on Signal Processing, 60, 4118-4134.  
https://doi.org/10.1109/TSP.2012.2197617</mixed-citation></ref><ref id="scirp.74080-ref19"><label>19</label><mixed-citation publication-type="other" xlink:type="simple">Shevlyakov, G. and Smirnov, P. (2011) Robust Estimation of the Correlation Coefficient: An Attempt of Survey. Austrian Journal of Statistics, 40, 147-156.</mixed-citation></ref><ref id="scirp.74080-ref20"><label>20</label><mixed-citation publication-type="other" xlink:type="simple">Charnes, A., Cooper, W.W. and Li, S. (1989) Using Data Envelopment Analysis to Evaluate Efficiency in the Economic Performance of Chinese Cities. Socio-Economic Planning Sciences, 23, 325-344.  
https://doi.org/10.1016/0038-0121(89)90001-3</mixed-citation></ref></ref-list></back></article>