<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article  PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article"><front><journal-meta><journal-id journal-id-type="publisher-id">JGIS</journal-id><journal-title-group><journal-title>Journal of Geographic Information System</journal-title></journal-title-group><issn pub-type="epub">2151-1950</issn><publisher><publisher-name>Scientific Research Publishing</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.4236/jgis.2012.41006</article-id><article-id pub-id-type="publisher-id">JGIS-17061</article-id><article-categories><subj-group subj-group-type="heading"><subject>Articles</subject></subj-group><subj-group subj-group-type="Discipline-v2"><subject>Earth&amp;Environmental Sciences</subject></subj-group></article-categories><title-group><article-title>
 
 
  The Rough Method for Spatial Data Subzone Similarity Measurement
 
</article-title></title-group><contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>eihua</surname><given-names>Liao</given-names></name><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib></contrib-group><author-notes><corresp id="cor1">* E-mail:<email>gisliaowh@163.com</email></corresp></author-notes><pub-date pub-type="epub"><day>17</day><month>01</month><year>2012</year></pub-date><volume>04</volume><issue>01</issue><fpage>37</fpage><lpage>45</lpage><history><date date-type="received"><day>September</day>	<month>7,</month>	<year>2011</year></date><date date-type="rev-recd"><day>November</day>	<month>2,</month>	<year>2011</year>	</date><date date-type="accepted"><day>November</day>	<month>28,</month>	<year>2011</year></date></history><permissions><copyright-statement>&#169; Copyright  2014 by authors and Scientific Research Publishing Inc. </copyright-statement><copyright-year>2014</copyright-year><license><license-p>This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/</license-p></license></permissions><abstract><p>
 
 
  There are two methods for GIS similarity measurement problem, one is cross-coefficient for GIS attribute similarity measurement, and the other is spatial autocorrelation that is based on spatial location. These methods can not calculate subzone similarity problem based on universal background. The rough measurement based on membership function solved this problem well. In this paper, we used rough sets to measure the similarity of GIS subzone discrete data, and used neighborhood rough sets to calculate continuous data’s upper and lower approximation. We used neighborhood particle to calculate membership function of continuous attribute, then to solve continuous attribute’s subzone similarity measurement problem.
 
</p></abstract><kwd-group><kwd>Subzone; Rough Sets; Neighborhood Rough Sets; Similarity Measurement</kwd></kwd-group></article-meta></front><body><sec id="s1"><title>1. Introduction</title><p>GIS entity has some spatial relevance in real world. Tober [<xref ref-type="bibr" rid="scirp.17061-ref1">1</xref>] proposed the famous geography first law “The spatial entities are always interrelated, especially, it have more obvious character for closed distance entities”. Cliff [<xref ref-type="bibr" rid="scirp.17061-ref2">2</xref>] put forward spatial autocorrelation concept from this established law, and the concept is the information for a spatial unit having similarity for it’s around units as summarized in Wang [<xref ref-type="bibr" rid="scirp.17061-ref3">3</xref>]. And the spatial autocorrelation has been widely used in many fields, such as regional economy, application ecology, scene analysis, preventive medicine and so on [<xref ref-type="bibr" rid="scirp.17061-ref4">4</xref>]. Anselin [<xref ref-type="bibr" rid="scirp.17061-ref5">5</xref>] found the spatial autocorrelation has two measurement index, that is global index and local index. Global index study the spatial mode for whole region, and it use a single value to reflect the region’s spatial autocorrelation degree. Local index calculate each unit degree of correlation from its neighbor unit for one attribute. And it has widely application domain as a similarity problem tool.</p><p>GIS subzone measurement is actually an uncertainty study problem. Li [<xref ref-type="bibr" rid="scirp.17061-ref6">6</xref>] found the uncertainty problem study works have attracted more and more attention by many study workers since entering the 21st century. There are many mathematic tools for study uncertainty problem, such as fuzzy sets, rough sets, quotient space etc. Rough sets have been widely used in GIS uncertainty study, Pawlak [<xref ref-type="bibr" rid="scirp.17061-ref7">7</xref>] introduced Rough sets theory and discussed in greater detail in Refs [8,9]. It is a technique for dealing with uncertainty and for identifying cause—effect relationships in databases as a form of data mining and database design. It is as summarized in R. Slowinski [<xref ref-type="bibr" rid="scirp.17061-ref10">10</xref>]. Slowinsk found it has also been used for improved information retrieval. Srinivasan [<xref ref-type="bibr" rid="scirp.17061-ref11">11</xref>] and Beaubouef [12,13] found it is also used in uncertainty management in relational databases. Theresa Beaubouef [<xref ref-type="bibr" rid="scirp.17061-ref14">14</xref>] used rough sets to describe the fuzziness, uncertainty, GIS topological relation, 9-intersection model, egg yolk model for GIS entity and GIS data reasoning. The Pawlak rough sets took all the study objects as universe, and used equivalence relation to divide the universe into some exclusive equivalence class, then took it as basic information partial in universe description. For discretionary concept in equivalence space, Hu [<xref ref-type="bibr" rid="scirp.17061-ref15">15</xref>] suggested that Pawlak rough sets took two equivalence class union sets: upper and lower approximation to approach it. But as an effective granular computing model, Pawlak rough sets are suit for dealing with nominal variable and discrete data that because it is based on classic equivalence class and equivalence relation. Then Xie [<xref ref-type="bibr" rid="scirp.17061-ref16">16</xref>] found the researcher took continuous numerical attribute into nominal variable and discrete data with discrete algorithms for rough sets method in processing data. Jensen [<xref ref-type="bibr" rid="scirp.17061-ref17">17</xref>] suggested this transformation inevitably brings the information loss. The compute result is largely rest with the discretization result. To solve this problem, Duboi [<xref ref-type="bibr" rid="scirp.17061-ref18">18</xref>], Hu [<xref ref-type="bibr" rid="scirp.17061-ref19">19</xref>], Yeung [<xref ref-type="bibr" rid="scirp.17061-ref20">20</xref>] et al. [21,22] introduced fuzzy rough sets, similarity relation rough sets model and neighborhood rough sets. Lin [<xref ref-type="bibr" rid="scirp.17061-ref23">23</xref>] put forward neighborhood rough sets model concept, this model took the space neighbor point to granulating universe, and took neighbor as basic information particle, then Lin [<xref ref-type="bibr" rid="scirp.17061-ref23">23</xref>] took it to describe others concepts in approximation space. The math pathfinders have done many study works about rough sets similarity. Wu [<xref ref-type="bibr" rid="scirp.17061-ref24">24</xref>] given three forms of the differences of rough fuzzy set, and discussed their basic properties, they think that the number of conditions for the difference degree of rough fuzzy set should have must be satisfied. Guan [<xref ref-type="bibr" rid="scirp.17061-ref25">25</xref>] defined the concept of rough similarity degree between two rough sets by using fuzzy sets induced by rough sets, and discussed its properties, and compared four kinds of rough similarity degree in the approximation space.</p><p>So it has many studies about similarity measurement for discrete value and continuous value in mathematics. There are two Similarity measurement correlations methods in GIS, one is cross-coefficient, and the other is spatial autocorrelation that based on spatial location. These two methods can not measure similarity of GIS subzone. This paper use rough sets measurement method to measure two subzone’s similarity problem, simultaneously study the subzone’s similarity based on one universe set.</p></sec><sec id="s2"><title>2. Spatial Autocorrelation and Cross-Coefficient</title><sec id="s2_1"><title>2.1. Global Spatial Autocorrelation</title><p>Global spatial autocorrelation is an attribute value description of whole region spatial character. And it estimated global spatial autocorrelation statistic for global Moran’s I and global Geary’s C, to analyze total region spatial correlation and spatial discrepancy. And global Moran’s I is used commonly, it is defined as follows:</p><disp-formula id="scirp.17061-formula128340"><label>(1)</label><graphic position="anchor" xlink:href="6-8401107\75f3d741-cff4-41f1-9494-ff2c469adde7.jpg"  xlink:type="simple"/></disp-formula><p>where x<sub>i</sub> is the observed value for observed spatial cell,</p><p><img src="6-8401107\cd3bf311-756f-4e0b-96e4-34f2abab6e4f.jpg" />is the average value for each observed value, S<sub>0</sub> is the sum of all element spatial weight matrix (W), and it can obtained from the follows formula:</p><disp-formula id="scirp.17061-formula128341"><label>(2)</label><graphic position="anchor" xlink:href="6-8401107\aab9b1bb-b755-46f2-a196-95191086a551.jpg"  xlink:type="simple"/></disp-formula><p><img src="6-8401107\582bbcdc-73b6-4947-820b-464f1dd8c18e.jpg" />is the spatial weighting matrix, and the value of <img src="6-8401107\c24e999f-a10b-459b-9752-66f30d00704c.jpg" /> can obtain from follows formula:</p><disp-formula id="scirp.17061-formula128342"><label>(3)</label><graphic position="anchor" xlink:href="6-8401107\5d6b981b-b531-4f08-b0de-7aa67c71b187.jpg"  xlink:type="simple"/></disp-formula><p>where n is the number for spatial cell. And if the i cell and the j cell are neighborhood, then <img src="6-8401107\16e491ce-ed93-405a-a61f-fbb146c18588.jpg" />= 1, otherwise <img src="6-8401107\bf84f25e-dd72-4c5f-8c34-66908e25a69b.jpg" />= 0. And one cell is neighborhood for itself, namely <img src="6-8401107\5c2b7bf9-6903-4c9d-8589-2782d453fe25.jpg" />= 1. It can use Z test to statistic test its result after computing Moran’s I, it can obtain from follows formula (4):</p><disp-formula id="scirp.17061-formula128343"><label>(4)</label><graphic position="anchor" xlink:href="6-8401107\d50601b6-42a3-4946-8ee5-60ad3ae3ae51.jpg"  xlink:type="simple"/></disp-formula><p>It is frequently took Moran’s I as cross-coefficient, and the value of Moran’s I is between –1 and 1. In given level of significance, when Moran’s I is obviously positive, it indicates each observed value has positive correlation, and higher observed value is cluster to higher observed value, lower observed value is cluster to lower observed value, it presents higher to higher cluster or lower to lower cluster. when Moran’s I is obviously negative, it indicates each observed value has negative correlation, and higher observed value is cluster to lower observed value, it presents dispersed pattern. When the Moran’s I trends to 0, it express that it has no spatial autocorrelation, it is random patterns for spatial observed value.</p><p>Example 1 considering the example seen in <xref ref-type="fig" rid="fig1">Figure 1</xref>, there are nine polygons, the number ID from left to right, top to down is {1, 2, 3,<img src="6-8401107\6f411b47-0636-4b69-9d3c-7b53390b8da9.jpg" /> 9}. The label of each polygon in <xref ref-type="fig" rid="fig1">Figure 1</xref> is the value of each polygon. We can see the value of each polygon is continuous spatial value. Then we can calculate the global Moran’s I value is 0.028508, Z value is 0.636673. So the distribution of <xref ref-type="fig" rid="fig1">Figure 1</xref> is a dispersed and random spatial pattern.</p></sec><sec id="s2_2"><title>2.2. Local Spatial Autocorrelation</title><p>Global Moran’s I is an overall statistic index, and it only illustrated the average degree of region and adjacent region. Local spatial disparities may expand, when the</p><p>whole region express a region’s spatial disparities trend we need to use ESDA local analysis method. Anselin (1994) proposed the local spatial relation index LISA (Local Indicators of Spatial Association), it can show the spatial autocorrelation characteristic for local and each spatial cell. It apportioned global Moran’s I to each region, and the i statistic for each region is:</p><disp-formula id="scirp.17061-formula128344"><label>(5)</label><graphic position="anchor" xlink:href="6-8401107\197830b2-997e-4096-8924-b00ebafc6bdf.jpg"  xlink:type="simple"/></disp-formula><p>where z<sub>i</sub>, z<sub>j</sub> is standardization average value, <img src="6-8401107\318ee1fc-66f8-4c50-bbc3-e18133d82b1c.jpg" />is spatial weighting matrix.</p><p>In given significance level, if I<sub>i</sub> is obviously positive and z<sub>i</sub> is greater than 0, and it indicates that the observed value of position I and neighborhood are relatively higher, it is higher to higher cluster, if it is obviously positive and z<sub>i</sub> is less than 0, and it indicated that the observed value of position I and neighborhood are relatively lower, it is lower to lower cluster, if it is obviously negative and z<sub>i</sub> is greater than 0, and it indicates that the neighborhood value is far lower to position I, it is higher to lower cluster, if it is obviously negative and z<sub>i</sub> is less than 0, and it indicates that the neighborhood value is far higher to position i, it is lower to higher cluster.</p><p>It is weighted average product for observed value of position i and neighborhood. So global Moran’s I and local Moran’s I<sub>i</sub> have follows relation:</p><disp-formula id="scirp.17061-formula128345"><label>(6)</label><graphic position="anchor" xlink:href="6-8401107\1beed7e2-9a77-4f8b-9fe4-1a3fd00906d8.jpg"  xlink:type="simple"/></disp-formula><p>The formal condition of LISA statistic and local Moran’s I<sub>i</sub> is:</p><disp-formula id="scirp.17061-formula128346"><label>(7)</label><graphic position="anchor" xlink:href="6-8401107\23beee5c-2a0a-472b-80e9-9ac7e55ec5ec.jpg"  xlink:type="simple"/></disp-formula><p>We can use Moran scatter plot to describe LISA. All observed value is cross shaft, and all spatial lag value (W<sub>x</sub>) is on ordinate axis. All spatial lag value for each region’s observed value is the weighted average value of neighborhood’s observed value. It concretely defined by standardized spatial weighting matrix. The Moran scatter plot can be divided into four quadrants, it is respectively corresponding to four spatial different region spatial type. The right upper quadrant (HH) is the level for region and its neighborhood are higher, and the spatial disparities degree of both is on the small side. The left upper quadrant (HL) is the region’s level is lower than its neighborhood, and the spatial disparities degree of both are comparatively large. The left lower quadrant (LL) is the spatial level for region and its neighborhood are higher, and the spatial disparities degree of both is on the small side. The right lower quadrant (LH) is the region’s spatial level is higher than its neighborhood, and the spatial disparities degree of both is comparatively large.</p><p>Example 2 we can compute local Moran’s I of <xref ref-type="fig" rid="fig2">Figure 2</xref>, then we can obtain local autocorrelation map, that can be seen in <xref ref-type="fig" rid="fig2">Figure 2</xref>. 1) is low and high cluster, 2) is high and high cluster, 3 is high and low cluster in <xref ref-type="fig" rid="fig2">Figure 2</xref>.</p><p>We can obviously obtain some properties of spatial autocorrelation as below:</p><p>1) Patial autocorrelation can only compute continuous attribute value, and can not compute discrete categorical data.</p><p>2) Spatial autocorrelation can only compute similarity problem of the whole or each unit’s element, and it can not compute for the similarity between subzones that are composed of several units in whole region.</p></sec><sec id="s2_3"><title>2.3. Cross-Coefficient</title><p>The cross-coefficient r is frequently used to measure linear correlation dimension of two variables in statistics, when <img src="6-8401107\016f4b6a-3432-416e-b81e-9804212ef24d.jpg" /> is not all zero and y<sub>i</sub> is not all zero, the formula of cross-coefficient can obtain from follows formula (8):</p><disp-formula id="scirp.17061-formula128347"><label>(8)</label><graphic position="anchor" xlink:href="6-8401107\423208c9-7313-46b5-ba22-1e5646365f87.jpg"  xlink:type="simple"/></disp-formula><p>where r is the cross-coefficient of variables y and x. <img src="6-8401107\b696f3ed-74b9-487a-93a4-1a20311c4cbc.jpg" />and <img src="6-8401107\80dac80b-e2de-475d-801b-a2c86c3f6fcb.jpg" /> are respectively to the average value of order <img src="6-8401107\2fd3b147-7b44-45df-b2fd-a5fa9489e5c5.jpg" /> and<img src="6-8401107\3128023d-4f5b-45df-a593-8f467cd42d14.jpg" />. We can obviously obtain some properties of cross-coefficient as below:</p><p>1) Cross-coefficient can only compute continuous attribute value, and can not compute discrete categorical data.</p><p>2) The length order of <img src="6-8401107\2eb9304e-5555-4689-a45b-c07c06171061.jpg" /> and<img src="6-8401107\9de176d8-57b4-47a6-b161-fecf681447d3.jpg" /> must be the same, if not, it can not compute it.</p></sec></sec><sec id="s3"><title>3. Introduction of Rough Sets Theory</title><sec id="s3_1"><title>3.1. Concept of Rough Sets</title><p>Rough sets theory is a mathematical tool for dealing with uncertainty and vague knowledge. And it is a good technique for dealing with uncertainty and fuzzy of GIS data, it is also a good technique for spatial entity relations. There are many references for studying uncertainty and fuzzy of spatial entity, such as Zhang [26,27].</p><p>Definition 1. Given knowledge base K = (U, R), for each subset <img src="6-8401107\4208a2ab-ac13-4267-a70b-7bff000301ff.jpg" />and an equivalence relation<img src="6-8401107\cd080f1c-ff26-4093-ac43-cea0b2fecf76.jpg" />, we can define two subsets as follows:</p><disp-formula id="scirp.17061-formula128348"><label>(9)</label><graphic position="anchor" xlink:href="6-8401107\290605c8-9cea-4c72-8e54-7202d7ef494c.jpg"  xlink:type="simple"/></disp-formula><p>where, <img src="6-8401107\e4eaae40-9ff7-4fe7-a420-141fbf29a38d.jpg" />, <img src="6-8401107\1aeae2bc-b749-4063-969e-113ae85b9fae.jpg" />are respectively called lower approximation and upper approximation of set X. This definition is Pawlak rough sets. If set R is subset of universe, then definable set R is R precise set. If R is a not definable set, then R is a rough set. If it has a polygon object X seen in <xref ref-type="fig" rid="fig3">Figure 3</xref>, if we used Pawlak describe it, the polygon object X is a fuzzy object in <xref ref-type="fig" rid="fig3">Figure 3</xref>. <img src="6-8401107\f8f7bad0-7fc1-4da2-9cc0-bf41d55f4be5.jpg" />is definitely belongs to X, <img src="6-8401107\a4f0a0f3-3f34-417f-baca-ecc4fc7acb8a.jpg" />is possibly belongs to X.</p><p>Example 3. The classification map of Moran’s I can divide into {{1, 3, 7}{2, 4, 5, 8, 9}{6}} according to equivalence class in fig 2. Now it has a subzone X covering {2, 3, 4, 6}, then we can obtain the lower approximation of subzone X is {6}, upper approximation is universe U. All element’s value must be discrete value when we use Pawlak rough sets partition and compute, but GIS object attribute’s value is continuous value in practice, such as slope, population density and so on. Then we should use neighborhood rough sets to compute continuous attribute value.</p></sec><sec id="s3_2"><title>3.2. GIS Spatial Data Distance Measurement</title><p>Geng (2009) suggested We should measure different attribute’s distance in spatial cluster. d<sub>ij</sub><sub> </sub>is the distance of attribute level X<sub>i</sub><sub> </sub>and X<sub>j</sub>. The frequently used distance formulas are Minkowski distance, Mahalanobis distance, Canberra distance [<xref ref-type="bibr" rid="scirp.17061-ref28">28</xref>]. We used Mahalanobis distance to define distance of two examples:</p><disp-formula id="scirp.17061-formula128349"><label>(10)</label><graphic position="anchor" xlink:href="6-8401107\2976b41b-9980-48f4-bb6c-a7a01ec6b5e8.jpg"  xlink:type="simple"/></disp-formula><p>when q = 1，that is Absolute distance:</p><disp-formula id="scirp.17061-formula128350"><label>(11)</label><graphic position="anchor" xlink:href="6-8401107\e70c1f30-c5cd-40c4-9a6e-2be610029a06.jpg"  xlink:type="simple"/></disp-formula><p>when q = 2，that is Euclidean distance:</p><disp-formula id="scirp.17061-formula128351"><label>(12)</label><graphic position="anchor" xlink:href="6-8401107\4d877eb1-9283-47d0-a3fb-736d42648bd2.jpg"  xlink:type="simple"/></disp-formula><p>when<img src="6-8401107\b9ef8144-c82c-4054-8471-42f31c5ee512.jpg" />，that is Chebyshev distance:</p><disp-formula id="scirp.17061-formula128352"><label>(13)</label><graphic position="anchor" xlink:href="6-8401107\7551ce41-c154-4ecc-868d-817e0b828667.jpg"  xlink:type="simple"/></disp-formula><p>Then, we can obviously see diamond is absolute distance, roundness is Euclidean distance, square is Chebyshev distance in <xref ref-type="fig" rid="fig4">Figure 4</xref>.</p><p>Example 4. Now we consider it has a GIS map level that composed of nine basic units in <xref ref-type="fig" rid="fig5">Figure 5</xref>, B and C stand for different attribute. Then it should use absolute distance for measure distance x<sub>1</sub> and x<sub>2</sub> in attribute B, that we can compute d(x<sub>1</sub>, x<sub>2</sub>) = 0.2. It should use Euclidean distance for measure distance x1 and x<sub>2</sub> in attribute B, C, that we can compute d(x<sub>1</sub>, x<sub>2</sub>) = 0.45. We should dispose source data first in practice, for lack of space, the details will not be dealt with here.</p></sec><sec id="s3_3"><title>3.3. GIS Continuous Data Granulation and Neighborhood Rough Sets</title><p>Li [<xref ref-type="bibr" rid="scirp.17061-ref28">28</xref>] suggested granulation and approximation is the basic problem in rough sets and granular computing. Hu [<xref ref-type="bibr" rid="scirp.17061-ref15">15</xref>] found that Pawlak rough sets are based on the equivalence class for discrete value space, and the universe partition from equivalence class can divide into universe space. But for real number space, the attribute value is continuous, such DEM value etc. Obviously, discrete numerical attributes may cause information loss because the degrees of membership of numerical values to discrete values are not considered. Neighborhood structure and order structure are important structure for real number space, so we should work based on neighborhood structure in this paper.</p><sec id="s3_3_1"><title>3.3.1. Neighborhood Granulation</title><p>There are two methods to define neighborhood, one is defined by the numbers of neighborhood, such as classic k-nearest neighbor methods, the other is defined by distance from one measurement central point to boundary. We used the second method in our work.</p><p>Definition 2. Given a N dimension real number space Ω, we call d is a measurement of R<sup>N</sup>, it usually satisfy follows properties:</p><p>1) d(x<sub>1</sub>, x<sub>2</sub>) ≥ 0, d(x<sub>1</sub>, x<sub>2</sub>) = 0, if and only if x<sub>1</sub> = x<sub>2</sub>，<img src="6-8401107\36cc7a92-444e-4928-8c87-6b59d594557b.jpg" />；</p><p>2) d(x<sub>1</sub>, x<sub>2</sub>) = d(x<sub>2</sub>, x<sub>1</sub>),<img src="6-8401107\ba711b17-5817-4400-b41a-86118cd914a0.jpg" />;</p><p>3) d(x<sub>1</sub>, x<sub>3</sub>) ≤ d(x<sub>1</sub>, x<sub>2</sub>) + d(x<sub>2</sub>, x<sub>3</sub>),<img src="6-8401107\bfbd8d74-5658-4332-815f-0eb5537ffd86.jpg" />.</p><p>Then we called (Ω, d) is real number space. And Euclidean distance is a common measurement tool for real number space.</p><p>Definition 3. Given a non-null limited set U{x<sub>1</sub>, x<sub>2</sub>, x<sub>3</sub>, <img src="6-8401107\2d3dda91-42da-4dbf-b960-36105b42c74c.jpg" /><img src="6-8401107\e9da28ad-c52c-4607-ad78-c57cefcfd373.jpg" />x<sub>n</sub>} in real number space, for every object x<sub>i</sub><sub> </sub>in U, then the δ-neighborhood definition is as follows:</p><disp-formula id="scirp.17061-formula128353"><label>(14)</label><graphic position="anchor" xlink:href="6-8401107\1cc2f449-80d6-4b1a-b59d-e633a3830111.jpg"  xlink:type="simple"/></disp-formula><p>where δ &gt; 0, <img src="6-8401107\64bb9533-b733-4460-92c1-bb937e8e11a6.jpg" />is δ neighborhood information granulation from x<sub>i</sub>, it for short called as x<sub>i </sub>neighborhood granulation.</p><p>From the measurement properties, we can get three properties about neighborhood information granulation:</p><p>1)<img src="6-8401107\c4aeb82e-1c22-4549-adcb-589fc83bd3dd.jpg" />, because of<img src="6-8401107\347f6561-f3ff-4ec4-ba20-98f2abd6315f.jpg" />；</p><p>2) <img src="6-8401107\6dc581ca-cc02-450f-bfdb-7f10a1eedc72.jpg" /></p><p>3) <img src="6-8401107\094e89e8-69a0-42ae-819f-11973c533666.jpg" /></p><p>So Given a measurement space (Ω, d) and a non-null limited set U{x<sub>1</sub>, x<sub>2</sub>, x<sub>3</sub>,<img src="6-8401107\693ce219-52a6-4014-b4b9-266aa114dff4.jpg" /> <img src="6-8401107\3c6632e0-e0ff-4016-aa24-74190637cd4a.jpg" />x<sub>n</sub>}, if δ<sub>1 </sub>≤ δ<sub>2</sub>, then we can get these properties:</p><p>1) <img src="6-8401107\3b120bac-bbc7-44d7-b591-ed209e7d218e.jpg" /></p><p>2) <img src="6-8401107\d09f58d2-a41b-4e05-adef-5727fa7351df.jpg" /></p><p>Obviously, neighborhood relations are a kind of similarity relations, which satisfy reﬂexivity and symmetry properties. Neighborhood relations draw the objects together for similarity or indistinguishability in terms of distances and the samples in the same neighborhood granule are close to each other.</p><p>Example 5. Nine polygons are seen in <xref ref-type="fig" rid="fig2">Figure 2</xref>, U = {x<sub>1</sub>, x<sub>2</sub>, x<sub>3</sub>,<img src="6-8401107\e2391f84-d5e8-4c12-a1a1-8b67a026198c.jpg" /> , x<sub>9</sub>}, and B and C are respectively stand for two attribute level value (such as slope, aspect etc), when we choose value in one dimension attribute, we can use absolute distance. We use f (x, b) to express the value in attribute B for example x , then we can get f (x<sub>1</sub>, b) = 1.6，f (x<sub>2</sub>, b) = 1.8,<img src="6-8401107\f64089cb-bdd1-41fc-888b-10f8e6bf8211.jpg" /> , f (x<sub>9</sub>, b) = 2.1. if we assigned the neighborhood threshold is 0.2, because of |f(x<sub>1</sub>, b) – f(x<sub>2</sub>, b)| = 0.2 ≤ 0.2, then</p><p><img src="6-8401107\8fe4368b-7ac2-4441-aca8-8985fc3d2552.jpg" />. In this case, we can get</p><p><img src="6-8401107\79681e50-1c19-4cf3-b35f-3cb6ce695e84.jpg" />,<img src="6-8401107\8b9c11de-46a4-4cc2-b0d5-046af5fd2c6e.jpg" /> ,</p><p><img src="6-8401107\24bed81f-15e9-438f-87d3-ce7d259e6718.jpg" />.</p><p>when we get value in two dimension attribute, we should use Euclidean distance, we used f (x, b) to express the value for attribute B, C for example x, if the neighborhood threshold is 0.3. Then we can compute each polygon’s neighborhood in two dimension space,</p><p><img src="6-8401107\d0335281-c49f-492b-b120-71b7dddea9ce.jpg" />, <img src="6-8401107\5c2bfcf5-3b59-4db9-adcb-53b59475206b.jpg" />,</p><p><img src="6-8401107\b9c055c6-59df-4988-ad41-405cccde0295.jpg" />, <img src="6-8401107\cfec5fa8-12c4-43d7-b011-d032e11a8032.jpg" />,</p><p><img src="6-8401107\397c24a2-b80b-4377-9e5e-12c0c67a08df.jpg" />, <img src="6-8401107\3c561fe6-402e-4240-80f8-534a457fee85.jpg" />,</p><p><img src="6-8401107\b9081f41-7ce1-4e73-a5fa-9458b6c02563.jpg" />,<img src="6-8401107\c665d35d-8e3f-473f-95b1-b08dd124345f.jpg" />.</p><p>If it has many attributes, we can compute the distance for examples, and computed the neighborhood for examples.</p></sec><sec id="s3_3_2"><title>3.3.2. Neighborhood Approximation</title><p>Definition 4. Given a set of objects U{x<sub>1</sub>, x<sub>2</sub>, x<sub>3</sub>,<img src="6-8401107\cf1b8b2f-a812-45f3-8116-aa010afc4f18.jpg" /> ,x<sub>n</sub>} and a neighborhood relation R, called D = {U, R} is a neighborhood approximation space [<xref ref-type="bibr" rid="scirp.17061-ref29">29</xref>].</p><p>Definition 5. Given D = {U, R} and X ⊆ U. For any X ⊆ U, two subsets of objects, it is called lower and upper approximations of X in D= {U, R}, that are defined as follows:</p><disp-formula id="scirp.17061-formula128354"><label>(15)</label><graphic position="anchor" xlink:href="6-8401107\21cc8cbe-190f-42b7-9bfd-1534533ab95a.jpg"  xlink:type="simple"/></disp-formula><p>Obviously,<img src="6-8401107\67180031-d5a2-4590-abd0-251d4b954eec.jpg" />.The positive region of X<img src="6-8401107\05065ee3-8b3e-41db-8128-3953bb48659d.jpg" />, negative region of X<img src="6-8401107\5187ceb7-8b6f-4f7c-b963-274c070d045e.jpg" /> and boundary region of X in the approximation space are defined as follows:</p><disp-formula id="scirp.17061-formula128355"><label>(16)</label><graphic position="anchor" xlink:href="6-8401107\4322e160-0065-4506-af5c-5ee37a779c99.jpg"  xlink:type="simple"/></disp-formula><p>A sample in the decision system belongs to either the positive region or the boundary region of decision. Therefore, the neighborhood model divides the samples into two subsets: positive region and boundary region. Positive region is the set of samples which can be classified into one of the decision classes without uncertainty, while boundary region is the set of samples which can not be determinately classified. Intuitively, the samples in boundary region are easy to be misclassified. In data acquirement and preprocessing, one usually tries to find a feature space in which the classification task has the least boundary region. It is as summarized in Zhang [<xref ref-type="bibr" rid="scirp.17061-ref26">26</xref>].</p><p>Example 6. We given two sets X = {x<sub>1</sub>, x<sub>2</sub>, x<sub>3</sub>, x<sub>5</sub>, x<sub>7</sub>} and Y={x<sub>2</sub>, x<sub>4</sub>, x<sub>6</sub>} in <xref ref-type="fig" rid="fig5">Figure 5</xref>, one sets stand for a group continuous value. Then we can get pos (X) ={x<sub>1</sub>, x<sub>2</sub>, x<sub>5</sub>}, pos (Y) ={x<sub>6</sub>}, accordingly, we can get the negative region and boundary region for two sets.</p><p>Then we can get a map that shown binary classification in a 2-D numerical space in <xref ref-type="fig" rid="fig6">Figure 6</xref>, it took it as the first example with “&#215;” label, took it as the second example with “+” label. So we can see x<sub>1</sub> is belongs to the lower approximations of the first example, x<sub>3</sub> is belongs to the lower approximations of the second example because of its neighborhood are from the second number, x<sub>2</sub> is boundary example because of its neighborhood is belongs to the first example and the second example too. The definition is according to our intuitive recognition ： 识别；再认；认可；采认for classification problem in real world.</p></sec></sec></sec><sec id="s4"><title>4. Rough Measurement Concept</title><p>Definition 7. U is universe, R is equivalence relation of U, <img src="6-8401107\4f4f2e73-9022-4f33-bc24-2faf56989a52.jpg" />, the rough membership for element <img src="6-8401107\304500f3-4772-46d8-b67e-6ba9bf139796.jpg" /> of set A [<xref ref-type="bibr" rid="scirp.17061-ref30">30</xref>], that are defined as follows:</p><disp-formula id="scirp.17061-formula128356"><label>(17)</label><graphic position="anchor" xlink:href="6-8401107\57c4b69d-f824-4e56-bf01-ee51edab6d06.jpg"  xlink:type="simple"/></disp-formula><p>The rough membership of x in A is equal to rough membership for fuzzy set x in equivalence class <img src="6-8401107\945e1fed-ff32-42db-abad-d2189b6f9717.jpg" /> that weakly contains to A. So we can understand rough membership as a coefficient, it describe inaccuracy for <img src="6-8401107\338bfda6-4174-4feb-80e4-94a5d8bc3930.jpg" /> in A.</p><p>The formula (17) is defined for GIS discrete value by Pawlak rough sets membership, but for a continuous value, we can not get equivalence class easily, and we can get this membership from Definition 8.</p><p>Definition 8. For GIS continuous value, we use neighborhood rough sets definition for continuous value membership, we defined as follows:</p><disp-formula id="scirp.17061-formula128357"><label>(18)</label><graphic position="anchor" xlink:href="6-8401107\2a2d4812-cfa6-4550-b921-210632dc26b3.jpg"  xlink:type="simple"/></disp-formula><p>The rough membership of x in A is equal to rough membership for neighborhood information granulation <img src="6-8401107\7c0c8759-552d-4703-971f-5c04f1d516d4.jpg" /> in equivalence class <img src="6-8401107\79617b30-cdd6-4144-b90e-9ba5bcb1cd63.jpg" /> that weakly contains to A.</p><p>Definition 9. U is universe, R is equivalence relation of U, <img src="6-8401107\43a5f750-0e97-423d-83a1-c32efda00b37.jpg" />, then a fuzzy set can get from A and R, via:</p><disp-formula id="scirp.17061-formula128358"><label>(19)</label><graphic position="anchor" xlink:href="6-8401107\38ebe646-70b6-41aa-ad79-4a9e94872b72.jpg"  xlink:type="simple"/></disp-formula><p>Definition 10. Given universe<img src="6-8401107\9bf29981-46de-4472-890c-3c1e8c0352fd.jpg" />, R is equivalence relation of U, A and B are two rough sets of universe U, <img src="6-8401107\42a9b9ea-0368-4566-8dde-8829f05e47ff.jpg" />, the rough membership about A, B in equivalence relation R is separately <img src="6-8401107\f6690bf0-c199-4485-ada7-2541fc7e9fd4.jpg" /> and <img src="6-8401107\d5d47c8d-2bbc-4fbf-9432-3573f6f0a1ed.jpg" /> (i = 1, 2,<img src="6-8401107\af05c535-7c48-4619-ac9b-ad6468dded10.jpg" /> , n), we can get the membership of A and B in equivalence relation R is separately<img src="6-8401107\ce435184-1559-4d0c-a740-e0c3dd6ff604.jpg" />, that defined as follows:</p><disp-formula id="scirp.17061-formula128359"><label>(20)</label><graphic position="anchor" xlink:href="6-8401107\3c2c46c6-37b3-4b61-a505-3880d62013b0.jpg"  xlink:type="simple"/></disp-formula><p>Then the similarity of set A and B can get from follows formula: [<xref ref-type="bibr" rid="scirp.17061-ref31">31</xref>].</p><disp-formula id="scirp.17061-formula128360"><label>(21)</label><graphic position="anchor" xlink:href="6-8401107\2820df75-7b31-4248-8c5b-5919e73c0c71.jpg"  xlink:type="simple"/></disp-formula><p>We used the formula from Shi [<xref ref-type="bibr" rid="scirp.17061-ref32">32</xref>], it is the similarity formula, defined as follows:</p><disp-formula id="scirp.17061-formula128361"><label>(22)</label><graphic position="anchor" xlink:href="6-8401107\a1ea70c7-24da-4827-9ce5-525956da30d1.jpg"  xlink:type="simple"/></disp-formula><p>Obviously, the higher the similarity of set A and B has, the bigger value<img src="6-8401107\c6be5d8c-ce8a-49b4-b81e-80f9dc15f440.jpg" /> has, vice versa. And it satisfied these properties:</p><p>1)<img src="6-8401107\df8f4d45-4bc3-4af1-ad05-608d1a7df597.jpg" />;</p><p>2)<img src="6-8401107\9531b2a6-f1cb-4a10-b960-f888d4981688.jpg" />;</p><p>3)<img src="6-8401107\16b8f6c9-ab51-47f1-8fe5-036437b44336.jpg" />if and only if<img src="6-8401107\f7f7b9aa-0958-4bc9-ae37-b77af2ca1208.jpg" />, one value is at least 0 for <img src="6-8401107\2de55375-a7ee-40be-973f-972afb2142e5.jpg" />and<img src="6-8401107\8745f885-4817-4adb-83a5-d7e17295471c.jpg" />, and set A and B can not be null at the same time.</p></sec><sec id="s5"><title>5. Case Study</title><p>Considering the example seen in <xref ref-type="fig" rid="fig7">Figure 7</xref>, it has 100 polygons, the number from left to right, top to down is {1, 2, 3,<img src="6-8401107\f76f7dfa-3541-4de9-9cfd-306900c7234c.jpg" /> 100}. Now we have three subzone covering polygons in <xref ref-type="fig" rid="fig7">Figure 7</xref>, that is A, B, C, each subzone covered 16 unit polygons, how to measure these three subzone’s similarity, from membership formula, we can get.</p><p><img src="6-8401107\ff6b34ba-4068-42c5-85df-21f417c826c7.jpg" /></p><p><img src="6-8401107\35294a2a-839a-43dd-8c8b-c271ac1ee328.jpg" /></p><p><img src="6-8401107\1aadec34-23bc-4cce-bdb3-090a91e3d0c3.jpg" /></p><p>Then the similarity of subzone A and B is:</p><p><img src="6-8401107\b6d543ea-3f1b-4c38-ba0a-bc3867819608.jpg" /></p><p>In a similar way, <img src="6-8401107\48e95a54-c95c-47f6-82b6-598a8e83042e.jpg" />,<img src="6-8401107\8375d184-b9f4-4df7-a0b1-e3d78916f2a8.jpg" />. So the similarity for A and B is less than the similarity of A and C, the similarity for B and C is less than the similarity of A and C.</p><p>Considering the example seen in <xref ref-type="fig" rid="fig8">Figure 8</xref>, it has 100 polygons, the number from left to right, top to down is {1, 2, 3,<img src="6-8401107\84f8b806-fe8e-478a-b1a7-bfb5db0dd33a.jpg" /> , 100}. Now we randomly evaluate to every polygon’s continuous value (1 - 100), specific value seen in <xref ref-type="fig" rid="fig8">Figure 8</xref>, we have three subzone covering polygons in <xref ref-type="fig" rid="fig8">Figure 8</xref>, that is A, B, C, each subzone covered 16 unit polygons, how to measure these three subzone’s similarity for continuous value .</p><p>For continuous value in <xref ref-type="fig" rid="fig8">Figure 8</xref>, we used absolute distance formula because it only has one attribute, we give threshold δ = 10 for neighborhood granulation. Then we can get each polygon’s distance from others in turns, and get each polygon’s neighborhood information granulation. Such as, the neighborhood information granulation of polygon 1 is {1, 10, 27, 28, 34, 50, 51, 65, 68, 75, 94, 98, 99, 100}, the rough membership for subzone A is 1/14, the rough membership for subzone B is 2/14, the rough membership for subzone C is 2/14. From continuous value membership formula, we can get</p><p>Then the similarity of subzone A and B is:</p><p><img src="6-8401107\957b3621-adc7-41d0-8534-f020d87ef8b4.jpg" /></p><p>In a similar way, <img src="6-8401107\c5c062b9-8598-448e-9139-b431f8bf4452.jpg" />,<img src="6-8401107\db33911d-6a02-480a-9c04-1c7452b6f3d4.jpg" />. So the similarity for A and C is less than the similarity of A and B, the similarity for B and C is less than the similarity of A and C.</p><p>If used spatial autocorrelation to measure the subzone similarity for above case, we can find it can not measure <xref ref-type="fig" rid="fig7">Figure 7</xref>, because the value is discrete. And the spatial autocorrelation can only compute continuous attribute value, it can not compute for the similarity between subzones that are composed of several units in whole region. The cross-coefficient can not measure <xref ref-type="fig" rid="fig7">Figure 7</xref> too, because the value is discrete. And if the subzone in map is not equal length for continuous value, it can not measure similarity too. The rough measurement based on membership function solved this problem well.</p></sec><sec id="s6"><title>6. Conclusion and Future Work</title><p>This paper used rough membership measure similarity problem for different subzone. Because Moran’s I can only measure universe or each unit’s spatial autocorrelation, it can not measure subzone, so our method can compute GIS subzone similarity based on universe. And for continuous value, we used distance function and neighborhood rough sets to divide continuous value’s upper and lower approximation and classification problem, then we put forward a rough membership function based on neighborhood information granulation. At last, we used rough similarity measurement formula to measure GIS subzone similarity problem, this method can provide a new direction for GIS point group or others’ object group similarity measurement. Our future work should study object group similarity based on different distribution, and for similarity problem based on rough entropy.</p></sec><sec id="s7"><title>7. Acknowledgements</title><p>The author would like to thank the project sponsored by the scientific research foundation of GuangXi University (Grant No.XTZ110584).</p></sec><sec id="s8"><title>REFERENCES</title></sec></body><back><ref-list><title>References</title><ref id="scirp.17061-ref1"><label>1</label><mixed-citation publication-type="other" xlink:type="simple">W. R. Tobler, “A Computer Movie Simulating Urban Growth in the Detroit Region,” Economic Geography, Vol. 46, No.2, 1970, pp. 234-240. doi:10.2307/143141</mixed-citation></ref><ref id="scirp.17061-ref2"><label>2</label><mixed-citation publication-type="other" xlink:type="simple">A. D. Cliff and J. K. Ord, “Spatial Autocorrelation,” Pion, London, 1973.</mixed-citation></ref><ref id="scirp.17061-ref3"><label>3</label><mixed-citation publication-type="other" xlink:type="simple">J. F. Wang, L. F. Li and Y. Ge, “A Theoretic Framework for Spatial Analysis,” Acta Geographica Sinica, Vol. 55 No.1, 2000, pp. 92-103.</mixed-citation></ref><ref id="scirp.17061-ref4"><label>4</label><mixed-citation publication-type="other" xlink:type="simple">F. Chen and D. S. Du, “Application of the Integration of Spatial Statistical Analysis with GIS to the Analysis of Regional Economy,” Geomatics and Information Science of Wuhan University, Vol. 27 No. 4, 2002, pp. 391-396.</mixed-citation></ref><ref id="scirp.17061-ref5"><label>5</label><mixed-citation publication-type="other" xlink:type="simple">L. Anselin, “Local Indicators of Spatial Association: LI- SA,” Geographical Analysis, Vol. 27, No. 2, 1995, pp. 93-115. doi:10.1111/j.1538-4632.1995.tb00338.x</mixed-citation></ref><ref id="scirp.17061-ref6"><label>6</label><mixed-citation publication-type="other" xlink:type="simple">D. Y. Li and C. Y. Liu, “Artificial Intelligence with Uncertainty,” Journal of Software, Vol. 15, No. 11, 2004, pp. 1583-1594.</mixed-citation></ref><ref id="scirp.17061-ref7"><label>7</label><mixed-citation publication-type="other" xlink:type="simple">Z. Pawlak, ”Rough Sets,” International Journal of Computer and Information Sciences, Vol. 11, 1982, pp. 341- 356.</mixed-citation></ref><ref id="scirp.17061-ref8"><label>8</label><mixed-citation publication-type="other" xlink:type="simple">Z. Pawlak, “Rough Sets Theoretical Aspects of Reasoning about Data,” Kluwer Academic Publishers, Dordre- cht, 1991.</mixed-citation></ref><ref id="scirp.17061-ref9"><label>9</label><mixed-citation publication-type="other" xlink:type="simple">Z. Pawlak, “Rough Set Theory and Its Application to Data Analysis,” Cybernetics and Systems, Vol. 29, No. 9, 1998, pp. 661-668. doi:10.1080/019697298125470</mixed-citation></ref><ref id="scirp.17061-ref10"><label>10</label><mixed-citation publication-type="other" xlink:type="simple">R. Slowinski, “A generalization of the in Discernibility Relation for Rough Sets Analysis of Quantitative Information,” Decisions in Economics and Finance, Vol. 15, No. 1, 1992, pp. 65-78. doi:10.1007/BF02086527</mixed-citation></ref><ref id="scirp.17061-ref11"><label>11</label><mixed-citation publication-type="other" xlink:type="simple">P. Srinivasan, “The Importance of Rough Approximations for Information Retrieval,” International Journal of Man-Machine Studies, Vol. 34, No. 5, 1991, pp. 657-671. 
doi:10.1016/0020-7373(91)90017-2</mixed-citation></ref><ref id="scirp.17061-ref12"><label>12</label><mixed-citation publication-type="other" xlink:type="simple">T. Beaubouef, F. Petry and B. Buckles, “Extension of the Relational Database and Its Algebra with Rough Sets Techniques,” Computational Intelligence, Vol. 11, No. 2, 1995, pp. 233-245. 
doi:10.1111/j.1467-8640.1995.tb00030.x</mixed-citation></ref><ref id="scirp.17061-ref13"><label>13</label><mixed-citation publication-type="other" xlink:type="simple">X. B. Yang, D. J. Yu and J. Y. Yang, “Dominance-Based Rough Set Approach to Incomplete Interval-Valued Information System,” Data &amp; Knowledge Engineering, Vol. 68, No. 11, 2009, pp. 1331-1347. 
doi:10.1016/j.datak.2009.07.007</mixed-citation></ref><ref id="scirp.17061-ref14"><label>14</label><mixed-citation publication-type="other" xlink:type="simple">T. Beaubouef, F. E. Petry and R. Ladner, “Spatial Data Methods and Vague Regions: A Rough Sets Approach,” Applied Soft Computing, Vol. 7, No. 1, 2007, pp. 425-440. 
doi:10.1016/j.asoc.2004.11.003</mixed-citation></ref><ref id="scirp.17061-ref15"><label>15</label><mixed-citation publication-type="other" xlink:type="simple">Q. H. Hu, D. R. Yu and Z. X. Xie, “Numerical Attribute Reduction Based on Neighborhood Granulation and Rough Approximation,” Journal of Software, Vol. 19, No. 3, 2008, pp. 640-649. doi:10.3724/SP.J.1001.2008.00640</mixed-citation></ref><ref id="scirp.17061-ref16"><label>16</label><mixed-citation publication-type="other" xlink:type="simple">H. Xie, H. Z. Cheng and D. X. Niu, “Discretization of Continuous Attributes in Rough Sets Theory Based on Information Entropy,” Chinese Journal of Computers, Vol. 28, No. 9, 2005, pp. 1570-1574.</mixed-citation></ref><ref id="scirp.17061-ref17"><label>17</label><mixed-citation publication-type="other" xlink:type="simple">R. Jensen and Q. Shen, “Semantics-Preserving Dimensionality Reduction: Rough and Fuzzy-Rough-Based Approaches,” IEEE Transactions on Knowledge and Data Engineering, Vol. 16, No. 12, 2004, pp. 1457-1471. 
doi:10.1109/TKDE.2004.96</mixed-citation></ref><ref id="scirp.17061-ref18"><label>18</label><mixed-citation publication-type="other" xlink:type="simple">D. Dubois and H. Prade, “Rough Fuzzy Sets and Fuzzy Rough Sets,” International Journal of General Systems, Vol. 17, No. 2, 1990, pp. 191-209. 
doi:10.1080/03081079008935107</mixed-citation></ref><ref id="scirp.17061-ref19"><label>19</label><mixed-citation publication-type="other" xlink:type="simple">Q. H. Hu, D. R. Yu and Z. X. Xie, “Fuzzy Probabilistic Approximation Spaces and Their Information Measures,” IEEE Transactions on Fuzzy Systems, Vol. 14, No. 2, 2006, pp. 191-201. doi:10.1109/TFUZZ.2005.864086</mixed-citation></ref><ref id="scirp.17061-ref20"><label>20</label><mixed-citation publication-type="other" xlink:type="simple">D. S. Yeung, D. G. Chen, et al., “On the Generalization of Fuzzy Rough Sets,” IEEE Transactions on Fuzzy Systems, Vol. 13, No. 3, 2005, pp. 343-361. 
doi:10.1109/TFUZZ.2004.841734</mixed-citation></ref><ref id="scirp.17061-ref21"><label>21</label><mixed-citation publication-type="other" xlink:type="simple">Q. H. Hu, D. R. Yu and Z. X. Xie, “Information-Pr- eserving Hybrid Data Reduction Based on Fuzzy Rough Techniques,” Pattern Recognition Letters, Vol. 27, No. 5, 2006, pp. 414-423. doi:10.1016/j.patrec.2005.09.004</mixed-citation></ref><ref id="scirp.17061-ref22"><label>22</label><mixed-citation publication-type="other" xlink:type="simple">R. Slowinski and D. Vanderpooten, “A Generalized Definition of Rough Approximations Based on Similarity,” IEEE Transactions on Knowledge and Data Engineering, Vol. 12, No. 2, 2000, pp. 331-336. 
doi:10.1109/69.842271</mixed-citation></ref><ref id="scirp.17061-ref23"><label>23</label><mixed-citation publication-type="other" xlink:type="simple">T. Y. Lin, “Data Mining and Machine Oriented Modeling: A Granular Computing Approach,” Applied Intelligence, Vol. 13, No. 2, 2000, pp. 113-124. 
doi:10.1023/A:1008384328214</mixed-citation></ref><ref id="scirp.17061-ref24"><label>24</label><mixed-citation publication-type="other" xlink:type="simple">R. M. Wu and X. H. Zhang, “A Research on the Difference Measures of Rough Fuzzy Sets,” Journal of Southwest University for Nationalities (Natural Science Edition), Vol. 35, No. 6, 2009, pp. 1139-1142.</mixed-citation></ref><ref id="scirp.17061-ref25"><label>25</label><mixed-citation publication-type="other" xlink:type="simple">Y. Y. Guan and H. K. Wang, “Measures of Rough Similarity between Sets,” Fuzzy Systems and Mathematics, Vol. 20, No. 1, 2006, pp. 134-139.</mixed-citation></ref><ref id="scirp.17061-ref26"><label>26</label><mixed-citation publication-type="other" xlink:type="simple">W. X. Zhang, W. Z. Wu and J. Y. Liang, “Rough Set Theory and Method,” Science Press, Beijing, 2005.</mixed-citation></ref><ref id="scirp.17061-ref27"><label>27</label><mixed-citation publication-type="other" xlink:type="simple">X. P. Geng, X. C. Du and P. Hu, “Spatial Clustering Method Based on Raster Distance Transform for Extended Objects,” Acta Geodaetica et Cartographica Sinica, Vol. 38, No. 2, 2009, pp. 162-168.</mixed-citation></ref><ref id="scirp.17061-ref28"><label>28</label><mixed-citation publication-type="other" xlink:type="simple">X. F. Li and J. Li, “Data Mining and Knowledge Discovery,” Higher Education Press, Beijing, 2003.</mixed-citation></ref><ref id="scirp.17061-ref29"><label>29</label><mixed-citation publication-type="other" xlink:type="simple">Y. Zhou, H. Lin and Y. B. Cui, “The Study under Rough Relation and It’s Neighbor Relation,” Computer Science, Vol. 31, No. 10A, 2004, pp. 61-63.</mixed-citation></ref><ref id="scirp.17061-ref30"><label>30</label><mixed-citation publication-type="other" xlink:type="simple">F. C. Liu, “Similarity Measure and Similarity Direction between Fuzzy Rough Sets,” Computer Engineering and Applications, Vol. 35, 2005, pp. 63-66.</mixed-citation></ref><ref id="scirp.17061-ref31"><label>31</label><mixed-citation publication-type="other" xlink:type="simple">H. K. Wang, Y. Y. Guan and K. Q. Shi, “Measure of Similarity between Rough sets and Its Application,” Computer Engineering and Applications, Vol. 31, 2004, pp. 39-40.</mixed-citation></ref><ref id="scirp.17061-ref32"><label>32</label><mixed-citation publication-type="other" xlink:type="simple">Z. H. Shi and Y. P. Lian, “Measure of Similarity between Rough Sets Based on Inclusion,” Research of Mathematic Teaching-Learning, Vol. 2, 2008, pp. 53-54.</mixed-citation></ref></ref-list></back></article>