<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article  PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article"><front><journal-meta><journal-id journal-id-type="publisher-id">AM</journal-id><journal-title-group><journal-title>Applied Mathematics</journal-title></journal-title-group><issn pub-type="epub">2152-7385</issn><publisher><publisher-name>Scientific Research Publishing</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.4236/am.2014.521318</article-id><article-id pub-id-type="publisher-id">AM-52221</article-id><article-categories><subj-group subj-group-type="heading"><subject>Articles</subject></subj-group><subj-group subj-group-type="Discipline-v2"><subject>Computer Science&amp;Communications</subject><subject> Engineering</subject><subject> Physics&amp;Mathematics</subject></subj-group></article-categories><title-group><article-title>
 
 
  The Influence Function of the Correlation Indexes in a Two-by-Two Table
 
</article-title></title-group><contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>iovanni</surname><given-names>Girone</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Fabio</surname><given-names>Manca</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Claudia</surname><given-names>Marin</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib></contrib-group><aff id="aff1"><addr-line>University of Bari “Aldo Moro”, Bari, Italy</addr-line></aff><author-notes><corresp id="cor1">* E-mail:<email>giovanni.girone@uniba.it(IG)</email>;<email>fabio.manca@uniba.it(FM)</email>;<email>claudia.marin@uniba.it(CM)</email>;</corresp></author-notes><pub-date pub-type="epub"><day>01</day><month>12</month><year>2014</year></pub-date><volume>05</volume><issue>21</issue><fpage>3411</fpage><lpage>3420</lpage><history><date date-type="received"><day>5</day>	<month>October</month>	<year>2014</year></date><date date-type="rev-recd"><day>28</day>	<month>October</month>	<year>2014</year>	</date><date date-type="accepted"><day>11</day>	<month>November</month>	<year>2014</year></date></history><permissions><copyright-statement>&#169; Copyright  2014 by authors and Scientific Research Publishing Inc. </copyright-statement><copyright-year>2014</copyright-year><license><license-p>This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/</license-p></license></permissions><abstract><p>
 
 
  In this paper we examine 5 indexes (the two Yule’s indexes, the chi square, the odds ratio and an elementary index) of a two-by-two table, which estimate the correlation coefficient 
  <em>ρ</em> in a bivariate Bernoulli distribution. We will find the compact expression of the influence functions, which allow the quantification of the effect of an infinitesimal contamination of the probability of any pair of attributes of the bivariate random variable distributed according to the above-mentioned model. We prove that the only unbiased index is the chi square. In order to determine the indexes, which are less sensitive to contamination, we obtain the expressions of three synthetic measures of the influence function, which are the maximum contamination (gross sensitivity error), the mean square deviation and the variance. These results, even if don’t allow a definitive assessment of the overall optimum properties of the five indexes, as not all of them are unbiased, nevertheless they allow to appreciating the synthetic entity of the effect of the contaminations in the estimation of the parameter
  <em> ρ</em> of the bivariate Bernoulli distribution.
 
</p></abstract><kwd-group><kwd>Two-by-Two Table</kwd><kwd> Influence Function</kwd><kwd> Correlation Indexes</kwd><kwd> Gross Sensitivity Error</kwd><kwd> Mean Square Deviation</kwd><kwd> Asymptotic Variance</kwd></kwd-group></article-meta></front><body><sec id="s1"><title>1. Introduction</title><p>In this paper we analyze the influence of a minimal contamination of the bivariate Bernoulli distribution on the values of the index measuring the association in a two-by-two table, having as a scenario the estimation of the correlation parameter of that distribution.</p></sec><sec id="s2"><title>2. Bivariate Bernoulli Model</title>Selecting a Template<p>Let us suppose that two dichotomous variables, denoted by X and Y, are relevant within a population. These variables take the values 1 and 0, depending on whether one of the dichotomous attributes is present or absent. The corresponding theoretical model is the bivariate Bernoulli distribution [<xref ref-type="bibr" rid="scirp.52221-ref1">1</xref>] , reported in <xref ref-type="table" rid="table1">Table 1</xref>.</p><p>The mean values of the two variables are</p><disp-formula id="scirp.52221-formula182"><label>, (1)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x6.png"  xlink:type="simple"/></disp-formula><disp-formula id="scirp.52221-formula183"><label>. (2)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x7.png"  xlink:type="simple"/></disp-formula><p>The variances of the two variables are</p><disp-formula id="scirp.52221-formula184"><label>, (3)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x8.png"  xlink:type="simple"/></disp-formula><disp-formula id="scirp.52221-formula185"><label>. (4)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x9.png"  xlink:type="simple"/></disp-formula><p>The covariance between the two variables is</p><disp-formula id="scirp.52221-formula186"><label>. (5)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x10.png"  xlink:type="simple"/></disp-formula><p>The correlation coefficient is</p><disp-formula id="scirp.52221-formula187"><label>(6)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x11.png"  xlink:type="simple"/></disp-formula></sec><sec id="s3"><title>3. Properties of the Correlation Parameter Estimation</title><p>Several indexes, suggested by various authors (Yule, Quetelet and others), are available for the sample estimation of the above-mentioned correlation coefficient. We refer to such indexes as R<sub>1</sub>, R<sub>2</sub> etc. For given indexes, R<sub>h</sub>, all variable between −1 and +1, we must take into account unbiasedness, i.e.</p><disp-formula id="scirp.52221-formula188"><label>(7)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x12.png"  xlink:type="simple"/></disp-formula><p>efficiency, i.e.</p><disp-formula id="scirp.52221-formula189"><label>, (8)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x13.png"  xlink:type="simple"/></disp-formula><p>and the limited influence of limited modification of the model.</p><p>With regard to this last fundamental property Hampel [<xref ref-type="bibr" rid="scirp.52221-ref2">2</xref>] in 1974 suggested the influence function as a tool for evaluating the effect caused on the value of an indexby a minimal contamination of the model. In our case the model is the bivariate Bernoulli distribution, the parameter is the correlation coefficient <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x14.png" xlink:type="simple"/></inline-formula> and the indexes are those proposed by various authors over time.</p><p>Basically the influence function <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x15.png" xlink:type="simple"/></inline-formula> referred to the index R is given by</p><disp-formula id="scirp.52221-formula190"><label>, (9)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x16.png"  xlink:type="simple"/></disp-formula><p>where <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x17.png" xlink:type="simple"/></inline-formula> is the index computed for the contaminated bivariate Bernoulli distribution<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x18.png" xlink:type="simple"/></inline-formula>, <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x19.png" xlink:type="simple"/></inline-formula>is the</p><table-wrap id="table1" ><label><xref ref-type="table" rid="table1">Table 1</xref></label><caption><title> The bivariate Bernoulli distribution</title></caption><table><tbody><thead><tr><th align="center" valign="middle"  rowspan="2"  >Attributes of variable X</th><th align="center" valign="middle"  colspan="2"  >Attributes of variable Y</th><th align="center" valign="middle"  rowspan="2"  >Total</th></tr></thead><tr><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td></tr><tr><td align="center" valign="middle" >0</td><td align="center" valign="middle" >α</td><td align="center" valign="middle" >β</td><td align="center" valign="middle" >α + β</td></tr><tr><td align="center" valign="middle" >1</td><td align="center" valign="middle" >γ</td><td align="center" valign="middle" >δ</td><td align="center" valign="middle" >γ + δ</td></tr><tr><td align="center" valign="middle" >Total</td><td align="center" valign="middle" >α + γ</td><td align="center" valign="middle" >β + δ</td><td align="center" valign="middle" >1</td></tr></tbody></table></table-wrap><p>index computed for the non-contaminated bivariate Bernoulli distribution and <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x20.png" xlink:type="simple"/></inline-formula> is the weight of the contamination.</p><p>It is easily understood that such a function measures the effect of an infinitesimal contamination of the model on the value of the correlation index [<xref ref-type="bibr" rid="scirp.52221-ref3">3</xref>] . From now on we will denote by a, b, c and d the empirical frequencies of the four cells of the two-by-two table obtained for a sample of n units.</p></sec><sec id="s4"><title>4. Influence Function of the Correlation Indexes</title><sec id="s4_1"><title>4.1. C Index</title><p>Let us first consider the elementary index given by</p><disp-formula id="scirp.52221-formula191"><label>(10)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x21.png"  xlink:type="simple"/></disp-formula><p>A contamination in the cell (0,0) leads to the influence function value</p><disp-formula id="scirp.52221-formula192"><label>(11)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x22.png"  xlink:type="simple"/></disp-formula><p>while a contamination in the cell (1,1) leads to the influence function value</p><disp-formula id="scirp.52221-formula193"><label>(12)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x23.png"  xlink:type="simple"/></disp-formula><p>That is, a contamination in one of the two cells indicating concordance increases the value of the C index of a quantity, which is proportional to the sum of the frequencies of the two discordance cells.</p><p>A contamination in the cell (0,1) leads to the influence function value</p><disp-formula id="scirp.52221-formula194"><label>(13)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x24.png"  xlink:type="simple"/></disp-formula><p>while a contamination in the cell (1,0) leads to the influence function value</p><disp-formula id="scirp.52221-formula195"><label>(14)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x25.png"  xlink:type="simple"/></disp-formula><p>That is, a contamination in one of the two cells indicating discordance decreases the value of the C index of a quantity, which is proportional to the sum of the frequencies of the two concordance cells.</p><p>In short, the influence function can be displayed as</p><disp-formula id="scirp.52221-formula196"><label>, (15)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x26.png"  xlink:type="simple"/></disp-formula><p>in which v, in the case of a concordance cell, is equal to the sum of the discordance frequencies, while, in the case of a discordance cell, is equal to the sum of the frequencies of discordance cells, changed of sign. In other words, the influence of the contamination for each concordance cell is directly proportional to the sum of the discordant frequencies and vice versa, for each discordance cell, provided that it is positive for the concordance cells and negative for the discordance cells [<xref ref-type="bibr" rid="scirp.52221-ref4">4</xref>] .</p></sec><sec id="s4_2"><title>4.2. Yule’s Q Index</title><p>Let us first consider the 1900 Yule’s index [<xref ref-type="bibr" rid="scirp.52221-ref5">5</xref>] given by</p><disp-formula id="scirp.52221-formula197"><label>(16)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x27.png"  xlink:type="simple"/></disp-formula><p>A contamination in the cell (0,0) leads to the following value of the influence function</p><disp-formula id="scirp.52221-formula198"><label>(17)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x28.png"  xlink:type="simple"/></disp-formula><p>while a contamination in the cell (1,1) leads to the value given by</p><disp-formula id="scirp.52221-formula199"><label>(18)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x29.png"  xlink:type="simple"/></disp-formula><p>That is, a contamination in one of the two cells indicating concordance increases the value of the index Q by a quantity, which is proportional to the product of the frequencies of the three non-contaminated cells.</p><p>On the other hand a contamination in the cell (0,1) leads to the value of the influence function given by</p><disp-formula id="scirp.52221-formula200"><label>(19)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x30.png"  xlink:type="simple"/></disp-formula><p>while a contamination in the cell (1,0) leads to the following value of the influence function</p><disp-formula id="scirp.52221-formula201"><label>(20)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x31.png"  xlink:type="simple"/></disp-formula><p>That is, a contamination in one of the two cells indicating discordance decreases the value of the index Q by a quantity, which is proportional to the product of the frequencies of the three non-contaminated cells.</p><p>In short, the influence function can be displayed as</p><disp-formula id="scirp.52221-formula202"><label>(21)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x32.png"  xlink:type="simple"/></disp-formula><p>in which v is equal to one of the frequencies with positive sign if it corresponds to a or to d, and to one of the frequencies with negative sign if it corresponds to b or to c. In other words, the influence of the contamination is inversely proportional to the frequency of the contaminated cell, provided that it is positive for the concordance cells and negative for the discordance cells.</p></sec><sec id="s4_3"><title>4.3. Yule’s Y Index</title><p>Let us consider now the other index proposed by Yule [<xref ref-type="bibr" rid="scirp.52221-ref6">6</xref>] in 1912,</p><disp-formula id="scirp.52221-formula203"><label>. (22)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x33.png"  xlink:type="simple"/></disp-formula><p>A contamination in the cell (0,0) leads to the following value of the influence function</p><disp-formula id="scirp.52221-formula204"><label>(23)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x34.png"  xlink:type="simple"/></disp-formula><p>while a contamination in the cell (1,1) leads to the value of the influence function given by</p><disp-formula id="scirp.52221-formula205"><label>(24)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x35.png"  xlink:type="simple"/></disp-formula><p>That is, a contamination in one of the two cells indicating concordance increases the value of the index Y by a quantity proportional to the root of the product of the frequencies of the three non-contaminated cells divided by the root of the frequency of the contaminated cell.</p><p>A contamination in the cell (0,1) leads to the value of the influence function given by</p><disp-formula id="scirp.52221-formula206"><label>(25)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x36.png"  xlink:type="simple"/></disp-formula><p>while a contamination in the cell (1,0) leads to the following value of the influence function</p><disp-formula id="scirp.52221-formula207"><label>(26)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x37.png"  xlink:type="simple"/></disp-formula><p>That is, a contamination in one of the two cells indicating discordance decreases the value of the index Y by a quantity proportional to the root of the product of the frequencies of the three non-contaminated cells divided by the root of the frequency of the contaminated cell.</p><p>In short, the influence function can be displayed as</p><disp-formula id="scirp.52221-formula208"><label>(27)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x38.png"  xlink:type="simple"/></disp-formula><p>in which v is equal to one of the frequencies with positive sign if a or d and to one of the frequencies with negative sign if b or c. In other words, the influence of the contamination is inversely proportional to the frequency of the contaminated cell, provided that it is positive for the concordance cells and negative for the discordance cells.</p></sec><sec id="s4_4"><title>4.4. The Chi Square Index</title><p>Let us examine the chi square index</p><disp-formula id="scirp.52221-formula209"><label>(28)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x39.png"  xlink:type="simple"/></disp-formula><p>A contamination in the cell (0,0) leads to the influence function value</p><disp-formula id="scirp.52221-formula210"><label>(29)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x40.png"  xlink:type="simple"/></disp-formula><p>while a contamination in the cell (1,1) leads to the influence function value</p><disp-formula id="scirp.52221-formula211"><label>(30)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x41.png"  xlink:type="simple"/></disp-formula><p>That is, a contamination in one of the two cells indicating concordance increases the value of the <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x42.png" xlink:type="simple"/></inline-formula> by a quantity which is proportional to the product of the sums of the frequency of the other concordance cell with each of the frequencies of the discordance cell.</p><p>A contamination in the cell (0,1) leads to the influence function value</p><disp-formula id="scirp.52221-formula212"><label>(31)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x43.png"  xlink:type="simple"/></disp-formula><p>while a contamination in the cell (1,0) leads to the influence function value</p><disp-formula id="scirp.52221-formula213"><label>(32)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x44.png"  xlink:type="simple"/></disp-formula><p>That is, a contamination in one of the two cells indicating discordance decreases the value of the <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x45.png" xlink:type="simple"/></inline-formula> by a quantity which is proportional to the product of the sums of the frequency of the other discordance cell with each of the frequencies of the concordance cells.</p><p>It is impossible to have a unique expression of the influence function as we had for the other indexes, because the expressions for the contaminated concordance cells differ from those related to the discordance ones.</p></sec><sec id="s4_5"><title>4.5. Odds Ratio, θ</title><p>Let us now examine the odds ratio index</p><disp-formula id="scirp.52221-formula214"><label>(33)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x46.png"  xlink:type="simple"/></disp-formula><p>A contamination in the cell (0,0) leads to the influence function value</p><disp-formula id="scirp.52221-formula215"><label>, (34)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x47.png"  xlink:type="simple"/></disp-formula><p>while a contamination in the cell (1,1) leads to the influence function value</p><disp-formula id="scirp.52221-formula216"><label>(35)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x48.png"  xlink:type="simple"/></disp-formula><p>That is, a contamination in one of the two cells indicating concordance increases the value of the index <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x49.png" xlink:type="simple"/></inline-formula> by a quantity that is proportional to the frequency of the other concordance cell and inversely proportional to the product of the frequencies of the discordance cells.</p><p>A contamination in the cell (0,1) leads to the influence function value</p><disp-formula id="scirp.52221-formula217"><label>(36)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x50.png"  xlink:type="simple"/></disp-formula><p>while a contamination in cell (1,0) leads to the influence function value</p><disp-formula id="scirp.52221-formula218"><label>(37)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x51.png"  xlink:type="simple"/></disp-formula><p>That is, a contamination in one of the two cells indicating discordance decreases the value of the index <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x52.png" xlink:type="simple"/></inline-formula> by a quantity, which is proportional to the product of the frequencies of the concordance cell and inversely proportional to the product of the square of the frequency of the contaminated cell multiplied by the frequency of the other discordance cell.</p><p>In short, the influence function can be displayed as</p><disp-formula id="scirp.52221-formula219"><label>(38)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x53.png"  xlink:type="simple"/></disp-formula><p>in which v is equal to the frequency of the contaminated cell, provided that the sign is positive in case of contamination in a concordance cell, and negative in case of contamination in a discordance cell.</p></sec></sec><sec id="s5"><title>5. Unbiasedness of the Indexes</title><p>It must be reminded that an index <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x54.png" xlink:type="simple"/></inline-formula> is unbiased if</p><disp-formula id="scirp.52221-formula220"><label>(39)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x55.png"  xlink:type="simple"/></disp-formula><p>let us examine now the unbiasedness of every index.</p><sec id="s5_1"><title>5.1. The Chi Square Index</title><p>The index</p><disp-formula id="scirp.52221-formula221"><label>(40)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x56.png"  xlink:type="simple"/></disp-formula><p>has the mean</p><disp-formula id="scirp.52221-formula222"><label>(41)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x57.png"  xlink:type="simple"/></disp-formula><p>and therefore it is unbiased.</p></sec><sec id="s5_2"><title>5.2. Other Indexes</title><p>Indexes Q, Y, θ e C are biased.</p><p>It has to be said that 3 of the considered indexes (Q, Y and θ) are functionally related, as it is shown below:</p><p><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x58.png" xlink:type="simple"/></inline-formula>, <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x58.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x59.png" xlink:type="simple"/></inline-formula>for <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x58.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x59.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x60.png" xlink:type="simple"/></inline-formula> and <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x58.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x59.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x60.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x61.png" xlink:type="simple"/></inline-formula> for<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x58.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x59.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x60.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x61.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x62.png" xlink:type="simple"/></inline-formula>, <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x58.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x59.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x60.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x61.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x62.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x63.png" xlink:type="simple"/></inline-formula>, <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x58.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x59.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x60.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x61.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x62.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x63.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x64.png" xlink:type="simple"/></inline-formula>, <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x58.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x59.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x60.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x61.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x62.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x63.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x64.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x65.png" xlink:type="simple"/></inline-formula>, <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x58.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x59.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x60.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x61.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x62.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x63.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x64.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x65.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x66.png" xlink:type="simple"/></inline-formula>, for <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x58.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x59.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x60.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x61.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x62.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x63.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x64.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x65.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x66.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x67.png" xlink:type="simple"/></inline-formula> and<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x58.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x59.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x60.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x61.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x62.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x63.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x64.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x65.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x66.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x67.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x68.png" xlink:type="simple"/></inline-formula>, for<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x58.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x59.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x60.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x61.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x62.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x63.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x64.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x65.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x66.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x67.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x68.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x69.png" xlink:type="simple"/></inline-formula>.</p><p>The other 2 indexes (C and<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x70.png" xlink:type="simple"/></inline-formula>) are not functionally explainable with themselves nor with the above-men- tioned ones. The 5 indexes estimate functions of the <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x70.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x71.png" xlink:type="simple"/></inline-formula> parameter. More exactly 4 of these indexes (C, Q, Y and<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x70.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x71.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x72.png" xlink:type="simple"/></inline-formula>) are estimators of increasing functions of this parameter and, in particular in the points −1, 0 and +1, these functions coincide with the argument. So the index <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x70.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x71.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x72.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/12-7402479x73.png" xlink:type="simple"/></inline-formula> can easily lead to the 2 Yule’s indexes achieving again its characteristics.</p></sec></sec><sec id="s6"><title>6. Influences of the Indexes</title><p>Since that the effects of contaminations in the various cells are balanced, it is necessary to evaluate their overall influence regardless of the sign. This can be done considering the maximum of the absolute values of the influence or the mean absolute deviation or the variance of the said values [<xref ref-type="bibr" rid="scirp.52221-ref7">7</xref>] .</p><sec id="s6_1"><title>6.1. Maximum of the Absolute Values of the Influence Function (Gross Sensitivity Error)</title><sec id="s6_1_1"><title>6.1.1. C Index</title><p>As, regardless of the sign, the influence function is equal to</p><disp-formula id="scirp.52221-formula223"><label>, (42)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x74.png"  xlink:type="simple"/></disp-formula><p>the maximum of the influence function is therefore</p><disp-formula id="scirp.52221-formula224"><label>. (43)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x75.png"  xlink:type="simple"/></disp-formula></sec><sec id="s6_1_2"><title>6.1.2. Yule’s Q Index</title><p>As, regardless of the sign, the influence function is equal to</p><disp-formula id="scirp.52221-formula225"><label>, (44)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x76.png"  xlink:type="simple"/></disp-formula><p>in which v is one of the four frequencies of the table, the maximum of the influence function is obtained for min(v); it is therefore</p><disp-formula id="scirp.52221-formula226"><label>(45)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x77.png"  xlink:type="simple"/></disp-formula></sec><sec id="s6_1_3"><title>6.1.3. Yule’s Y Index</title><p>As, regardless of the sign, the influence function is equal to</p><disp-formula id="scirp.52221-formula227"><label>, (46)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x78.png"  xlink:type="simple"/></disp-formula><p>in which v is one of the four frequencies of the table, the maximum of the influence function is obtained for min(v); it is therefore</p><disp-formula id="scirp.52221-formula228"><label>(47)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x79.png"  xlink:type="simple"/></disp-formula></sec><sec id="s6_1_4"><title>6.1.4. Chi Square Index</title><p>An empirical analysis allows to asses that the maximum absolute value of the influence function is obtained in correspondence of the minimum frequency. Thus,</p><disp-formula id="scirp.52221-formula229"><label>(48)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x80.png"  xlink:type="simple"/></disp-formula><disp-formula id="scirp.52221-formula230"><label>(49)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x81.png"  xlink:type="simple"/></disp-formula><disp-formula id="scirp.52221-formula231"><label>(50)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x82.png"  xlink:type="simple"/></disp-formula><disp-formula id="scirp.52221-formula232"><label>(51)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x83.png"  xlink:type="simple"/></disp-formula></sec><sec id="s6_1_5"><title>6.1.5. Odds Ratio</title><p>As, regardless of the sign, the influence function is equal to</p><disp-formula id="scirp.52221-formula233"><label>, (52)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x84.png"  xlink:type="simple"/></disp-formula><p>in which v is one of the four frequencies of the table, the maximum of the influence function is obtained for min(v); it is therefore</p><disp-formula id="scirp.52221-formula234"><label>(53)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x85.png"  xlink:type="simple"/></disp-formula></sec></sec><sec id="s6_2"><title>6.2. Variability of Influence Functions: Mean Absolute Deviation</title><sec id="s6_2_1"><title>6.2.1. C Index</title><p>A few algebraic steps allow us to obtain</p><disp-formula id="scirp.52221-formula235"><label>. (54)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x86.png"  xlink:type="simple"/></disp-formula></sec><sec id="s6_2_2"><title>6.2.2. Yule’s Q Index</title><disp-formula id="scirp.52221-formula236"><label>(55)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x87.png"  xlink:type="simple"/></disp-formula></sec><sec id="s6_2_3"><title>6.2.3. Yule’s Y Index</title><disp-formula id="scirp.52221-formula237"><label>(56)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x88.png"  xlink:type="simple"/></disp-formula></sec><sec id="s6_2_4"><title>6.2.4. Chi Square</title><disp-formula id="scirp.52221-formula238"><label>(57)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x89.png"  xlink:type="simple"/></disp-formula></sec><sec id="s6_2_5"><title>6.2.5. Odds Ratio</title><disp-formula id="scirp.52221-formula239"><label>(58)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x90.png"  xlink:type="simple"/></disp-formula><p>It can be seen that the mean deviation for all indexes is a symmetric function either of the concordant frequencies or of the discordant frequencies.</p></sec></sec><sec id="s6_3"><title>6.3. Variability of the Influence Function Asymptotic Variance (A.S.V.)</title><sec id="s6_3_1"><title>6.3.1. C Index</title><p>Let us consider the asymptotic variance of the indexes. A few algebraic steps lead us to the following expression</p><disp-formula id="scirp.52221-formula240"><label>(59)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x91.png"  xlink:type="simple"/></disp-formula></sec><sec id="s6_3_2"><title>6.3.2. Yule’s Q Index</title><disp-formula id="scirp.52221-formula241"><label>(60)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x92.png"  xlink:type="simple"/></disp-formula></sec><sec id="s6_3_3"><title>6.3.3. Yule’s Y Index</title><disp-formula id="scirp.52221-formula242"><label>(61)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x93.png"  xlink:type="simple"/></disp-formula></sec><sec id="s6_3_4"><title>6.3.4. Chi Square Index</title><disp-formula id="scirp.52221-formula243"><label>(62)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x94.png"  xlink:type="simple"/></disp-formula></sec><sec id="s6_3_5"><title>6.3.5. Odds Ratio</title><disp-formula id="scirp.52221-formula244"><label>(63)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/12-7402479x95.png"  xlink:type="simple"/></disp-formula><p>It can be seen that the asymptotic variance is a symmetric function of the concordance and discordance frequencies as well.</p></sec></sec></sec><sec id="s7"><title>7. Example</title><p>Let us consider a practical example in which 1071 persons are classified on 2 dichotomic characters: “does he/she smoke” and “is he/she suffering from bronchitis?” both with yes or no response (see <xref ref-type="table" rid="table2">Table 2</xref>).</p><p>There were 1071 cases of which 135 smoke and have bronchitis and 547 don’t smoke and don’t have bronchitis.</p><p>As it can be noticed, between the 4 indexes whose values go between −1 and +1, the ones which are less sensitive to contamination are C and chi square indexes; on the other hand, the more sensitive ones are Yule’s indexes, Q and Y. The greater sensitivity of the odds ratio is due to the fact that such index measures a function of the correlation of the model that goes in the range from 1 to &#165;.</p><table-wrap id="table2" ><label><xref ref-type="table" rid="table2">Table 2</xref></label><caption><title> Smoke versus bronchitis</title></caption><table><tbody><thead><tr><th align="center" valign="middle"  rowspan="2"  >Smoke</th><th align="center" valign="middle"  colspan="2"  >Bronchitis</th><th align="center" valign="middle"  rowspan="2"  >Total</th></tr></thead><tr><td align="center" valign="middle" >Yes</td><td align="center" valign="middle" >No</td></tr><tr><td align="center" valign="middle" >Yes</td><td align="center" valign="middle" >135</td><td align="center" valign="middle" >287</td><td align="center" valign="middle" >422</td></tr><tr><td align="center" valign="middle" >No</td><td align="center" valign="middle" >102</td><td align="center" valign="middle" >547</td><td align="center" valign="middle" >649</td></tr><tr><td align="center" valign="middle" >Total</td><td align="center" valign="middle" >237</td><td align="center" valign="middle" >834</td><td align="center" valign="middle" >1071</td></tr></tbody></table></table-wrap><p>Source: Survey at the University Hospital of Bari, Department of Pulmonology.</p></sec><sec id="s8"><title>8. Conclusions</title><p>In this paper we analyzed the indexes of a two-by-two table, which allow the estimation of the correlation coefficient ρ in the bivariate Bernoulli model. More precisely, we considered the two Yule’s indexes, the chi square, the odds ratio and a further elementary index. We obtained, for these indexes, the compact expressions of the influence functions, which allow the quantification of the effect of an infinitesimal contamination of the probability of any pair of attributes of the bivariate random variable distributed according to the above-mentioned model.</p><p>In order to determine the indexes which are less sensitive to contamination, we obtained the expressions of three synthetic measures of the influence function, specifically the maximum contamination (gross sensitivity error), the mean absolute deviation and the variance. These expressions, even if don’t allow a definitive assessment of the overall optimum properties of the five indexes considered, as not all of them are unbiased, nevertheless they allow to appreciating the synthetic entity of the effect of the contaminations in the estimation of the parameter ρ of the bivariate Bernoulli model.</p></sec><sec id="s9"><title>NOTES</title></sec></body><back><ref-list><title>References</title><ref id="scirp.52221-ref1"><label>1</label><mixed-citation publication-type="journal" xlink:type="simple"><name name-style="western"><surname>Barnard</surname><given-names> G.A. </given-names></name>,<etal>et al</etal>. (<year>1981</year>)<article-title>Two by Two (2 × 2) Tables</article-title><source> Encyclopedia of Statistical Sciences</source><volume> 9</volume>,<fpage> 367</fpage>-<lpage>372</lpage>.<pub-id pub-id-type="doi"></pub-id></mixed-citation></ref><ref id="scirp.52221-ref2"><label>2</label><mixed-citation publication-type="other" xlink:type="simple">Hampel, F.R. (1974) The Influence Curve and Its Role in Robust Estimation. Journal of the American Statistical Association, 69, 383-393. http://dx.doi.org/10.1080/01621459.1974.10482962</mixed-citation></ref><ref id="scirp.52221-ref3"><label>3</label><mixed-citation publication-type="other" xlink:type="simple">Kendall, M.G. and Stuart, A. (1977) The Advanced Theory of Statistics. Vol. 2, C. Griffin, London, 566-571.</mixed-citation></ref><ref id="scirp.52221-ref4"><label>4</label><mixed-citation publication-type="other" xlink:type="simple">Pearson, K. (1904) On the Theory of Contingency and Its Relation to Association and Normal Correlation. Biometric Series, Drapers’ Co. Memoirs, London.</mixed-citation></ref><ref id="scirp.52221-ref5"><label>5</label><mixed-citation publication-type="other" xlink:type="simple">Yule, G.U. (1900) On the Association of Attributes in Statistics. Philosophical Transaction, 194, 257.  
http://dx.doi.org/10.1098/rsta.1900.0019</mixed-citation></ref><ref id="scirp.52221-ref6"><label>6</label><mixed-citation publication-type="other" xlink:type="simple">Yule, G.U. (1912) On the Methods of Measuring Association between Two Attributes. Journal of the Royal Statistical Society, 75, 579. http://dx.doi.org/10.2307/2340126</mixed-citation></ref><ref id="scirp.52221-ref7"><label>7</label><mixed-citation publication-type="other" xlink:type="simple">Yule, G.U. and Kendal, M.G. (1958) An Introduction to the Theory of Statistics. C. Griffin, London, 271-272.</mixed-citation></ref></ref-list></back></article>