<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article  PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article"><front><journal-meta><journal-id journal-id-type="publisher-id">OALibJ</journal-id><journal-title-group><journal-title>Open Access Library Journal</journal-title></journal-title-group><issn pub-type="epub">2333-9705</issn><publisher><publisher-name>Scientific Research Publishing</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.4236/oalib.1107183</article-id><article-id pub-id-type="publisher-id">OALibJ-107466</article-id><article-categories><subj-group subj-group-type="heading"><subject>Articles</subject></subj-group><subj-group subj-group-type="Discipline-v2"><subject>Biomedical&amp;Life Sciences</subject><subject> Business&amp;Economics</subject><subject> Chemistry&amp;Materials Science</subject><subject> Computer Science&amp;Communications</subject><subject> Earth&amp;Environmental Sciences</subject><subject> Engineering</subject><subject> Medicine&amp;Healthcare</subject><subject> Physics&amp;Mathematics</subject><subject> Social Sciences&amp;Humanities</subject></subj-group></article-categories><title-group><article-title>
 
 
  The Behaviour of the Dispersion Matrix of the Information Matrix Test under the Wrong Logistic Regression Model
 
</article-title></title-group><contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Nuri</surname><given-names>H. Salem Badi</given-names></name><xref ref-type="aff" rid="aff1"><sub>1</sub></xref></contrib></contrib-group><aff id="aff1"><label>1</label><addr-line>Faculty of Science, Statistical Department, University of Benghazi, Benghazi, Libya</addr-line></aff><pub-date pub-type="epub"><day>01</day><month>02</month><year>2021</year></pub-date><volume>08</volume><issue>02</issue><fpage>1</fpage><lpage>12</lpage><history><date date-type="received"><day>26,</day>	<month>January</month>	<year>2021</year></date><date date-type="rev-recd"><day>23,</day>	<month>February</month>	<year>2021</year>	</date><date date-type="accepted"><day>26,</day>	<month>February</month>	<year>2021</year></date></history><permissions><copyright-statement>&#169; Copyright  2014 by authors and Scientific Research Publishing Inc. </copyright-statement><copyright-year>2014</copyright-year><license><license-p>This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/</license-p></license></permissions><abstract><p>
 
 
  
    The Information Matrix Tests (IMT) considers as one of the important global goodness of fit test. The IMT provides a unified framework for specification goodness of fit tests for a wide variety of distribution, multivariate or univariate, discrete or continuous. Many researchers discussed the IMT in cases of the outcome covariate is a continuous variable which reported it has reasonable behaviour. This article considers using IMT as a goodness of fit test for the logistic regression mode, to investigate the behaviour of this statistic under the wrong model. Moreover, we are interested to examine the behaviour of the dispersion matrix under wrong logistic model and compute alternative formula of variance, empirical variance of IMT and examine it by simulation. 
  
 
</p></abstract><kwd-group><kwd>Logistic Regression Model</kwd><kwd> Goodness of Fit Test</kwd><kwd> Information Matrix Test</kwd><kwd> Estimation of Parameters</kwd></kwd-group></article-meta></front><body><sec id="s1"><title>1. Introduction</title><p>The IMT is a test for general misspecification, produced by [<xref ref-type="bibr" rid="scirp.107466-ref1">1</xref>] who pointed out that the properties of the Maximum likelihood estimator and the information matrix can be exploited to yield a family of useful tests for model mis-specification. The idea of the IMT is to compare two different estimators of the information matrix to assess model fit. The IMT is based on the information matrix equality that obtains when the model specification is correct. This equality implies the asymptotic equivalence of the Hessian and the score forms of Fisher’s information matrix [<xref ref-type="bibr" rid="scirp.107466-ref2">2</xref>]. As [<xref ref-type="bibr" rid="scirp.107466-ref1">1</xref>], points out, the IMT is designed to detect the failure of this equality and the failure implies the model misspecification. [<xref ref-type="bibr" rid="scirp.107466-ref3">3</xref>] discussed the information matrix test and showed that it is useful with binary data models. Many researchers, [<xref ref-type="bibr" rid="scirp.107466-ref4">4</xref>] [<xref ref-type="bibr" rid="scirp.107466-ref5">5</xref>] and [<xref ref-type="bibr" rid="scirp.107466-ref6">6</xref>] pointed out the behaviour of the asymptotic distribution of IMT statistic and dispersion matrix. The idea of the</p><p>information matrix test is to compare E ( − ∂ 2 l ∂ θ ∂ θ T ) and E ( ∂ l ∂ θ ∂ l ∂ θ T ) , as these</p><p>differ when the model is mis-specified but not when the model is correct. [<xref ref-type="bibr" rid="scirp.107466-ref7">7</xref>], pointed out, can be estimated the covariance matrix of IMT, dependent upon the IMT of [<xref ref-type="bibr" rid="scirp.107466-ref1">1</xref>], which can be estimated without the computation of analytic third derivatives of the density function. [<xref ref-type="bibr" rid="scirp.107466-ref4">4</xref>], discussed that, the IMT is sensitive to non-normality. Moreover, he proposed a simple computation procedure which employs the Outer Product of the Gradient (OPG) covariance matrix estimator of IMT statistic. However, [<xref ref-type="bibr" rid="scirp.107466-ref5">5</xref>] argue that, such a procedure maybe give unreliable inferences, related to the stochastic nature of the covariance matrix estimator which uses high sample moments to estimate high population moments. [<xref ref-type="bibr" rid="scirp.107466-ref6">6</xref>] purposed a simple calculation procedure for the test statistic, for general binary data models, which employs the ML covariance matrix estimator instead the OPG estimator. Moreover, [<xref ref-type="bibr" rid="scirp.107466-ref8">8</xref>], computed and examined IMT and found it had good power for logistic model.</p><p>Basic Idea of the IMT</p><p>Let us consider the density function f ( x i , θ ) for individual observation and the data are independent, identically distribution so we have</p><p>∫ f ( x | θ ) d x = 1</p><p>and we consider l ( θ ) = log f ( x , θ ) to be the logarithm of a density function of x dependent upon p parameters θ , so the log-likelihood function in this case is</p><p>l n ( θ ) = ∑ i = 1 n log f ( x i , θ )</p><p>Now, as we defined the idea of the IMT to compare two different matrix of expected the first and second partial derivatives of the l n ( θ ) , we have</p><p>∂ l ∂ θ = ∫ ∂ f ( x | θ ) ∂ θ d x = ∫ ∂ log f ( x | θ ) ∂ θ f ( x | θ ) d x = E ( ∂ log ( f ( x | θ ) ) ∂ θ ) = 0 (1)</p><p>So, according to the ML method, we have</p><p>E ( ∂ l ∂ θ ) = 0.</p><p>Differentiating (1) again we get</p><p>0 = ∫ ∂ 2 log f ( x | θ ) ∂ θ ∂ θ T f ( x | θ ) d x + ∫ ∂ log f ( x | θ ) ∂ θ ∂ log f ( x | θ ) ∂ θ T f ( x | θ ) d x (2)</p><p>So</p><p>E ( ∂ 2 l ∂ θ ∂ θ T ) + E ( ∂ l ∂ θ ∂ l ∂ θ T ) = 0. (3)</p><p>When the model is mis-specified, the above quantity will be not necessarily equal zero.</p><p>Asymptotic Distribution of θ ^</p><p>The asymptotic distribution of estimated parameters and the behaviour of the MLE under the wrong model discussed by [<xref ref-type="bibr" rid="scirp.107466-ref9">9</xref>] and more investigated considered by [<xref ref-type="bibr" rid="scirp.107466-ref10">10</xref>]. [<xref ref-type="bibr" rid="scirp.107466-ref11">11</xref>], pointed out the estimation the parameters of a given regression model. In the limit for each value of the parameter vector θ ,</p><p>n − 1 l n ( θ ) → ∫ g ( Y ) log f ( Y | θ ) d Y = E ( log f ( Y | θ ) )</p><p>where g ( Y ) denoted to the true model and f ( Y | θ ) is the fitted model. Also, consider the Kullback-Leibler divergence (KL) from the true to the approximating model conditional on X, under the wrong model. In this case θ ^ → θ * , where θ * is the least false value (LF). Note that the least false value θ * minimizes the KL divergence, because the derivative of the KL is</p><p>E ( ∂ log f ( Y , θ ) ∂ θ ) = ∫ g ( Y ) ∂ log f ( Y , θ ) ∂ θ d Y = 0.</p><p>Also, if we need define</p><p>J = − E ( ∂ 2 l ∂ θ ∂ θ T )</p><p>and</p><p>K = var ( ∂ log f ( Y , θ ) ∂ θ ) = E ( ∂ l ∂ θ ∂ l ∂ θ T )</p><p>these matrixes are identical when g ( Y ) = ∂ log f ( Y , θ ) ∂ θ for all Y. As explained in [<xref ref-type="bibr" rid="scirp.107466-ref11">11</xref>], the distribution of the θ ^ , in this case from the central limit theorem there is convergence in distribution</p><p>n U &#175; n → U ′ ~ N p ( 0, K )</p><p>where, U &#175; = n − 1 ∑ i = 1 n     u ( Y i , θ * ) , which is leads to</p><p>n ( θ ^ − θ * ) → J − 1 U ′ ~ N p ( 0, J − 1 K J − 1 ) .</p><p>So, we can say, the asymptotic MLE distribution under the null hypotheses H<sub>0</sub>, in this case</p><p>n θ ^ ~ N ( θ 0 , J − 1 )</p><p>where, θ 0 is the true value. And the asymptotic distribution of θ ^ under alternative hypotheses H<sub>1</sub> is</p><p>n θ ^ ~ N ( θ * , J − 1 K J − 1 )</p><p>So, that is meaning ( J = K ) if and only if when fitted the correct model (i.e. under H<sub>0</sub>).</p></sec><sec id="s2"><title>2. The IMT under Missing Covariates for Logistic Regression Model</title><p>In this part, we apply the procedure of the IMT statistic under missing covariates for a logistic regression model. If X i is a p-dimensional vector of covariates draw from normal distribution and Y i is binary with</p><p>P ( Y i = 1 | X i ) = expit ( α + β T X i ) . (4)</p><p>In the following we treat the simple case where the fitted model is</p><p>P ( Y i = 1 | X i ) = expit ( α + β 1 X 1 i ) (5)</p><p>for a scalar X 1 and that the true model has</p><p>P ( Y i = 1 | X i ) = expit ( α + β 1 X 1 i + β 2 X 2 i ) , (6)</p><p>where X 2 is also a scalar. We have the log-likelihood function contribution for the i<sup>th</sup> element ( Y i , X i ) is</p><p>l ( Y i , X i ) = Y i ( α + β T X i ) − log ( 1 + exp ( α + β T X i ) ) (7)</p><p>and so,</p><p>∂ l i ∂ α = Y i − π i ;   ∂ l i ∂ β 1 = ( Y i − π i ) X 1 i</p><p>and note that we only consider fitting the model with X 1 , even if the true model also includes X 2 (i.e. β 2 ≠ 0 ). From this we get:</p><p>∂ 2 l i ∂ θ ∂ θ T = [ − π i ( 1 − π i ) − π i ( 1 − π i ) X i − π i ( 1 − π i ) X i − π i ( 1 − π i ) X i 2 ]</p><p>Also,</p><p>∂ l i ∂ θ ∂ l i ∂ θ T = [ ( Y i − π i ) 2 ( Y i − π i ) 2 X i ( Y i − π i ) 2 X i ( Y i − π i ) 2 X i 2 ]</p><p>using,</p><p>( Y i − π i ) 2 − π i ( 1 − π i ) = ( Y i − π i ) ( 1 − 2 π i ) ,</p><p>as Y i 2 is Y i , and so we get that</p><p>d g ( y i , θ ) = ( Y i − π i ) ( 1 − 2 π i ) [ 1 X i X i 2 ] . (8)</p></sec><sec id="s3"><title>3. An Alternative Formulae of Variance</title><p>In this part we are interested to find a formulae of the variance of d statistic, even when the model is mis-specified. To perform the IMT we need to find the mean and variance of</p><p>T = 1 n ∑ i = 1 n     d g i</p><p>Under H<sub>0</sub> E ( d g i ) = 0 , and so the IMT could be written as</p><p>T T var ( T ) − 1 T</p><p>which will have a χ 2 -distribution on rank ( var ( T ) ) d.f. as T is asymptotically Normal. However, the test statistic has to be evaluated at the MLE θ ^ and this introduces a complication. The MLE θ ^ is the solution to</p><p>S = 1 n ∇ l = 1 n ∑ i = 1 n     ∇ l i = 1 n ∑ i = 1 n ( y i − π i ) [ 1 X i ] = 0.</p><p>The expression for T is</p><p>T = 1 n ∑ i = 1 n ( y i − π i ) ( 1 − 2 π i ) [ 1 x i x i 2 ]</p><p>and this is clearly going to be highly correlation with S. Therefore, the appropriate variance for the IMT is var ( T | S = 0 ) . As T and S are sums of independent elements, the Central limit Theorem implies that ( T , S ) T is asymptotically Normal and so we can use</p><p>var ( T | S = 0 ) = var ( T ) − cov ( T , S ) var ( S ) − 1 cov ( T , S ) T . (9)</p><p>To work out var ( T | S = 0 ) , so, in this case we can write</p><p>var ( T ) = var ( [ d g 1 + d g 2 + ⋯ + d g n ] / n ) = var ( d g 1 ) ,</p><p>and similarly</p><p>var ( S ) = var ( ∇ l 1 ) , cov ( T , S ) = cov ( d g 1 , ∇ l 1 ) .</p><sec id="s3_1"><title>3.1. The Variance of IMT under Missing Covariates for Logistic Regression Model</title><p>We now need to find expressions for var ( d g 1 ) , var ( ∇ l 1 ) and cov ( d g 1 , ∇ l 1 )</p><p>We already have that</p><p>d g = ( y i − π i ) ( 1 − 2 π i ) [ 1 x i x i 2 ]</p><p>and</p><p>∇ l i = ( y i − π i ) [ 1 x i ]</p><p>so, the variance is</p><p>var ( d g ) = E ( d g d g T ) − E ( d g ) E ( d g T ) (10)</p><p>and we have</p><p>d g d g T = ( y − π ) 2 ( 1 − 2 π ) 2 [ 1 x i x i 2 x i x i 2 x i 3 x i 2 x i 3 x i 4 ] (11)</p><p>taking expectation E Y | X we obtain</p><p>E ( d g 1 ) = E X [ ( π t − π ) ( 1 − 2 π ) [ 1 x i x i 2 ] ] (12)</p><p>and,</p><p>E ( d g 1 d g 1 T ) = E X [ ( π t ( 1 − 2 π ) + π 2 ) ( 1 − 2 π ) 2 [ 1 X X 2 X X 2 X 3 X 2 X 3 X 4 ] ] . (13)</p><p>Now we need to compute cov ( d g , ∇ l ) . In fact E ( ∇ l ) = 0 , not only if the model is correct but also when evaluated at the least false value θ * (under wrong model), so in this case</p><p>cov ( d g 1 , ∇ l 1 ) = E ( d g ∇ l ) T .</p><p>and we have</p><p>d g 1 ∇ l 1 T = ( y − π ) ( 1 − 2 π ) [ 1 x i x i 2 ] ( y − π ) [ 1 x i ] = ( y − π ) 2 ( 1 − 2 π ) [ 1 x i x i x i 2 x i 2 x i 3 ]</p><p>then,</p><p>E ( d g 1 ∇ l 1 T ) = E X [ ( π t ( 1 − 2 π ) + π 2 ) ( 1 − 2 π ) [ 1 X X X 2 X 2 X 3 ] ] . (14)</p><p>Now we will work out var ( ∇ l ) , as before, since E ( ∇ l ) = 0 , so</p><p>var ( ∇ l 1 ) = E ( ∇ l ∇ l T ) = E X E Y | X [ ( Y − π ) 2 ( Y − π ) 2 X ( Y − π ) 2 X ( Y − π ) 2 X 2 ]</p><p>and note that</p><p>E Y | X ( Y − π ) 2 = E Y | X ( Y ( 1 − 2 π ) + π 2 ) = π t ( 1 − 2 π ) + π 2 ,</p><p>where, π t is E ( Y ) under the true model. So,</p><p>E ( ∇ l ∇ l T ) = E X [ π t ( 1 − 2 π ) + π 2 ( π t ( 1 − 2 π ) + π 2 ) X ( π t ( 1 − 2 π ) + π 2 ) X ( π t ( 1 − 2 π ) + π 2 ) X 2 ] . (15)</p><p>Hence, the required variance (9)</p><p>E ( d g d g T ) − E ( d g ) E ( d g T ) − E ( d g ∇ l T ) E ( ∇ l ∇ l T ) − 1 E ( ( ∇ l ) d g T ) (16)</p><p>and we have expressions for each component from (12), (13), (14) and (15) We need to evaluate these components by simulation.</p></sec><sec id="s3_2"><title>3.2. The Dispersion Matrix under Wrong Model</title><p>In fact, may be some elements of the covariance matrix of the IMT are linear combinations of others leading to singularity of the estimated covariance matrix, this point discussed by [<xref ref-type="bibr" rid="scirp.107466-ref1">1</xref>] and [<xref ref-type="bibr" rid="scirp.107466-ref12">12</xref>]. We are interested to compute the var ( T | S = 0 ) , even when the wrong model has been fitted. We will compute each of the components of this variance separately. We see from Section 3.1 that we need to evaluate, e.g.</p><p>E ( d ) = E X ( ( π t − π ) ( 1 − 2 π ) [ 1 X X 2 ] )</p><p>and also,</p><p>E ( d d T ) = E X ( [ π t ( 1 − 2 π ) + π 2 ] ( 1 − 2 π ) 2 [ 1 X X 2 X X 2 X 3 X 2 X 3 X 4 ] ) .</p><p>This cannot be done analytically so we simulate 5000 values of X and replace the E ( d ) by the mean of these 5000 values. In evaluating π t we use the values of the parameters α t , β 1 t and β 2 t . What do we use for π ? We need to evaluate π ( α , β 1 ) at the least false values α * and β 1 * for α and β 1 . So, e.g, the first element of E ( d ) is found by simulation from</p><p>E X [ ( expit ( α t + β t 1 X 1 + β t 2 X 2 ) − expit ( α * + β 1 * X 1 ) ) ( 1 − 2 expit ( α * + β 1 * X 1 ) ) ]</p><p>where,</p><p>α * = α t + β t 2 ( μ 2 − ρ μ 1 ) 1 + k 2 β t 2 2 σ 2 ( 1 − ρ 2 ) , (17)</p><p>β 1 * = β t 1 + ρ β t 2 1 + k 2 β t 2 2 σ 2 ( 1 − ρ 2 ) (18)</p><p>and X draw from bivariate normal distribution with μ = ( μ 1 , μ 2 ) , and σ 1 2 = σ 2 2 . The formulae of the least false values α * and β * has been discussed and calculated by [<xref ref-type="bibr" rid="scirp.107466-ref10">10</xref>].</p></sec></sec><sec id="s4"><title>4. Empirical Variance of IMT</title><p>The expression in (16) is the variance V of d at θ ^ but we need an estimate, V ^ . If we have a sample { ( y i , x i 1 ) | i = 1, ⋯ , n } how can we estimate V consistently? One candidate would be to compute</p><p>d i = ( y i − π ^ i ) ( 1 − 2 π ^ i ) [ 1 x i x i 2 ] ,   i = 1 , ⋯ , n</p><p>and</p><p>∇ l i = ( y i − π ^ i ) [ 1 x i ] ,   i = 1 , ⋯ , n</p><p>where, π ^ i is the fitted value from the model with just x 1 . Now compute</p><p>W ^ n = 1 n ∑ i = 1 n     d i d i T − ( 1 n ∑ i = 1 n     d i ) ( 1 n ∑ i = 1 n     d i T )</p><p>and</p><p>B ^ n = 1 n ∑ i = 1 n ( y i − π ^ ) 2 [ 1 x i x i x i 2 ] ,</p><p>C ^ n = 1 n ∑ i = 1 n ( y i − π ^ ) 2 ( 1 − 2 π ^ i ) [ 1 x i x i x i 2 x i 2 x i 3 ]</p><p>Then use</p><p>V ^ = W ^ n − C ^ n B ^ n − 1 C ^ n T (19)</p><p>as an estimate of V, we will assess this by simulation.</p></sec><sec id="s5"><title>5. Simulation Study</title><p>This simulation examines the correctness of the form of the dispersion matrix V in (16) and (19). To achieve the aim of this simulation, we will consider a logistic regression model which has two covariates draw from bivariate normal distribution with mean zero and covariance matrix Σ as:</p><p>π t = expit ( α t + β t 1 x 1 + β t 2 x 2 )</p><p>and the fitted model is</p><p>π = expit ( α + β 1 x 1 )</p><p>・ Apply in two cases of logistic model,</p><p>・ The fitted is the true logistic model (i.e. β t 2 = 0 )</p><p>・ The fitted model is mis-specified (i.e. β t 2 ≠ 0 ).</p><p>・ Use variance ( σ 1 2 = σ 2 2 = 2 ) and correlation ρ = 0.1 .</p><p>・ We choose some different components of parameters α t , β t 1 and β t 2 to calculate π t .</p><p>・ We compute the least false values α * and β 1 * by formulae to calculate π .</p><p>・ We compute the true variance by simulating d i and take the variance to be var ( n d &#175; ) = V t r .</p><p>・ We compute the theoretical variance var ( d ) = V T at the least false value and calculate E ( d 1 ) and E ( d 1 d 1 T ) as described in section 3.2.</p><p>・ Finally, for each simulation we compute the empirical variance V E and take the mean over the simulations.</p><p>・ We make comparison between the diagonal elements of dispersion matrix V E , V T vs. V t r respectively.</p><p>・ Apply on different sample size n = 500 , 1000 and N = 5000 number of simulations.</p></sec><sec id="s6"><title>6. Results and Discussion</title><p>The results were reported in tables, which show the diagonal elements of the variance matrix: V E denotes the empirical variance, V T denotes the theoretical variance and V t r denotes the true variance. The true parameters appear as α t , β t 1 , and β t 2 ; R n E and R n T denote to the rank of the covariance matrix</p><p>empirical and theoretical respectively. The Ratio R E and R T are V E V t r , V T V t r respectively. S . D ( π t ) denotes the standard deviation over a sample</p><p>where π t is the true model. In our simulation we consider two covariates, so in this case the dispersion matrix of d is a 3 &#215; 3 dimensional matrix.</p><p>Firstly, we consider the results under true logistic model, <xref ref-type="table" rid="table1">Table 1</xref>, shows the results of simulation, which appeared the diagonal elements of matrix V, the empirical version and theoretical form comparing with true variance, which use ρ = 0.1 in case of σ 1 2 = σ 2 2 = 2 by sample size n = 500 . <xref ref-type="table" rid="table2">Table 2</xref>, reported the results by sample size n = 1000 , with equal variance σ 1 2 = σ 2 2 = 2 . We can see clearly, that all diagonal elements appeared small in value in two different cases of sample size. The first element was much closer to zero than of the rest. In almost cases the results appeared reasonable ratio which is meaning the theoretical variance and empirical variance are close to the true value. There are some slightly strange ratio almost in case of sample size n = 500 , the reason may be affected by small value of standard deviation of π t S . D ( π t ) , otherwise the ratio is close to one. In case of sample size n = 1000 , the behaviour of results shows almost the same pattern, with the ratio close to one and that is meaning the formulae of the variance works well. In a few cases with small values of S . D ( π t ) which affected on the ratio where the first two elements were more sensitive. Overall, we have reasonable results to say that, the alternative formulae of variance works well and the two first elements still more sensitive which appeared tend to zero.</p><p>Secondly, we consider the results when the missing covariate logistic model has been fitted. That is meaning when the variance of IMT computed under H<sub>1</sub> and uses the least false values. <xref ref-type="table" rid="table3">Table 3</xref>, shows the results of sample size n = 500 . <xref ref-type="table" rid="table4">Table 4</xref>, shows the results of sample size 1000. In general, the behaviour of ratio</p><table-wrap id="table1" ><label><xref ref-type="table" rid="table1">Table 1</xref></label><caption><title> Simulation results of the variance ( V t r ), ( V E ) and ( V T ) in case of fitted true model, with sample size n = 500 and σ 1 2 = σ 2 2 = 2 </title></caption><table><tbody><thead><tr><th align="center" valign="middle"  colspan="11"  >Diagonal component of variance IMT and Ratio</th></tr></thead><tr><td align="center" valign="middle" >α t</td><td align="center" valign="middle" >β t 1</td><td align="center" valign="middle" >π t</td><td align="center" valign="middle" >S . D ( π t )</td><td align="center" valign="middle" >R n E</td><td align="center" valign="middle" >R n T</td><td align="center" valign="middle" >V E</td><td align="center" valign="middle" >V T</td><td align="center" valign="middle" >V t r</td><td align="center" valign="middle" >R 1</td><td align="center" valign="middle" >R 2</td></tr><tr><td align="center" valign="middle" >0.80</td><td align="center" valign="middle" >0.50</td><td align="center" valign="middle" >0.68</td><td align="center" valign="middle" >0.14</td><td align="center" valign="middle" >3</td><td align="center" valign="middle" >3</td><td align="center" valign="middle" >2.4323e<sup>−04</sup></td><td align="center" valign="middle" >2.4817e<sup>−04</sup></td><td align="center" valign="middle" >2.4979e<sup>−04</sup></td><td align="center" valign="middle" >0.99</td><td align="center" valign="middle" >0.99</td></tr><tr><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >4.2714e<sup>−02</sup></td><td align="center" valign="middle" >4.6277e<sup>−02</sup></td><td align="center" valign="middle" >4.3906e<sup>−02</sup></td><td align="center" valign="middle" >0.99</td><td align="center" valign="middle" >1.02</td></tr><tr><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >1.9446e<sup>−01</sup></td><td align="center" valign="middle" >2.1171e<sup>−01</sup></td><td align="center" valign="middle" >2.0659e<sup>−01</sup></td><td align="center" valign="middle" >0.97</td><td align="center" valign="middle" >1.01</td></tr><tr><td align="center" valign="middle" >1.20</td><td align="center" valign="middle" >2.20</td><td align="center" valign="middle" >0.63</td><td align="center" valign="middle" >0.36</td><td align="center" valign="middle" >3</td><td align="center" valign="middle" >3</td><td align="center" valign="middle" >1.4666e<sup>−03</sup></td><td align="center" valign="middle" >1.6904e<sup>−03</sup></td><td align="center" valign="middle" >1.6126e<sup>−03</sup></td><td align="center" valign="middle" >0.95</td><td align="center" valign="middle" >1.02</td></tr><tr><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >1.8138e<sup>−02</sup></td><td align="center" valign="middle" >2.0199e<sup>−02</sup></td><td align="center" valign="middle" >1.9872e<sup>−02</sup></td><td align="center" valign="middle" >0.96</td><td align="center" valign="middle" >1.01</td></tr><tr><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >2.5199e<sup>−02</sup></td><td align="center" valign="middle" >2.9136e<sup>−02</sup></td><td align="center" valign="middle" >2.8595e<sup>−02</sup></td><td align="center" valign="middle" >0.94</td><td align="center" valign="middle" >1.01</td></tr></tbody></table></table-wrap><table-wrap id="table2" ><label><xref ref-type="table" rid="table2">Table 2</xref></label><caption><title> Simulation results of the variance ( V t r ), ( V E ) and ( V T ) in case of fitted true model, with sample size n = 1000 and σ 1 2 = σ 2 2 = 2 </title></caption><table><tbody><thead><tr><th align="center" valign="middle"  colspan="11"  >Diagonal component of variance IMT and Ratio</th></tr></thead><tr><td align="center" valign="middle" >α t</td><td align="center" valign="middle" >β t 1</td><td align="center" valign="middle" >π t</td><td align="center" valign="middle" >S . D ( π t )</td><td align="center" valign="middle" >R n E</td><td align="center" valign="middle" >R n T</td><td align="center" valign="middle" >V E</td><td align="center" valign="middle" >V T</td><td align="center" valign="middle" >V t r</td><td align="center" valign="middle" >R 1</td><td align="center" valign="middle" >R 2</td></tr><tr><td align="center" valign="middle" >0.80</td><td align="center" valign="middle" >0.50</td><td align="center" valign="middle" >0.67</td><td align="center" valign="middle" >0.14</td><td align="center" valign="middle" >3</td><td align="center" valign="middle" >3</td><td align="center" valign="middle" >2.4038e<sup>−04</sup></td><td align="center" valign="middle" >2.4441e<sup>−04</sup></td><td align="center" valign="middle" >2.4915e<sup>−04</sup></td><td align="center" valign="middle" >0.98</td><td align="center" valign="middle" >0.99</td></tr><tr><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >4.3783e<sup>−02</sup></td><td align="center" valign="middle" >4.4257e<sup>−02</sup></td><td align="center" valign="middle" >4.4706e<sup>−02</sup></td><td align="center" valign="middle" >0.99</td><td align="center" valign="middle" >0.99</td></tr><tr><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >1.9858e<sup>−01</sup></td><td align="center" valign="middle" >1.9486e<sup>−01</sup></td><td align="center" valign="middle" >1.0478e<sup>−01</sup></td><td align="center" valign="middle" >0.98</td><td align="center" valign="middle" >0.98</td></tr><tr><td align="center" valign="middle" >1.20</td><td align="center" valign="middle" >2.20</td><td align="center" valign="middle" >0.64</td><td align="center" valign="middle" >0.35</td><td align="center" valign="middle" >3</td><td align="center" valign="middle" >3</td><td align="center" valign="middle" >1.5709<sup>−03</sup></td><td align="center" valign="middle" >1.6876e<sup>−03</sup></td><td align="center" valign="middle" >1.6469e<sup>−03</sup></td><td align="center" valign="middle" >0/98</td><td align="center" valign="middle" >1.01</td></tr><tr><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >1.9049e<sup>−02</sup></td><td align="center" valign="middle" >2.0225e<sup>−02</sup></td><td align="center" valign="middle" >2.0199e<sup>−02</sup></td><td align="center" valign="middle" >0.97</td><td align="center" valign="middle" >1.00</td></tr><tr><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >2.6726e<sup>−02</sup></td><td align="center" valign="middle" >2.9664e<sup>−02</sup></td><td align="center" valign="middle" >2.7877e<sup>−02</sup></td><td align="center" valign="middle" >0.98</td><td align="center" valign="middle" >1.03</td></tr></tbody></table></table-wrap><table-wrap id="table3" ><label><xref ref-type="table" rid="table3">Table 3</xref></label><caption><title> Simulation results of the variance ( V t r ), ( V E ) and ( V T ) in case of fitted missing covariates model, with sample size n = 500 and σ 1 2 = σ 2 2 = 2 </title></caption><table><tbody><thead><tr><th align="center" valign="middle"  colspan="12"  >Diagonal component of variance IMT and Ratio</th></tr></thead><tr><td align="center" valign="middle" >α t</td><td align="center" valign="middle" >β t 1</td><td align="center" valign="middle" >β t 2</td><td align="center" valign="middle" >π t</td><td align="center" valign="middle" >S . D ( π t )</td><td align="center" valign="middle" >R n E</td><td align="center" valign="middle" >R n T</td><td align="center" valign="middle" >V E</td><td align="center" valign="middle" >V T</td><td align="center" valign="middle" >V t r</td><td align="center" valign="middle" >R 1</td><td align="center" valign="middle" >R 2</td></tr><tr><td align="center" valign="middle" >0.80</td><td align="center" valign="middle" >0.50</td><td align="center" valign="middle" >0.4</td><td align="center" valign="middle" >0.66</td><td align="center" valign="middle" >0.18</td><td align="center" valign="middle" >3</td><td align="center" valign="middle" >3</td><td align="center" valign="middle" >2.3400e<sup>−04</sup></td><td align="center" valign="middle" >2.5316e<sup>−04</sup></td><td align="center" valign="middle" >2.3730e<sup>−04</sup></td><td align="center" valign="middle" >0.99</td><td align="center" valign="middle" >1.03</td></tr><tr><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >4.3984e<sup>−02</sup></td><td align="center" valign="middle" >5.1198e<sup>−02</sup></td><td align="center" valign="middle" >4.5628e<sup>−02</sup></td><td align="center" valign="middle" >0.98</td><td align="center" valign="middle" >1.05</td></tr><tr><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >1.9280e<sup>−01</sup></td><td align="center" valign="middle" >2.3559e<sup>−01</sup></td><td align="center" valign="middle" >2.0038e<sup>−01</sup></td><td align="center" valign="middle" >0.98</td><td align="center" valign="middle" >1.08</td></tr><tr><td align="center" valign="middle" >1.20</td><td align="center" valign="middle" >2.20</td><td align="center" valign="middle" >0.8</td><td align="center" valign="middle" >0.62</td><td align="center" valign="middle" >0.36</td><td align="center" valign="middle" >3</td><td align="center" valign="middle" >3</td><td align="center" valign="middle" >1.2620e<sup>−03</sup></td><td align="center" valign="middle" >1.4252e<sup>−03</sup></td><td align="center" valign="middle" >1.3850e<sup>−03</sup></td><td align="center" valign="middle" >0.95</td><td align="center" valign="middle" >1.01</td></tr><tr><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >2.1819e<sup>−02</sup></td><td align="center" valign="middle" >2.3741e<sup>−02</sup></td><td align="center" valign="middle" >2.3411e<sup>−02</sup></td><td align="center" valign="middle" >0.97</td><td align="center" valign="middle" >1.01</td></tr><tr><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >3.2250e<sup>−02</sup></td><td align="center" valign="middle" >3.5432e<sup>−02</sup></td><td align="center" valign="middle" >3.6766e<sup>−02</sup></td><td align="center" valign="middle" >0.94</td><td align="center" valign="middle" >0.98</td></tr></tbody></table></table-wrap><table-wrap id="table4" ><label><xref ref-type="table" rid="table4">Table 4</xref></label><caption><title> Simulation results of the variance ( V t r ), ( V E ) and ( V T ) in case of fitted missing covariates model, with sample size n = 1000 and σ 1 2 = σ 2 2 = 2 </title></caption><table><tbody><thead><tr><th align="center" valign="middle"  colspan="12"  >Diagonal component of variance IMT and Ratio</th></tr></thead><tr><td align="center" valign="middle" >α t</td><td align="center" valign="middle" >β t 1</td><td align="center" valign="middle" >β t 2</td><td align="center" valign="middle" >π t</td><td align="center" valign="middle" >S . D ( π t )</td><td align="center" valign="middle" >R n E</td><td align="center" valign="middle" >R n T</td><td align="center" valign="middle" >V E</td><td align="center" valign="middle" >V T</td><td align="center" valign="middle" >V t r</td><td align="center" valign="middle" >R 1</td><td align="center" valign="middle" >R 2</td></tr><tr><td align="center" valign="middle" >0.80</td><td align="center" valign="middle" >0.50</td><td align="center" valign="middle" >0.4</td><td align="center" valign="middle" >0.66</td><td align="center" valign="middle" >0.18</td><td align="center" valign="middle" >3</td><td align="center" valign="middle" >3</td><td align="center" valign="middle" >2.3253e<sup>−04</sup></td><td align="center" valign="middle" >2.6095e<sup>−04</sup></td><td align="center" valign="middle" >2.2984e<sup>−04</sup></td><td align="center" valign="middle" >1.01</td><td align="center" valign="middle" >1.06</td></tr><tr><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >4.5129e<sup>−02</sup></td><td align="center" valign="middle" >4.8176e<sup>−02</sup></td><td align="center" valign="middle" >4.4637e<sup>−02</sup></td><td align="center" valign="middle" >1.01</td><td align="center" valign="middle" >1,03</td></tr><tr><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >1.9837e<sup>−01</sup></td><td align="center" valign="middle" >2.3374e<sup>−01</sup></td><td align="center" valign="middle" >2.0511e<sup>−01</sup></td><td align="center" valign="middle" >0.98</td><td align="center" valign="middle" >1.06</td></tr><tr><td align="center" valign="middle" >1.20</td><td align="center" valign="middle" >2.20</td><td align="center" valign="middle" >0.8</td><td align="center" valign="middle" >0.59</td><td align="center" valign="middle" >0.36</td><td align="center" valign="middle" >3</td><td align="center" valign="middle" >3</td><td align="center" valign="middle" >1.3404e<sup>−03</sup></td><td align="center" valign="middle" >1.4576e<sup>−03</sup></td><td align="center" valign="middle" >1.4116e<sup>−03</sup></td><td align="center" valign="middle" >0.97</td><td align="center" valign="middle" >1.01</td></tr><tr><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >2.2914e<sup>−02</sup></td><td align="center" valign="middle" >2.4611e<sup>−02</sup></td><td align="center" valign="middle" >2.3911e<sup>−02</sup></td><td align="center" valign="middle" >0.98</td><td align="center" valign="middle" >1.01</td></tr><tr><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >3.3481e<sup>−02</sup></td><td align="center" valign="middle" >3.7438e<sup>−02</sup></td><td align="center" valign="middle" >3.4621e<sup>−02</sup></td><td align="center" valign="middle" >0.98</td><td align="center" valign="middle" >1.03</td></tr></tbody></table></table-wrap><p>appeared the same behaviour which found in case of β 2 t = 0 , the two cases of different sample size appeared reasonable ratio which is close to one. A few cases shows low ratio, the reason is as discussed before concerning to the small value of S . D ( π t ) .</p></sec><sec id="s7"><title>7. Conclusion</title><p>This paper carried out to investigate the behaviour of IMT and compute the covariance matrix under the wrong logistic regression model. As result, we can see that the alternative formula of the variance appeared reasonable results under the true and missing covariate model. As we computed the final form of the variance of IMT, we can see clearly it is dependent on E ( d ) . As we know, we made some notes on the first two elements of E ( d ) , which may be quite close to zero under true model and use the least false value, the E ( π t − π ) = E ( ( π t − π ) X ) = 0 related to the log likelihood functions. So, these elements leading to singularity of the estimated covariance matrix, and have effect on the behaviour of the dispersion matrix of the IMT.</p></sec><sec id="s8"><title>Acknowledgements</title><p>I am very grateful to Professor J. N. S. Matthews, School of Mathematics and Statistics, Newcastle University for academic supporting and Dr. Hamza M. A. Boauod, (FCOPHTH) (SA), Consultant Ophthalmologist, Eye Department, Klerksdorp Hospital. South Africa for his financial support. Also thank the referees, associate editor and joint editor for their helpful comments and additional references.</p></sec><sec id="s9"><title>Conflicts of Interest</title><p>The author declares no conflicts of interest regarding the publication of this paper.</p></sec><sec id="s10"><title>Cite this paper</title><p>Badi, N.H.S. (2021) The Behaviour of the Dispersion Matrix of the Information Matrix Test under the Wrong Logistic Regression Model e. Open Access Library Journal, 8: e7183. https://doi.org/10.4236/oalib.1107183</p></sec></body><back><ref-list><title>References</title><ref id="scirp.107466-ref1"><label>1</label><mixed-citation publication-type="other" xlink:type="simple">Hausman, J. A. (1978) Spesification Tests in Econometrics. Econometrica, 46, 1251- 1271. &lt;br /&gt;https://doi.org/10.2307/1913827</mixed-citation></ref><ref id="scirp.107466-ref2"><label>2</label><mixed-citation publication-type="other" xlink:type="simple">Chesher, A. (1984) Testing for Neglected Heterogeneity. Econometrica, 52, 865-872.  
&lt;br /&gt;https://doi.org/10.2307/1911188</mixed-citation></ref><ref id="scirp.107466-ref3"><label>3</label><mixed-citation publication-type="other" xlink:type="simple">Newey, W.K. (1984) Maximum Likelihood Specification Testing and Conditional Moment Tests. Econometrica, 53, 1047-1070. https://doi.org/10.2307/1911011</mixed-citation></ref><ref id="scirp.107466-ref4"><label>4</label><mixed-citation publication-type="other" xlink:type="simple">Davidson, R. and Mackinnon, J.G. (1984) Convenient Specification Tests for Logit and Probit Models. Journal of Econometrics, 25, 241-262.  
https://doi.org/10.1016/0304-4076(84)90001-0</mixed-citation></ref><ref id="scirp.107466-ref5"><label>5</label><mixed-citation publication-type="other" xlink:type="simple">Orme, C. (1988) The Calculation of the Information Matrix Test for Binary Data Models. The Manchester School, 56, 370-376.  
https://doi.org/10.1111/j.1467-9957.1988.tb01339.x</mixed-citation></ref><ref id="scirp.107466-ref6"><label>6</label><mixed-citation publication-type="other" xlink:type="simple">Lancaster, T. (1984) Covariance Matrix of the Information Matrix Test. Econometrica, 52, 1051-1053. https://doi.org/10.2307/1911198</mixed-citation></ref><ref id="scirp.107466-ref7"><label>7</label><mixed-citation publication-type="other" xlink:type="simple">Kuss, O. (2002) Global Goodness-of-Fit Tests in Logistic Regression with Sparse Data. Statistics in Medicine, 21, 3789-3801. https://doi.org/10.1002/sim.1421</mixed-citation></ref><ref id="scirp.107466-ref8"><label>8</label><mixed-citation publication-type="other" xlink:type="simple">Matthews, J.N.S. and Badi, N.H. (2015) Inconsistent Treatment Estimates from Mis-Specified Logistic Regression Analyses of Randomized Trials. Statistics in Medicine, 34, 2681-2694. &lt;br /&gt;https://doi.org/10.1002/sim.6508</mixed-citation></ref><ref id="scirp.107466-ref9"><label>9</label><mixed-citation publication-type="other" xlink:type="simple">Badi, N.H.S. (2017) Properties of the Maximum Likelihood Estimates and Bias Reduction for Logistic Regression Model. Open Access Library Journal, 4, e3625.</mixed-citation></ref><ref id="scirp.107466-ref10"><label>10</label><mixed-citation publication-type="other" xlink:type="simple">Claeskens, G. and Hjort, N.L. (2008) Model Selection and Model Averaging. Cambridge University Press, Cambridge.</mixed-citation></ref><ref id="scirp.107466-ref11"><label>11</label><mixed-citation publication-type="other" xlink:type="simple">Lin, D.Y. and Wel, L.J. (1991) Goodness-of-Fit Tests for the General Cox Regression Model. Statistica Sinica, 1, 1-17.</mixed-citation></ref><ref id="scirp.107466-ref12"><label>12</label><mixed-citation publication-type="other" xlink:type="simple">White, H. (1982) Maximum Likelihood Estimation of Misspecified Models. Econometrica, 50, 1-25. https://doi.org/10.2307/1912526</mixed-citation></ref></ref-list></back></article>