<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article  PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article"><front><journal-meta><journal-id journal-id-type="publisher-id">PSYCH</journal-id><journal-title-group><journal-title>Psychology</journal-title></journal-title-group><issn pub-type="epub">2152-7180</issn><publisher><publisher-name>Scientific Research Publishing</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.4236/psych.2014.518200</article-id><article-id pub-id-type="publisher-id">PSYCH-51663</article-id><article-categories><subj-group subj-group-type="heading"><subject>Articles</subject></subj-group><subj-group subj-group-type="Discipline-v2"><subject>Social Sciences&amp;Humanities</subject></subj-group></article-categories><title-group><article-title>
 
 
  A Procedure for Diagnostically Modeling Extant Large-Scale Assessment Data: The Case of the Programme for International Student Assessment in Reading
 
</article-title></title-group><contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>insong</surname><given-names>Chen</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Jimmy</surname><given-names>de la Torre</given-names></name><xref ref-type="aff" rid="aff2"><sup>2</sup></xref></contrib></contrib-group><aff id="aff2"><addr-line>Department of Educational Psychology, Rutgers, The State University of New Jersey, New Brunswick, USA</addr-line></aff><aff id="aff1"><addr-line>Department of Psychology, Sun Yat-sen University, Guangzhou, China</addr-line></aff><author-notes><corresp id="cor1">* E-mail:<email>jinsong.chen@live.com(IC)</email>;</corresp></author-notes><pub-date pub-type="epub"><day>24</day><month>11</month><year>2014</year></pub-date><volume>05</volume><issue>18</issue><fpage>1967</fpage><lpage>1978</lpage><history><date date-type="received"><day>17</day>	<month>September</month>	<year>2014</year></date><date date-type="rev-recd"><day>12</day>	<month>October</month>	<year>2014</year>	</date><date date-type="accepted"><day>8</day>	<month>November</month>	<year>2014</year></date></history><permissions><copyright-statement>&#169; Copyright  2014 by authors and Scientific Research Publishing Inc. </copyright-statement><copyright-year>2014</copyright-year><license><license-p>This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/</license-p></license></permissions><abstract><p>
 
 
  Cognitive diagnosis models (CDMs) are psychometric models developed mainly to assess examinees’ specific strengths and weaknesses of a set of skills or attributes within a domain. Recently, several methodological developments have been added to the CDM literature, which include the development of general and reduced CDMs, various absolute and relative fit measures at both the test and item levels, and a general Q-matrix validation procedure. Building on these developments, this research proposes a systematic procedure to diagnostically model extant large-scale assessment data. The procedure can be divided into four phases: construction of initial attributes and Q-matrices, construction of final attributes and Q-matrix, evaluation of reduced CDMs, and crossvalidation of the selected model. Working with language experts, we use data from the PISA 2000 reading assessment to illustrate the procedure.
 
</p></abstract><kwd-group><kwd>CDM</kwd><kwd> Q-Matrix</kwd><kwd> Large-Scale Assessment</kwd><kwd> Fit Measures</kwd><kwd> PISA</kwd></kwd-group></article-meta></front><body><sec id="s1"><title>1. Introduction</title><p>Cognitive diagnosis models (CDMs) are psychometric models developed mainly to assess examinees’ specific strengths and weaknesses, or mastery or nonmastery of a given set of skills or attributes within a domain. Different from conventional unidimensional item response models (IRMs) that rank examinees along a proficiency continuum, CDMs with latent classes are employed for the purpose of diagnosing the presence or absence of multiple fine-grained attributes. In conjunction with an appropriate Q-matrix  (Tatsuoka, 1983) , CDMs can be applied to different assessments for diagnostic purposes, thereby facilitating a more precise measurement of student learning and aiding in the design of better instruction. Recently, several methodological developments have been added to the CDM literature. Among others, these developments include the generalization of highly constrained models like the deterministic inputs, noisy “and” gate (DINA;  Haertel, 1989;   Junker &amp; Sijtsma, 2001 ) model and the deterministic inputs, noisy “or” gate (DINO;  Templin &amp; Henson, 2006 ) model to saturated models such as the log-linear CDM  (Henson, Templin, &amp; Willse, 2009)  and the generalized DINA (G-DINA;  de la Torre, 2011 ) model; different reduced CDMs like the additive CDM (A-CDM;  de la Torre, 2011 ), the linear logistic model (LLM;  Maris, 1999 ), and the reduced reparameterized unified model (R-RUM;  DiBello, Roussos, &amp; Stout, 2007; Hartz, 2002 ); general CDM for expert-defined polytomous attributes  (Chen &amp; de la Torre, 2013) ; various model-data misfit measures  (Chen, de la Torre, &amp; Zhang, 2013; de la Torre &amp; Chen, 2011; de la Torre &amp; Lee, 2010;   Kunina-Habenicht, Rupp, &amp; Wilhelm, 2012)  for absolute or relative fit evaluation on the test or item levels; and a general Q-matrix validation procedure  (de la Torre, 2008; de la Torre &amp; Chiu, 2010) .</p><p>Compared to the methodological developments, however, empirical applications of CDMs are still limited. The usefulness of these developments is best reflected by the breadth and depth of their applications across substantive areas. Although empirical examples were usually provided when developing the above methodologies, they were largely limited to the mathematics domain, particularly the subtraction fraction data by  Tatsuoka (1990) . However, if used in a systematic way, these developments can allow non-diagnostic assessments to be adapted for diagnostic purposes. We found that large-scale assessments like the Programme for International Student Assessment (PISA), Trends in International Mathematics and Science Study (TIMSS), or the National Assessment of Educational Progress (NAEP) can be adapted. Considering the large amount of resources that have been invested in developing the assessments, it would be cost-effective if these assessments can be used for other purposes such as drawing fine-grained inferences about what students can and cannot do.</p><p>In this research, we proposed a systematic procedure of modeling extant large-scale assessments for diagnostic purpose by capitalizing on and integrating recent CDM developments. By working with language experts, we used the PISA reading assessment to demonstrate the procedure in practice. A domain different from mathematics was chosen for a wider application of the developments. We expect that a similar procedure is applicable to other large-scale assessments and/or other subject matters. It is worth noting that the PISA reading domain is somewhat different from conventional reading in that it does not focus on text decoding or literal comprehension  (OECD, 1999, 2006a) . Instead, PISA reading emphasizes the understanding of reading literacy under the context of daily activities. As a result, we found attributes well beyond the scope of the traditional reading domain (e.g., number sense). In light of this, comparisons between this research and those of the diagnostic assessments in the conventional reading domain (e.g.,  Jang, 2009; Lee &amp; Sawaki, 2009; von Davier, 2008 ) should be done with caution. In the rest of this research, we will first summarize the methodological background required for the modeling procedure. After the description of the PISA reading assessments and released items, we will illustrate the four phases of the modeling procedure. The paper concludes with a discussion of the results and some implications of this work.</p></sec><sec id="s2"><title>2. Background</title><sec id="s2_1"><title>2.1. Q-Matrix and Required Attributes</title><p>For any assessment to provide useful diagnostic information, the Q-matrix plays an important role, as it provides the specification of attributes for the items. As in, the Q-matrix describes the relationship between the items and the attributes being measured. Let q<sub>jk</sub> denote the element in row j and column k of a J &#215; K Q-matrix, where J and K represent the numbers of items and attributes, respectively. The element q<sub>jk</sub> is specified to be one if the kth attribute is required to answer item j correctly, and zero otherwise. The jth row of the Q-matrix (i.e., q<sub>j・</sub>) is called</p><p>the jth q-vector, which gives the attribute specification of item j. <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-6901312x6.png" xlink:type="simple"/></inline-formula>is used to denote the number of required attributes for item j. For notational convenience, let the first <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-6901312x7.png" xlink:type="simple"/></inline-formula> attributes be required for item j. The required attributes for item j can be represented by the reduced vector<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-6901312x8.png" xlink:type="simple"/></inline-formula>, where<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-6901312x9.png" xlink:type="simple"/></inline-formula>. By adopting the concept of required attributes we can simplify the model because the number of attribute vectors to be considered for item j reduces from <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-6901312x10.png" xlink:type="simple"/></inline-formula> to<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-6901312x11.png" xlink:type="simple"/></inline-formula>. The probability that examinees with reduced vector <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-6901312x12.png" xlink:type="simple"/></inline-formula> will answer item j correctly is denoted as<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-6901312x13.png" xlink:type="simple"/></inline-formula>.</p></sec><sec id="s2_2"><title>2.2. Saturated and Reduced CDM</title><p>When there is only one required attribute for item j, there is no need to distinguish between the general and re-</p><p>duced CDM, because both will have two different<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-6901312x14.png" xlink:type="simple"/></inline-formula>, corresponding to the two reduced vectors. In prac-</p><p>tice however, it is highly likely that each item measures more than one attribute. For a multiple-attribute item j, the saturated form of its item response function (IRF) in the identity link is</p><disp-formula id="scirp.51663-formula705"><graphic  xlink:href="http://html.scirp.org/file/2-6901312x15.png"  xlink:type="simple"/></disp-formula><p>which has <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-6901312x16.png" xlink:type="simple"/></inline-formula> item parameters (i.e.,<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-6901312x17.png" xlink:type="simple"/></inline-formula>). Using different link functions (e.g., logit or log) we can get differ-</p><p>ent saturated forms that are linear in the parameters, all of which theoretically provide identical model-data fit  (de la Torre, 2011) .</p><p>By constraining the parameters of the saturated forms, we can obtain different reduced CDMs. In this research, five commonly used reduced CDMs were considered: The DINA model has two parameters per item, and assumes incremental probability only when all the required attributes have been simultaneously mastered; the DINO model also has two parameters per item, and assumes incremental probability when at least one required</p><p>attribute has been mastered; the other three CDMs (i.e., A-CDM, LLM, and R-RUM) all have <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-6901312x18.png" xlink:type="simple"/></inline-formula> parameters for item j, and assume additive contributions of the parameters to <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-6901312x19.png" xlink:type="simple"/></inline-formula> based on different link functions.</p><p>Specifically, the identity, logit and log link is adopted by the A-CDM, LLM, and R-RUM, respectively. More technical details about the above saturated and reduced CDMs can be found in  de la Torre (2011)  and  de la Torre and Chen (2011) .</p></sec><sec id="s2_3"><title>2.3. Model-Data Fit Evaluation</title><p>For inferences from any CDM applied on the assessment to be valid, it is important to evaluate the model-data fit. Two types of misfit are of major concern under the context of diagnostic assessment: Q-matrix and CDM misspecifications. Based on simulation study from  Chen, de la Torre, and Zhang (2013) , we can separate the fit evaluation process into two steps: evaluating the appropriateness of the Q-matrix based on a saturated CDM, and then evaluating the appropriateness of reduced CDMs given an appropriate Q-matrix. To detect possible Q-ma- trix misspecification, we can evaluate the ρ statistics (residual between the observed and predicted Fisher- transformed correlation of item pairs) and the l statistics (residual between the observed and predicted log-odds ratios of item pairs). In addition to evaluate the whole Q-matrix, we also found that these two statistics can give hints about the problematic q-vectors in case Q-matrix misspecifications exist. Given that an appropriate Q-ma- trix can be identified based on a saturated CDM, the next step is to find appropriate reduced CDMs. Using the above ρ and l statistics, we can similarly evaluate the fit of the reduced CDMs in the absolute sense. The conventional Akaike’s information criterion (AIC;  Akaike, 1974 ) and Bayesian information criterion (BIC;  Schwarzer, 1976 ) can be also used to compare different CDMs. Finally, a likelihood ratio test (LRT) based on χ<sup>2</sup> distribution can be conducted to compare two nested Q-matrices or models.</p></sec><sec id="s2_4"><title>2.4. Systematic Modeling of Large-Scale Assessment</title><p>To model extant assessments for diagnostic purposes, we need to address four critical issues: 1) defining a set of meaningful attributes; 2) constructing an appropriate Q-matrix; 3) obtaining appropriated reduced CDMs; and 4) validation of the fit results. It is understood that the first and second issues cannot be fully separated with extant assessments. Large-scale assessments have an assessment framework that is designed by content experts and has been field-tested. Used with the released items, we can readily construct different initial attributes and Q-ma- trices for evaluation in the first phase. The above absolute (i.e., ρ and l) and relative (i.e., AIC, BIC and LRT) fit measures can be used to ascertain model-data fit. In this phase, if the absolute fit results are generally poor, the relative fit results can give us directions for possible adjustment of the attributes and Q-matrices.</p><p>In the second phase, we can redefine the attributes and change the Q-matrix specifications based on the findings of the initial set of attributes. It is possible to construct different Q-matrices based on different initial attributes we choose. Although the different Q-matrix could be equally appropriate, they would result in different interpretation of the final attributes. Many initial attributes based on the assessment framework are intended to be exclusive in that they are defined so that each item can measure a single attribute only. One way to improve absolute fit is to redefine the attributes so that each item can measure multiple attributes. Another way is to combine some initial attributes based on conceptual understanding. Both ways can also lower the typically high interrelations among the initial attributes. In doing so, the attributes can provide more diagnostic information. After fixing the attribute definitions, we can re-specify the Q-matrix and fine-tune individual q-vectors using item-level indices. This fine-tuning is performed until the Q-matrix is acceptable, say, at a significance level of α = 0.05. In the third phase, the selected Q-matrix can be used to evaluate the appropriateness of reduced CDMs using the similar absolute or relative fit measures.</p><p>Finally, to ensure that the selected Q-matrix and CDMs are applicable beyond the current data, the above fit results need to be validated using different data. For international assessments like PISA, data from a different country or region with similar culture can be used to validate the fit of the Q-matrix and reduced CDMs. For a national assessment like NAEP or when the sample size is sufficient large (e.g., N &gt; 2000), the original dataset can be separated into two subsets, where one subset can be used for the analysis phases, and the other for the validation phase.</p></sec></sec><sec id="s3"><title>3. Data Description</title><p>To provide meaningful diagnostic information using large-scale assessments, released items are needed to construct Q-matrices based on well-defined attributes. We chose PISA 2000 reading assessment because of the large number of released items associated with the assessment  (OECD, 2006b) . PISA reading assessment is an international assessment measuring 15-year-old students’ reading achievement, with a focus on students’ ability to apply what they learned in school to their daily activities  (OECD, 1999, 2006a) . Booklet 8 and 9 of the assessment are adopted, which consist of 26 released items from six independent articles. Among all participating countries and regions, we found the largest number of examinees in the United Kingdom (UK) using these two Booklets, which will be used for the analysis. Examinees from the United States (US) were chosen to validate the fit results, due to the cultural similarity between these two countries. To ensure the adequacy of the diagnostic information, we remove examinees that missed half or more items on the test, resulting in a sample of 2012 and 802 examinees for the UK and the US, respectively. All data and related technical documents are publicly available in the PISA official web site (http://www.oecd.org/pisa/). We converted the partial-credit items to dichotomous items by considering full credit as successful, and the remaining scores as failure. <xref ref-type="table" rid="table1">Table 1</xref> gives a summary of examinees’ responses to the 26 released items for the UK. Response patterns of the US examinees are similar and are omitted to save space.</p><p>A salient feature of the response data was the large percentage of missing responses towards the end of the test. To have a clearer picture, a visualization of <xref ref-type="table" rid="table1">Table 1</xref> is given in <xref ref-type="fig" rid="fig1">Figure 1</xref>. It can be seen that the trends between the missing and failure patterns are remarkably similar towards the end of the test, especially when we exclude multiple-choice items (i.e., Item 19, 20 and 24). Such a pattern suggests that the missing responses near the end are neither ignorable nor uninformative. Additional discussion of the missing responses is given below.</p><sec id="s3_1"><title>3.1. Modeling Phase One: Initial Attribute Definition and Q-Matrices Construction</title><p>An important component of diagnostic modeling is to construct an appropriate Q-matrix based on defined attributes. As a starting point, our language experts employed the five processes (aspects) of reading under the PISA assessment framework  (OECD, 1999: pp. 28-32, 2006a: pp. 48-52)  as initial attributes. According to the framework, these five interrelated processes are necessary for the full understanding of texts and provided guidance for item design. <xref ref-type="table" rid="table2">Table 2</xref> presents their definitions and how they were operationalized during item development. If we treat each process as one attribute, we can obtain a Q-matrix with five attributes and 26 single- attribute items from the released item manual  (OECD, 2006b) , which will be called Q1 (see α<sub>1</sub> - α<sub>5</sub> in <xref ref-type="table" rid="table3">Table 3</xref>).</p><fig id="fig1"  position="float"><label><xref ref-type="fig" rid="fig1">Figure 1</xref></label><caption><title> Distribution of item responses</title></caption><graphic mimetype="image"   position="float"  xlink:type="simple"  xlink:href="http://html.scirp.org/file/2-6901312x20.png"/></fig><table-wrap id="table1" ><label><xref ref-type="table" rid="table1">Table 1</xref></label><caption><title> Summary of examinees’ responses to the 26 items for the UK</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >No.</th><th align="center" valign="middle" >Code</th><th align="center" valign="middle" >Type</th><th align="center" valign="middle" >%M</th><th align="center" valign="middle" >%F</th><th align="center" valign="middle" >%S</th><th align="center" valign="middle" >No.</th><th align="center" valign="middle" >Code</th><th align="center" valign="middle" >Type</th><th align="center" valign="middle" >%M</th><th align="center" valign="middle" >%F</th><th align="center" valign="middle" >%S</th></tr></thead><tr><td align="center" valign="middle" >1</td><td align="center" valign="middle" >R040Q02</td><td align="center" valign="middle" >M</td><td align="center" valign="middle" >0.03</td><td align="center" valign="middle" >0.30</td><td align="center" valign="middle" >0.68</td><td align="center" valign="middle" >14</td><td align="center" valign="middle" >R088Q05T</td><td align="center" valign="middle" >CM</td><td align="center" valign="middle" >0.04</td><td align="center" valign="middle" >0.20</td><td align="center" valign="middle" >0.76</td></tr><tr><td align="center" valign="middle" >2</td><td align="center" valign="middle" >R040Q03A</td><td align="center" valign="middle" >CR</td><td align="center" valign="middle" >0.07</td><td align="center" valign="middle" >0.38</td><td align="center" valign="middle" >0.55</td><td align="center" valign="middle" >15</td><td align="center" valign="middle" >R088Q07</td><td align="center" valign="middle" >M</td><td align="center" valign="middle" >0.03</td><td align="center" valign="middle" >0.26</td><td align="center" valign="middle" >0.71</td></tr><tr><td align="center" valign="middle" >3</td><td align="center" valign="middle" >R040Q03B</td><td align="center" valign="middle" >CR</td><td align="center" valign="middle" >0.16</td><td align="center" valign="middle" >0.49</td><td align="center" valign="middle" >0.35</td><td align="center" valign="middle" >16</td><td align="center" valign="middle" >R110Q01</td><td align="center" valign="middle" >M</td><td align="center" valign="middle" >0.01</td><td align="center" valign="middle" >0.17</td><td align="center" valign="middle" >0.82</td></tr><tr><td align="center" valign="middle" >4</td><td align="center" valign="middle" >R040Q04</td><td align="center" valign="middle" >M</td><td align="center" valign="middle" >0.04</td><td align="center" valign="middle" >0.21</td><td align="center" valign="middle" >0.75</td><td align="center" valign="middle" >17</td><td align="center" valign="middle" >R110Q04</td><td align="center" valign="middle" >CR</td><td align="center" valign="middle" >0.02</td><td align="center" valign="middle" >0.14</td><td align="center" valign="middle" >0.83</td></tr><tr><td align="center" valign="middle" >5</td><td align="center" valign="middle" >R040Q06</td><td align="center" valign="middle" >M</td><td align="center" valign="middle" >0.04</td><td align="center" valign="middle" >0.39</td><td align="center" valign="middle" >0.56</td><td align="center" valign="middle" >18</td><td align="center" valign="middle" >R110Q05</td><td align="center" valign="middle" >CR</td><td align="center" valign="middle" >0.08</td><td align="center" valign="middle" >0.17</td><td align="center" valign="middle" >0.74</td></tr><tr><td align="center" valign="middle" >6</td><td align="center" valign="middle" >R077Q02</td><td align="center" valign="middle" >M</td><td align="center" valign="middle" >0.01</td><td align="center" valign="middle" >0.23</td><td align="center" valign="middle" >0.77</td><td align="center" valign="middle" >19</td><td align="center" valign="middle" >R110Q06</td><td align="center" valign="middle" >M</td><td align="center" valign="middle" >0.01</td><td align="center" valign="middle" >0.20</td><td align="center" valign="middle" >0.79</td></tr><tr><td align="center" valign="middle" >7</td><td align="center" valign="middle" >R077Q03</td><td align="center" valign="middle" >CR</td><td align="center" valign="middle" >0.11</td><td align="center" valign="middle" >0.26</td><td align="center" valign="middle" >0.63</td><td align="center" valign="middle" >20</td><td align="center" valign="middle" >R216Q01</td><td align="center" valign="middle" >M</td><td align="center" valign="middle" >0.02</td><td align="center" valign="middle" >0.24</td><td align="center" valign="middle" >0.74</td></tr><tr><td align="center" valign="middle" >8</td><td align="center" valign="middle" >R077Q04</td><td align="center" valign="middle" >M</td><td align="center" valign="middle" >0.01</td><td align="center" valign="middle" >0.37</td><td align="center" valign="middle" >0.62</td><td align="center" valign="middle" >21</td><td align="center" valign="middle" >R216Q02</td><td align="center" valign="middle" >CR</td><td align="center" valign="middle" >0.16</td><td align="center" valign="middle" >0.34</td><td align="center" valign="middle" >0.50</td></tr><tr><td align="center" valign="middle" >9</td><td align="center" valign="middle" >R077Q05</td><td align="center" valign="middle" >CR</td><td align="center" valign="middle" >0.08</td><td align="center" valign="middle" >0.55</td><td align="center" valign="middle" >0.37</td><td align="center" valign="middle" >22</td><td align="center" valign="middle" >R216Q03T</td><td align="center" valign="middle" >CR</td><td align="center" valign="middle" >0.14</td><td align="center" valign="middle" >0.30</td><td align="center" valign="middle" >0.56</td></tr><tr><td align="center" valign="middle" >10</td><td align="center" valign="middle" >R077Q06</td><td align="center" valign="middle" >M</td><td align="center" valign="middle" >0.02</td><td align="center" valign="middle" >0.48</td><td align="center" valign="middle" >0.50</td><td align="center" valign="middle" >23</td><td align="center" valign="middle" >R216Q04</td><td align="center" valign="middle" >CR</td><td align="center" valign="middle" >0.19</td><td align="center" valign="middle" >0.41</td><td align="center" valign="middle" >0.39</td></tr><tr><td align="center" valign="middle" >11</td><td align="center" valign="middle" >R088Q01</td><td align="center" valign="middle" >M</td><td align="center" valign="middle" >0.01</td><td align="center" valign="middle" >0.36</td><td align="center" valign="middle" >0.63</td><td align="center" valign="middle" >24</td><td align="center" valign="middle" >R216Q06</td><td align="center" valign="middle" >M</td><td align="center" valign="middle" >0.02</td><td align="center" valign="middle" >0.29</td><td align="center" valign="middle" >0.68</td></tr><tr><td align="center" valign="middle" >12</td><td align="center" valign="middle" >R088Q03</td><td align="center" valign="middle" >CR</td><td align="center" valign="middle" >0.04</td><td align="center" valign="middle" >0.67</td><td align="center" valign="middle" >0.29</td><td align="center" valign="middle" >25</td><td align="center" valign="middle" >R236Q01</td><td align="center" valign="middle" >CR</td><td align="center" valign="middle" >0.09</td><td align="center" valign="middle" >0.34</td><td align="center" valign="middle" >0.56</td></tr><tr><td align="center" valign="middle" >13</td><td align="center" valign="middle" >R088Q04T</td><td align="center" valign="middle" >CM</td><td align="center" valign="middle" >0.02</td><td align="center" valign="middle" >0.83</td><td align="center" valign="middle" >0.15</td><td align="center" valign="middle" >26</td><td align="center" valign="middle" >R236Q02</td><td align="center" valign="middle" >CR</td><td align="center" valign="middle" >0.40</td><td align="center" valign="middle" >0.42</td><td align="center" valign="middle" >0.18</td></tr></tbody></table></table-wrap><p>Notes: M = multiple choice; CR = constructed response; CM = complex multiple choice; %M = missing%; %F = failing%; %S = successful%.</p><table-wrap id="table2" ><label><xref ref-type="table" rid="table2">Table 2</xref></label><caption><title> Definitions of the five reading processes in pisa reading</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >α<sub>1</sub></th><th align="center" valign="middle" >Retrieving information Match information given in the item with identical or synonymous information in the text and use this to find the new information called for, based on requirements or features specified in the item; examinees have to identify essential elements of a item, like characters, place, time, and setting, and then to search for a match that may be literal or synonymous.</th></tr></thead><tr><td align="center" valign="middle" >α<sub>2</sub></td><td align="center" valign="middle" >Forming a broad general understanding Consider the text as a whole or in a broad perspective. Items include identifying the main topic, the general purpose, or use of the text, distinguishing between key ideas and minor details, or recognizing the summary of the main theme in a sentence or title.</td></tr><tr><td align="center" valign="middle" >α<sub>3</sub></td><td align="center" valign="middle" >Developing an interpretation Develop a more specific or complete understanding of what the examinees have read beyond their initial impressions. Items call for logical understanding, and include comparing and contrasting information, drawing inferences, or listing supporting evidence.</td></tr><tr><td align="center" valign="middle" >α<sub>4</sub></td><td align="center" valign="middle" >Reflecting on and evaluating the content of a text Connect information in a text to knowledge from other sources, or assess the claims in the text against examinees’ own knowledge of the world. Items include providing evidence or arguments from outside the text, assessing the relevance or sufficiency of information or evidence, or drawing comparisons with moral or aesthetic rules.</td></tr><tr><td align="center" valign="middle" >α<sub>5</sub></td><td align="center" valign="middle" >Reflecting on and evaluating the form of a text Stand apart from the text, consider it objectively and evaluate its quality and appropriateness. Items include determining the utility of a text for a specified purpose, evaluating an author’s use of textual features for specific goal, describing or commenting on author’s use of style, and identifying the author’s purpose and attitude.</td></tr></tbody></table></table-wrap><p>Notes: Summarized from the PISA assessment framework  (OECD, 1999: pp. 28-32, 2006a: pp. 48-52) .</p><table-wrap id="table3" ><label><xref ref-type="table" rid="table3">Table 3</xref></label><caption><title> Eight initial attributes and corresponding item specifications</title></caption><table><tbody><thead><tr><th align="center" valign="middle"  rowspan="2"  >Item</th><th align="center" valign="middle"  colspan="8"  >Attribute</th><th align="center" valign="middle"  rowspan="2"  >Item</th><th align="center" valign="middle"  colspan="8"  >Attribute</th></tr></thead><tr><td align="center" valign="middle" >α<sub>1</sub></td><td align="center" valign="middle" >α<sub>2</sub></td><td align="center" valign="middle" >α<sub>3</sub></td><td align="center" valign="middle" >α<sub>4</sub></td><td align="center" valign="middle" >α<sub>5</sub></td><td align="center" valign="middle" >α<sub>6</sub></td><td align="center" valign="middle" >α<sub>7</sub></td><td align="center" valign="middle" >α<sub>8</sub></td><td align="center" valign="middle" >α<sub>1</sub></td><td align="center" valign="middle" >α<sub>2</sub></td><td align="center" valign="middle" >α<sub>3</sub></td><td align="center" valign="middle" >α<sub>4</sub></td><td align="center" valign="middle" >α<sub>5</sub></td><td align="center" valign="middle" >α<sub>6</sub></td><td align="center" valign="middle" >α<sub>7</sub></td><td align="center" valign="middle" >α<sub>8</sub></td></tr><tr><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >14</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >1</td></tr><tr><td align="center" valign="middle" >2</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >15</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >1</td></tr><tr><td align="center" valign="middle" >3</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >16</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td></tr><tr><td align="center" valign="middle" >4</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >17</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td></tr><tr><td align="center" valign="middle" >5</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >18</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td></tr><tr><td align="center" valign="middle" >6</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >19</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td></tr><tr><td align="center" valign="middle" >7</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >20</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td></tr><tr><td align="center" valign="middle" >8</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >21</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td></tr><tr><td align="center" valign="middle" >9</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >22</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td></tr><tr><td align="center" valign="middle" >10</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >23</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td></tr><tr><td align="center" valign="middle" >11</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >24</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td></tr><tr><td align="center" valign="middle" >12</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >25</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td></tr><tr><td align="center" valign="middle" >13</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >26</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td></tr><tr><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" >Sum</td><td align="center" valign="middle" >5</td><td align="center" valign="middle" >4</td><td align="center" valign="middle" >9</td><td align="center" valign="middle" >4</td><td align="center" valign="middle" >4</td><td align="center" valign="middle" >7</td><td align="center" valign="middle" >15</td><td align="center" valign="middle" >10</td></tr></tbody></table></table-wrap><p>Notes: α<sub>1</sub> - α<sub>5</sub>: see <xref ref-type="table" rid="table2">Table 2</xref>; α<sub>6</sub> = related to test speededness; α<sub>7</sub> = interpreting non-continuous texts; α<sub>8</sub> = number sense.</p><p>In addition to the five processes, the PISA framework defined text format like information sheets, tables, diagrams, charts, and graphs as non-continuous texts  (OECD, 2006a: pp. 47-48) , of which the first three articles (i.e., R040, R077, and R088) were largely consisted. Hence, our language experts defined an additional initial attribute to interpret non-continuous texts. Furthermore, we noticed that the first and third articles were rich in numbers, and defined another initial attribute as number sense. Lastly, as shown in <xref ref-type="table" rid="table1">Table 1</xref> and <xref ref-type="fig" rid="fig1">Figure 1</xref>, both the missing and failure patterns tended to increase towards the end of the test. Taking into account that items towards the end are not necessarily increasingly difficult, it suggests that some examinees simply needed more time to answer those items correctly. Accordingly, our language experts created another attribute related to test speededness, which means that the examinees who master the attribute have the ability to fully consider all items within the test time. Assuming that the examinees finish the items in order, the attribute would be required to answer the last seven items in the last two articles successfully. Altogether, we constructed eight initial attributes with their item specifications shown in <xref ref-type="table" rid="table3">Table 3</xref>.</p><p>Based on these attributes, we created five additional Q-matrices for evaluation, all of which contain Q1 as a subset, as shown in <xref ref-type="table" rid="table4">Table 4</xref>. Conceptually, these six Q-matrices can be divided into three hierarchical layers: 1) Q1 (first five attributes); 2) Q2, Q3, and Q4 (adding one attribute); and 3) Q5 and Q6 (adding two attributes). Note that α<sub>7</sub> and α<sub>8</sub> were not used together in the same Q-matrix because the latter is just a subset of the former with the current set of released items. With these Q-matrices, we can evaluate the model-data fit using the saturated model and corresponding fit statistics as discussed in the Background Section. <xref ref-type="table" rid="table4">Table 4</xref> presents the fit results using these Q-matrices. As expected, none of the Q-matrices can be accepted at 0.01 significant level based on either the r or l statistics. In addition, the number of items with poor fitting values is not small for any Q-ma- trix. However, we can see the direction of improvement from the first to the third layer based on relative fit results (i.e., AIC, BIC). Based on their nested relationships and the LRT, we found that: 1) any of Q2, Q3, or Q4 was significantly better than Q1; 2) Q5 was significantly better than Q2 or Q3; and 3) Q6 was significantly better than Q2 or Q4 (p ≈ 0 in all cases). Based on the above results, we can choose either Q5 or Q6 as a basis to finalize the attribute definitions and Q-matrix specifications in the next phase.</p><table-wrap id="table4" ><label><xref ref-type="table" rid="table4">Table 4</xref></label><caption><title> Model-data fitting results for Q1 to Q6</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >Q-matrix</th><th align="center" valign="middle" >Attribute</th><th align="center" valign="middle" >−2LL</th><th align="center" valign="middle" >df</th><th align="center" valign="middle" >AIC</th><th align="center" valign="middle" >BIC</th><th align="center" valign="middle" >Max. z(r)</th><th align="center" valign="middle" >#z(r)</th><th align="center" valign="middle" >Max. z(l)</th><th align="center" valign="middle" >#z(l)</th></tr></thead><tr><td align="center" valign="middle" >Q1</td><td align="center" valign="middle" >α<sub>1</sub> - α<sub>5</sub></td><td align="center" valign="middle" >54935</td><td align="center" valign="middle" >83</td><td align="center" valign="middle" >55101</td><td align="center" valign="middle" >55566</td><td align="center" valign="middle" >10.60</td><td align="center" valign="middle" >23</td><td align="center" valign="middle" >11.20</td><td align="center" valign="middle" >23</td></tr><tr><td align="center" valign="middle" >Q2</td><td align="center" valign="middle" >Q1 + α<sub>6</sub></td><td align="center" valign="middle" >54480</td><td align="center" valign="middle" >129</td><td align="center" valign="middle" >54738</td><td align="center" valign="middle" >55461</td><td align="center" valign="middle" >10.38</td><td align="center" valign="middle" >22</td><td align="center" valign="middle" >10.96</td><td align="center" valign="middle" >20</td></tr><tr><td align="center" valign="middle" >Q3</td><td align="center" valign="middle" >Q1 + α<sub>7</sub></td><td align="center" valign="middle" >54242</td><td align="center" valign="middle" >145</td><td align="center" valign="middle" >54532</td><td align="center" valign="middle" >55345</td><td align="center" valign="middle" >9.66</td><td align="center" valign="middle" >20</td><td align="center" valign="middle" >8.82</td><td align="center" valign="middle" >20</td></tr><tr><td align="center" valign="middle" >Q4</td><td align="center" valign="middle" >Q1 + α<sub>8</sub></td><td align="center" valign="middle" >54193</td><td align="center" valign="middle" >135</td><td align="center" valign="middle" >54463</td><td align="center" valign="middle" >55220</td><td align="center" valign="middle" >9.57</td><td align="center" valign="middle" >16</td><td align="center" valign="middle" >8.79</td><td align="center" valign="middle" >17</td></tr><tr><td align="center" valign="middle" >Q5</td><td align="center" valign="middle" >Q1 + α<sub>6</sub> + α<sub>7</sub></td><td align="center" valign="middle" >53686</td><td align="center" valign="middle" >223</td><td align="center" valign="middle" >54132</td><td align="center" valign="middle" >55382</td><td align="center" valign="middle" >6.43</td><td align="center" valign="middle" >9</td><td align="center" valign="middle" >6.25</td><td align="center" valign="middle" >9</td></tr><tr><td align="center" valign="middle" >Q6</td><td align="center" valign="middle" >Q1 + α<sub>6</sub> + α<sub>8</sub></td><td align="center" valign="middle" >53665</td><td align="center" valign="middle" >213</td><td align="center" valign="middle" >54091</td><td align="center" valign="middle" >55285</td><td align="center" valign="middle" >5.89</td><td align="center" valign="middle" >8</td><td align="center" valign="middle" >4.71</td><td align="center" valign="middle" >7</td></tr></tbody></table></table-wrap><p>Note: −2LL: −2 &#215; log-likelihood; df: degree of freedom; Max. z(r) &amp; Max. z(l): maximum z score for r and l; #z(r) &amp; #z(l): number of items with Max. z(ρ) and Max. z(l) at p &lt; 0.01; critical z score = 4.17 for α = 0.01 (with the Bonferroni correction of 26 &#215; 25 comparisons).</p></sec><sec id="s3_2"><title>3.2. Modeling Phase Two: Final Attribute Definition and Q-Matrix Construction</title><p>The only difference between Q5 and Q6 is the use of either α<sub>7</sub> or α<sub>8</sub>. Our language experts found that either one can help to construct an appropriate Q-matrix, but will result in different attribute interpretations. In this research we select Q6 because α<sub>8</sub> can help to interpret the final attributes in a more straightforward way. Specifically, we noticed that α<sub>8</sub> can be conceptually incorporated into α<sub>4</sub>, because number sense is exactly one type of external knowledge that is needed to interpret the article. By combining α<sub>4</sub> and α<sub>8</sub> (i.e., α<sub>4</sub> + α<sub>8 </sub>&gt;1) together as new α<sub>4</sub>, we transformed Q6 into a six-attribute Q-matrix Q7. The relative fit for Q7 was worse than that for Q6 based on LRT (p ≈ 0) or the BIC, and the absolute fit for Q7 was even further away from an acceptable value compared with Q6 (<xref ref-type="table" rid="table5">Table 5</xref>). But we did find one improvement: In Q6 the first five attributes were highly interrelated (<xref ref-type="table" rid="table6">Table 6</xref>), which implied that limited diagnostic information on these five attributes can be obtained because mastering any of them suggests a large chance of mastering them all. In Q7, α<sub>4</sub> was separated out from the other four attributes with lower values in the correlation matrix (<xref ref-type="table" rid="table6">Table 6</xref>).</p><p>To improve the absolute fit of Q7, our language experts adjusted the definitions of the first five attributes. We redefined α<sub>4</sub> to focus on number sense only, and extended α<sub>5</sub> to cover both the form and content of a text. Attributes α<sub>1</sub> to α<sub>3</sub> were also adjusted and the revised definitions are given in <xref ref-type="table" rid="table7">Table 7</xref>. One major change was to make the original definitions less exclusive so that the items could measure multiple attributes. With these adjustments, the absolute fit of the adjusted Q-matrix was much closer to the acceptable value. After that, we utilized suggestions from the item-level ρ and l statistics to fine-tune the Q-matrix. We also found that items with large guessing parameters (i.e., P(0)) can provide useful hints to adjust specific q-vectors. Note that we adopted suggestions or hints to change the corresponding q-vectors only if they were consistent with the adjusted definitions. The specifications of the resulting Q-matrix Q8 can be found in <xref ref-type="table" rid="table8">Table 8</xref>.</p><p>The final fit results are given in <xref ref-type="table" rid="table5">Table 5</xref>, which suggest that the model is above a 5% significance level based on either ρ or l. As shown in <xref ref-type="table" rid="table9">Table 9</xref>, the correlations of attribute mastery are more reasonable (0.46 - 0.8). It is interesting to see that α<sub>4</sub> is the most difficult attribute to master (i.e., lowest prevalence), whereas α<sub>1</sub> is the easiest one. Meanwhile, 40% of examinees failed to master α<sub>6</sub> (i.e., need more time to fully consider all items). <xref ref-type="table" rid="table1">Table 1</xref>0 presents the estimates of item parameters based on Q8. The difference in the probabilities of success between examinees who have all the required attributes and those who have none, as in, P(1) - P(0), was at least 0.35 for all items and at least 0.5 for 21 items, with a mean difference of 0.59. These results indicate that the items in the test are relatively diagnostic. Item 13 and 26 are the most difficult items, with a P(1) of 0.4 or lower. It can be seen that the guessing (i.e., P(0)) for some items (i.e., Item 1, 4, 6, 14, 16, 17, and 19) are rather large. It should not be surprising to find out that the large guessing parameters are mostly associated with multiple- choice items, except for 14, which is a complex multiple-choice item.</p></sec><sec id="s3_3"><title>3.3. Modeling Phase Three: Reduced CDM Evaluation</title><p>In this phase we attempted to obtain more interpretable reduced CDMs for the released items based on Q8. Test-level relative and absolute fit results are presented in <xref ref-type="table" rid="table1">Table 1</xref>1. The LLM had both the best relative and absolute fit whereas the DINO model had the worst. It is worth noting that the DINA model, which is the most widely used CDM, was only second to the worst. But none of the reduced CDMs can be accepted for the entire</p><table-wrap id="table5" ><label><xref ref-type="table" rid="table5">Table 5</xref></label><caption><title> Model-data fitting results for Q7 to Q8</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >Q-matrix</th><th align="center" valign="middle" >−2LL</th><th align="center" valign="middle" >df</th><th align="center" valign="middle" >AIC</th><th align="center" valign="middle" >BIC</th><th align="center" valign="middle" >Max. z(r)</th><th align="center" valign="middle" >#z(r)</th><th align="center" valign="middle" >Max. z(l)</th><th align="center" valign="middle" >#z(l)</th></tr></thead><tr><td align="center" valign="middle" >Q7</td><td align="center" valign="middle" >53968</td><td align="center" valign="middle" >145</td><td align="center" valign="middle" >54258</td><td align="center" valign="middle" >55071</td><td align="center" valign="middle" >8.19</td><td align="center" valign="middle" >19</td><td align="center" valign="middle" >8.75</td><td align="center" valign="middle" >18</td></tr><tr><td align="center" valign="middle" >Q8</td><td align="center" valign="middle" >53252</td><td align="center" valign="middle" >185</td><td align="center" valign="middle" >53622</td><td align="center" valign="middle" >54660</td><td align="center" valign="middle" >3.68</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >3.42</td><td align="center" valign="middle" >0</td></tr></tbody></table></table-wrap><p>Note: critical z score = 3.61, 3.78, and 4.17 for α = 0.1, 0.05, and 0.01, respectively (with the Bonferroni correction).</p><table-wrap id="table6" ><label><xref ref-type="table" rid="table6">Table 6</xref></label><caption><title> Correlations of attribute mastery for Q6 and Q7</title></caption><table><tbody><thead><tr><th align="center" valign="middle" ></th><th align="center" valign="middle" >α<sub>1</sub></th><th align="center" valign="middle" >α<sub>2</sub></th><th align="center" valign="middle" >α<sub>3</sub></th><th align="center" valign="middle" >α<sub>4</sub></th><th align="center" valign="middle" >α<sub>5</sub></th></tr></thead><tr><td align="center" valign="middle" >α<sub>1</sub></td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >0.91</td><td align="center" valign="middle" >0.91</td><td align="center" valign="middle" >0.91</td><td align="center" valign="middle" >0.95</td></tr><tr><td align="center" valign="middle" >α<sub>2</sub></td><td align="center" valign="middle" >0.85</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >0.93</td><td align="center" valign="middle" >0.90</td><td align="center" valign="middle" >0.95</td></tr><tr><td align="center" valign="middle" >α<sub>3</sub></td><td align="center" valign="middle" >0.89</td><td align="center" valign="middle" >0.91</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >0.92</td><td align="center" valign="middle" >0.93</td></tr><tr><td align="center" valign="middle" >α<sub>4</sub></td><td align="center" valign="middle" >0.59</td><td align="center" valign="middle" >0.63</td><td align="center" valign="middle" >0.68</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >0.92</td></tr><tr><td align="center" valign="middle" >α<sub>5</sub></td><td align="center" valign="middle" >0.90</td><td align="center" valign="middle" >0.92</td><td align="center" valign="middle" >0.92</td><td align="center" valign="middle" >0.66</td><td align="center" valign="middle" >-</td></tr></tbody></table></table-wrap><p>Notes: Upper diagonal for Q6; lower diagonal for Q7.</p><table-wrap id="table7" ><label><xref ref-type="table" rid="table7">Table 7</xref></label><caption><title> Adjusted definitions for α<sub>1</sub> - α<sub>5</sub></title></caption><table><tbody><thead><tr><th align="center" valign="middle" >α<sub>1</sub></th><th align="center" valign="middle" >Locating information Locate similar information in the text based on requirements or features specified in the item; examinees have to identify essential elements of an item and then to search for a match that may be literal or synonymous.</th></tr></thead><tr><td align="center" valign="middle" >α<sub>2</sub></td><td align="center" valign="middle" >Forming a broad general understanding Consider the text as a whole or in a broad perspective. Items include identifying the main topic, the general purpose, or use of the text, distinguishing between key ideas and minor details, or recognizing the summary of the main theme in a sentence or title.</td></tr><tr><td align="center" valign="middle" >α<sub>3</sub></td><td align="center" valign="middle" >Developing a logical interpretation Develop a logical understanding of what the examinees have read. Items include comparing and contrasting information, drawing inferences, or listing supporting evidence.</td></tr><tr><td align="center" valign="middle" >α<sub>4</sub></td><td align="center" valign="middle" >Evaluating a number-rich text with number sense Connect information in a number-rich text to number sense</td></tr><tr><td align="center" valign="middle" >α<sub>5</sub></td><td align="center" valign="middle" >Evaluating the quality or appropriateness of a text Items include determining the utility of a text for a specified purpose, evaluating an author’s use of textual features for specific goal, describing or commenting on author’s use of style, and identifying the author’s purpose and attitude.</td></tr></tbody></table></table-wrap><table-wrap id="table8" ><label><xref ref-type="table" rid="table8">Table 8</xref></label><caption><title> Attribute specifications for Q8</title></caption><table><tbody><thead><tr><th align="center" valign="middle" ></th><th align="center" valign="middle"  colspan="6"  >Attribute</th><th align="center" valign="middle" ></th><th align="center" valign="middle"  colspan="6"  >Attribute</th></tr></thead><tr><td align="center" valign="middle" >Item</td><td align="center" valign="middle" >α<sub>1</sub></td><td align="center" valign="middle" >α<sub>2</sub></td><td align="center" valign="middle" >α<sub>3</sub></td><td align="center" valign="middle" >α<sub>4</sub></td><td align="center" valign="middle" >α<sub>5</sub></td><td align="center" valign="middle" >α<sub>6</sub></td><td align="center" valign="middle" >Item</td><td align="center" valign="middle" >α<sub>1</sub></td><td align="center" valign="middle" >α<sub>2</sub></td><td align="center" valign="middle" >α<sub>3</sub></td><td align="center" valign="middle" >α<sub>4</sub></td><td align="center" valign="middle" >α<sub>5</sub></td><td align="center" valign="middle" >α<sub>6</sub></td></tr><tr><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >14</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td></tr><tr><td align="center" valign="middle" >2</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >15</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td></tr><tr><td align="center" valign="middle" >3</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >16</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td></tr><tr><td align="center" valign="middle" >4</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >17</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td></tr><tr><td align="center" valign="middle" >5</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >18</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td></tr><tr><td align="center" valign="middle" >6</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >19</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td></tr><tr><td align="center" valign="middle" >7</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >20</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td></tr><tr><td align="center" valign="middle" >8</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >21</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >1</td></tr><tr><td align="center" valign="middle" >9</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >22</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td></tr><tr><td align="center" valign="middle" >10</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >23</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td></tr><tr><td align="center" valign="middle" >11</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >24</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td></tr><tr><td align="center" valign="middle" >12</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >25</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td></tr><tr><td align="center" valign="middle" >13</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >26</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td></tr><tr><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" >Sum</td><td align="center" valign="middle" >10</td><td align="center" valign="middle" >8</td><td align="center" valign="middle" >14</td><td align="center" valign="middle" >10</td><td align="center" valign="middle" >7</td><td align="center" valign="middle" >7</td></tr></tbody></table></table-wrap><p>Notes: α<sub>1</sub> - α<sub>5</sub>: see <xref ref-type="table" rid="table6">Table 6</xref>; α<sub>6</sub> = related to test speededness; underscored entries are different from Q1.</p><table-wrap id="table9" ><label><xref ref-type="table" rid="table9">Table 9</xref></label><caption><title> Attribute mastery correlations and prevalence for Q8</title></caption><table><tbody><thead><tr><th align="center" valign="middle" ></th><th align="center" valign="middle" >α<sub>1</sub></th><th align="center" valign="middle" >α<sub>2</sub></th><th align="center" valign="middle" >α<sub>3</sub></th><th align="center" valign="middle" >α<sub>4</sub></th><th align="center" valign="middle" >α<sub>5</sub></th><th align="center" valign="middle" >α<sub>6</sub></th><th align="center" valign="middle" >AP</th></tr></thead><tr><td align="center" valign="middle" >α<sub>1</sub></td><td align="center" valign="middle" >1.00</td><td align="center" valign="middle" >0.70</td><td align="center" valign="middle" >0.57</td><td align="center" valign="middle" >0.56</td><td align="center" valign="middle" >0.78</td><td align="center" valign="middle" >0.70</td><td align="center" valign="middle" >0.65</td></tr><tr><td align="center" valign="middle" >α<sub>2</sub></td><td align="center" valign="middle" >0.70</td><td align="center" valign="middle" >1.00</td><td align="center" valign="middle" >0.80</td><td align="center" valign="middle" >0.46</td><td align="center" valign="middle" >0.53</td><td align="center" valign="middle" >0.61</td><td align="center" valign="middle" >0.64</td></tr><tr><td align="center" valign="middle" >α<sub>3</sub></td><td align="center" valign="middle" >0.57</td><td align="center" valign="middle" >0.80</td><td align="center" valign="middle" >1.00</td><td align="center" valign="middle" >0.64</td><td align="center" valign="middle" >0.70</td><td align="center" valign="middle" >0.52</td><td align="center" valign="middle" >0.55</td></tr><tr><td align="center" valign="middle" >α<sub>4</sub></td><td align="center" valign="middle" >0.56</td><td align="center" valign="middle" >0.46</td><td align="center" valign="middle" >0.64</td><td align="center" valign="middle" >1.00</td><td align="center" valign="middle" >0.74</td><td align="center" valign="middle" >0.61</td><td align="center" valign="middle" >0.46</td></tr><tr><td align="center" valign="middle" >α<sub>5</sub></td><td align="center" valign="middle" >0.78</td><td align="center" valign="middle" >0.53</td><td align="center" valign="middle" >0.70</td><td align="center" valign="middle" >0.74</td><td align="center" valign="middle" >1.00</td><td align="center" valign="middle" >0.72</td><td align="center" valign="middle" >0.54</td></tr><tr><td align="center" valign="middle" >α<sub>6</sub></td><td align="center" valign="middle" >0.70</td><td align="center" valign="middle" >0.61</td><td align="center" valign="middle" >0.52</td><td align="center" valign="middle" >0.61</td><td align="center" valign="middle" >0.72</td><td align="center" valign="middle" >1.00</td><td align="center" valign="middle" >0.60</td></tr></tbody></table></table-wrap><p>Notes: AP = attribute prevalence.</p><table-wrap id="table10" ><label><xref ref-type="table" rid="table1">Table 1</xref>0</label><caption><title> Items’ probability of success for different reduced attribute vectors</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >Item</th><th align="center" valign="middle"  colspan="8"  >Estimate</th><th align="center" valign="middle"  colspan="8"  >Standard error</th></tr></thead><tr><td align="center" valign="middle" ></td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td></tr><tr><td align="center" valign="middle" ></td><td align="center" valign="middle" >00</td><td align="center" valign="middle" >01</td><td align="center" valign="middle" >10</td><td align="center" valign="middle" >11</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" >00</td><td align="center" valign="middle" >01</td><td align="center" valign="middle" >10</td><td align="center" valign="middle" >11</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td></tr><tr><td align="center" valign="middle" ></td><td align="center" valign="middle" >000</td><td align="center" valign="middle" >001</td><td align="center" valign="middle" >010</td><td align="center" valign="middle" >100</td><td align="center" valign="middle" >011</td><td align="center" valign="middle" >101</td><td align="center" valign="middle" >110</td><td align="center" valign="middle" >111</td><td align="center" valign="middle" >000</td><td align="center" valign="middle" >001</td><td align="center" valign="middle" >010</td><td align="center" valign="middle" >100</td><td align="center" valign="middle" >011</td><td align="center" valign="middle" >101</td><td align="center" valign="middle" >110</td><td align="center" valign="middle" >111</td></tr><tr><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0.43</td><td align="center" valign="middle" >0.84</td><td align="center" valign="middle" >0.69</td><td align="center" valign="middle" >0.87</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" >0.02</td><td align="center" valign="middle" >0.04</td><td align="center" valign="middle" >0.03</td><td align="center" valign="middle" >0.01</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td></tr><tr><td align="center" valign="middle" >2</td><td align="center" valign="middle" >0.23</td><td align="center" valign="middle" >0.74</td><td align="center" valign="middle" >0.35</td><td align="center" valign="middle" >0.87</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" >0.02</td><td align="center" valign="middle" >0.06</td><td align="center" valign="middle" >0.03</td><td align="center" valign="middle" >0.01</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td></tr><tr><td align="center" valign="middle" >3</td><td align="center" valign="middle" >0.04</td><td align="center" valign="middle" >0.10</td><td align="center" valign="middle" >0.34</td><td align="center" valign="middle" >0.75</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" >0.01</td><td align="center" valign="middle" >0.03</td><td align="center" valign="middle" >0.06</td><td align="center" valign="middle" >0.02</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td></tr><tr><td align="center" valign="middle" >4</td><td align="center" valign="middle" >0.39</td><td align="center" valign="middle" >0.85</td><td align="center" valign="middle" >0.79</td><td align="center" valign="middle" >0.93</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" >0.02</td><td align="center" valign="middle" >0.04</td><td align="center" valign="middle" >0.02</td><td align="center" valign="middle" >0.01</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td></tr><tr><td align="center" valign="middle" >5</td><td align="center" valign="middle" >0.30</td><td align="center" valign="middle" >0.63</td><td align="center" valign="middle" >0.55</td><td align="center" valign="middle" >0.81</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" >0.02</td><td align="center" valign="middle" >0.05</td><td align="center" valign="middle" >0.04</td><td align="center" valign="middle" >0.02</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td></tr><tr><td align="center" valign="middle" >6</td><td align="center" valign="middle" >0.42</td><td align="center" valign="middle" >0.95</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" >0.02</td><td align="center" valign="middle" >0.01</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td></tr><tr><td align="center" valign="middle" >7</td><td align="center" valign="middle" >0.15</td><td align="center" valign="middle" >0.72</td><td align="center" valign="middle" >0.56</td><td align="center" valign="middle" >0.91</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" >0.02</td><td align="center" valign="middle" >0.05</td><td align="center" valign="middle" >0.03</td><td align="center" valign="middle" >0.01</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td></tr><tr><td align="center" valign="middle" >8</td><td align="center" valign="middle" >0.36</td><td align="center" valign="middle" >0.50</td><td align="center" valign="middle" >0.64</td><td align="center" valign="middle" >0.86</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" >0.02</td><td align="center" valign="middle" >0.05</td><td align="center" valign="middle" >0.04</td><td align="center" valign="middle" >0.01</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td></tr><tr><td align="center" valign="middle" >9</td><td align="center" valign="middle" >0.07</td><td align="center" valign="middle" >0.34</td><td align="center" valign="middle" >0.28</td><td align="center" valign="middle" >0.59</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" >0.02</td><td align="center" valign="middle" >0.04</td><td align="center" valign="middle" >0.03</td><td align="center" valign="middle" >0.02</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td></tr><tr><td align="center" valign="middle" >10</td><td align="center" valign="middle" >0.24</td><td align="center" valign="middle" >0.29</td><td align="center" valign="middle" >0.46</td><td align="center" valign="middle" >0.72</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" >0.02</td><td align="center" valign="middle" >0.05</td><td align="center" valign="middle" >0.03</td><td align="center" valign="middle" >0.02</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td></tr><tr><td align="center" valign="middle" >11</td><td align="center" valign="middle" >0.25</td><td align="center" valign="middle" >0.59</td><td align="center" valign="middle" >0.60</td><td align="center" valign="middle" >0.92</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" >0.02</td><td align="center" valign="middle" >0.05</td><td align="center" valign="middle" >0.03</td><td align="center" valign="middle" >0.01</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td></tr><tr><td align="center" valign="middle" >12</td><td align="center" valign="middle" >0.02</td><td align="center" valign="middle" >0.25</td><td align="center" valign="middle" >0.15</td><td align="center" valign="middle" >0.58</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" >0.01</td><td align="center" valign="middle" >0.05</td><td align="center" valign="middle" >0.02</td><td align="center" valign="middle" >0.02</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td></tr><tr><td align="center" valign="middle" >13</td><td align="center" valign="middle" >0.02</td><td align="center" valign="middle" >0.00</td><td align="center" valign="middle" >0.03</td><td align="center" valign="middle" >0.37</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" >0.01</td><td align="center" valign="middle" >0.04</td><td align="center" valign="middle" >0.02</td><td align="center" valign="middle" >0.02</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td></tr><tr><td align="center" valign="middle" >14</td><td align="center" valign="middle" >0.44</td><td align="center" valign="middle" >0.71</td><td align="center" valign="middle" >0.82</td><td align="center" valign="middle" >0.96</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" >0.02</td><td align="center" valign="middle" >0.04</td><td align="center" valign="middle" >0.02</td><td align="center" valign="middle" >0.01</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td></tr><tr><td align="center" valign="middle" >15</td><td align="center" valign="middle" >0.30</td><td align="center" valign="middle" >0.44</td><td align="center" valign="middle" >0.68</td><td align="center" valign="middle" >0.67</td><td align="center" valign="middle" >0.79</td><td align="center" valign="middle" >0.74</td><td align="center" valign="middle" >0.74</td><td align="center" valign="middle" >0.95</td><td align="center" valign="middle" >0.03</td><td align="center" valign="middle" >0.07</td><td align="center" valign="middle" >0.10</td><td align="center" valign="middle" >0.06</td><td align="center" valign="middle" >0.03</td><td align="center" valign="middle" >0.05</td><td align="center" valign="middle" >0.08</td><td align="center" valign="middle" >0.01</td></tr><tr><td align="center" valign="middle" >16</td><td align="center" valign="middle" >0.47</td><td align="center" valign="middle" >0.80</td><td align="center" valign="middle" >0.87</td><td align="center" valign="middle" >0.99</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" >0.03</td><td align="center" valign="middle" >0.04</td><td align="center" valign="middle" >0.02</td><td align="center" valign="middle" >0.01</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td></tr><tr><td align="center" valign="middle" >17</td><td align="center" valign="middle" >0.46</td><td align="center" valign="middle" >0.92</td><td align="center" valign="middle" >0.85</td><td align="center" valign="middle" >0.99</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" >0.03</td><td align="center" valign="middle" >0.03</td><td align="center" valign="middle" >0.02</td><td align="center" valign="middle" >0.00</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td></tr><tr><td align="center" valign="middle" >18</td><td align="center" valign="middle" >0.22</td><td align="center" valign="middle" >0.78</td><td align="center" valign="middle" >0.82</td><td align="center" valign="middle" >0.97</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" >0.02</td><td align="center" valign="middle" >0.04</td><td align="center" valign="middle" >0.03</td><td align="center" valign="middle" >0.01</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td></tr><tr><td align="center" valign="middle" >19</td><td align="center" valign="middle" >0.49</td><td align="center" valign="middle" >0.85</td><td align="center" valign="middle" >0.76</td><td align="center" valign="middle" >0.94</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" >0.03</td><td align="center" valign="middle" >0.03</td><td align="center" valign="middle" >0.03</td><td align="center" valign="middle" >0.01</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td></tr><tr><td align="center" valign="middle" >20</td><td align="center" valign="middle" >0.33</td><td align="center" valign="middle" >0.83</td><td align="center" valign="middle" >0.64</td><td align="center" valign="middle" >0.96</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" >0.03</td><td align="center" valign="middle" >0.04</td><td align="center" valign="middle" >0.04</td><td align="center" valign="middle" >0.01</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td></tr><tr><td align="center" valign="middle" >21</td><td align="center" valign="middle" >0.04</td><td align="center" valign="middle" >0.50</td><td align="center" valign="middle" >0.11</td><td align="center" valign="middle" >0.21</td><td align="center" valign="middle" >0.47</td><td align="center" valign="middle" >0.00</td><td align="center" valign="middle" >0.79</td><td align="center" valign="middle" >0.87</td><td align="center" valign="middle" >0.01</td><td align="center" valign="middle" >0.06</td><td align="center" valign="middle" >0.08</td><td align="center" valign="middle" >0.06</td><td align="center" valign="middle" >0.05</td><td align="center" valign="middle" >0.11</td><td align="center" valign="middle" >0.08</td><td align="center" valign="middle" >0.01</td></tr><tr><td align="center" valign="middle" >22</td><td align="center" valign="middle" >0.07</td><td align="center" valign="middle" >0.22</td><td align="center" valign="middle" >0.54</td><td align="center" valign="middle" >0.46</td><td align="center" valign="middle" >0.03</td><td align="center" valign="middle" >0.58</td><td align="center" valign="middle" >0.66</td><td align="center" valign="middle" >0.92</td><td align="center" valign="middle" >0.02</td><td align="center" valign="middle" >0.06</td><td align="center" valign="middle" >0.06</td><td align="center" valign="middle" >0.11</td><td align="center" valign="middle" >0.05</td><td align="center" valign="middle" >0.04</td><td align="center" valign="middle" >0.07</td><td align="center" valign="middle" >0.01</td></tr><tr><td align="center" valign="middle" >23</td><td align="center" valign="middle" >0.02</td><td align="center" valign="middle" >0.29</td><td align="center" valign="middle" >0.21</td><td align="center" valign="middle" >0.75</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" >0.01</td><td align="center" valign="middle" >0.03</td><td align="center" valign="middle" >0.03</td><td align="center" valign="middle" >0.02</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td></tr><tr><td align="center" valign="middle" >24</td><td align="center" valign="middle" >0.33</td><td align="center" valign="middle" >0.76</td><td align="center" valign="middle" >0.37</td><td align="center" valign="middle" >1.00</td><td align="center" valign="middle" >0.39</td><td align="center" valign="middle" >0.84</td><td align="center" valign="middle" >0.36</td><td align="center" valign="middle" >0.94</td><td align="center" valign="middle" >0.03</td><td align="center" valign="middle" >0.07</td><td align="center" valign="middle" >0.06</td><td align="center" valign="middle" >0.17</td><td align="center" valign="middle" >0.07</td><td align="center" valign="middle" >0.04</td><td align="center" valign="middle" >0.08</td><td align="center" valign="middle" >0.01</td></tr><tr><td align="center" valign="middle" >25</td><td align="center" valign="middle" >0.12</td><td align="center" valign="middle" >0.13</td><td align="center" valign="middle" >0.46</td><td align="center" valign="middle" >0.41</td><td align="center" valign="middle" >0.51</td><td align="center" valign="middle" >0.56</td><td align="center" valign="middle" >0.77</td><td align="center" valign="middle" >0.85</td><td align="center" valign="middle" >0.02</td><td align="center" valign="middle" >0.06</td><td align="center" valign="middle" >0.06</td><td align="center" valign="middle" >0.11</td><td align="center" valign="middle" >0.06</td><td align="center" valign="middle" >0.04</td><td align="center" valign="middle" >0.06</td><td align="center" valign="middle" >0.02</td></tr><tr><td align="center" valign="middle" >26</td><td align="center" valign="middle" >0.00</td><td align="center" valign="middle" >0.06</td><td align="center" valign="middle" >0.09</td><td align="center" valign="middle" >0.40</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" >0.00</td><td align="center" valign="middle" >0.02</td><td align="center" valign="middle" >0.02</td><td align="center" valign="middle" >0.02</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td></tr></tbody></table></table-wrap><table-wrap id="table11" ><label><xref ref-type="table" rid="table1">Table 1</xref>1</label><caption><title> Test-level reduced model fitting</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >CDM</th><th align="center" valign="middle" >−2LL</th><th align="center" valign="middle" >df</th><th align="center" valign="middle" >AIC</th><th align="center" valign="middle" >BIC</th><th align="center" valign="middle" >Max. z(ρ)</th><th align="center" valign="middle" >#ρ</th><th align="center" valign="middle" >Max. z(l)</th><th align="center" valign="middle" >#l</th></tr></thead><tr><td align="center" valign="middle" >DINA</td><td align="center" valign="middle" >55,159</td><td align="center" valign="middle" >115</td><td align="center" valign="middle" >55,389</td><td align="center" valign="middle" >56,034</td><td align="center" valign="middle" >11.95</td><td align="center" valign="middle" >21</td><td align="center" valign="middle" >9.22</td><td align="center" valign="middle" >23</td></tr><tr><td align="center" valign="middle" >DINO</td><td align="center" valign="middle" >55,870</td><td align="center" valign="middle" >115</td><td align="center" valign="middle" >56,100</td><td align="center" valign="middle" >56,745</td><td align="center" valign="middle" >13.62</td><td align="center" valign="middle" >25</td><td align="center" valign="middle" >14.57</td><td align="center" valign="middle" >25</td></tr><tr><td align="center" valign="middle" >A-CDM</td><td align="center" valign="middle" >53,647</td><td align="center" valign="middle" >145</td><td align="center" valign="middle" >53,937</td><td align="center" valign="middle" >54,750</td><td align="center" valign="middle" >7.68</td><td align="center" valign="middle" >12</td><td align="center" valign="middle" >6.47</td><td align="center" valign="middle" >11</td></tr><tr><td align="center" valign="middle" >LLM</td><td align="center" valign="middle" >53,400</td><td align="center" valign="middle" >145</td><td align="center" valign="middle" >53,690</td><td align="center" valign="middle" >54,503</td><td align="center" valign="middle" >4.49</td><td align="center" valign="middle" >2</td><td align="center" valign="middle" >4.76</td><td align="center" valign="middle" >2</td></tr><tr><td align="center" valign="middle" >R-RUM</td><td align="center" valign="middle" >53,750</td><td align="center" valign="middle" >145</td><td align="center" valign="middle" >54,040</td><td align="center" valign="middle" >54,853</td><td align="center" valign="middle" >8.47</td><td align="center" valign="middle" >6</td><td align="center" valign="middle" >7.15</td><td align="center" valign="middle" >7</td></tr></tbody></table></table-wrap><p>Note: critical z score = 4.17 for α = 0.01 (with the Bonferroni correction).</p><p>test at a 0.01 significance level, although the LLM was relatively close. This implied that the test might consist of items that were appropriate with different reduced CDMs, and possibly a saturated CDM. More specific item- level fit evaluation might be needed to determine appropriate reduced models for each item, which is beyond the scope of this paper.</p></sec><sec id="s3_4"><title>3.4. Modeling Phase Four: Cross-Validation</title><p>In this phase, we cross-validated the fit results of the final Q-matrix and reduced CDMs using different data. Examinees from the US were used, and the fit results based on the final Q-matrix are shown in <xref ref-type="table" rid="table1">Table 1</xref>2. Considering the differences of the sample size and educational system between the UK and US, the results can be considered quite similar. Specifically, for both countries: 1) the final Q-matrix based on the saturated model can be accepted at 0.05 significance level; and 2) the test-level fit of the reduced CDMs based on either relative or absolute fit measures followed a similar ranking (i.e., the LLM is better than the R-RUM or A-CDM, both of which are better than the DINA or DINO model).</p></sec></sec><sec id="s4"><title>4. Discussions</title><p>This research proposed a systematic procedure for modeling extant large-scale assessments for diagnostic purposes. The procedure can be divided into four phases, namely, initial attribute definition and Q-matrices construction, final attribute definition and Q-matrix construction, reduced CDM evaluation, and cross-validation. In the procedure, we adopted and integrated recent methodological developments in cognitive diagnosis modeling, including the development of general and reduced CDMs, various absolute and relative fit measures, and a general Q-matrix validation procedure. The PISA reading assessment was employed to demonstrate the modeling procedure in practice, which resulted in attributes with meaningful definitions and an appropriate Q-matrix. The modeling procedure can be generalized to other large-scale assessments like NAEP or TIMSS and/or other domain areas like mathematics or science, provided that adequate initial attributes and released items can be found. The current dichotomous attributes can be also extended to expert-defined polytomous attributes  (Chen &amp; de la Torre, 2012)  based on the five-level reading processes in the PISA assessment framework  (OECD, 2006a: p. 61)  to provide more useful diagnostic information, given that more item information is available. We have a note of caution in determining the Q-matrix specification: if we only rely on the indices to adjust the q-vectors, it is possible to obtain a Q-matrix which has an acceptable absolute fit but is theoretically incorrect (i.e., the attribute specifications for some items are not consistent with the attribute definitions). Accordingly, it is important for researchers to also exercise subjective judgment and not solely rely on objective criteria to make sure that the changes in the attribute specifications are always consistent with how the attributes have been defined.</p><p>In addition to some practical implications, this research also highlights a few issues that, when adequately addressed, can improve existing CDM methodologies. First, here we estimate the joint distribution of all attributes, which would be cumbersome as the number of attributes becomes large. Finding a more efficient way to estimate the attribute distributions could help us to detect the attributes’ relationship more easily. Second, the number of items is important for CDMs in providing accurate classification of the examinees. To accommodate more released items from large-scale assessments, design issues such as missing data across multiple booklets and varying sampling weights need to be addressed. This would require modifying existing CDM procedures (e.g., calibration, Q-matrix validation) to effectively handle these issues.</p><table-wrap id="table12" ><label><xref ref-type="table" rid="table1">Table 1</xref>2</label><caption><title> Q-matrix and reduced CDMs fitting for the US</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >CDM</th><th align="center" valign="middle" >−2LL</th><th align="center" valign="middle" >df</th><th align="center" valign="middle" >AIC</th><th align="center" valign="middle" >BIC</th><th align="center" valign="middle" >Max. z(ρ)</th><th align="center" valign="middle" >#ρ</th><th align="center" valign="middle" >Max. z(l)</th><th align="center" valign="middle" >#l</th></tr></thead><tr><td align="center" valign="middle" >Saturated</td><td align="center" valign="middle" >21,475</td><td align="center" valign="middle" >185</td><td align="center" valign="middle" >21,845</td><td align="center" valign="middle" >22,712</td><td align="center" valign="middle" >3.27</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >3.43</td><td align="center" valign="middle" >0</td></tr><tr><td align="center" valign="middle" >DINA</td><td align="center" valign="middle" >22,448</td><td align="center" valign="middle" >115</td><td align="center" valign="middle" >22,678</td><td align="center" valign="middle" >23,217</td><td align="center" valign="middle" >8.54</td><td align="center" valign="middle" >13</td><td align="center" valign="middle" >6.71</td><td align="center" valign="middle" >14</td></tr><tr><td align="center" valign="middle" >DINO</td><td align="center" valign="middle" >22,461</td><td align="center" valign="middle" >115</td><td align="center" valign="middle" >22,691</td><td align="center" valign="middle" >23,230</td><td align="center" valign="middle" >6.12</td><td align="center" valign="middle" >14</td><td align="center" valign="middle" >6.28</td><td align="center" valign="middle" >16</td></tr><tr><td align="center" valign="middle" >A-CDM</td><td align="center" valign="middle" >21,676</td><td align="center" valign="middle" >145</td><td align="center" valign="middle" >21,966</td><td align="center" valign="middle" >22,646</td><td align="center" valign="middle" >5.50</td><td align="center" valign="middle" >3</td><td align="center" valign="middle" >4.82</td><td align="center" valign="middle" >3</td></tr><tr><td align="center" valign="middle" >LLM</td><td align="center" valign="middle" >21,597</td><td align="center" valign="middle" >145</td><td align="center" valign="middle" >21,887</td><td align="center" valign="middle" >22,567</td><td align="center" valign="middle" >3.68</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >3.94</td><td align="center" valign="middle" >0</td></tr><tr><td align="center" valign="middle" >R-RUM</td><td align="center" valign="middle" >21,776</td><td align="center" valign="middle" >145</td><td align="center" valign="middle" >22,066</td><td align="center" valign="middle" >22,746</td><td align="center" valign="middle" >6.60</td><td align="center" valign="middle" >3</td><td align="center" valign="middle" >5.63</td><td align="center" valign="middle" >3</td></tr></tbody></table></table-wrap><p>Note: critical z score =3.61, 3.78, and 4.17 for α = 0.1, 0.05, and 0.01, respectively (with the Bonferroni correction).</p><p>To the extent possible, large-scale assessment data should be harnessed to provide information of additional practical value, considering the sizeable amount of resources invested in collecting them. With the recent CDM developments, we are in a better position to utilize extant large-scale data for diagnostic purpose. However, most large-scale assessments are still designed based on unidimensional IRM. Although current CDM developments allow them to be retrofitted more effectively compared to a few years ago, we are cognizant that retrofitting of any CDM to data that were not originally designed to provide diagnostic information will always provide less than ideal results. With the maturation of CDM methodologies, it appears that now is a propitious time to incorporate these methodologies into the development of large-scale assessments so that the assessments can provide optimal diagnostic information.</p></sec><sec id="s5"><title>Acknowledgements</title><p>This research was supported by Sun Yat-sen University Start-Up Grant No. 26000-18801031.</p></sec><sec id="s6"><title>NOTES</title></sec></body><back><ref-list><title>References</title><ref id="scirp.51663-ref1"><label>1</label><mixed-citation publication-type="other" xlink:type="simple">Akaike, H. (1974). A New Look at the Statistical Identification Model. IEEE Transactions on Automated Control, 19, 716-723. http://dx.doi.org/10.1109/TAC.1974.1100705</mixed-citation></ref><ref id="scirp.51663-ref2"><label>2</label><mixed-citation publication-type="other" xlink:type="simple">Chen, J., &amp; de la Torre, J. (2012). An Extension of the G-DINA Model for Polytomous Attributes. Paper Presented at the Annual Meeting of American Educational Research Association, Vancouver.</mixed-citation></ref><ref id="scirp.51663-ref3"><label>3</label><mixed-citation publication-type="other" xlink:type="simple">Chen, J., &amp; de la Torre, J. (2013). A General Cognitive Diagnosis Model for Expert-Defined Polytomous Attributes. Applied Psychological Measurement, 37, 419-437. http://dx.doi.org/10.1177/0146621613479818</mixed-citation></ref><ref id="scirp.51663-ref4"><label>4</label><mixed-citation publication-type="other" xlink:type="simple">Chen, J., de la Torre, J., &amp; Zhang, Z. (2013). Relative and Absolute Fit Evaluation in Cognitive Diagnosis Modeling. Journal of Educational Measurement, 50, 123-140. http://dx.doi.org/10.1111/j.1745-3984.2012.00185.x</mixed-citation></ref><ref id="scirp.51663-ref5"><label>5</label><mixed-citation publication-type="other" xlink:type="simple">de la Torre, J. (2008). An Empirically-Based Method of Q-Matrix Validation for the DINA Model: Development and Applications. Journal of Educational Measurement, 45, 343-362. http://dx.doi.org/10.1111/j.1745-3984.2008.00069.x</mixed-citation></ref><ref id="scirp.51663-ref6"><label>6</label><mixed-citation publication-type="other" xlink:type="simple">de la Torre, J. (2011). The Generalized DINA Model Framework. Psychometrika, 76, 179-199. http://dx.doi.org/10.1007/s11336-011-9207-7</mixed-citation></ref><ref id="scirp.51663-ref7"><label>7</label><mixed-citation publication-type="other" xlink:type="simple">de la Torre, J., &amp; Chen, J. (2011). Estimating Different Reduced Cognitive Diagnosis Models Using a General Framework. Paper Presented at the Annual Meeting of the National Council on Measurement in Education, New Orleans.</mixed-citation></ref><ref id="scirp.51663-ref8"><label>8</label><mixed-citation publication-type="other" xlink:type="simple">de la Torre, J., &amp; Chiu, C.-Y. (2010). A General Method of Empirical Q-Matrix Validation Using the G-DINA Model Discrimination Index. Paper Presented at the Annual Meeting of the National Council on Measurement in Education, Denver.</mixed-citation></ref><ref id="scirp.51663-ref9"><label>9</label><mixed-citation publication-type="other" xlink:type="simple">de la Torre, J., &amp; Lee, Y.-S. (2010). Item-Level Comparison of Saturated and Reduced Cognitive Diagnosis Models. Paper Presented at the Annual Meeting of the National Council on Measurement in Education, Denver.</mixed-citation></ref><ref id="scirp.51663-ref10"><label>10</label><mixed-citation publication-type="book" xlink:type="simple">DiBello, L., Roussos, L., &amp; Stout, W. (2007). Cognitive Diagnosis Part I. In C. R. Rao, &amp; S. Sinharay (Eds.), Handbook of Statistics (Vol. 26): Psychometrics (pp. 979-1030). Amsterdam: Elsevier.</mixed-citation></ref><ref id="scirp.51663-ref11"><label>11</label><mixed-citation publication-type="other" xlink:type="simple">Haertel, E. H. (1989). Using Restricted Latent Class Models to Map the Skill Structure of Achievement Items. Journal of Educational Measurement, 26, 301-321. http://dx.doi.org/10.1111/j.1745-3984.1989.tb00336.x</mixed-citation></ref><ref id="scirp.51663-ref12"><label>12</label><mixed-citation publication-type="other" xlink:type="simple">Hartz, S. (2002). A Bayesian Framework for the Unified Model for Assessing Cognitive Abilities: Blending Theory with Practicality. Unpublished Doctoral Dissertation, University of Illinois at Urbana-Champaign.</mixed-citation></ref><ref id="scirp.51663-ref13"><label>13</label><mixed-citation publication-type="other" xlink:type="simple">Henson, R. A., Templin, J., &amp; Willse, J. (2009). Defining a Family of Cognitive Diagnosis Models Using Log-Linear Models with Latent Variables. Psychometrika, 74, 191-210. http://dx.doi.org/10.1007/s11336-008-9089-5</mixed-citation></ref><ref id="scirp.51663-ref14"><label>14</label><mixed-citation publication-type="other" xlink:type="simple">Jang, E. E. (2009). Cognitive Diagnostics Assessment of L2 Reading Comprehension Ability: Validity Arguments for Fusion Model Application to Langu Edge Assessment. Language Testing, 26, 31-73. http://dx.doi.org/10.1177/0265532208097336</mixed-citation></ref><ref id="scirp.51663-ref15"><label>15</label><mixed-citation publication-type="other" xlink:type="simple">Junker, B. W., &amp; Sijtsma, K. (2001). Cognitive Assessment Models with Few Assumptions, and Connections with Non-Parametric Item Response Theory. Applied Psychological Measurement, 25, 258-272. http://dx.doi.org/10.1177/01466210122032064</mixed-citation></ref><ref id="scirp.51663-ref16"><label>16</label><mixed-citation publication-type="other" xlink:type="simple">Kunina-Habenicht, O., Rupp, A. A., &amp; Wilhelm, O. (2012). The Impact of Model Misspecification on Parameter Estimation and Item-Fit Assessment in Log-Linear Diagnostic Classification Models. Journal of Educational Measurement, 49, 59-81. http://dx.doi.org/10.1111/j.1745-3984.2011.00160.x</mixed-citation></ref><ref id="scirp.51663-ref17"><label>17</label><mixed-citation publication-type="other" xlink:type="simple">Lee, Y., &amp; Sawaki, Y. (2009). Application of Three Cognitive Diagnosis Models to ESL Reading and Listening Assessments. Language Assessment Quarterly, 6, 239-263. http://dx.doi.org/10.1080/15434300903079562</mixed-citation></ref><ref id="scirp.51663-ref18"><label>18</label><mixed-citation publication-type="other" xlink:type="simple">Maris, E. (1999). Estimating Multiple Classification Latent Class Models. Psychometrika, 64, 187-212. http://dx.doi.org/10.1007/BF02294535</mixed-citation></ref><ref id="scirp.51663-ref19"><label>19</label><mixed-citation publication-type="other" xlink:type="simple">OECD (1999). Measuring Student Knowledge and Skills: A New Framework for Assessment. Paris: Organization for Economic Cooperation and Development.</mixed-citation></ref><ref id="scirp.51663-ref20"><label>20</label><mixed-citation publication-type="other" xlink:type="simple">OECD (2006a). Assessing Scientific, Reading and Mathematical Literacy: A Framework for PISA 2006. Paris: Organization for Economic Cooperation and Development.</mixed-citation></ref><ref id="scirp.51663-ref21"><label>21</label><mixed-citation publication-type="other" xlink:type="simple">OECD (2006b). PISA Released Items: Reading. http://www.oecd.org/pisa/38709396.pdf</mixed-citation></ref><ref id="scirp.51663-ref22"><label>22</label><mixed-citation publication-type="other" xlink:type="simple">Schwarzer, G. (1976). Estimating the Dimension of a Model. Annals of Statistics, 6, 461-464. http://dx.doi.org/10.1214/aos/1176344136</mixed-citation></ref><ref id="scirp.51663-ref23"><label>23</label><mixed-citation publication-type="other" xlink:type="simple">Tatsuoka, K. K. (1983). Rule Space: An Approach for Dealing with Misconceptions Based on Item Response Theory. Journal of Educational Measurement, 20, 345-354. http://dx.doi.org/10.1111/j.1745-3984.1983.tb00212.x</mixed-citation></ref><ref id="scirp.51663-ref24"><label>24</label><mixed-citation publication-type="book" xlink:type="simple">Tatsuoka, K. K. (1990). Toward an Integration of Item-Response Theory and Cognitive Error Diagnosis. In N. Frederiksen, R. Glaser, A. Lesgold, &amp; M. Safto (Eds.), Diagnostic Monitoring Skills and Knowledge Acquisition (pp. 453-488). Hillsdale, NJ: Erlbaum.</mixed-citation></ref><ref id="scirp.51663-ref25"><label>25</label><mixed-citation publication-type="other" xlink:type="simple">Templin, J., &amp; Henson, R. A. (2006). Measurement of Psychological Disorders Using Cognitive Diagnosis Models. Psychological Methods, 11, 287-305. http://dx.doi.org/10.1037/1082-989X.11.3.287</mixed-citation></ref><ref id="scirp.51663-ref26"><label>26</label><mixed-citation publication-type="other" xlink:type="simple">von Davier, M. (2008). A General Diagnostic Model Applied to Language Testing Data. British Journal of Mathematical and Statistical Psychology, 61, 287-307. http://dx.doi.org/10.1348/000711007X193957</mixed-citation></ref></ref-list></back></article>