<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article  PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article"><front><journal-meta><journal-id journal-id-type="publisher-id">OJBM</journal-id><journal-title-group><journal-title>Open Journal of Business and Management</journal-title></journal-title-group><issn pub-type="epub">2329-3284</issn><publisher><publisher-name>Scientific Research Publishing</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.4236/ojbm.2017.52022</article-id><article-id pub-id-type="publisher-id">OJBM-75414</article-id><article-categories><subj-group subj-group-type="heading"><subject>Articles</subject></subj-group><subj-group subj-group-type="Discipline-v2"><subject>Business&amp;Economics</subject></subj-group></article-categories><title-group><article-title>
 
 
  Research on Personal Credit Assessment Based on Neural Network-Logistic Regression Combination Model
 
</article-title></title-group><contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Yajuan</surname><given-names>Huo</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Huazhou</surname><given-names>Chen</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Jiechao</surname><given-names>Chen</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref></contrib></contrib-group><aff id="aff1"><addr-line>College of Science, Guilin University of Technology, Guilin, China</addr-line></aff><author-notes><corresp id="cor1">* E-mail:<email>hzchengut@foxmail.com(HC)</email>;</corresp></author-notes><pub-date pub-type="epub"><day>13</day><month>04</month><year>2017</year></pub-date><volume>05</volume><issue>02</issue><fpage>244</fpage><lpage>252</lpage><history><date date-type="received"><day>19,</day>	<month>March</month>	<year>2017</year></date><date date-type="rev-recd"><day>9,</day>	<month>April</month>	<year>2017</year>	</date><date date-type="accepted"><day>14,</day>	<month>April</month>	<year>2017</year></date></history><permissions><copyright-statement>&#169; Copyright  2014 by authors and Scientific Research Publishing Inc. </copyright-statement><copyright-year>2014</copyright-year><license><license-p>This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/</license-p></license></permissions><abstract><p>
 
 
  With the development of economic globalization and financial liberalization, credit assessment plays an important role in maintaining the normal relationship of social economy. Personal credit assessment requires establishing calibration models with statistic methods. The mono-method-based models are not capable to simultaneously hold the robustness, interpretation and prediction accuracy of the models. In this paper, back-propagation neural network (BPNN) was used to generate a new comprehensive variable for logistic regression (LR) by tuning the number of hidden nodes. The optimal back-propagation neural network-logistic regression combination model (BPNN-LR) was established with 5 input nodes, 7 hidden nodes and 1 output node. The model performance was slightly improved. The prediction accuracy was raised up to 86.33% and 87.96% for the training samples and the test samples, respectively. Results showed that the BPNN-LR model had higher classification accuracy than the LR model. It is concluded that the outcome performance provides technical reference for the corporation’s decision making.
 
</p></abstract><kwd-group><kwd>Personal Credit Assessment</kwd><kwd> Back-Propagation Neural Network</kwd><kwd> Logistic Regression</kwd><kwd> Information Value</kwd><kwd> Weight of Evidence</kwd></kwd-group></article-meta></front><body><sec id="s1"><title>1. Introduction</title><p>Credit maintains the normal relationship of social economy. Personal credit is the basis of the social credit. As a result of the imperfect personal credit system, the confusion of personal credit relationship, fraud, repudiation and other dishonesty behaviors prevail universally. This situation has not only become a huge obstacle to the development of market economy. But also has seriously affected the progress and development of society [<xref ref-type="bibr" rid="scirp.75414-ref1">1</xref>] [<xref ref-type="bibr" rid="scirp.75414-ref2">2</xref>] . Credit assessment is used to evaluate credibility for individuals or corporations. It plays a key role in the development of economic globalization and financial liberalization [<xref ref-type="bibr" rid="scirp.75414-ref3">3</xref>] . It is necessary to establish statistic model for predicting personal credibility. Thus the customers can be discriminated as having good credit or having bad credit [<xref ref-type="bibr" rid="scirp.75414-ref4">4</xref>] .</p><p>Statistic methods are widely used for the personal credit assessment. Linear discriminant and logistic regression (LR) are the basic robust and interpretative linear methods. They generate a linear scorecard to evaluate customer’s credit [<xref ref-type="bibr" rid="scirp.75414-ref5">5</xref>] [<xref ref-type="bibr" rid="scirp.75414-ref6">6</xref>] [<xref ref-type="bibr" rid="scirp.75414-ref7">7</xref>] . But the linear methods meet the difficulties of extracting the nonlinear relationship from data. Therefore, nonlinear machine learning methods are introduced, such as classification decision tree, support vector machine (SVM), neural network (NN), etc. They are feasible to effectively improve the predictive performance of the model [<xref ref-type="bibr" rid="scirp.75414-ref8">8</xref>] [<xref ref-type="bibr" rid="scirp.75414-ref9">9</xref>] [<xref ref-type="bibr" rid="scirp.75414-ref10">10</xref>] . However, the mono-method-based models cannot take into consideration all demands of model robustness, prediction accuracy and variables’ interpretation. Therefore, combination optimization of multiple methods has become an important developing for credit assessment [<xref ref-type="bibr" rid="scirp.75414-ref11">11</xref>] . Logistic regression-support vector machine combination (LR-SVM) model could extract significant variables, simplify the model structure, reduce the computational complexity and improve the model efficiency [<xref ref-type="bibr" rid="scirp.75414-ref12">12</xref>] . Neural network combined with multivariable linear regression model could optimize the initial solution of the neural network, thus to shorten the calculation time consumption and to improve the classification accuracy [<xref ref-type="bibr" rid="scirp.75414-ref13">13</xref>] . In this paper, the personal credit assessment models were established by combining back-propagation neural network (BPNN) and logistic regression, to estimate the customers’ behaviors for a communication corporation.</p><p>The records of customers’ behaviors (January-June, 2015) were collected from the transaction data of China Unicom. The numerical variables were transformed using normalization and classification variables are transformed to the binary classification form. The contribution of each variable was evaluated by IV and some of the most contributive valuables were selected for model calibration and prediction. The selected transformed variables were the basic data for model optimization. First, the LR models were established to calculate the probability of being bad credit for each customer, in order to discriminate that this customer has good credit or bad credit. Then, a BPNN model with one hidden layer was used to generate a new comprehensive variable for model optimization by tuning and selecting the number of hidden nodes. Thus, an optimal back-propaga- tion neural network-logistic regression combination model (BPNN-LR) was determined for the evaluation of customer’s personal credit, so that the LR classification accuracy was improved. In our study, Python and Matlab are combined used for sample classification, data transformation and model optimization.</p></sec><sec id="s2"><title>2. Data and Methods</title><sec id="s2_1"><title>2.1. Experimental Data</title><p>A total of 4518 customer behavior records were collected from transaction data of China Unicom. These records from January to June 2015 were scrambled as samples for establishing statistic models. And each sample included 24 variables (as shown in <xref ref-type="table" rid="table1">Table 1</xref>). One of these valuables was used to evaluate customer’s personal credit, by which the customers could be discriminated as having good credit or having bad credit. The customers who had good credit were labeled as “good” customers and the customers who had bad credit were labeled as “bad” customers, so that this valuable was defined as “customer credibility”. The remaining 23 variables described customers’ daily communication behaviors. They were regarded as the characteristic variables in the modeling processes. As the customer credibility recording, there were 2574 actual “good” customers(valued as 0), and 1944 actual “bad” customers (valued as 1). To achieve the modeling and predicting tasks, we randomly divided all of the 4518 samples into two parts at around the ratio of 3:1. One part was used to train the model, containing 3388 samples as the training set, and the other part containing 1130 samples as the test set.</p></sec><sec id="s2_2"><title>2.2. Data Transformation</title><p>Establishing personal credit assessment model was used to predict the value of customer credibility (Y) with 23 characteristic variables, so that the customers could be discriminated as “good” or “bad” customers. The 23 characteristic variables included 16 numerical variables (T) and 7 classification variables (C). It was necessary to normalize the numerical variables, because they were acquired</p><table-wrap id="table1" ><label><xref ref-type="table" rid="table1">Table 1</xref></label><caption><title> The variables</title></caption><table><tbody><thead><tr><th align="center" valign="middle"  colspan="2"  >Classification variable</th><th align="center" valign="middle"  colspan="4"  >Numerical variable</th></tr></thead><tr><td align="center" valign="middle" >Variable</td><td align="center" valign="middle" >Symbol</td><td align="center" valign="middle" >Variable</td><td align="center" valign="middle" >Symbol</td><td align="center" valign="middle" >Variable</td><td align="center" valign="middle" >Symbol</td></tr><tr><td align="center" valign="middle" >Customer credibility</td><td align="center" valign="middle" >Y</td><td align="center" valign="middle" >Times of payment arrearage</td><td align="center" valign="middle" >T1</td><td align="center" valign="middle" >Totalnumber of communicational friends</td><td align="center" valign="middle" >T9</td></tr><tr><td align="center" valign="middle" >IS converged with other service</td><td align="center" valign="middle" >C1</td><td align="center" valign="middle" >Amount of payment arrearage</td><td align="center" valign="middle" >T2</td><td align="center" valign="middle" >Number of intranet communicational friends</td><td align="center" valign="middle" >T10</td></tr><tr><td align="center" valign="middle" >IS real-name registered</td><td align="center" valign="middle" >C2</td><td align="center" valign="middle" >Online duration</td><td align="center" valign="middle" >T3</td><td align="center" valign="middle" >Number of outer communicational friends</td><td align="center" valign="middle" >T11</td></tr><tr><td align="center" valign="middle" >IS bound with bankcards</td><td align="center" valign="middle" >C3</td><td align="center" valign="middle" >Accumulative days of communication</td><td align="center" valign="middle" >T4</td><td align="center" valign="middle" >Accumulation of used data</td><td align="center" valign="middle" >T12</td></tr><tr><td align="center" valign="middle" >IS an attractive number</td><td align="center" valign="middle" >C4</td><td align="center" valign="middle" >Times of accumulative calls</td><td align="center" valign="middle" >T5</td><td align="center" valign="middle" >Monthly average of the on-bill amounts</td><td align="center" valign="middle" >T13</td></tr><tr><td align="center" valign="middle" >IS registered to APP’s</td><td align="center" valign="middle" >C5</td><td align="center" valign="middle" >Monthly average of arrearages</td><td align="center" valign="middle" >T6</td><td align="center" valign="middle" >Monthly average of payments</td><td align="center" valign="middle" >T14</td></tr><tr><td align="center" valign="middle" >IS a grouped number</td><td align="center" valign="middle" >C6</td><td align="center" valign="middle" >Accumulative callduration</td><td align="center" valign="middle" >T7</td><td align="center" valign="middle" >Accumulation of overdue payment arrearages</td><td align="center" valign="middle" >T15</td></tr><tr><td align="center" valign="middle" >On/off line</td><td align="center" valign="middle" >C7</td><td align="center" valign="middle" >Times of being suspended</td><td align="center" valign="middle" >T8</td><td align="center" valign="middle" >Times of international roaming</td><td align="center" valign="middle" >T16</td></tr></tbody></table></table-wrap><p>using different measurement scales and valued in different algebraic ranges. While the seven classification variables had different number of classification target. For example, the values of “IS converged with other service” (C1) included 0, 1, 2 and 3. This valuing diversity was probable to result in high dimensional calculation. In order to reduce model’s complexity, they were transformed to binary classification variable, equaling to either 0 or 1.</p></sec><sec id="s2_3"><title>2.3. Variable Selection</title><p>The contributions of 23 characteristic variables were different. The calibration model could be much complex and the prediction accuracy be affected if all the 23 variables were used for personal credit assessment. Therefore, it was necessary to select characteristic variables according to their contribution. The contributions of variables are quantitatively measured by calculating the Weight of Evidence (WOE) for divided sample groups. WOE is used to measure the difference between variables and between different sample groups [<xref ref-type="bibr" rid="scirp.75414-ref14">14</xref>] . The value of WOE<sub>i</sub> for the i-th group was calculated as follows,</p><p>WOE i = ln ( G i / G B i / B )</p><p>where G represented the “good” customers. B represented the “bad” customers. The Information Value (IV) of each variable is successively defined by weighted sum of the grouping WOE’s,</p><p>I V = ∑ i = 1 n WOE i &#215; ( G i G − B i B )</p><p>A larger IV represent the variable contributes greater to the models [<xref ref-type="bibr" rid="scirp.75414-ref15">15</xref>] .</p></sec><sec id="s2_4"><title>2.4. The Logistic Regression Model</title><p>Logistic regression is a common classification model in machine learning field. The personal credit was estimated by establishing logistic regression models with the selected transformed variables. To achieve the calibrating and predicting tasks, the customer credibility (Y) was defined as the dependent variable. And the selected variables were defined as the independent variables (labeled as X<sub>1</sub>, X<sub>2</sub>, ・・・, X<sub>n</sub>). The logistic regression formula was as follows,</p><p>ln ( P 1 − P ) = β 0 + β 1 X 1 + β 2 X 2 + ⋯ + β n X n</p><p>where, P was the probability of a customer being “bad”, i.e. P = P (Y = 1). The P-value resulting from logistic regression model was used to distinguish “good” customers and “bad” customers.</p></sec><sec id="s2_5"><title>2.5. The BPNN-LR Model</title><p>The BPNN-LR model was to utilize back-propagation neural network to train a new comprehensive valuable for logistic regression. The neural network was formed as having one hidden layer with k hidden neurons. The parameters (the weights) of the whole network should be trained by iteration.</p><p>Step 1, the weights of any neuron between the input and the hidden layer (V<sub>ij</sub>) were preliminarily initialized. The selected characteristic variables X<sub>1</sub>, X<sub>2</sub>, ・・・, X<sub>n</sub> were input and the weighted sum of them was passed to the hidden layer by logistic function. The output of the hidden layer was defined as follows,</p><p>H j = f ( ∑ i = 0 n V i j X i ) , j = 1 , 2 , ⋯ , k</p><p>Step 2, the weights of any neuron between the hidden and the output layer (W<sub>jp</sub>) were initialized. The output of the hidden layer were weighted and summed, then transferred to the output layer. The final output of the neural network was as follows,</p><p>Y p = f ( ∑ j = 0 k W j p H j ) , p = 1 , 2 , ⋯ , m</p><p>where, the logistic function was used as the transfer function f, V<sub>ij</sub> was defined as the weight of the i-th input node and the j-th hidden node and W<sub>jp</sub> was defined as weight of the j-th hidden node and the p-th output. For each weighted sum, the i and j equaling to 0 meant that V<sub>0j</sub> and W<sub>0p</sub> were the alternatives of the thresholds of the neurons when X<sub>0</sub> equaled to 1 and H<sub>0</sub> equaled to 1.</p><p>The output of BP neural network was regarded as a new comprehensive variable added to logistic regression, so that the predictions of customer credibility of all samples were accomplished. According to the predictive results, the optimal number of hidden neurons was selected for the improvement of the BPNN-LR model and the “good/bad” customers identified.</p></sec></sec><sec id="s3"><title>3. Results and Discussions</title><p>A total of 4518 samples were randomly divided into training set and test set at around the ratio of 3:1, containing 3388 training samples and 1130 test samples. Each sample included 24 variables. One of these variables was customer credibility (Y), by which the customers could be discriminated as “good” (valued as 0) or “bad” (valued as 1). We defined the customer credibility as the dependent variable and the 23 characteristic variables as the independent variables to establish LR and BPNN-LR models for personal credit assessment, so that the customers could be distinguished as “good” or “bad”.</p><p>In order to reduce the complexity of the model, it was necessary to select variables according to their contribution (i.e. the IV values). A larger IV represent the variable contributes greater to the models. The variables were sorted in descending order of IV (shown in <xref ref-type="fig" rid="fig1">Figure 1</xref>). Several variables at the top of the IV chart were expected to produce most contribution. For example, “On/off line” (C7) was at the top of the figure, with its IV equaling to 2.77. The first runners-up were “Times of payment arrearage” (T1), “Amount of payment arrearage” (T2), “Online duration” (T3) and “Accumulative days of communication” (T4). Each of them contributed much less than C7. As for dimension reduction, we established the self-adaptive models by tuning the different numbers of variables (from 4 to 10). The prediction accuracy of different number of selected variables were shown in <xref ref-type="fig" rid="fig2">Figure 2</xref>. The prediction accuracy of using 5 variables was slightly better than using four variables, but almost the same as using more-than-five variables. Thus the top five variables with the largest IV values were selected to establish an optimal model. The 5 selected variables were: On/off line (C7), Times of payment arrearage (T1), Amount of payment arrearage (T2), online duration (T3) and Accumulative days of communication (T4).</p><p>The logistic regression method was used to establish the personal credit assessment model to distinguish “good” customers and “bad” customers by the P-value resulting from the model. The training samples with the most contributive characteristic valuables were input to the model to predict the P-values of the test samples. If P was greater than 0.5, the sample was classified as “bad” customers (Y = 1), otherwise it was classified as “good” customers (Y = 0).</p><p>Results of the logistic regression model were shown in <xref ref-type="table" rid="table2">Table 2</xref>. The classification accuracy of the training samples was 86.25% and the classification accuracy of the test samples was 84.87%.</p><p>In order to improve the predictive performance of the logistic regression model, the BPNN method was used to generate a new comprehensive valuable for logistic regression model. And a BPNN-LR model was established. To select the number of BPNN’s hidden neurons (k), we established the models with tuning the number of hidden neurons (from 4 to 10). Initializing the weights between the input and the hidden layer (V<sub>ij</sub>), and between the hidden and the output layer (W<sub>j</sub><sub>1</sub>), the logistic function was used as the transfer function. Iterative training the parameters (the weights) of the whole network, the classification accuracy of the test samples corresponding to different k values were shown in <xref ref-type="fig" rid="fig3">Figure 3</xref>. As shown in <xref ref-type="fig" rid="fig3">Figure 3</xref>, the classification accuracy of the BPNN-LR model was superior to the logistic regression model. And the optimal number of hidden neurons was 7. Therefore, the BP neural network included 7 hidden nodes, 5 input nodes (i.e. The selected 5 most contributive characteristic variables) and 1 output node (the customer credibility). The optimal BPNN-LR model was optimized with the output neuron having the weights of W<sub>01</sub> = −4.14, W<sub>11</sub> = −3.48, W<sub>21</sub> = −0.71, W<sub>31</sub> = 0.34, W<sub>41</sub> = 6.81, W<sub>51</sub> = −3.04, W<sub>61</sub> = −3.37 and W<sub>71</sub> = 5.33. Results of the BPNN-LR model were shown in <xref ref-type="table" rid="table3">Table 3</xref>. The classification accuracy of the training samples was 86.33%. And the classification accuracy of the test samples was 87.96%.</p></sec><sec id="s4"><title>4. Conclusion</title><p>In this paper, the customer transaction data from China Unicom were used to establish personal credit assessment model. These data were from January to</p><table-wrap id="table2" ><label><xref ref-type="table" rid="table2">Table 2</xref></label><caption><title> The confusion matrix for the binary classification model based on logistic regression</title></caption><table><tbody><thead><tr><th align="center" valign="middle"  colspan="2"   rowspan="2"  >Training set</th><th align="center" valign="middle"  colspan="2"  >Prediction</th><th align="center" valign="middle"  rowspan="2"  >Accuracy (%)</th><th align="center" valign="middle"  colspan="2"   rowspan="2"  >Test set</th><th align="center" valign="middle"  colspan="2"  >Prediction</th><th align="center" valign="middle"  rowspan="2"  >Accuracy (%)</th></tr></thead><tr><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td></tr><tr><td align="center" valign="middle"  rowspan="2"  >Reality</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1784</td><td align="center" valign="middle" >129</td><td align="center" valign="middle" >93.25</td><td align="center" valign="middle"  rowspan="2"  >Reality</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >572</td><td align="center" valign="middle" >89</td><td align="center" valign="middle" >86.53</td></tr><tr><td align="center" valign="middle" >1</td><td align="center" valign="middle" >337</td><td align="center" valign="middle" >1138</td><td align="center" valign="middle" >77.15</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >82</td><td align="center" valign="middle" >387</td><td align="center" valign="middle" >84.00</td></tr><tr><td align="center" valign="middle"  colspan="2"  >Total accuracy (%)</td><td align="center" valign="middle"  colspan="3"  >86.25</td><td align="center" valign="middle"  colspan="2"  >Total accuracy (%)</td><td align="center" valign="middle"  colspan="3"  >84.87</td></tr></tbody></table></table-wrap><table-wrap id="table3" ><label><xref ref-type="table" rid="table3">Table 3</xref></label><caption><title> The confusion matrix for the binary classification model based on BPNN-LR model</title></caption><table><tbody><thead><tr><th align="center" valign="middle"  colspan="2"   rowspan="2"  >Training set</th><th align="center" valign="middle"  colspan="2"  >Prediction</th><th align="center" valign="middle"  rowspan="2"  >Accuracy (%)</th><th align="center" valign="middle"  colspan="2"   rowspan="2"  >Test set</th><th align="center" valign="middle"  colspan="2"  >Prediction</th><th align="center" valign="middle"  rowspan="2"  >Accuracy (%)</th></tr></thead><tr><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td></tr><tr><td align="center" valign="middle"  rowspan="2"  >Reality</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1789</td><td align="center" valign="middle" >124</td><td align="center" valign="middle" >93.52</td><td align="center" valign="middle"  rowspan="2"  >Reality</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >630</td><td align="center" valign="middle" >31</td><td align="center" valign="middle" >95.31</td></tr><tr><td align="center" valign="middle" >1</td><td align="center" valign="middle" >339</td><td align="center" valign="middle" >1136</td><td align="center" valign="middle" >77.02</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >105</td><td align="center" valign="middle" >364</td><td align="center" valign="middle" >77.61</td></tr><tr><td align="center" valign="middle" >Total accuracy (%)</td><td align="center" valign="middle"  colspan="4"  >86.33</td><td align="center" valign="middle" >Total accuracy (%)</td><td align="center" valign="middle"  colspan="4"  >87.96</td></tr></tbody></table></table-wrap><p>June 2015. The LR model and the BPNN-LR model were respectively established to discriminate between “good” customers and “bad” customers. First of all, the numerical variables were normalized and classification variables were transformed to binary classification variables. Then, variables were selected according to their contribution measured by calculating the IV. The selected five variables with the largest IV were used to establish the model and predict. The P-value resulting from the logistic regression model was used to distinguish “good” customers and “bad” customers. The accuracy of the training samples was 86.25%. And the accuracy of the test samples was 84.87%. In order to optimize the model, BPNN method was used to generate a new comprehensive valuable for logistic regression model. And a BPNN-LR model was established. The optimal BPNN included 5 input nodes, 7 hidden nodes and 1 output node. The classification accuracy of the training samples was 86.33%. And the classification accuracy of the test samples was 87.96%. The results showed that the classification accuracy of the BPNN-LR model was higher than the LR model. This indicated that the new comprehensive valuable generated by BPNN was feasible to highly interpret the characteristic valuables. The method of BPNN combined with LR modeling provided technical references to distinguish “good” customers and “bad” customers and provide decision-making basis for a corporation.</p></sec><sec id="s5"><title>Acknowledgements</title><p>This work was supported by National Natural Scientific Foundation of China (61505037) and Natural Scientific Foundations of Guangxi (2016GXNSFBA- 380077, 2015GXNSFBA139259).</p></sec><sec id="s6"><title>Cite this paper</title><p>Huo, Y.J., Chen, H.Z. and Chen, J.C. (2017) Research on Personal Credit Assessment Based on Neural Network-Logistic Regression Combination Model. Open Journal of Business and Management, 5, 244-252. https://doi.org/10.4236/ojbm.2017.52022</p></sec></body><back><ref-list><title>References</title><ref id="scirp.75414-ref1"><label>1</label><mixed-citation publication-type="other" xlink:type="simple">Fang, K.N., Wu, J. and Zhu, J. (2010) Forecasting of Credit Card Credit Risk Under Asymmetric Information Based on Nonparametric Random Forests. Economic Research Journal, 39, 97-107.</mixed-citation></ref><ref id="scirp.75414-ref2"><label>2</label><mixed-citation publication-type="journal" xlink:type="simple"><name name-style="western"><surname>Liu</surname><given-names> L. </given-names></name>,<etal>et al</etal>. (<year>2007</year>)<article-title>Retail Exposures Credit Scoring Models for Chinese Commercial Banks: Model Specification and its Application</article-title><source> Journal of Finance and Economics</source><volume> 33</volume>,<fpage> 26</fpage>-<lpage>36</lpage>.<pub-id pub-id-type="doi"></pub-id></mixed-citation></ref><ref id="scirp.75414-ref3"><label>3</label><mixed-citation publication-type="other" xlink:type="simple">Wang, Y., Nie, G. and Shi, Y. (2012) A Research on Customers Default Rate of Commercial Banks in China Based on Credit Scoring Models. Management Review, 24, 80-89.</mixed-citation></ref><ref id="scirp.75414-ref4"><label>4</label><mixed-citation publication-type="other" xlink:type="simple">Guo, Y., Zhou, W. and Luo, C. (2015) Instance-Based Credit Risk Assessment for Investment Decisions in P2P Lending. European Journal of Operational Research, 24, 417-426.</mixed-citation></ref><ref id="scirp.75414-ref5"><label>5</label><mixed-citation publication-type="other" xlink:type="simple">Shi, Q. and Jin, Y. (2004) A Comparative Study on The Application of Various Personal Credit Scoring Models in China. Statistical Research, 21, 43-47.</mixed-citation></ref><ref id="scirp.75414-ref6"><label>6</label><mixed-citation publication-type="journal" xlink:type="simple"><name name-style="western"><surname>Miyamoto</surname><given-names> M. </given-names></name>,<etal>et al</etal>. (<year>2014</year>)<article-title>Credit Risk Assessment for a Small Bank by Using a Multinomial Logistic Regression Model</article-title><source> International Journal of Finance &amp; Accounting</source><volume> 3</volume>,<fpage> 327</fpage>-<lpage>334</lpage>.<pub-id pub-id-type="doi"></pub-id></mixed-citation></ref><ref id="scirp.75414-ref7"><label>7</label><mixed-citation publication-type="journal" xlink:type="simple"><name name-style="western"><surname>Menard</surname><given-names> S. </given-names></name>,<etal>et al</etal>. (<year>2002</year>)<article-title>Applied Logistic Regression Analysis</article-title><source> Technometrics</source><volume> 38</volume>,<fpage> 184</fpage>-<lpage>186</lpage>.<pub-id pub-id-type="doi"></pub-id></mixed-citation></ref><ref id="scirp.75414-ref8"><label>8</label><mixed-citation publication-type="other" xlink:type="simple">Yang, S., Zhu, Q. and Cheng, C. (2013) The Building of the Combined Model for Personal Credit Rating: A Study Based on the Decision Tree-Neural Network. Financial Forum, 27, 57-61.</mixed-citation></ref><ref id="scirp.75414-ref9"><label>9</label><mixed-citation publication-type="other" xlink:type="simple">Xiao, W. and Fei, Q. (2006) A Study of Personal Credit Scoring Models on Support Vector Machine with Optimal Choice of Kernel Function Parameters. Systems Engineering Theory &amp; Practice, 26, 73-79.</mixed-citation></ref><ref id="scirp.75414-ref10"><label>10</label><mixed-citation publication-type="other" xlink:type="simple">Galindo, J. and Tamayo, P. (2006) Evaluation of Neural Networks and Data Mining Methods on a Credit Assessment Task for Class Imbalance Problem. Nonlinear Analysis: Real World Applications, 7, 720-747.</mixed-citation></ref><ref id="scirp.75414-ref11"><label>11</label><mixed-citation publication-type="other" xlink:type="simple">Salehi, M. and Mansoury, A. (2011) An Evaluation of Iranian Banking System Credit Risk: Neural Network And Logistic Regression Approach. International Journal of Physical Sciences, 6, 6082-6090.</mixed-citation></ref><ref id="scirp.75414-ref12"><label>12</label><mixed-citation publication-type="other" xlink:type="simple">Lu, H., Li, Y. and Hong, W. (2013) Credit Scoring Model Hybridizing Artificial Intelligence With Logistic Regression. Journal of Networks, 8, 253-261.</mixed-citation></ref><ref id="scirp.75414-ref13"><label>13</label><mixed-citation publication-type="other" xlink:type="simple">Khashei, M. and Bijari, M.S. (2012) A Novel Hybrid Classification Model of Artificial Neural Networks and Multiple Linear Regression Models. Expert Systems with Applications, 39, 2606-2620.</mixed-citation></ref><ref id="scirp.75414-ref14"><label>14</label><mixed-citation publication-type="other" xlink:type="simple">Zheng, L. (2002) Commercial Bank Credit Risk Management. China Renmin University Press, Beijing.</mixed-citation></ref><ref id="scirp.75414-ref15"><label>15</label><mixed-citation publication-type="other" xlink:type="simple">Galindo, J. and Tamayo, P. (2000) Credit Risk Assessment Using Statistical and Machine Learning: Basic Methodology and Risk Modeling Applications. Computational Economics, 15, 107-143. https://doi.org/10.1023/A:1008699112516</mixed-citation></ref></ref-list></back></article>