<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article  PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article"><front><journal-meta><journal-id journal-id-type="publisher-id">JCC</journal-id><journal-title-group><journal-title>Journal of Computer and Communications</journal-title></journal-title-group><issn pub-type="epub">2327-5219</issn><publisher><publisher-name>Scientific Research Publishing</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.4236/jcc.2022.106004</article-id><article-id pub-id-type="publisher-id">JCC-117978</article-id><article-categories><subj-group subj-group-type="heading"><subject>Articles</subject></subj-group><subj-group subj-group-type="Discipline-v2"><subject>Computer Science&amp;Communications</subject></subj-group></article-categories><title-group><article-title>
 
 
  Support Vector Machine Prediction Modeling for Automobile Ownership
 
</article-title></title-group><contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Ruidong</surname><given-names>Zhang</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Xinguang</surname><given-names>Zhang</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib></contrib-group><aff id="aff1"><addr-line>School of Mechanical and Automotive Engineering, Shanghai University of Engineering Science, Shanghai, China</addr-line></aff><pub-date pub-type="epub"><day>09</day><month>06</month><year>2022</year></pub-date><volume>10</volume><issue>06</issue><fpage>37</fpage><lpage>43</lpage><history><date date-type="received"><day>6,</day>	<month>May</month>	<year>2022</year></date><date date-type="rev-recd"><day>20,</day>	<month>June</month>	<year>2022</year>	</date><date date-type="accepted"><day>23,</day>	<month>June</month>	<year>2022</year></date></history><permissions><copyright-statement>&#169; Copyright  2014 by authors and Scientific Research Publishing Inc. </copyright-statement><copyright-year>2014</copyright-year><license><license-p>This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/</license-p></license></permissions><abstract><p>
 
 
  To address some inherent defects of artificial neural networks, such as insufficient generalization performance, local extremum problem, and dimensional catastrophe problem, a support vector machine was proposed and applied to the modeling of automobile ownership prediction. By analyzing the data on automobile ownership and its influencing factors, the learning sample couples for automobile ownership prediction modeling were constructed, and support vector machine (SVM) was used to regression the nonlinear function relation of automobile ownership prediction model, and the established automobile ownership prediction model was used to predict the automobile ownership in different years. To reduce the impact on the accuracy of automobile ownership prediction caused by the large order of magnitude difference between the data of automobile ownership and its influence factors, the normalization method was used to pre-process the automobile ownership and its influence factors, and the inverse normalization was used to process the automobile ownership prediction results. The comparison between the automobile ownership prediction results and the statistical results shows that the automobile ownership prediction model has good generalization performance, and the support vector machine is an effective method to model the automobile ownership prediction.
 
</p></abstract><kwd-group><kwd>Automobile Ownership</kwd><kwd> Prediction</kwd><kwd> Support Vector Machine</kwd></kwd-group></article-meta></front><body><sec id="s1"><title>1. Introduction</title><p>The accurate prediction modeling of China’s automobile ownership can provide a reference for decision-making to deal with a series of urgent social problems such as traffic, environmental pollution, and energy caused by the growth of automobile ownership in China.</p><p>In predictive modeling, the mathematical model combined with computer simulation is mostly used nowadays, and the key to applying this method is to accurately determine the model parameters or functional relationship in the mathematical model.</p><p>Classical system identification methods, including the least-squares, Kalman filtering, etc., were applied in various fields. To decrease the modeling errors and enhance the control behavior in transients and steady-state, Anian Brosch used the recursive least squares forecasting approach [<xref ref-type="bibr" rid="scirp.117978-ref1">1</xref>] to conduct the current control. Shao et al. proposed the direct least-squares method [<xref ref-type="bibr" rid="scirp.117978-ref2">2</xref>] to address collinear data and big data. Fabio Tinazzi et al. described the historical current measurements using the recursive least-square algorithm [<xref ref-type="bibr" rid="scirp.117978-ref3">3</xref>] to predict the currency’s future behavior corresponding to a limited set of voltage vectors. A least-squares [<xref ref-type="bibr" rid="scirp.117978-ref4">4</xref>] predictive model using discrete cosine transform was suggested to perform the prediction of hourly air quality index. The partial least squares regression model [<xref ref-type="bibr" rid="scirp.117978-ref5">5</xref>] was presented to develop the sea ice concentration prediction model. Xing et al. used the Kalman filter [<xref ref-type="bibr" rid="scirp.117978-ref6">6</xref>] based on the fractional-order model to perform the lithium-ion battery state of charge estimation. Mathieu Brunot presented the recursive estimator [<xref ref-type="bibr" rid="scirp.117978-ref7">7</xref>] that was of Kalman type for discrete-time systems characterized with measurement noise modeled from Gaussian uniform mixtures. Martincevic, A. and M. Vasak used the constrained Kalman filter [<xref ref-type="bibr" rid="scirp.117978-ref8">8</xref>] to identify the semi-physical building thermal models. Xiao et al. utilized the improved extended Kalman filter [<xref ref-type="bibr" rid="scirp.117978-ref9">9</xref>] to do the intelligent vehicle localization. The neuron-based Kalman filter [<xref ref-type="bibr" rid="scirp.117978-ref10">10</xref>] was developed to address the problems of identification of actual systems and conventional Kalman filter for empirical parameter estimation.</p><p>However, these classical methods have some inherent defects, such as the dependence on the mathematical model of the research object; the convergence and accuracy of the algorithm depending on the initial value of the parameter estimation, etc., which hinder the further development of traditional system identification methods and make their applications somewhat limited. Artificial neural network [<xref ref-type="bibr" rid="scirp.117978-ref11">11</xref>] makes up for the inherent defects of traditional system identification methods, does not depend on the mathematical model of the research object, has the characteristics of algorithm stability and accuracy with little dependence on the initial value of parameter estimation, and has strong nonlinear mapping ability, which can effectively describe the input-output response characteristic for the nonlinear dynamic system of automobile ownership prediction model. However, the artificial neural network also has insufficient generalization performance, local extremum problems, dimensional catastrophe problems, etc. As a new artificial intelligence method developed based on statistical theory and the principle of structural risk minimization, a support vector machine (SVM) [<xref ref-type="bibr" rid="scirp.117978-ref12">12</xref>] solves the local extremum problem which cannot be avoided in neural network methods and has a better generalization ability and solves the dimensional catastrophe problem skillfully. However, there is little literature on the application of support vector machines to vehicle ownership prediction modeling. The modeling used in this paper is novel, but not innovative.</p><p>The learning samples required for the automobile ownership prediction modeling are obtained from the statistical data on automobile ownership and its impact factors. The nonlinear functional relationship in the automobile ownership prediction model is regressed using a support vector machine, and the established automobile ownership prediction model is used to predict automobile ownership in different years. To reduce the impact on the accuracy of automobile ownership prediction due to the large difference in the order of magnitude between the input and output of learning samples, the statistical data on automobile ownership and its influence factors are normalized and the results of automobile ownership prediction are processed by the inverse normalization method. The comparison between the prediction results and the statistical data shows that the automobile ownership prediction model has high performance in generalization, and SVM is a useful method to construct the automobile ownership prediction model.</p></sec><sec id="s2"><title>2. Research Method</title><p>SVM is mainly used for pattern recognition and function regression. When used for pattern recognition, it is called support vector classification (SVC) and when used for function regression, it is called support vector regression (SVR).</p><p>The algorithm of SVR [<xref ref-type="bibr" rid="scirp.117978-ref12">12</xref>] can be described as follows.</p><p>Step 1: Construct the training set</p><p>T = { ( x 1 , y 1 ) , ⋯ , ( x l , y l ) } ∈ ( χ &#215; y ) l ,   x i ∈ R n ,   y i ∈ R ,   i = 1 , ⋯ , l .</p><p>Step 2: Select the appropriate accuracy parameter ε, penalty parameter C, and kernel function K ( x , x ′ ) .</p><p>Step 3: Solve the following optimization problem.</p><p>min α ( * ) ∈ R 2 l 1 2 ∑ i , j = 1 l ( α i * − α i ) ( α j * − α j ) K ( x i ⋅ x j ) + ε ∑ i = 1 l ( α i * + α i ) − ∑ i = 1 l y i ( α i * − α i ) s .t .     ∑ i = 1 l ( α i − α i * ) = 0 0 ≤ α i ( * ) ≤ C l ,     i = 1 , 2 , ⋯ , l (1)</p><p>The optimal solution α ˜ ( * ) = ( α ˜ 1 , α ˜ 1 * , ⋯ , α ˜ l , α ˜ l * ) T is obtained from Eq. (1).</p><p>Step 4: Construct the decision function.</p><p>f ( x ) = ∑ i = 1 l ( α ˜ i * − α ˜ i ) K ( x i , x ) + b (2)</p><p>Select the α ˜ j or α ˜ k * located in the open interval ( 0 , C l ) . If α ˜ j is chosen, then</p><p>b = y j − ∑ i = 1 l ( α ˜ i * − α ˜ i ) K ( x i , x ) (3)</p><p>If α ˜ k * is chosen, then</p><p>b = y j − ∑ i = 1 l ( α ˜ i * − α ˜ i ) K ( x i , x ) (4)</p><p>In the structural parameters for SVR, the accuracy parameter ε is selected with a certain rule, i.e., as ε increases, the sparsity of the solution becomes larger and the fitting accuracy becomes smaller; instead, the smaller ε is, the higher the fitting accuracy is. When ε decreases to 0, the number of support vectors required for SVR reaches the maximum value, which is almost the same as the number of training samples, affecting the learning efficiency. With a larger value of the penalty parameter C, the more favorable the sample fitting accuracy is but reduces the confidence and generalization performance of the learning machine. Meanwhile, the larger the C value is, the greater the penalty is for the noisy sample, but eliminating too many samples will lead to distortion of the data regression, conversely, a smaller C value may not fit the samples well enough, which likewise causes the generalization performance of the learning machine to be reduced.</p></sec><sec id="s3"><title>3. Results and Discussion</title><p>In this paper, the training sample pairs required for SVR for automobile ownership prediction modeling were constructed by using automobile ownership and its impact factor, and the statistical data on automobile ownership and its impact factor was obtained from the National Bureau of Statistics.</p><p>The difference in the order of magnitude between automobile ownership and its impact factors can seriously affect the accuracy of automobile ownership prediction modeling. Therefore, in the paper, the automobile ownership and its impact factor statistics were first normalized and pre-processed.</p><p>The impact factors affecting the automobile ownership are used as the learning sample inputs, including seven impact indicators such as gross national income, GDP per capita, total import and export, disposable income per urban resident, steel production, road passenger traffic, and total social consumer goods, and the automobile ownership was used as the learning sample output.</p><p>The statistical data on automobile ownership and its impact factors before and after normalization are shown in <xref ref-type="table" rid="table1">Table 1</xref> and <xref ref-type="table" rid="table2">Table 2</xref>.</p><p>In the SVR structural parameters, the penalty parameter C is chosen as 10<sup>5</sup> and the precision parameter ε is chosen as 0. To confirm the generalization performance for the automobile ownership regression function constructed by SVR, a learning sample set is constructed from the automobile ownership and its impact factor data for the period from 1994 to 2004, while a verification sample set is constructed from the automobile ownership and its impact factor data for the period from 2005 to 2012.</p><p>After the inverse normalization of the automobile ownership prediction results, the predicted values of automobile ownership for the period from 2005 to 2012 are obtained, and the comparison of the predicted values and the statistical results is shown in <xref ref-type="fig" rid="fig1">Figure 1</xref>.</p><table-wrap id="table1" ><label><xref ref-type="table" rid="table1">Table 1</xref></label><caption><title> Automobiles ownership and its impact factors on statistical data</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >year</th><th align="center" valign="middle" >1994</th><th align="center" valign="middle" >…</th><th align="center" valign="middle" >2010</th><th align="center" valign="middle" >2011</th><th align="center" valign="middle" >2012</th></tr></thead><tr><td align="center" valign="middle" >Car ownership/10<sup>4</sup> liang</td><td align="center" valign="middle" >942</td><td align="center" valign="middle" >…</td><td align="center" valign="middle" >7802</td><td align="center" valign="middle" >9356</td><td align="center" valign="middle" >10,933</td></tr><tr><td align="center" valign="middle" >Gross national income 10<sup>8</sup> yuan</td><td align="center" valign="middle" >48,108</td><td align="center" valign="middle" >…</td><td align="center" valign="middle" >399,760</td><td align="center" valign="middle" >468,562</td><td align="center" valign="middle" >516,810</td></tr><tr><td align="center" valign="middle" >Per capita GDP/yuan</td><td align="center" valign="middle" >4044</td><td align="center" valign="middle" >…</td><td align="center" valign="middle" >30,015</td><td align="center" valign="middle" >35,198</td><td align="center" valign="middle" >38,459</td></tr><tr><td align="center" valign="middle" >Total import and export/10<sup>8</sup> yuan</td><td align="center" valign="middle" >20,382</td><td align="center" valign="middle" >…</td><td align="center" valign="middle" >201,722</td><td align="center" valign="middle" >236,402</td><td align="center" valign="middle" >244,160</td></tr><tr><td align="center" valign="middle" >Per capita disposable income of urban residents/yuan</td><td align="center" valign="middle" >3496</td><td align="center" valign="middle" >…</td><td align="center" valign="middle" >19,109</td><td align="center" valign="middle" >21,810</td><td align="center" valign="middle" >24,565</td></tr><tr><td align="center" valign="middle" >Steel output/10<sup>4</sup> tons</td><td align="center" valign="middle" >8428</td><td align="center" valign="middle" >…</td><td align="center" valign="middle" >80,277</td><td align="center" valign="middle" >88,620</td><td align="center" valign="middle" >95,578</td></tr><tr><td align="center" valign="middle" >Highway passenger volume/10<sup>4</sup></td><td align="center" valign="middle" >953,940</td><td align="center" valign="middle" >…</td><td align="center" valign="middle" >3,052,738</td><td align="center" valign="middle" >3,286,220</td><td align="center" valign="middle" >3,557,010</td></tr><tr><td align="center" valign="middle" >Total retail sales of consumer goods/10<sup>8</sup> yuan</td><td align="center" valign="middle" >18,623</td><td align="center" valign="middle" >…</td><td align="center" valign="middle" >156,998</td><td align="center" valign="middle" >183,917</td><td align="center" valign="middle" >210,307</td></tr></tbody></table></table-wrap><table-wrap id="table2" ><label><xref ref-type="table" rid="table2">Table 2</xref></label><caption><title> Automobiles ownership and its impact factors on normalized statistical data</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >year</th><th align="center" valign="middle" >1994</th><th align="center" valign="middle" >…</th><th align="center" valign="middle" >2010</th><th align="center" valign="middle" >2011</th><th align="center" valign="middle" >2012</th></tr></thead><tr><td align="center" valign="middle" >Car ownership/10<sup>4</sup></td><td align="center" valign="middle" >−1.00</td><td align="center" valign="middle" >…</td><td align="center" valign="middle" >−0.99</td><td align="center" valign="middle" >−0.99</td><td align="center" valign="middle" >−0.99</td></tr><tr><td align="center" valign="middle" >Gross national income/10<sup>8</sup> yuan</td><td align="center" valign="middle" >−0.90</td><td align="center" valign="middle" >…</td><td align="center" valign="middle" >−0.74</td><td align="center" valign="middle" >−0.71</td><td align="center" valign="middle" >−0.71</td></tr><tr><td align="center" valign="middle" >Per capita GDP/yuan</td><td align="center" valign="middle" >−0.99</td><td align="center" valign="middle" >…</td><td align="center" valign="middle" >−0.98</td><td align="center" valign="middle" >−0.98</td><td align="center" valign="middle" >−0.98</td></tr><tr><td align="center" valign="middle" >Total import and export/10<sup>8</sup> yuan</td><td align="center" valign="middle" >−0.96</td><td align="center" valign="middle" >…</td><td align="center" valign="middle" >−0.87</td><td align="center" valign="middle" >−0.86</td><td align="center" valign="middle" >−0.86</td></tr><tr><td align="center" valign="middle" >Per capita disposable income of urban residents/yuan</td><td align="center" valign="middle" >−0.99</td><td align="center" valign="middle" >…</td><td align="center" valign="middle" >−0.99</td><td align="center" valign="middle" >−0.99</td><td align="center" valign="middle" >−0.99</td></tr><tr><td align="center" valign="middle" >Steel output/10<sup>4</sup> tons</td><td align="center" valign="middle" >−0.98</td><td align="center" valign="middle" >…</td><td align="center" valign="middle" >−0.95</td><td align="center" valign="middle" >−0.95</td><td align="center" valign="middle" >−0.95</td></tr><tr><td align="center" valign="middle" >Highway passenger volume/10<sup>4</sup></td><td align="center" valign="middle" >1.00</td><td align="center" valign="middle" >…</td><td align="center" valign="middle" >1.00</td><td align="center" valign="middle" >1.00</td><td align="center" valign="middle" >1.00</td></tr><tr><td align="center" valign="middle" >Total retail sales of consumer goods/10<sup>8</sup> yuan</td><td align="center" valign="middle" >−0.96</td><td align="center" valign="middle" >…</td><td align="center" valign="middle" >−0.90</td><td align="center" valign="middle" >−0.89</td><td align="center" valign="middle" >−0.88</td></tr></tbody></table></table-wrap><p>As seen in <xref ref-type="fig" rid="fig1">Figure 1</xref>, the constructed automobile ownership prediction model is very well generalized and SVM presents an attractive tool for the automobile ownership prediction modeling.</p></sec><sec id="s4"><title>4. Conclusion</title><p>In this paper, the SVR based modeling method for automobile ownership prediction was suggested, the learning sample set was constructed using the statistical results of automobile ownership and its influence factors, and the decision function for automobile ownership prediction was developed. The comparison of the automobile ownership prediction results with the statistical results confirms the SVR based automobile ownership prediction modeling method. However, there was no theoretical basis for the selection of penalty parameter C and accuracy parameter ε in SVR, and the arbitrariness in the selection of penalty parameter C and accuracy parameter ε will directly affect the accuracy of automobile ownership prediction modeling. Therefore, the next research work was to use the optimization algorithms including genetic algorithms, etc. for the optimization of penalty parameter C and accuracy parameter ε to improve the accuracy of SVR based automobile ownership prediction modeling.</p></sec><sec id="s5"><title>Acknowledgements</title><p>This work is supported by the National Natural Science Foundation of China (Grant No. 51609132).</p></sec><sec id="s6"><title>Conflicts of Interest</title><p>The authors declare no conflicts of interest regarding the publication of this paper.</p></sec><sec id="s7"><title>Cite this paper</title><p>Zhang, R.D. and Zhang, X.G. (2022) Support Vector Machine Prediction Modeling for Automobile Ownership. Journal of Computer and Communications, 10, 37-43. https://doi.org/10.4236/jcc.2022.106004</p></sec></body><back><ref-list><title>References</title><ref id="scirp.117978-ref1"><label>1</label><mixed-citation publication-type="other" xlink:type="simple">Brosch, A., Hanke, S. and Wallscheid, O. (2021) Data-Driven Recursive Least Squares Estimation for Model Predictive Current Control of Permanent Magnet Synchronous Motors . IEEE Transactions on Power Electronics, 36, 2179-2190. https://doi.org/10.1109/TPEL.2020.3006779</mixed-citation></ref><ref id="scirp.117978-ref2"><label>2</label><mixed-citation publication-type="other" xlink:type="simple">Shao, Z.T., Zhai, Q.Z. and Wu, J. (2021) Data Based Linear Power Flow Model: Investigation of a Least-Squares Based Approximation. IEEE Transactions on Power Systems, 36, 4246-4258. https://doi.org/10.1109/TPWRS.2021.3062359</mixed-citation></ref><ref id="scirp.117978-ref3"><label>3</label><mixed-citation publication-type="other" xlink:type="simple">Tinazzi, F., Carlet, P.G. and Bolognani, S. (2020) Motor Parameter-Free Predictive Current Control of Synchronous Motors by Recursive Least-Square Self-Commissioning Model. IEEE Transactions on Industrial Electronics, 67, 9093-9100. https://doi.org/10.1109/TIE.2019.2956407</mixed-citation></ref><ref id="scirp.117978-ref4"><label>4</label><mixed-citation publication-type="other" xlink:type="simple">Yang, Z.C. (2020) DCT-Based Least-Squares Predictive Model for Hourly AQI Fluctuation Forecasting. Journal of Environmental Informatics, 36, 58-69. https://doi.org/10.3808/jei.201800402</mixed-citation></ref><ref id="scirp.117978-ref5"><label>5</label><mixed-citation publication-type="other" xlink:type="simple">Ye, X.C. and Wu, Z.W. (2021) Seasonal Prediction of Arctic Summer Sea Ice Concentration from a Partial Least Squares Regression Model. Atmosphere, 12, 230-249. https://doi.org/10.3390/atmos12020230</mixed-citation></ref><ref id="scirp.117978-ref6"><label>6</label><mixed-citation publication-type="other" xlink:type="simple">Xing, L.K., Ling, L.Y. and Gong, B. (2022) State-of-Charge Estimation for Lithium-Ion Batteries Using Kalman Filters Based on Fractional-Order Models. Connection Science, 34, 162-184. https://doi.org/10.1080/09540091.2021.1978930</mixed-citation></ref><ref id="scirp.117978-ref7"><label>7</label><mixed-citation publication-type="other" xlink:type="simple">Brunot, M. (2020) A Gaussian Uniform Mixture Model for Robust Kalman Filtering. IEEE Transactions on Aerospace and Electronic Systems, 56, 2656-2665. https://doi.org/10.1109/TAES.2019.2953414</mixed-citation></ref><ref id="scirp.117978-ref8"><label>8</label><mixed-citation publication-type="other" xlink:type="simple">Martincevic, A. and Vasak, M. (2020) Constrained Kalman Filter for Identification of Semiphysical Building Thermal Models. IEEE Transactions on Control Systems Technology, 28, 2697-2704. https://doi.org/10.1109/TCST.2019.2942808</mixed-citation></ref><ref id="scirp.117978-ref9"><label>9</label><mixed-citation publication-type="other" xlink:type="simple">Xiao, G.C., Song, X.L. and Cao, H.T. (2020) Augmented Extended Kalman Filter with Cooperative Bayesian Filtering and Multi-Models Fusion for Precise Vehicle Localisations. IET Radar, Sonar &amp; Navigation, 14, 1815-1826. https://doi.org/10.1049/iet-rsn.2020.0155</mixed-citation></ref><ref id="scirp.117978-ref10"><label>10</label><mixed-citation publication-type="other" xlink:type="simple">Bai, Y.T., Wang, X.Y. and Jin, X.B. (2020) A Neuron-Based Kalman Filter with Nonlinear Autoregressive Model . Sensors, 20, 299. https://doi.org/10.3390/s20010299</mixed-citation></ref><ref id="scirp.117978-ref11"><label>11</label><mixed-citation publication-type="other" xlink:type="simple">Zhang, C., Guo, Y. and Li, M. (2021) Review of Development and Application of Artificial Neural Network Models. Computer Engineering and Applications, 57, 57-69.</mixed-citation></ref><ref id="scirp.117978-ref12"><label>12</label><mixed-citation publication-type="other" xlink:type="simple">Zhang, X.-G. and Zou, Z.-J. (2013) Estimation of the Hydrodynamic Coefficients from Captive Model Test Results by Using Support Vector Machines. Ocean Engineering, 73, 25-31. https://doi.org/10.1016/j.oceaneng.2013.07.007</mixed-citation></ref></ref-list></back></article>