<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article  PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article"><front><journal-meta><journal-id journal-id-type="publisher-id">JTTs</journal-id><journal-title-group><journal-title>Journal of Transportation Technologies</journal-title></journal-title-group><issn pub-type="epub">2160-0473</issn><publisher><publisher-name>Scientific Research Publishing</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.4236/jtts.2019.91007</article-id><article-id pub-id-type="publisher-id">JTTs-89917</article-id><article-categories><subj-group subj-group-type="heading"><subject>Articles</subject></subj-group><subj-group subj-group-type="Discipline-v2"><subject>Engineering</subject></subj-group></article-categories><title-group><article-title>
 
 
  Case Study of Four Vehicle Reliability Comparison Based on Survival Analysis
 
</article-title></title-group><contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Pengzhou</surname><given-names>Xu</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Junhui</surname><given-names>Gao</given-names></name><xref ref-type="aff" rid="aff2"><sup>2</sup></xref></contrib></contrib-group><aff id="aff2"><addr-line>American and European International Study Center, Wuxi, China</addr-line></aff><aff id="aff1"><addr-line>Jiangsu Tianyi High School, Wuxi, China</addr-line></aff><pub-date pub-type="epub"><day>07</day><month>11</month><year>2018</year></pub-date><volume>09</volume><issue>01</issue><fpage>109</fpage><lpage>119</lpage><history><date date-type="received"><day>19,</day>	<month>December</month>	<year>2018</year></date><date date-type="rev-recd"><day>13,</day>	<month>January</month>	<year>2019</year>	</date><date date-type="accepted"><day>16,</day>	<month>January</month>	<year>2019</year></date></history><permissions><copyright-statement>&#169; Copyright  2014 by authors and Scientific Research Publishing Inc. </copyright-statement><copyright-year>2014</copyright-year><license><license-p>This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/</license-p></license></permissions><abstract><p>
 
 
  The paper is written to analyze the behavior of a selected set of vehicles with different makes, on how they survive after each owner change. The data come from Github. The four cars are Honda Accord, Mini Cooper, Chevy Cavalier, and Toyota Avalon. The two faults are Engine System and Transmission System. The data are from 1996 to 2012. The paper used the Kaplan-Meier curve to survival analysis; the paper also calculates and discusses the self-comparison of each car’s four time periods, the four-stage failure rate through median comparison, and the median comparison of fault conditions in all years. We find that all the vehicle types have gotten better with the years and Toyota vehicles are more reliable than Honda.
 
</p></abstract><kwd-group><kwd>Survival Analysis</kwd><kwd> Automobile</kwd><kwd> Reliability</kwd><kwd> Case Study</kwd></kwd-group></article-meta></front><body><sec id="s1"><title>1. Introduction</title><p>Vehicle survival is a concept concerning total time a vehicle works after it is sold to a customer and the malfunctions of the vehicle. Vehicle survival analysis is utilized in numerous areas like vehicle quality assessment. For example, people are able to anticipate latent problems that might occur on vehicles to ensure the driver and passengers’ safety; survival analysis is also employed in large-scale vehicle scrappage programs to maximize the usage vehicles’ abilities and maintain the price of vehicles.</p><p>Furthermore, the vehicle survival analysis can also be used to estimate the car stability even before customers purchase it. In this way, money is used more efficiently. This analysis related to specific vehicles provides car manufacturers with a great opportunity to make an improvement in their products, attracting new consumers and ensure old consumers’ loyalty.</p><p>Former researchers have done a similar analysis to estimate vehicle performances. Data mining and neural network methods are utilized to estimate the reliability of a vehicle [<xref ref-type="bibr" rid="scirp.89917-ref1">1</xref>] , using MATLAB2007 to evaluate the engine performance. People also do such reliability assessment on race cars. Fault tree, finite element analysis are used to estimate the full car reliability of FSAE race cars [<xref ref-type="bibr" rid="scirp.89917-ref2">2</xref>] . Diesel engines are also researched by experts to find out its reliability. The methods integrate Weibull Model and Dempster-Shafer evidence theory to describe the regularities of distribution of its failure rate of each part of the engine when it still functions [<xref ref-type="bibr" rid="scirp.89917-ref3">3</xref>] . In this paper, we are going to analyze four types of vehicles to compare the overall performance of each vehicle and determine the car with the greatest development on reliability over years.</p></sec><sec id="s2"><title>2. Data Sources</title><p>In this paper, we adopt data from Github (https://github.com/tcrug/car-reliability). Data is in the form of charts. In the data set, there are totally six columns in the data chart. The first column represents the date the vehicle was bought; the second column shows the vehicle manufacturer; the third column is the specific type of vehicle produced by the manufacturer; the fourth column lists out the total distance traveled by the vehicle before it was examined at it first malfunction; the fifth and sixth columns represents the state of engine and transmission system respectively. Four different types of vehicles from different car brands are analyzed, like Honda, Toyota, MINI, and Chevrolet. To make the data represent the complicated vehicle market more generally, we deliberately chose car manufacturers from three different countries which stand for different manufacture criteria and styles. The malfunctions are separated into two major categories: engine problem and transmission problem. The vehicle is examined at the time the state of these two parts are represented by 0 and 1. 0 means no problem found after the vehicle examination, whereas 1 represents that problem exists in corresponding part.</p></sec><sec id="s3"><title>3. Related Technology</title><p>In this paper, we employ mainly three methods: non-parameter method, semiparametric method and parametric method.</p><sec id="s3_1"><title>3.1. Non-Parametric Analysis</title><p>Kaplan-Meier estimation graph and log-rank test are utilized to analyze the state of each type of vehicles.</p><sec id="s3_1_1"><title>3.1.1. Kaplan-Meier Estimator</title><p>The graph of Kaplan-Meier estimator declines like stairs. It is composed of multiple horizontal lines and vertical lines to reveal the chance of individual to survive within a given time. It is described as survival function. It is mainly used in medical treatment to estimate the probability for patients’ to survive under certain circumstances, but in this paper it serves as the main method to evaluate vehicle survival possibilities. The utility of this method on vehicles is essential for the promotion of vehicle production with higher qualities [<xref ref-type="bibr" rid="scirp.89917-ref4">4</xref>] .</p><p>The estimator has the basic function of</p><p>s ^ ( t ) = ∏ i : t i ≤ t ( 1 − d i n i )</p><p>t<sub>i</sub> is a time when at least one event happened, d<sub>i</sub> is the number of events that happened at t<sub>i</sub>. n<sub>i</sub> is the individuals known to survive (have not yet had an event or been censored) at time t<sub>i</sub>. There is no unknown parameter, so Kaplan-Meier can be include in non-parametric methods [<xref ref-type="bibr" rid="scirp.89917-ref5">5</xref>] . However, the d<sub>i</sub>/n<sub>i</sub> can be regarded as a parameter. We can use the method of maximum likelihood to estimate its value.</p><p>We hypothesize the new function to be</p><p>s ^ ( t ) = ∏ i : t i ≤ t ( 1 − h i )</p><p>The likelihood function is</p><p>L ( h j : j ≤ i ) = ∏ j = 1 i h j d j ( 1 − h j ) n j − d j</p><p>To maximize the likelihood function, just simplify the function using natural logarithm.</p><p>ln ( L ) = ∑ j = 1 i d j ln ( h j ) + ( n j − d j ) ln ( 1 − h j )</p><p>∂ ln ( L ) ∂ h i = d i h ^ i − n i − d i 1 − h ^ i = 0 ⇒ h ^ j</p><p>The Kaplan-Meier estimator is one of the most frequently used method for survival analysis. It has a comparably advantage in estimating the death rate which is the rate of malfunction in vehicles in each part. Also, the result is clearer since it is visualized.</p></sec><sec id="s3_1_2"><title>3.1.2. Log-Rank Test</title><p>The logrank test statistic compares estimates of the hazard functions of the two groups at each observed event time. It is constructed by calculating the observed and expected number of events in one of the groups at each observation time and then add these estimates to obtain an overall summary throughout the focused period where there is an event [<xref ref-type="bibr" rid="scirp.89917-ref6">6</xref>] .</p><p>Let j = 1, ..., J be the distinct times of observed events in either group. For each time j, let N 1 j and N 2 j be the number of subjects “at risk” (have not yet had an event or been censored) at the start of period j. Let N j = N 1 j + N 2 j . Let O is the observed number of events. The expectation value of the log-rank test is E<sub>ij</sub>, the variance difference is V<sub>j</sub>. [<xref ref-type="bibr" rid="scirp.89917-ref7">7</xref>]</p><p>Z = ∑ j = 1 J ( O 1 j − E 1 j ) ∑ j = 1 J V j ~ N ( 0 , 1 )</p><p>Calculated outcome should be tested using Z test above and determined whether it is in the acceptable range.</p><p>Log-rank test can estimate the difference between two groups with significantly different risks, but it is only a test for significance, so it will not be the primary resolution in this paper.</p></sec></sec><sec id="s3_2"><title>3.2. Semiparametric Analysis</title>COX Regression Model<p>COX regression model uses h ( t , X ) = h 0 ( t ) exp ( β 1 X 1 + ⋯ + β m X m ) as a variable in the middle instead of directly determine the relationship between the causing factor X and the survival function S ( t , x ) [<xref ref-type="bibr" rid="scirp.89917-ref8">8</xref>] . The main idea of COX regression model as a semiparametric method is that the parameter β in the model is able to be determined without knowing h ( t , X ) . There is a prerequisite for using the COX model: the effect of each factor X do not vary as time passes on. We decide the function of each factor by determine the relative risk between the exposed group and non-exposed group [<xref ref-type="bibr" rid="scirp.89917-ref9">9</xref>] .</p><p>R R = h ( t , X i ) h ( t , X j )</p><p>Cox regression model takes multiple factors which will affect the studied subject’s survival time.</p></sec><sec id="s3_3"><title>3.3. Parametric Analysis</title><sec id="s3_3_1"><title>3.3.1. Weibull Distribution and Exponential Distribution</title><p>Exponential distribution and Weibull distribution measure the status of the occurrence of a specific event in a time interval. Exponential distribution and Weibull distribution has a probability density function respectively:</p><p>f ( x ) = { λ e − λ x ( x &gt; 0 ) 0 ( x ≤ 0 ) , f ( x ; λ , k ) = { k λ ( k λ ) k − 1 e − ( x / λ ) k ( x ≥ 0 ) 0 ( x &lt; 0 )</p><p>Weibull distribution and exponential distribution are very alike [<xref ref-type="bibr" rid="scirp.89917-ref10">10</xref>] . When k = 1 in Weibull distribution, the density function becomes the same as that of exponential distribution.</p></sec><sec id="s3_3_2"><title>3.3.2. Parameter Estimation</title><p>The parameters have to be determined clearly to draw the precise probability density function. We mainly use two methods for parameter estimation-point estimation and the maximum likelihood estimation [<xref ref-type="bibr" rid="scirp.89917-ref11">11</xref>] . Since the maximum likelihood method has more accuracy, we mainly focus on this method to determine the parameters.</p><p>Parametric estimation can be combined with predefined equations and functions to estimate duration of a project.</p></sec><sec id="s3_3_3"><title>3.3.3. Exponential Regression and Weibull Regression</title><p>To determine whether there is significant cause and effect relationship, we have to do regression test to the outcome from Weibull and Exponential distribution. First, we hypothesize that there is no relationship between the factor and the result, then we propose the formula and plug in the required data presented in the data set. Next, we determine the rejection region and see if the value falls within this range. Finally, we give the result whether accept the hypothesis or not.</p></sec></sec></sec><sec id="s4"><title>4. Methods</title><sec id="s4_1"><title>4.1. Overall Description of Data</title><p>Use SQL to sort out the data distribution of the four cars, listed in <xref ref-type="table" rid="table1">Table 1</xref>. Numbers in the table indicate how many pieces of failure data are available for a particular type of vehicle in the corresponding year. For example, Honda had 393 samples in 1996 and MINI had 129 samples in 2002.</p><p>As we can see from <xref ref-type="table" rid="table1">Table 1</xref>, the data is from 1996 to 2012. Among them, Honda and Toyota have data in all years, while Chevrolet lacks data from 2006 to 2012. MINI lacks data from 1996 to 2001.</p></sec><sec id="s4_2"><title>4.2. Analysis Strategy</title><p>We adopt non-parametric method to analyze the faults of automobiles. Based on the data distribution, considering the lack of data, we are ready to analyze and model the data from two angles. The first angle is to compare the survival curves of the four cars, using the Kaplan-Meier estimator. The second angle is to compare the survival curves of different time periods. We divide the time into four sections, 1996-1999, 2000-2003, 2004-2007, 2008-2012. The last stage is one year longer than the first three stages, considering that the data for 2012 is relatively small.</p><table-wrap id="table1" ><label><xref ref-type="table" rid="table1">Table 1</xref></label><caption><title> Time distribution of available samples of four cars</title></caption><table><tbody><thead><tr><th align="center" valign="middle" ></th><th align="center" valign="middle" >Chevrolet</th><th align="center" valign="middle" >Honda</th><th align="center" valign="middle" >MINI</th><th align="center" valign="middle" >Toyota</th></tr></thead><tr><td align="center" valign="middle" >1996</td><td align="center" valign="middle" >48</td><td align="center" valign="middle" >393</td><td align="center" valign="middle" ></td><td align="center" valign="middle" >109</td></tr><tr><td align="center" valign="middle" >1997</td><td align="center" valign="middle" >98</td><td align="center" valign="middle" >509</td><td align="center" valign="middle" ></td><td align="center" valign="middle" >139</td></tr><tr><td align="center" valign="middle" >1998</td><td align="center" valign="middle" >116</td><td align="center" valign="middle" >977</td><td align="center" valign="middle" ></td><td align="center" valign="middle" >171</td></tr><tr><td align="center" valign="middle" >1999</td><td align="center" valign="middle" >146</td><td align="center" valign="middle" >1171</td><td align="center" valign="middle" ></td><td align="center" valign="middle" >152</td></tr><tr><td align="center" valign="middle" >2000</td><td align="center" valign="middle" >209</td><td align="center" valign="middle" >1382</td><td align="center" valign="middle" ></td><td align="center" valign="middle" >354</td></tr><tr><td align="center" valign="middle" >2001</td><td align="center" valign="middle" >222</td><td align="center" valign="middle" >1271</td><td align="center" valign="middle" ></td><td align="center" valign="middle" >229</td></tr><tr><td align="center" valign="middle" >2002</td><td align="center" valign="middle" >335</td><td align="center" valign="middle" >1419</td><td align="center" valign="middle" >129</td><td align="center" valign="middle" >175</td></tr><tr><td align="center" valign="middle" >2003</td><td align="center" valign="middle" >348</td><td align="center" valign="middle" >1650</td><td align="center" valign="middle" >254</td><td align="center" valign="middle" >135</td></tr><tr><td align="center" valign="middle" >2004</td><td align="center" valign="middle" >354</td><td align="center" valign="middle" >1152</td><td align="center" valign="middle" >239</td><td align="center" valign="middle" >96</td></tr><tr><td align="center" valign="middle" >2005</td><td align="center" valign="middle" >195</td><td align="center" valign="middle" >984</td><td align="center" valign="middle" >337</td><td align="center" valign="middle" >123</td></tr><tr><td align="center" valign="middle" >2006</td><td align="center" valign="middle" ></td><td align="center" valign="middle" >574</td><td align="center" valign="middle" >299</td><td align="center" valign="middle" >203</td></tr><tr><td align="center" valign="middle" >2007</td><td align="center" valign="middle" ></td><td align="center" valign="middle" >570</td><td align="center" valign="middle" >245</td><td align="center" valign="middle" >105</td></tr><tr><td align="center" valign="middle" >2008</td><td align="center" valign="middle" ></td><td align="center" valign="middle" >481</td><td align="center" valign="middle" >194</td><td align="center" valign="middle" >49</td></tr><tr><td align="center" valign="middle" >2009</td><td align="center" valign="middle" ></td><td align="center" valign="middle" >231</td><td align="center" valign="middle" >158</td><td align="center" valign="middle" >9</td></tr><tr><td align="center" valign="middle" >2010</td><td align="center" valign="middle" ></td><td align="center" valign="middle" >206</td><td align="center" valign="middle" >61</td><td align="center" valign="middle" >11</td></tr><tr><td align="center" valign="middle" >2011</td><td align="center" valign="middle" ></td><td align="center" valign="middle" >101</td><td align="center" valign="middle" >25</td><td align="center" valign="middle" >11</td></tr><tr><td align="center" valign="middle" >2012</td><td align="center" valign="middle" ></td><td align="center" valign="middle" >47</td><td align="center" valign="middle" >12</td><td align="center" valign="middle" >1</td></tr></tbody></table></table-wrap></sec><sec id="s4_3"><title>4.3. Programming Tools</title><p>The programming language uses python 3.6 [<xref ref-type="bibr" rid="scirp.89917-ref12">12</xref>] and pandas [<xref ref-type="bibr" rid="scirp.89917-ref13">13</xref>] , and the survival analysis uses KM related functions in third-party package lifetime [<xref ref-type="bibr" rid="scirp.89917-ref14">14</xref>] : Kaplan Meier Fitter, multivariate logrank test.</p></sec></sec><sec id="s5"><title>5 Results</title><sec id="s5_1"><title>5.1. Compare the Survival Curves of Four Cars</title><p>Comparing four types of vehicles in all years by K-M method, the results of calculating the two kinds of faults are shown in <xref ref-type="fig" rid="fig1">Figure 1</xref> and <xref ref-type="fig" rid="fig2">Figure 2</xref>, respectively.</p><p>From <xref ref-type="fig" rid="fig1">Figure 1</xref>, we find that almost all cars in Honda had a fault before the mileage of 400,000 miles. The MINI car, before 200,000 miles, had a general car failure. The failure of the other two cars is significantly better than Honda and MINI.</p><p>From <xref ref-type="fig" rid="fig2">Figure 2</xref> we find that Toyota’s fault condition is significantly better than the other three cars.</p></sec><sec id="s5_2"><title>5.2. Compare Survival Curves over Different Time Periods</title><p>Different time periods, K-M overall comparison, two faults, the results are shown in <xref ref-type="fig" rid="fig3">Figure 3</xref>, <xref ref-type="fig" rid="fig4">Figure 4</xref>.</p><p>From <xref ref-type="fig" rid="fig3">Figure 3</xref>, we find that for the first type of failure, 2008-2012 is the best period of reliability (low failure rate), followed by 2004-2007, then 1996-1999 and finally 2000-2003. This shows that the fault changes in a good direction over time.</p><p>From <xref ref-type="fig" rid="fig4">Figure 4</xref>, we find that for the second type of failure, the failure situation in 2004-2007 is the best, followed by 2008-2012, followed by 1996-1999, and finally 2000-2003. This shows that the fault changes in a good direction over time, but the fluctuation is larger than the fault.</p></sec></sec><sec id="s6"><title>6. Discussion</title><p>In the fifth part of the article, we give two results, which are the survival curves of the four cars, and the survival curves of different time periods. In order to</p><p>compare the reliability of the four cars more deeply, we will continue to carry out some analysis and calculation here.</p><sec id="s6_1"><title>6.1. Self-Comparison of Each Car in Four Time Periods</title><p>Here we select Honda and Toyota, which compare the faults of each of the four time periods.</p><p>From <xref ref-type="fig" rid="fig5">Figure 5</xref>, we find that for the second fault of Honda, the two phases of 2004-2007 and 2008-2012 are relatively close, which are better than the other two time periods.</p><p>From <xref ref-type="fig" rid="fig6">Figure 6</xref>, we find that for the second failure of Toyota, the two phases of 2004-2007 and 2008-2012 are relatively close, and the two phases of 2000-2003 and 1996-1999 are similar Closer.</p></sec><sec id="s6_2"><title>6.2. Four-Stage Failure Rate by Median Comparison</title><p>The median here is the average mileage of all cars when 50% of cars fail. We compare the median of the four stages here, and the results are shown in <xref ref-type="table" rid="table2">Table 2</xref>.</p><p>From <xref ref-type="table" rid="table2">Table 2</xref>, we can find that in the four time periods, fault 1 has a median in the two stages of 1996-1999 and 2000-2003, and the other two stages do not. This means that in the two periods of 2004-2007, 2008-2012, the faulty car has never reached 50%, so overall, from 1996 to 2012, the fault 1 situation is changing in a good direction. Although the data of 2000-2003 is poor with the data of 1996-1999. The situation of fault 2 is basically similar to that of fault 1, except that in the first two phases, the median mileage of fault 2 is significantly more than that of fault 1.</p></sec><sec id="s6_3"><title>6.3. Compare the Failures of All Years by Median</title><p>Similar to the method of 6.2, we compare the median of all years here, and the calculation results are listed in <xref ref-type="table" rid="table3">Table 3</xref>. Considering that only Honda and Toyota have data for all years, the other two models are not counted.</p><p>In <xref ref-type="table" rid="table3">Table 3</xref>, after 2002, the median of the two faults of Honda’s car could not be calculated, indicating that the reliability of the car has improved since 2002. In the whole 16 years, Toyota has only the faults of 1998 and 2000, and the median of fault 2 in 2009 can be calculated. Overall, Toyota’s reliability is better than Honda.</p><table-wrap id="table2" ><label><xref ref-type="table" rid="table2">Table 2</xref></label><caption><title> Median comparison of the four stages</title></caption><table><tbody><thead><tr><th align="center" valign="middle" ></th><th align="center" valign="middle" >the first fault</th><th align="center" valign="middle" >the second fault</th></tr></thead><tr><td align="center" valign="middle" >1996-1999</td><td align="center" valign="middle" >270,919</td><td align="center" valign="middle" >329,885</td></tr><tr><td align="center" valign="middle" >2000-2003</td><td align="center" valign="middle" >250,720</td><td align="center" valign="middle" >374,217</td></tr><tr><td align="center" valign="middle" >2004-2007</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >Inf</td></tr><tr><td align="center" valign="middle" >2008-2012</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >inf</td></tr></tbody></table></table-wrap><table-wrap id="table3" ><label><xref ref-type="table" rid="table3">Table 3</xref></label><caption><title> Median comparison of four stages</title></caption><table><tbody><thead><tr><th align="center" valign="middle"  rowspan="2"  ></th><th align="center" valign="middle"  colspan="2"  >Honda</th><th align="center" valign="middle"  colspan="2"  >Toyota</th></tr></thead><tr><td align="center" valign="middle" >the first fault</td><td align="center" valign="middle" >the second fault</td><td align="center" valign="middle" >the first fault</td><td align="center" valign="middle" >the second fault</td></tr><tr><td align="center" valign="middle" >1996</td><td align="center" valign="middle" >296188</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >inf</td></tr><tr><td align="center" valign="middle" >1997</td><td align="center" valign="middle" >255692</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >inf</td></tr><tr><td align="center" valign="middle" >1998</td><td align="center" valign="middle" >260828</td><td align="center" valign="middle" >329,885</td><td align="center" valign="middle" >287496</td><td align="center" valign="middle" >inf</td></tr><tr><td align="center" valign="middle" >1999</td><td align="center" valign="middle" >257376</td><td align="center" valign="middle" >323,454</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >inf</td></tr><tr><td align="center" valign="middle" >2000</td><td align="center" valign="middle" >237485</td><td align="center" valign="middle" >301,751</td><td align="center" valign="middle" >324,786</td><td align="center" valign="middle" >inf</td></tr><tr><td align="center" valign="middle" >2001</td><td align="center" valign="middle" >225687</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >inf</td></tr><tr><td align="center" valign="middle" >2002</td><td align="center" valign="middle" >212010</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >inf</td></tr><tr><td align="center" valign="middle" >2003</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >inf</td></tr><tr><td align="center" valign="middle" >2004</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >inf</td></tr><tr><td align="center" valign="middle" >2005</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >inf</td></tr><tr><td align="center" valign="middle" >2006</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >inf</td></tr><tr><td align="center" valign="middle" >2007</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >inf</td></tr><tr><td align="center" valign="middle" >2008</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >inf</td></tr><tr><td align="center" valign="middle" >2009</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >114,932</td></tr><tr><td align="center" valign="middle" >2010</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >inf</td></tr><tr><td align="center" valign="middle" >2011</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >inf</td></tr><tr><td align="center" valign="middle" >2012</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >inf</td><td align="center" valign="middle" >inf</td></tr></tbody></table></table-wrap></sec><sec id="s6_4"><title>6.4. Inadequacies of Research</title><p>The most obvious shortcoming of this study is that the data source is single, and the data of other two cars is incomplete. In 17 years, there are 6 years of missing data.</p></sec></sec><sec id="s7"><title>7. Conclusion</title><p>This paper analyzes the survival of engine and transmission faults and compares the reliability of four vehicles from different manufacturers. The research in this paper shows that by applying the Kaplan-Meier fitter method and the log-rank test, we can not only get the most insight into improving the car brand, but also get the best performance. A comparative analysis of the four time periods suggests that the entire industry may be getting better. Data analysis can provide customers with very useful vehicle reliability information for their reference at the time of purchase. Survival analysis methods can also be applied to specific parts of a vehicle, such as the most common damaged parts on a vehicle―a tire or suspension. This aspect is also one of our follow-up studies.</p></sec><sec id="s8"><title>Conflicts of Interest</title><p>The authors declare no conflicts of interest regarding the publication of this paper.</p></sec><sec id="s9"><title>Cite this paper</title><p>Xu, P.Z. and Gao, J.H. (2019) Case Study of Four Vehicle Reliability Comparison Based on Survival Analysis. Journal of Transportation Technologies, 9, 109-119. https://doi.org/10.4236/jtts.2019.91007</p></sec></body><back><ref-list><title>References</title><ref id="scirp.89917-ref1"><label>1</label><mixed-citation publication-type="other" xlink:type="simple">Meng, S.P., Luo, H.Y. and Li, S. (2007) “Metal Heat Treatment” Automobile Reliability Analysis Method Based on Data Mining.</mixed-citation></ref><ref id="scirp.89917-ref2"><label>2</label><mixed-citation publication-type="other" xlink:type="simple">Wang, J. (2015) The Reliability Analysis and Application on the FSAE Racing Vehicle. Doctoral Thesis, Hefei University of Technology, Hefei.</mixed-citation></ref><ref id="scirp.89917-ref3"><label>3</label><mixed-citation publication-type="other" xlink:type="simple">Yang, Z.Q. (2015) Research on Reliability Analysis Data on Diesel Engines. Doctoral Thesis, Jiangxi University of Science and Technology, Ganzhou.</mixed-citation></ref><ref id="scirp.89917-ref4"><label>4</label><mixed-citation publication-type="other" xlink:type="simple">Kaplan, E.L. and Meier, P. (1958) Nonparametric Estimation from Incomplete Observations. Journal of the American Statistical Association, 457-481.  
https://doi.org/10.1080/01621459.1958.10501452</mixed-citation></ref><ref id="scirp.89917-ref5"><label>5</label><mixed-citation publication-type="other" xlink:type="simple">Kaplan, E.L. (1983) This Week’s Citation Classic. Current Contents, 24, 14.</mixed-citation></ref><ref id="scirp.89917-ref6"><label>6</label><mixed-citation publication-type="other" xlink:type="simple">Mantel, N. (1966) Evaluation of Survival Data and Two New Rank Order Statistics Arising in Its Consideration. Cancer Chemotherapy Reports.</mixed-citation></ref><ref id="scirp.89917-ref7"><label>7</label><mixed-citation publication-type="other" xlink:type="simple">Peto, R. and Peto, J. (1972) Asymptotically Efficient Rank Invariant Test Procedures. Journal of the Royal Statistical Society, Series A, Blackwell Publishing, 135, 185-207. https://doi.org/10.2307/2344317</mixed-citation></ref><ref id="scirp.89917-ref8"><label>8</label><mixed-citation publication-type="other" xlink:type="simple">Cox, D.R. (1972) Regression Models and Life Tables (with Discussion). Journal of the Royal Statistical Society Series B.  
https://doi.org/10.1111/j.2517-6161.1972.tb00899.x</mixed-citation></ref><ref id="scirp.89917-ref9"><label>9</label><mixed-citation publication-type="other" xlink:type="simple">Cox, D.R. and Oakes, D. (1984) Analysis of Survival Data. Chapman &amp; Hall, New York.</mixed-citation></ref><ref id="scirp.89917-ref10"><label>10</label><mixed-citation publication-type="other" xlink:type="simple">Fréchet, M. (1927) Sur la loi de probabilité de l’écart maximum. Annales de la Société Polonaise de Mathematique, Cracovie.</mixed-citation></ref><ref id="scirp.89917-ref11"><label>11</label><mixed-citation publication-type="other" xlink:type="simple">Johnson, N.L., Kotz, S. and Balakrishnan, N. (1994) Continuous Univariate Distributions. Wiley Series in Probability and Mathematical Statistics: Applied Probability and Statistics, 2nd Edition, John Wiley &amp; Sons, New York.</mixed-citation></ref><ref id="scirp.89917-ref12"><label>12</label><mixed-citation publication-type="other" xlink:type="simple">Python 3.6. https://www.python.org/downloads/release/python-360/</mixed-citation></ref><ref id="scirp.89917-ref13"><label>13</label><mixed-citation publication-type="other" xlink:type="simple">Pandas. https://pandas.pydata.org/</mixed-citation></ref><ref id="scirp.89917-ref14"><label>14</label><mixed-citation publication-type="other" xlink:type="simple">Lifelines. https://lifelines.readthedocs.io/en/latest/</mixed-citation></ref></ref-list></back></article>