<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article  PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article"><front><journal-meta><journal-id journal-id-type="publisher-id">CN</journal-id><journal-title-group><journal-title>Communications and Network</journal-title></journal-title-group><issn pub-type="epub">1949-2421</issn><publisher><publisher-name>Scientific Research Publishing</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.4236/cn.2012.41009</article-id><article-id pub-id-type="publisher-id">CN-17500</article-id><article-categories><subj-group subj-group-type="heading"><subject>Articles</subject></subj-group><subj-group subj-group-type="Discipline-v2"><subject>Computer Science&amp;Communications</subject></subj-group></article-categories><title-group><article-title>
 
 
  A New Energy Efficient Data Gathering Approach in Wireless Sensor Networks
 
</article-title></title-group><contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>afar</surname><given-names>Amiri</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Masoud</surname><given-names>Sabaei</given-names></name><xref ref-type="aff" rid="aff2"><sup>2</sup></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Bahman</surname><given-names>Soltaninasab</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib></contrib-group><aff id="aff2"><addr-line>Department of Computer and IT Engineering, Amirkabir University of Technology, Tehran, Iran</addr-line></aff><aff id="aff1"><addr-line>Department of Computer and IT Engineering, Professional and Technical University of Tabriz, Iran</addr-line></aff><author-notes><corresp id="cor1">* E-mail:<email>Jafar_amiri85@yahoo.com(AA)</email>;<email>sabaei@aut.ac.ir(MS)</email>;<email>Bahman_soltaninasab@yahoo.com(BS)</email>;</corresp></author-notes><pub-date pub-type="epub"><day>27</day><month>02</month><year>2012</year></pub-date><volume>04</volume><issue>01</issue><fpage>61</fpage><lpage>72</lpage><history><date date-type="received"><day>November</day>	<month>16,</month>	<year>2010</year></date><date date-type="rev-recd"><day>October</day>	<month>30,</month>	<year>2011</year>	</date><date date-type="accepted"><day>November</day>	<month>15,</month>	<year>2011</year></date></history><permissions><copyright-statement>&#169; Copyright  2014 by authors and Scientific Research Publishing Inc. </copyright-statement><copyright-year>2014</copyright-year><license><license-p>This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/</license-p></license></permissions><abstract><p>
 
 
  Data gathering in wireless sensor networks is one of the important operations in these networks. These operations require energy consumption. Due to the limited energy of nodes, the energy productivity should be considered as a key objective in design of sensor networks. Therefore the clustering is a suitable method that used in energy consumption management. For this purpose many methods have been proposed. Between these methods the LEACH algorithm has been attend as a basic method. This algorithm uses distributed clustering method for data gathering and aggregation. The LEACH-C method that is the improvement of LEACH, which performs the clustering in centralized mode. In this method, collecting the energy level of information of every node directly in each period increases the energy cost. Also the phenomenon that is seen by sensor nodes continually change over time. Thereby the information received by nodes is correlated. Sending time correlated data in the network cause to energy dissipation. TINA method and its improvement have been proposed in order to not sending correlated data. These approaches have reported errors. In this paper, the idea of not sending time correlated data of nodes has been considered by using the time series function. Also, a model to estimate the remaining energy of nodes by the base station has been presented. Finally, a method has been proposed to aware the base station from the number of correlated data in each node as the estimation of energy will be more precise. The proposed ideas have been implemented over the LEACH-C protocol. Evaluation results showed that the proposed methods had a better performance in energy consumption and the lifetime of the network in comparison with similar methods.
 
</p></abstract><kwd-group><kwd>Clustering; Sensor Network; Data Correlation; Time Series; Energy Prediction</kwd></kwd-group></article-meta></front><body><sec id="s1"><title>1. Introduction</title><p>Wireless sensor networks are a class of wireless ad-hoc networks. In these networks, sensor nodes collect data from physical environment and after processing sent to the base station (BS1). Thus allow monitoring and control many types of physical parameters. Each sensor node has limited energy and in most applications, replacing energy sources are not possible. So lifetime of sensor nodes is highly dependent on energy stored in their battery. Clustering is a designing method that used for management of wireless sensor networks. In this method, the network is divided into several independent collections that these collections called cluster. So each cluster contains a number of sensor nodes and a cluster head node. Member nodes in a cluster send their data to relative cluster head node. Cluster head node aggregates these data and send to the base station. Therefore, clustering in sensor networks has advantages such as data aggregation support [<xref ref-type="bibr" rid="scirp.17500-ref1">1</xref>], data gathering facilitation [<xref ref-type="bibr" rid="scirp.17500-ref2">2</xref>], organizing a suitable structure for scalable routing [<xref ref-type="bibr" rid="scirp.17500-ref3">3</xref>], and efficient propagation of data in the network [<xref ref-type="bibr" rid="scirp.17500-ref4">4</xref>].</p><p>Data gathering in wireless sensor networks is an important operation in these networks and for this purpose many methods have been proposed. The LEACH2 [<xref ref-type="bibr" rid="scirp.17500-ref5">5</xref>] protocol has been considered as a hierarchical basic method. This method is suitable for monitoring applications. Each node periodically senses the information and sends them. In this algorithm, the clustering method has used for data gathering and aggregation. The cluster and cluster head selected randomly, therefore there is no assurance to select the exact improved number and uniform distribution of cluster head throughout the network. Many improvements in LEACH protocol have been presented. LEACH-C3 method [<xref ref-type="bibr" rid="scirp.17500-ref6">6</xref>] is an example of these improvements. In LEACH-C, the forming of clusters is done using a centralized algorithm by the base station in the starting of each period. Base Station uses the received information from nodes for finding the predetermined number of cluster heads and network configuration within the clusters. This information contains position and energy of nodes. Another improvement to this algorithm is the use of estimation. One of these algorithms is LEACHCE4 [<xref ref-type="bibr" rid="scirp.17500-ref7">7</xref>]. In the proposed technique energy level collected from all nodes in two primary periods but not collected in the other periods. Instead, the average energy of initial periods is used. Considering that the energy estimation in this method is not precise, this clustering scheme is not efficient and suitable. There is some proposed clustering methods that ABCP [<xref ref-type="bibr" rid="scirp.17500-ref8">8</xref>] and ABEE [<xref ref-type="bibr" rid="scirp.17500-ref9">9</xref>] and HMM [10,11] are samples of them.</p><p>Each sensor node is observer of a physical phenomenon. Also physical phenomenon such as temperature and... continuously change in time. So the information provided by sensor nodes is dependent on each other. Some algorithms that based on not sending of correlated data are considered. The TINA<sup>5</sup> [<xref ref-type="bibr" rid="scirp.17500-ref12">12</xref>] algorithm is one of them. In this algorithm the sensor node compares the value of sampled data with previous data, if that be different send it and otherwise goes to sleep mode. The proposed improvement to this algorithm is that sensor node decides to send data with comparing the value of new sample with last reported data [<xref ref-type="bibr" rid="scirp.17500-ref13">13</xref>]. These mentioned algorithms due to error in report, is not suitable. Therefore, a method proposed to increase the accuracy of data reporting. For precise estimation of nodes energy, the base station must be aware of data time correlation. So with existence of data time correlation and using energy estimation of nodes, a method suggested so that the base station can estimate nodes energy precisely. These methods avoid the overhead excess and increase the network lifetime.</p><p>The remaining of this article is organized as follows: In Section 2 related works are reviewed. In Section 3 we introduce correlated data algorithm with the energy predicting technique and the hybrid method. analysis of experiments with existing nodes offered in Section 4, and we finally in Section 5 summarize and discuss the scheme.</p></sec><sec id="s2"><title>2. Related Work</title><sec id="s2_1"><title>2.1. LEACH</title><p>One of the most famous hierarchical routing protocols based on clustering, is the LEACH protocol. In this method, each cluster members send their data to cluster head. The cluster head aggregate this data and send to the BS. So the communication cost is reduced. <xref ref-type="fig" rid="fig1">Figure 1</xref> describes this concept:</p><p>The operation of cluster forming and data transmission in LEACH is done in two phases that these phases shown in Figures 2 and 3:</p><p>Setup phase is the stage of forming cluster and cluster head. At this stage, cluster and cluster head randomly selected. After forming the cluster, cluster head propagate TDMA<sup>6</sup> scheduler to specify the data transfer time to member nodes. Then the steady-state phase started. In the steady-state phase, each member node in cluster send data to the cluster head only in its time slot and at the rest of time pieces for energy conservation goes to sleep mode.</p><p>In this method, the cluster head consumes more energy for receiving, processing and directly sending this data to the BS node. So for increasing the life time of the network it is necessary to replace role of cluster head between network nodes. Many improvements over the LEACH method have been provided that in these improvements firstly, as far as possible the best clustering and cluster head selection is done, secondly possible as possible overhead of the protocol is to be reduced. LEACHC method is an example of these improvements.</p></sec><sec id="s2_2"><title>2.2. LEACH-C</title><p>In LEACH-C, clusters forming in the beginning of every period are done, using the centralized algorithm by the base station. The base station uses received information from nodes that includes energy and node status, uses</p><p>this information during the setup phase for finding predetermined number of cluster heads and network configuration within the clusters. Next classification of nodes in the clusters is done to minimize energy consumption in order to transfer their data to the related cluster head. Results show that LEACH-C overall performance is better than LEACH because of the optimal forming of clusters by the base station. In addition, the number of cluster heads in each period of LEACH-C is equal to the expected optimal value. While in LEACH the number of cluster heads varies in different periods because of lack of global coordination.</p><p>As in LEACH-C at the beginning of every period energy of nodes must be sent to BS, therefore nodes early discharged and the network lifetime reduces. Another improvement on this algorithm is the use of energy estimation. The LEACH-CE method is an example of these methods.</p></sec><sec id="s2_3"><title>2.3. LEACH-CE</title><p>In the LEACH-CE method, the energy level of all nodes collected only in two primary periods and not be collected in other periods. Instead because of knowing information about energy level of nodes, we can calculate energy consumption average for each node by using information of two primary periods. This means that reducing calculated energy level from the energy level of node, causes predicting current energy level of node. The problem of this algorithm is that firstly energy estimation is not done precisely and secondly if nodes have correlated data, while not sending correlated data means that previous data is valid, so this plan of clustering is not suitable and efficient.</p></sec><sec id="s2_4"><title>2.4. TINA</title><p>Phenomenon that observed by sensor nodes, continually change in time. Therefore information received by the nodes is correlated on each other. These cases for physical phenomena that are continuous or repetitive, or in an application that the accuracy is not too important, or in a network that node density in a region is high, have seen more. There are two types of data correlation: 1) spatial correlation; 2) Time correlation.</p><p>In the spatial correlation, aggregation is done within the network by cluster heads. This is one of the proposed methods to reduce energy consumption. So the nodes that have correlated data send them to cluster head and cluster head after aggregating these data send to the base station. This causes to prevent waste of energy. This method has been implemented in LEACH protocol.</p><p>But in the time correlation, each node can have correlated data in successive times. Mohamed and Sharaf proposed the TINA algorithm. The main idea of TINA algorithm was that the sensor nodes send their data only when this data differ with previous data otherwise goes to sleep mode. This algorithm has a reporting error. There is an improvement to this algorithm that presented below.</p></sec><sec id="s2_5"><title>2.5. Improvement of TINA</title><p>In this method, the sensor node decides to send data by comparing the value of newly sampled data with last reported data. However, sensor nodes maintain last reported data. For better understanding, we describe this section with an example. Suppose that a given sensor node that received data are 1.0, 0.95, 1.05, 0.95, 1.05 respectively. A threshold has been considered that data changing to this threshold is not important. The value of this threshold considered equal to 10%. First given data that is equal to 1.0 successfully sent and in the next period 0.95, will not be sent when: <img src="9-6101078\3ad186dc-d39f-46fd-bb18-8e0c55b7f0be.jpg" /></p><p>Otherwise that will be sent. In the third stage <img src="9-6101078\0abc946b-2bb7-410d-8114-30cf3418278a.jpg" />that will not sent and in the fourth stage <img src="9-6101078\53d9bba1-43e8-407f-b75a-cf00b3459cd8.jpg" /> that will not sent. This method is suitable when phenomenon changes have not a lot of swing or any special event in the network is done. But as mentioned previously most of phenomenon change continuously with time. So most of data are in ascending or descending mode in the time slices. Or in an application such as temperature for example in a certain time slot occurs a specific event. So the proposed methods have errors and are not suitable. We offer a method to improve this algorithm and prevent the waste of energy. In addition, the problem of data time correlation is not considered in proposed protocols. Therefore, we will check the time correlation of data in the proposed algorithms.</p></sec></sec><sec id="s3"><title>3. The Proposed Method</title><sec id="s3_1"><title>3.1. Presentation of Methods</title><p>Three ideas are proposed here: 1) the data time correlation; 2) Energy estimation model of nodes and 3) the hybrid method.</p><p>In the data time correlation algorithm, Time Series Forecasting method (TSF7) used to decide sending or not sending of data. Then in time t in the beginning of each period, base station send percentage of error e(t) to all nodes. First data sensed by node and sent. Second and third and fourth data sent based on the improved TINA algorithm. Then the node runs time series function to determine the value of prediction of trend line model, to create trend model. In the next times the sensed data compared with predicted value of trend model, if the difference between these two values exceeds a threshold value, data sent to the given node and the node recalculate prediction function of trend model to update the trend line. Otherwise, the sensor node does not send the sensed data with this insurance that sensed data placed in accuracy range of data. So only some data have to be sent that are very different from the trend line model. This help to prevent energy loss.</p><p>We call the node energy prediction model LEACHCEC8 and describe as follow. For doing the best clustering, that is needed to know energy of the nodes. The estimation method is a method that has low cost and is suitable. We also use the energy estimation method. For this, we divided LEACH-C protocol to three phases. Topology building phase, setup-state and steady-state. In the first phase nodes send their position to the BS. Then BS creates network topology based on these positions. Once the topology was formed in the base station, base station node calculates the distance of nodes to each other. The BS calculates the amount of energy used in each node in the setup-phase, using a simple mathematical model. Then deduct this amount from primary energy and calculates its remaining energy. Finally do the clustering and goes to the steady-state phase. In this phase for each node, the data time correlated algorithm applied according to the following method.</p><p>BS node should be informed of data time correlation in nodes to estimate precisely energy of them. Therefore cluster head create a table that containing list of all members of the cluster. Cluster head registers every node in to the table that have correlated data and do not sent in certain times. In the end of each period, cluster head sends this table with collected data to the base station. This table contains nodes ID and number of times that these nodes not sent data. Base station uses this information for clustering decisions in centralized methods. Ultimately that cause to energy estimation in centralized methods is more carefully while the best clustering is created and the network lifetime increases. So in total lifetime of the network, first phase has done once but setup and steady-state phases done as in LEACH-C.</p></sec><sec id="s3_2"><title>3.2. Process of Proposed Methods</title><sec id="s3_2_1"><title>3.2.1. Linear Prediction Method Using Time Series</title><p>Linear prediction method is a powerful technique to predict time series in an environment changing with time. Suppose that you want to contact an independent variable x and a dependent variable y to specify. If we assume that the true relationship between these variables in a straight line and the value observed for each value of y for every given x is a random variable then we can wrote:</p><disp-formula id="scirp.17500-formula151064"><label>(1)</label><graphic position="anchor" xlink:href="9-6101078\45cf764c-6be1-4b04-8435-4915664a66c2.jpg"  xlink:type="simple"/></disp-formula><p>where in this equation <img src="9-6101078\9b7b57e0-3995-485e-a87a-cc59d72050ac.jpg" /> is the width from the origin and <img src="9-6101078\2583e772-4d88-43b5-bf5f-15f11b08eebd.jpg" /> is the slope of the line that is unidentified fixed values. Observed value y can be described with the following equation where the error ε created because of not conforming real value to the amount of predicted value.</p><disp-formula id="scirp.17500-formula151065"><label>(2)</label><graphic position="anchor" xlink:href="9-6101078\840cfac2-834d-45b2-9661-03e4de99d56b.jpg"  xlink:type="simple"/></disp-formula><p>This pattern is usually named a simple linear regression model. Because that has only one independent variable so that the x independent variables called prediction variable and y called the response variable. Prediction and response variables x and y can be time series in which case we have a time series regression pattern.</p><p>There are several methods to estimate unknown parameters<img src="9-6101078\5eb3593d-2634-49df-8a96-7b09f7e415f9.jpg" />, <img src="9-6101078\bdbaf60e-903a-4b43-af1d-cf5eb47976b3.jpg" />in equation (2) that can be used. One method that a lot used is the least square error method in which the<img src="9-6101078\839822ac-32d0-44f5-8a4e-eebbae06929d.jpg" />, <img src="9-6101078\d2532992-d650-48ae-a5a4-8ad017d3f15f.jpg" />estimates obtained from minimizing sum of squares errors or remaining’s. Suppose we have n observations of (x<sub>1</sub>, y<sub>1</sub>), (x<sub>2</sub>, y<sub>2</sub>) &#183;&#183;&#183; (x<sub>n</sub>, y<sub>n</sub>). A model that is based on these observations is written as follows:</p><disp-formula id="scirp.17500-formula151066"><label>(3)</label><graphic position="anchor" xlink:href="9-6101078\433abb60-6d12-47fc-937e-e8326813d0c0.jpg"  xlink:type="simple"/></disp-formula><p>And the total square error is as the follow:</p><disp-formula id="scirp.17500-formula151067"><label>(4)</label><graphic position="anchor" xlink:href="9-6101078\2b1ee4d4-f297-4fe9-bf75-d6c002a9b909.jpg"  xlink:type="simple"/></disp-formula><p>So the total square error is simply the total squares of deviations observed y<sub>i</sub> and<img src="9-6101078\df8ec477-db81-4533-b7df-722ec5f63a67.jpg" />. The estimated values of <img src="9-6101078\1ff1115f-463d-4f05-a7ba-c797c5a29c25.jpg" /> and <img src="9-6101078\483127eb-aa88-4b90-9a12-5ba0f86f518e.jpg" /> that we call them <img src="9-6101078\61921723-97be-48e3-b20e-126a84c61261.jpg" /> and <img src="9-6101078\e4e16ae6-fe9e-488b-9a74-25f779f6fdd8.jpg" /> that achieved using the least squares method by minimizing <img src="9-6101078\17e43be6-31b2-4a50-a743-dd8d0cd26d24.jpg" /> toward <img src="9-6101078\f66b9fcf-bf17-4bab-a28e-68db2982c98c.jpg" /> and <img src="9-6101078\96a90580-c1bd-4fce-9c41-b18aa3e0d05e.jpg" /> so we can write:</p><p><img src="9-6101078\6d796308-c320-447c-a397-78636cf2a83c.jpg" /></p><p><img src="9-6101078\661e5241-7a35-4e30-b57a-f83c46788a70.jpg" /></p><p>This system of equations called least squares line normal equations system that simplified as the following:</p><p><img src="9-6101078\afcc732b-bbcd-458a-8ac8-bb5ed74bfaf3.jpg" /></p><p><img src="9-6101078\8e31d807-e61b-44bf-94d7-d03735575be8.jpg" /></p><p>By solving this system <img src="9-6101078\c7b58ff4-2ca0-4bfd-aba0-7d9c35ce30ff.jpg" /> and <img src="9-6101078\312ee433-b885-4705-b648-9480878add57.jpg" /> estimates or on the other hand <img src="9-6101078\0f2e6510-357c-4dd3-9f81-f8c0a69e3278.jpg" /> and <img src="9-6101078\c71734ee-4e4f-4461-9347-a0129a6b565d.jpg" /> obtained and so:</p><p><img src="9-6101078\4f27475e-0361-45db-8b4b-e0ac5994a26a.jpg" /></p><p><img src="9-6101078\89f9ff65-87b6-4d59-8255-05f6577d36e8.jpg" /></p><p>where <img src="9-6101078\dd08411b-172d-4a8c-8fdb-d50bb6fcf60d.jpg" /> and<img src="9-6101078\ce72be79-1215-4cb2-8e78-00e83cfc7c93.jpg" />. So the fitted simple linear regression model is as follows:</p><p><img src="9-6101078\c9f033c1-66bf-493e-9f37-0f49602d1746.jpg" /></p><p>For each value of the predicted variable x we can obtain corresponding value predicted response from this equation. The fitted values of <img src="9-6101078\ee38befe-6400-4101-9e21-06f92c5e277c.jpg" /> corresponding to observed values <img src="9-6101078\f2b11416-a2b8-4479-9084-d2e4aee14d84.jpg" /> for every i = 1, 2, &#183;&#183;&#183;, n is the following:</p><p><img src="9-6101078\29155913-606e-4625-99a2-eb8f4063ebd9.jpg" /></p><p>The difference between ith given fitted value to an observed yi value called a residue so that:</p><disp-formula id="scirp.17500-formula151068"><label>(5)</label><graphic position="anchor" xlink:href="9-6101078\f552464e-3f23-4d4f-86ed-71bea29e4c68.jpg"  xlink:type="simple"/></disp-formula><p>If the fitted model regression for data is appropriate, in this case remains do not follow of appropriate form. There is no clear pattern for the remains.</p><p>This method stated by equation (11) that is a recursive method [<xref ref-type="bibr" rid="scirp.17500-ref9">9</xref>]:</p><p>Linear prediction method is a powerful technique for predicting time series in a time-varying environment. This method is expressed in equation (6) and is a recursive method [9,10]:</p><disp-formula id="scirp.17500-formula151069"><label>(6)</label><graphic position="anchor" xlink:href="9-6101078\2e473ed2-2bb0-4484-90b0-680968e9afc8.jpg"  xlink:type="simple"/></disp-formula><p>Estimated value at time t as a linear function of previous values in the times “t − T, t − 2T, &#183;&#183;&#183;, t − mT” has been produced is obtained. In Equation (1) a<sub>1</sub>, a<sub>2</sub>, &#183;&#183;&#183;, a<sub>m</sub> are the linear prediction coefficients, “m” is the model degree, “T” is the sampling time, y(t + T) is the next observation estimation and y(t), y(t − T), …, y(t − mT) are the present and past observations. The prediction error which is the difference between the predicted and the real values (Equation (7)) must be minimized.</p><disp-formula id="scirp.17500-formula151070"><label>(7)</label><graphic position="anchor" xlink:href="9-6101078\97bcd2f5-cfdb-4a8d-a62b-52aad15bec06.jpg"  xlink:type="simple"/></disp-formula><p>In order to estimate the coefficients of linear prediction model we use the least squares error method and rewrite Equation (6) with considering modeling error in equation (8):</p><disp-formula id="scirp.17500-formula151071"><label>(8)</label><graphic position="anchor" xlink:href="9-6101078\7f4aba8c-2fcd-427e-aa2f-5dec0ff0f5be.jpg"  xlink:type="simple"/></disp-formula><p>The error e(t) is generated because of not adopting the linear prediction model to the real value. So to find the coefficients, a<sub>1</sub>, a<sub>2</sub>, &#183;&#183;&#183;, a<sub>m</sub> in Equation (8) we use the sum least squares error and set of linear functions. Presented in Equation (9).</p><disp-formula id="scirp.17500-formula151072"><label>(9)</label><graphic position="anchor" xlink:href="9-6101078\5c55f0df-7b84-406f-8615-13761156dfae.jpg"  xlink:type="simple"/></disp-formula><disp-formula id="scirp.17500-formula151073"><label>(10)</label><graphic position="anchor" xlink:href="9-6101078\d7e1e5a3-ccd1-4f36-bbe5-942096dccca4.jpg"  xlink:type="simple"/></disp-formula><p>Elements in the matrix A are the coefficients which can be found by least squares error method (11):</p><disp-formula id="scirp.17500-formula151074"><label>(11)</label><graphic position="anchor" xlink:href="9-6101078\2db02877-4c14-4acd-80e7-7925bd46ec18.jpg"  xlink:type="simple"/></disp-formula><p>In Equation (11), φ<sup>T</sup>, is the transpose of the matrix φ, and (φ<sup>T</sup>φ) is the inverse of matrix. In practice:</p><p>• If m is chosen larger than is required (i.e. over-estimation of the model order), Equation (11), cannot be solved for any unique set of coefficients, because of some columns in the matrix φ, are not independent of each other. Hence φ<sup>T</sup>φ would be unique and will not have inverse. This means that the system of equations in Equation (8) will have an infinite number of answers for the coefficients. Geometrically speaking, it is like fitting an infinite number of lines to a single point which is not the preferred case.&#160;&#160;&#160;&#160;&#160;</p><p>• If m is chosen less than the required value, the number of independent equations would be more than the number of unknown variables (a<sub>1</sub> – a<sub>m</sub>). Such a system of equations has to be solved for the best approximation of coefficients. The best approximation for coefficients (a<sub>1</sub> – a<sub>m</sub>) is the use of the least squares error method.</p><p>Obviously, that is how much precision, the number of the data sent will increase and vice versa.</p></sec><sec id="s3_2_2"><title>3.2.2. The Algorithm Used to Estimate the Energy by Transferring Topology to the Base Station</title><p>As previously mentioned, LEACH-C protocol divided to three phases: topology building phase, setup-state and steady-state and this algorithm is proposed as following. These three phases is called LEACH-CEC.</p></sec><sec id="s3_2_3"><title>a) Topology building phase</title><p>1) Start of network.</p><p>2) Base station receives position of all nodes that contain x and y.</p><p>3) Base station calculates distance of all nodes with each other’s using the following Formula (12):</p><disp-formula id="scirp.17500-formula151075"><label>(12)</label><graphic position="anchor" xlink:href="9-6101078\16917295-ff78-4182-b83b-57e84a08fd52.jpg"  xlink:type="simple"/></disp-formula><p>4) End of phase.</p></sec><sec id="s3_2_4"><title>b) Setup-state phase</title><p>5) If clustering has changed 5.1) BS calculate energy consumption per node using the Formulae (13) and (14):</p><disp-formula id="scirp.17500-formula151076"><label>(13)</label><graphic position="anchor" xlink:href="9-6101078\4ef7bf8f-1793-4575-b14b-35f864ab720d.jpg"  xlink:type="simple"/></disp-formula><p>and</p><disp-formula id="scirp.17500-formula151077"><label>(14)</label><graphic position="anchor" xlink:href="9-6101078\4807a05f-0399-4137-8a50-1476f58a6c4e.jpg"  xlink:type="simple"/></disp-formula><p>And after computing, place the sum of two variables in to the<img src="9-6101078\2434c759-f61e-4a77-81de-339621b38e25.jpg" />.</p><p>5.2) Base station calculates remaining energy (E<sub>R</sub>) of each node using the Formula (15).</p><disp-formula id="scirp.17500-formula151078"><label>(15)</label><graphic position="anchor" xlink:href="9-6101078\d0e288c5-4ac1-49d6-ac61-df421f86cf5b.jpg"  xlink:type="simple"/></disp-formula><p>5.3) BS audit the information received from CHs, if the nodes name with the number of duplicate data is there, considered in terms of computing nodes remaining energy.</p><p>5.4) If <img src="9-6101078\1d945c00-26b5-44ad-8e2f-0d3ae8a2a277.jpg" /> the node is dead.</p><p>5.5) Otherwise the node is alive and participates in Clustering.</p><p>6) Clustering is formed.</p><p><img src="9-6101078\6aa4f547-72bb-4883-add8-f65501c69196.jpg" />: The amount of consumed energy by node i in time t.</p></sec><sec id="s3_2_5"><title>c) Steady-state phase(Hybrid method)</title><p>7) If node n is a cluster head.</p><p>7.1) Cluster head creates a table containing node name and the number of correlated data.</p><p>7.2) Cluster head collects data from the members of the cluster.</p><p>7.3) If the cluster head has not received data from its member node, registers name of that node in the table and a unit to be added to the number of correlated data. Cluster head sends the table with correlated data to the base station.</p><p>8) Otherwise, that is a member node.</p><p>8.1) If the node is in turn on then the node runs correlated data algorithm in figure 4.</p><p>8.2) Otherwise goes to sleep mode.</p><p><img src="9-6101078\095b0cd5-a07b-44a8-a055-bd5cd17ebded.jpg" />: Strengthening the energy to transferring 1 bit data in distance d.</p><p><img src="9-6101078\ba4f0e76-a944-414e-a4b9-61ad449fa26f.jpg" />: Radio energy of amplifier.</p><p><img src="9-6101078\777b0bf8-6ea3-48a9-b0ea-6bfe3ca3b755.jpg" />: Radio energy of amplifier.</p><p>d: distance between receiver and sender l: length of data package.</p><p><xref ref-type="table" rid="table1">Table 1</xref> should be created by the cluster head in the steady-state phase. To explain this part, consider a cluster with node numbers 2, 3, 5, 6, 7, 14, and 15 are chosen. Assume that the node 7 is cluster head and the rest is member nodes. If nodes 2 and 5 respectively, each have 2 and 1 correlated data, node 7 as cluster head must create <xref ref-type="table" rid="table1">Table 1</xref>. This table must be dynamic and at the end of each period must be sent with correlated data.</p></sec></sec></sec></body><back><ref-list><title>References</title><ref id="scirp.17500-ref1"><label>1</label><mixed-citation publication-type="other" xlink:type="simple">S. Bandyopadhyay and E. Coyle, “An Energy Efficient Hierarchical Clustering Algorithm for Wireless Sensor Networks,” Proceedings of IEEE INFOCOM, San Francisco, 30 March 2003. </mixed-citation></ref><ref id="scirp.17500-ref2"><label>2</label><mixed-citation publication-type="other" xlink:type="simple"> 
M. Ali and S. K. Ravula, “Real-Time Support and Energy Efficiency in Wireless Sensor Networks,” School of Information Science, Computer and Electrical Engineering Halmstad University, Halmstad, 2008. </mixed-citation></ref><ref id="scirp.17500-ref3"><label>3</label><mixed-citation publication-type="other" xlink:type="simple"> 
A. A. Abbasi and M. Younis, “A Survey on Clustering Algorithms for Wireless Sensor Networks,” Elsevier B.V, Amsterdam, 2007. </mixed-citation></ref><ref id="scirp.17500-ref4"><label>4</label><mixed-citation publication-type="other" xlink:type="simple"> 
M. Demirbas and H. Ferhatosmanoglu, “Peer-to-Peer Spatial Queries in Sensor Networks,” Proceeding of 3rd IEEE International Conference on Peer-to-Peer Computing (p2p ‘03), Linkopings, 11 August 2003, pp. 32-39. </mixed-citation></ref><ref id="scirp.17500-ref5"><label>5</label><mixed-citation publication-type="other" xlink:type="simple"> 
W. B. Heinzelman, A. P. Chandrakasan and H. Balakrishnan, “An Application-Specific Protocol Architecture for Wireless Microsensor Networks,” IEEE, Vol. 1, No. 4, 2002, pp. 660-670. </mixed-citation></ref><ref id="scirp.17500-ref6"><label>6</label><mixed-citation publication-type="other" xlink:type="simple"> 
S. D. Muruganathan and D. C. F. Ma, “A Centralized Energy-Efficient Routing Protocol for Wireless Sensor Networks,” IEEE, Vol. 43, No. 3, 2005, pp. 8-13. </mixed-citation></ref><ref id="scirp.17500-ref7"><label>7</label><mixed-citation publication-type="other" xlink:type="simple"> 
W. B. Heinzelman, A. P. Chandrakasan and H. Balakrishnan, “Application-Speci_c Protocol Architectures for Wireless Networks,” IEEE Transactions on Wireless Communications, Vol. 1, No. 4, 2002, pp. 1-154.  
doi:10.1109/TWC.2002.804190</mixed-citation></ref><ref id="scirp.17500-ref8"><label>8</label><mixed-citation publication-type="other" xlink:type="simple"> 
X. Wang, J.-J. Ma, S. Wang and D.-W. Bi, “Time Series Forecasting for Energy-efficient Organization of Wireless Sensor Networks,” MDPI, www.mdpi.org/sensors. </mixed-citation></ref><ref id="scirp.17500-ref9"><label>9</label><mixed-citation publication-type="other" xlink:type="simple"> 
W.-P. Chen, J. C. Hou and L. Sha, “Dynamic Clustering for Acoustic Target Tracking in Wireless Sensor Networks,” Proceedings of the 11th IEEE International Conference on Network Protocols, Atlanta, 4-7 November 2003, pp. 284-294. 
doi:10.1109/ICNP.2003.1249778</mixed-citation></ref><ref id="scirp.17500-ref10"><label>10</label><mixed-citation publication-type="other" xlink:type="simple"> 
C.-K. Liang, Y.-J. Huang and J.-D. Lin, “An Energy Efficient Routing Scheme in Wireless Sensor Networks”, 22nd International Conference on Advanced Information Networking and Applications, IEEE, Singapore, 22-25 March 2008. </mixed-citation></ref><ref id="scirp.17500-ref11"><label>11</label><mixed-citation publication-type="other" xlink:type="simple"> 
P. Hu, Z. Zhou, Q. Liu and F. M. Li, “The HMM-Based Modeling for the Energy Level Prediction in Wireless Sensor Networks,” Proceeding of the 2007 2nd IEEE Conference on Industrial Electronics and Applications, Harbin, 23-25 May 2007. </mixed-citation></ref><ref id="scirp.17500-ref12"><label>12</label><mixed-citation publication-type="other" xlink:type="simple"> 
A. S. Mohamed and B. Jonathen, “TINA: A Scheme for Temporal Coherency-Aware In-Network Aggregation,” Proceedings of the 3rd ACM International Workshop on Data Engineering for Wireless and Mobile Access, San Diego, 19 September 2003, pp. 69-76. </mixed-citation></ref><ref id="scirp.17500-ref13"><label>13</label><mixed-citation publication-type="other" xlink:type="simple"> 
X. H. Dai, F. Xia and Z. Wang, “An Energy-Efficient In-network Aggregation Query Algorithm for WSN,” IEEE, Beijing, 30 August 2006, pp. 255-258.</mixed-citation></ref></ref-list></back></article>