<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article  PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article"><front><journal-meta><journal-id journal-id-type="publisher-id">JCC</journal-id><journal-title-group><journal-title>Journal of Computer and Communications</journal-title></journal-title-group><issn pub-type="epub">2327-5219</issn><publisher><publisher-name>Scientific Research Publishing</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.4236/jcc.2019.72004</article-id><article-id pub-id-type="publisher-id">JCC-90612</article-id><article-categories><subj-group subj-group-type="heading"><subject>Articles</subject></subj-group><subj-group subj-group-type="Discipline-v2"><subject>Computer Science&amp;Communications</subject></subj-group></article-categories><title-group><article-title>
 
 
  Energy-Efficient MTC Data Offloading in Wireless Networks Based on K-Means Grouping Technique
 
</article-title></title-group><contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Juma</surname><given-names>Saidi Ally</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Muhammad</surname><given-names>Asif</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Qingli</surname><given-names>Ma</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref></contrib></contrib-group><aff id="aff1"><addr-line>Department of Information and Communication Engineering, University of Science and Technology of China,<br/> 
Key Laboratory of Wireless-Optical Communications, Chinese Academy of Sciences, Hefei, China</addr-line></aff><pub-date pub-type="epub"><day>29</day><month>01</month><year>2019</year></pub-date><volume>07</volume><issue>02</issue><fpage>47</fpage><lpage>61</lpage><history><date date-type="received"><day>17,</day>	<month>October</month>	<year>2018</year></date><date date-type="rev-recd"><day>17,</day>	<month>February</month>	<year>2019</year>	</date><date date-type="accepted"><day>20,</day>	<month>February</month>	<year>2019</year></date></history><permissions><copyright-statement>&#169; Copyright  2014 by authors and Scientific Research Publishing Inc. </copyright-statement><copyright-year>2014</copyright-year><license><license-p>This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/</license-p></license></permissions><abstract><p>
 
 
  Machine-type communication (MTC) devices provide a broad range of data collection especially on the massive data generated environments such as urban, industrials and event-enabled areas. In dense deployments, the data collected at the closest locations between the MTC devices are spatially correlated. In this paper, we propose a k-means grouping technique to combine all MTC devices based on spatially correlated. The MTC devices collect the data on the event-based area and then transmit to the centralized aggregator for processing and computing. With the limitation of computational resources at the centralized aggregator, some grouped MTC devices data offloaded to the nearby base station collocated with the mobile edge-computing server. As a sensing capability adopted on MTC devices, we use a power exponential function model to compute a correlation coefficient existing between the MTC devices. Based on this framework, we compare the energy consumption when all data processed locally at centralized aggregator or offloaded at mobile edge computing server with optimal solution obtained by the brute force method. Then, the simulation results revealed that the proposed k-means grouping technique reduce the energy consumption at centralized aggregator while satisfying the required completion time.
 
</p></abstract><kwd-group><kwd>Machine-Type Communication</kwd><kwd> Correlation</kwd><kwd> Data Offloading</kwd><kwd> Grouping Technique</kwd><kwd> Differential Entropy</kwd><kwd> Power Exponential Function</kwd></kwd-group></article-meta></front><body><sec id="s1"><title>1. Introduction</title><p>Market trends of Machine-Type Communication (MTC) devices deployments are exponential increases. This is determined by many applications and services discovered such automatic industries monitoring, smart metering, surveillance cameras, environment monitoring and trace devices [<xref ref-type="bibr" rid="scirp.90612-ref1">1</xref>] . Additionally, according to a broadly used report by Cisco VNI in 2016 up to 2020 the traffic generated from MTC devices expected to reach 49 exabytes [<xref ref-type="bibr" rid="scirp.90612-ref2">2</xref>] . MTC usually includes a large number of devices deployed randomly in a hostile and highly dynamic environment. Since the data collected coverage and network coverage (communication range) are normally constant at randomly allocated, a high density of the redundancy devices is used to conserve the preferred level of network coverage in order to accomplish enough data collection. Addition, MTC devices are normally the event-driven systems where various devices try to send data at any event of interest occurred [<xref ref-type="bibr" rid="scirp.90612-ref3">3</xref>] . Since the MTC devices are densely deployed, then the data transmitted from devices to the aggregator (sink) or base station for computational are spatially correlated.</p><p>Therefore, the wasted of the computational resources occurred at CA when the data collected from different MTC devices are processing independently. The significant amount of computational resources can be saved by considering the advantages of existence of a spatial correlation.</p><p>However, the growth of the data traffic collected by MTC devices increase the pressure on the mobile operators specifically for the delay-sensitive applications, which requires a very short time to be processed. Several approaches are proposed to deal with that challenge such as edge computing, data offloading and data caching [<xref ref-type="bibr" rid="scirp.90612-ref4">4</xref>] [<xref ref-type="bibr" rid="scirp.90612-ref5">5</xref>] [<xref ref-type="bibr" rid="scirp.90612-ref6">6</xref>] , where the data computation and terminal requests move very close to the data source. Besides, it attempts to reduce the pressure on the mobile operators regarding with the limits of data generated from the terminal devices such as MTC devices, but still encounters the problem of data overloading when the amount of data generated are increases. That amount of data required to be analyzed. Recently, the big data analysis implemented in cloud or enterprises centralized data centers to analyze the data generated from terminal devices. Moreover, data aggregation technique proposed in [<xref ref-type="bibr" rid="scirp.90612-ref7">7</xref>] where the network congestion originated from the massive data generated by MTC devices reduced. In addition, the data aggregator act as intermediate nodes on the cellular networks can further help minimize the power consumed and transmission delay by MTC devices while transmitting data to the networks [<xref ref-type="bibr" rid="scirp.90612-ref8">8</xref>] .</p><p>Generally, the MTC devices are deployed to perform specific tasks collectively; the data collected from each device are not completely independent rather correlated. Thus, in such case to avoid the resource wasted for the individual device processing at the aggregator, the CA combines the correlated devices together to form a group [<xref ref-type="bibr" rid="scirp.90612-ref9">9</xref>] . In this paper, we use the centralized aggregator (CA) as a sink or nearby devices computing the data collected from MTC devices in the event-based area. By examining the existence of correlated between the MTC device we propose k-means grouping technique to combine all the spatial correlated devices together. With the limited computational resources at CA, some data will be offloaded to the nearby server allocated to the base station called mobile edge computing server. We investigate the energy consumption at the CA by computing the optimal solution using the brute force method. The main contributions of this paper described on the following:</p><p>• We define the correlation model to compute the correlation coefficient between the MTC devices by considering the device coordinate points on the hyperplane.</p><p>• We propose device k-means grouping technique to groups MTC devices based on spatial correlation around the data collected coverage.</p><p>• We use the differential entropy framework to compute the size of the data in each group.</p><p>• Based on the size of the data we introduce the concepts of the data offloading and we define the optimization problem for minimizing the energy consumption on CA for processing the data collected from MTC devices.</p><p>• We use the brute force method to find the optimal solution for total energy consumption on the centralized aggregator.</p><p>The remaining part of this structured as follows. Related work discussed in Section 2. In Section 3, we discuss the network model that includes the network model together with details of the correlation model and the proposed k-means grouping technique. The Section 4 present the theoretical concepts of computation together with the optimization problem used to minimize the energy consumption and Section 5 discuss the results used to verify our context and lastly we conclude by summarizing the idea presented on whole paper.</p></sec><sec id="s2"><title>2. Related Work</title><p>There are many emerging works exist in MTC communication and wireless networks. In [<xref ref-type="bibr" rid="scirp.90612-ref10">10</xref>] authors proposed the medium access control (MAC) protocol with low latency and energy for hierarchical structured of M2M networks. It permits the effective data originated from the terminal nodes transmitted to a sink node via a cluster head. The cloud based lightweight for mobile core networks investigated by [<xref ref-type="bibr" rid="scirp.90612-ref11">11</xref>] that improve the challenges MTC occurred from system overload and network congestion .Similarly extended access barring (EAB) in the LTE-A studied in [<xref ref-type="bibr" rid="scirp.90612-ref12">12</xref>] to deals with the problem of overloaded that originated from a massive number of MTC devices that compete to access networks in a limited burst. The efficiency of data aggregation for M2M network discussed in [<xref ref-type="bibr" rid="scirp.90612-ref13">13</xref>] with the goal of reducing the total energy expenditure together with coverage probability.</p><p>Conventionally, to describe the concept of computational capability, a mobile cloud computing (MCC) have been widely introduced [<xref ref-type="bibr" rid="scirp.90612-ref14">14</xref>] . However, a cloud allocated far away from terminal devices cannot offer an extensive solution for the low latency services (emergence applications), real-time applications and regular data transmission such as location updates. Therefore, to allow terminal devices communicate directly with cloud will not be feasible and cost-effective because it needs more resources allocation and complex routing protocol design. To deal with these challenges a mobile edge computing technique proposed in [<xref ref-type="bibr" rid="scirp.90612-ref15">15</xref>] . In MEC, the computation resources allocated near to the terminal devices in wireless cellular networks that helps reduce the delay and energy consumption especially for the delay-sensitive applications [<xref ref-type="bibr" rid="scirp.90612-ref16">16</xref>] . Then, the energy consumption and computation defined as jointly optimization problem in [<xref ref-type="bibr" rid="scirp.90612-ref17">17</xref>] . MTC devices use the virtual networks to handle the computing tasks and other tasks are offloaded to the MEC server through wireless cellular networks. A cooperative data aggregation algorithm developed in [<xref ref-type="bibr" rid="scirp.90612-ref18">18</xref>] for MTC data offloading with help of the multiple trusted mobile relays. The result shows the proposed algorithm prolong the lifetime of MTC devices battery.</p><p>Besides the MTC devices have sensing capability, implied the correlation between adopted as in wireless sensor networks. In [<xref ref-type="bibr" rid="scirp.90612-ref19">19</xref>] authors propose clustered aggregation technique to reduce the number of transmission from the sensor by utilizing the existing of the spatial correlation. Similarly the correlation characteristic of the spatial information is taken as advantages to eliminate the network redundancy [<xref ref-type="bibr" rid="scirp.90612-ref20">20</xref>] . Furthermore, the compression between the correlation MTC sources based on distributed source coding discussed in [<xref ref-type="bibr" rid="scirp.90612-ref21">21</xref>] . The three clustering algorithms such as grid dividing alorithm, Weighted Pair Group Method with Arithmetic Mean and K-medoids proposed to balance the tradeoff between compression ratio and decoding delay. In this paper, we study the existence of the correlation when the MTC devices are spatial distributed in the events-based area collects the data and then transmit to the Centralized Aggregator for computation.</p></sec><sec id="s3"><title>3. Network Model and Descriptions</title><p>In this section, we describe the details about the network model together with theoretical part of used correlation model and proposed k-means grouping technique.</p><sec id="s3_1"><title>3.1. Network Model</title><p>We consider the network model having the set of MTC devices D = { d 1 , d 2 , ⋯ , d M } that consists of sensing ability shown in <xref ref-type="fig" rid="fig1">Figure 1</xref>. The MTC devices are randomly distributed on the region to collects the data and then send to nearby Centralized Aggregator (CA) for the computing. Due to the limits of computational resources on the CA can’t save all the data sends from nearby MTC devices, then some required computation data are offloaded to the Base Station that collocated with mobile edge computing server (MEC) for more processing. After the computation to either CA or MEC, the results will be feedback to the corresponding MTC device for future action. From the physical nature of the MTC devices are randomly located on the plane; then the data collected by proximally devices can have higher degree of the correlation. With that case we analysis the presence of MTC devices correlation on the next subsection.</p></sec><sec id="s3_2"><title>3.2. Description of Correlation Model</title><p>The idea of a data correlation in computing is to reduce the size of data collected</p><p>by MTC devices by evaluating the existence of data similarity or data dependency. Since the CA wastes the computational resources for processing the individual device that contains the same data independently, therefore we adapt the data correlation model used on the Wireless Sensor Networks to evaluate the presence of correlation between the spatial correlated MTC devices. The correlation model verified by using the covariance function that decreases with Euclidean distance for 0 at l = ∞ and 1 at l = 0 , where l represents the Euclidean distance between the locations of MTC devices. Additionally, we assume the data collected from the MTC devices denoted as Y = { y 1 , y 2 , ⋯ , y M } have nature of the multivate Gaussian distribution having mean of μ and variance of σ . Hence, the covariance existence between the MTC device d 1 and d 2 calculated as on [<xref ref-type="bibr" rid="scirp.90612-ref22">22</xref>] using the pair coordinated points corresponding to the devices P 1 and P 2 as</p><p>cov ( d i , d j ) = σ i σ j c o r r ( d i , d j ) (1)</p><p>where σ i and σ j denotes to variance of devices d i and d j respectively. We can further improve the expression as</p><p>c o r r ( d i , d j ) = E ( d i , d j ) σ i σ j = K ϕ ( ‖ P i − P j ‖ ) = K ϕ ( l i , j ) (2)</p><p>where K ϕ ( . ) denotes a correlation function with given correlation parameter of ϕ and l i , j represents the existence distance between the device d i and d j . Furthermore, we assume, the value of K ϕ ( . ) related with data collected from MTC device d i and d j as y i and y j respectively. To determine the value of K ϕ ( . ) covariance function models proposed such as Rational Quadratic, Spherical, Power Exponential and Matern based on the structure of the correlation [<xref ref-type="bibr" rid="scirp.90612-ref23">23</xref>] . In this paper, we choose the Power Exponential model to express the correlation function existed between the MTC devices that expressed as</p><p>K ϕ ( l i , j ) = exp { − ( l i , j θ 1 ) θ 2 } (3)</p><p>where θ 1 &gt; 0 and θ 2 ∈ ( 0 , 2 ] represents the control parameters for a given correlation and the smoothness at given random region. For simplicity we use θ 2 = 2 and the correlation coefficient between the device d i and d j in short represented as ξ i , j becomes</p><p>K ϕ ( l i , j ) = ξ i , j = exp { − ( η l i , j 2 ) } (4)</p><p>where η = θ 1 − 2 represents the exponent that control correlation that exist between the devices. Hence, we can determine the correlation matrix of M MTC devices as</p><p>K c o r r = ( ξ 1 , 1 ξ 1 , 2 ⋯ ξ 1 , N ξ 2 , 1 ξ 2 , 2 ⋯ ξ 2 , N ⋮ ⋮ ⋮ ⋮ ⋯ ⋯ ξ i , j ⋯ ⋮ ⋮ ⋮ ⋮ ξ N , 1 ξ N , 2 ⋯ ξ N , N ) (5)</p><p>The above correlation matrix provides overall existed correlation between the devices with the value between 0 and 1. That’s mean when the value of ξ i , j = 0 the devices are located very far not correlated and similarly when ξ i , j = 1 the devices are very near to each other have highly data correlation.</p></sec><sec id="s3_3"><title>3.3. Device Grouping Technique</title><p>In utilizing the limited computational resources at CA, we use the grouping technique to groups all the MTC devices based on the coordinate points. Since all MTC devices in each group assume to be very close, then data collected by the individual device may be identical. Hence, the computation resources reduced on the CA by processing together the MTC in one group in either to the CA or MEC.</p><p>The proposed grouping technique of MTC devices consist of two part; the first we need to compute the distance or similarities existing between the two devices and then grouping the MTC devices using the clustering technique. In this paper, we compute the distance between the two devices according to the equations on the previous section and using the k-means clustering algorithm to group the MTC devices.</p><p>The k-means algorithm frequently used to partition set data automatically into K disjoint clusters or groups as described in [<xref ref-type="bibr" rid="scirp.90612-ref24">24</xref>] . With the given the MTC devices that are randomly distributed, the aim is to group into different clusters which are near to initialized centroids. We randomly deciding the number of groups as K and then we analyze the k-means technique according to Euclidean distance expressed as</p><p>l c ( P i − μ g ) = ‖ P i − μ g ‖ 2 (6)</p><p>where P i represents the coordinate point of device d i and μ g denotes the initialized coordinate point of centroid cluster. Then, the centroid cluster updated iteratively with associated devices as expressed on</p><p>μ g = 1 | Ω g | ∑ j ∈ Ω g P j (7)</p><p>where Ω g is the set of number of the MTC devices contained on group or cluster. The Algorithm 1 explain in details.</p><p>Algorithm 1. K-means grouping technique.</p></sec></sec><sec id="s4"><title>4. Computation and Problem Formulation</title><p>In the section, we describe the theoretical details of computational model and we define the energy consumption optimization problem regardless with the data processing decision which either locally at CA or offloaded to the remote at MEC server.</p><sec id="s4_1"><title>4.1. Computation Model for CA and MEC</title><p>The CA collocated near to the MTC devices that receive all the data collected from them. As we assume the MTC devices are randomly distributed, the CA uses the correlation framework to check whether the correlation between the devices existed. After that the devices grouping occurred. Then, the correlation matrix corresponding to the member of each group determined according to the Equation (5). With the data collected by each MTC device follows a multi-variate Gaussian distribution, then we can use the idea of information entropy to find the size of data on each group. Therefore, we adopt the differential entropy as explained in [<xref ref-type="bibr" rid="scirp.90612-ref25">25</xref>] with assumption that the data collected by MTC devices are quantized with the identical quantization size as Δ = { Δ 1 , Δ 2 , ⋯ , Δ M } . Then, the entropy from the multi-variate Gaussian distribution model expressed as</p><p>h ( Y ) = 1 2 log [ ( 2 π e ) S | K S | ] (8)</p><p>where | K S | represents the correlation matrix of any group with S number MTC devices. Then, the size data in each group based on correlation matrix modeled as entropy H ( Δ g ) corresponding to the data set Y Δ g = { y 1 Δ 1 , y 2 Δ 2 , ⋯ , y S g Δ S g } on each group evaluated as</p><p>H ( Δ g ) = 1 2 log [ ( 2 π e ∏ j = 1 S g ( Δ j ) 2 ) S g | K S g | ] (9)</p><p>where S g represents number of MTC devices presented in the group g. With the above expression, we obtain size of the input data for each group.</p><sec id="s4_1_1"><title>4.1.1. Local Execution Analysis</title><p>In the local computing, we consider the computation time and energy consumption for processing each group. For the computing process requires the computation capability of CA denoted by C l and CPU clock frequency represented as f l [<xref ref-type="bibr" rid="scirp.90612-ref26">26</xref>] . Then the computation time for processing group i expressed as T l i given by</p><p>T l i = C l χ ( i ) f l i (10)</p><p>where χ ( i ) represents the size of the input data on the group i that obtained from Equation (9). Furthermore, the energy consumption per each group i given by</p><p>E l i = κ ( f l i ) τ T l i (11)</p><p>where κ ( f l i ) τ represents the power coefficient at CA and κ is a constant that determined by architecture of capacity of the chip. The parameter τ represent frequency exponent with constant value of τ ≥ 2 . The frequently parameter used is approximate equal to 3 as illustrated by [<xref ref-type="bibr" rid="scirp.90612-ref27">27</xref>] , in this paper we define τ = 3 .</p></sec><sec id="s4_1_2"><title>4.1.2. MEC Execution Analysis</title><p>As we assume the CA has limits of the computational resources, then some groups are offloaded to MEC server for processing. In this scenario the computation time on MEC obtained by considering the transmission time of uplink, downlink and execution time to the MEC. In this paper, we ignore the transmission time of downlink because the data size remained after processing too small [<xref ref-type="bibr" rid="scirp.90612-ref28">28</xref>] . Then, the transmission time of uplink evaluated as</p><p>T t r i = χ ( i ) R (12)</p><p>where R represents the transmission rate between the CA and MEC which obtained as</p><p>R = W log ( 1 + λ P t r ) (13)</p><p>where W denotes the channel bandwidth reserved for CA and P t r is the transmission power of CA consumed to offloads to MEC and λ is the channel gains that normalized by the power of white Gaussian noise. Then, we can evaluate the energy consumption used for transmission as</p><p>E t r i = P t r T t r i (14)</p><p>In addition, we evaluate the computation time at MEC as</p><p>T s i = C s χ ( i ) f s (15)</p><p>where C s and f s represents the CPU capability and clock frequency of the MEC server respectively and χ ( i ) is the size of input data of the group i. Therefore, the total offloading time for CA, T o i obtained as</p><p>T o i = T t r i + T s i (16)</p></sec></sec><sec id="s4_2"><title>4.2. Problem Formulation</title><p>From the Equations obtained from previous section we formulate the optimization problem for total energy consumption at CA by considering the energy consumed when groups processed at CA plus the transmission energy when the groups offloaded to the MEC formulated as</p><p>min z i ∑ i = 1 K ( ( 1 − z i ) E l i + z i E t r i )</p><p>s.t</p><p>C 1 : ∑ i = 1 K ( 1 − z i ) T t r i ≤ T max</p><p>C 2 : ∑ i = 1 K z i T o i ≤ T max</p><p>C 3 : z i ∈ { 0 , 1 } , i = 1 , 2 , ⋯ , K (17)</p><p>where the constraints C1 and C2 represents limits of the completion time for processing the whole group and C3 denotes the decision computing variable for processing on each group either at CA or at MEC. T max is a maximum completion time to process all the MTC devices based on number of groups.</p><p>The optimization problem formulated above is an integer-programming problem, which we can solve using various heuristic approaches such as generic programming, dynamic programming, brute force or exhaustive search method and variable relaxation approaches (linear and semidefinite relaxation programming). Based on analyzing the performance of the proposed grouping technique we present the brute force approach to determine the optimal solution obtained from the finite number of iterations.</p></sec></sec><sec id="s5"><title>5. Performance Analysis and Evaluation</title><p>This section describes performance and results for the proposed scheme. To evaluate the performance of proposed scheme we use the parameters referred in [<xref ref-type="bibr" rid="scirp.90612-ref29">29</xref>] and the devices are randomly distributed at a 150 &#215; 150 cm region. We assume the CA is N900 as smartphone device CPU frequency of 600 &#215; 10<sup>6</sup> Cycle/sec, CPU capability equal to 650 &#215; 10<sup>6</sup> Cycle/bytes, computing power of 0.9 W and the value of κ obtained as</p><p>0.9 ( 600 &#215; 10 6 ) 3 = 4.1667 &#215; 10 − 27 J / cyc</p><p>Then, the value of a transmitted power of CA equal 1.012 W, normalized channel gain equal to 17 dB and the transmission bandwidth is 0.185 MHz. Addition, the CPU frequency and capability of MEC equal to 600 &#215; 10<sup>7</sup> Cycle/sec and 960 &#215; 10<sup>7</sup> Cycle/sec respectively. The values of quantization level range between minimum and maximum of 1/2 (1 bit) and 1/256 (8 bits) respectively with a degree of the correlation equal to 0.05. We evaluate the performance of the system by comparing the energy consumption when the data processed on the three different implementation scenario in terms of a number of MTC devices and the energy consumption comprises of both energy consumption for computation and for transmission. We simply choose the number of group or cluster equal to 4 and 8 to verify our proposed grouping technique.</p><sec id="s5_1"><title>5.1. Comparison Based on Computation Decision</title><p>The energy consumption succeeded by the proposed grouping technique as the number of MTC devices increases as illustrated in <xref ref-type="fig" rid="fig2">Figure 2</xref>(a) and <xref ref-type="fig" rid="fig2">Figure 2</xref>(b). With the different number of a group, the energy consumption obtained from optimal BFM outperforms the local computation (CA) at delay constraints while at the remote computing (MEC) performed better without satisfy the delay constraints. From the two figures as number group increases the energy consumption increases, which influenced by increases of the data size in the case of optimal BFM energy consumption almost 18% of the total energy increases between the two groups. Similarly in <xref ref-type="fig" rid="fig3">Figure 3</xref>(a) and <xref ref-type="fig" rid="fig3">Figure 3</xref>(b) compared the computation time with the number of MTC devices as illustrated the computation time achieved by the proposed technique as the number of MTC devices increases. Shown that, the computation time at the local computing (CA) was much small compared to the other two because the CA is very near to MTC devices. In addition, as the number of a group increases the optimal BFM approaches to remote computing (MEC). This implied that much number of groups are offloaded compared to the groups that processed at the local. We further investigate the tradeoff between energy consumption and computation time as shown in <xref ref-type="fig" rid="fig4">Figure 4</xref>(a) and <xref ref-type="fig" rid="fig4">Figure 4</xref>(b). From, the curves demonstrated that the data processing at the MEC consumed the small amount of energy compared to the CA but it consumes much time compare to processing at CA. With our applications that required the computing time becomes small, then the</p><p>local computing more favorable compared to offloading to remotely computing (MEC). Moreover as illustrated in a small number of groups the size of data processing is much small compared to a larger number of groups. The optimal BFM solution approaches at CA computing for grouping MTC devices into a small number of groups and at larger number of groups it approaches at MEC computing.</p></sec><sec id="s5_2"><title>5.2. Impact of Grouping Technique</title><p><xref ref-type="fig" rid="fig5">Figure 5</xref>(a) and <xref ref-type="fig" rid="fig5">Figure 5</xref>(b) illustrate the benefit of grouping technique based on the number of MTC devices. The result shows that when the number of the group increases the energy consumption and the computation time is increased, which means the size of the data processed increases and then, the optimal BFM solution for grouping technique outperforms compared with to processes un-grouping devices in which the each MTC device considered the individual.</p></sec></sec><sec id="s6"><title>6. Conclusion and Future Work</title><p>In this paper, we investigate the problem of data correlated in MTC devices based on the resource-constrained allocated at Centralized Aggregator for computing and processing. We propose k-means grouping technique to group MTC devices corresponding to a spatial correlation on the event-based area. With combining the MTC devices, we reduced the data redundancy caused by similar data processing and saving the computation resources at CA. Then, we use the differential entropy to measure the size of data contents in each group. Through the extensive simulations, we illustrated the benefits of our proposed grouping technique compared with the individual MTC device computation. Our simulation results indicate that the optimal BFM solutions performance is very close between the computation at CA and MEC in terms of energy consumption and the trade-off in computation time. In the future, we will investigate more grouping techniques together with multiple aggregators.</p></sec><sec id="s7"><title>Acknowledgements</title><p>We thank the Editor and the referee for their comments. This work was partially supported by Natural Science Foundation of China (Grant No. 61461136002), Key Program of National Natural Science Foundation of China (Grant No. 61631018), Fundamental Research Funds for the Central Universities, and Huawei Innovation Research Program.</p></sec><sec id="s8"><title>Conflicts of Interest</title><p>The authors declare no conflicts of interest regarding the publication of this paper.</p></sec><sec id="s9"><title>Cite this paper</title><p>Ally, J.S., Asif, M. and Ma, Q.L. (2019) Energy-Efficient MTC Data Offloading in Wireless Networks Based on K-Means Grouping Technique. Journal of Computer and Communications, 7, 47-61. https://doi.org/10.4236/jcc.2019.72004</p></sec></body><back><ref-list><title>References</title><ref id="scirp.90612-ref1"><label>1</label><mixed-citation publication-type="other" xlink:type="simple">Zheng, K., Ou, S., Alonso-Zarate, J., Dohler, M., Liu, F. and Zhu, H., (2014) Challenges of Massive Access in Highly Dense LTE-Advanced Networks with Machine-to-Machine Communications. IEEE Wireless Communications, 21, 12-18. https://doi.org/10.1109/MWC.2014.6845044</mixed-citation></ref><ref id="scirp.90612-ref2"><label>2</label><mixed-citation publication-type="other" xlink:type="simple">Cisco, V.N.I. (2016) Global Mobile Data Traffic Forecast Update, 2015-2020 White Paper. Document ID: 958959758.</mixed-citation></ref><ref id="scirp.90612-ref3"><label>3</label><mixed-citation publication-type="other" xlink:type="simple">Zheng, G. and Tang, S. (2011) Spatial Correlation-Based MAC Protocol for Event- Driven Wireless Sensor Networks. Journal of Networks, 6, 121. https://doi.org/10.4304/jnw.6.1.121-128</mixed-citation></ref><ref id="scirp.90612-ref4"><label>4</label><mixed-citation publication-type="other" xlink:type="simple">Khodashenas, P.S., Ruiz, C., Siddiqui, M.S., Betzler, A. and Riera, J.F. (2017) The Role of Edge Computing in Future 5G Mobile Networks: Concept and Challenges. Cloud and Fog Computing in 5G Mobile Networks: Emerging Advances and Applications, 70, 349.</mixed-citation></ref><ref id="scirp.90612-ref5"><label>5</label><mixed-citation publication-type="other" xlink:type="simple">Rebecchi, F., De Amorim, M.D., Conan, V., Passarella, A., Bruno, R. and Conti, M. (2015) Data Offloading Techniques in Cellular Networks: A Survey. IEEE Communications Surveys and Tutorials, 17, 580-603. https://doi.org/10.1109/COMST.2014.2369742</mixed-citation></ref><ref id="scirp.90612-ref6"><label>6</label><mixed-citation publication-type="other" xlink:type="simple">Li, L., Zhao, G. and Blum, R.S. (2018) A Survey of Caching Techniques in Cellular Networks: Research Issues and Challenges in Content Placement and Delivery Strategies. IEEE Communications Surveys &amp; Tutorials, 20, 1710-1732.</mixed-citation></ref><ref id="scirp.90612-ref7"><label>7</label><mixed-citation publication-type="other" xlink:type="simple">Dawy, Z., Saad, W., Ghosh, A., Andrews, J.G. and Yaacoub, E., (2017) Toward Massive Machine Type Cellular Communications. IEEE Wireless Communications, 24, 120-128. https://doi.org/10.1109/MWC.2016.1500284WC</mixed-citation></ref><ref id="scirp.90612-ref8"><label>8</label><mixed-citation publication-type="other" xlink:type="simple">Guo, J., Durrani, S., Zhou, X. and Yanikomeroglu, H. (2017) Massive Machine Type Communication with Data Aggregation and Resource Scheduling. IEEE Transactions on Communications, 65, 4012-4026. https://doi.org/10.1109/TCOMM.2017.2710185</mixed-citation></ref><ref id="scirp.90612-ref9"><label>9</label><mixed-citation publication-type="other" xlink:type="simple">Dai, R. and Akyildiz, I.F. (2009) A Spatial Correlation Model for Visual Information in Wireless Multimedia Sensor Networks. IEEE Transactions on Multimedia, 11, 1148-1159. https://doi.org/10.1109/TMM.2009.2026100</mixed-citation></ref><ref id="scirp.90612-ref10"><label>10</label><mixed-citation publication-type="other" xlink:type="simple">Park, I., Kim, D. and Har, D. (2015) MAC Achieving Low Latency and Energy Efficiency in Hierarchical M2M Networks with Clustered Nodes. IEEE Sensors Journal, 15, 1657-1661. https://doi.org/10.1109/JSEN.2014.2364055</mixed-citation></ref><ref id="scirp.90612-ref11"><label>11</label><mixed-citation publication-type="other" xlink:type="simple">Taleb, T., Ksentini, A. and Kobbane, A. (2014) Lightweight Mobile Core Networks for Machine Type Communications. IEEE Access, 2, 1128-1137. https://doi.org/10.1109/ACCESS.2014.2359649</mixed-citation></ref><ref id="scirp.90612-ref12"><label>12</label><mixed-citation publication-type="other" xlink:type="simple">3GPP TR 37.868 V11.0.0 (2011) Study on RAN Improvements for Machine-Type Communications.</mixed-citation></ref><ref id="scirp.90612-ref13"><label>13</label><mixed-citation publication-type="other" xlink:type="simple">Malak, D., Dhillon, H.S. and Andrews, J.G. (2016) Optimizing Data Aggregation for Uplink Machine-to-Machine Communication Networks. IEEE Transactions on Communications, 64, 1274-1290. https://doi.org/10.1109/TCOMM.2016.2517073</mixed-citation></ref><ref id="scirp.90612-ref14"><label>14</label><mixed-citation publication-type="other" xlink:type="simple">Cai, Y., Yu, F.R. and Bu, S. (2014) Cloud Computing Meets Mobile Wireless Communications in Next Generation Cellular Networks. IEEE Network, 28, 54-59. https://doi.org/10.1109/MNET.2014.6963805</mixed-citation></ref><ref id="scirp.90612-ref15"><label>15</label><mixed-citation publication-type="other" xlink:type="simple">Etsi, M. (2014) Mobile Edge Computing-Introductory Technical White Paper.</mixed-citation></ref><ref id="scirp.90612-ref16"><label>16</label><mixed-citation publication-type="other" xlink:type="simple">Sajnani, D.K., Mahesar, A.R., Lakhan, A. and Jamali, I.A. (2018) Latency Aware and Service Delay with Task Scheduling in Mobile Edge Computing. Communications and Network, 10, 127-141. https://doi.org/10.4236/cn.2018.104011</mixed-citation></ref><ref id="scirp.90612-ref17"><label>17</label><mixed-citation publication-type="other" xlink:type="simple">Li, M. and Hu, S. (2017) Machine-to-Machine (M2M) Communications in Virtualized Cellular Networks with MEC. arXiv:1706.09107</mixed-citation></ref><ref id="scirp.90612-ref18"><label>18</label><mixed-citation publication-type="other" xlink:type="simple">Salam, T., Rehman, W. and Tao, X. (2017) Cooperative MTC Data Offloading with Trust Transitivity Framework in 5G Networks. 2017 IEEE Global Communications Conference, Singapore, 4-8 December 2017, 1-7. https://doi.org/10.1109/GLOCOM.2017.8255045</mixed-citation></ref><ref id="scirp.90612-ref19"><label>19</label><mixed-citation publication-type="other" xlink:type="simple">Yoon, S. and Shahabi, C. (2005) Exploiting Spatial Correlation towards an Energy Efficient Clustered Aggregation Technique (CAG) [Wireless Sensor Network Applications]. 2005 IEEE International Conference on Communications, 5, 3307-3313.</mixed-citation></ref><ref id="scirp.90612-ref20"><label>20</label><mixed-citation publication-type="other" xlink:type="simple">Dogandzic, A. and Qiu, K. (2008) Decentralized Random-Field Estimation for Sensor Networks Using Quantized Spatially Correlated Data and Fusion-Center Feedback. IEEE Transactions on Signal Processing, 56, 6069-6085. https://doi.org/10.1109/TSP.2008.2005753</mixed-citation></ref><ref id="scirp.90612-ref21"><label>21</label><mixed-citation publication-type="other" xlink:type="simple">Wang, W., Zhu, J., Zhang, S. and Zhou, W. (2018) Tradeoff between Compression Ratio and Decoding Delay of Distributed Source Coding for Uplink Transmissions in Machine-Type Communication. International Journal of Distributed Sensor Networks, 14, 1550147718787109. https://doi.org/10.1177/1550147718787109</mixed-citation></ref><ref id="scirp.90612-ref22"><label>22</label><mixed-citation publication-type="other" xlink:type="simple">Vuran, M.C. and Akyildiz, I.F. (2006) Spatial Correlation-Based Collaborative Medium access Control in Wireless Sensor Networks. IEEE/ACM Transactions on Networking, 14, 316-329. https://doi.org/10.1109/TNET.2006.872544</mixed-citation></ref><ref id="scirp.90612-ref23"><label>23</label><mixed-citation publication-type="other" xlink:type="simple">Berger, J.O., De Oliveira, V. and Sansó, B. (2001) Objective Bayesian Analysis of Spatially Correlated Data. Journal of the American Statistical Association, 96, 1361-1374. https://doi.org/10.1198/016214501753382282</mixed-citation></ref><ref id="scirp.90612-ref24"><label>24</label><mixed-citation publication-type="other" xlink:type="simple">Paek, J. and Ko, J. (2017) K-Means Clustering-Based Data Compression Scheme for Wireless Imaging Sensor Networks. IEEE Systems Journal, 11, 2652-2662. https://doi.org/10.1109/JSYST.2015.2491359</mixed-citation></ref><ref id="scirp.90612-ref25"><label>25</label><mixed-citation publication-type="other" xlink:type="simple">Cover, T.M. and Thomas, J.A. (2012) Elements of Information Theory. John Wiley &amp; Sons, New York.</mixed-citation></ref><ref id="scirp.90612-ref26"><label>26</label><mixed-citation publication-type="other" xlink:type="simple">You, C., Huang, K., Chae, H. and Kim, B.H. (2017) Energy-Efficient Resource Allocation for Mobile-Edge Computation Offloading. IEEE Transactions on Wireless Communications, 16, 1397-1411. https://doi.org/10.1109/TWC.2016.2633522</mixed-citation></ref><ref id="scirp.90612-ref27"><label>27</label><mixed-citation publication-type="other" xlink:type="simple">Yu, Y., Zhang, J. and Letaief, K.B. (2016) Joint Subcarrier and CPU Time Allocation for Mobile Edge Computing. 2016 IEEE Global Communications Conference, Washington DC, 4-8 December 2016, 1-6. https://doi.org/10.1109/GLOCOM.2016.7841937</mixed-citation></ref><ref id="scirp.90612-ref28"><label>28</label><mixed-citation publication-type="other" xlink:type="simple">Gerards, M.E., Hurink, J.L. and Kuper, J. (2015) On the Interplay between Global DVFS and Scheduling Tasks with Precedence Constraints. IEEE Transactions on Computers, 64, 1742-1754.</mixed-citation></ref><ref id="scirp.90612-ref29"><label>29</label><mixed-citation publication-type="other" xlink:type="simple">Miettinen, A.P. and Nurminen, J.K. (2010) Energy Efficiency of Mobile Clients in Cloud Computing. Proceedings of the 2nd USENIX Conference on Hot Topics in Cloud Computing, HotCloud’10, Boston, 22-25 June 2010, 4.</mixed-citation></ref></ref-list></back></article>