<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article  PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article"><front><journal-meta><journal-id journal-id-type="publisher-id">JCC</journal-id><journal-title-group><journal-title>Journal of Computer and Communications</journal-title></journal-title-group><issn pub-type="epub">2327-5219</issn><publisher><publisher-name>Scientific Research Publishing</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.4236/jcc.2015.39010</article-id><article-id pub-id-type="publisher-id">JCC-59999</article-id><article-categories><subj-group subj-group-type="heading"><subject>Articles</subject></subj-group><subj-group subj-group-type="Discipline-v2"><subject>Computer Science&amp;Communications</subject></subj-group></article-categories><title-group><article-title>
 
 
  Improved Research on Fuzzy Search over Encrypted Cloud Data Based on Keywords
 
</article-title></title-group><contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>ing</surname><given-names>Zhang</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Jianzhong</surname><given-names>Wang</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref></contrib></contrib-group><aff id="aff1"><addr-line>College of Fundamental Education, Sichuan Normal University, Chengdu, China</addr-line></aff><author-notes><corresp id="cor1">* E-mail:<email>835148187@qq.com(IZ)</email>;</corresp></author-notes><pub-date pub-type="epub"><day>01</day><month>09</month><year>2015</year></pub-date><volume>03</volume><issue>09</issue><fpage>90</fpage><lpage>98</lpage><history><date date-type="received"><day>20</day>	<month>August</month>	<year>2015</year></date><date date-type="rev-recd"><day>accepted</day>	<month>25</month>	<year>September</year>	</date><date date-type="accepted"><day>28</day>	<month>September</month>	<year>2015</year></date></history><permissions><copyright-statement>&#169; Copyright  2014 by authors and Scientific Research Publishing Inc. </copyright-statement><copyright-year>2014</copyright-year><license><license-p>This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/</license-p></license></permissions><abstract><p>
 
 
  A search strategy over encrypted cloud data based on keywords has been improved and has presented a method using different strategies on the client and the server to improve the search efficiency in this paper. The client uses the Chinese and English to achieve the synonym construction of the keywords, the establishment of the fuzzy-syllable words and synonyms set of keywords and the implementation of fuzzy search strategy over the encryption of cloud data based on keywords. The server side through the analysis of the user’s query request provides keywords for users to choose and topic words and secondary words are picked out. System will match topic words with historical inquiry in time order, and then the new query result of the request is directly gained. The analysis of the simulation experiment shows that the fuzzy search strategy can make better use of historical results on the basis of privacy protection for the realization of efficient data search, saving the search time and improving the efficiency of search.
 
</p></abstract><kwd-group><kwd>Cloud Data</kwd><kwd> Fuzzy Search</kwd><kwd> Keywords</kwd><kwd> Synonyms</kwd><kwd> Searchable Encryption</kwd></kwd-group></article-meta></front><body><sec id="s1"><title>1. Introduction</title><p>With the arrival of the era of big data and cloud data, more and more ordinary users and enterprise storage in local data files, cloud storage service with convenient using, cost saving and the other advantages are welcomed by more and more users. But the uses of cloud storage services are also facing some problems. For example, some of the business secrets of the enterprise must be able to protect the privacy to prevent illegal use, so the data is generally encrypted locally and then outsourced to the cloud storage server, which brings a lot of trouble to the data use. Because of the limitation of network bandwidth and local storage capacity, users may not download all data to the local and then decrypt it before using it and how to improve the search efficiency etc. In the face of the huge amount of information decrypted and the need of accurate, efficient and personalized search information and other needs, domestic and foreign researchers have proposed a lot of search methods.</p><p>A fuzzy keyword search method for encryption data in cloud computing is proposed in the literature [<xref ref-type="bibr" rid="scirp.59999-ref1">1</xref>] . In the paper [<xref ref-type="bibr" rid="scirp.59999-ref1">1</xref>] , a new technique based on fuzzy keyword search using the similarity between the key words of the edit distance is proposed. In the paper [<xref ref-type="bibr" rid="scirp.59999-ref2">2</xref>] , a range search method based on the packet-based range is proposed. The architecture, mechanism and model evaluation of cloud computing security are described in the literature [<xref ref-type="bibr" rid="scirp.59999-ref3">3</xref>] . A privacy preserving method based on encryption cloud data is proposed in the literature [<xref ref-type="bibr" rid="scirp.59999-ref4">4</xref>] . The literature [<xref ref-type="bibr" rid="scirp.59999-ref5">5</xref>] presents a hybridization of Searchable Encryption and Attribute Based Encryption techniques. The proposed model supports a personalized and secure multi-user access to outsourced data, presenting high search performance. A secure and effective range query method based on R-tree for the outsourcing database is proposed in the paper [<xref ref-type="bibr" rid="scirp.59999-ref6">6</xref>] . In intelligent search, in the paper [<xref ref-type="bibr" rid="scirp.59999-ref7">7</xref>] a new technology of intelligent text search is proposed, including information retrieval, information extraction and information filtering. In the literature [<xref ref-type="bibr" rid="scirp.59999-ref8">8</xref>] the improved word segmentation algorithm and the improved correlation algorithm are proposed by analyzing the existing mainstream Chinese word segmentation algorithm and Lucene correlation ranking algorithm. A new intelligent cloud search optimization algorithm is designed in the paper [<xref ref-type="bibr" rid="scirp.59999-ref9">9</xref>] .</p><p>A new search model is proposed in the paper [<xref ref-type="bibr" rid="scirp.59999-ref10">10</xref>] . The problem of low efficiency of large data query is improved and an effective search method based on large data is improved in the literature [<xref ref-type="bibr" rid="scirp.59999-ref11">11</xref>] , which can improve the search efficiency. In the paper [<xref ref-type="bibr" rid="scirp.59999-ref12">12</xref>] , a fuzzy search scheme based on key words is proposed, which realizes the search for the Chinese fuzzy tone and synonymous keywords and uses the pseudo random function to protect the private key. In a word, semantic search method and encryption of data search research have made fruitful achievements, but a safe and efficient fuzzy search method based on the key words fuzzy sets has not been designed. Therefore, in this paper, a fuzzy search strategy based on key encrypted cloud data on the basis of fuzzy search strategy over encrypted cloud data based on the keywords is proposed, exploring a method to improve the search efficiency in the cloud storage environment on the client and the server by different search strategies.</p></sec><sec id="s2"><title>2. System Framework and Parameter Definition of the New Search Strategy over Encryption Cloud Data</title><p>New strategy based on the encrypted data search scheme in the client analyses fuzzy tone and polysemy characteristic of Chinese characteristics, uses Chinese and English to realize the synonym construction of keywords and establishes fuzzy tone words and synonym set of keywords, in the implementation of Chinese fuzzy tone and synonymous keywords search and the pseudo random function to protect the private key. When the user’s query request reaches the server side, query request graph analyzed, keywords provided for user to choose, the key words that are used in the end are determined, the topic words and auxiliary words picked up. The topic words and auxiliary words have been matched with the historical inquiry in order, the query result of which is returned to the client.</p><p>System framework of fuzzy search strategy based on the keywords of the encrypted cloud data is shown in <xref ref-type="fig" rid="fig1">Figure 1</xref>. The framework consists of three main entities, which are data owners, users and cloud service providers. Among them, data owners are individuals or enterprise users, who store the collection of data files</p><p><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x5.png" xlink:type="simple"/></inline-formula>in the cloud server. The keywords set associated with the file set F will be pre-defined and expressed as<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x6.png" xlink:type="simple"/></inline-formula>. In order to ensure that sensitive data is not used by unauthorized people, the</p><p>data set F will be encrypted before the outsourcing to the cloud server. Due to the presence of a large number of sound words and synonyms in Chinese, in order to improve the cloud data utilization efficiency and success rate of retrieval, the system provides encryption data fuzzy sound synonyms and fuzzy search function. In the implementation of the service, the data owner will use search request to generate a private key PK to and distribute to the user, such as team members or employees. When the private key distribution is completed, for any input keyword W, in order to search relevant documents collection safely, authorized users use the private key PK and one-way generating function to convert query keywords into an encrypted search request and then submit to the cloud server. The cloud server executes the search without decrypting data and sends data collection FIDW related to a keyword W or W of fuzzy sound or synonym object files searched to searchers.</p><p>In this paper, we use the free cloud platform of the Amazon provided as the experimental platform and use the journal magazine as the search object to carry out the experiment. Through the comparison of this paper scheme</p><fig id="fig1"  position="float"><label><xref ref-type="fig" rid="fig1">Figure 1</xref></label><caption><title> New strategy system framework for fuzzy search</title></caption><graphic mimetype="image"   position="float"  xlink:type="simple"  xlink:href="http://html.scirp.org/file/10-1730270x7.png"/></fig><p>and the method of no using the fuzzy search technology, the validity and practicability of the scheme are verified. When the index this paper scheme is built, with the increase of the number of keywords, the occupancy of CPU and memory gradually increase and the time consumed increases. The time consumed for building an index is shown in <xref ref-type="fig" rid="fig3">Figure 3</xref>. The time consumption of the keywords query is shown in <xref ref-type="fig" rid="fig4">Figure 4</xref>. The success rate of the query is shown in <xref ref-type="fig" rid="fig5">Figure 5</xref>.</p><p>The search scheme still follows the security definitions involved in the traditional symmetric encryption. In addition to search results and search model, other related content connected with the file stored and its index should not be leaked.</p><p>The meaning and representation of the parameters in the system of the new search strategy over encryption cloud data are show in <xref ref-type="table" rid="table1">Table 1</xref>.</p></sec><sec id="s3"><title>3. Implementation Scheme of the New Search Strategy over Encryption Cloud Data</title><p>The goal of fuzzy search is to return all the similar results to the keywords according to the keywords of different users. In this paper, we use the fractional step scheme to reduce the difficulty of fuzzy matching over the encrypted cloud data. First, data owners in the client construct fuzzy key words set, which includes Pinyin keywords, keywords and fuzzy tone, keywords corresponding English words, corresponding index information keywords and file ID table, Chinese and English keywords comparison table and pinyin and fuzzy tone comparison table. Second, based on the fuzzy keyword collection, a safe and efficient fuzzy search method is design, making full use of historical records to improve search efficiency.</p><sec id="s3_1"><title>3.1. Set up Fuzzy Pinyin Keywords Set</title><p>The establishment of keywords set is the precondition for the efficient fuzzy search. Keyword w is given and similar constraints d to generate SP<sub>w</sub><sub>,d </sub>and SY<sub>w</sub>. The following epsilon<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x8.png" xlink:type="simple"/></inline-formula>, meet<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x9.png" xlink:type="simple"/></inline-formula>;<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x10.png" xlink:type="simple"/></inline-formula>, meet<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x11.png" xlink:type="simple"/></inline-formula>. The Chinese characters consist of pinyin initials and finals, as shown in <xref ref-type="table" rid="table2">Table 2</xref>.</p><p>Combination of consonants and vowels with specific rules, specifically, for example, consonant is m, the set of vowels only is {a, o, i, u, ai, ei, ao, ie, an, en, in, ang, eng, ing, ian, iao}, number of vowels fixed .The easiest error in Chinese pinyin spelling is fuzzy sound, such as initials in Alice retroflex retroflex peace, n and l, the anterior nasal and after nasal in the vowels, such as ian, iang , uan and uang. Therefore, the simplest way to set up the fuzzy sound keywords is to list the possible phonetic combinations and then to find out the keywords which are the same as those of the combination.</p><p>Assume d = 2 given by the user, the input pinyin of keywords w is nin, according to the alphabet constitute rules to generate the corresponding fuzzy sound keyword combination<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x12.png" xlink:type="simple"/></inline-formula>, in which <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x13.png" xlink:type="simple"/></inline-formula> pinyin should be included in the set {nen, len, neng, in leng, *en}.</p><table-wrap id="table1" ><label><xref ref-type="table" rid="table1">Table 1</xref></label><caption><title> The meaning and representation of the system parameters of the new search strategy over encryption cloud data</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >Parameter</th><th align="center" valign="middle" >Parameter meaning and representation</th></tr></thead><tr><td align="center" valign="middle" >F</td><td align="center" valign="middle" >A collection of files that are outsourced represents a collection of n data files<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x14.png" xlink:type="simple"/></inline-formula>.</td></tr><tr><td align="center" valign="middle" >W</td><td align="center" valign="middle" >A collection of different keywords that are extracted from the document collection F, representing a collection of p words<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x15.png" xlink:type="simple"/></inline-formula>.</td></tr><tr><td align="center" valign="middle" >I</td><td align="center" valign="middle" >Index for the establishment of a fuzzy keyword search for privacy preserving.</td></tr><tr><td align="center" valign="middle" >Tw</td><td align="center" valign="middle" >After user input search keywords w, the search request is generated by the one-way function, that is Trapdoor.</td></tr><tr><td align="center" valign="middle" >FID<sub>wi</sub></td><td align="center" valign="middle" >The file set F contains an ID collection of documents of keywords w<sub>i</sub> or near w<sub>i</sub>-syllable word or synonym.</td></tr><tr><td align="center" valign="middle" >f(key,・),g(key,・)</td><td align="center" valign="middle" >Pseudo random function (PRF), defined as {0, 1}* &#215; key→ (0, 1).</td></tr><tr><td align="center" valign="middle" >Enc(key,・) Dec(key,・)</td><td align="center" valign="middle" >Symmetric key encryption/decryption function based on semantic security.</td></tr><tr><td align="center" valign="middle" >ED(w<sub>1</sub>, w<sub>2</sub>)</td><td align="center" valign="middle" >A description method of string similarity. For two words w<sub>1</sub> and w<sub>2</sub> in the two to achieve the increase of the minimum number of the required to convert, modify or delete the operation of the characters. For a given word w and integer d, with S<sub>w</sub><sub>,d</sub> representing similar thereto word w’, to meet the<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x16.png" xlink:type="simple"/></inline-formula>.</td></tr><tr><td align="center" valign="middle" >SP<sub>w</sub><sub>,d</sub></td><td align="center" valign="middle" >The fuzzy Pinyin of keywords w corresponding to the keyword set. For the given Chinese keywords w and integer d, the Pinyin PY<sub>w</sub> corresponds to the fuzzy tone set<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x17.png" xlink:type="simple"/></inline-formula>, to meet with the edit distance of keywords w’s Pinyin that is less than d for all similar Pinyin keywords set expressed as<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x18.png" xlink:type="simple"/></inline-formula>. For any<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x19.png" xlink:type="simple"/></inline-formula>, the pinyin of <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x20.png" xlink:type="simple"/></inline-formula> is represented as<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x21.png" xlink:type="simple"/></inline-formula>.</td></tr><tr><td align="center" valign="middle" >SYw</td><td align="center" valign="middle" >A set of keywords synonymous with keywords w. different keywords set<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x22.png" xlink:type="simple"/></inline-formula>. Describing the same thing in a language are converted into another language, <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x23.png" xlink:type="simple"/></inline-formula>generally corresponding to the same keywords w<sub>e</sub>, which is called set<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x24.png" xlink:type="simple"/></inline-formula>, as a synonym set for keywords w, among them the<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x25.png" xlink:type="simple"/></inline-formula>, syn() is synonymous conversion function. This paper uses Chinese and English to achieve the conversion.</td></tr><tr><td align="center" valign="middle" >Fuzzy keyword search</td><td align="center" valign="middle" >A collection <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x26.png" xlink:type="simple"/></inline-formula> consisted of n encrypted data files and different predefined keywords set <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x27.png" xlink:type="simple"/></inline-formula> are given, search keywords w and d input and after the implementation of the synonymous keyword search it will return to the ID file set {FIDw<sub>i</sub>,}, in which<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x28.png" xlink:type="simple"/></inline-formula>, <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x29.png" xlink:type="simple"/></inline-formula>or<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x30.png" xlink:type="simple"/></inline-formula>.</td></tr><tr><td align="center" valign="middle" >History</td><td align="center" valign="middle" >The interaction between the user and the cloud server, which is composed of F and a group of keywords, is expressed as<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x31.png" xlink:type="simple"/></inline-formula>.</td></tr><tr><td align="center" valign="middle" >View</td><td align="center" valign="middle" >According to the secret key K, historical records Hq given, cloud servers can only see encrypted history that is the view of Vk (Hq). It includes index I of document collection f, query keywords trapdoor<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x32.png" xlink:type="simple"/></inline-formula>. Among them <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x33.png" xlink:type="simple"/></inline-formula> and<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x34.png" xlink:type="simple"/></inline-formula>.<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x35.png" xlink:type="simple"/></inline-formula>, encryption file set C , is expressed as<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x36.png" xlink:type="simple"/></inline-formula>.</td></tr></tbody></table></table-wrap><table-wrap id="table2" ><label><xref ref-type="table" rid="table2">Table 2</xref></label><caption><title> The composition of Chinese characters</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >Phonetic composition</th><th align="center" valign="middle" >Set</th></tr></thead><tr><td align="center" valign="middle" >Initials</td><td align="center" valign="middle" >{b, p, m, f, d, t, n, l, g, k, h, j, q, x, zh, ch, sh, r, z, c, s, y, w}</td></tr><tr><td align="center" valign="middle" >Single vowel</td><td align="center" valign="middle" >{a, o, e, i, u, ii}</td></tr><tr><td align="center" valign="middle" >Compound finals</td><td align="center" valign="middle" >{ ai, ei, ui, ao, ou, iu, ie, iie, er}</td></tr><tr><td align="center" valign="middle" >The anterior nasal vowel</td><td align="center" valign="middle" >{an, en, in, un, iin}</td></tr><tr><td align="center" valign="middle" >After nasal vowel</td><td align="center" valign="middle" >{ang, eng, ing, ong}</td></tr></tbody></table></table-wrap></sec><sec id="s3_2"><title>3.2. Set up a Set of Synonyms</title><p>There are a lot of synonyms or near synonyms in Chinese, and the words in the dictionary cannot be reflected, such as “计算机”, “电脑” and “微机”. These three Chinese words’ meanings are consistent with one another. File F associated with the three words should all returned to the user when users in cloud data search, but how to achieve similar synonym comparison has not a good way. Through the comparison of the expression similarity between Chinese and English in describing the same thing , the differential expression of language is proposed to achieve synonyms conversion method, for example, the English words of three words above are all computer, so if the Chinese keywords w<sub>i</sub> corresponding the same English language translation, so these words are synonymous.</p><p>If user input keywords w, executive function syn(w), system translates w into English words w<sub>e</sub>, then look up Chinese keywords whose English translation in bilingual table is w<sub>e</sub> and return to the set of keywords SY<sub>w</sub>. After generating the corresponding fuzzy tone and synonym set, encrypted function Enc(key) used, the encrypted SP<sub>w</sub><sub>,d</sub> and SY<sub>w</sub> ,together with the encrypted file are sent to the cloud to be saved.</p></sec><sec id="s3_3"><title>3.3. Encryption Using Random Number Algorithm</title><p>The literature [<xref ref-type="bibr" rid="scirp.59999-ref13">13</xref>] design the random tree data structure providing cloud computing environment of privacy protection and the algorithm OPEART (order-preserving encryption based on random tree) is constructed. The algorithm by introducing a random realization of data encryption supports &gt;, &lt;, ≥ any relation operation of encryption data. Security analysis and performance evaluation in Literature [<xref ref-type="bibr" rid="scirp.59999-ref13">13</xref>] show that algorithm OPEART is indistinguishability under distinct and neighboring chosen plaintext attack and can be efficiently implemented on data encryption operations. In this paper, we introduce a secure OPEART algorithm to provide privacy protection in cloud computing environment.</p></sec><sec id="s3_4"><title>3.4. Generate Search Requests</title><p>When the user enters the keyword w, the system performs a fuzzy search and returns the corresponding file ID collection {FIDw<sub>i</sub>}. The formation of a search request generation process and keyword index are similar, namely according to the input of the w and d, fuzzy Pinyin and synonyms generating function to get fuzzy Pinyin keywords set SP<sub>w</sub><sub>,d</sub> and synonym set SY<sub>w</sub>. SP<sub>w</sub><sub>,d</sub>, w and SY<sub>wa</sub> are encrypted to generate search trapdoor, submitting to the cloud server, namely search request generation are completed.</p></sec><sec id="s3_5"><title>3.5. Fuzzy Search Scheme</title><p>In cloud service system, in order to avoid cloud access to sensitive information, establishment of a part of the work such as search index, the trapdoor generation in the client implementation. But executing search in large amounts of data is very consuming in resource, which can be handed to the cloud server to complete. The encryption cloud data fuzzy search implementation algorithm based on the keywords is as follows:</p><p>{</p><p>The data owner randomly selects two numbers a and b as the private key PK.</p><p>Establish index. The index<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x37.png" xlink:type="simple"/></inline-formula>,<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x38.png" xlink:type="simple"/></inline-formula>. <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x39.png" xlink:type="simple"/></inline-formula>, <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x40.png" xlink:type="simple"/></inline-formula>, in which<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x41.png" xlink:type="simple"/></inline-formula>, Encryption key<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x41.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x42.png" xlink:type="simple"/></inline-formula>.</p><p>The index table I<sub>1</sub>, I<sub>2</sub> and the encryption of data files to the cloud server storage.</p><p>The user input PK, w and i, the client system to generate SP<sub>w</sub><sub>,d</sub>, SY<sub>w</sub> and generate the trapdoor<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x43.png" xlink:type="simple"/></inline-formula>,<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x43.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x44.png" xlink:type="simple"/></inline-formula>.<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x43.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x44.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x45.png" xlink:type="simple"/></inline-formula>,<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x43.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x44.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x45.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x46.png" xlink:type="simple"/></inline-formula>. <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x43.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x44.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x45.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x46.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x47.png" xlink:type="simple"/></inline-formula>and <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x43.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x44.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x45.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x46.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x47.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x48.png" xlink:type="simple"/></inline-formula> will be sent to the cloud.</p><p>The cloud server will receive the trapdoor <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x49.png" xlink:type="simple"/></inline-formula> and<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x49.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x50.png" xlink:type="simple"/></inline-formula>, respectively, with the index I<sub>1</sub> and I<sub>2</sub> are compared to obtain a matching document ID set {FID<sub>wi</sub>}, and send the results to the client.</p><p>The user uses the corresponding <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/10-1730270x51.png" xlink:type="simple"/></inline-formula> to decrypt the file ID and retrieve the required file, called function Dec(key) decryption and then used.</p><p>}</p></sec><sec id="s3_6"><title>3.6. Increasing the Server Side Search Strategy</title><p>When the query request arrives at the server, system will provide similar query keywords for the user to choose and the final query keywords are obtained. The keywords are constructed, subject words and auxiliary words extracted. Subject words and auxiliary words are matched with history query in order. System firstly match the keywords of query request with history query results, after which the shared history query results and a new date for the query results can be combined, and the combination of history query results as a query results. If you do not match, the result is directly the result of the query, and the results are recorded in the history. The basic framework for cloud data search is shown in <xref ref-type="fig" rid="fig2">Figure 2</xref>. The other see references [<xref ref-type="bibr" rid="scirp.59999-ref11">11</xref>] .</p><fig id="fig2"  position="float"><label><xref ref-type="fig" rid="fig2">Figure 2</xref></label><caption><title> Basic frameworks for cloud search strategy</title></caption><graphic mimetype="image"   position="float"  xlink:type="simple"  xlink:href="http://html.scirp.org/file/10-1730270x52.png"/></fig><p>The pseudo code of the cloud search algorithm is as follows:</p><p>{</p><p>The user proposes Query request. Provide similar keyword query for users to choose, users get a final query keywords. Keywords extracted topic keywords and auxiliary words.</p><p>System analysis query keywords, the query topic keywords and auxiliary keywords and historical query keywords match.</p><p>Execute different programs according to the matching results.</p><p>If the result is completely matched that it showed that the new query keywords have previously occurred, such previous queries obtained the query results can be directly to use for this query. That is to say, you can share the same history query results. At the same time, due to historical queries only for a certain period of time before the data query, there may be new data records generated after the history query, so a query still needs to be executed for the new data to get new results. The historical results and the new query results are merged to get the final result of the user.</p><p>If the result is partially matched that it showed that the key words of the user’s new query request are extracted from the topic words and auxiliary words. The part of the theme words appeared before, so before the same query results through the query can be directly into the present query, that is, part of the history query results can be shared. Also due to historical queries only for a certain period of time before the data query, probably after the history query new data records may be generated, so the new data still need execute to obtain the new query results. The historical results and the new query results are merged to obtain the results of the query matching section. Then, in the result of the query matching part, it continues to perform the auxiliary word query and gets the final result of the user.</p><p>If the result is mismatched that it showed that the new query request keywords without historical records can be shared, the need to re-execute all the queries, the results obtained by the user.</p><p>Feedback the user query to the user.</p><p>To realize the update of historical inquiry network.</p><p>}</p></sec></sec><sec id="s4"><title>4. The Experimental Analysis of the New Method over Encrypting the Cloud Data</title><sec id="s4_1"><title>4.1. Security Analysis</title><p>When the user enters the same search request, the encrypted search scheme in the cloud will always return the same search results. Although the cloud server cannot see what it is in the end, in the interaction with the user, it can still create access patterns and search mode. Therefore, the system’s security should ensure that the other contents in addition to the pattern and search requests are leaked. The literature [<xref ref-type="bibr" rid="scirp.59999-ref13">13</xref>] has proved that the fuzzy keyword search scheme is in line with the non adaptive security requirements. Non-adaptive attack model only considers cloud server adversary, the existing literature proves that the adversary cannot choose search requests and previous search results based on trapdoor. This is acceptable, because only the user with an authorized private key can generate a search trapdoor.</p><p>In general, the security of this scheme is reflected in that for two historical records with the same trajectory, the cloud server cannot distinguish the views between them. In other words, the cloud server cannot extract more information from the information in the query, so the program is safe. The literature [<xref ref-type="bibr" rid="scirp.59999-ref13">13</xref>] has proved that the fuzzy keyword search scheme satisfies the non adaptive semantic security.</p></sec><sec id="s4_2"><title>4.2. Time and Space Consumption</title><p>In this paper, the keywords fuzzy search scheme adds the cloud search method on the basis of literature [<xref ref-type="bibr" rid="scirp.59999-ref13">13</xref>] , but time complexity and space consumption are in the same level with the original program. The time complexity of the index construction of the pre-processing stage is related to the number of keywords n. That is O (n). The size of the index is also related to the number of words. That is O (n). In searching, due to the cloud server system support for multiple threads, you can also search the synonyms; fuzzy syllable word retrieval without increasing too much time consumption, so search time complex degree is O (n).</p></sec><sec id="s4_3"><title>4.3. Experimental Comparison</title><p>In this paper, we use the free cloud platform of the Amazon provided as the experimental platform and use the journal magazine as the search object to carry out the experiment. Through the comparison of this paper scheme and the method of no using the fuzzy search technology, the validity and practicability of the scheme are verified. When the index this paper scheme is built, with the increase of the number of keywords, the occupancy of CPU and memory gradually increase and the time consumed increases. The time consumed for building an index is shown in <xref ref-type="fig" rid="fig3">Figure 3</xref>. The time consumption of the keywords query is shown in <xref ref-type="fig" rid="fig4">Figure 4</xref>. The success rate of the query is shown in <xref ref-type="fig" rid="fig5">Figure 5</xref>.</p><fig id="fig3"  position="float"><label><xref ref-type="fig" rid="fig3">Figure 3</xref></label><caption><title> Time consumption for building an index</title></caption><graphic mimetype="image"   position="float"  xlink:type="simple"  xlink:href="http://html.scirp.org/file/10-1730270x53.png"/></fig><fig id="fig4"  position="float"><label><xref ref-type="fig" rid="fig4">Figure 4</xref></label><caption><title> Time consumption of query</title></caption><graphic mimetype="image"   position="float"  xlink:type="simple"  xlink:href="http://html.scirp.org/file/10-1730270x54.png"/></fig><fig id="fig5"  position="float"><label><xref ref-type="fig" rid="fig5">Figure 5</xref></label><caption><title> Keyword search success rat</title></caption><graphic mimetype="image"   position="float"  xlink:type="simple"  xlink:href="http://html.scirp.org/file/10-1730270x55.png"/></fig><p>From the above experimental results we can see, the storage to achieve storage of the synonyms of keywords and fuzzy speech sounds is appropriately increased through the search scheme, although the preprocessing and search time has increased, but it greatly improves the searching success rate.</p></sec></sec><sec id="s5"><title>5. Conclusion</title><p>In this paper, a new method based on keywords is presented according to the actual demand of the user’s Chinese search in the cloud storage environment. Through the implementation of the scheme of fuzzy keyword search function, the problems of fuzzy sound and synonymous issues existing in the input text and the words that the user wants to find in the Chinese environment are easy to solve. By using the pseudo random function to effectively avoid the information leakage problem in the process of the query, it can ensure the safety. Therefore, this scheme has high security, good practicability and high search success rate.</p></sec><sec id="s6"><title>Acknowledgements</title><p>In this paper, the research was sponsored by the Department of Education Project of Sichuan Province.</p></sec><sec id="s7"><title>Cite this paper</title><p>PingZhang,JianzhongWang, (2015) Improved Research on Fuzzy Search over Encrypted Cloud Data Based on Keywords. Journal of Computer and Communications,03,90-98. doi: 10.4236/jcc.2015.39010</p></sec></body><back><ref-list><title>References</title><ref id="scirp.59999-ref1"><label>1</label><mixed-citation publication-type="other" xlink:type="simple">Li, J., Wang, Q., Wang, C., Ren, K. and Lou, W.J. (2010) Fuzzy Keyword Search over Encrypted Data in Cloud Computing. Proceedings of IEEE INFOCOM Mini-Conference, San Diego, 15-19 March 2010, 441-445.</mixed-citation></ref><ref id="scirp.59999-ref2"><label>2</label><mixed-citation publication-type="other" xlink:type="simple">Hore, B., Mehrotra, S., Canim, M., and Kantarcioglu, M. (2012) Secure Multidimensional Range Queries over Outsourced Data. The VLDB Journal, 21, 333-358. http://dx.doi.org/10.1007/s00778-011-0245-7</mixed-citation></ref><ref id="scirp.59999-ref3"><label>3</label><mixed-citation publication-type="other" xlink:type="simple">Lin, C., Su, W.B., Meng, K., Liu, Q. and Liu, W.D. (2013) Cloud Computing Security: Architecture, Mechanism and Modeling Evaluation. Chinese Journal of Computer, 36, 1765-1784. http://dx.doi.org/10.3724/SP.J.1016.2013.01765</mixed-citation></ref><ref id="scirp.59999-ref4"><label>4</label><mixed-citation publication-type="other" xlink:type="simple">Cao, N., Wang, C., Li, M., Ren, K. and Lou, W.J. (2011) Privacy-Preserving Multi-Keyword Ranked Search over Encrypted Cloud Data. Proceedings of IEEE INFOCOM 2011, Shanghai, 10-15 April 2011, 829-837.</mixed-citation></ref><ref id="scirp.59999-ref5"><label>5</label><mixed-citation publication-type="other" xlink:type="simple">Tebibel, T. and Kaci, A. (2015) Parallel Search over Encrypted Data under Attribute Based Encryption on the Cloud Computing. Computers &amp; Security, in press. http://dx.doi.org/10.1016/j.cose.2015.04.007</mixed-citation></ref><ref id="scirp.59999-ref6"><label>6</label><mixed-citation publication-type="other" xlink:type="simple">Wang, P. and Ravishankar, C.V. (2013) Secure and Efficient Range Queries on Outsourced Databases Using R-Trees. 29th International Conference on Data Engineering (ICDE), Brisbane, 8-12 April 2013, 314-325.</mixed-citation></ref><ref id="scirp.59999-ref7"><label>7</label><mixed-citation publication-type="other" xlink:type="simple">Wang, Z.Y., Xu,Y.R. and Guo, J. (2012) New Technologies of Intelligent Text Search. Transactions on Intelligent Systems, 7, 40-49.</mixed-citation></ref><ref id="scirp.59999-ref8"><label>8</label><mixed-citation publication-type="other" xlink:type="simple">Wu, J.M., Han, Y.H. and Ji, D.D. (2013) Research and Design of Search Engine for Digital Works Based on Lucene. Computer Engineering &amp; Science, 35, 166-172.</mixed-citation></ref><ref id="scirp.59999-ref9"><label>9</label><mixed-citation publication-type="other" xlink:type="simple">Yin, Z. and Cao, J. (2012) Clouds Search Optimization Algorithm with Difference Quotient Information and Its Convergence Analysis. Computer Science, 39, 252-255.</mixed-citation></ref><ref id="scirp.59999-ref10"><label>10</label><mixed-citation publication-type="other" xlink:type="simple">Zhou, B., Liu, Y.Q., Zhang, M., Jin, Y.J. and Ma, S.P. (2011) Incorporating Web Browsing Activities into Anchor Texts for Web. Information Retrieval, 14, 290-314. http://dx.doi.org/10.1007/s10791-010-9151-7</mixed-citation></ref><ref id="scirp.59999-ref11"><label>11</label><mixed-citation publication-type="other" xlink:type="simple">Zhang, P. and Wang, J.Z. (2014) An Improved Efficient Search Method Based on Big Data. Application Research of Computer, 31, 2331-2333.</mixed-citation></ref><ref id="scirp.59999-ref12"><label>12</label><mixed-citation publication-type="other" xlink:type="simple">Fang, Z.J., Zhou, S. and Xia, Z.H. (2015) Research on Fuzzy Search over Encrypted Cloud Data Based on Keywords. Computer Science, 42, 136-139.</mixed-citation></ref><ref id="scirp.59999-ref13"><label>13</label><mixed-citation publication-type="other" xlink:type="simple">Huang, R.W., Gui, X.L., Chen, N.J. and Yao, J. (2015) Encryption Algorithm Supporting Relational Calculations in Cloud Computing. Journal of Software, 26, 1181-1195. (In Chinese) http://www.jos.org.cn/1000-9825/4656.htm</mixed-citation></ref></ref-list></back></article>