<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article  PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article"><front><journal-meta><journal-id journal-id-type="publisher-id">JCC</journal-id><journal-title-group><journal-title>Journal of Computer and Communications</journal-title></journal-title-group><issn pub-type="epub">2327-5219</issn><publisher><publisher-name>Scientific Research Publishing</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.4236/jcc.2023.115010</article-id><article-id pub-id-type="publisher-id">JCC-125234</article-id><article-categories><subj-group subj-group-type="heading"><subject>Articles</subject></subj-group><subj-group subj-group-type="Discipline-v2"><subject>Computer Science&amp;Communications</subject></subj-group></article-categories><title-group><article-title>
 
 
  Securing User Authentication with Server-Side Voice Verification
 
</article-title></title-group><contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Deepak</surname><given-names>R. Chandran</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Sanath</surname><given-names>Kumar</given-names></name><xref ref-type="aff" rid="aff2"><sup>2</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>S.</surname><given-names>Deepashri</given-names></name><xref ref-type="aff" rid="aff2"><sup>2</sup></xref></contrib></contrib-group><aff id="aff1"><addr-line>Emergence Technologies LLC, Willow Grove, PA, USA</addr-line></aff><aff id="aff2"><addr-line>Iris Energy LLC, Edison, NJ, USA</addr-line></aff><pub-date pub-type="epub"><day>10</day><month>05</month><year>2023</year></pub-date><volume>11</volume><issue>05</issue><fpage>137</fpage><lpage>150</lpage><history><date date-type="received"><day>9,</day>	<month>April</month>	<year>2023</year></date><date date-type="rev-recd"><day>27,</day>	<month>May</month>	<year>2023</year>	</date><date date-type="accepted"><day>30,</day>	<month>May</month>	<year>2023</year></date></history><permissions><copyright-statement>&#169; Copyright  2014 by authors and Scientific Research Publishing Inc. </copyright-statement><copyright-year>2014</copyright-year><license><license-p>This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/</license-p></license></permissions><abstract><p>
 
 
  User authentication is critical to the security of any information system. The traditional text-based passwords and even biometric systems based on face and fingerprint validation suffer from various drawbacks. Voice-based authentication systems have emerged as an effective alternative method. Within the user authentication systems, the server-side voice authentication systems added advantages. The purpose of this paper is to present an innovative approach to the use of voice verification for user authentication. This paper describes a new framework for the implementation of server-side voice authentication, ensuring that only the users who are authenticated and validated can access the system. In addition to providing enhanced security and a more pleasant user experience, this technology has potential applications in a wide range of fields.
 
</p></abstract><kwd-group><kwd>Voice Authentication</kwd><kwd> Data Security</kwd><kwd> Voice Biometrics</kwd><kwd> Automatic Speech Recognition</kwd></kwd-group></article-meta></front><body><sec id="s1"><title>1. Introduction</title><p>Authentication of users is a critical component of information security, as it ensures that only authorized individuals can access sensitive data and systems [<xref ref-type="bibr" rid="scirp.125234-ref1">1</xref>] . Traditionally, authentication has relied on knowledge-based (passwords) or possession-based (tokens) mechanisms. These methods, however, are increasingly vulnerable to attacks such as phishing and brute force [<xref ref-type="bibr" rid="scirp.125234-ref2">2</xref>] . The use of biometric authentication has emerged as an effective and convenient alternative to traditional authentication methods [<xref ref-type="bibr" rid="scirp.125234-ref3">3</xref>] . The use of voice authentication has gained traction due to its non-invasive nature, ease of use, and widespread availability of microphones in devices [<xref ref-type="bibr" rid="scirp.125234-ref4">4</xref>] . The purpose of voice authentication is to identify and verify a particular individual based on their unique vocal characteristics, thereby making it nearly impossible to impersonate or fabricate [<xref ref-type="bibr" rid="scirp.125234-ref5">5</xref>] . <xref ref-type="fig" rid="fig1">Figure 1</xref> summarizes the traditional and biometric methods in use for user authentication.</p><p>The purpose of this paper is to present an innovative approach to the use of voice biometrics for user authentication. The current literature on user authentication through voice biometrics is reviewed in this paper. The importance of server-side voice authentication and its use cases are discussed. Thereafter, a new framework is proposed for the implementation of the server-side voice authentication system. The advantages of the proposed system are also discussed.</p></sec><sec id="s2"><title>2. Literature Review</title><p>This section captures the existing literature on the user authentication methods. This review does not amount to a systematic and exhaustive review of the literature on the subject [<xref ref-type="bibr" rid="scirp.125234-ref6">6</xref>] . This paper adopts the narrative approach to the review of literature [<xref ref-type="bibr" rid="scirp.125234-ref7">7</xref>] for exploring the current knowledge that informs the author in the ongoing research in the field of user authentication. The findings of the review are utilized in preparing and presenting the innovative approach to implementing a server-side voice verification for enhanced user authentication.</p><sec id="s2_1"><title>2.1. Existing Solutions for User Authentication</title><p>The traditional methods used for user authentication suffered from many issues and inefficiencies. These issues included having to correctly remember multiple passwords and user ids and susceptibility to duplication and misuse [<xref ref-type="bibr" rid="scirp.125234-ref8">8</xref>] . Biometric based authentication systems came up in response to the issues faced by the traditional methods. Face recognition, fingerprints, iris recognition, and hand</p><p>geometry were some of the biometrics adopted for user verification.</p><p>Biometrics like face and fingerprint are capable of providing strong security at the device level. However, these biometrics are not suitable for all the situations. There are occasions when hand-free access to their devices would be beneficial to the users. Accessing a locked device while driving a vehicle is an apt example. Methods like PIN or use of biometrics like face and fingerprint may not be suitable for accessing a device while one is driving a vehicle. Attempting to access a device by using the touchscreen or placing the face in front of a camera could be both inconvenient and dangerous under such circumstances. Many other similar situations also exist where a user would find hand-free access to their devices beneficial and convenient [<xref ref-type="bibr" rid="scirp.125234-ref9">9</xref>] . <xref ref-type="table" rid="table1">Table 1</xref> contains a comparison of traditional authentication methods and the voice authentication method.</p><p>An alternative that can address the above drawbacks of other authentication systems is voice authentication. Voice authentication allows the user to access</p><table-wrap id="table1" ><label><xref ref-type="table" rid="table1">Table 1</xref></label><caption><title> A comparison of traditional authentication methods and voice authentication</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >Metrics</th><th align="center" valign="middle" >Traditional authentication</th><th align="center" valign="middle" >Voice authentication</th></tr></thead><tr><td align="center" valign="middle" >Authentication Factor</td><td align="center" valign="middle" >Single-factor authentication</td><td align="center" valign="middle" >Single or multi-factor authentication</td></tr><tr><td align="center" valign="middle" >Usability (Ease of use)</td><td align="center" valign="middle" >Moderate Easy to use but not always user-friendly</td><td align="center" valign="middle" >High Easy to use and user-friendly</td></tr><tr><td align="center" valign="middle" >Security</td><td align="center" valign="middle" >Low Vulnerable to password cracking or object theft</td><td align="center" valign="middle" >High Difficult to fake or replicate voice characteristics, protection from voice spoofing attacks</td></tr><tr><td align="center" valign="middle" >Accuracy</td><td align="center" valign="middle" >Low Can be prone to errors or forgotten passwords</td><td align="center" valign="middle" >High Advanced algorithms ensure accuracy. False Acceptance Rate below 1 in 50,000</td></tr><tr><td align="center" valign="middle" >Spoofing Detection</td><td align="center" valign="middle" >Low Passwords or objects can be easily spoofed</td><td align="center" valign="middle" >High Advanced algorithms can detect and protect from voice spoofing attacks</td></tr><tr><td align="center" valign="middle" >Convenience</td><td align="center" valign="middle" >Low Requires remembering and entering passwords, carrying objects or possession</td><td align="center" valign="middle" >High Convenient as it only requires a user’s voice</td></tr><tr><td align="center" valign="middle" >Cost</td><td align="center" valign="middle" >Low Relatively inexpensive to implement</td><td align="center" valign="middle" >Moderate Requires specialized technology and software</td></tr><tr><td align="center" valign="middle" >User Acceptance</td><td align="center" valign="middle" >Low Due to inconvenience and complexity</td><td align="center" valign="middle" >High Due to easy and convenient use</td></tr><tr><td align="center" valign="middle" >Reliability</td><td align="center" valign="middle" >Moderate Can be affected by human errors</td><td align="center" valign="middle" >High Consistent and reliable</td></tr><tr><td align="center" valign="middle" >Privacy</td><td align="center" valign="middle" >Low Can be compromised if the password is shared or the object is lost or stolen</td><td align="center" valign="middle" >High Voice samples can be kept encrypted</td></tr><tr><td align="center" valign="middle" >Adoption</td><td align="center" valign="middle" >Moderate Widely adopted and understood, but becoming increasingly outdated in terms of security</td><td align="center" valign="middle" >High Increasingly adopted, but still not as widely used as traditional methods</td></tr></tbody></table></table-wrap><p>the secured devices without diverting the focus of attention. It only requires speaking a short instruction such as “Hey Device, Play the music.” The added advantage of voice authentication is that it allows the instructions to act in a dual capacity of commands for execution and ensuring security. This dual role is possible because the execution command itself is processed for voice authentication as well.</p><p>Another advantage of voice authentication is that unlike other security methods, it doesn’t require any special sensors or equipment to capture the biometric. Voice authentication can be adopted even with low end devices. There is no hardware costs involved in adopting voice authentication. The existing devices can be easily configured to use voice authentication for security unlock. The voice authentication also provides a suitable alternative for situations where camera or fingerprint scanner might not work due to conditions like inadequate light or wet fingers etc. [<xref ref-type="bibr" rid="scirp.125234-ref9">9</xref>] .</p></sec><sec id="s2_2"><title>2.2. Voice Authentication Process</title><p>A comparison of the voice authentication method with other traditional and biometric methods clearly establishes the advantages of adopting the voice-based system for user authentication. The steps involved in the voice authentication process are detailed in this section.</p><sec id="s2_2_1"><title>2.2.1. Voice Recording and Storage</title><p>The voice authentication process begins with the collection of voice samples from users during the registration and enrolment process. Users are typically asked to provide a set of phrases, ensuring consistency, and reducing environmental influences on the quality of voice samples [<xref ref-type="bibr" rid="scirp.125234-ref10">10</xref>] . <xref ref-type="fig" rid="fig2">Figure 2</xref> contains the process of user registration and enrollment process. The system records and extracts the voice signatures from the samples and creates a unique voiceprint. These voiceprints are then stored securely in the system and used as a reference for future authentication attempts. As a result, it is very important that these voiceprints are maintained confidentially and with integrity to avoid the risk of unauthorized access and fraud [<xref ref-type="bibr" rid="scirp.125234-ref11">11</xref>] .</p></sec><sec id="s2_2_2"><title>2.2.2. Feature Extraction and Comparison</title><p>To get access to a system, users must give a sample of their voice as part of an authentication process. Using advanced signal processing methods and machine learning algorithms [<xref ref-type="bibr" rid="scirp.125234-ref13">13</xref>] , the system then pulls out important features from the voice sample, such as pitch, frequency, and formants. Using the extracted features,</p><p>the voice sample is then compared with the stored reference voiceprints, calculating a similarity score that reflects the degree of correspondence between the two samples [<xref ref-type="bibr" rid="scirp.125234-ref14">14</xref>] .</p></sec><sec id="s2_2_3"><title>2.2.3. Decision-Making Process</title><p>Based on the uploaded voice samples, the system calculates a similarity score, which lets it figure out if a particular input belongs to the claimed user. To distinguish between genuine and imposter attempts, a threshold value is set. Users get access to the system only if the system determines their similarity score as above the set threshold. If the score is lower than the threshold, the system denies access, potentially triggering additional safeguards or alerts [<xref ref-type="bibr" rid="scirp.125234-ref4">4</xref>] . To balance security and usability, threshold values need to be tuned to minimize both false acceptances and false rejections. <xref ref-type="fig" rid="fig3">Figure 3</xref> captures the decision-making process.</p></sec><sec id="s2_2_4"><title>2.2.4. Voice Authentication Scheme</title><p>According to a white paper published by ID R&amp;D [<xref ref-type="bibr" rid="scirp.125234-ref9">9</xref>] , there are two types of methods for authenticating the unique voiceprint of a user. These are text-de- pendent and text-independent methods. The first type of method uses pre-de- fined messages for voice analysis and authentication. The second type of method is text-independent and can use any words or sentences spoken by the user for voice authentication.</p><p>One essential requirement of a robust voice authentication system is an effective protection against voice spoofing attacks. ID R&amp;D [<xref ref-type="bibr" rid="scirp.125234-ref9">9</xref>] identified the spoofing attacks under the following categories:</p><p>Text-to-Speech attacks: Text to speech attack is carried out by generating synthesized voice and using it to overcome the authentication system.</p><p>Voice Conversion attacks: Voice conversion attack uses a software tool for converting a message or phrase in person’s voice into that of another person’s voice to present it to the authentication system.</p><p>Replay attacks: Replays attacks merely record the user’s voice or text and then play it through a speaker to gain access to the device.</p><p>Mixed attacks: Mixed attacks use a combination of one or more of the above methods to attack a voice authentication system.</p><p>To ensure the robustness of a voice authentication system, there must be effective defense against spoofing attacks. There are algorithms available to ensure protection against voice spoofing attacks.</p><p>By combining the authentication system and anti-spoofing algorithms it is possible to attain high accuracy levels. Industry claims that it is feasible to achieve a false acceptance rate of less than 1 is 50,000, spoofing acceptance rate of less than 3%, and false rejection rate at below 10% by adopting a combination of user verification and anti-spoofing methods [<xref ref-type="bibr" rid="scirp.125234-ref9">9</xref>] . Use of Common Deep Neural Network processing allows the authentication systems to extract the identifying features of voices for authenticating the text-dependent, text-independent, and anti-spoofing systems from the same network. <xref ref-type="fig" rid="fig4">Figure 4</xref> contains a scheme of voice authentication.</p></sec></sec><sec id="s2_3"><title>2.3. Importance of Server-Side Verification</title><p>To maintain the integrity and security of authentication systems, server-side verification is essential. A centralized, controlled, and monitored authentication process can be achieved by validating user credentials on the server side, which reduces the risk of unauthorized access [<xref ref-type="bibr" rid="scirp.125234-ref15">15</xref>] . Further, server-side verification allows users to authenticate continuously as well as implement advanced security measures, such as multi-factor authentication and risk-based authentication, allowing for continuous authentication and enhanced security [<xref ref-type="bibr" rid="scirp.125234-ref16">16</xref>] .</p></sec><sec id="s2_4"><title>2.4. Ensuring Authenticated and Validated User Access</title><p>Steps involved in ensuring access to only the authenticated and validated users are detailed in this section.</p><sec id="s2_4_1"><title>2.4.1. Registration and Enrolment</title><p>During enrollment and registration, a user’s account is set up and their biometric data collected for authentication. This process is essential for ensuring the accuracy and security of the authentication system. Reference [<xref ref-type="bibr" rid="scirp.125234-ref17">17</xref>] recommended</p><p>that the registration and enrollment process must involve multiple factors of authentication. These factors could be what user knows (password, PIN), user has (smart card, token), or user is (biometric). This can prevent spoofing or other forms of fraud. It is vital to gather and securely store biometric data to ensure the user’s privacy. According to [<xref ref-type="bibr" rid="scirp.125234-ref18">18</xref>] , an access restricted secure environment is recommended for storing biometric data. Use of secure protocols and channels of communication for any transmission of biometric data is also imperative to ensure the integrity of the data.</p></sec><sec id="s2_4_2"><title>2.4.2. Continuous Authentication and Monitoring</title><p>The continuous authentication and monitoring serve the purpose of continually verifying the user’s identity to ensure that the user remains authorized to access the system or data. This is crucial to preventing unauthorized access or data breaches. Continuous authentication can use multiple authentication factors, including biometric data and location data, to establish and maintain a user’s identity [<xref ref-type="bibr" rid="scirp.125234-ref19">19</xref>] . As a result, spoofing or other forms of fraud may be detected and prevented. The use of machine learning algorithms is also effective in detecting anomalies or suspicious behavior that could indicate a security threat. As described by [<xref ref-type="bibr" rid="scirp.125234-ref19">19</xref>] , machine learning is useful for analyzing the user behavior to identify deviations from normal trends, which can then trigger additional authentication checks or alerts to security personnel.</p></sec><sec id="s2_4_3"><title>2.4.3. Managing False Positives and Negatives</title><p>Achieving accuracy and usability requires managing false positives and negatives in the authentication system. A false positive occurs when an authorized user is denied access, whereas a false negative occurs when an unauthorized user is granted access [<xref ref-type="bibr" rid="scirp.125234-ref20">20</xref>] .</p><p>Adaptive authentication, which emphasizes the balance between security and usability, may be used to manage false positives and negatives [<xref ref-type="bibr" rid="scirp.125234-ref21">21</xref>] . Additionally, feedback mechanisms can improve the accuracy of the authentication system over time. It is possible to improve accuracy by using feedback from users and reduce false positives and negatives by adjusting based on feedback from users [<xref ref-type="bibr" rid="scirp.125234-ref22">22</xref>] .</p></sec><sec id="s2_4_4"><title>2.4.4. Digital Signature Certificate (DSC) for Added Security</title><p>A digital signature certificate (DSC) can be used as another factor of authentication during the enrollment and registration process. A DSC is a digital certificate that contains information about the identity of the signer, which can be verified using public key infrastructure (PKI). By using a DSC, the user can prove their identity and authenticate the information they are providing during the registration process.</p><p>In addition, a DSC ensures the integrity and authenticity of biometric and location data along with timestamp transmitted over a network. The DSC is useful to digitally sign the biometric data, which assures that the data is not tampered with or altered during transmission. Furthermore, a DSC is useful to securely store and protect biometric data that is collected during the registration. The DSC can be used to encrypt and digitally sign the data, which provides confidentiality, integrity, and non-repudiation. Only authorized users with the correct public key can access the biometric data. This ensures the security of the private information of a user.</p><p>Overall, a DSC can be used to increase integrity and security during the enrolment and registration process. A DSC can also secure biometric data and protect the privacy during its transmission.</p></sec></sec><sec id="s2_5"><title>2.5. Use Cases and Applications</title><p>There are many domains and sectors where server-side user authentication can play an effective role. Some of the representative use cases are discussed in this section.</p><p><xref ref-type="fig" rid="fig5">Figure 5</xref> below contains an infographic of the domains and sectors that can potentially make use of server-side voice authentication for enhancing their efficacy and user experience.</p><sec id="s2_5_1"><title>2.5.1. Banking and Financial Services</title><p>Banks and financial services can greatly benefit from voice authentication by improving security and the customer experience. With voice authentication, bank accounts, trading platforms, and other financial services can be protected</p><p>against unauthorized access by replacing password-based systems and enhancing multi-factor authentication [<xref ref-type="bibr" rid="scirp.125234-ref23">23</xref>] . The use of voice authentication can also streamline the customer service experience by providing secure, efficient, and convenient access to telephone-based support [<xref ref-type="bibr" rid="scirp.125234-ref24">24</xref>] .</p></sec><sec id="s2_5_2"><title>2.5.2. Healthcare</title><p>Voice authentication systems are useful in healthcare settings to make sure that only authorized people can have access to protected information of the patients. A healthcare provider can improve patient privacy and comply with relevant data protection regulations by implementing voice authentication in electronic health records (EHRs) and telemedicine platforms, such as those covered by the Health Insurance Portability and Accountability Act (HIPAA) [<xref ref-type="bibr" rid="scirp.125234-ref25">25</xref>] .</p></sec><sec id="s2_5_3"><title>2.5.3. E-Commerce and Retail</title><p>Voice authentication is useful to make e-commerce and retail stores safer. It can also improve the customer experience. The risk of fraudulent transactions can be reduced by the Businesses. They can also protect their systems against identity thefts through voice authentication [<xref ref-type="bibr" rid="scirp.125234-ref26">26</xref>] . This is in addition to making shopping easy and safe. Also, voice authentication makes it easier for chatbots and virtual assistants to help customers in a smooth way [<xref ref-type="bibr" rid="scirp.125234-ref27">27</xref>] .</p></sec><sec id="s2_5_4"><title>2.5.4. Enterprise and Government Services</title><p>Enterprises and governments can utilize voice authentication to protect sensitive data and systems. Data breaches can be prevented, and security enhanced by implementing voice authentication for employee logins and remote access [<xref ref-type="bibr" rid="scirp.125234-ref28">28</xref>] . Additionally, voice authentication can be used by government agencies to verify citizens’ identities when they access online services, including submitting tax returns, filing benefit claims, and registering to vote [<xref ref-type="bibr" rid="scirp.125234-ref29">29</xref>] .</p></sec></sec></sec><sec id="s3"><title>3. Implementing Server-Side Voice Authentication</title><p>A review of the existing literature showed the advantages and use cases of a server-side voice authentication system. To successfully integrate server-side voice authentication into existing infrastructure, the implementation process must be carefully planned and evaluated to ensure compatibility with existing authentication mechanisms. This paper now presents a new framework for implementing a server-side voice authentication system and integrating it with the existing systems. <xref ref-type="fig" rid="fig6">Figure 6</xref> captures a high-level architecture flow diagram for the suggested server-side voice authentication system.</p><sec id="s3_1"><title>3.1. Choosing the Right Voice Authentication Technology</title><p>In choosing voice authentication technology [<xref ref-type="bibr" rid="scirp.125234-ref30">30</xref>] many factors play critical roles. These factors include accuracy, scalability, ease of use, and cost. There are many commercial and open-source solutions in the market, and each has its pros and cons. Organizations need to weigh these options based on their specific needs, compliance needs, and available resources [<xref ref-type="bibr" rid="scirp.125234-ref31">31</xref>] . It is also important to think</p><p>about the technology’s ongoing maintenance and support to make sure that it can keep up with emerging threats.</p></sec><sec id="s3_2"><title>3.2. Integration with Existing Systems</title><p>Before attempting the integration, a comprehensive assessment of the current systems is necessary to identify potential bottlenecks and to determine the parts that need to be upgraded [<xref ref-type="bibr" rid="scirp.125234-ref21">21</xref>] . All relevant stakeholders such as IT administrators, security experts, and end users are to be involved in the process of decision-making to ensure seamless integration and adoption [<xref ref-type="bibr" rid="scirp.125234-ref23">23</xref>] . The steps involved in integration of a server-side voice authentication system with the existing system are captured in <xref ref-type="fig" rid="fig7">Figure 7</xref> below.</p></sec><sec id="s3_3"><title>3.3. Addressing Privacy and Security Concerns</title><p>It is imperative to address privacy and security concerns when implementing server-side voice authentication. Voiceprints should be encrypted and stored securely by organizations to ensure their confidentiality and integrity [<xref ref-type="bibr" rid="scirp.125234-ref11">11</xref>] . A strict access control mechanism and audit trail should be implemented to ensure that only authorized personnel and applications have access to voiceprints [<xref ref-type="bibr" rid="scirp.125234-ref32">32</xref>] . Regulations like the General Data Protection Regulation (GDPR) and the California Consumer Privacy Act (CCPA) are to be taken into consideration while handling voice data [<xref ref-type="bibr" rid="scirp.125234-ref33">33</xref>] [<xref ref-type="bibr" rid="scirp.125234-ref34">34</xref>] .</p></sec></sec><sec id="s4"><title>4. Conclusion and Future Directions</title><p>Server-side voice authentication can enhance security and user experience in a</p><p>variety of applications and industries. Using advanced technologies, digital certificates, and location data, organizations can implement robust and context- aware authentication systems that effectively protect against unauthorized access and security threats. Research and development are still needed to address privacy concerns, address demographic factors, and counter sophisticated attacks.</p><p>Voice authentication systems can be further strengthened by incorporating continuous authentication and adaptive security measures. Voice authentication technology is always evolving, so organizations need to remain vigilant and proactive in addressing these challenges while ensuring that they work equally across diverse user populations. Hence, server-side voice authentication holds great promise for improving security and user experience in an increasingly connected world. With the adoption of right practices and addressing of the challenges, organizations can provide secure and seamless access to their systems and services through voice authentication.</p></sec><sec id="s5"><title>Conflicts of Interest</title><p>The author declares no conflicts of interest regarding the publication of this paper.</p></sec><sec id="s6"><title>Cite this paper</title><p>Chandran, D.R., Kumar, S. and Deepashri, S. (2023) Securing User Authentication with Server-Side Voice Verification. Journal of Computer and Communications, 11, 137- 150. https://doi.org/10.4236/jcc.2023.115010</p></sec></body><back><ref-list><title>References</title><ref id="scirp.125234-ref1"><label>1</label><mixed-citation publication-type="other" xlink:type="simple">Juels, A. and Rivest, R.L. (2013) Honeywords: Making Password-Cracking Detectable. Proceedings of the 2013 ACM SIGSAC Conference on Computer &amp; Communications Security, Berlin, 4-8 November 2013, 145-160. https://doi.org/10.1145/2508859.2516671</mixed-citation></ref><ref id="scirp.125234-ref2"><label>2</label><mixed-citation publication-type="book" xlink:type="simple">Bonneau, J., Preibusch, S. and Anderson, R. (2012) Does a Birthday Present Every Eleven Wallets? The Security of Customer-Chosen Banking PINs. In: Keromytis, A.D., Ed., Financial Cryptography and Data Security, Springer, Berlin, 25-40. https://doi.org/10.1007/978-3-642-32946-3_3</mixed-citation></ref><ref id="scirp.125234-ref3"><label>3</label><mixed-citation publication-type="other" xlink:type="simple">Jain, A.K., Ross, A. and Prabhakar, S. (2004) An Introduction to Biometric Recognition. IEEE Transactions on Circuits and Systems for Video Technology, 1, 4-20. https://doi.org/10.1109/TCSVT.2003.818349</mixed-citation></ref><ref id="scirp.125234-ref4"><label>4</label><mixed-citation publication-type="other" xlink:type="simple">Reynolds, D.A. and Torres-Carrasquillo, P.A. (2005) Approaches and Applications of Audio Diarization. 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Philadelphia, 23-23 March 2005, 953-956. https://doi.org/10.1109/ICASSP.2005.1416463</mixed-citation></ref><ref id="scirp.125234-ref5"><label>5</label><mixed-citation publication-type="other" xlink:type="simple">Furui, S. (1997) Recent Advances in Speaker Recognition. Pattern Recognition Letters, 18, 859-872. https://doi.org/10.1016/S0167-8655(97)00073-1</mixed-citation></ref><ref id="scirp.125234-ref6"><label>6</label><mixed-citation publication-type="other" xlink:type="simple">Massaro, M., Dumay, J. and Guthrie, J. (2016) On the Shoulders of Giants: Undertaking a Structured Literature Review in Accounting. Accounting, Auditing &amp; Accountability Journal, 29, 767-801. https://doi.org/10.1108/AAAJ-01-2015-1939</mixed-citation></ref><ref id="scirp.125234-ref7"><label>7</label><mixed-citation publication-type="other" xlink:type="simple">Rother, E.T. (2007) Systematic Literature Review X Narrative Review. Acta Paulista de Enfermagem, 20, 5-6. https://doi.org/10.1590/S0103-21002007000200001</mixed-citation></ref><ref id="scirp.125234-ref8"><label>8</label><mixed-citation publication-type="other" xlink:type="simple">Zheng, Y. and Zhao, S. (2016) A Usable Authentication System Based on Personal Voice Challenge. 2016 International Conference on Advanced Cloud and Big Data (CBD), Chengdu, 13-16 August 2016, 194-199.</mixed-citation></ref><ref id="scirp.125234-ref9"><label>9</label><mixed-citation publication-type="other" xlink:type="simple">ID R&amp;D (2021) Voice Biometric Revolution: Why Voice ID Is Now Secure Enough for Device Unlock. https://www.idrnd.ai/wp-content/uploads/2021/02/IDRD-VoiceBiometric-DeviceUnlock-Whitepaper-.pdf</mixed-citation></ref><ref id="scirp.125234-ref10"><label>10</label><mixed-citation publication-type="other" xlink:type="simple">Reynolds, D.A. (2000) An Overview of Automatic Speaker Recognition Technology. 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Orlando, 13-17 May 2002, 4072-4075.</mixed-citation></ref><ref id="scirp.125234-ref11"><label>11</label><mixed-citation publication-type="other" xlink:type="simple">Yampolskiy, R.V., Govindaraju, V. and Reznik, L. (2013) Behavior-Based Biometrics: A Survey and Classification. International Journal of Biometrics, 5, 197-221.</mixed-citation></ref><ref id="scirp.125234-ref12"><label>12</label><mixed-citation publication-type="other" xlink:type="simple">Chandran, D.R. (2022) Use of AI Voice Authentication Technology Instead of Traditional Keypads in Security Devices. Journal of Computer and Communications, 10, 11-21. https://doi.org/10.4236/jcc.2022.106002</mixed-citation></ref><ref id="scirp.125234-ref13"><label>13</label><mixed-citation publication-type="other" xlink:type="simple">Kinnunen, T. and Li, H. (2010) An Overview of Text-Independent Speaker Recognition: From Features to Supervectors. Speech Communication, 52, 12-40. https://doi.org/10.1016/j.specom.2009.08.009</mixed-citation></ref><ref id="scirp.125234-ref14"><label>14</label><mixed-citation publication-type="other" xlink:type="simple">Kinnunen, T., Wu, Z.Z., Sedlák, F. and Lee, K.A. (2012) Non-Parametric and Parametric Score Calibration for Biometric Authentication. Pattern Recognition Letters, 33, 387-393.</mixed-citation></ref><ref id="scirp.125234-ref15"><label>15</label><mixed-citation publication-type="other" xlink:type="simple">Wang, R. and Liu, Y. (2013) Scalable Web-Based User Authentication Using Server-Side Verification. International Journal of Security and Its Applications, 7, 329-342.</mixed-citation></ref><ref id="scirp.125234-ref16"><label>16</label><mixed-citation publication-type="other" xlink:type="simple">Menezes, A., van Oorschot, P.C. and Vanstone, S.A. (2020) Handbook of Applied Cryptography. CRC Press, Boca Raton.</mixed-citation></ref><ref id="scirp.125234-ref17"><label>17</label><mixed-citation publication-type="other" xlink:type="simple">Gao, X., Liu, Y., Zhang, Q. and Li, Y. (2020) Biometric Authentication Systems: A Comprehensive Review. IEEE Access, 8, 104420-104445.</mixed-citation></ref><ref id="scirp.125234-ref18"><label>18</label><mixed-citation publication-type="other" xlink:type="simple">Hu, Y., Zhang, H. and Liu, Y. (2020) Biometric Authentication for Secure Mobile Cloud Computing: A Comprehensive Review. IEEE Access, 8, 97612-97627.</mixed-citation></ref><ref id="scirp.125234-ref19"><label>19</label><mixed-citation publication-type="other" xlink:type="simple">Shin, D., Lee, J., Lee, S. and Lee, S. (2021) Continuous Authentication Using Machine Learning for Mobile Devices: A Comprehensive Review. IEEE Access, 9, 32813-32831.</mixed-citation></ref><ref id="scirp.125234-ref20"><label>20</label><mixed-citation publication-type="other" xlink:type="simple">Muckenhirn, H., Magimai-Doss, M. and Marcel, S. (2018) Long-Term Speaker Verification: The Case of Telephone Speech. 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Calgary, 15-20 April 2018, 5239-5243.</mixed-citation></ref><ref id="scirp.125234-ref21"><label>21</label><mixed-citation publication-type="other" xlink:type="simple">Ratha, N.K., Connell, J.H. and Bolle, R.M. (2001) Enhancing Security and Privacy in Biometrics-Based Authentication Systems. IBM Systems Journal, 40, 614-634. https://doi.org/10.1147/sj.403.0614</mixed-citation></ref><ref id="scirp.125234-ref22"><label>22</label><mixed-citation publication-type="other" xlink:type="simple">Li, Y., He, X., Zhao, X. and Yang, X. (2019) Enhancing User Experience of Continuous Authentication with Feedback. IEEE Transactions on Mobile Computing, 18, 310-322.</mixed-citation></ref><ref id="scirp.125234-ref23"><label>23</label><mixed-citation publication-type="other" xlink:type="simple">Ratha, N.K., Connell, J., Sasse, M.A., Brostoff, S. and Weirich, D. (2001) Transforming the ‘Weakest Link’—A Human/Computer Interaction Approach to Usable and Effective Security. BT Technology Journal, 19, 122-131. https://doi.org/10.1023/A:1011902718709</mixed-citation></ref><ref id="scirp.125234-ref24"><label>24</label><mixed-citation publication-type="other" xlink:type="simple">Gupta, P., Mehrotra, H. and Verma, A. (2019) Voice Biometric-Based User Authentication in Banking: A Literature Review. Journal of Internet Banking and Commerce, 24, 1-13.</mixed-citation></ref><ref id="scirp.125234-ref25"><label>25</label><mixed-citation publication-type="other" xlink:type="simple">Sundararajan, M. and Rajamani, V. (2016) A Secure Voice-Based Authentication for Accessing EHRs Using Smart Phones. Journal of Medical Systems, 40, 186.</mixed-citation></ref><ref id="scirp.125234-ref26"><label>26</label><mixed-citation publication-type="other" xlink:type="simple">Gupta, P. and Sivakumar, A.I. (2019) Voice Biometrics in E-commerce: Applications, Benefits, and Challenges. Journal of Electronic Commerce in Organizations, 17, 1-17.</mixed-citation></ref><ref id="scirp.125234-ref27"><label>27</label><mixed-citation publication-type="other" xlink:type="simple">Nayak, S.K., Sahoo, S. and Mohapatra, A.G. (2018) A Secure E-Commerce Application Using Voice Recognition. 2018 2nd International Conference on Inventive Systems and Control (ICISC), Coimbatore, 19-20 January 2018, 1036-1040.</mixed-citation></ref><ref id="scirp.125234-ref28"><label>28</label><mixed-citation publication-type="other" xlink:type="simple">Dey, S., Samanta, D. and Pal, A. (2019) A Reliable Voice-Based Authentication System for Enhancing Enterprise Security. Journal of Ambient Intelligence and Humanized Computing, 10, 3579-3592.</mixed-citation></ref><ref id="scirp.125234-ref29"><label>29</label><mixed-citation publication-type="book" xlink:type="simple">Krishna, Y., Kumar, M.A. and Sekhar, C.C. (2018) Speaker Recognition for E-Governance Security. In: Shrivastava, G., Kumar, P., Gupta, B.B., Bala, S. and Dey, N., Eds., Handbook of Research on Network Forensics and Analysis Techniques, IGI Global, Hershey, 255-277.</mixed-citation></ref><ref id="scirp.125234-ref30"><label>30</label><mixed-citation publication-type="other" xlink:type="simple">Jain, A.K., Nandakumar, K. and Ross, A. (2015) 50 Years of Biometric Research: Accomplishments, Challenges, and Opportunities. Pattern Recognition Letters, 79, 80-105. https://doi.org/10.1016/j.patrec.2015.12.013</mixed-citation></ref><ref id="scirp.125234-ref31"><label>31</label><mixed-citation publication-type="other" xlink:type="simple">Korshunov, P., Ramírez, G.R. and Fierrez, J. (2016) Voice Presentation Attack Detection: Current Progress and Challenges. 2016 IEEE 8th International Conference on Biometrics Theory, Applications, and Systems (BTAS), Niagara Falls, 6-9 September 2016, 1-7.</mixed-citation></ref><ref id="scirp.125234-ref32"><label>32</label><mixed-citation publication-type="other" xlink:type="simple">Zheng, N., Bai, K., Huang, H. and Wang, H. (2011) You Are How You Touch: User Verification on Smartphones via Tapping Behaviors. 2014 IEEE 22nd International Conference on Network Protocols (ICNP), Raleigh, 21-24 October 2014, 221-232. https://doi.org/10.1109/ICNP.2014.43</mixed-citation></ref><ref id="scirp.125234-ref33"><label>33</label><mixed-citation publication-type="other" xlink:type="simple">European Parliament and Council (2016) Regulation (EU) 2016/679 of the European Parliament and of the Council of 27 April 2016 on the Protection of Natural Persons about the Processing of Personal Data and the Free Movement of Such Data and Repealing Directive 95/46/EC (General Data Protection Regulation). https://eur-lex.europa.eu/legal-content/EN/TXT/?uri=CELEX%3A32016R0679</mixed-citation></ref><ref id="scirp.125234-ref34"><label>34</label><mixed-citation publication-type="other" xlink:type="simple">California Legislature (2018) California Consumer Privacy Act of 2018 [AB-375]. https://leginfo.legislature.ca.gov/faces/billTextClient.xhtml?bill_id=201720180AB375</mixed-citation></ref></ref-list></back></article>