<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article  PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article"><front><journal-meta><journal-id journal-id-type="publisher-id">OJGen</journal-id><journal-title-group><journal-title>Open Journal of Genetics</journal-title></journal-title-group><issn pub-type="epub">2162-4453</issn><publisher><publisher-name>Scientific Research Publishing</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.4236/ojgen.2020.101002</article-id><article-id pub-id-type="publisher-id">OJGen-98646</article-id><article-categories><subj-group subj-group-type="heading"><subject>Articles</subject></subj-group><subj-group subj-group-type="Discipline-v2"><subject>Biomedical&amp;Life Sciences</subject></subj-group></article-categories><title-group><article-title>
 
 
  The Pattern of Occurrence of Cytosine in the Genetic Code Minimizes Deleterious Mutations and Favors Proper Function of the Translational Machinery
 
</article-title></title-group><contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Bin</surname><given-names>Wang</given-names></name><xref ref-type="aff" rid="aff1"><sub>1</sub></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib></contrib-group><aff id="aff1"><label>1</label><addr-line>Department of Chemistry, Marshall University, Huntington, WV, USA</addr-line></aff><pub-date pub-type="epub"><day>20</day><month>12</month><year>2019</year></pub-date><volume>10</volume><issue>01</issue><fpage>8</fpage><lpage>15</lpage><history><date date-type="received"><day>10,</day>	<month>January</month>	<year>2020</year></date><date date-type="rev-recd"><day>29,</day>	<month>February</month>	<year>2020</year>	</date><date date-type="accepted"><day>3,</day>	<month>March</month>	<year>2020</year></date></history><permissions><copyright-statement>&#169; Copyright  2014 by authors and Scientific Research Publishing Inc. </copyright-statement><copyright-year>2014</copyright-year><license><license-p>This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/</license-p></license></permissions><abstract><p>
 
 
  The standard genetic code consists of 64 combinations of base triplets made from four different bases. The research aim of this study was to investigate the pattern of occurrence of cytosine in the genetic code. By exploring the base composition and sequence of all 64 codons, the author found some important features based on the instability of cytosine. Because cytosine undergoes spontaneous deamination that converts it into uracil, it is evolutionarily favorable to exclude cytosine from codons critical to the initiation and termination of translation. For amino acids that have one to three synonymous codons (also called synonyms), the frequency of occurrence of C in the first and second positions of their mRNA codons is significantly lower than the frequencies of A, U, and G. For mRNA codons that encode amino acids with four synonyms, the trend of base composition is opposite to those encoding amino acids with one to three synonyms; the instability of C could be inhibited or reduced via formation of hydrogen bonds with a G and/or with a protonated C, and the secondary structure of the resultant mRNA could be adjusted via the multiple synonymous alternates at the third position of their codons to facilitate the translation process. The overall pattern of occurrence for C in the genetic code not only minimizes deleterious mutations and favors proper function of the translational machinery by excluding C from certain positions within codons, but also allows the occurrence of genetic diversity via mutation by including C in less-critical positions.
 
</p></abstract><kwd-group><kwd>Genetic Code</kwd><kwd> Base Triplet</kwd><kwd> Synonyms</kwd><kwd> Cytosine Deamination</kwd><kwd> Translation Mutations</kwd></kwd-group></article-meta></front><body><sec id="s1"><title>1. Introduction</title><p>The standard genetic code is nearly universal, and consists of 64 combinations of base triplets made from four different bases—adenine (A), guanine (G), uracil (U), and cytosine (C). Since 61 of the 64 base triplets are used to encode only 20 amino acids, most amino acids are encoded by more than one codon. The remaining three triplets, called stop codons, designate the termination of translation [<xref ref-type="bibr" rid="scirp.98646-ref1">1</xref>]. To the author’s knowledge, no study has investigated the pattern of occurrence of cytosine in the genetic code; it thus became the objective of this study. The author explored the base composition and sequence of all 64 codons, and inferred some important features in view of the instability of cytosine.</p></sec><sec id="s2"><title>2. Methods</title><p>Since the genetic code is highly degenerate, meaning that most amino acids are encoded by more than one mRNA codon, the author divided the standard genetic codons into two groups: the base triplets encoding amino acids that have one to three synonymous codons (<xref ref-type="table" rid="table1">Table 1</xref>), and those amino acids with four synonymous codons (<xref ref-type="table" rid="table2">Table 2</xref>). Amino acids serine, leucine, and arginine each have six synonymous codons (also called synonyms); they are categorized as two-synonym plus four-synonym occurrences. The author determined the percentage (%) of A, U, G, and C at every position of the base triplet for mRNA codons with one to three synonyms (<xref ref-type="table" rid="table1">Table 1</xref>), and those with four synonyms (<xref ref-type="table" rid="table2">Table 2</xref>), respectively.</p></sec><sec id="s3"><title>3. Results</title><p>The first feature is the absence of cytosine (C) in both the start (AUG, also the only codon for methionine) and stop codons (UAA, UAG, and UGA) of translation. The initiation and termination of translation are critical for protein synthesis; therefore, evolution has resulted in a higher frequency of the more stable A, U, and G to avoid a fatal malfunction in the translation process. Cytosine is also absent from the only codon for the amino acid tryptophan (UGG). The author infers that the absence of cytosine from the codons for methionine and tryptophan, neither of which has an alternate mRNA codon, is the result of evolutionary selection to avoid translation errors due to the spontaneous deamination of cytosine to uracil [<xref ref-type="bibr" rid="scirp.98646-ref2">2</xref>] [<xref ref-type="bibr" rid="scirp.98646-ref3">3</xref>] [<xref ref-type="bibr" rid="scirp.98646-ref4">4</xref>].</p><p>In contrast to the standard genetic code referred to above, mitochondrial genomes contain alternate start codons (e.g., AUA and AUU in humans, and GUG and UUG in prokaryotes). All vertebrate mitochondria use AGA and AGG as translation terminators. Mitochondrial mRNA from vertebrates and microorganisms use UGA to encode tryptophan rather than as a translation terminator,</p><table-wrap id="table1" ><label><xref ref-type="table" rid="table1">Table 1</xref></label><caption><title> Analysis of the base triplets that encode the initiation and termination of translation, and those that encode amino acids with one to three synonymous codons. The genetic codons, and the amino acids encoded and their properties are from Berg et al. (2015) and Harris et al. (2016) [<xref ref-type="bibr" rid="scirp.98646-ref1">1</xref>] [<xref ref-type="bibr" rid="scirp.98646-ref6">6</xref>]</title></caption><table><tbody><thead><tr><th align="center" valign="middle"  rowspan="2"  >Amino Acid Encoded, Including the Property and Formula of Its Side Chain</th><th align="center" valign="middle"  rowspan="2"  >mRNA Codon</th><th align="center" valign="middle"  colspan="5"  >% of Each Base for mRNA Codons with 1 - 3 Synonyms</th></tr></thead><tr><td align="center" valign="middle" >1<sup>st</sup> Position (Left)</td><td align="center" valign="middle" >2<sup>nd </sup>Position (Middle)</td><td align="center" valign="middle" >3<sup>rd</sup> Position (Right)</td><td align="center" valign="middle" >1<sup>st</sup> and 2<sup>nd</sup> Positions</td><td align="center" valign="middle" >All Three Positions</td></tr><tr><td align="center" valign="middle" >Methionine (Met) Translation Start Codon hydrophobic -CH<sub>2</sub>CH<sub>2</sub>SCH<sub>3</sub></td><td align="center" valign="middle" >AUG</td><td align="center" valign="middle"  rowspan="16"  >% of A 12/32 = 37.5% % of U 12/32 = 37.5% % of G 4/32 = 12.5% % of C 4/32 = 12.5%</td><td align="center" valign="middle"  rowspan="16"  >% of A 16/32 = 50% % of U 8/32 = 25% % of G 8/32 = 25% % of C 0/32 = 0%</td><td align="center" valign="middle"  rowspan="16"  >% of A 8/32 = 25% % of U 8/32 = 25% % of G 8/32 = 25% % of C 8/32 = 25%</td><td align="center" valign="middle"  rowspan="16"  >% of A 28/64 = 43.8% % of U 20/64 = 31.2% % of G 12/64 = 18.8% % of C 4/64 = 6.2%</td><td align="center" valign="middle"  rowspan="16"  >% of A 36/96 = 37.5% % of U 28/96 = 29.2% % of G 20/96 = 20.8% % of C 12/96 = 12.5%</td></tr><tr><td align="center" valign="middle" >Tryptophan (Trp) hydrophobic -CH<sub>2</sub>C<sub>8</sub>H<sub>6</sub>N</td><td align="center" valign="middle" >UGG</td></tr><tr><td align="center" valign="middle" >Lysine (Lys) positively charged − CH 2 CH 2 CH 2 CH 2 NH 3 +</td><td align="center" valign="middle" >AAA AAG</td></tr><tr><td align="center" valign="middle" >Asparagine (Asn) polar -CH<sub>2</sub>CONH<sub>2</sub></td><td align="center" valign="middle" >AAU AAC</td></tr><tr><td align="center" valign="middle" >Arginine (Arg) positively charged − CH 2 CH 2 CH 2 NHC ( NH 2 ) 2 +</td><td align="center" valign="middle" >AGA AGG</td></tr><tr><td align="center" valign="middle" >Serine (Ser) polar -CH<sub>2</sub>OH</td><td align="center" valign="middle" >AGU AGC</td></tr><tr><td align="center" valign="middle" >Tyrosine (Tyr) polar -CH<sub>2</sub>C<sub>6</sub>H<sub>4</sub>OH</td><td align="center" valign="middle" >UAU UAC</td></tr><tr><td align="center" valign="middle" >Leucine (Leu) hydrophobic -CH<sub>2</sub>CH(CH<sub>3</sub>)<sub>2</sub></td><td align="center" valign="middle" >UUA UUG</td></tr><tr><td align="center" valign="middle" >Phenylalanine (Phe) hydrophobic -CH<sub>2</sub>C<sub>6</sub>H<sub>5</sub></td><td align="center" valign="middle" >UUU UUC</td></tr><tr><td align="center" valign="middle" >Cysteine (Cys) polar -CH<sub>2</sub>SH</td><td align="center" valign="middle" >UGU UGC</td></tr><tr><td align="center" valign="middle" >Glutamic Acid (Glu) negatively charged -CH<sub>2</sub>CH<sub>2</sub>COO<sup>−</sup></td><td align="center" valign="middle" >GAA GAG</td></tr><tr><td align="center" valign="middle" >Aspartic Acid (Asp) negatively charged -CH<sub>2</sub>COO<sup>−</sup></td><td align="center" valign="middle" >GAU GAC</td></tr><tr><td align="center" valign="middle" >Glutamine (Gln) polar -CH<sub>2</sub>CH<sub>2</sub>CONH<sub>2</sub></td><td align="center" valign="middle" >CAA CAG</td></tr><tr><td align="center" valign="middle" >Histidine (His) polar/positively charged -CH<sub>2</sub>C<sub>3</sub>H<sub>3</sub>N<sub>2</sub></td><td align="center" valign="middle" >CAU CAC</td></tr><tr><td align="center" valign="middle" >Isoleucine (Ile) hydrophobic -CH(CH<sub>3</sub>)(CH<sub>2</sub>CH<sub>3</sub>)</td><td align="center" valign="middle" >AUA AUU AUC</td></tr><tr><td align="center" valign="middle" >Translation Stop Codon</td><td align="center" valign="middle" >UAA UAG UGA</td></tr></tbody></table></table-wrap><table-wrap id="table2" ><label><xref ref-type="table" rid="table2">Table 2</xref></label><caption><title> Analysis of the base triplets that encode amino acids with four synonymous codons. The genetic codons, and the amino acids encoded and their properties are from Berg et al. (2015) and Harris et al. (2016) [<xref ref-type="bibr" rid="scirp.98646-ref1">1</xref>] [<xref ref-type="bibr" rid="scirp.98646-ref6">6</xref>]</title></caption><table><tbody><thead><tr><th align="center" valign="middle"  rowspan="2"  >Amino Acid Encoded, Including the Property and Formula of Its Side Chain</th><th align="center" valign="middle"  rowspan="2"  >mRNA Codon</th><th align="center" valign="middle"  colspan="5"  >% of Each Base for mRNA Codons with Four Synonyms</th></tr></thead><tr><td align="center" valign="middle" >1<sup>st</sup> Position (Left)</td><td align="center" valign="middle" >2<sup>nd</sup> Position (Middle)</td><td align="center" valign="middle" >3<sup>rd</sup> Position (Right)</td><td align="center" valign="middle" >1<sup>st</sup> and 2<sup>nd</sup> Positions</td><td align="center" valign="middle" >All Three Positions</td></tr><tr><td align="center" valign="middle" >Threonine (Thr) polar -CHCH<sub>3</sub>OH</td><td align="center" valign="middle" >ACA ACG ACU ACC</td><td align="center" valign="middle"  rowspan="8"  >% of A 4/32 = 12.5% % of U 4/32 = 12.5% % of G 12/32 = 37.5% % of C 12/32 = 37.5%</td><td align="center" valign="middle"  rowspan="8"  >% of A 0/32 = 0% % of U 8/32 = 25% % of G 8/32 = 25% % of C 16/32 = 50%</td><td align="center" valign="middle"  rowspan="8"  >% of A 8/32 = 25% % of U 8/32 = 25% % of G 8/32 = 25% % of C 8/32 = 25%</td><td align="center" valign="middle"  rowspan="8"  >% of A 4/64 = 6.2% % of U 12/64 = 18.8% % of G 20/64 = 31.2% % of C 28/64 = 43.8%</td><td align="center" valign="middle"  rowspan="8"  >% of A 12/96 = 12.5% % of U 20/96 = 20.8% % of G 28/96 = 29.2% % of C 36/96 = 37.5%</td></tr><tr><td align="center" valign="middle" >Serine (Ser) polar -CH<sub>2</sub>OH</td><td align="center" valign="middle" >UCA UCG UCU UCC</td></tr><tr><td align="center" valign="middle" >Valine (Val) hydrophobic -CH(CH<sub>3</sub>)<sub>2</sub></td><td align="center" valign="middle" >GUA GUG GUU GUC</td></tr><tr><td align="center" valign="middle" >Glycine (Gly) hydrophobic -H</td><td align="center" valign="middle" >GGA GGG GGU GGC</td></tr><tr><td align="center" valign="middle" >Alanine (Ala) hydrophobic -CH<sub>3</sub></td><td align="center" valign="middle" >GCA GCG GCU GCC</td></tr><tr><td align="center" valign="middle" >Leucine (Leu) hydrophobic -CH<sub>2</sub>CH(CH<sub>3</sub>)<sub>2</sub></td><td align="center" valign="middle" >CUA CUG CUU CUC</td></tr><tr><td align="center" valign="middle" >Arginine (Arg) positively charged − CH 2 CH 2 CH 2 NHC ( NH 2 ) 2 +</td><td align="center" valign="middle" >CGA CGG CGU CGC</td></tr><tr><td align="center" valign="middle" >Proline (Pro) hydrophobic -CH<sub>2</sub>CH<sub>2</sub>CH<sub>2</sub>-</td><td align="center" valign="middle" >CCA CCG CCU CCC</td></tr></tbody></table></table-wrap><p>and vertebrate mitochondria use AUA for methionine rather than for isoleucine [<xref ref-type="bibr" rid="scirp.98646-ref1">1</xref>] [<xref ref-type="bibr" rid="scirp.98646-ref5">5</xref>] [<xref ref-type="bibr" rid="scirp.98646-ref6">6</xref>]. Again, C is absent from these critical codons. While the author will focus on the nucleic genetic code in the following discussion, it is noted that the pattern of occurrence for cytosine seems to be true for mitochondrial codons as well.</p><p>The right-hand column in <xref ref-type="table" rid="table1">Table 1</xref> (“All Three Positions” column) provides the total base composition, including total number and percentage of A, U, G, and C in the mRNA codons shown. Overall, A and U residues are more abundant than G and C residues in the codons for amino acids with one to three synonyms. Data presented in <xref ref-type="table" rid="table1">Table 1</xref> (“1<sup>st</sup> Position” column) provide the base composition at the 5’/left end of the base triplet of the mRNA codons studied. The frequencies of A and U are 37.5% each, whereas G and C residues are less frequent (12.5% each). At the second/middle base of the mRNA codons studied, the frequencies of A, U, and G are 50%, 25%, and 25%, respectively, as shown in <xref ref-type="table" rid="table1">Table 1</xref> (“2<sup>nd</sup> Position” column). Interestingly, C does not occur at the second position. At the 3’/right end of the base triplet of the mRNA codons studied (<xref ref-type="table" rid="table1">Table 1</xref>, “3<sup>rd</sup> Position” column), there is an equal abundance of A, U, G, and C (25% each).</p><p>For mRNA codons that encode amino acids with four synonyms, the trend of base composition is opposite to those encoding amino acids with one to three synonyms. As shown in <xref ref-type="table" rid="table2">Table 2</xref> (“All Three Positions” column), C and G residues are more abundant than U and A residues for codons encoding amino acids with four synonyms. The frequencies of C and G at the first position of the mRNA codons studied (<xref ref-type="table" rid="table2">Table 2</xref>, “1<sup>st</sup> Position” column) are 37.5% each, whereas the frequencies of U and A are 12.5% each. At the second position of the mRNA codons studied (<xref ref-type="table" rid="table2">Table 2</xref>, “2<sup>nd</sup> Position” column), the frequencies of C, G, and U are 50%, 25%, and 25%, respectively; A does not occur at the second position. At the third position of the mRNA codons studied (<xref ref-type="table" rid="table2">Table 2</xref>, “3<sup>rd</sup> Position” column), there is an equal abundance of A, U, G, and C (25% each).</p></sec><sec id="s4"><title>4. Discussion</title><p>Because cytosine is known to undergo spontaneous deamination into uracil, it is evolutionarily favorable to exclude cytosine from codons critical to the initiation or termination of translation. For amino acids that have one to three synonyms, the frequency of occurrence of C in the first and second positions (the root) of their mRNA codons is significantly lower than the frequencies of occurrence of A, U, and G (see <xref ref-type="table" rid="table1">Table 1</xref>, “1<sup>st</sup> and 2<sup>nd</sup> Positions” column). Furthermore, since the middle position of a base triplet is the most critical location for mRNA codon-tRNA anticodon interaction/binding [<xref ref-type="bibr" rid="scirp.98646-ref7">7</xref>] [<xref ref-type="bibr" rid="scirp.98646-ref8">8</xref>] [<xref ref-type="bibr" rid="scirp.98646-ref9">9</xref>] [<xref ref-type="bibr" rid="scirp.98646-ref10">10</xref>] [<xref ref-type="bibr" rid="scirp.98646-ref11">11</xref>], the complete absence of C from the second position that is observed for base triplets encoding amino acids with one to three synonyms is not surprising.</p><p>In <xref ref-type="table" rid="table1">Table 1</xref>, the only mRNA codons containing C in the root are those encoding histidine (CAU and CAC) and glutamine (CAA and CAG). If spontaneous deamination by hydrolysis occurs, histidine will be converted into tyrosine (UAU and UAC), and glutamine will be converted into a stop codon (UAA and UAG). Since histidine and tyrosine both have polar side chains, in theory, this C-to-U mutation may be less likely to introduce significant changes in a protein’s structure or function. However, histidine is often found in active sites of enzymes because its imidazole ring-containing side chain is able to perform many different roles in catalysis, whereas tyrosine has a phenol-containing side chain [<xref ref-type="bibr" rid="scirp.98646-ref1">1</xref>] [<xref ref-type="bibr" rid="scirp.98646-ref6">6</xref>]. Therefore, the histidine-to-tyrosine mutation may allow for genetic variation. The C-to-U mutation within a glutamine codon would cause translation to stop. Because humans can synthesize enough glutamine, it is the most abundant nonessential amino acid in the human body; further studies are needed to determine the effects of the conversion of a glutamine codon into a stop codon on human health and on genetic diversity, although the loss of a protein is likely to have deleterious effects.</p><p>For amino acids that have four synonyms, the effects of an unstable C on translation mutations may not be as deleterious as for amino acids with fewer synonyms, due to the high percentages of C and G in the root, and to the existence of multiple synonymous alternates at the third position of these codons. Frederico et al. demonstrated that the rate of hydrolytic deamination of cytosine in a double helix was approximately 140-fold slower than in single-stranded DNA at 37˚C [<xref ref-type="bibr" rid="scirp.98646-ref12">12</xref>]; this difference is mainly due to the decreased accessibility of the N3 and C4 positions in a cytosine that is paired to guanine via hydrogen bonds, blocking the attack from water. The mRNA codons encoding amino acids with four synonyms are CG-rich in the root (see <xref ref-type="table" rid="table2">Table 2</xref>, “1<sup>st</sup> and 2<sup>nd</sup> Positions” column), which indicates that they have the potential to inhibit or reduce cytosine deamination by folding upon themselves to form a C&#186;G double helix, and/or to form a hydrogen-bonded C<sup>+</sup>-C i-motif if the RNA sequence is C-rich. (Note: Previous studies have proved the existence of i-motifs under physiological pH [<xref ref-type="bibr" rid="scirp.98646-ref13">13</xref>] [<xref ref-type="bibr" rid="scirp.98646-ref14">14</xref>].) Since CG-rich mRNA regions may form complicated secondary structures that hinder the translation process, producing the same amino acid no matter which of the four mRNA bases is in the third position allows the adjustment of the secondary structure of the resultant mRNA.</p><p><xref ref-type="table" rid="table2">Table 2</xref> shows that no A is present at the second position of base triplets encoding amino acids with four synonyms. Previous studies have indicated that the second base of mRNA codons determines the hydrophobicity of the encoded amino acids: The majority of codons for hydrophilic (polar and/or charged) amino acids have A in the second position; while the majority of codons for hydrophobic amino acids have U in the second position [<xref ref-type="bibr" rid="scirp.98646-ref7">7</xref>] [<xref ref-type="bibr" rid="scirp.98646-ref15">15</xref>] [<xref ref-type="bibr" rid="scirp.98646-ref16">16</xref>]. From <xref ref-type="table" rid="table1">Table 1</xref>, we can see that hydrophilic amino acids with one to three synonyms have A or G in the second position of their mRNA codons, while hydrophobic amino acids with one to three synonyms have U or G in their second position. From <xref ref-type="table" rid="table2">Table 2</xref>, we can see that hydrophilic amino acids with four synonyms have C or G in the second position of their mRNA codons, while hydrophobic amino acids have U or C or G in their second position. Since the majority of hydrophilic amino acids have two synonyms, it is reasonable that A is absent from the second position of mRNA codons that encode amino acids with four synonyms.</p></sec><sec id="s5"><title>5. Conclusion</title><p>In summary, for amino acids that have one to three synonyms, the frequency of occurrence of C in the root of their mRNA codons is significantly lower than the frequencies of A, U, and G. For amino acids that have four synonyms, the instability of C may be inhibited or reduced via the formation of hydrogen bonds with a G and/or with a protonated C. In addition, the “new” secondary structure of the resultant mRNA could be adjusted via the multiple synonymous alternates in the codons’ third positions, which could facilitate the translation process. The overall pattern of occurrence for C in the genetic code not only minimizes deleterious mutations and favors proper function of the translational machinery by excluding C from certain positions within codons, but also allows the occurrence of genetic diversity via mutation by including C in less-critical positions. Evolution is an excellent engineer.</p></sec><sec id="s6"><title>Acknowledgements</title><p>This work is supported by the National Science Foundation under Award No. OIA-1458952. Any opinions, findings, and conclusions expressed in this material are those of the author and do not necessarily reflect the views of the National Science Foundation.</p></sec><sec id="s7"><title>Conflicts of Interest</title><p>The author declares that she has no competing financial interests.</p></sec><sec id="s8"><title>Data Availability Statement</title><p>All data generated and analyzed during this study are included in this published article.</p></sec><sec id="s9"><title>Cite this paper</title><p>Wang, B. (2020) The Pattern of Occurrence of Cytosine in the Genetic Code Minimizes Deleterious Mutations and Favors Proper Function of the Translational Machinery. Open Journal of Genetics, 10, 8-15. https://doi.org/10.4236/ojgen.2020.101002</p></sec></body><back><ref-list><title>References</title><ref id="scirp.98646-ref1"><label>1</label><mixed-citation publication-type="other" xlink:type="simple">Berg, J.M., Tymoczko, J.L., Gatto Jr., G.J. and Stryer, L. (2015) Biochemistry. 8th Edition, W.H. Freeman &amp; Company, New York, NY.</mixed-citation></ref><ref id="scirp.98646-ref2"><label>2</label><mixed-citation publication-type="other" xlink:type="simple">Nabel, C.S., Manning, S.A. and Kohli, R.M. (2012) The Curious Chemical Biology of Cytosine: Deamination, Methylation, and Oxidation as Modulators of Genomic Potential. ACS Chemical Biology, 7, 20-30. https://doi.org/10.1021/cb2002895</mixed-citation></ref><ref id="scirp.98646-ref3"><label>3</label><mixed-citation publication-type="other" xlink:type="simple">Poole, A., Penny, D. and Sjoberg, B.M. (2001) Confounded Cytosine! Tinkering and the Evolution of DNA. Nature Reviews Molecular Cell Biology, 2, 147-151. https://doi.org/10.1038/35052091</mixed-citation></ref><ref id="scirp.98646-ref4"><label>4</label><mixed-citation publication-type="other" xlink:type="simple">Levy, M. and Miller, S.L. (1998) The Stability of the RNA Bases: Implications for the Origin of Life. Proceedings of the National Academy of Sciences of the United States of America, 95, 7933-7938. https://doi.org/10.1073/pnas.95.14.7933</mixed-citation></ref><ref id="scirp.98646-ref5"><label>5</label><mixed-citation publication-type="other" xlink:type="simple">Swire, J., Judson, O.P. and Burt, A. (2005) Mitochondrial Genetic Codes Evolve to Match Amino Acid Requirements of Proteins. Journal of Molecular Evolution, 60, 128-139. https://doi.org/10.1007/s00239-004-0077-9</mixed-citation></ref><ref id="scirp.98646-ref6"><label>6</label><mixed-citation publication-type="other" xlink:type="simple">Harris, D.C. and Lucy, C.A. (2016) Quantitative Chemical Analysis. 9th Edition, W.H. Freeman &amp; Company, New York, NY.</mixed-citation></ref><ref id="scirp.98646-ref7"><label>7</label><mixed-citation publication-type="other" xlink:type="simple">Lehmann, J. and Libchaber, A. (2008) Degeneracy of the Genetic Code and Stability of the Base Pair at the Second Position of the Anticodon. RNA, 14, 1264-1269.https://doi.org/10.1261/rna.1029808</mixed-citation></ref><ref id="scirp.98646-ref8"><label>8</label><mixed-citation publication-type="other" xlink:type="simple">Auffinger, P. and Westhof, E. (2001) An Extended Structural Signature for the tRNA Anticodon Loop. RNA, 7, 334-341. https://doi.org/10.1017/S1355838201002382</mixed-citation></ref><ref id="scirp.98646-ref9"><label>9</label><mixed-citation publication-type="other" xlink:type="simple">Rumer, Y.B. (2016) Translation of “Systematization of Codons in the Genetic Code [I]” by Yu. B. Rumer (1966). Philosophical Transactions of The Royal Society A, 374, Article ID: 20150446. https://doi.org/10.1098/rsta.2015.0446</mixed-citation></ref><ref id="scirp.98646-ref10"><label>10</label><mixed-citation publication-type="other" xlink:type="simple">Rumer, Y.B. (2016) Translation of “Systematization of Codons in the Genetic Code [II]” by Yu. B. Rumer (1968). Philosophical Transactions of The Royal Society A, 374, Article ID: 20150447. https://doi.org/10.1098/rsta.2015.0447</mixed-citation></ref><ref id="scirp.98646-ref11"><label>11</label><mixed-citation publication-type="other" xlink:type="simple">Rumer, Y.B. (2016) Translation of “Systematization of Codons in the Genetic Code [III]” by Yu. B. Rumer (1969). Philosophical Transactions of The Royal Society A, 374, Article ID: 20150448. https://doi.org/10.1098/rsta.2015.0448</mixed-citation></ref><ref id="scirp.98646-ref12"><label>12</label><mixed-citation publication-type="other" xlink:type="simple">Frederico, L.A., Kunkel, T.A. and Shaw, B.R. (1990) A Sensitive Genetic Assay for the Detection of Cytosine Deamination: Determination of Rate Constants and the Activation Energy. Biochemistry, 29, 2532-2537. https://doi.org/10.1021/bi00462a015</mixed-citation></ref><ref id="scirp.98646-ref13"><label>13</label><mixed-citation publication-type="other" xlink:type="simple">Wright, E.P., Huppert, J.L. and Waller, Z.A.E. (2017) Identification of Multiple Genomic DNA Sequences Which Form I-motif Structures at Neutral pH. Nucleic Acids Research, 45, 2951-2959. https://doi.org/10.1093/nar/gkx090</mixed-citation></ref><ref id="scirp.98646-ref14"><label>14</label><mixed-citation publication-type="other" xlink:type="simple">Zeraati, M., Langley, D.B., Schofield, P., Moye, A.L., Rouet, R., Hughes, W.E., Bryan, T.M., Dinger, M.E. and Christ, D. (2018) I-motif DNA Structures Are Formed in the Nuclei of Human Cells. Nature Chemistry, 10, 631-637.https://doi.org/10.1038/s41557-018-0046-3</mixed-citation></ref><ref id="scirp.98646-ref15"><label>15</label><mixed-citation publication-type="other" xlink:type="simple">Chiusano, M.L., Alvarez-Valin, F., Di Giulio, M., D’Onofrio, G., Ammirato, G., Colonna, G. and Bernardi, G. (2000) Second Codon Positions of Genes and the Secondary Structures of Proteins. Relationships and Implications for the Origin of the Genetic Code. Gene, 261, 63-69. https://doi.org/10.1016/S0378-1119(00)00521-7</mixed-citation></ref><ref id="scirp.98646-ref16"><label>16</label><mixed-citation publication-type="other" xlink:type="simple">Copley, S.D., Smith, E. and Morowitz, H.J. (2005) A Mechanism for the Association of Amino Acids with Their Codons and the Origin of the Genetic Code. Proceedings of the National Academy of Sciences of the United States of America, 102, 4442-4447. https://doi.org/10.1073/pnas.0501049102</mixed-citation></ref></ref-list></back></article>