Katana VentraIP

Proteinogenic amino acid

Proteinogenic amino acids are amino acids that are incorporated biosynthetically into proteins during translation. The word "proteinogenic" means "protein creating". Throughout known life, there are 22 genetically encoded (proteinogenic) amino acids, 20 in the standard genetic code and an additional 2 (selenocysteine and pyrrolysine) that can be incorporated by special translation mechanisms.[1]

In contrast, non-proteinogenic amino acids are amino acids that are either not incorporated into proteins (like GABA, L-DOPA, or triiodothyronine), misincorporated in place of a genetically encoded amino acid, or not produced directly and in isolation by standard cellular machinery (like hydroxyproline). The latter often results from post-translational modification of proteins. Some non-proteinogenic amino acids are incorporated into nonribosomal peptides which are synthesized by non-ribosomal peptide synthetases.


Both eukaryotes and prokaryotes can incorporate selenocysteine into their proteins via a nucleotide sequence known as a SECIS element, which directs the cell to translate a nearby UGA codon as selenocysteine (UGA is normally a stop codon). In some methanogenic prokaryotes, the UAG codon (normally a stop codon) can also be translated to pyrrolysine.[2]


In eukaryotes, there are only 21 proteinogenic amino acids, the 20 of the standard genetic code, plus selenocysteine. Humans can synthesize 12 of these from each other or from other molecules of intermediary metabolism. The other nine must be consumed (usually as their protein derivatives), and so they are called essential amino acids. The essential amino acids are histidine, isoleucine, leucine, lysine, methionine, phenylalanine, threonine, tryptophan, and valine (i.e. H, I, L, K, M, F, T, W, V).[3]


The proteinogenic amino acids have been found to be related to the set of amino acids that can be recognized by ribozyme autoaminoacylation systems.[4] Thus, non-proteinogenic amino acids would have been excluded by the contingent evolutionary success of nucleotide-based life forms. Other reasons have been offered to explain why certain specific non-proteinogenic amino acids are not generally incorporated into proteins; for example, ornithine and homoserine cyclize against the peptide backbone and fragment the protein with relatively short half-lives, while others are toxic because they can be mistakenly incorporated into proteins, such as the arginine analog canavanine.


The evolutionary selection of certain proteinogenic amino acids from the primordial soup has been suggested to be because of their better incorporation into a polypeptide chain as opposed to non-proteinogenic amino acids.[5]

L-Alanine
(Ala / A)

L-Alanine (Ala / A)

L-Arginine
(Arg / R)

L-Arginine (Arg / R)

L-Asparagine
(Asn / N)

L-Asparagine (Asn / N)

L-Aspartic acid
(Asp / D)

L-Aspartic acid (Asp / D)

L-Cysteine
(Cys / C)

L-Cysteine (Cys / C)

L-Glutamic acid
(Glu / E)

L-Glutamic acid (Glu / E)

L-Glutamine
(Gln / Q)

L-Glutamine (Gln / Q)

Glycine
(Gly / G)

Glycine (Gly / G)

L-Histidine
(His / H)

L-Histidine (His / H)

L-Isoleucine
(Ile / I)

L-Isoleucine (Ile / I)

L-Leucine
(Leu / L)

L-Leucine (Leu / L)

L-Lysine
(Lys / K)

L-Lysine (Lys / K)

L-Methionine
(Met / M)

L-Methionine (Met / M)

L-Phenylalanine
(Phe / F)

L-Phenylalanine (Phe / F)

L-Proline
(Pro / P)

L-Proline (Pro / P)

L-Serine
(Ser / S)

L-Serine (Ser / S)

L-Threonine
(Thr / T)

L-Threonine (Thr / T)

L-Tryptophan
(Trp / W)

L-Tryptophan (Trp / W)

L-Tyrosine
(Tyr / Y)

L-Tyrosine (Tyr / Y)

L-Valine
(Val / V)

L-Valine (Val / V)

The following illustrates the structures and abbreviations of the 21 amino acids that are directly encoded for protein synthesis by the genetic code of eukaryotes. The structures given below are standard chemical structures, not the typical zwitterion forms that exist in aqueous solutions.


IUPAC/IUBMB now also recommends standard abbreviations for the following two amino acids:

Glucogenic, with the products having the ability to form by gluconeogenesis

glucose

Ketogenic, with the products not having the ability to form glucose: These products may still be used for or lipid synthesis.

ketogenesis

Amino acids catabolized into both glucogenic and ketogenic products

Glucogenic amino acid

Ketogenic amino acid

The origin of the single-letter code for the amino acids