List of Standard Amino Acids

This entry is from Wikipedia, the leading user-contributed encyclopedia.

 

This list of standard proteinogenic amino acids details the chemical structures and properties of the twenty standard amino acids used in proteins by living cells.

Contents

 

1. Structures
2. Chemical properties
    2.1. General chemical properties
    2.2. Side chain properties
    2.3. Gene expression and biochemistry
    2.4. Remarks
3. References

1. Structures

L-Alanine
(Ala/A)

L-Arginine
(Arg/R)

L-Asparagine
(Asn/N)

L-Aspartic acid
(Asp/D)

L-Cysteine
(Cys/C)

L-Glutamic acid
(Glu/E)

L-Glutamine
(Gln/Q)

Glycine
(Gly/G)

L-Histidine
(His/H)

L-Isoleucine
(Ile/I)

L-Leucine
(Leu/L)

L-Lysine
(Lys/K)

L-Methionine
(Met/M)

L-Phenylalanine
(Phe/F)

L-Proline
(Pro/P)

L-Serine
(Ser/S)

L-Threonine
(Thr/T)

L-Tryptophan
(Trp/W)

L-Tyrosine
(Tyr/Y)

L-Cysteine
(Cys/C)

Structures and symbols of the 20 amino acids which are directly
encoded for protein synthesis by the standard genetic code.

2. Chemical properties

 

Following is a table listing the one letter symbols, the three-letter symbols, and the chemical properties of the side chains of the standard amino acids. The masses listed are based on weighted averages of the elemental isotopes at their natural abundances. Note that forming a peptide bond results in elimination of a molecule of water, so the mass of an amino acid unit within a protein chain is reduced by 18.01524 Da. The one-letter symbol for an undetermined amino acid is X. The three-letter symbol Asx or one-letter symbol B means the amino acid is either asparagine or aspartic acid; Glx or Z means either glutamic acid or glutamine; and Xle or J means either leucine or isoleucine. IUPAC/IUBMB now also recommends that Sec or U refers to selenocysteine, and Pyl or O refers to pyrrolysine.

2.1. General chemical properties

Acid

Abbrev.

Mass (Da)

pI

pK1 (α-COOH)

pK2 (α-+NH3)

Alanine

A

Ala

89.09404

6.01

2.35

9.87

Cysteine

C

Cys

121.15404

5.05

1.92

10.70

Aspartic acid

D

Asp

133.10384

2.85

1.99

9.90

Glutamic acid

E

Glu

147.13074

3.15

2.10

9.47

Phenylalanine

F

Phe

165.19184

5.49

2.20

9.31

Glycine

G

Gly

75.06714

6.06

2.35

9.78

Histidine

H

His

155.15634

7.60

1.80

9.33

Isoleucine

I

Ile

131.17464

6.05

2.32

9.76

Lysine

K

Lys

146.18934

9.60

2.16

9.06

Leucine

L

Leu

131.17464

6.01

2.33

9.74

Methionine

M

Met

149.20784

5.74

2.13

9.28

Asparagine

N

Asn

132.11904

5.41

2.14

8.72

Proline

P

Pro

115.13194

6.30

1.95

10.64

Glutamine

Q

Gln

146.14594

5.65

2.17

9.13

Arginine

R

Arg

174.20274

10.76

1.82

8.99

Serine

S

Ser

105.09344

5.68

2.19

9.21

Threonine

T

Thr

119.12034

5.60

2.09

9.10

Selenocysteine

U

Sec

169.06

Valine

V

Val

117.14784

6.00

2.39

9.74

Tryptophan

W

Trp

204.22844

5.89

2.46

9.41

Tyrosine

Y

Tyr

181.19124

5.64

2.20

9.21

2.2. Side chain properties

Acid

Abbrev.

Side chain

Hydro-
phobic

pKr

Polar

Charged

Small

Tiny

Aromatic
or Aliphatic

van der Waals

Alanine

A

Ala

-CH3

X

X

X

volume

Cysteine

C

Cys

-CH2SH

X

8.18

acidic

X

67

Aspartic acid

D

Asp

-CH2COOH

3.90

X

acidic

X

86

Glutamic acid

E

Glu

-CH2CH2COOH

4.07

X

acidic

91

Phenylalanine

F

Phe

-CH2C6H5

X

Aromatic

109

Glycine

G

Gly

-H

X

X

135

Histidine

H

His

-CH2-C3H3N2

6.04

X

weak basic

Aromatic

48

Isoleucine

I

Ile

-CH(CH3)CH2CH3

X

Aliphatic

118

Lysine

K

Lys

-(CH2)4NH2

10.54

X

basic

124

Leucine

L

Leu

-CH2CH(CH3)2

X

Aliphatic

135

Methionine

M

Met

-CH2CH2SCH3

X

124

Asparagine

N

Asn

-CH2CONH2

X

X

124

Proline

P

Pro

-CH2CH2CH2-

X

X

96

Glutamine

Q

Gln

-CH2CH2CONH2

X

90

Arginine

R

Arg

-(CH2)3NH-C(NH)NH2

12.48

X

basic

114

Serine

S

Ser

-CH2OH

X

X

X

148

Threonine

T

Thr

-CH(OH)CH3

X

weak acidic

X

73

Selenocysteine

U

Sec

-CH2SeH

X

5.73

X

93

Valine

V

Val

-CH(CH3)2

X

X

Aliphatic

 

Tryptophan

W

Trp

-CH2C8H6N

X

Aromatic

105

Tyrosine

Y

Tyr

-CH2-C6H4OH

X

 

X

Aromatic

163

Note: The pKa values of amino acids are typically slightly different when the amino acid is inside a protein. Protein pKa calculations are sometimes used to calculate the change in the pKa value of an amino acid in this situation.

2.3. Gene expression and biochemistry

Amino Acid

Abbrev.

Codon (s)

Occurrence in proteins (%)

Essential in humans

Alanine

A

Ala

GCU, GCC, GCA, GCG

7.8

Cysteine

C

Cys

UGU, UGC

1.9

Aspartic acid

D

Asp

GAU, GAC

5.3

Glutamate

E

Glu

GAA, GAG

6.3

Phenylalanine

F

Phe

UUU, UUC

3.9

X

Glycine

G

Gly

GGU, GGC, GGA, GGG

7.2

Histidine

H

His

CAU, CAC

2.3

Isoleucine

I

Ile

AUU, AUC, AUA

5.3

X

Lysine

K

Lys

AAA, AAG

5.9

X

Leucine

L

Leu

UUA, UUG, CUU, CUC, CUA, CUG

9.1

X

Methionine

M

Met

AUG

2.3

X

Asparagine

N

Asn

AAU, AAC

4.3

Proline

P

Pro

CCU, CCC, CCA, CCG

5.2

Glutamine

Q

Gln

CAA, CAG

4.2

Arginine

R

Arg

CGU, CGC, CGA, CGG, AGA, AGG

5.1

Serine

S

Ser

UCU, UCC, UCA, UCG, AGU, AGC

6.8

Threonine

T

Thr

ACU, ACC, ACA, ACG

5.9

X

Selenocysteine

U

Sec

UGA

Valine

V

Val

GUU, GUC, GUA, GUG

6.6

X

Tryptophan

W

Trp

UGG

1.4

X

Tyrosine

Y

Tyr

UAU, UAC

3.2

Stop codon

Term

UAA, UAG, UGA

Note: The Stop Codon is not an amino acid, but is included for completeness.

2.4. Remarks

Amino Acid

Abbrev.

Remarks

Alanine

A

Ala

Very abundant, very versatile. More stiff than glycine, but small enough to pose only small steric limits for the protein conformation. It behaves fairly neutrally, can be located in both hydrophilic regions on the protein outside and the hydrophobic areas inside.

Cysteine

C

Cys

The sulfur atom binds readily to heavy metal ions. Under oxidizing conditions, two cysteines can join together in a disulfide bond to form the amino acid cystine. When cystines are part of a protein, insulin for example, this stabilises tertiary structure and makes the protein more resistant to denaturation; disulphide bridges are therefore common in proteins that have to function in harsh environments including digestive enzymes (e.g., pepsin and chymotrypsin) and structural proteins (e.g., keratin). Disulphides are also found in peptides too small to hold a stable shape on their own (eg. insulin).

Aspartic acid

D

Asp

Behaves similarly to glutamic acid. Carries a hydrophilic acidic group with strong negative charge. Usually is located on the outer surface of the protein, making it water-soluble. Binds to positively-charged molecules and ions, often used in enzymes to fix the metal ion. When located inside of the protein, aspartate and glutamate are usually paired with arginine and lysine.

Glutamate

E

Glu

Behaves similar to aspartic acid. Has longer, slightly more flexible side chain.

Phenylalanine

F

Phe

Essential for humans. Phenylalanine, tyrosine, and tryptophan contain large rigid aromatic group on the side chain. These are the biggest amino acids. Like isoleucine, leucine and valine, these are hydrophobic and tend to orient towards the interior of the folded protein molecule.

Glycine

G

Gly

Because of the two hydrogen atoms at the α carbon, glycine is not optically active. It is the smallest amino acid, rotates easily, adds flexibility to the protein chain. It is able to fit into the tightest spaces, e.g., the triple helix of collagen. As too much flexibility is usually not desired, as a structural component it is less common than alanine.

Histidine

H

His

In even slightly acidic conditions protonation of the nitrogen occurs, changing the properties of histidine and the polypeptide as a whole. It is used by many proteins as a regulatory mechanism, changing the conformation and behavior of the polypeptide in acidic regions such as the late endosome or lysosome, enforcing conformation change in enzymes. However only a few histidines are needed for this, so it is comparatively scarce.

Isoleucine

I

Ile

Essential for humans. Isoleucine, leucine and valine have large aliphatic hydrophobic side chains. Their molecules are rigid, and their mutual hydrophobic interactions are important for the correct folding of proteins, as these chains tend to be located inside of the protein molecule.

Lysine

K

Lys

Essential for humans. Behaves similarly to arginine. Contains a long flexible side-chain with a positively-charged end. The flexibility of the chain makes lysine and arginine suitable for binding to molecules with many negative charges on their surfaces. E.g., DNA-binding proteins have their active regions rich with arginine and lysine. The strong charge makes these two amino acids prone to be located on the outer hydrophilic surfaces of the proteins; when they are found inside, they are usually paired with a corresponding negatively-charged amino acid, e.g., aspartate or glutamate.

Leucine

L

Leu

Essential for humans. Behaves similar to isoleucine and valine. See isoleucine.

Methionine

M

Met

Essential for humans. Always the first amino acid to be incorporated into a protein; sometimes removed after translation. Like cysteine, contains sulfur, but with a methyl group instead of hydrogen. This methyl group can be activated, and is used in many reactions where a new carbon atom is being added to another molecule.

Asparagine

N

Asn

Similar to aspartic acid. Asn contains an amide group where Asp has a carboxyl.

Proline

P

Pro

Contains an unusual ring to the N-end amine group, which forces the CO-NH amide sequence into a fixed conformation. Can disrupt protein folding structures like α helix or β sheet, forcing the desired kink in the protein chain. Common in collagen, where it often undergoes a posttranslational modification to hydroxyproline. Uncommon elsewhere.

Glutamine

Q

Gln

Similar to glutamic acid. Gln contains an amide group where Glu has a carboxyl. Used in proteins and as a storage for ammonia.

Arginine

R

Arg

Functionally similar to lysine.

Serine

S

Ser

Serine and threonine have a short group ended with a hydroxyl group. Its hydrogen is easy to remove, so serine and threonine often act as hydrogen donors in enzymes. Both are very hydrophilic, therefore the outer regions of soluble proteins tend to be rich with them.

Threonine

T

Thr

Essential for humans. Behaves similarly to serine.

Valine

V

Val

Essential for humans. Behaves similarly to isoleucine and leucine. See isoleucine.

Tryptophan

W

Trp

Essential for humans. Behaves similarly to phenylalanine and tyrosine (see phenylalanine). Precursor of serotonin.

Tyrosine

Y

Tyr

Behaves similarly to phenylalanine and tryptophan (see phenylalanine). Precursor of melanin, epinephrine, and thyroid hormones.

3. References

 

1. Nelson, David L.; Cox, Michael M. (2000). Lehninger Principles of Biochemistry, 3rd ed, Worth Publishers.
2. Kyte, J.; Doolittle, R. F. (1982). “A simple method for displaying the hydropathic character of a protein”. J. Mol. Biol. 157 (1): 105-132.

ChemPep
Close Bitnami banner
Bitnami