Xirius-PEPTIDESANDPROTEINSNOTE8-CHM211.pdf
Xirius AI
This document, "PEPTIDES AND PROTEINS NOTE 8 - CHM211," serves as a comprehensive guide to the fundamental concepts of amino acids, peptides, and proteins, tailored for a CHM211 course. It systematically introduces the building blocks of proteins, their chemical properties, and the various levels of structural organization that dictate their biological functions. The notes begin with a detailed exploration of amino acids, covering their general structure, stereochemistry, and crucial acid-base properties, including the concept of zwitterions and isoelectric point.
The document then progresses to explain how amino acids link together to form peptides through the formation of peptide bonds, detailing the nomenclature and structural characteristics of these oligomers. Finally, it delves into the complex world of proteins, outlining their hierarchical structural organization from primary to quaternary levels, and discussing the forces that stabilize these structures. Key topics such as protein denaturation, which leads to loss of function, and various classification schemes based on shape, composition, and biological function are also thoroughly covered. The overall aim is to provide students with a solid foundation in the chemistry and biology of these essential macromolecules.
MAIN TOPICS AND CONCEPTS
Amino acids are the fundamental monomeric units that constitute peptides and proteins. They are organic compounds characterized by the presence of both an amino group ($-NH_2$) and a carboxyl group ($-COOH$) attached to the same carbon atom, known as the alpha-carbon ($\alpha$-carbon). Also attached to the $\alpha$-carbon are a hydrogen atom and a unique side chain, or R-group, which determines the specific properties of each amino acid.
* General Structure: The general formula for an $\alpha$-amino acid is:
```
R
|
H_2N - C - COOH
|
H
```
where R represents the side chain.
* Classification: Amino acids are classified based on the polarity and charge of their R-groups at physiological pH (around 7.4):
* Nonpolar, Aliphatic R Groups: These have hydrocarbon side chains and are hydrophobic. Examples: Glycine (Gly), Alanine (Ala), Valine (Val), Leucine (Leu), Isoleucine (Ile), Methionine (Met), Proline (Pro).
* Aromatic R Groups: These contain aromatic rings and are relatively nonpolar. Examples: Phenylalanine (Phe), Tyrosine (Tyr), Tryptophan (Trp). Tyrosine and Tryptophan are more polar than Phenylalanine due to hydroxyl and indole groups, respectively.
* Polar, Uncharged R Groups: These have side chains capable of hydrogen bonding but are not ionized at physiological pH. Examples: Serine (Ser), Threonine (Thr), Cysteine (Cys), Asparagine (Asn), Glutamine (Gln). Cysteine is unique due to its thiol group, which can form disulfide bonds.
* Positively Charged (Basic) R Groups: These have side chains containing amino or guanidinium groups that are protonated and positively charged at physiological pH. Examples: Lysine (Lys), Arginine (Arg), Histidine (His). Histidine has a pKa near physiological pH, so it can be uncharged or positively charged.
* Negatively Charged (Acidic) R Groups: These have side chains containing carboxyl groups that are deprotonated and negatively charged at physiological pH. Examples: Aspartate (Asp), Glutamate (Glu).
* Stereochemistry:
* All amino acids except glycine (where R=H) have a chiral $\alpha$-carbon, meaning it is bonded to four different groups.
* This chirality leads to two possible stereoisomers: L- and D-forms.
* In proteins, almost exclusively L-amino acids are found.
* The L/D configuration is determined by comparing the amino acid to L- or D-glyceraldehyde. In a Fischer projection, if the amino group is on the left, it's an L-amino acid; if on the right, it's a D-amino acid.
* Acid-Base Properties:
* Amino acids are amphoteric (ampholytes), meaning they can act as both acids and bases.
* At physiological pH, amino acids exist predominantly as zwitterions (dipolar ions), where the amino group is protonated ($-NH_3^+$) and the carboxyl group is deprotonated ($-COO^-$). The net charge of a zwitterion is zero.
* Isoelectric Point (pI): This is the specific pH at which an amino acid (or peptide/protein) has a net electrical charge of zero. At this pH, the molecule is least soluble and will not migrate in an electric field.
* Titration Curves: The titration of an amino acid reveals its multiple ionizable groups (carboxyl, amino, and sometimes R-group). Each group has a specific pKa value.
* Henderson-Hasselbalch Equation: This equation is used to calculate the pH of a buffer solution or to determine the ratio of conjugate base to weak acid:
$pH = pKa + \log \frac{[A^-]}{[HA]}$
* Calculating pI:
* For amino acids with uncharged R-groups (e.g., Glycine, Alanine): $pI = \frac{pKa_1 + pKa_2}{2}$ (where $pKa_1$ is for the carboxyl group and $pKa_2$ is for the amino group).
* For acidic amino acids (e.g., Aspartate, Glutamate): $pI = \frac{pKa_1 + pKa_R}{2}$ (where $pKa_1$ is for the $\alpha$-carboxyl and $pKa_R$ is for the R-group carboxyl).
* For basic amino acids (e.g., Lysine, Arginine): $pI = \frac{pKa_R + pKa_2}{2}$ (where $pKa_R$ is for the R-group amino/guanidinium and $pKa_2$ is for the $\alpha$-amino).
Peptides: Chains of Amino AcidsPeptides are polymers formed by linking amino acids together via peptide bonds.
* Peptide Bond Formation:
* A peptide bond is an amide linkage formed by a condensation reaction (dehydration reaction) between the carboxyl group of one amino acid and the amino group of another amino acid.
* Water is eliminated during this process.
* The peptide bond has partial double-bond character, making it rigid and planar. This restricts rotation around the C-N bond.
* Naming Peptides:
* Peptides are named starting from the amino-terminal (N-terminal) end, which has a free amino group, to the carboxyl-terminal (C-terminal) end, which has a free carboxyl group.
* Amino acid residues within the peptide (except the C-terminal one) are named by changing their -ine or -ate suffix to -yl (e.g., Alanine becomes Alanyl).
* Example: Alanyl-Glycyl-Serine (Ala-Gly-Ser).
* Examples:
* Dipeptide: Two amino acids linked by one peptide bond (e.g., Glycylalanine).
* Tripeptide: Three amino acids linked by two peptide bonds.
* Oligopeptide: A few amino acids (typically 2-20).
* Polypeptide: Many amino acids (typically >20). Proteins are large polypeptides.
Proteins: Complex MacromoleculesProteins are large, complex macromolecules that perform a vast array of functions within living organisms. Their biological activity is intimately linked to their intricate three-dimensional structures.
* Levels of Protein Structure: Proteins exhibit four levels of structural organization:
* Primary Structure: This is the unique, linear sequence of amino acids in a polypeptide chain, determined by covalent peptide bonds. It is the most fundamental level and dictates all higher levels of structure.
* Secondary Structure: Refers to the local, regular folding patterns of the polypeptide backbone, stabilized by hydrogen bonds between the carbonyl oxygen and amide hydrogen atoms of the peptide backbone.
* $\alpha$-Helix: A spiral structure where the polypeptide backbone coils around an imaginary axis. Hydrogen bonds form between the C=O of residue $n$ and the N-H of residue $n+4$. The R-groups project outwards.
* $\beta$-Sheet (Pleated Sheet): Consists of two or more polypeptide strands (beta strands) arranged side-by-side. Hydrogen bonds form between adjacent strands, which can be parallel or antiparallel. The R-groups alternate above and below the plane of the sheet.
* Other less common secondary structures include $\beta$-turns and random coils.
* Tertiary Structure: The overall three-dimensional shape of a single polypeptide chain, resulting from the folding of secondary structures and interactions between the R-groups of amino acids. It is stabilized by various non-covalent interactions and covalent disulfide bonds:
* Hydrophobic interactions: Nonpolar R-groups cluster in the interior, away from water.
* Ionic bonds (salt bridges): Interactions between oppositely charged R-groups (e.g., Lysine and Aspartate).
* Hydrogen bonds: Between polar R-groups, or between polar R-groups and the polypeptide backbone.
* Disulfide bonds: Covalent bonds formed between the thiol groups of two cysteine residues ($-S-S-$). These are strong and contribute significantly to protein stability.
* Van der Waals forces: Weak, transient interactions between all atoms.
* Quaternary Structure: Applies only to proteins composed of two or more polypeptide chains (subunits). It describes the spatial arrangement of these subunits and the interactions between them. The interactions are similar to those in tertiary structure (hydrophobic, ionic, hydrogen bonds, disulfide bonds). Example: Hemoglobin, which has four subunits.
* Denaturation of Proteins:
* Denaturation is the process by which a protein loses its specific three-dimensional structure (secondary, tertiary, and quaternary) without breaking the primary peptide bonds.
* This loss of native structure typically leads to a loss of biological activity.
* Causes of Denaturation:
* Heat: Increases kinetic energy, disrupting weak non-covalent bonds.
* Extreme pH changes: Alters the ionization states of R-groups, disrupting ionic bonds and hydrogen bonds.
* Organic solvents: Interferes with hydrophobic interactions.
* Heavy metal ions: Bind to sulfhydryl groups and disrupt disulfide bonds.
* Detergents: Disrupt hydrophobic interactions.
* Mechanical agitation: Can unfold proteins.
* Renaturation: In some cases, if the denaturing agent is removed, a denatured protein can refold into its native, active conformation, demonstrating that the primary sequence contains all the information needed for proper folding.
* Classification of Proteins: Proteins can be classified based on various criteria:
* Based on Shape:
* Fibrous Proteins: Elongated, insoluble in water, typically structural roles. Examples: Collagen (connective tissue), Keratin (hair, nails), Myosin (muscle).
* Globular Proteins: Compact, spherical, generally soluble in water, typically functional roles (enzymes, transporters). Examples: Hemoglobin, Enzymes (e.g., Amylase), Antibodies.
* Based on Composition:
* Simple Proteins: Composed entirely of amino acid residues. Example: Albumin.
* Conjugated Proteins: Composed of amino acid residues and one or more non-amino acid components called prosthetic groups.
* Nucleoproteins: Protein + nucleic acid (e.g., Ribosomes).
* Glycoproteins: Protein + carbohydrate (e.g., Antibodies, cell surface receptors).
* Lipoproteins: Protein + lipid (e.g., HDL, LDL).
* Metalloproteins: Protein + metal ion (e.g., Hemoglobin - iron, Carbonic anhydrase - zinc).
* Phosphoproteins: Protein + phosphate group (e.g., Casein).
* Chromoproteins: Protein + pigment (e.g., Hemoglobin, Cytochromes).
* Based on Function:
* Catalytic Proteins (Enzymes): Accelerate biochemical reactions. Example: Amylase, DNA polymerase.
* Transport Proteins: Bind and carry specific molecules. Example: Hemoglobin (oxygen), Albumin (fatty acids).
* Storage Proteins: Store nutrients or ions. Example: Ferritin (iron), Ovalbumin (egg white).
* Structural Proteins: Provide support and shape. Example: Collagen, Keratin.
* Hormonal Proteins: Act as chemical messengers. Example: Insulin, Growth hormone.
* Protective/Defense Proteins: Involved in immune response or protection. Example: Antibodies (immunoglobulins), Fibrinogen (blood clotting).
* Contractile/Motile Proteins: Involved in movement. Example: Actin, Myosin.
* Regulatory Proteins: Control gene expression or cell processes. Example: Transcription factors.
KEY DEFINITIONS AND TERMS
* Amino Acid: An organic molecule containing both an amino group ($-NH_2$) and a carboxyl group ($-COOH$), typically attached to the same $\alpha$-carbon, along with a unique side chain (R-group). They are the monomeric units of peptides and proteins.
* Zwitterion: A dipolar ion that has both a positive and a negative charge, but a net charge of zero. Amino acids exist as zwitterions at physiological pH, with a protonated amino group ($-NH_3^+$) and a deprotonated carboxyl group ($-COO^-$).
* Isoelectric Point (pI): The specific pH at which a molecule (like an amino acid, peptide, or protein) has a net electrical charge of zero. At this pH, the molecule is least soluble and does not migrate in an electric field.
* Peptide Bond: A covalent amide linkage formed by a condensation reaction between the carboxyl group of one amino acid and the amino group of another, with the elimination of a water molecule. It forms the backbone of peptides and proteins.
* N-terminal (Amino-terminal) End: The end of a polypeptide chain that has a free amino group ($-NH_3^+$). By convention, peptide sequences are written from the N-terminal to the C-terminal end.
* C-terminal (Carboxyl-terminal) End: The end of a polypeptide chain that has a free carboxyl group ($-COO^-$).
* Primary Structure: The linear sequence of amino acids in a polypeptide chain, determined by covalent peptide bonds. It is the most fundamental level of protein structure.
* Secondary Structure: Localized, regular folding patterns of the polypeptide backbone, primarily stabilized by hydrogen bonds between backbone atoms. Common examples include $\alpha$-helices and $\beta$-sheets.
* Tertiary Structure: The overall three-dimensional shape of a single polypeptide chain, resulting from the folding of secondary structures and interactions (hydrophobic, ionic, hydrogen bonds, disulfide bonds) between amino acid R-groups.
* Quaternary Structure: The spatial arrangement of multiple polypeptide subunits (if present) in a multi-subunit protein, and the interactions between these subunits.
* Denaturation: The process by which a protein loses its native three-dimensional structure (secondary, tertiary, and quaternary) and consequently its biological activity, without breaking the primary peptide bonds. It is typically caused by heat, extreme pH, or chemical agents.
* Prosthetic Group: A non-amino acid component that is tightly bound to a conjugated protein and is essential for its biological activity. Examples include metal ions, carbohydrates, lipids, or heme groups.
IMPORTANT EXAMPLES AND APPLICATIONS
- Glycine (Gly): The simplest amino acid, unique because its R-group is a hydrogen atom, making it achiral. Its small size allows it to fit into tight spaces in protein structures, contributing to flexibility.
- Cysteine (Cys): An amino acid with a thiol (sulfhydryl, -SH) group in its side chain. Two cysteine residues can form a covalent disulfide bond ($-S-S-$) through oxidation, which is crucial for stabilizing the tertiary and quaternary structures of many proteins, particularly extracellular proteins.
- Hemoglobin: A classic example of a protein with quaternary structure. It is a conjugated protein composed of four polypeptide subunits (two $\alpha$ and two $\beta$ chains), each containing a heme prosthetic group with an iron atom. Hemoglobin's function is to transport oxygen in the blood. Its denaturation would lead to a loss of oxygen-carrying capacity.
- Collagen: A prominent example of a fibrous protein and the most abundant protein in mammals. It provides structural support to connective tissues like skin, tendons, and bones. Its structure involves a triple helix, a unique secondary structure, and extensive cross-linking for strength.
- Insulin: A hormonal protein that regulates blood glucose levels. It is a small protein composed of two polypeptide chains linked by disulfide bonds. Its synthesis involves proteolytic cleavage of a larger precursor, demonstrating the importance of post-translational modification.
- Enzymes (e.g., Amylase, Trypsin): These are catalytic proteins that accelerate biochemical reactions. Their specific 3D structure (tertiary and sometimes quaternary) creates an active site that binds substrates and facilitates reactions. Denaturation of an enzyme leads to loss of its catalytic activity.
- Albumin: A simple, globular protein found in blood plasma. It serves as a transport protein for various molecules (e.g., fatty acids, hormones) and contributes significantly to maintaining osmotic pressure.
DETAILED SUMMARY
The provided document for CHM211 offers a thorough introduction to peptides and proteins, starting with their fundamental building blocks: amino acids. It meticulously details the general structure of $\alpha$-amino acids, highlighting the central $\alpha$-carbon, the amino and carboxyl groups, and the defining R-group. A comprehensive classification system for the 20 common amino acids is presented, categorizing them based on the polarity and charge of their R-groups (nonpolar aliphatic, aromatic, polar uncharged, positively charged, and negatively charged). This classification is crucial for understanding how amino acid properties influence protein structure and function. The document also covers the stereochemistry of amino acids, emphasizing the predominance of L-amino acids in biological proteins and the concept of chirality, with glycine being the sole exception.
A significant portion is dedicated to the acid-base properties of amino acids. It explains their amphoteric nature and their existence as zwitterions at physiological pH, where both the amino and carboxyl groups are ionized but the molecule has a net zero charge. The concept of the isoelectric point (pI) is introduced as the pH at which an amino acid or protein has no net charge, along with methods for calculating pI for different types of amino acids. The Henderson-Hasselbalch equation is provided as a tool for understanding the ionization states of these molecules.
Moving beyond individual amino acids, the document explains the formation of peptides. It describes the peptide bond as a covalent amide linkage formed via a condensation reaction between the carboxyl group of one amino acid and the amino group of another, releasing a water molecule. The rigidity and planar nature of the peptide bond due to its partial double-bond character are highlighted. The nomenclature of peptides, from the N-terminal to the C-terminal end, is also detailed.
The culmination of the document is the discussion of proteins, which are large polypeptides. It systematically breaks down the hierarchical organization of protein structure into four levels:
1. Primary structure: The linear sequence of amino acids, determined by covalent peptide bonds, which is the blueprint for all higher structures.
2. Secondary structure: Localized, regular folding patterns of the polypeptide backbone, primarily stabilized by hydrogen bonds. The two most common forms, the $\alpha$-helix and the $\beta$-sheet, are described in detail, including their characteristic hydrogen bonding patterns and R-group orientations.
3. Tertiary structure: The overall three-dimensional folding of a single polypeptide chain, resulting from interactions between the R-groups of amino acids. Various stabilizing forces are explained, including hydrophobic interactions, ionic bonds (salt bridges), hydrogen bonds, and crucially, covalent disulfide bonds formed between cysteine residues.
4. Quaternary structure: Applicable to proteins with multiple polypeptide subunits, describing their spatial arrangement and the non-covalent interactions that hold them together.
A critical concept discussed is protein denaturation, the process by which a protein loses its native 3D structure and, consequently, its biological function, without breaking the primary peptide bonds. The document lists common denaturing agents such as heat, extreme pH, organic solvents, and heavy metals, explaining how they disrupt the weak non-covalent interactions that maintain higher-order structures. The possibility of renaturation for some proteins, where the primary sequence alone dictates proper folding, is also mentioned.
Finally, the document provides a comprehensive classification of proteins based on their shape (fibrous vs. globular), composition (simple vs. conjugated, with various prosthetic groups like carbohydrates, lipids, or metal ions), and diverse biological functions (catalytic, transport, storage, structural, hormonal, protective, contractile, and regulatory). This classification underscores the vast functional diversity of proteins, all stemming from the specific arrangement and interactions of their amino acid building blocks. The summary effectively ties together the concepts, emphasizing that the primary sequence of amino acids is the ultimate determinant of a protein's complex 3D structure and its specific biological role.