Proteins are biomolecules made up of one or more amino acid chains (polypeptide chains), and their shape/structure is very important to their function.
There are four levels of protein structure: primary, secondary, tertiary, and quaternary.
The simplest level of protein structure, primary structure, is the sequence of amino acids in a polypeptide chain. For example, the hormone insulin has two polypeptide chains, A and B, shown in diagram below. Each chain has its own set of amino acids, assembled in a particular order.
The next level of protein structure, secondary structure, refers to local folded structures that form within a polypeptide due to interactions between atoms of the backbone (not R group side chains). The most common types of secondary structures are the α helix and the β pleated sheet. Both types of structures are held in shape by hydrogen bonds, which form between the carbonyl O of one amino acid and the amino H of another.
In an α helix, the carbonyl (C=O) of one amino acid is hydrogen bonded to the amino H (N-H) of an amino acid that is four down the chain. (E.g., the carbonyl of amino acid 1 would form a hydrogen bond to the N-H of amino acid 5.) This pattern of bonding pulls the polypeptide chain into a helical structure that resembles a curled ribbon. The R groups of the amino acids stick outward from the α helix, where they are free to interact.
In a β pleated sheet, two or more segments of a polypeptide chain line up next to each other, forming a sheet-like structure held together by hydrogen bonds. The hydrogen bonds form between carbonyl and amino groups of backbone, while the R groups extend above and below the plane of the sheet. The strands of a β pleated sheet may be parallel, pointing in the same direction (meaning that their N- and C-termini match up), or antiparallel, pointing in opposite directions (meaning that the N-terminus of one strand is positioned next to the C-terminus of the other).
Certain amino acids are more or less likely to be found in α-helices or β pleated sheets. For instance, the amino acid proline is sometimes called a “helix breaker” because its unusual R group (which bonds to the amino group to form a ring) creates a bend in the chain and is not compatible with helix formation. Proline is typically found in bends, unstructured regions between secondary structures. Similarly, amino acids such as tryptophan, tyrosine, and phenylalanine, which have large ring structures in their R groups, are often found in β pleated sheets, perhaps because the β pleated sheet structure provides plenty of space for the side chains.
Many proteins contain both α helices and β pleated sheets, though some contain just one type of secondary structure (or do not form either type).
The overall three-dimensional structure of a polypeptide is called its tertiary structure. The tertiary structure is primarily due to interactions between the R groups of the amino acids that make up the protein.
R group interactions that contribute to tertiary structure include hydrogen bonding, ionic bonding, dipole-dipole interactions, and London dispersion forces – basically, all types of non-covalent bonds. Also important to tertiary structure are hydrophobic interactions, in which amino acids with nonpolar, hydrophobic R groups cluster together on the inside of the protein, leaving hydrophilic amino acids on the outside to interact with surrounding water molecules.
One special type of covalent bond that can contribute to tertiary structure is the disulfide bond. Disulfide bonds are covalent linkages between the sulfur-containing side chains of cysteines, and are much stronger than the other types of bonds that contribute to tertiary structure.