Primary structure The primary structure of a protein, its linear amino-acid sequence, determines its native conformation. The specific amino acid residues and their position in the polypeptide chain are the determining factors for which portions of the protein fold closely together and form its three-dimensional conformation. The amino acid composition is not as important as the sequence. The essential fact of folding, however, remains that the amino acid sequence of each protein contains the information that specifies both the native structure and the pathway to attain that state. This is not to say that nearly identical amino acid sequences always fold similarly. Conformations differ based on environmental factors as well; similar proteins fold differently based on where they are found.
Secondary structure spiral formation displaying hydrogen bonding within the backbone Formation of a
secondary structure is the first step in the folding process that a protein takes to assume its native structure. Characteristic of secondary structure are the structures known as
alpha helices and
beta sheets that fold rapidly because they are stabilized by
intramolecular hydrogen bonds, as was first characterized by
Linus Pauling. Formation of intramolecular hydrogen bonds provides another important contribution to protein stability. α-helices are formed by hydrogen bonding of the
backbone to form a spiral shape (refer to figure on the right).
Quaternary structure Tertiary structure may give way to the formation of quaternary structure in some proteins, which usually involves the "assembly" or "coassembly" of subunits that have already folded; in other words, multiple polypeptide chains could interact to form a fully functional quaternary protein. The folding time scale of an isolated protein depends on its size,
contact order, and
circuit topology. Inside cells, the process of folding often begins
co-translationally, so that the
N-terminus of the protein begins to fold while the
C-terminal portion of the protein is still being
synthesized by the
ribosome; however, a protein molecule may fold spontaneously during or after
biosynthesis. While these
macromolecules may be regarded as "
folding themselves", the process also depends on the
solvent (
water or
lipid bilayer), the concentration of
salts, the
pH, the
temperature, the possible presence of cofactors and of molecular
chaperones. Proteins will have limitations on their folding abilities by the restricted bending angles or conformations that are possible. These allowable angles of protein folding are described with a two-dimensional plot known as the
Ramachandran plot, depicted with psi and phi angles of allowable rotation.
Hydrophobic effect . In the compact fold (to the right), the hydrophobic amino acids (shown as black spheres) collapse toward the center to become shielded from aqueous environment.|left Protein folding must be thermodynamically favorable within a cell in order for it to be a spontaneous reaction. Since it is known that protein folding is a spontaneous reaction, then it must assume a negative
Gibbs free energy value. Gibbs free energy in protein folding is directly related to
enthalpy and
entropy. The hydrophobic effect is the phenomenon in which the hydrophobic chains of a protein collapse into the core of the protein (away from the hydrophilic environment). An ordering of water molecules around a hydrophobic region increases order in a system and therefore contributes a negative change in entropy (less entropy in the system). The water molecules are fixed in these water cages which drives the
hydrophobic collapse, or the inward folding of the hydrophobic groups. The hydrophobic collapse introduces entropy back to the system via the breaking of the water cages which frees the ordered water molecules. The strength of hydrogen bonds depends on their environment; thus, H-bonds enveloped in a hydrophobic core contribute more than H-bonds exposed to the aqueous environment to the stability of the native state. In proteins with globular folds, hydrophobic amino acids tend to be interspersed along the primary sequence, rather than randomly distributed or clustered together. However, proteins that have recently been born
de novo, which tend to be
intrinsically disordered, show the opposite pattern of hydrophobic amino acid clustering along the primary sequence.
Chaperones Molecular chaperones are a class of proteins that aid in the correct folding of other proteins
in vivo. Chaperones exist in all cellular compartments and interact with the polypeptide chain in order to allow the native three-dimensional conformation of the protein to form; however, chaperones themselves are not included in the final structure of the protein they are assisting in. In this way, chaperones do not actually increase the rate of individual steps involved in the folding pathway toward the native structure; instead, they work by reducing possible unwanted aggregations of the polypeptide chain that might otherwise slow down the search for the proper intermediate and they provide a more efficient pathway for the polypeptide chain to assume the correct conformations. This means that the polypeptide chain could theoretically fold into its native structure without the aid of chaperones, as demonstrated by protein folding experiments conducted
in vitro; A fully denatured protein lacks both tertiary and secondary structure, and exists as a so-called
random coil. Under certain conditions some proteins can refold; however, in many cases, denaturation is irreversible. Cells sometimes protect their proteins against the denaturing influence of heat with
enzymes known as
heat shock proteins (a type of chaperone), which assist other proteins both in folding and in remaining folded.
Heat shock proteins have been found in all species examined, from
bacteria to humans, suggesting that they evolved very early and have an important function. Some proteins never fold in cells at all except with the assistance of chaperones which either isolate individual proteins so that their folding is not interrupted by interactions with other proteins or help to unfold misfolded proteins, allowing them to refold into the correct native structure. This function is crucial to prevent the risk of
precipitation into
insoluble amorphous aggregates. The external factors involved in protein denaturation or disruption of the native state include temperature, external fields (electric, magnetic), molecular crowding, and even the limitation of space (i.e. confinement), which can have a big influence on the folding of proteins. High concentrations of
solutes, extremes of
pH, mechanical forces, and the presence of chemical denaturants can contribute to protein denaturation, as well. These individual factors are categorized together as stresses. Chaperones are shown to exist in increasing concentrations during times of cellular stress and help the proper folding of emerging proteins as well as denatured or misfolded ones. which of course requires that their full complement of vital proteins and protein assemblies be stable at that temperature or above. The bacterium
E. coli is the host for
bacteriophage T4, and the phage encoded gp31 protein () appears to be structurally and functionally homologous to
E. coli chaperone protein
GroES and able to substitute for it in the assembly of bacteriophage T4
virus particles during infection. Like GroES, gp31 forms a stable complex with
GroEL chaperonin that is absolutely necessary for the folding and assembly in vivo of the bacteriophage T4 major capsid protein gp23. == Misfolded proteins ==