Protein biosynthesis
Encyclopedia
Protein biosynthesis is the process in which cells
build or manufacture protein
s. The term is sometimes used to refer only to protein translation
but more often it refers to a multi-step process, beginning with amino acid synthesis
and transcription
of nuclear
DNA
into messenger RNA
, which is then used as input for translation.
The cistron
DNA is transcribed into a variety of RNA intermediates. The last version is used as a template in synthesis of a polypeptide chain. Proteins can often be synthesized directly from genes by translating
mRNA. When a protein must be available on short notice or in large quantities, a protein precursor
is produced. A proprotein is an inactive protein containing one or more inhibitory peptides that can be activated when the inhibitory sequence is removed by proteolysis
during posttranslational modification
. A preprotein is a form that contains a signal sequence (an N-terminal signal peptide
) that specifies its insertion into or through membranes, i.e., targets them for secretion. The signal peptide is cleaved off in the endoplasmic reticulum
. Preproproteins have both sequences (inhibitory and signal) still present.
For synthesis of protein, a succession of tRNA molecules charged with appropriate amino acids have to be brought together with an mRNA molecule and matched up by base-pairing through their anti-codons with each of its successive codons. The amino acids then have to be linked together to extend the growing protein chain, and the tRNAs, relieved of their burdens, have to be released. This whole complex of processes is carried out by a giant multimolecular machine, the ribosome
, formed of two main chains of RNA, called ribosomal RNA (rRNA
), and more than 50 different proteins. This molecular juggernaut latches onto the end of an mRNA molecule and then trundles along it, capturing loaded tRNA molecules and stitching together the amino acids they carry to form a new protein chain.
Protein biosynthesis, although very similar, is different for prokaryote
s and eukaryote
s.
double helix in the genome
as template. This strand is called the template strand. Transcription can be divided into 3 stages: Initiation, Elongation, and Termination, each regulated by a large number of proteins such as transcription factor
s and coactivators that ensure that the correct gene is transcribed.
The DNA strand is read in the 3' to 5' direction and the mRNA is transcribed in the 5' to 3' direction by the RNA polymerase
.
Transcription occurs in the cell nucleus
, where the DNA is held. The DNA structure of the cell is made up of two helixes made up of sugar and phosphate held together by the bases. The sugar and the phosphate are joined together by covalent bond. The DNA is "unzipped" by the enzyme helicase
, leaving the single nucleotide chain open to be copied. RNA polymerase reads the DNA strand from 3-prime (3') end to the 5-prime (5') end, while it synthesizes a single strand of messenger RNA in the 5'-to-3' direction. The general RNA structure is very similar to the DNA structure, but in RNA the nucleotide uracil
takes the place that thymine
occupies in DNA. The single strand of mRNA leaves the nucleus through nuclear pore
s, and migrates into the cytoplasm
.
The first product of transcription differs in prokaryotic
cells from that of eukaryotic
cells, as in prokaryotic cells the product is mRNA, which needs no post-transcriptional modification, whereas, in eukaryotic cells, the first product is called primary transcript, that needs post-transcriptional modification (capping with 7-methyl-guanosine, tailing with a poly A tail) to give hnRNA (heterophil nuclear RNA). hnRNA then undergoes splicing of intron
s (noncoding parts of the gene) via spliceosome
s to produce the final mRNA.
, where the ribosome
s are located. Ribosomes are made of a small and large subunit that surround the mRNA. In translation, messenger RNA (mRNA) is decoded to produce a specific polypeptide according to the rules specified by the trinucleotide genetic code
. This uses an mRNA sequence as a template to guide the synthesis of a chain of amino acid
s that form a protein. Translation proceeds in four phases: activation, initiation, elongation, and termination (all describing the growth of the amino acid chain, or polypeptide that is the product of translation).
In activation, the correct amino acid (AA) is joined to the correct transfer RNA (tRNA). While this is not, in the technical sense, a step in translation, it is required for translation to proceed. The AA is joined by its carboxyl group to the 3' OH of the tRNA by an ester bond. When the tRNA has an amino acid linked to it, it is termed "charged".
Initiation involves the small subunit of the ribosome binding to 5' end of mRNA with the help of initiation factors (IF), other proteins that assist the process.
Elongation occurs when the next aminoacyl-tRNA (charged tRNA) in line binds to the ribosome along with GTP
and an elongation factor
.
Termination of the polypeptide happens when the A site of the ribosome faces a stop codon (UAA, UAG, or UGA). When this happens, no tRNA can recognize it, but releasing factor
can recognize nonsense codons and causes the release of the polypeptide chain.
The capacity of disabling or inhibiting translation in protein biosynthesis is used by some antibiotic
s such as anisomycin
, cycloheximide
, chloramphenicol
, tetracycline, streptomycin
, erythromycin
, puromycin
, etc.
.
During and after synthesis, polypeptide chains often fold to assume, so called, native secondary
and tertiary structure
s. This is known as protein folding
.
Cell (biology)
The cell is the basic structural and functional unit of all known living organisms. It is the smallest unit of life that is classified as a living thing, and is often called the building block of life. The Alberts text discusses how the "cellular building blocks" move to shape developing embryos....
build or manufacture protein
Protein
Proteins are biochemical compounds consisting of one or more polypeptides typically folded into a globular or fibrous form, facilitating a biological function. A polypeptide is a single linear polymer chain of amino acids bonded together by peptide bonds between the carboxyl and amino groups of...
s. The term is sometimes used to refer only to protein translation
Translation (genetics)
In molecular biology and genetics, translation is the third stage of protein biosynthesis . In translation, messenger RNA produced by transcription is decoded by the ribosome to produce a specific amino acid chain, or polypeptide, that will later fold into an active protein...
but more often it refers to a multi-step process, beginning with amino acid synthesis
Amino acid synthesis
For the non-biological synthesis of amino acids see: Strecker amino acid synthesisAmino acid synthesis is the set of biochemical processes by which the various amino acids are produced from other compounds. The substrates for these processes are various compounds in the organism's diet or growth...
and transcription
Transcription (genetics)
Transcription is the process of creating a complementary RNA copy of a sequence of DNA. Both RNA and DNA are nucleic acids, which use base pairs of nucleotides as a complementary language that can be converted back and forth from DNA to RNA by the action of the correct enzymes...
of nuclear
Cell nucleus
In cell biology, the nucleus is a membrane-enclosed organelle found in eukaryotic cells. It contains most of the cell's genetic material, organized as multiple long linear DNA molecules in complex with a large variety of proteins, such as histones, to form chromosomes. The genes within these...
DNA
DNA
Deoxyribonucleic acid is a nucleic acid that contains the genetic instructions used in the development and functioning of all known living organisms . The DNA segments that carry this genetic information are called genes, but other DNA sequences have structural purposes, or are involved in...
into messenger RNA
Messenger RNA
Messenger RNA is a molecule of RNA encoding a chemical "blueprint" for a protein product. mRNA is transcribed from a DNA template, and carries coding information to the sites of protein synthesis: the ribosomes. Here, the nucleic acid polymer is translated into a polymer of amino acids: a protein...
, which is then used as input for translation.
The cistron
Cistron
A cistron is a gene. The term cistron is used to emphasize that genes exhibit a specific behavior in a cis-trans test; distinct positions within a genome are cistronic when mutations at the loci exhibit the same simple Mendelian inheritance as would mutations at a single locus.For example,...
DNA is transcribed into a variety of RNA intermediates. The last version is used as a template in synthesis of a polypeptide chain. Proteins can often be synthesized directly from genes by translating
Translation (genetics)
In molecular biology and genetics, translation is the third stage of protein biosynthesis . In translation, messenger RNA produced by transcription is decoded by the ribosome to produce a specific amino acid chain, or polypeptide, that will later fold into an active protein...
mRNA. When a protein must be available on short notice or in large quantities, a protein precursor
Protein precursor
A protein precursor, also called a pro-protein or pro-peptide, is an inactive protein that can be turned into an active form by posttranslational modification. The name of the precursor for a protein is often prefixed by pro...
is produced. A proprotein is an inactive protein containing one or more inhibitory peptides that can be activated when the inhibitory sequence is removed by proteolysis
Proteolysis
Proteolysis is the directed degradation of proteins by cellular enzymes called proteases or by intramolecular digestion.-Purposes:Proteolysis is used by the cell for several purposes...
during posttranslational modification
Posttranslational modification
Posttranslational modification is the chemical modification of a protein after its translation. It is one of the later steps in protein biosynthesis, and thus gene expression, for many proteins....
. A preprotein is a form that contains a signal sequence (an N-terminal signal peptide
Signal peptide
A signal peptide is a short peptide chain that directs the transport of a protein.Signal peptides may also be called targeting signals, signal sequences, transit peptides, or localization signals....
) that specifies its insertion into or through membranes, i.e., targets them for secretion. The signal peptide is cleaved off in the endoplasmic reticulum
Endoplasmic reticulum
The endoplasmic reticulum is an organelle of cells in eukaryotic organisms that forms an interconnected network of tubules, vesicles, and cisternae...
. Preproproteins have both sequences (inhibitory and signal) still present.
For synthesis of protein, a succession of tRNA molecules charged with appropriate amino acids have to be brought together with an mRNA molecule and matched up by base-pairing through their anti-codons with each of its successive codons. The amino acids then have to be linked together to extend the growing protein chain, and the tRNAs, relieved of their burdens, have to be released. This whole complex of processes is carried out by a giant multimolecular machine, the ribosome
Ribosome
A ribosome is a component of cells that assembles the twenty specific amino acid molecules to form the particular protein molecule determined by the nucleotide sequence of an RNA molecule....
, formed of two main chains of RNA, called ribosomal RNA (rRNA
Ribosomal RNA
Ribosomal ribonucleic acid is the RNA component of the ribosome, the enzyme that is the site of protein synthesis in all living cells. Ribosomal RNA provides a mechanism for decoding mRNA into amino acids and interacts with tRNAs during translation by providing peptidyl transferase activity...
), and more than 50 different proteins. This molecular juggernaut latches onto the end of an mRNA molecule and then trundles along it, capturing loaded tRNA molecules and stitching together the amino acids they carry to form a new protein chain.
Protein biosynthesis, although very similar, is different for prokaryote
Prokaryote
The prokaryotes are a group of organisms that lack a cell nucleus , or any other membrane-bound organelles. The organisms that have a cell nucleus are called eukaryotes. Most prokaryotes are unicellular, but a few such as myxobacteria have multicellular stages in their life cycles...
s and eukaryote
Eukaryote
A eukaryote is an organism whose cells contain complex structures enclosed within membranes. Eukaryotes may more formally be referred to as the taxon Eukarya or Eukaryota. The defining membrane-bound structure that sets eukaryotic cells apart from prokaryotic cells is the nucleus, or nuclear...
s.
Transcription
In transcription an mRNA chain is generated, with one strand of the DNADNA
Deoxyribonucleic acid is a nucleic acid that contains the genetic instructions used in the development and functioning of all known living organisms . The DNA segments that carry this genetic information are called genes, but other DNA sequences have structural purposes, or are involved in...
double helix in the genome
Genome
In modern molecular biology and genetics, the genome is the entirety of an organism's hereditary information. It is encoded either in DNA or, for many types of virus, in RNA. The genome includes both the genes and the non-coding sequences of the DNA/RNA....
as template. This strand is called the template strand. Transcription can be divided into 3 stages: Initiation, Elongation, and Termination, each regulated by a large number of proteins such as transcription factor
Transcription factor
In molecular biology and genetics, a transcription factor is a protein that binds to specific DNA sequences, thereby controlling the flow of genetic information from DNA to mRNA...
s and coactivators that ensure that the correct gene is transcribed.
The DNA strand is read in the 3' to 5' direction and the mRNA is transcribed in the 5' to 3' direction by the RNA polymerase
RNA polymerase
RNA polymerase is an enzyme that produces RNA. In cells, RNAP is needed for constructing RNA chains from DNA genes as templates, a process called transcription. RNA polymerase enzymes are essential to life and are found in all organisms and many viruses...
.
Transcription occurs in the cell nucleus
Cell nucleus
In cell biology, the nucleus is a membrane-enclosed organelle found in eukaryotic cells. It contains most of the cell's genetic material, organized as multiple long linear DNA molecules in complex with a large variety of proteins, such as histones, to form chromosomes. The genes within these...
, where the DNA is held. The DNA structure of the cell is made up of two helixes made up of sugar and phosphate held together by the bases. The sugar and the phosphate are joined together by covalent bond. The DNA is "unzipped" by the enzyme helicase
Helicase
Helicases are a class of enzymes vital to all living organisms. They are motor proteins that move directionally along a nucleic acid phosphodiester backbone, separating two annealed nucleic acid strands using energy derived from ATP hydrolysis.-Function:Many cellular processes Helicases are a...
, leaving the single nucleotide chain open to be copied. RNA polymerase reads the DNA strand from 3-prime (3') end to the 5-prime (5') end, while it synthesizes a single strand of messenger RNA in the 5'-to-3' direction. The general RNA structure is very similar to the DNA structure, but in RNA the nucleotide uracil
Uracil
Uracil is one of the four nucleobases in the nucleic acid of RNA that are represented by the letters A, G, C and U. The others are adenine, cytosine, and guanine. In RNA, uracil binds to adenine via two hydrogen bonds. In DNA, the uracil nucleobase is replaced by thymine.Uracil is a common and...
takes the place that thymine
Thymine
Thymine is one of the four nucleobases in the nucleic acid of DNA that are represented by the letters G–C–A–T. The others are adenine, guanine, and cytosine. Thymine is also known as 5-methyluracil, a pyrimidine nucleobase. As the name suggests, thymine may be derived by methylation of uracil at...
occupies in DNA. The single strand of mRNA leaves the nucleus through nuclear pore
Nuclear pore
Nuclear pores are large protein complexes that cross the nuclear envelope, which is the double membrane surrounding the eukaryotic cell nucleus. There are about on average 2000 nuclear pore complexes in the nuclear envelope of a vertebrate cell, but it varies depending on cell type and the stage in...
s, and migrates into the cytoplasm
Cytoplasm
The cytoplasm is a small gel-like substance residing between the cell membrane holding all the cell's internal sub-structures , except for the nucleus. All the contents of the cells of prokaryote organisms are contained within the cytoplasm...
.
The first product of transcription differs in prokaryotic
Prokaryote
The prokaryotes are a group of organisms that lack a cell nucleus , or any other membrane-bound organelles. The organisms that have a cell nucleus are called eukaryotes. Most prokaryotes are unicellular, but a few such as myxobacteria have multicellular stages in their life cycles...
cells from that of eukaryotic
Eukaryote
A eukaryote is an organism whose cells contain complex structures enclosed within membranes. Eukaryotes may more formally be referred to as the taxon Eukarya or Eukaryota. The defining membrane-bound structure that sets eukaryotic cells apart from prokaryotic cells is the nucleus, or nuclear...
cells, as in prokaryotic cells the product is mRNA, which needs no post-transcriptional modification, whereas, in eukaryotic cells, the first product is called primary transcript, that needs post-transcriptional modification (capping with 7-methyl-guanosine, tailing with a poly A tail) to give hnRNA (heterophil nuclear RNA). hnRNA then undergoes splicing of intron
Intron
An intron is any nucleotide sequence within a gene that is removed by RNA splicing to generate the final mature RNA product of a gene. The term intron refers to both the DNA sequence within a gene, and the corresponding sequence in RNA transcripts. Sequences that are joined together in the final...
s (noncoding parts of the gene) via spliceosome
Spliceosome
A spliceosome is a complex of snRNA and protein subunits that removes introns from a transcribed pre-mRNA segment. This process is generally referred to as splicing.-Composition:...
s to produce the final mRNA.
Translation
The synthesis of proteins is known as translation. Translation occurs in the cytoplasmCytoplasm
The cytoplasm is a small gel-like substance residing between the cell membrane holding all the cell's internal sub-structures , except for the nucleus. All the contents of the cells of prokaryote organisms are contained within the cytoplasm...
, where the ribosome
Ribosome
A ribosome is a component of cells that assembles the twenty specific amino acid molecules to form the particular protein molecule determined by the nucleotide sequence of an RNA molecule....
s are located. Ribosomes are made of a small and large subunit that surround the mRNA. In translation, messenger RNA (mRNA) is decoded to produce a specific polypeptide according to the rules specified by the trinucleotide genetic code
Genetic code
The genetic code is the set of rules by which information encoded in genetic material is translated into proteins by living cells....
. This uses an mRNA sequence as a template to guide the synthesis of a chain of amino acid
Amino acid
Amino acids are molecules containing an amine group, a carboxylic acid group and a side-chain that varies between different amino acids. The key elements of an amino acid are carbon, hydrogen, oxygen, and nitrogen...
s that form a protein. Translation proceeds in four phases: activation, initiation, elongation, and termination (all describing the growth of the amino acid chain, or polypeptide that is the product of translation).
In activation, the correct amino acid (AA) is joined to the correct transfer RNA (tRNA). While this is not, in the technical sense, a step in translation, it is required for translation to proceed. The AA is joined by its carboxyl group to the 3' OH of the tRNA by an ester bond. When the tRNA has an amino acid linked to it, it is termed "charged".
Initiation involves the small subunit of the ribosome binding to 5' end of mRNA with the help of initiation factors (IF), other proteins that assist the process.
Elongation occurs when the next aminoacyl-tRNA (charged tRNA) in line binds to the ribosome along with GTP
Guanosine triphosphate
Guanosine-5'-triphosphate is a purine nucleoside triphosphate. It can act as a substrate for the synthesis of RNA during the transcription process...
and an elongation factor
Elongation factor
Elongation factors are a set of proteins that facilitate the events of translational elongation, the steps in protein synthesis from the formation of the first peptide bond to the formation of the last one.Elongation is the most rapid step in translation:...
.
Termination of the polypeptide happens when the A site of the ribosome faces a stop codon (UAA, UAG, or UGA). When this happens, no tRNA can recognize it, but releasing factor
Release factor
A release factor is a protein that allows for the termination of translation by recognizing the termination codon or stop codon in a mRNA sequence....
can recognize nonsense codons and causes the release of the polypeptide chain.
The capacity of disabling or inhibiting translation in protein biosynthesis is used by some antibiotic
Antibiotic
An antibacterial is a compound or substance that kills or slows down the growth of bacteria.The term is often used synonymously with the term antibiotic; today, however, with increased knowledge of the causative agents of various infectious diseases, antibiotic has come to denote a broader range of...
s such as anisomycin
Anisomycin
Anisomycin, also known as flagecidin is an antibiotic produced by Streptomyces griseolus which inhibits protein synthesis. Partial inhibition of DNA synthesis occurs at anisomycin concentrations that effect 95% inhibition of protein synthesis...
, cycloheximide
Cycloheximide
Cycloheximide is an inhibitor of protein biosynthesis in eukaryotic organisms, produced by the bacterium Streptomyces griseus. Cycloheximide exerts its effect by interfering with the translocation step in protein synthesis thus blocking translational elongation...
, chloramphenicol
Chloramphenicol
Chloramphenicol is a bacteriostatic antimicrobial that became available in 1949. It is considered a prototypical broad-spectrum antibiotic, alongside the tetracyclines, and as it is both cheap and easy to manufacture it is frequently found as a drug of choice in the third world.Chloramphenicol is...
, tetracycline, streptomycin
Streptomycin
Streptomycin is an antibiotic drug, the first of a class of drugs called aminoglycosides to be discovered, and was the first antibiotic remedy for tuberculosis. It is derived from the actinobacterium Streptomyces griseus. Streptomycin is a bactericidal antibiotic. Streptomycin cannot be given...
, erythromycin
Erythromycin
Erythromycin is a macrolide antibiotic that has an antimicrobial spectrum similar to or slightly wider than that of penicillin, and is often used for people who have an allergy to penicillins. For respiratory tract infections, it has better coverage of atypical organisms, including mycoplasma and...
, puromycin
Puromycin
Puromycin is an antibiotic that is a protein synthesis inhibitor by inhibiting translation.-Inhibition of translation:Puromycin is an aminonucleoside antibiotic, derived from the Streptomyces alboniger bacterium, that causes premature chain termination during translation taking place in the...
, etc.
Events following protein translation
The events following biosynthesis include post-translational modification and protein foldingProtein folding
Protein folding is the process by which a protein structure assumes its functional shape or conformation. It is the physical process by which a polypeptide folds into its characteristic and functional three-dimensional structure from random coil....
.
During and after synthesis, polypeptide chains often fold to assume, so called, native secondary
Secondary structure
In biochemistry and structural biology, secondary structure is the general three-dimensional form of local segments of biopolymers such as proteins and nucleic acids...
and tertiary structure
Tertiary structure
In biochemistry and molecular biology, the tertiary structure of a protein or any other macromolecule is its three-dimensional structure, as defined by the atomic coordinates.-Relationship to primary structure:...
s. This is known as protein folding
Protein folding
Protein folding is the process by which a protein structure assumes its functional shape or conformation. It is the physical process by which a polypeptide folds into its characteristic and functional three-dimensional structure from random coil....
.
See also
- CistronCistronA cistron is a gene. The term cistron is used to emphasize that genes exhibit a specific behavior in a cis-trans test; distinct positions within a genome are cistronic when mutations at the loci exhibit the same simple Mendelian inheritance as would mutations at a single locus.For example,...
- Genetic codeGenetic codeThe genetic code is the set of rules by which information encoded in genetic material is translated into proteins by living cells....
- lac operonLac operonThe lac operon is an operon required for the transport and metabolism of lactose in Escherichia coli and some other enteric bacteria. It consists of three adjacent structural genes, lacZ, lacY and lacA. The lac operon is regulated by several factors including the availability of glucose and of...
- OperonOperonIn genetics, an operon is a functioning unit of genomic DNA containing a cluster of genes under the control of a single regulatory signal or promoter. The genes are transcribed together into an mRNA strand and either translated together in the cytoplasm, or undergo trans-splicing to create...
- Peptide synthesisPeptide synthesisIn organic chemistry, peptide synthesis is the production of peptides, which are organic compounds in which multiple amino acids are linked via amide bonds which are also known as peptide bonds...
- Central dogma of molecular biologyCentral dogma of molecular biologyThe central dogma of molecular biology was first articulated by Francis Crick in 1958 and re-stated in a Nature paper published in 1970:In other words, the process of producing proteins is irreversible: a protein cannot be used to create DNA....
External links
- Translation of mRNA Section of The Cell: A Molecular Approach by Geoffrey M. Cooper
- Science aid:Protein synthesis For high school
- Protein Synthesis
- Transcription
- Protein Synthesis with emphasis on ribosomes and DNA Ribosomes and DNA
- Protein Synthesis Animation Wesleyan University Learning Objects animation of protein synthesis.
- Interactive Java simulation of transcription initiation. From Center for Models of Life at the Niels Bohr Institute.
- From RNA to Protein Synthesis hypervideo.