BIO101 – Protein Synthesis: Transcription and Translation

As you may know, I have been teaching BIO101 (and also the BIO102 Lab) to non-traditional students in an adult education program for about twelve years now. Every now and then I muse about it publicly on the blog (see this, this, this, this, this, this and this for a few short posts about various aspects of it – from the use of videos, to the use of a classroom blog, to the importance of Open Access so students can read primary literature). The quality of students in this program has steadily risen over the years, but I am still highly constrained with time: I have eight 4-hour meetings with the students over eight weeks. In this period I have to teach them all of biology they need for their non-science majors, plus leave enough time for each student to give a presentation (on the science of their favourite plant and animal) and for two exams. Thus I have to strip the lectures to the bare bones, and hope that those bare bones are what non-science majors really need to know: concepts rather than factoids, relationship with the rest of their lives rather than relationship with the other sciences. Thus I follow my lectures with videos and classroom discussions, and their homework consists of finding cool biology videos or articles and posting the links on the classroom blog for all to see. A couple of times I used malaria as a thread that connected all the topics – from cell biology to ecology to physiology to evolution. I think that worked well but it is hard to do. They also write a final paper on some aspect of physiology.

Another new development is that the administration has realized that most of the faculty have been with the school for many years. We are experienced, and apparently we know what we are doing. Thus they recently gave us much more freedom to design our own syllabus instead of following a pre-defined one, as long as the ultimate goals of the class remain the same. I am not exactly sure when am I teaching the BIO101 lectures again (late Fall, Spring?) but I want to start rethinking my class early. I am also worried that, since I am not actively doing research in the lab and thus not following the literature as closely, that some of the things I teach are now out-dated. Not that anyone can possibly keep up with all the advances in all the areas of Biology which is so huge, but at least big updates that affect teaching of introductory courses are stuff I need to know.

I need to catch up and upgrade my lecture notes. And what better way than crowdsource! So, over the new few weeks, I will re-post my old lecture notes (note that they are just intros – discussions and videos etc. follow them in the classroom) and will ask you to fact-check me. If I got something wrong or something is out of date, let me know (but don’t push just your own preferred hypothesis if a question is not yet settled – give me the entire controversy explanation instead). If something is glaringly missing, let me know. If something can be said in a nicer language – edit my sentences. If you are aware of cool images, articles, blog-posts, videos, podcasts, visualizations, animations, games, etc. that can be used to explain these basic concepts, let me know. And at the end, once we do this with all the lectures, let’s discuss the overall syllabus – is there a better way to organize all this material for such a fast-paced class.

Today, we continue with the cell – the basic processes of DNA transcription, RNA translation, and protein synthesis. See the previous lectures:
Biology and the Scientific Method
BIO101 – Cell Structure

Follow me under the fold:

Protein Synthesis: Transcription and TranslationHere is the third BIO101 lecture (from May 08, 2006). Again, I’d appreciate comments on the correctness as well as suggestions for improvement.


BIO101 – Bora Zivkovic – Lecture 1 – Part 3

The DNA code

DNA is a long double-stranded molecule residing inside the nucleus of every cell. It is usually tightly coiled forming chromosomes in which it is protected by proteins.

Each of the two strands of the DNA molecule is a chain of smaller molecules. Each link in the chain is composed of one sugar molecule, one phosphate molecule and one nucleotide molecule. There are four types of nucleotides (or ‘bases’) in the DNA: adenine (A), thymine (T), guanine (G) and cytosine (C). The two strands of DNA are structured in such a way that an adenine on one strand is always attached to a thymine on the other strand, and the guanine of one strand is always bound to cytosine on the other strand. Thus, the two strands of the DNA molecule are mirror-images of each other.

The exact sequence of nucleotides on a DNA strand is the genetic code. The total genetic code of all of the DNA on all the chromosomes is the genome. Each cell in the body has exactly the same chromosomes and exactly the same genome (with some exceptions we will cover later).

A gene is a small portion of the genome – a sequence of nucleotides that is expressed together and codes for a single protein (polypeptide) molecule.

Cell uses the genes to synthesize proteins. This is a two-step process. The first step is transcription in which the sequence of one gene is replicated in an RNA molecule. The second step is translation in which the RNA molecule serves as a code for the formation of an amino-acid chain (a polypeptide).


For a gene to be expressed, i.e., translated into RNA, that portion of the DNA has to be uncoiled and freed of the protective proteins. An enzyme, called DNA polymerase, reads the DNA code (the sequence of bases on one of the two strands of the DNA molecule) and builds a single-stranded chain of the RNA molecule. Again, where there is a G in DNA, there will be C in the RNA and vice versa. Instead of thymine, RNA has uracil (U). Wherever in the DNA strand there is an A, there will be a U in the RNA, and wherever there is a T on the DNA molecule, there will be an A in the RNA.

Once the whole gene (100s to 10,000s of bases in a row) is transcribed, the RNA molecule detaches. The RNA (called messenger RNA or mRNA) may be further modified by addition of more A bases at its tail, by addition of other small molecules to some of the nucleotides and by excision of some portions (introns) out of the chain. The removal of introns (the non-coding regions) and putting together the remaining segments – exons – into a single chain again, is called RNA splicing. RNA splicing allows for one gene to code for multiple related kinds of proteins, as alternative patterns of splicing may be controlled by various factors in the cell.

Unlike DNA, the mRNA molecule is capable of exiting the nucleus through the pores in the nuclear membrane. It enters the endoplasmatic reticulum and attaches itself to one of the membranes in the rough ER.


Three types of RNA are involved in the translation process: mRNA which carries the code for the gene, rRNA which aids in the formation of the ribosome, and tRNA which brings individual amino-acids to the ribosome. Translation is controlled by various enzymes that recognize specific nucleotide sequences.

The genetic code (nucleotide sequence of a gene) translates into a polypeptide (amino-acid sequence of a protein) in a 3-to-1 fashion. Three nuclotides in a row code for one amino-acid. There are a total of 20 amino-acids used to build all proteins in our bodies. Some amino-acids are coded by a single triplet code, or codon. Other amino-acids may be coded by several different RNA sequences. There is also a START sequence (coding for fMet) and a STOP sequence that does not code for any amino-acid. The genetic code is (almost) universal. Except for a few microorganisms, all of life uses the same genetic code.

When the ribosome is assembled around a molecule of mRNA, the translation begins with the reading of the first triplet. Small tRNA molecules bring in the individual amino-acids and attach them to the mRNA, as well as to each other, forming a chain of amino-acids. When a stop signal is reached, the entire complex disassociates. The ribosome, the mRNA, the tRNAs and the enzymes are then either degraded or re-used for another translational event.

Protein synthesis – post-translational modifications

Translation of the DNA/RNA code into a sequence of amino-acids is just the beginning of the process of protein synthesis.

The exact sequence of amino-acids in a polypeptide chain is the primary structure of the protein.
As different amino-acids are molecules of somewhat different shapes, sizes and electrical polarities, they react with each other. The attractive and repulsive forces between amino-acids cause the chain to fold in various ways. The three-dimensional shape of the polypeptide chain due to the chemical properties of its component amino-acids is called the secondary structure of the protein.

Enzymes called chaperonins further modify the three-dimensional structure of the protein by folding it in particular ways. The 3D structure of a protein is its most important property as the functionality of a protein depends on its shape – it can react with other molecules only if the two molecules fit into each other like a key and a lock. The 3D structure of the fully folded protein is its tertiary structure.

Prions, the causes of such diseases as Mad Cow Disease, Scabies and Kreutzfeld-Jacob disease, are proteins. The primary and secondary structure of the prion is almost identical to the normally expressed proteins in our brain cells, but the tertiary structure is different – they are folded into different shapes. When a prion enters a healthy brain cell, it is capable of denaturing (unwinding) the native protein and then reshaping it in the same shape as the prion. Thus one prion molecule makes two – those two go on and make four, those four make eight, and so on, until the whole brain is just one liquifiied spongy mass.

Another aspect of the tertiary structure of the protein is addition of small molecules to the chain. For instance, phosphate groups may be attached to the protein (giving it additional energy). Also, short chains of sugars are usually bound to the tail-end of the protein. These sugar chains serve as “ZIP-code tags” for the protein, informing carrier molecules exactly where in the cell this protein needs to be carried to (usually within vesicles that bud off the RER or the Golgi apparatus). The elements of the cytoskeleton are used as conduits (“elevators and escalators”) to shuttle proteins to where in the cell they are needed.

Many proteins are composed of more than one polypeptide chain. For instance, hemoglobin is formed by binding together four subunits. Each subunit also has a heme molecule attached to it, and an ion of iron attached to the heme (this iron is where oxygen binds to hemogolobin). This larger, more complex structure of the protein is its quaternary structure.

See animations:

Previously in this series:
Biology and the Scientific Method
BIO101 – Cell Structure


19 responses to “BIO101 – Protein Synthesis: Transcription and Translation

  1. Bora, I’m definitely a fan of using any animations possible to help convey the concept to first-timers.

    I remembered a great PBS series all about DNA.
    ( )
    The series must have been reasonably long ago, since the video offered at the site is teeny-tiny realvideo.

    But someone got a great clip of the animation that covers the transcription-translation steps. No mention of modifications in the clip, though.

  2. Oh yes, that is a beauty. I usually ask them to find clips like this for homework and post links on the classroom blogs – they tend to find dozens of cool animations.

  3. The exact sequence of nucleotides on a DNA strand is the genetic code.

    This is incorrect. The genetic code is the correspondence between codons in the DNA and amino acids. For example, AAA -> a specific amino acid (I forget which one). Usually, the code is depicted in a table where you read down, across, etc. to find the amino acid.

  4. Actually, Hainish is correct. I’m so used to understanding the meaning of the words that I probably use the term loosely as well. Without referring to genes or promoters or other intergenic DNA bits, a string a bps is called a…? I guess just a DNA “sequence”, come to think of it. There’s DNA base pairs–> genes (+ etc)–> chromosome–> genome.

    On a second note, were you planning to get into DNA replication later (guessing at a cell division portion)?

  5. Hainish is technically correct — the best kind of correct! (-:

  6. Pingback: BIO101: Cell-Cell Interactions | A Blog Around The Clock

  7. Pingback: BIO101 – From One Cell To Two: Cell Division and DNA Replication | A Blog Around The Clock

  8. Pingback: BIO101 – From Two Cells To Many: Cell Differentiation and Embryonic Development | A Blog Around The Clock

  9. Pingback: BIO101 – From Genes To Traits: How Genotype Affects Phenotype | A Blog Around The Clock

  10. Pingback: BIO101 – From Genes To Species: A Primer on Evolution | A Blog Around The Clock

  11. Pingback: BIO101 – What Creatures Do: Animal Behavior | A Blog Around The Clock

  12. Pingback: BIO101 – Organisms In Time and Space: Ecology | A Blog Around The Clock

  13. Pingback: BIO101 – Origin of Biological Diversity | A Blog Around The Clock

  14. Pingback: BIO101 – Evolution of Biological Diversity | A Blog Around The Clock

  15. Pingback: Free Biology 101 Lectures, Around the Clock… « A Natural Evolution

  16. Pingback: BIO101 – Introduction to Anatomy and Physiology | A Blog Around The Clock

  17. Pingback: BIO101 – Physiology: Regulation and Control | A Blog Around The Clock

  18. Pingback: BIO101 – Physiology: Coordinated Response | A Blog Around The Clock

  19. Pingback: Best of August 2010 | A Blog Around The Clock