Molecule of the Month: Globin Evolution

The mechanisms of molecular evolution are revealed in globin sequences and structures.

Sequences of myoglobin from five different animals are compared to create a phylogenetic tree. Each molecule is colored to show differences from the human protein. Amino acids that are identical are pink, amino acids that are different but similar are lighter pink, and amino acids that are completely different are in white. The heme is shown in bright red. The tree was calculated with the online server at phylogeny.fr. PDB entries 3rgk, 1ymb, 1mbo, 1lhs, 2nrl.
Sequences of myoglobin from five different animals are compared to create a phylogenetic tree. Each molecule is colored to show differences from the human protein. Amino acids that are identical are pink, amino acids that are different but similar are lighter pink, and amino acids that are completely different are in white. The heme is shown in bright red. The tree was calculated with the online server at phylogeny.fr. PDB entries 3rgk, 1ymb, 1mbo, 1lhs, 2nrl.
Download high quality TIFF image
When Charles Darwin developed his theory of evolution, the inner workings of the cell were largely a mystery. His revolutionary insight, that populations of organisms evolve under natural selection, was based largely on his observations of plants and animals living around him. The inner mechanisms of evolution were only uncovered later, as the details of genetic information, inheritance, and mutation were revealed by scientists. Today, we have an abundance of data to explore, allowing us to see and understand the underlying molecular biology behind evolution.

Tree of Life

Ever since biologists started looking closely at living things, it was apparent that everything living on the Earth is related. Some are close relations: horses share many traits with cows. Others are more distant relatives: starfish are quite different from elephants. By carefully analyzing these similarities and differences, biologists built a tree a life, with similar organisms clustered closely together on individual branches and differing organisms scattered on distant branches. Today, the study of the genomic sequences of these organisms allows us to make this tree more quantitative. If we assume that differences in genomic sequences build up at an approximately equal rate as the organisms evolve, we can use this data to build a phylogenetic tree of life.

Globin Evolution

To build this type of tree, we need to have a protein that is essential in all the organisms we are studying, so that they all have a corresponding gene in their genome. The tree shown here uses the protein myoglobin, and is limited to vertebrates. As you can see, our closest relatives, mammals, have myoglobins that are quite similar, but reptiles and fish have had more time to diverge and their myoglobins have more changes. Trees that span a larger range of organisms, from bacteria to humans, are often built using biomolecules that are even more essential, such as the central machinery of protein synthesis or energy metabolism.

Globin Mutation

Darwinian evolution relies on diversity in a population, so that there are individuals with different traits, some of which might be beneficial and will be selected for a particular environment. This diversity is generated by mutation of the genome, producing slightly altered proteins with different functional characteristics. Many examples of these types of mutations, and their consequences, are found in the PDB archive. For instance, the myoglobin in aquatic mammals can be packed more densely than typical myoglobins, allowing it to store extra oxygen for their long dives, and the single-site mutation in human sickle-cell hemoglobin causes disease, but also may provide protection against infection by malaria.

A cluster of beta-like hemoglobin genes is found in the human genome on chromosome 11, as shown at the top from the  UCSC Genome Browser. The structures reveal that they are quite similar, with small mutations that tune the function for their tasks in embryonic, fetal, and adult life. The four globins at the bottom have different functions, as reflected in their quite different sequences. PDB entries 2hhb,  1shr, 1i3d, 1fdh, 1a9w, 3rgk, 1ut0, 1oj6,  all are colored based on differences with the hemoglobin beta-chain.
A cluster of beta-like hemoglobin genes is found in the human genome on chromosome 11, as shown at the top from the UCSC Genome Browser. The structures reveal that they are quite similar, with small mutations that tune the function for their tasks in embryonic, fetal, and adult life. The four globins at the bottom have different functions, as reflected in their quite different sequences. PDB entries 2hhb, 1shr, 1i3d, 1fdh, 1a9w, 3rgk, 1ut0, 1oj6, all are colored based on differences with the hemoglobin beta-chain.
Download high quality TIFF image

Mechanisms of Evolution

Looking closely at gene sequences and protein structures, we can also unravel some of the mechanisms of evolution. For example, you might imagine that it is difficult for a protein to evolve, since it needs to be continually active to keep the organism alive. Looking in the genome, we find that this problem is often solved through gene duplication. A copy of the gene is made, which is then free to evolve without compromising the organism. In the globins, this has happened multiple times in the distant past. There is a cluster of genes in our genome that encode several variants of the hemoglobin beta-chain. For instance, the gamma-2 and epsilon genes encode variants that subtly change the properties hemoglobin for the specialized needs of fetuses and embryos. Larger changes are seen in other globins, evolved from other gene duplication events, such as the hemoglobin alpha-chain, myoglobin, and two recently-discovered globins of unknown function.

The structure of mammoth hemoglobin has four amino acid changes (shown in blue) when compared to modern Asian elephants.
The structure of mammoth hemoglobin has four amino acid changes (shown in blue) when compared to modern Asian elephants.
Download high quality TIFF image

Reconstructing Ancient Globins

In special cases, we can also look back in history to study the evolution of life. Using DNA isolated from a mammoth bone found frozen in Siberia, researchers reconstructed the genes for for the alpha and beta chains of hemoglobin from this extinct species and used it to synthesize the protein in bacteria (PDB entry 3vrf). Mammoth hemoglobin is quite similar to the hemoglobin of modern elephants, but testing of the protein showed that the small changes make it function better in cold temperatures. The sequence also supported other studies that showed that mammoths are more closely related to Asian elephants than to African elephants.

Exploring the Structure

Sequence Conservation in the Hemoglobin Beta-Chain

Molecular biologists often look at evolutionary variation is terms of “conservation.” Regions of the protein with essential functions are very similar when we look at different organisms, whereas regions that are playing a supporting role are often quite different. Hemoglobin is shown here (PDB entry 2hhb), with one beta-chain colored by conservation, calculated using the online ConSurf Server and based on sequences from several hundred organisms. As you can see, regions involved in subunit contacts and lining the heme pocket are highly conserved (colored bright blue), but the exterior portions of the protein are not (colored white). To explore this structure in more detail, click on the image for an interactive JSmol.

Topics for Further Discussion

  1. You can use the “Pairwise Structure Alignment” tool to compare sequences and structures of proteins in the PDB archive. For example, try aligning the hemoglobin alpha and beta chains to see their similarities and differences.
  2. The “Protein Feature View” has a convenient menu for finding structures of a particular protein from different organisms. For example, take a look at the Protein Feature View for myoglobin and look for the little box in blue next to the image.

References

  1. J. F. Storz, J. C. Opazo & F. G. Hoffmann (2013) Gene duplication, genome duplication, and the functional diversification of vertebrate globins. Molecular Phylogenetics and Evolution 66, 469-478.
  2. H. Noguchi, K. L. Campbell, C. Ho, S. Unzai, S.-Y. Park & J. R. H. Tame (2012) Structures of haemoglobin from whooly mammoth in liganded and unliganded states. Acta Crystallographica D68, 1441-1449.
  3. 2nrl: E. R. Schreiter, M. M. Rodriguez, A. Weichsel, W. R. Montfort & J. Bonaventura (2007) S-nitrosylation-induced conformation change in blackfin tuna myoglobin. Journal of Biological Chemistry 282, 19773-19780.
  4. 1ut0: D. DeSanctis, S. Dewilde, A. Pesce, L. Moens, P. Ascenzi, T. Hankeln, T. Burmester & M. Bolognesi (2004) Crystal structure of cytoglobin: the fourth globin type discovered in man displays heme hexa-coordination. Journal of Molecular Biology 336, 917-927.
  5. 1shr: U. Sen, J. Dasgupta, D. Choudhury, P. Datta, A. Chakrabarti, S. B. Chakrabarty, A. Chakrabarty & J. K. Dattagupta (2004) Crystal structures of HbA2 and HbE and modeling of hemoglobin delta4: interpretation of the thermal stability and the antisickling effect of HbA2 and identification of the ferrocyanide binding site in Hb. Biochemistry 43, 12477-12488.
  6. 1oj6: A. Pesce, S. Dewilde, M. Nardini, L. Moens, P. Ascenzi, T. Hankeln, T. Burmester & M. Bolognesi (2003) Human brain neuroglobin structure reveals a distinct mode of controlling oxygen affinity. Structure 11, 1087-1095.
  7. 1i3d: R. D. Kidd, H. M. Baker, A. J. Mathews, T. Brittain & E. N. Baker (2001) Oligomerization and ligand binding in a homotetrameric hemoglobin: two high-resolution crystal structures of hemoglobin Bart’s (gamma(4)), a marker for alpha-thalassemia. Protein Science 10, 1739-1749.
  8. 1a9w: A. J. Sutherland-Smith, H. M. Baker, O. M. Hofmann, T. Brittain & E. N. Baker (1998) Crystal structure of a human embryonic haemoglobin: the carbonmonoxy form of gower II (alpha2 epsilon2) haemoglobin at 2.9 A resolution. Journal of Molecular Biology 280, 475-484.
  9. 1lhs: M. Nardini, C. Tarricone, M. Rizzi, A. Lania, A. Desideri, G. DeSanctis, M. Coletta, R. Petruzzelli, P. Ascenzi, A. Coda & M. Bolognesi (1995) Reptile heme protein structure: X-ray crystallographic study of the aquo-met and cyano-met derivatives of the loggerhead sea turtle (Caretta caretta) myoglobin at 2.0 A resolution. Journal of Molecular Biology 247, 459-465.
  10. 1ymb: S. V. Evans & G. D. Brayer (1990) High resolution study of the three-dimensional structure of horse heart metmyoglobin. Journal of Molecular Biology 213, 885-897.
  11. 3rgk: S. R. Hubbard, S. G. Lambright, S. G. Boxer & W. A. Hendrickson (1990) X-ray crystal structure of a recombinant human myoglobin mutant at 2.8 A resolution. Journal of Molecular Biology 20, 215-218.
  12. 2hhb: G. Fermi, M. F. Perutz, B. Shaanan & R. Fourme (1984) The crystal structure of human deoxyhaemoglobin at 1.74 A resolution. Journal of Molecular Biology 175, 159-174.
  13. 1mbo: S. E. Phillips (1980) Structure and refinement of oxymyoglobin at 1.6 A resolution. Journal of Molecular Biology 142, 531-554.
  14. 1fdh: J. A. Frier & M. F. Perutz (1977) Structure of human foetal deoxyhaemoglobin. Journal of Molecular Biology 112, 97-112.

February 2017, David Goodsell

http://doi.org/10.2210/rcsb_pdb/mom_2017_2
About Molecule of the Month
The RCSB PDB Molecule of the Month by David S. Goodsell (The Scripps Research Institute and the RCSB PDB) presents short accounts on selected molecules from the Protein Data Bank. Each installment includes an introduction to the structure and function of the molecule, a discussion of the relevance of the molecule to human health and welfare, and suggestions for how visitors might view these structures and access further details.More