Bioinformatics - Similarity, identity, and homology

Related things are often similar and similar things are often related.

Relationships are defined by common ancestry: siblings are related to and through their parents, nieces and nephews through their grandparents. On the same token different globin molecules are related: they all relate back to an ancestral heme-incorporating globin-like protein. Relationships among family members are determined by similarities: biologically through physical similarities, socially/legally by identical last names. Relationships of proteins and genes are determined by the degree of similarity among their sequences.
Everybody has their own, unique DNA sequence, just as everybody has their own, unique set of fingerprints. Any organism's DNA sequence has been provided by its forebears; parents for humans, and progenitor cells for bacteria and other single-cell organisms. Changes in DNA are introduced by two mechanisms: spontaneous mutations (infrequent) and recombination of paternal and maternal DNA during meiosis (frequent). Thus, a person's genome does usually not resemble either her father's or her mother's genome, it is combined of parts of both of these. The genomes of cells which are derived from mitotic procreation however, are almost always identical to the genome of the progenitor cell.
Homology is a term used for genes or proteins which are derived from the same ancestor. Homology cannot be expressed as fraction, either two sequences are derived from a common ancestor, i.e. homologous, or they are not. Scientists infer homology from sequence similarity.
Similarity is a measure for relatedness. 100% similarity would be identity. In order to find out whether things are similar, we frequently compare them side-by-side. Such as the two images below. Or we find some way to describe them. Such as the formula that is being used to describe the loops and whorls in fingerprints. Then, searching for matches does not require to compare images but formulas can be used to query databases.
Below are the images of regular hemoglobin and an artistically "mutated" hemoglobin molecule. Compare the two images and try to find the three defects in the right one, that would render a person's blood unable to transport oxygen if this molecule really existed.
  • Click left image.
  • To preserve the large molecule in a new browser window, hold down 'ctrl'-key and press 'n'.
  • Then click the right image.
  • Re-size screens that they fit next to each other.
  • Compare.