A team at the University of Oklahoma will search for uniquely human gene families and their sources. Zoologist Tom Ray, author of the artificial life program Tierra, will lead the team that conducts the research. The study seeks to discover whether Darwinism or strong panspermia accounts for the genes that make us human. Here is Ray's research plan, adopted 18 November 2001. (What'sNEW updates are posted by Klyce.)

Human Genome Search
by Tom Ray, University of Oklahoma        What'sNEW

Correspondence, 2001-2005, during the establishment and pursuit of this research project.
The objective of the research is to use bioinformatic analysis of genomic data to address the question "Is sustained macroevolutionary progress possible in a closed system?". In the context of genomics, we believe that we can gain insights into this question by studying the origin and evolution of gene families.

By definition, "species" of higher organisms (especially animalia) are considered to be genetically closed systems. Conventional Darwinian thinking is that most or all gene families arise through gene duplication and divergence. Strong panspermia suggests that all or most gene families enter a species lineage by horizontal transfer from other species, with these new genes ultimately arriving from outer space.

The current phase of the research will have as its primary target, the quantitative estimation of the relative proportion of new genes and gene families arising by duplication/divergence or by horizontal transfer. If this work demonstrates a significant proportion of horizontal transfer, we will publish this result and acknowledge that it is surprising to Darwinians, but predicted by the strong version of panspermia. Then we will develop a new phase that considers all possible sources for the genes.

The study of the evolution of gene families will begin with a comparison of the human and mouse genomes, to identify a set of genes and gene families present in humans, but absent in mice. This will be followed by a targeted search for these gene families in every available genome between the human and mouse. This will give us the first clear picture of the process by which new gene families emerge and evolve.

Currently, there is no other genome available between the human and mouse, but we expect many to be completed in the coming years, with the greatest concentration of available genomes among the primates, which should be ideal for this study.

However, to avoid having to wait for full genomes to become available, we will attempt to raise funds for a targeted sequencing of the candidate gene families (from the human-mouse comparison), in every species between human and mouse for which BACs are available.

In spite of the limitations on the availability of genetic data, we believe that there is certainly enough data available to begin work, and maintain momentum for a year or two. If we find ourselves in the position of having to wait for data, we would reduce or suspend the funding until the relevant data becomes available.

Tom Ray
In the first year, we anticipate the following activities:

  1. The continued training of Tom Ray in the use of bioinformatic techniques
  2. The selection, purchase, assembly, and development of administrative skills of a new linux cluster for bioinformatic analysis
  3. Development of analytical protocols for addressing the questions of this study.
    • a) How to define and recognize "gene families"
    • b) How to compare human and mouse genomes to recognize gene families present in humans but not mice
    • c) How to target the search for these gene families in the intermediate genomes as they become available
    • d) How to detect the emergence of new genes and gene families by duplication and divergence
    • e) How to detect the emergence of new genes and gene families by horizontal transfer
    • f) How to clearly discirminate between duplication/divergence and horizontal transfer
  4. Application of the protocols to address the questions of this study
    • a) Comparison of mouse and human genomes to identify genes and gene families present in humans but absent from mice
    • b) Attempt to locate genes in the human genome which arose recently through duplication and divergence
    • c) Attempt to locate genes in the human genome which arose recently through horizontal transfer
    • d) Steps b and c can be repeated for any higher organism whose genome has been completed (e.g., mouse, drosophila)

While we can not make commitments for others, we hope that in the first year, a collaborative proposal can be developed and submitted, to target sequence the relevant gene families in many species.

We expect that all or most of the work described above can be started in the first year. However, it is notoriously difficult to predict research progress. Proposals tend to be ambitious, and research tends to take longer than anticipated.

If the reseach should reveal a significant frequency of horizontal gene transfer among higher organisms, we anticipate that Tom and Brig will coauthor a publication revealing this result.

