From cell proteins to viral capsids

Origin of virusesWe have previously discussed the idea that viruses originated from selfish genetic elements such as plasmids and transposons when these nucleic acids acquired structural proteins (see A plasmid on the road to becoming a virus). I want to explore in more detail the idea that the structural proteins of  viruses likely originated from cell proteins (link to paper).

Three ideas have emerged to explain the origin of viruses: 1. viruses evolved first on Earth, before cells, and when cells evolved, the viruses became their genetic parasites; 2. viruses are cells that lost many genes and became intracellular parasites; 3. viruses are collections of genes that escaped from cells. Missing from these hypothesis is how nucleic acids became virus particles – that is, how they acquired structural proteins. It seems likely that viral structural proteins originated from cellular genes.

An analysis of the sequence an structure of major virion proteins has identified likely ancestors in cellular proteins. Following are some examples to illustrate this conclusion.

A very common motif among viral capsid proteins is called the single jelly roll, made up of eight beta strands in two four-stranded sheets. Many cell proteins have jelly role motifs, and some form 60-subunit virus-like particles in cells. The extra sequences at the N-termini of viral jelly roll capsid proteins, involved in recognizing the viral genome, likely evolved after the capture of these proteins from cells.

The core proteins of alphaviruses (think Semliki Forest virus) has structural similarity with chymotrypsin-like serine proteases. The viral core protein retains protease activity, needed for cleavage from a protein precursor.

Retroviral structural proteins also appear to have originated from cell proteins, with clear homologies with matrix, capsid, and nucleocapsid proteins. The matrix Z proteins of arenaviruses are related to cellular RING domain proteins, and the matrix proteins of some negative strand RNA viruses are related to cellular cyclophilin. There are many more examples, providing support for the hypothesis that viruses evolved on multiple instances by recruiting different cell proteins.

Given this information on the origin of viral capsid proteins, we can modify the three hypotheses for the origin of viruses into one. Self-replicating, virus like nucleic acids emerged in the pre-cellular world and from the emerged the first cells. The replicating nucleic acids entered the cells, where they replicated and became genetic parasites. At some point these genetic elements acquired structural proteins from the cells and became bona fide virus particles. As cells evolved, new viruses emerged from them.

It is important to point out that the genes do not always flow from cells to viruses. We know that viral proteins can be returned to cells, where they serve useful functions. One example is syncytin, a retroviral protein used for the construction of the mammalian placenta.

TWiV 439: The purloined envelope

Paul Bieniasz joins the TWiV team to talk about the co-option, millions of years ago, of an endogenous retrovirus envelope protein by hominid ancestors for host defense against viral infection.

 

You can find TWiV #439 at microbe.tv/twiv, or listen. below.

Click arrow to play
Download TWiV 439 (74 MB .mp3, 122 min)
Subscribe (free): iTunesRSSemail

Become a patron of TWiV!

TWiV 424: FLERVergnügen

Trudy joins the the TWiVlords to discuss new tests for detecting prions in the blood, and evidence showing that foamy retroviruses originated in the seas with their jawed vertebrate hosts at least 450 million years ago.

You can find TWiV #424 at microbe.tv/twiv, or listen below.

Click arrow to play
Download TWiV 424 (67 MB .mp3, 111 min)
Subscribe (free): iTunesRSSemail

Become a patron of TWiV!

Interview with J. Michael Bishop

A major new feature of the fourth edition of Principles of Virology is the inclusion of 26 video interviews with leading scientists who have made significant contributions to the field of virology. For the chapter on Transformation and Oncogenesis, Vincent spoke with Nobel Laureate J. Michael Biship, of the University of California, San Francisco, about his career and his work on oncogenes.

TWiV 386: The dolphins did it

TWiVDid you know that the evolution of ancient retroviruses, millions of years ago, can be traced by studying their genomes in the chromosomes of contemporary animals? Ted Diehl and Welkin Johnson join the TWiV team to tell us how they did it with mammals. All without a single wet experiment! They also join in the discussion about virus dispersal by hand dryers.

You can find TWiV #386 at microbe.tv/twiv, or listen below.

Click arrow to play
Download TWiV 386 (86 MB .mp3, 119 min)
Subscribe (free): iTunesRSSemailGoogle Play Music

TWiV 382: Everyone’s a little bit viral

TWiVOn episode #382 of the science show This Week in Virology, Nels Elde and Ed Chuong join the TWiV team to talk about their observation that regulation of the human interferon response depends on regulatory sequences that were co-opted millions of years ago from endogenous retroviruses.

You can find TWiV #382 at microbe.tv/twiv, or listen below.

Click arrow to play
Download TWiV 382 (82 MB .mp3, 114 min)
Subscribe (free): iTunesRSSemail

A huge host contribution to virus mutation rates

HIV-1 mutation rateThe high mutation rate of RNA viruses enables them to evolve in the face of different selection pressures, such as entering a new host or countering host defenses. It has always been thought that the sources of such mutations are the enzymes that copy viral RNA genomes: they make random errors which they cannot correct. Now it appears that a cell enzyme makes an even greater contribution the mutation rate of an RNA virus.

Deep sequencing was used to determine the mutation rate of HIV-1 in the blood of AIDS patients by searching for premature stop codons in open reading frames of viral RNA. Because stop codons terminate protein synthesis, they do not allow production of infectious viruses. Therefore they can be used to calculate the mutation rate in the absence of selection. The mutation rate calculated in this way, 0.000093 mutations per base per cell, was slightly higher than previously calculated from studies in cell culture.

When HIV-1 infects a cell, the enzyme reverse transcriptase converts its RNA genome to DNA, which then integrates into the host cell genome. Identification of stop codons in integrated viral DNA should provide an even better estimate of the mutation rate of reverse transcriptase, because mutations that block the production of infectious virus have not yet been removed by selection. The mutation rate calculated by this approach was 0.0041 mutations per base per cell, or one mutation every 250 bases. This mutation rate is 44 times higher than the value calculated from viral RNA in patient plasma (illustrated).

Sequencing of integrated viral DNA from many patients revealed that the vast majority of mutations leading to insertion of stop codons – 98% – were the consequence of editing by the cellular enzyme APOBEC3G. This enzyme is a deaminase that changes dC to dU in the first strand of viral DNA synthesized by reverse transcriptase. APOBEC3G constitutes an intrinsic defense against HIV-1 infection, because extensive mutation of the viral DNA reduces viral infectivity. Indeed, most integrated HIV proviruses are not infectious as a consequence of APOBEC3G-induced mutations. That infection proceeds at all is due to incorporation of the viral protein vif in the virus particles. Vif binds APOBEC3G, leading to its degradation in cells.

The mutation rate of integrated HIV-1 DNA calculated by this method is much higher than that of other RNA viruses. This high mutation rate is driven by the cellular enzyme, APOBEC3G. At least half of the mutations observed in plasma viral RNAs are also contributed by this enzyme.

It has always been thought that error-prone viral RNA polymerases are largely responsible for the high mutation rates of RNA viruses. The results of this study add a new driver of viral variation, a cellular enzyme. APOBEC enzymes are known to introduce mutations in the genomes of other viruses, including hepatitis B virus, papillomaviruses, and herpesviruses. Furthermore, the cellular adenosine deaminase enzyme can edit the genomes of RNA viruses such as measles virus, parainfluenza virus, and respiratory syncytial virus. Cellular enzymes may therefore play a much greater role in the generation of viral diversity than previously imagined.

TWiV 360: From Southeastern Michigan

On episode #360 of the science show This Week in Virology, Vincent visits the University of Michigan where he and Kathy speak with Michael, Adam, and Akira about polyomaviruses, virus evolution, and virus assembly, on the occasion of naming the department of Microbiology & Immunology a Milestones in Microbiology site.

You can find TWiV #360 at www.microbe.tv/twiv. Or you can watch the video below.

TWiV 337: Steamer

On episode #337 of the science show This Week in Virology, Vincent meets up with Michael and Steve to discuss their finding of a transmissible tumor in soft-shell clams associated with a retrovirus-like element in the clam genome.

You can find TWiV #337 at www.microbe.tv/twiv.

Retroviral influence on human embryonic development

EmbryogenesisAbout eight percent of human DNA is viral: it consists of retroviral genomes produced by infections that occurred many years ago. These endogenous retroviruses are passed from parent to child in our DNA. Some of these viral genomes are activated for a brief time during human embryogenesis, suggesting that they may play a role in development.

There are over 500,000 endogenous retroviruses in the human genome, about 20 times more than human genes. They were acquired millions of years ago after retroviral infection. In this process, viral RNA is converted to DNA, which then integrates into cell DNA. If the retroviral infection takes place in the germ line, the integrated DNA may be passed on to offspring.

The most recent human retroviral infections leading to germ line integration took place with a subgroup of human endogenous retroviruses called HERVK(HML-2). The human genome contains ~90 copies of these viral genomes, which might have infected human ancestors as recently as 200,000 years ago. HERVs do not produce infectious virus: not only is the viral genome silenced – no mRNAs are produced – but they are littered with lethal mutations that have accumulated over time.

A recent study revealed that HERVK mRNAs are produced during normal human embryogenesis. Viral RNAs were detected beginning at the 8-cell stage, through epiblast cells in preimplantation embryos, until formation of embryonic stem cells (illustrated). At this point the production of HERVK mRNA ceases. Viral capsid protein was detected in blastocysts, and electron microscopy revealed the presence of virus-like particles similar to those found in reconstructed HERVK particles. These results indicate that retroviral proteins and particles are present during human development, up until implantation.

Retroviral particles in blastocysts are accompanied by induction of synthesis of an antiviral protein, IFITM1, that is known to block infection with a variety of viruses, including influenza virus. A HERVK protein known as Rec, produced in blastocysts, binds a variety of cell mRNAs and either increases or decreases their association with ribosomes.

Is there a function for HERVK expression during human embryogenesis? The authors speculate that modulation of the ribosome-binding activities of specific cell mRNAs by the viral Rec protein could influence aspects of early development. As Rec sequences are polymorphic in humans, the effects could even extend to individuals. In addition, HERVK induction of IFITM1 might conceivably protect embryos against infection with other viruses.

The maintenance of open reading frames in HERV genomes, over many years of evolution, suggests a functional role for these elements. Evidence for such function comes from the syncytin proteins, which  are essential for placental development: the genes encoding these proteins originated from HERV glycoproteins. However, not all endogenous retroviruses are beneficial: a number of malignant diseases have been associated with HERV-K expression.