The approach will allow researchers to understand the role individual genes play in normal cell growth and development, in aging, and in such diseases as cancer, said Shiri Levy, a postdoctoral fellow in UW Institute for Stem Cell and Regenerative Medicine (ISCRM) and the lead author of the paper.
"The beauty of this approach is we can safely upregulate specific genes to affect cell activity without permanently changing the genome and cause unintended mistakes," Levy said.
The project was led by Hannele Ruohola-Baker, professor of biochemistry and associate director of ISCRM. The AI-designed protein was developed at the UW Medicine Institute for Protein Design (IPD) under the leadership of David Baker, also a professor of biochemistry and head of the IPD.
The new technique controls gene activity without altering the DNA sequence of the genome by targeting chemical modifications that help package genes in our chromosomes and regulate their activity. Because these modifications occur not in, but on top of genes, they are called epigenetic, from the Greek epi "over" or "above" the genes. The chemical modifications that regulate gene activity are called epigenetic markers.
Scientists are particularly interested in epigenetic modifications because not only do they affect gene activity in normal cell function, epigenetic markers accumulate with time, contribute to aging, and can affect of the health of future generations as we can pass them on to our children.
In their work, Levy and her colleagues focused on a complex of proteins called PRC2 that silences genes by attaching a small molecule, called a methyl group, to a protein that packages genes called histones. These methyl groups must be refreshed so if PRC2 is blocked the genes it has silenced. it can be reawakened.
PRC2 is active throughout development but plays a particularly important role during the first days of life when embryonic cells differentiate into the variouscell types that will form the tissues and organs of the growing embryo. PRC2 can be blocked with chemicals, but they are imprecise, affecting PRC2 function throughout the genome. The goal of the UW researchers was to find a way to block PRC2 so that only one gene at a time would be affected.
To do this, David Baker and his colleagues use AI to create a protein that would bind to PRC2 and block a protein the PRC2 uses to modify the histones. Ruohola-Baker and Levy then fused this designed protein with a disabled version of a protein called Cas9.
Cas9 is the protein used in the gene editing process called CRISPR. Cas9 binds and uses RNA as an address-tag. The system allows scientists, by synthesizing a specific "address-tag" RNA, to bring Cas9 to a precise location in genome and therefore cut and splice genes at specific sites. In this experiment, however, the cutting function of the Cas9 protein is disabled so the genomic DNA sequence is unaltered. As a result, it's called dCas9, for "dead." However, the Cas9 function as a "vehicle" to deliver cargo to a specific location remains active. The AI-designed blocking protein was the cargo of the dCas9-RNA construct. "dCas9 is like UBER," says Levy, "It will take you anywhere on the genome you want to go. The guide RNA is like a passenger, telling the UBER where to go."
In the new paper, Levy and her colleagues show that by using this technique, they were able to block PRC2 and selectively turn on four different genes. They were also able to show they could transdifferentiate induced pluripotent stem cells to placental progenitor cells by simply turning on two genes.
"This technique allows us to avoid bombarding cells with various growth factors and gene activators and repressors to get them to differentiate," Levy said. "Instead, we can target specific sites on the gene transcription promoters' region, lift those marks and let the cell do the rest in an organic, holistic manner."
Finally, the researchers were able to show how the technique can be used to find the location of specific PRC2-controlled regulatory regions from where individual genes are activated. The location of many of these are unknown. In this case, they identified a promoter region - called a TATA box - for a gene called TBX18. Although current thinking is that these promotor regions are close to the gene, within in 30 DNA base pairs, they found for this gene the promoter region was more than 500 base pairs away.
"This was a very important finding," said Ruohola-Baker. "TATA boxes are scattered throughout the genome, and current thinking in biology is that the important TATA boxes are very close to the gene transcription site and the others don't seem to matter. The power of this tool is that it can find the critical PRC2 dependent elements, in this case TATA boxes that matter."
Epigenetic modifications decorate broad regions of the genome in normal and abnormal cells. However, the minimal functional unit for the epigenetic modification remains poorly understood, Ruohola-Baker notes, "With these two advances, AI-designed proteins and CRISPR technology, we can now find the precise epigenetic marks that are important for gene expression, learn the rules and utilize them to control cell function, drive cell differentiation and develop 21st century therapies."
Levy S, Somasundaram L, Raj IX, Ic-Mex D, Phal A, Schmidt S, Ng WI, Mar D, Decarreau J, Moss N, Alghadeer A, Honkanen H, Sarthy J, Vitanza N, Hawkins RD, Mathieu J, Wang Y, Baker D, Bomsztyk K, Ruohola-Baker H.
dCas9 fusion to computer-designed PRC2 inhibitor reveals functional TATA box in distal promoter region.
Cell Rep. 2022 Mar 1;38(9):110457. doi: 10.1016/j.celrep.2022.110457