RESEARCH: PROTEIN-SIMULATION-DUD-E
FOLDING PROJECT #12274 PROFILE

PROJECT TEAM

Manager(s): Louis Smith
Institution: University of Pennsylvania

WORK UNIT INFO

Atoms: 133,591
Core: 0x23
Status: Public

TLDR; PROJECT SUMMARY AI BETA

This project simulates how proteins interact with drugs using powerful computer models. They're focusing on a protein called Acetylcholinesterase, which is important for brain function and is a target for both pesticides and medicines. By simulating these interactions, they hope to better understand how drugs work and develop new ones more efficiently.

Note: This TLDR is a simplication and may not be 100% accurate.

OFFICAL PROJECT DESCRIPTION

In this series of projects we are simulating proteins that are part of the DUD-E benchmark data set for protein-ligand interactions, using simulations initialized from Alpha Fold. Simulation methods to study protein-small molecule interactions are of critical importance to the early stages of drug discovery, but most methods have a poor balance of accuracy relative to cost.

Much of the development process for new compounds happens via screening large libraries of compounds for activity against target proteins believed to be relevant for a disease.

Lending focus to this search makes developing new molecules into drugs more economical and faster. In order to do this kind of methods development, good reference data that is widely available is essential.

A classic dataset for benchmarking structural methods attempting to predict protein-ligand interactions known as DUD-E has been widely used because it has diverse proteins, and each protein is bound to a fairly large collection (usually more than fifty) of small molecules for which the ability to bind the receptor have been measured experimentally.

Using Folding@Home, we will create large reference quality simulations of these proteins.

Because we know how such simulations, and the binding methods we or others may test on them, should look and function we have a great yardstick for improving the methods we have and developing new ones. In this project series we have the following systems, many of which are known for their medical relevance in addition to having been extensively studied with both simulation and experiment in the past. 12234 - ACES: Acetylcholinesterase that is critical to nervous system function in animals.

It is the target of pesticides, and also numerous drugs.

If targeted in the correct way, it can reduce neural swelling.

This sequence happens to be from the Pacific Electric ray, Torpedo Californica, which was a landmark discovery in biomedical efforts to isolate neurotransmitter receptors and led to a mechanistic understanding of myasthenia gravis. 12235 - AKT2: serine-threonine kinase taking part in the insulin signal transduction pathway.

Implicated in some cancers, it has been a target of drug development campaigns in the past. 12236 - AMPC: A critical antibiotic resistance gene, it is a beta lactamase capable of opening the critical structural feature of celphalosporin-type antibiotics, rendering them ineffective. 12237 - BACE1: Beta secretase 1, an aspartic acid protease that helps form myelin sheaths in neurons.

It is the major generator of amyloid-beta peptides in neurons, and therefore is implicated in Alzheimer's disease. 12238 - BRAF: B-raf is involved in sending signals involved in cell growth, and as such is considered a proto-oncogen.

It is a serine/threonine kinase that has several known inhibitors, some of which are now anti-cancer medications. 12239 - CASP3: a caspase-type protease that participates in the execution of apoptosis, the process of programmed cell death.

It also acts to cleave one of the amyloid forming proteins and is therefore implicated in Alzheimer's dementia. 1240 - CDK2: one of the cyclin dependent kinases, this protein is a checkpoint kinase that signals transitions between growth and DNA synthesis phases in the cell cycle.

Dysfunction in this checkpoint is associated with cancer; inhibiting CDK2 can arrest cell cycle in cases of abmormal growth, so it has been an anti-cancer target for some time. 12241 - CSF1R: Colony stimulating factor 1 receptor, when bound by cognate ligands, will promote survival, proliferation and differentiation of many myeloid cell types.

It is thus involved in disease and is targeted in therapies for cancer, neurodegenerative diseases, nad inflammatory bone diseases. 12242 - DPP4: Dipeptidyl peptidase-4, a protein that cuts up certain other proteins on the surfaces of most cells.

Important in immune regulation, signal transduction, and apoptosis, molecules inhibiting its enzymatic activity can help treat type 2 diabetes because the peptide hormones (GLP-1, and GIP) are degraded by DPP4.

Thus, inhibiting DPP4 prolongs the effects of these hormones. 12243 - EGFR: Epidermal growth factor receptor; its deficient signaling is associated with Alzheimer's dementia, whereas its over-expression is a common characteristic of tumor cells.

It is thus an oncogene that is targeted by numerous anti-cancer molecules and drugs.

Many of these are targeted at the tyrosine kinase domain, because hampering its function prevents excessive transduction of the signals these receptors would otherwise send to the nucleus of the tumor cell. 12244 - ESR1: Estrogen Receptor Alpha is critical to many tissue differentiation processes across the body, and has been targeted by various drugs to both enhance and suppress its effects depending on associated conditions.

12245 - FA10: Coagulation factor X is an enzyme in the coagulation signaling cascade for forming blood clots.

It is a serine endopeptidase, and has been targeted by inhibitors to reduce coagulation in medical contexts where that is desirable.
12246 - FABP4: Fatty Acid Binding protein 4 is a protein that imports lipids between intra and extracellular membranes in macrophages and adipocytes.

Inhibiting it is associated with both preventing certain fat-tumor cancers and metabolic syndromes. 12247 - GRIA2: Glutamate ionotropic receptor AMPA type subunit 2 is a glutamate receptor, an essential neurotransmitter in humans.

Its pre-mRNA is A->I edited at a particular site that makes the channel impermeable to calcium.

Editing errors here can result in ALS, and some other diseases. 12248 - HSP90AA1: Heat shock protein 90kDa alpha A1 is a stress inducible protein that refolds misfolded or damaged proteins.

It is a relevant drug target because it interacts with a number of tumor promoting proteins and plays a large role in cellular adaptation to stress. 12249 - IGF1R: Insulin like growth factor 1 is an extracellular receptor with a tyrosine kinase domain.

It is critical for growth and development, and as such if overproduced can contribute to the cancer phenotype and certain other diseases.

Therefore inhibitors have been developed to target this extracellular receptor. 12250 - ITAL (LFA-1): Leukocyte adhesion cglycoprotein LFA-1 alpha is an integrin found on lymphocytes and other leukocytes.

It functions in the process of tissue emigration in lymphocytes, and in cytotoxic T-cell mediated killing of cells. 12251 - KIT: tyrosine-protein kinase KIT is a receptor tyrosine kinase and a proto-oncogene.

It senses cytokines, transducing signals that govern cell proliferation and survival.

As such it is often mutated in cancers, where its excessive activity maintains or enhances the tumor state. 12252 - MAPK2: Mitogen-activated protein kinase kinase is part of the MAPK pathway, which is famously aberrant in many types of cancers, particularly melanomas.

Inhibitors against it would slow the progression of cancer, so it has been targeted by therapies historically. 12253 - MET: tyrosine-protein kinase Met, also known as hepatocyte growth factor receptor, governs embryonic development, organogenesis and wound-healing.

Abnormal activation of MET sustains tumors by causing them to grow and become better supplied by blood vessels.

Extensive research has focused on inhibiting MET because of its correlation with poor prognosis in cancer, and many compounds are in various parts of the regulatory approval process. 12254 - MK10: MAPK-10 or mitogen-activated protein kinase 10 is associated with a wide variety of cellular processes associated with proliferatiation, differentiation, and development.

Mapk-10 is implicated in neuronal development, and when active can inhibit neuronal apoptosis.

12255 - MK14: MAPK-14 or p38-alpha is another stress and differentiation controlling kinase.

Because of its interaction with inflammatory signaling in the immune system, it is a relevant target for immune diseases and heart disease. 12256 - PPARD: Peroxisome proliferator-activated receptor delta is a nuclear hormone receptor that is implicated in the development of several classes of chronic disease.

Drugs stimulating it can act as biochemical substitutes for exercise, and decouple oxidative phosphorylation.

12257 - PPARG: Peroxisome proliferator activated receptor gamma is similar to the delta variant in some ways, but also serves as a master-regulator of fat cell differentiation.

It has been studied as a target for growth inhibition in cancer cell cultures.

It also is targeted by drugs that treat lipid metabolism disorders like hyperlipidemia and hyperglycemia, as well as for type 2 diabetes as an insulin sensitizer. 12258 - PTN1: Tyrosine-protein phosphatase non-receptor type 1 counteracts the effect of certain tyrosine kinases in protein signalling.

One of its targets is the phosphosite on the insulin receptor and several other receptor tyrosine kinases, including some from this list.

As such, it has implications for both the treatemnt of some cancers and also type 2 diabetes. 12259 - RENI: Renin is an endopeptidase that generates angiotensin 1, resulting in a blood pressure increasing signalling cascade that also causes sodium retention by the kidneys.

As such, renin inhibitors can serve to reduce blood pressure. 12260 - RXRA: Retinoid x receptor alpha is a nuclear receptor that binds retinoic acid, causing transcription of a large number of genes.

12261 - TGFR1: Transforming growth factor beta receptor 1 is a TGF-beta receptor that regulates differentiation in a number of endothelial cell types, and seems to have particular bearing on the development of reproductive tissues.

It has been targeted by studies working to develop cancer therapeutics. 12262 - THRB: Thyroid hormone receptor beta is a nuclear receptor that, when activated by thyroid hormone, initiates a large number of different genes.

Deficiencies in activity can result in thyroid hormone resistance which can cause goiter.

12263 - TRY1: Trypsin-1 is the main form of trypsinogen secreted by the pancreas.

It is an enzyme that breaks down proteins, and defective mutations of it can cause pancreatitis.

It is also a workhorse protein in modern biochemical and biophysical labs. 12264 - TRYB1: Tryptase beta-1 is a trypsin like protease that is secreted as part of Mast-cell activation.

As such it has roles in inflammation associated with asthma, and in cleaving flu's hemagglutinin surface protein (which initiates the experience of flu-like symptoms).

Attempts to produce inhibitors hve so far been difficult, but it has relevance to reducing the severity of the inflammatory response in these conditions. 12265 - VGFR2: Vascular endothelial growth factor receptor number 2 is a tyrosine kinase signaling receptor that binds the vascularization hormone, and causes tissue remodeling to form channels for blood vessel growth.

When over-active or over-expressed this protein supports the vascularization of tumor tissue, making inhibitors targeting it helpful in treating some cancers. 12267 - ABL1: ABL is a protein tyrosine kinase and a prominent proto-oncogen.

Mu
tations in this gene are involved in chronic and acute myelogenous leukemia (A/CML),
and it is the kinase in the fusion protein created by the translocation event that
causes the Philadelphia chromosome, a historically important type of hereditary canc
er risk factor that was visible from analysis of mitotic chromosomes.

Specific treat
ment of these cancers came in part from the development of tyrosine kinase inhibito
rs targeting this protein.
12268 - ANDR: The androgen receptor.

A steroid hormone activated transcription f
actor, it activates a collection of genes that are part of the secondary sex charact
eristics.

Implicated in several varieties of human disease, androgen receptor can no
tably undergo gene duplication events in cancer cells, causing aberrant growth in th
e presence of androgen.

Newer treatments for such cancers involve blocking this rece
ptor.
12270 - FAK1: Focal adhesion kinase.

A protein tyrosine kinase involved in the signaling pathways that cause cells to adhere to one another.

Excess activity is often part of cancer phenomena, and is implicated in metastasis.

It is also cleaved in the 'caspase' cascade resulting in programmed cell death, so this activity is sometimes disrupted in cancers, particularly breast cancers and certain types of ovarian cancers.

A specific class of drugs has emerged to inhibit this molecule as anti-cancer therapies.
12271 - FKBP1: FK506 (tacrolimus) binding protein.

An enzyme that takes part in immune regulation and basic cell processes.

It is the target of immunosuppresive drugs; it may also take part in the dysfunctions in cell signalling associated with some cancers.

Originally discovered as the target of the immunosuppresant tacrolimus used in organ transplants.
12272 - GCR: Glucocorticoid receptor.

Binds cortisol and other glucocorticoid hormones.

Many medications target this receptor because of the changes in gene expression activation or repression of its behavior can induce.
12273 - GLCM: Beta glucocerebrosidase (also called acid beta glucosidase).

An enzyme catalyzing a reaction in glycolipid metabolism; some of its mutants are implicated in parkinsons disease, and Gaucher's disease, a lysozomal storage disease.
12274 - HXK4: Hexokinase IV (glucokinase).

Occurring in the liver and pancreas, and is critical for carbohydrate metabolism in mammals.

An enzyme transcribed in response to insulin signaling, it is relevant for treatment of certain types of diabetes.

Genetic variation can result in genetic diseases similar to diabetes.
12276 - LCK: Lymphocite specific protein tyrosine kinase.

Responsible for the T-cell receptor signalling cascade. 12278 - MCR: Mineralocorticoid receptor.

MR is expressed in many tissues, and does myriad things.

It is activated by aldosterone and glucocorticoids like cortisol.

Targeting MR has relevance for numerous medically relevant biologies, but has been exploited for hypertension/cardiovascular disease.
12279 - PPARA: Peroxisome proliferator-activated receptor.

A receptor activated under conditions of energy deprivation--necessary for ketogenesis, whereby energy is produced from fats while an organism fasts.
12280 - PRGR: Progesterone Receptor.

This receptor binds a key sex hormone having to do with pregnancy and estrous in mammals.

Activating or deactivating this receptor pharmaceutically has been very important for contraceptives and other reproduction-related applications.
12281 - PUR2: GAR transformylase, Phosphoribosylglycinamide formyltransferase.

This enzyme is a critical part of nucleic acid biosynthesis.

Because nucleotides are the building blocks of DNA and RNA, cells that are very metabolically or mitotically active depend on this enzyme more heavily.

As such, this protein is the target of some chemotheraputic treatments.

RELATED TERMS GLOSSARY AI BETA

Note: Glossary items are a high level summary and may not be 100% accurate.

proteins

Large biomolecules essential for various biological functions.

scientific: pharmaceutical
biotechnology / structural biology

Proteins are complex molecules found in all living organisms. They perform a wide range of vital functions, including catalyzing biochemical reactions, transporting molecules, providing structural support, and regulating cellular processes.


DUD-E

Directory of Useful Decoys for Enhanced Evaluation (DUD-E)

acronym: pharmaceutical
biotechnology / drug discovery

DUD-E is a publicly available dataset used to evaluate the performance of computational methods for predicting protein-ligand interactions. It contains diverse proteins and their known binding affinities with various small molecules.


drug discovery

The process of identifying and developing new medications.

process: pharmaceutical
biotechnology / pharmaceutical research

Drug discovery is a complex and lengthy process involving multiple stages, including target identification, lead compound screening, preclinical testing, clinical trials, and regulatory approval. The goal is to develop safe and effective treatments for various diseases.


Alpha Fold

An artificial intelligence system for predicting protein structures.

software: research
biotechnology / protein structure prediction

AlphaFold is a groundbreaking AI system developed by DeepMind that can accurately predict the three-dimensional structure of proteins from their amino acid sequences. This has revolutionized structural biology and drug discovery.


simulations

Computer-based models that mimic real-world processes.

method: research
biotechnology / molecular modeling

Simulations are used to study complex systems, such as proteins and drug interactions, by creating virtual models and observing their behavior under different conditions. This can provide insights into molecular mechanisms and aid in drug development.


Folding@Home

Distributed computing project for protein folding research.

software: research
biotechnology / distributed computing

Folding@Home is a volunteer computing project that utilizes the processing power of donated computers to perform simulations of protein folding. It has contributed significantly to our understanding of protein structure and function.


acetylcholinesterase

An enzyme that breaks down acetylcholine in the nervous system.

enzyme: pharmaceutical
pharmacology / neurotransmission

Acetylcholinesterase is an enzyme responsible for terminating nerve impulses by breaking down the neurotransmitter acetylcholine. It plays a crucial role in muscle contraction, learning, memory, and other cognitive functions.


Torpedo Californica

The scientific name for the California Electric ray.

organism: research
biology / marine life

Torpedo Californica is a species of electric ray found in the waters off the coast of California. It was used in early biomedical research due to its potent electric organs.

PROJECT FOLDING PPD AVERAGES BY GPU

Data as of Tuesday, 14 April 2026 06:34:57
Rank
Project
Model Name
Folding@Home Identifier
Make
Brand
GPU
Model
PPD
Average
Points WU
Average
WUs Day
Average
WU Time
Average
1 GeForce RTX 4090
AD102 [GeForce RTX 4090]
Nvidia AD102 22,838,361 444,562 51.37 0 hrs 28 mins
2 GeForce RTX 4080
AD103 [GeForce RTX 4080]
Nvidia AD103 15,864,055 475,931 33.33 0 hrs 43 mins
3 GeForce RTX 4070 Ti
AD104 [GeForce RTX 4070 Ti]
Nvidia AD104 13,737,097 475,081 28.92 0 hrs 50 mins
4 GeForce RTX 4070 SUPER
AD104 [GeForce RTX 4070 SUPER]
Nvidia AD104 10,548,002 483,322 21.82 1 hrs 6 mins
5 GeForce RTX 3090
GA102 [GeForce RTX 3090]
Nvidia GA102 9,607,069 466,810 20.58 1 hrs 10 mins
6 GeForce RTX 3090 Ti
GA102 [GeForce RTX 3090 Ti]
Nvidia GA102 9,113,197 458,167 19.89 1 hrs 12 mins
7 GeForce RTX 3080 Ti
GA102 [GeForce RTX 3080 Ti]
Nvidia GA102 8,829,004 451,629 19.55 1 hrs 14 mins
8 GeForce RTX 3080
GA102 [GeForce RTX 3080]
Nvidia GA102 8,322,465 445,314 18.69 1 hrs 17 mins
9 GeForce RTX 3080 Lite Hash Rate
GA102 [GeForce RTX 3080 Lite Hash Rate]
Nvidia GA102 7,993,178 441,928 18.09 1 hrs 20 mins
10 Radeon RX 7900XT/XTX/GRE
Navi 31 [Radeon RX 7900XT/XTX/GRE]
AMD Navi 31 7,561,963 322,273 23.46 1 hrs 1 mins
11 GeForce RTX 4070
AD104 [GeForce RTX 4070]
Nvidia AD104 6,998,648 412,469 16.97 1 hrs 25 mins
12 GeForce RTX 3080 12GB
GA102 [GeForce RTX 3080 12GB]
Nvidia GA102 6,950,772 420,709 16.52 1 hrs 27 mins
13 GeForce RTX 2080 Ti Rev. A
TU102 [GeForce RTX 2080 Ti Rev. A] M 13448
Nvidia TU102 6,489,323 409,161 15.86 1 hrs 31 mins
14 GeForce RTX 2080 Ti
TU102 [GeForce RTX 2080 Ti] M 13448
Nvidia TU102 6,105,600 53,568 113.98 0 hrs 13 mins
15 Radeon RX 6900 XT
Navi 21 [Radeon RX 6900 XT]
AMD Navi 21 5,680,939 391,592 14.51 1 hrs 39 mins
16 GeForce RTX 3070 Ti
GA104 [GeForce RTX 3070 Ti]
Nvidia GA104 5,582,346 389,155 14.34 1 hrs 40 mins
17 Radeon RX 6950 XT
Navi 21 [Radeon RX 6950 XT]
AMD Navi 21 5,504,609 388,299 14.18 1 hrs 42 mins
18 GeForce RTX 3070
GA104 [GeForce RTX 3070]
Nvidia GA104 4,771,994 366,681 13.01 1 hrs 51 mins
19 GeForce RTX 4060
AD107 [GeForce RTX 4060]
Nvidia AD107 4,505,031 363,045 12.41 1 hrs 56 mins
20 RTX A4500
GA102GL [RTX A4500]
Nvidia GA102GL 4,316,718 354,129 12.19 1 hrs 58 mins
21 GeForce RTX 4060 Ti
AD106 [GeForce RTX 4060 Ti]
Nvidia AD106 4,208,711 339,266 12.41 1 hrs 56 mins
22 Radeon RX 6800/6800XT/6900XT
Navi 21 [Radeon RX 6800/6800XT/6900XT]
AMD Navi 21 3,766,360 338,880 11.11 2 hrs 10 mins
23 GeForce RTX 3060 Ti
GA104 [GeForce RTX 3060 Ti]
Nvidia GA104 3,704,733 339,104 10.93 2 hrs 12 mins
24 GeForce RTX 2080 Rev. A
TU104 [GeForce RTX 2080 Rev. A] 10068
Nvidia TU104 3,603,590 336,598 10.71 2 hrs 15 mins
25 GeForce RTX 2080
TU104 [GeForce RTX 2080]
Nvidia TU104 3,471,606 332,565 10.44 2 hrs 18 mins
26 GeForce RTX 3060 Ti Lite Hash Rate
GA104 [GeForce RTX 3060 Ti Lite Hash Rate]
Nvidia GA104 3,131,136 322,435 9.71 2 hrs 28 mins
27 GeForce RTX 2070
TU106 [GeForce RTX 2070]
Nvidia TU106 3,034,207 316,421 9.59 2 hrs 30 mins
28 GeForce RTX 2060 Super
TU106 [GeForce RTX 2060 SUPER]
Nvidia TU106 2,837,646 311,900 9.10 2 hrs 38 mins
29 GeForce GTX 1080 Ti
GP102 [GeForce GTX 1080 Ti] 11380
Nvidia GP102 2,639,599 298,634 8.84 2 hrs 43 mins
30 GeForce RTX 3060 Mobile / Max-Q
GA106M [GeForce RTX 3060 Mobile / Max-Q]
Nvidia GA106M 2,535,992 288,570 8.79 2 hrs 44 mins
31 TITAN Xp
GP102 [TITAN Xp] 12150
Nvidia GP102 2,486,794 295,080 8.43 2 hrs 51 mins
32 GeForce RTX 2060 12GB
TU106 [GeForce RTX 2060 12GB]
Nvidia TU106 2,407,811 296,519 8.12 2 hrs 57 mins
33 GeForce RTX 2060
TU106 [Geforce RTX 2060]
Nvidia TU106 2,339,581 293,007 7.98 3 hrs 0 mins
34 Radeon RX 6700/6700XT/6800M
Navi 22 XT-XL [Radeon RX 6700/6700XT/6800M]
AMD Navi 22 XT-XL 2,314,326 134,118 17.26 1 hrs 23 mins
35 GeForce RTX 3060 Lite Hash Rate
GA106 [GeForce RTX 3060 Lite Hash Rate]
Nvidia GA106 2,302,663 239,753 9.60 2 hrs 30 mins
36 GeForce RTX 3060
GA106 [GeForce RTX 3060]
Nvidia GA106 2,283,668 286,159 7.98 3 hrs 0 mins
37 GeForce RTX 2060
TU104 [GeForce RTX 2060]
Nvidia TU104 2,025,901 270,331 7.49 3 hrs 12 mins
38 GeForce RTX 2060 Mobile
TU106M [GeForce RTX 2060 Mobile]
Nvidia TU106M 1,743,445 265,406 6.57 3 hrs 39 mins
39 RTX A2000
GA106 [RTX A2000]
Nvidia GA106 1,701,851 261,638 6.50 3 hrs 41 mins
40 GeForce GTX 1080
GP104 [GeForce GTX 1080] 8873
Nvidia GP104 1,555,650 256,916 6.06 3 hrs 58 mins
41 GeForce GTX 1070
GP104 [GeForce GTX 1070] 6463
Nvidia GP104 1,544,727 262,619 5.88 4 hrs 5 mins
42 GeForce GTX 1660 SUPER
TU116 [GeForce GTX 1660 SUPER]
Nvidia TU116 1,513,380 175,265 8.63 2 hrs 47 mins
43 GeForce RTX 3050 Ti Mobile
GA107M [GeForce RTX 3050 Ti Mobile]
Nvidia GA107M 1,396,443 246,846 5.66 4 hrs 15 mins
44 GeForce GTX 980 Ti
GM200 [GeForce GTX 980 Ti] 5632
Nvidia GM200 429,567 164,452 2.61 9 hrs 11 mins
45 GeForce GTX 1650
TU116 [GeForce GTX 1650] 3091
Nvidia TU116 411,697 163,975 2.51 9 hrs 34 mins
46 GeForce GTX 1050 LP
GP107 [GeForce GTX 1050 LP] 1862
Nvidia GP107 345,600 53,568 6.45 3 hrs 43 mins

PROJECT FOLDING PPD AVERAGES BY CPU BETA

Data as of Tuesday, 14 April 2026 06:34:57
Rank
Project
CPU Model Logical
Processors (LP)
PPD-PLP
AVG PPD per 1 LP
ALL LP-PPD
(Estimated)
Make