Logo del repository
  1. Home
 
Opzioni

Search for relevant subsets of binary predictors in high dimensional regression for discovering the lead molecule

Valentina Mameli
•
Debora Slanzi
•
Irene Poli
•
Darren V. S. Green
2021
  • journal article

Periodico
PHARMACEUTICAL STATISTICS
Abstract
One of the main problems that the drug discovery research field confronts is to identify small molecules, modulators of protein function, which are likely to be therapeutically useful. Common practices rely on the screening of vast libraries of small molecules (often 1–2 million molecules) in order to identify a molecule, known as a lead molecule, which specifically inhibits or activates the protein function. To search for the lead molecule, we investigate the molecular structure, which generally consists of an extremely large number of fragments. Presence or absence of particular fragments, or groups of fragments, can strongly affect molecular properties. We study the relationship between molecular properties and its fragment composition by building a regression model, in which predictors, represented by binary variables indicating the presence or absence of fragments, are grouped in subsets and a bi-level penalization term is introduced for the high dimensionality of the problem. We evaluate the performance of this model in two simulation studies, comparing different penalization terms and different clustering techniques to derive the best predictor subsets structure. Both studies are characterized by small sets of data relative to the number of predictors under consideration. From the results of these simulation studies, we show that our approach can generate models able to identify key features and provide accurate predictions. The good performance of these models is then exhibited with real data about the MMP–12 enzyme.
DOI
10.1002/pst.2117
WOS
WOS:000632596300001
Archivio
http://hdl.handle.net/11390/1203679
info:eu-repo/semantics/altIdentifier/scopus/2-s2.0-85103223586
https://onlinelibrary.wiley.com/doi/10.1002/pst.2117
Diritti
closed access
Scopus© citazioni
0
Data di acquisizione
Jun 14, 2022
Vedi dettagli
Web of Science© citazioni
0
Data di acquisizione
Mar 28, 2024
Visualizzazioni
1
Data di acquisizione
Apr 19, 2024
Vedi dettagli
google-scholar
Get Involved!
  • Source Code
  • Documentation
  • Slack Channel
Make it your own

DSpace-CRIS can be extensively configured to meet your needs. Decide which information need to be collected and available with fine-grained security. Start updating the theme to match your nstitution's web identity.

Need professional help?

The original creators of DSpace-CRIS at 4Science can take your project to the next level, get in touch!

Realizzato con Software DSpace-CRIS - Estensione mantenuta e ottimizzata da 4Science

  • Impostazioni dei cookie
  • Informativa sulla privacy
  • Accordo con l'utente finale
  • Invia il tuo Feedback