NUEVAS HERRAMIENTAS SINTETICAS Y QUIMIOINFORMATICAS PARA LA CONSTRUCCION Y DIVERSIFICACION DE HETEROCICLOS ¿DRUG-LIKE¿. ACTIVACION C-H Y MACHINE LEARNING

PID2019-104148GB-I00

Nombre agencia financiadora Agencia Estatal de Investigación
Acrónimo agencia financiadora AEI
Programa Programa Estatal de Generación de Conocimiento y Fortalecimiento Científico y Tecnológico del Sistema de I+D+i
Subprograma Subprograma Estatal de Generación de Conocimiento
Convocatoria Proyectos I+D
Año convocatoria 2019
Unidad de gestión Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020
Centro beneficiario UNIVERSIDAD DEL PAIS VASCO EUSKAL HERRIKO UNIBERTSITATEA
Identificador persistente http://dx.doi.org/10.13039/501100011033

Publicaciones

Found(s) 13 result(s)
Found(s) 1 page(s)

Machine Learning Study of Metabolic Networks vs ChEMBL Data of Antibacterial Compounds

Digital.CSIC. Repositorio Institucional del CSIC
  • Diéguez, Karel
  • Casañola, Gerardo
  • Torres, Roldán
  • Rasulev, Bakhtiyor
  • Green, James R.
  • González-Díaz, Humberto
Antibacterial drugs (AD) change the metabolic status of bacteria, contributing to bacterial death. However, antibiotic resistance and the emergence of multidrug-resistant bacteria increase interest in understanding metabolic network (MN) mutations and the interaction of AD vs MN. In this study, we employed the IFPTML = Information Fusion (IF) + Perturbation Theory (PT) + Machine Learning (ML) algorithm on a huge dataset from the ChEMBL database, which contains >155,000 AD assays vs >40 MNs of multiple bacteria species. We built a linear discriminant analysis (LDA) and 17 ML models centered on the linear index and based on atoms to predict antibacterial compounds. The IFPTML-LDA model presented the following results for the training subset: specificity (Sp) = 76% out of 70,000 cases, sensitivity (Sn) = 70%, and Accuracy (Acc) = 73%. The same model also presented the following results for the validation subsets: Sp = 76%, Sn = 70%, and Acc = 73.1%. Among the IFPTML nonlinear models, the k nearest neighbors (KNN) showed the best results with Sn = 99.2%, Sp = 95.5%, Acc = 97.4%, and Area Under Receiver Operating Characteristic (AUROC) = 0.998 in training sets. In the validation series, the Random Forest had the best results: Sn = 93.96% and Sp = 87.02% (AUROC = 0.945). The IFPTML linear and nonlinear models regarding the ADs vs MNs have good statistical parameters, and they could contribute toward finding new metabolic mutations in antibiotic resistance and reducing time/costs in antibacterial drug research., G.D.H. acknowledges financial support from grants from the Ministry of Science and Innovation (PID 2019-104148 GB-I00) and grant no. IT1045-16-2016–2021 from the Basque Government., Peer reviewed




Towards rational nanomaterial design by predicting drug–nanoparticle system interaction vs. bacterial metabolic networks

Digital.CSIC. Repositorio Institucional del CSIC
  • Diéguez, Karel
  • Rasulev, Bakhtiyor
  • González-Díaz, Humberto
The emergence of multidrug-resistant (MDR) strains with perturbed metabolic networks (MNs) pushes researchers to improve antibacterial drugs (ADs). Certain nanoparticles (NPs) may present antibacterial activity along with acting as delivery systems. Thus, developing dual antibacterial drug–nanoparticle (DADNP) systems becomes an option. However, testing DADNPs vs. strains with different MNs is a hard and costly task. Artificial intelligence (AI) or machine learning (ML) could accelerate this by predicting bacterial sensitivity. In this work, we used an information fusion perturbation-theory machine learning (IFPTML) analysis and mapping of DADNP (AD + NP) systems vs. MNs of pathogenic bacterial species as a new application of AI/ML methods. Furthermore, most existing AI/ML models do not use cj of experimental conditions of assays (i.e., bacteria species, strain, NP shape, etc.) as input vectors. A working solution may be the use of an AI/ML method with an information fusion (IF) additive approach. Additive IF uses the sets of vectors Ddk, Dnk, Dmk and cdk, cnk, csk as inputs with information about AD, NP, and MN structure and assays separately. Accordingly, the IFPTML algorithm was selected to seek predictive models based on a ChEMBL dataset of >160 000 AD assays enriched with 300 NP assays and >25 MNs of different bacterial species. IFPTML uses the IF process to join the three datasets, PT operators (PTOs) to codify Ddk, Dnk, Dsk and cdk, cnk, csk vector information, and ML algorithms to train the model. The IFPTML linear discriminant analysis (LDA) model with Sp ≈ 90% and Sn ≈ 80% and the best artificial neural network (ANN) model found with Sp ≈ Sn ≈ 95% in the training/validation series presented good results. This kind of model could be useful for DADNP system discovery. We also ran a simulation with >140 000 points of putative DADNP systems vs. wild type and knockout (KO) computationally generated bacterial strains. The linear and additive IFPTML model was able to predict 102 experimental cases of complex DADNPs with a high degree of structural and biological variety. This led us to introduce the concept of MDR computational surveillance that could help to detect new strains of MDR bacteria., G. D. H. acknowledges financial support from grants Ministry of Science and Innovation (PID2019-104148GB-I00) and grant (IT1045-16) – 2016–2021 of Basque Government., Peer reviewed




Prediction of Antileishmanial Compounds: General Model, Preparation, and Evaluation of 2-Acylpyrrole Derivatives

Digital.CSIC. Repositorio Institucional del CSIC
  • Santiago, Carlos
  • Ortega Tenezaca, Bernabé
  • Barbolla, Iratxe
  • Fundora, Brenda
  • Arrasate, Sonia
  • Dea-Ayuela, M. Auxiliadora
  • González-Díaz, Humberto
  • Sotomayor, Nuria
  • Lete, Esther
In this work, the SOFT.PTML tool has been used to pre-process a ChEMBL dataset of pre-clinical assays of antileishmanial compound candidates. A comparative study of different ML algorithms, such as logistic regression (LOGR), support vector machine (SVM), and random forests (RF), has shown that the IFPTML-LOGR model presents excellent values of specificity and sensitivity (81-98%) in training and validation series. The use of this software has been illustrated with a practical case study focused on a series of 28 derivatives of 2-acylpyrroles 5a,b, obtained through a Pd(II)-catalyzed C-H radical acylation of pyrroles. Their in vitro leishmanicidal activity against visceral (L. donovani) and cutaneous (L. amazonensis) leishmaniasis was evaluated finding that compounds 5bc (IC= 30.87 μM, SI > 10.17) and 5bd (IC= 16.87 μM, SI > 10.67) were approximately 6-fold more selective than the drug of reference (miltefosine) in in vitro assays against L. amazonensis promastigotes. In addition, most of the compounds showed low cytotoxicity, CC> 100 μg/mL in J774 cells. Interestingly, the IFPMTL-LOGR model predicts correctly the relative biological activity of these series of acylpyrroles. A computational high-throughput screening (cHTS) study of 2-acylpyrroles 5a,b has been performed calculating >20,700 activity scores vs a large space of 647 assays involving multiple Leishmania species, cell lines, and potential target proteins. Overall, the study demonstrates that the SOFT.PTML all-in-one strategy is useful to obtain IFPTML models in a friendly interface making the work easier and faster than before. The present work also points to 2-acylpyrroles as new lead compounds worthy of further optimization as antileishmanial hits., Ministerio de Ciencia e Innovación (PID2019-104148GB-I00) and Gobierno Vasco (IT1558-22) are gratefully acknowledged for their financial support. I.B. wishes to thank Fundación Biofísica Bizkaia/Biofisika Bizkaia Fundazioa (FBB) for a postdoctoral grant funded by BERC Basque Government program.




IFPTML Mapping of Drug Graphs with Protein and Chromosome Structural Networks vs. Pre-Clinical Assay Information for Discovery of Antimalarial Compounds

Digital.CSIC. Repositorio Institucional del CSIC
  • Quevedo-Tumailli, Viviana
  • Ortega Tenezaca, Bernabé
  • González-Díaz, Humberto
The parasite species of genus Plasmodium causes Malaria, which remains a major global health problem due to parasite resistance to available Antimalarial drugs and increasing treatment costs. Consequently, computational prediction of new Antimalarial compounds with novel targets in the proteome of Plasmodium sp. is a very important goal for the pharmaceutical industry. We can expect that the success of the pre-clinical assay depends on the conditions of assay per se, the chemical structure of the drug, the structure of the target protein to be targeted, as well as on factors governing the expression of this protein in the proteome such as genes (Deoxyribonucleic acid, DNA) sequence and/or chromosomes structure. However, there are no reports of computational models that consider all these factors simultaneously. Some of the difficulties for this kind of analysis are the dispersion of data in different datasets, the high heterogeneity of data, etc. In this work, we analyzed three databases ChEMBL (Chemical database of the European Molecular Biology Laboratory), UniProt (Universal Protein Resource), and NCBI-GDV (National Center for Biotechnology Information-Genome Data Viewer) to achieve this goal. The ChEMBL dataset contains outcomes for 17,758 unique assays of potential Antimalarial compounds including numeric descriptors (variables) for the structure of compounds as well as a huge amount of information about the conditions of assays. The NCBI-GDV and UniProt datasets include the sequence of genes, proteins, and their functions. In addition, we also created two partitions (cassayj = caj and cdataj = cdj) of categorical variables from theChEMBL dataset. These partitions contain variables that encode information about experimental conditions of preclinical assays (caj) or about the nature and quality of data (cdj). These categorical variables include information about 22 parameters of biological activity (ca0), 28 target proteins (ca1), and 9 organisms of assay (ca2), etc. We also created another partition of (cprotj = cpj) including categorical variables with biological information about the target proteins, genes, and chromosomes. These variables cover32 genes (cp0), 10 chromosomes (cp1), gene orientation (cp2), and 31 protein functions (cp3). We used a Perturbation-Theory Machine Learning Information Fusion (IFPTML) algorithm to map all this information (from three databases) into and train a predictive model. Shannon's entropy measure Shk (numerical variables) was used to quantify the information about the structure of drugs, protein sequences, gene sequences, and chromosomes in the same information scale. Perturbation Theory Operators (PTOs) with the form of Moving Average (MA) operators have been used to quantify perturbations (deviations) in the structural variables with respect to their expected values for different subsets (partitions) of categorical variables. We obtained three IFPTML models using General Discriminant Analysis (GDA), Classification Tree with Univariate Splits (CTUS), and Classification Tree with Linear Combinations (CTLC). The IFPTML-CTLC presented the better performance with Sensitivity Sn(%) = 83.6/85.1, and Specificity Sp(%) = 89.8/89.7 for training/validation sets, respectively. This model could become a useful tool for the optimization of preclinical assays of new Antimalarial compounds vs. different proteins in the proteome of Plasmodium., H.G.-D. personally acknowledges financial support from the Minister of Science and Innovation (PID2019-104148GB-I00) and a grant (IT1045-16)—2016–2021 from the Basque Government. V.Q.T. acknowledges Universidad EstatalAmazónica (UEA) scholarship for postgraduate studies; Ecuador Sciences PhD Program, (UEA.Res.26.2019.06.13)., Peer reviewed




IFPTML mapping of nanoparticle antibacterial activity vs. pathogen metabolic networks

Digital.CSIC. Repositorio Institucional del CSIC
  • Ortega Tenezaca, Bernabé
  • González-Díaz, Humberto
Nanoparticles are useful antimicrobial drug-release systems, but some nanoparticles also exhibit antibacterial activity. However, investigation of their antibacterial activity is a difficult and slow process due to the numerous combinations of nanoparticle size, shape, and composition vs. biological tests, assay organisms, and multiple activity parameters to be measured. Additionally, the overuse of antibiotics has led to the emergence of resistant bacterial strains with different metabolic networks. Computational models may speed up this process, but the models reported to date do not to consider all the previous factors, and the data sources are dispersed and not curated. Thus, herein, we used an information fusion, perturbation-theory machine learning (IFPTML) approach, which is introduced by us for the first time, to fit a model for the discovery of antibacterial nanoparticles. The dataset studied had 15 classes of nanoparticles (1-100 nm) with most cases in the range of 1-50 nm vs. >20 pathogenic bacteria species with different metabolic networks. The nanoparticles studied included metal nanoparticles of Au, Ag, and Cu; oxide nanoparticles of Zn, Cu, La, Al, Fe, Sn, Ti, Cd, and Si; and metal salt nanoparticles of CuI and CdS. We used the SOFT.PTML software (our own application) with a user-friendly interface for the IFPTML calculations and a control statistics package. Using SOFT.PTML, we found a linear logistic regression equation that could model 4 biological activity parameters using only 8 variables with χ2 = 2265.75, p-level <0.05, sensitivity, Sn = 79.4, and specificity, Sp = 99.3, for 3213 cases (nanoparticle-bacteria pairs) in the training series. The model had Sn = 80.8 and Sp = 99.3 for 2114 cases in the external validation series. We also developed a random forest non-linear model with higher values of Sn and Sp = 98-99% in the training/validation series, although it was more complicated to use. SOFT.PTML has been demonstrated to be a useful tool for the analysis of complex data in nanotechnology. We also introduced a new anabolism-catabolism unbalance index of metabolic networks to reveal the biological connotation of the IFPTML predictions for antibacterial nanoparticles. These new models open a new door for the discovery of NPs vs. new bacterial species and strains with different topological structures of their metabolic networks., G. D. H. personally acknowledges financial support from grants Minister of Science and Innovation (PID2019-104148GB-I00) and grant (IT1045-16) – 2016–2021 of Basque Government., Peer reviewed




Palladium-mediated synthesis and biological evaluation of C-10b substituted Dihydropyrrolo[1,2-b]isoquinolines as antileishmanial agents

Digital.CSIC. Repositorio Institucional del CSIC
  • Barbolla, Iratxe
  • Hernández-Suárez, Leidi
  • Quevedo-Tumailli, Viviana
  • Nocedo-Mena, Deyani
  • Arrasate, Sonia
  • Dea-Ayuela, M. Auxiliadora
  • González-Díaz, Humberto
  • Sotomayor, Nuria
  • Lete, Esther
The development of new molecules for the treatment of leishmaniasis is, a neglected parasitic disease, is urgent as current anti-leishmanial therapeutics are hampered by drug toxicity and resistance. The pyrrolo[1,2-b]isoquinoline core was selected as starting point, and palladium-catalyzed Heck-initiated cascade reactions were developed for the synthesis of a series of C-10 substituted derivatives. Their in vitro leishmanicidal activity against visceral (L. donovani) and cutaneous (L. amazonensis) leishmaniasis was evaluated. The best activity was found, in general, for the 10-arylmethyl substituted pyrroloisoquinolines. In particular, 2ad (IC50 = 3.30 μM, SI > 77.01) and 2bb (IC50 = 3.93 μM, SI > 58.77) were approximately 10-fold more potent and selective than the drug of reference (miltefosine), against L. amazonensis on in vitro promastigote assays, while 2ae was the more active compound in the in vitro amastigote assays (IC50 = 33.59 μM, SI > 8.93). Notably, almost all compounds showed low cytotoxicity, CC50 > 100 μg/mL in J774 cells, highest tested dose. In addition, we have developed the first Perturbation Theory Machine Learning (PTML) algorithm able to predict simultaneously multiple biological activity parameters (IC50, Ki, etc.) vs. any Leishmania species and target protein, with high values of specificity (>98%) and sensitivity (>90%) in both training and validation series. Therefore, this model may be useful to reduce time and assay costs (material and human resources) in the drug discovery process., Ministerio de Economía y Competitividad (CTQ2016-74881-P), Ministerio de Ciencia e Innovación (PID2019-104148 GB-I00) and Gobierno Vasco (IT1045-16) are gratefully acknowledged for their financial support. I. B. wishes to thank Fundación Biofísica Bizkaia/Biofisika Bizkaia Fundazioa (FBB) for a postdoctoral grant., Peer reviewed




Pd(II)-Catalyzed Fujiwara-Moritani Reactions for the Synthesis and Functionalization of Substituted Coumarins

Digital.CSIC. Repositorio Institucional del CSIC
  • Ortiz de Elguea, Verónica
  • Carral-Menoyo, Asier
  • Simón-Vidal, Lorena
  • Martínez-Nunes, Mikel
  • Barbolla, Iratxe
  • Lete, Marta G.
  • Sotomayor, Nuria
  • Lete, Esther
Highly substituted coumarins, privileged and versatile scaffolds for bioactive natural products and fluorescence imaging, are obtained via a Pd(II)-catalyzed direct C-H alkenylation reaction (Fujiwara-Moritani reaction), which has emerged as a powerful tool for the construction and functionalization of heterocyclic compounds because of its chemical versatility and its environmental advantages. Thus, a selective 6-endo cyclization led to 4-substituted coumarins in moderate yields. Selected examples have been further functionalized in C3 through a second intermolecular C-H alkenylation reaction to give coumarin-acrylate hybrids, whose fluorescence spectra have been measured., Ministerio de Economía y Competitividad (FEDER CTQ2016-74881-P), Ministerio de Ciencia e Innovación (PID2019-104148GB-I00), and Gobierno Vasco (IT1045-16) are gratefully acknowledged for their financial support. A.C.-M. wishes to thank Gobierno Vasco for a grant., Peer reviewed




Towards machine learning discovery of dual antibacterial drug-nanoparticle systems

Digital.CSIC. Repositorio Institucional del CSIC
  • Diéguez, Karel
  • González-Díaz, Humberto
Artificial Intelligence/Machine Learning (AI/ML) algorithms may speed up the design of DADNP systems formed by Antibacterial Drugs (AD) and Nanoparticles (NP). In this work, we used IFPTML = Information Fusion (IF) + Perturbation-Theory (PT) + Machine Learning (ML) algorithm for the first time to study of a large dataset of putative DADNP systems composed by >165 000 ChEMBL AD assays and 300 NP assays vs. multiple bacteria species. We trained alternative models with Linear Discriminant Analysis (LDA), Artificial Neural Networks (ANN), Bayesian Networks (BNN), K-Nearest Neighbour (KNN) and other algorithms. IFPTML-LDA model was simpler with values of Sp ≈ 90% and Sn ≈ 74% in both training (>124 K cases) and validation (>41 K cases) series. IFPTML-ANN and KNN models are notably more complicated even when they are more balanced Sn ≈ Sp ≈ 88.5%-99.0% and AUROC ≈ 0.94-0.99 in both series. We also carried out a simulation (>1900 calculations) of the expected behavior for putative DADNPs in 72 different biological assays. The putative DADNPs studied are formed by 27 different drugs with multiple classes of NP and types of coats. In addition, we tested the validity of our additive model with 80 DADNP complexes experimentally synthetized and biologically tested (reported in >45 papers). All these DADNPs show values of MIC < 50 μg mL-1 (cutoff used) better that MIC of AD and NP alone (synergistic or additive effect). The assays involve DADNP complexes with 10 types of NP, 6 coating materials, NP size range 5-100 nm vs. 15 different antibiotics, and 12 bacteria species. The IFPTML-LDA model classified correctly 100% (80 out of 80) DADNP complexes as biologically active. IFPMTL additive strategy may become a useful tool to assist the design of DADNP systems for antibacterial therapy taking into consideration only information about AD and NP components by separate., G. D. H acknowledges financial support from grants Minister of Science and Innovation (PID2019-104148GB-I00) and grant (IT1045-16) – 2016–2021 of Basque Government., Peer reviewed




Machine learning in antibacterial discovery and development: A bibliometric and network analysis of research hotspots and trends

Digital.CSIC. Repositorio Institucional del CSIC
  • Diéguez, Karel
  • González-Díaz, Humberto
Machine learning (ML) methods are used in cheminformatics processes to predict the activity of an unknown drug and thus discover new potential antibacterial drugs. This article conducts a bibliometric study to analyse the contributions of leading authors, universities/organisations and countries in terms of productivity, citations and bibliographic linkage. A sample of 1596 Scopus documents for the period 2006-2022 is the basis of the study. In order to develop the analysis, bibliometrix R-Tool and VOSviewer software were used. We determined essential topics related to the application of ML in the field of antibacterial development (Computer model in antibacterial drug design, and Learning algorithms and systems for forecasting). We identified obsolete and saturated areas of research. At the same time, we proposed emerging topics according to the various analyses carried out on the corpus of published scientific literature (Title, abstract and keywords). Finally, the applied methodology contributed to building a broader and more specific "big picture" of ML research in antibacterial studies for the focus of future projects., G.D.H acknowledges financial support from grants from the Basque Government IT1558-22 (2022-2025), SPRI ELKARTEK KK-2022/00032 (2022-2024), and MICIIN PID2019-104148GB-I00 (2020-2022)., Peer reviewed




Trends in Nanoparticles for Leishmania Treatment: A Bibliometric and Network Analysis

Digital.CSIC. Repositorio Institucional del CSIC
  • Mazón-Ortiz, Gabriel
  • Cerda-Mejía, Galo
  • Gutiérrez Morales, Eberto
  • Diéguez, Karel
  • Ruso, Juan M.
  • González-Díaz, Humberto
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/)., Leishmaniasis is a neglected tropical illness with a wide variety of clinical signs ranging from visceral to cutaneous symptoms, resulting in millions of new cases and thousands of fatalities reported annually. This article provides a bibliometric analysis of the main authors’ contributions, institutions, and nations in terms of productivity, citations, and bibliographic linkages to the application of nanoparticles (NPs) for the treatment of leishmania. The study is based on a sample of 524 Scopus documents from 1991 to 2022. Utilising the Bibliometrix R-Tool version 4.0 and VOSviewer software, version 1.6.17 the analysis was developed. We identified crucial subjects associated with the application of NPs in the field of antileishmanial development (NPs and drug formulation for leishmaniasis treatment, animal models, and experiments). We selected research topics that were out of date and oversaturated. Simultaneously, we proposed developing subjects based on multiple analyses of the corpus of published scientific literature (title, abstract, and keywords). Finally, the technique used contributed to the development of a broader and more specific “big picture” of nanomedicine research in antileishmanial studies for future projects., H.G.-D. acknowledges financial support from grants from the Basque Government IT1558-22 (2022–2025), SPRI ELKARTEK KK-2022/00032 (2022–2024), and MICIIN PID2019-104148GB-I00 (2020–2022)., Peer reviewed




Identification of Riluzole derivatives as novel calmodulin inhibitors with neuroprotective activity by a joint synthesis, biosensor, and computational guided strategy

Digital.CSIC. Repositorio Institucional del CSIC
  • Baltasar-Marchueta, Maider
  • Llona, Leire
  • M-Alicante, Sara
  • Barbolla, Iratxe
  • García Ibarluzea, Markel
  • Ramis, Rafael
  • Salomon, Ane Miren
  • Fundora, Brenda
  • Araujo, Ariane
  • Muguruza-Montero, Arantza
  • Núñez, Eider
  • Pérez-Olea, Scarlett
  • Villanueva, Christian
  • Leonardo, Aritz
  • Arrasate, Sonia
  • Sotomayor, Nuria
  • Villarroel, Álvaro
  • Bergara, Aitor
  • Lete, Esther
  • González-Díaz, Humberto
© 2024 The Authors. Published by Elsevier Masson SAS. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)., The development of new molecules for the treatment of calmodulin related cardiovascular or neurodegenerative diseases is an interesting goal. In this work, we introduce a novel strategy with four main steps: (1) chemical synthesis of target molecules, (2) Förster Resonance Energy Transfer (FRET) biosensor development and in vitro biological assay of new derivatives, (3) Cheminformatics models development and in vivo activity prediction, and (4) Docking studies. This strategy is illustrated with a case study. Firstly, a series of 4-substituted Riluzole derivatives 1–3 were synthetized through a strategy that involves the construction of the 4-bromoriluzole framework and its further functionalization via palladium catalysis or organolithium chemistry. Next, a FRET biosensor for monitoring Ca2+-dependent CaM-ligands interactions has been developed and used for the in vitro assay of Riluzole derivatives. In particular, the best inhibition (80%) was observed for 4-methoxyphenylriluzole 2b. Besides, we trained and validated a new Networks Invariant, Information Fusion, Perturbation Theory, and Machine Learning (NIFPTML) model for predicting probability profiles of in vivo biological activity parameters in different regions of the brain. Next, we used this model to predict the in vivo activity of the compounds experimentally studied in vitro. Last, docking study conducted on Riluzole and its derivatives has provided valuable insights into their binding conformations with the target protein, involving calmodulin and the SK4 channel. This new combined strategy may be useful to reduce assay costs (animals, materials, time, and human resources) in the drug discovery process of calmodulin inhibitors., Basque Government / Eusko Jaurlaritza (IT1558–22, IT1707-22) and SPRI ELKARTEK grant (CardiCaM KK-2020/00110) are acknowledged for financial support. We also acknowledge Ministry of Science and Innovation (PID2019–104148GB-100, PID2021–128286NB-100, PID2022–137365NB-100 funded by MCIN/AEI/10.13039/501100011033/FEDER, UE, including FEDER funds)., Peer reviewed




MATEO: intermolecular α-amidoalkylation theoretical enantioselectivity optimization. Online tool for selection and design of chiral catalysts and products

Digital.CSIC. Repositorio Institucional del CSIC
  • Carracedo-Reboredo, Paula
  • Aranzamendi, Eider
  • He, Shan
  • Arrasate, Sonia
  • Munteanu, Cristian Robert
  • Fernández-Lozano, Carlos
  • Sotomayor, Nuria
  • Lete, Esther
  • González-Díaz, Humberto
MATEO web server was implemented for public use by experimental organic chemists, see link: https://cptmltool.rnasa-imedir.com/CPTMLTools-Web/mateo.The code of the software was uploaded to a GitHub repository and is available free for use by cheminformatics researchers with MIT license. The links are the following. For the MATEO server code the link is: https://github.com/glezdiazh/MATEO. For libraries used to calculate the molecular descriptors the link is: https://github.com/muntisa/RMarkovTI.All data files (SI00, SI01, and SI02) have been uploaded to a public data repository and are available for use free of charge under universal commons creative license (CC0). The links are, SI00.pdf file link: https://doi.org/https://doi.org/10.6084/m9.figshare.21981740.v2, Additional file 2: https://doi.org/https://doi.org/10.6084/m9.figshare.21971690.v2, and Additional file 3: https://doi.org/https://doi.org/10.6084/m9.figshare.21971696.v2., The enantioselective Brønsted acid-catalyzed α-amidoalkylation reaction is a useful procedure is for the production of new drugs and natural products. In this context, Chiral Phosphoric Acid (CPA) catalysts are versatile catalysts for this type of reactions. The selection and design of new CPA catalysts for different enantioselective reactions has a dual interest because new CPA catalysts (tools) and chiral drugs or materials (products) can be obtained. However, this process is difficult and time consuming if approached from an experimental trial and error perspective. In this work, an Heuristic Perturbation-Theory and Machine Learning (HPTML) algorithm was used to seek a predictive model for CPA catalysts performance in terms of enantioselectivity in α-amidoalkylation reactions with R2 = 0.96 overall for training and validation series. It involved a Monte Carlo sampling of > 100,000 pairs of query and reference reactions. In addition, the computational and experimental investigation of a new set of intermolecular α-amidoalkylation reactions using BINOL-derived N-triflylphosphoramides as CPA catalysts is reported as a case of study. The model was implemented in a web server called MATEO: InterMolecular Amidoalkylation Theoretical Enantioselectivity Optimization, available online at: https://cptmltool.rnasa-imedir.com/CPTMLTools-Web/mateo . This new user-friendly online computational tool would enable sustainable optimization of reaction conditions that could lead to the design of new CPA catalysts along with new organic synthesis products., The authors acknowledge financial support from Grant PID2019-104148 GB-I00 and PID2022-137365NB-I00 funded by MCIN/ AEI/10.13039/501100011033 and Grant IT1558-22 funded by Basque Government/Eusko Jaurlaritza, 2022–2025.CITIC is funded by the Xunta de Galicia through the collaboration agreement between the Department of Culture, Education, Vocational Training and Universities and the Galician universities to strengthen the research centers of the Galician University System (CIGUS)., Peer reviewed




SI00 Experimental Section

Digital.CSIC. Repositorio Institucional del CSIC
  • González-Díaz, Humberto
The enantioselective Brønsted acid-catalyzed α-amidoalkylation reaction is a useful procedure is for the production of new drugs and natural products (products) or chiral catalysts (tools). The enantioselectivity is sensitive to many factors, from the nature of the nucleophile and the catalyst to the experimental conditions (solvent, temperature, etc.). Although computational chemistry has been used to rationalize experimental results, it is still difficult to understand the influence of different parameters (solvent, temperature, etc.) on the quantitative reaction outcome (as yield or regio- and stereoselectivities).Both experimental and computational (Quantum Chemistry) study of a large number of reactions may become costly in terms of resources and time. Thus, the development of fast-track public computational tools to predict the enantioselectivity [enantiomeric excess ee(%)obs] would be very useful. Furthermore, making the new tool available online could save time and experimental resources in many labs worldwide. We used an Heuristic Perturbation-Theory and Machine Learning (HPTML) algorithm to seek a predictive model with R2 = 0.91 in training and validation series has been developed. It involves a Monte Carlo sampling of>100,000 pairs of query and reference reactions. In addition, the computational and experimental investigation of a new set of intermolecular α-amidoalkylation reactions using BINOL-derived N-trifylphosphoramides as chiral catalysts is reported as a case of study. After validation of the model, it was implementedin a web server called MATEO: InterMolecular Amidoalkylation Theoretical Enantioselectivity Optimization. This tool is available online at:https://cptmltool.rnasa-imedir.com/CPTMLTools-Web/mateo.This new user-friendly online computational tool may become useful to explore a large number of combinations of reactants, catalysts, and experimental conditions. This public tool would enable sustainable optimization of reaction conditions that could lead to the design of new catalysts, substrates, nucleophiles, and/or products., Ministry of Science and Innovation (PID2019-104148GB-I00); Basque Government / Eusko Jaurlaritza (IT1558-22), Peer reviewed