REAL

Machine-learning model selection and parameter estimation from kinetic data of complex first-order reaction systems

Zimányi, László and Sipos, Áron and Sarlós, Ferenc and Nagypál, Rita and Groma, Géza (2021) Machine-learning model selection and parameter estimation from kinetic data of complex first-order reaction systems. PLOS ONE, 16 (8). ISSN 1932-6203

[img]
Preview
Text
2424.pdf
Available under License Creative Commons Attribution.

Download (3MB) | Preview

Abstract

Dealing with a system of first-order reactions is a recurrent issue in chemometrics, especially in the analysis of data obtained by spectroscopic methods applied on complex biological systems. We argue that global multiexponential fitting, the still common way to solve such problems, has serious weaknesses compared to contemporary methods of sparse modeling. Combining the advantages of group lasso and elastic net—the statistical methods proven to be very powerful in other areas—we created an optimization problem tunable from very sparse to very dense distribution over a large pre-defined grid of time constants, fitting both simulated and experimental multiwavelength spectroscopic data with high computational efficiency. We found that the optimal values of the tuning hyperparameters can be selected by a machine-learning algorithm based on a Bayesian optimization procedure, utilizing widely used or novel versions of cross-validation. The derived algorithm accurately recovered the true sparse kinetic parameters of an extremely complex simulated model of the bacteriorhodopsin photocycle, as well as the wide peak of hypothetical distributed kinetics in the presence of different noise levels. It also performed well in the analysis of the ultrafast experimental fluorescence kinetics data detected on the coenzyme FAD in a very wide logarithmic time window. We conclude that the primary application of the presented algorithms—implemented in available software—covers a wide area of studies on light-induced physical, chemical, and biological processes carried out with different spectroscopic methods. The demand for this kind of analysis is expected to soar due to the emerging ultrafast multidimensional infrared and electronic spectroscopic techniques that provide very large and complex datasets. In addition, simulations based on our methods could help in designing the technical parameters of future experiments for the verification of particular hypothetical models.

Item Type: Article
Subjects: Q Science / természettudomány > QH Natural history / természetrajz > QH301 Biology / biológia > QH3020 Biophysics / biofizika
SWORD Depositor: MTMT SWORD
Depositing User: MTMT SWORD
Date Deposited: 07 Feb 2022 07:41
Last Modified: 26 Apr 2023 11:38
URI: http://real.mtak.hu/id/eprint/137459

Actions (login required)

Edit Item Edit Item