Data-Driven Identification of Stochastic Dynamical Systems
Main Article Content
Abstract
Identifying stochastic dynamical systems from observational data remains a major challenge in applied mathematics and engineering, particularly when complex systems are influenced by random perturbations and incomplete empirical information. This comprehensive review aims to examine state-of-the-art data-driven methods for discovering governing equations, estimating parameters, and predicting the behavior of stochastic dynamical systems. The review systematically analyzes key methodological approaches, including Sparse Identification of Nonlinear Dynamics (SINDy), Dynamic Mode Decomposition (DMD) and its extensions, Koopman operator theory, neural ordinary differential equations, and Bayesian inference. Each approach is evaluated in terms of its theoretical foundations, computational requirements, robustness to noise, and applicability to different classes of stochastic systems. Drawing on numerical experiments and real-world case studies, the findings show that no single method consistently outperforms others across all scenarios. Instead, hybrid approaches that integrate physics-informed constraints with machine learning demonstrate the strongest potential for advancing data-driven system identification. The review concludes that future research should address real-time identification, uncertainty quantification, and the integration of multi-fidelity data sources to improve the reliability and scalability of stochastic system modeling. This work contributes a comprehensive framework for guiding researchers and practitioners in selecting and implementing appropriate identification methods for stochastic dynamical systems.

Citation Metrics:
Downloads
Article Details

Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
References
Arnold, L. (1974). Stochastic differential equations: Theory and applications. Wiley.
Bishop, C. M. (2006). Pattern recognition and machine learning. Springer. https://link.springer.com/book/9780387310732
Boulanger-Lewandowski, N., Bengio, Y., & Vincent, P. (2012). Modeling temporal dependencies in high-dimensional sequences: Application to polyphonic music generation and transcription. In Proceedings of the 29th International Conference on Machine Learning (pp. 1159–1166). https://icml.cc/2012/papers/590.pdf
Brunton, S. L., Proctor, J. L., & Kutz, J. N. (2016). Discovering governing equations from data by sparse identification of nonlinear dynamical systems. Proceedings of the National Academy of Sciences, 113(15), 3932–3937. https://doi.org/10.1073/pnas.1517384113
Champion, K., Lusch, B., Kutz, J. N., & Brunton, S. L. (2019). Data-driven discovery of coordinates and governing equations. Proceedings of the National Academy of Sciences, 116(45), 22445–22451. https://doi.org/10.1073/pnas.1906995116
Chen, R. T. Q., Rubanova, Y., Bettencourt, J., & Duvenaud, D. (2018). Neural ordinary differential equations. In Advances in Neural Information Processing Systems (Vol. 31, pp. 6571–6583). Neural Information Processing Systems Foundation. https://papers.neurips.cc/paper/7892-neural-ordinary-differential-equations
Cranmer, M., Greydanus, S., Hoyer, S., Battaglia, P., Spergel, D., & Ho, S. (2020). Lagrangian neural networks. In ICLR 2020 Workshop on Integration of Deep Neural Models and Differential Equations. https://openreview.net/forum?id=iE8tFa4Nq
de Bézenac, E., Pajot, A., & Gallinari, P. (2019). Deep learning for physical processes: Incorporating prior scientific knowledge. Journal of Statistical Mechanics: Theory and Experiment, 2019(12), Article 124009. https://doi.org/10.1088/1742-5468/ab3195
Freidlin, M. I., & Wentzell, A. D. (2012). Random perturbations of dynamical systems (3rd ed.). Springer. https://doi.org/10.1007/978-3-642-25847-3
Gelman, A., Carlin, J. B., Stern, H. S., Dunson, D. B., Vehtari, A., & Rubin, D. B. (2013). Bayesian data analysis (3rd ed.). CRC Press. https://doi.org/10.1201/b16018
Goodfellow, I., Bengio, Y., & Courville, A. (2016). Deep learning. MIT Press. https://www.deeplearningbook.org
Guckenheimer, J., & Holmes, P. (1983). Nonlinear oscillations, dynamical systems, and bifurcations of vector fields. Springer. https://doi.org/10.1007/978-1-4612-1140-2
Hairer, M., & Mattingly, J. C. (2011). A theory of hypoellipticity and unique ergodicity for semilinear stochastic PDEs. Electronic Journal of Probability, 16, 658–738. https://doi.org/10.1214/EJP.v16-875
Hastie, T., Tibshirani, R., & Friedman, J. (2009). The elements of statistical learning: Data mining, inference, and prediction (2nd ed.). Springer. https://doi.org/10.1007/978-0-387-84858-7
He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 770–778). https://doi.org/10.1109/CVPR.2016.90
Huang, G., Liu, Z., van der Maaten, L., & Weinberger, K. Q. (2017). Densely connected convolutional networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 4700–4708). https://doi.org/10.1109/CVPR.2017.243
Karniadakis, G. E., Kevrekidis, I. G., Lu, L., Perdikaris, P., Wang, S., & Yang, L. (2021). Physics-informed machine learning. Nature Reviews Physics, 3(6), 422–440. https://doi.org/10.1038/s42254-021-00314-5
Kloeden, P. E., & Platen, E. (1992). Numerical solution of stochastic differential equations. Springer. https://doi.org/10.1007/978-3-662-12616-5
Klus, S., Nüske, F., & Hamzi, B. (2020). Kernel-based approximation of the Koopman generator and Schrödinger operator. Entropy, 22(7), Article 722. https://doi.org/10.3390/e22070722
Kutz, J. N., Brunton, S. L., Brunton, B. W., & Proctor, J. L. (2016). Dynamic mode decomposition: Data-driven modeling of complex systems. SIAM. https://doi.org/10.1137/1.9781611974508
Lasota, A., & Mackey, M. C. (1994). Chaos, fractals, and noise: Stochastic aspects of dynamics. Springer. https://doi.org/10.1007/978-1-4612-4286-4
Li, X., Wong, T. K. L., Chen, R. T. Q., & Duvenaud, D. (2020). Scalable gradients for stochastic differential equations. In Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics (pp. 3870–3882). PMLR. https://proceedings.mlr.press/v108/li20i.html
Lusch, B., Kutz, J. N., & Brunton, S. L. (2018). Deep learning for universal linear embeddings of nonlinear dynamics. Nature Communications, 9, Article 4950. https://doi.org/10.1038/s41467-018-07210-0
Majda, A. J., & Harlim, J. (2012). Filtering complex turbulent systems. Cambridge University Press. https://doi.org/10.1017/CBO9781139061308
Mezić, I. (2013). Analysis of fluid flows via spectral properties of the Koopman operator. Annual Review of Fluid Mechanics, 45, 357–378. https://doi.org/10.1146/annurev-fluid-011212-140652
Morrill, J., Salvi, C., Kidger, P., & Foster, J. (2021). Neural rough differential equations for long time series. In Proceedings of the 38th International Conference on Machine Learning (pp. 7829–7838). PMLR. https://proceedings.mlr.press/v139/morrill21b.html
Murphy, K. P. (2012). Machine learning: A probabilistic perspective. MIT Press. https://mitpress.mit.edu/9780262018029/machine-learning-a-probabilistic-perspective/
Øksendal, B. (2003). Stochastic differential equations: An introduction with applications (6th ed.). Springer. https://doi.org/10.1007/978-3-642-14394-6
Pathak, J., Hunt, B., Girvan, M., Lu, Z., & Ott, E. (2018). Model-free prediction of large spatiotemporally chaotic systems from data: A reservoir computing approach. Physical Review Letters, 120(2), Article 024102. https://doi.org/10.1103/PhysRevLett.120.024102
Pavliotis, G. A. (2014). Stochastic processes and applications: Diffusion processes, the Fokker-Planck and Langevin equations. Springer. https://doi.org/10.1007/978-1-4939-1323-7
Purandare, R., Ratnaparkhi, A., & Bang, A. (2023). Resource optimization using the Taguchi technique for channel allocation. International Journal of Intelligent Systems and Applications in Engineering, 11(3s), 93–99. https://mail.ijisae.org/index.php/IJISAE/article/view/2535
Raissi, M., Perdikaris, P., & Karniadakis, G. E. (2019). Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. Journal of Computational Physics, 378, 686–707. https://doi.org/10.1016/j.jcp.2018.10.045
Risken, H. (1996). The Fokker-Planck equation: Methods of solution and applications (2nd ed.). Springer. https://doi.org/10.1007/978-3-642-61544-3
Rudy, S. H., Brunton, S. L., Proctor, J. L., & Kutz, J. N. (2017). Data-driven discovery of partial differential equations. Science Advances, 3(4), Article e1602614. https://doi.org/10.1126/sciadv.1602614
Sah, B. K., & Sahani, S. K. (2023). Development and characterization of a new linear exponential distribution for reliability analysis of complex systems. International Journal of Intelligent Systems and Applications in Engineering, 11(11s), 854–865. https://ijisae.org/index.php/IJISAE/article/view/7713
Sahani, S. K. (2019). Runge–Kutta-based dynamic simulation of a multi-degree-of-freedom vibrating system. International Journal of Intelligent Systems and Applications in Engineering, 7(4), 275–284. https://www.ijisae.org/index.php/IJISAE/article/view/7733
Sahani, S. K. (2020). Nonlinear dynamic analysis of multistory structures using Runge–Kutta integration. International Journal of Intelligent Systems and Applications in Engineering, 8(4), 375–385. https://www.ijisae.org/index.php/IJISAE/article/view/7732
Sahani, S. K. (2021). Differential equation on astrophysics: A fundamental approach to understanding cosmic structures and their dynamic evolution. Letters in High Energy Physics, 2021, 1–13. https://doi.org/10.52783/lhep.2021.1468
Sahani, S. K. (2022). A mathematical framework for incorporating neural networks into root-finding algorithms. Journal of Electrical Systems, 18(1), 99–109. https://doi.org/10.52783/jes.8947
Sahani, S. K. (2023). Application of numerical methods in structural health monitoring using IoT sensors. Journal of Electrical Systems, 19(1), 194–207. https://doi.org/10.52783/jes.8941
Sahani, S. K. (2023). Neural network surrogates for weather prediction using numerical solutions of the shallow water equations. International Journal of Intelligent Systems and Applications in Engineering, 11(3s), 356–368. https://doi.org/10.17762/ijisae.v11i3s.7686
Sahani, S. K. (2024). AI-enhanced finite element method (FEM) for structural analysis. Journal of Electrical Systems, 20(1), 661–676. https://doi.org/10.52783/jes.8946
Sahani, S. K., & Sah, D. K. (2022). A comprehensive study on predicting numerical integration errors using machine learning approaches. Letters in High Energy Physics, 2022, 96–103. https://doi.org/10.52783/lhep.2022.1465
Sahani, S. K., Oruganti, S. K., Kumar, K. S., Sahani, K., Pandey, B. K., & Pandey, D. (2025). Case study on mechanical and operational behavior in steel production: Performance and process behavior in steel manufacturing plant. Reports in Mechanical Engineering, 6(1), 180–197. https://doi.org/10.31181/rme496
Sahani, S. K., Raj, R. K., Sathishkumar, K., Keshava Murthy, G. N., Manojkumar, S. B., Naveen, K. B., Sah, B. K., Jayanthiladevi, A., Mandal, P., & Sahani, K. (2026). Mechanical process control and statistical process control for reducing butter-oil defects in industrial production. Reports in Mechanical Engineering, 7(1), 169–184. https://doi.org/10.31181/rme575
Särkkä, S., & Solin, A. (2019). Applied stochastic differential equations. Cambridge University Press. https://doi.org/10.1017/9781108186735
Schmid, P. J. (2010). Dynamic mode decomposition of numerical and experimental data. Journal of Fluid Mechanics, 656, 5–28. https://doi.org/10.1017/S0022112010001217
Strogatz, S. H. (2018). Nonlinear dynamics and chaos: With applications to physics, biology, chemistry, and engineering (2nd ed.). CRC Press. https://doi.org/10.1201/9780429492563
Takens, F. (1981). Detecting strange attractors in turbulence. In Dynamical systems and turbulence, Warwick 1980 (Lecture Notes in Mathematics, Vol. 898, pp. 366–381). Springer. https://doi.org/10.1007/BFb0091924
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, Ł., & Polosukhin, I. (2017). Attention is all you need. In Advances in Neural Information Processing Systems (Vol. 30, pp. 5998–6008). Neural Information Processing Systems Foundation. https://papers.neurips.cc/paper/7181-attention-is-all-you-need
Vlachas, P. R., Byeon, W., Wan, Z. Y., Sapsis, T. P., & Koumoutsakos, P. (2018). Data-driven forecasting of high-dimensional chaotic systems with long short-term memory networks. Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences, 474(2213), Article 20170844. https://doi.org/10.1098/rspa.2017.0844
Wehmeyer, C., & Noé, F. (2018). Time-lagged autoencoders: Deep learning of slow collective variables for molecular kinetics. The Journal of Chemical Physics, 148(24), Article 241703. https://doi.org/10.1063/1.5011399
Wiggins, S. (2003). Introduction to applied nonlinear dynamical systems and chaos (2nd ed.). Springer. https://doi.org/10.1007/b97481
Williams, M. O., Kevrekidis, I. G., & Rowley, C. W. (2015). A data-driven approximation of the Koopman operator: Extending dynamic mode decomposition. Journal of Nonlinear Science, 25(6), 1307–1346. https://doi.org/10.1007/s00332-015-9258-5














