Spline-Based Approaches for Functional Data Representation and Estimation
DOI:
https://doi.org/10.31185/bsj.Vol21.Iss35.1252Keywords:
Functional Data Analysis, Splines, B-Splines, Penalized Regression, Smoothing Parameter Selection, Simulation Study, Python ImplementationAbstract
Functional data analysis (FDA) treats observations as realizations of smooth processes evolving or space. In this work, we focus on spline representations because they capture broad trends while accommodating local departures in a controlled way. We present an end-to-end workflow—implemented in Python/Colab—that prepares raw series (cleaning, interpolation, and registration on a common grid), expands them in cubic B-spline bases, and estimates coefficients under a penalized least-squares criterion. Smoothing is chosen automatically (generalized cross-validation with a one-standard-error rule) to balance fidelity and regularity. A simulation with noisy signals confirms that the estimator recovers the latent trajectory, and a real-data illustration shows how fitted curves and basis weights aid interpretation. The pipeline is transparent, reproducible, and adaptable, and is relevant to applications in health, environmental monitoring, and finance.
References
Chen, Y., Wang, X., & Zhao, Z. (2019). Spline regression models for functional data: A review. Journal of Statistical Planning and Inference, 202, 1–15. https://doi.org/10.1016/j.jspi.2018.12.001
de Boor, C. (2001). A practical guide to splines (Rev. ed.). Springer.
Di, C., Crainiceanu, C. M., Caffo, B. S., & Punjabi, N. M. (2019). Multilevel functional principal component analysis. Annals of Applied Statistics, 13(2), 1128–1152. https://doi.org/10.1214/18-AOAS1223
Eilers, P. H. C., & Marx, B. D. (1996). Flexible smoothing with B-splines and penalties. Statistical Science, 11(2), 89–121. https://doi.org/10.1214/ss/1038425655
Ferraty, F., & Vieu, P. (2006). Nonparametric functional data analysis: Theory and practice. Springer. https://doi.org/10.1007/0-387-36620-2
Geller, N. L., Müller, H. G., & Wang, J. L. (2021). Recent advances in functional data analysis for biomedical applications. Statistics in Medicine, 40(1), 127–149. https://doi.org/10.1002/sim.8799
Goldsmith, J., & Schwartz, J. E. (2020). A user’s guide to functional data analysis. Annual Review of Statistics and Its Application, 7(1), 287–311. https://doi.org/10.1146/annurev-statistics-031219-041101
Liu, R., Chen, K., & Müller, H. G. (2020). Spline-based approaches for multivariate functional data. Journal of Multivariate Analysis, 176, 104566. https://doi.org/10.1016/j.jmva.2019.104566
Malfait, N., & Ramsay, J. O. (2003). The historical functional linear model. The Canadian Journal of Statistics, 31(2), 115–128. https://doi.org/10.2307/3316063
Nason, G. P. (2008). Wavelet methods in statistics with R. Springer. https://doi.org/10.1007/978-0-387-75961-6
Ramsay, J. O., & Silverman, B. W. (2005). Functional data analysis (2nd ed.). Springer. https://doi.org/10.1007/b98888
Ruppert, D., Wand, M. P., & Carroll, R. J. (2003). Semiparametric regression. Cambridge University Press. https://doi.org/10.1017/CBO9780511755453
Wickham, H., Averick, M., Bryan, J., Chang, W., McGowan, L. D., François, R., … & Yutani, H. (2019). Welcome to the tidyverse. Journal of Open Source Software, 4(43), 1686. https://doi.org/10.21105/joss.01686
Wood, S. N. (2017). Generalized additive models: An introduction with R (2nd ed.). CRC Press. https://doi.org/10.1201/9781315370279
Zhou, S., Shen, X., & Wolfe, D. A. (1998). Local asymptotics for regression splines and confidence regions. The Annals of Statistics, 26(5), 1760–1782. https://doi.org/10.1214/aos/1024691251
Downloads
Published
Issue
Section
License
Copyright (c) 2026 ساهر علي خضر أسود الحرداني جامعة محقق أردبيلي، إيران/ كلية العلوم/ قسم الرياضيات

This work is licensed under a Creative Commons Attribution 4.0 International License.