Spline-Based Approaches for Functional Data Representation and Estimation

Authors

  • Saher Ali Khader Aswad Al-Hardani Mohaghegh Ardabili University, Iran/ College of Science/ Department of Mathematics .

DOI:

https://doi.org/10.31185/bsj.Vol21.Iss35.1252

Keywords:

Functional Data Analysis, Splines, B-Splines, Penalized Regression, Smoothing Parameter Selection, Simulation Study, Python Implementation

Abstract

       Functional data analysis (FDA) treats observations as realizations of smooth processes evolving or space. In this work, we focus on spline representations because they capture broad trends while accommodating local departures in a controlled way. We present an end-to-end workflow—implemented in Python/Colab—that prepares raw series (cleaning, interpolation, and registration on a common grid), expands them in cubic B-spline bases, and estimates coefficients under a penalized least-squares criterion. Smoothing is chosen automatically (generalized cross-validation with a one-standard-error rule) to balance fidelity and regularity. A simulation with noisy signals confirms that the estimator recovers the latent trajectory, and a real-data illustration shows how fitted curves and basis weights aid interpretation. The pipeline is transparent, reproducible, and adaptable, and is relevant to applications in health, environmental monitoring, and finance.

References

Chen, Y., Wang, X., & Zhao, Z. (2019). Spline regression models for functional data: A review. Journal of Statistical Planning and Inference, 202, 1–15. https://doi.org/10.1016/j.jspi.2018.12.001

de Boor, C. (2001). A practical guide to splines (Rev. ed.). Springer.

Di, C., Crainiceanu, C. M., Caffo, B. S., & Punjabi, N. M. (2019). Multilevel functional principal component analysis. Annals of Applied Statistics, 13(2), 1128–1152. https://doi.org/10.1214/18-AOAS1223

Eilers, P. H. C., & Marx, B. D. (1996). Flexible smoothing with B-splines and penalties. Statistical Science, 11(2), 89–121. https://doi.org/10.1214/ss/1038425655

Ferraty, F., & Vieu, P. (2006). Nonparametric functional data analysis: Theory and practice. Springer. https://doi.org/10.1007/0-387-36620-2

Geller, N. L., Müller, H. G., & Wang, J. L. (2021). Recent advances in functional data analysis for biomedical applications. Statistics in Medicine, 40(1), 127–149. https://doi.org/10.1002/sim.8799

Goldsmith, J., & Schwartz, J. E. (2020). A user’s guide to functional data analysis. Annual Review of Statistics and Its Application, 7(1), 287–311. https://doi.org/10.1146/annurev-statistics-031219-041101

Liu, R., Chen, K., & Müller, H. G. (2020). Spline-based approaches for multivariate functional data. Journal of Multivariate Analysis, 176, 104566. https://doi.org/10.1016/j.jmva.2019.104566

Malfait, N., & Ramsay, J. O. (2003). The historical functional linear model. The Canadian Journal of Statistics, 31(2), 115–128. https://doi.org/10.2307/3316063

Nason, G. P. (2008). Wavelet methods in statistics with R. Springer. https://doi.org/10.1007/978-0-387-75961-6

Ramsay, J. O., & Silverman, B. W. (2005). Functional data analysis (2nd ed.). Springer. https://doi.org/10.1007/b98888

Ruppert, D., Wand, M. P., & Carroll, R. J. (2003). Semiparametric regression. Cambridge University Press. https://doi.org/10.1017/CBO9780511755453

Wickham, H., Averick, M., Bryan, J., Chang, W., McGowan, L. D., François, R., … & Yutani, H. (2019). Welcome to the tidyverse. Journal of Open Source Software, 4(43), 1686. https://doi.org/10.21105/joss.01686

Wood, S. N. (2017). Generalized additive models: An introduction with R (2nd ed.). CRC Press. https://doi.org/10.1201/9781315370279

Zhou, S., Shen, X., & Wolfe, D. A. (1998). Local asymptotics for regression splines and confidence regions. The Annals of Statistics, 26(5), 1760–1782. https://doi.org/10.1214/aos/1024691251

Downloads

Published

2026-03-01

Issue

Section

Articles