Deep Learning-Based Prediction of Insurance Claim Severity Using Structured and Unstructured Data

Authors

  • Yukti Lnu KForce Author

Keywords:

Deep Learning, Insurance Claim Severity, Multimodal Learning, Structured Data, Unstructured Text, Transformer Embeddings, Regression Modeling

Abstract

Accurate prediction of insurance claim severity remains a fundamental challenge in actuarial science due to nonlinear feature interactions, contextual variability, and heavy-tailed loss distributions. Traditional statistical models primarily rely on structured policy variables and often fail to capture semantic signals embedded in unstructured claim narratives. This study proposes a multimodal deep learning framework that integrates structured insurance data with contextual embeddings derived from claim descriptions to enhance severity regression performance. The proposed architecture employs independent encoding layers for structured and textual inputs, followed by an intermediate fusion mechanism that learns cross-modal feature interactions. Log transformation is applied to stabilize skewed loss distributions, and model training is conducted using a robust optimization strategy with cross-validation. The framework is benchmarked against classical actuarial models, tree-based ensemble methods, and structured-only neural networks. Experimental evaluation demonstrates that multimodal integration significantly improves predictive accuracy, reduces error dispersion, and enhances model calibration in high-severity cases. The results confirm that textual narratives provide incremental predictive value beyond traditional structured attributes. From a practical perspective, improved severity estimation supports more precise pricing, reserve allocation, and risk management decisions. This research establishes a scalable and production-oriented multimodal architecture for insurance analytics and highlights the strategic importance of combining structured and unstructured data in actuarial prediction systems.

Downloads

Download data is not yet available.

References

Y. Wei, K. Xu, J. Yao, M. Sun, and Y. Sun, "Financial risk analysis using integrated data and transformer-based deep learning," Journal of Computer Science and Software Applications, vol. 4, no. 7, pp. 1-8, 2024.

P. C. Reddi, T. S. Prajwal, V. Vyas, N. Kumari, and R. Kumar, "Intelligent Model-Fusion Based Insurance Claim Analysis," in 2025 5th International Conference on Electrical, Computer, Communications and Mechatronics Engineering (ICECCME), 2025: IEEE, pp. 1-6.

R. Alasmari and A. A. Alhogail, "Protecting smart-home IoT devices from MQTT attacks: An empirical study of ML-based IDS," IEEE Access, vol. 12, pp. 25993-26004, 2024.

S. Kremsner, A. Steinicke, and M. Szölgyenyi, "A deep neural network algorithm for semilinear elliptic PDEs with applications in insurance mathematics," Risks, vol. 8, no. 4, p. 136, 2020.

E. Seyam and M. A. M. Osman, "Physics-Informed Transformer Networks for Multi-Peril Insurance Pricing: A Novel Hybrid Computational Framework Integrating Actuarial Principles with Deep Attention Mechanisms," Statistics, Optimization & Information Computing, vol. 15, no. 3, pp. 2105-2132, 2026.

J. Devlin, M.-W. Chang, K. Lee, and K. Toutanova, "Bert: Pre-training of deep bidirectional transformers for language understanding," in Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: human language technologies, volume 1 (long and short papers), 2019, pp. 4171-4186.

X. Wang, "Medical Entity-Driven Analysis of Insurance Claims Using a Multimodal Transformer Model," Journal of Computer Technology and Software, vol. 4, no. 3, 2025.

S. Gajula, "Architectural transformation of legacy financial systems: a framework for microservices, cloud, and API integration," Int. J. Inform. Technol. Manag. Inform. Syst, vol. 16, no. 2, pp. 1201-1218, 2025.

J. Gao, P. Li, Z. Chen, and J. Zhang, "A survey on deep learning for multimodal data fusion," Neural computation, vol. 32, no. 5, pp. 829-864, 2020.

Downloads

Published

12-12-2025

How to Cite

[1]
Yukti Lnu, “Deep Learning-Based Prediction of Insurance Claim Severity Using Structured and Unstructured Data”, American J Cognit Comput AI Syst, vol. 9, pp. 127–147, Dec. 2025, Accessed: May 30, 2026. [Online]. Available: https://ajccai.org/index.php/publication/article/view/55