Clustering in General Insurance Pricing

doi:10.1017/CBO9781139342681.007

6 - Clustering in General Insurance Pricing

Published online by Cambridge University Press: 05 August 2016

Ji Yao

Edited by

Edward W. Frees ,

Glenn Meyers and

Richard A. Derrig

Show author details

Ji Yao: Affiliation:
University of Kent
Edward W. Frees: Affiliation:
University of Wisconsin, Madison
Glenn Meyers: Affiliation:
ISO Innovative Analytics, New Jersey
Richard A. Derrig: Affiliation:
Temple University, Philadelphia

Book contents

Get access

Summary

Introduction

Clustering is the unsupervised classification of patterns into groups (Jain et al., 1999). It is widely studied and applied in many areas including computer science, biology, social science and statistics. A significant number of clustering methods have been proposed in Berkhin (2006), Filippone et al. (2008), Francis (2014), Han et al. (2001), Jain et al. (1999), Luxburg (2007), and Xu and Wunsch (2005). In the context of actuarial science, Guo (2003), Pelessoni and Picech (1998), and Sanche and Lonergan (2006) studied some possible applications of clustering methods in insurance. As to the territory ratemaking, Christopherson and Werland (1996) considered the use of geographical information systems. Athorough analysis of the application of clustering methods in insurance ratemaking is not known to the author.

The purpose of this chapter is twofold. The first part of the chapter will introduce the typical idea of clustering and state-of-the-art clustering methods with their application in insurance data. To facilitate the discussion, an insurance dataset is introduced before the discussion of clustering methods. Due to the large number of methods, it is not intended to give a detailed review of every clustering methods in the literature. Rather, the focus is on the key ideas of each methods, and more importantly their advantages and disadvantages when applied in insurance ratemaking.

In the second part, a clustering method called the exposure-adjusted hybrid (EAH) clustering method is proposed. The purpose of this section is not to advocate one certain clustering method but to illustrate the general approach that could be taken in territory clustering. Because clustering is subjective, it is well recognized that most details should be modified to accommodate the feature of the dataset and the purpose of the clustering.

The remainder of the chapter proceeds as follows. Section 6.2 introduces clustering and its application in insurance ratemaking. Section 6.3 introduces a typical insurance dataset that requires clustering analysis on geographic information. Section 6.4 reviews clustering methods and their applicability in insurance ratemaking. Section 6.5 proposes the EAH clustering method and illustrates this method step by step using U.K. motor insurance data with the results presented in Section 6.8. Section 6.7 discusses some other considerations, and conclusions are drawn in Section 6.8. Some useful references are listed in Section 6.9.

Type: Chapter
Information: Predictive Modeling Applications in Actuarial Science , pp. 159 - 179

DOI: https://doi.org/10.1017/CBO9781139342681.007 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2016

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Anderson, D., S., Feldblum, C., Modlin, D., Schirmacher, E., Schirmacher, and N., Thandi. A practitioner's guide to generalized linear models. Casualty Actuarial Society Discussion Paper Program, 2004.

Berkhin, P. Survey of clustering data mining techniques. Technical Report, Accrue Software, 2006.

Christopherson, S., and D. L., Werland. Using a Geographic Information System to identify territory boundaries. Casualty Actuarial Society Forum, Winter 1996.

Filippone, M., F., Camastra, F., Masulli, and S., Rovetta. A survey of kernel and spectral methods for clustering. Pattern Recognition, 41(1): 176–190, 2008.Google Scholar

Francis, L. Unsupervised learning. In E. W., Frees, G., Meyers, and R. A., Derrig (eds.), Predictive Modeling Applications in Actuarial Science: Volume 1, Predictive Modeling Techniques, pp. 280–311. Cambridge University Press, Cambridge, 2014.

Guo, L. Appling data mining techniques in property/casualty insurance. Casualty Actuarial Society Forum, Winter 2003.

Han, J., M., Kamber, and A. K. H., Tung. Spatial clustering methods in data mining: A survey. In Miller, H., and J., Han (eds.), Geographic Data Mining and Knowledge Discovery. Taylor and Francis, London, 2001.

Jain, A. K., M. N., Murty, and P. J., Flynn. Data clustering: A review. ACM Comput. Surveys, 31: 264–323, 1999.Google Scholar

Luxburg, U. V. A tutorial on spectral clustering. Statistics and Computing, 17(4): 395–416, 2007.Google Scholar

Pelessoni, R., and L., Picech. Some applications of unsupervised neural networks in rate making procedure. 1998 General Insurance Convention & ASTIN Colloquium, 1998.

Sanche, R., and K., Lonergan. Variable reduction for predictive modeling with clustering. Casualty Actuarial Society Forum, Winter 2006.

Xu, R., and D., Wunsch. Survey of clustering algorithms. IEEE Trans. Neural Networks, 16: 645–678, 2005.Google Scholar

Book contents

6 - Clustering in General Insurance Pricing

Summary

Access options

References

Save book to Kindle

Save book to Dropbox

Save book to Google Drive