Published online by Cambridge University Press: 05 August 2016
Introduction
Clustering is the unsupervised classification of patterns into groups (Jain et al., 1999). It is widely studied and applied in many areas including computer science, biology, social science and statistics. A significant number of clustering methods have been proposed in Berkhin (2006), Filippone et al. (2008), Francis (2014), Han et al. (2001), Jain et al. (1999), Luxburg (2007), and Xu and Wunsch (2005). In the context of actuarial science, Guo (2003), Pelessoni and Picech (1998), and Sanche and Lonergan (2006) studied some possible applications of clustering methods in insurance. As to the territory ratemaking, Christopherson and Werland (1996) considered the use of geographical information systems. Athorough analysis of the application of clustering methods in insurance ratemaking is not known to the author.
The purpose of this chapter is twofold. The first part of the chapter will introduce the typical idea of clustering and state-of-the-art clustering methods with their application in insurance data. To facilitate the discussion, an insurance dataset is introduced before the discussion of clustering methods. Due to the large number of methods, it is not intended to give a detailed review of every clustering methods in the literature. Rather, the focus is on the key ideas of each methods, and more importantly their advantages and disadvantages when applied in insurance ratemaking.
In the second part, a clustering method called the exposure-adjusted hybrid (EAH) clustering method is proposed. The purpose of this section is not to advocate one certain clustering method but to illustrate the general approach that could be taken in territory clustering. Because clustering is subjective, it is well recognized that most details should be modified to accommodate the feature of the dataset and the purpose of the clustering.
The remainder of the chapter proceeds as follows. Section 6.2 introduces clustering and its application in insurance ratemaking. Section 6.3 introduces a typical insurance dataset that requires clustering analysis on geographic information. Section 6.4 reviews clustering methods and their applicability in insurance ratemaking. Section 6.5 proposes the EAH clustering method and illustrates this method step by step using U.K. motor insurance data with the results presented in Section 6.8. Section 6.7 discusses some other considerations, and conclusions are drawn in Section 6.8. Some useful references are listed in Section 6.9.
To save this book to your Kindle, first ensure [email protected] is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.
Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.
Find out more about the Kindle Personal Document Service.
To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Dropbox.
To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Google Drive.