Hostname: page-component-cd9895bd7-8ctnn Total loading time: 0 Render date: 2025-01-03T16:10:22.828Z Has data issue: false hasContentIssue false

Stochastic Modeling of Multidimensional Particle Properties Using Parametric Copulas

Published online by Cambridge University Press:  23 April 2019

Orkun Furat*
Affiliation:
Institute of Stochastics, Ulm University, D-89069 Ulm, Germany
Thomas Leißner
Affiliation:
Institute of Mechanical Process Engineering and Mineral Processing, Technische Universität Bergakademie Freiberg, D-09599 Freiberg, Germany
Kai Bachmann
Affiliation:
Helmholtz Institute Freiberg for Resource Technology, Helmholtz-Zentrum Dresden-Rossendorf, D-01328 Dresden, Germany
Jens Gutzmer
Affiliation:
Helmholtz Institute Freiberg for Resource Technology, Helmholtz-Zentrum Dresden-Rossendorf, D-01328 Dresden, Germany
Urs Peuker
Affiliation:
Institute of Mechanical Process Engineering and Mineral Processing, Technische Universität Bergakademie Freiberg, D-09599 Freiberg, Germany
Volker Schmidt
Affiliation:
Institute of Stochastics, Ulm University, D-89069 Ulm, Germany
*
*Author for correspondence: Orkun Furat, E-mail: [email protected]

Abstract

In this paper, prediction models are proposed which allow the mineralogical characterization of particle systems observed by X-ray micro tomography (XMT). The models are calibrated using 2D image data obtained by a combination of scanning electron microscopy and energy dispersive X-ray spectroscopy in a planar cross-section of the XMT data. To reliably distinguish between different minerals the models are based on multidimensional distributions of certain particle characteristics describing, for example, their size, shape, and texture. These multidimensional distributions are modeled using parametric Archimedean copulas which are able to describe the correlation structure of complex multidimensional distributions with only a few parameters. Furthermore, dimension reduction of the multidimensional vectors of particle characteristics is utilized to make non-parametric approaches such as the computation of distributions via kernel density estimation viable. With the help of such distributions the proposed prediction models are able to distinguish between different types of particles among the entire XMT image.

Type
Software and Instrumentation
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
Copyright © Microscopy Society of America 2019

Introduction

Processes which separate mixtures of particles based on criteria such as size, shape, or chemical composition of particles are used in many applications. Often the separation quality can have a critical influence on subsequent processing steps. For example, in the mining industry an essential step of ore dressing deals with separation processes which remove unwanted minerals from a system of particles with sizes smaller than 1 mm such that minerals of interest remain. In order to evaluate the separation success a mineralogical characterization of the particles prior and after separation is necessary. Common methods to characterize particles with respect to size, shape, and composition rely on two-dimensional (2D) techniques describing a three-dimensional (3D) reality. For example, a combination of scanning electron microscopy (SEM) and energy dispersive X-ray spectroscopy (EDS) can be used (Sunderland & Gottlieb, Reference Sunderland and Gottlieb1991) to achieve a mineralogical characterization of ore samples. However, SEM-EDS is limited to 2D profiles of samples, which makes the characterization of a whole 3D sample rather expensive or even impossible—at least in a non-destructive manner. Another approach, namely X-ray micro tomography (XMT) provides 3D image data depicting the morphology of particles in a sample. This allows a quantitative analysis of particle systems without the stereological bias of 2D measurement techniques (Wang et al., Reference Wang, Lin and Miller2017; Reyes et al., Reference Reyes, Lin, Cilliers and Neethling2018). Recently, Su & Yan (Reference Su and Yan2018) characterized sand particles observed in XMT with various shape descriptors and utilized spherical harmonic functions for both a parametric representation of single particles and a basis for stochastic modeling, see also Feinauer et al. (Reference Feinauer, Brereton, Spettl, Weber, Manke and Schmidt2015). Besides the morphology of particles, XMT provides information about local material specific constants. More precisely, the grayscale values of XMT images are related to the mass density of the observed material. When the considered minerals have distinct mass densities, the contrast-information from XMT can be utilized to distinguish between them. However, Furat et al. (Reference Furat, Leißner, Ditscherlein, Sedivy, Weber, Bachmann, Gutzmer, Peuker and Schmidt2018) showed that grayscale values suffice only in a limited way to characterize the observed material in XMT data.

Therefore, for achieving a 3D mineralogical characterization of a sample it is necessary to use more information from the XMT data than the grayscale values alone. If we know, for example, that particles composed from a certain type of mineral are more spherical than others, we can utilize the additional information about their shape, which can also be obtained from XMT, to characterize them.

In the present paper, we consider a sample of milled greisen-type Li-ore which mainly comprises the minerals zinnwaldite and quartz, though these particles can have imperfections consisting of further components such as topaz, muscovite, kaolinite, and others, see Leißner et al. (Reference Leißner, Bachmann, Gutzmer and Peuker2016). Details on the sample preparation as well as on the analysis of the epoxy block by XMT and SEM-EDS can be found in Furat et al. (Reference Furat, Leißner, Ditscherlein, Sedivy, Weber, Bachmann, Gutzmer, Peuker and Schmidt2018). We propose prediction models which can characterize 3D particles from XMT based on their size, shape, and grayscale values. They can reliably distinguish between consolidated particles which are dominated by either quartz or zinnwaldite grains. The calibration of these prediction models requires SEM-EDS data from only one planar cross-section of the sample. In order to be able to compute size and shape characteristics of XMT image data a particle-wise segmentation of the image data is required, such that it is possible to identify single particles in the XMT image. First, we give a short overview of the image processing steps which were necessary to obtain such a segmentation of our image data. Furthermore, we present several characteristics which can describe the size, shape, and grayscale texture of particles. The mineralogical characterization of 3D particles from XMT images will be conducted based on these characteristics. In addition to the XMT image, we have SEM-EDS data of the same sample, which lies in a planar cross-section of the XMT image. By utilizing the mineralogical characterization of SEM-EDS we know the mineralogical composition of the 3D particles that hit this cross-section. Therefore, we are able to link the size, shape, and grayscale characteristics of these 3D particles to different minerals. The prediction models, proposed in the present paper, require the probability density functions of the considered particle characteristics for each type of mineral. Since multiple characteristics are necessary to characterize particles, because only grayscale information of the particles does not suffice for characterization, we need multidimensional probability densities of the considered characteristics. An easy way to determine probability densities are the so-called kernel density estimators (Scott, Reference Scott2015), since they do not require the search for a suitable parametric family of distributions (e.g., normal distribution, exponential distribution, etc.). However, due to the “curse of dimensionality,” kernel density estimators require huge sample sizes for determining multidimensional densities which can limit their practicality. Therefore, we rigorously show how to determine multidimensional densities by fitting parametric families of distributions to the data. At first we fit one-dimensional (1D) parametric densities to single characteristics and, in a second step, we use the so-called Archimedean copulas (Nelsen, Reference Nelsen2006) to determine joint densities of the considered particle characteristics. As an alternative, we utilize dimension reduction of our multidimensional particle characteristics, in order to make kernel density estimation more viable. With the help of the fitted or estimated multidimensional densities, we propose prediction models which misclassify only 3–6% of composite zinnwaldite and quartz particles. In a further step, we validate the prediction models by comparing their predictions with the characterization of SEM-EDS data at a spatially different cross-section which was not used for the calibration of the prediction models.

The rest of this paper is organized as follows. To begin with, we briefly recall some techniques of image processing and segmentation which we have recently used in Furat et al. (Reference Furat, Leißner, Ditscherlein, Sedivy, Weber, Bachmann, Gutzmer, Peuker and Schmidt2018). Then, various particle characteristics are explained, which are stochastically modeled with copulas later on. Afterwards, we deal with dimension reduction of data and kernel density estimation, as an alternative to the copula approach. In the “Results and Discussion” section classification models utilizing either copulas or kernel density estimation are compared and validated.

Materials and Methods

Image Processing

3D XMT Data Preprocessing and Segmentation

In this section, we give a short overview of the image preprocessing steps and the particle-wise segmentation of the 3D XMT data considered in the present paper; for more details see Furat et al. (Reference Furat, Leißner, Ditscherlein, Sedivy, Weber, Bachmann, Gutzmer, Peuker and Schmidt2018). The first step consists of the reduction of noise in the XMT data with a non-local means denoising algorithm, see Buades et al. (Reference Buades, Coll and Morel2005). In contrast to the commonly used Gaussian filter, this nonlinear image filter has the advantage of smoothing homogeneous regions while preserving edges. In order to separate the particles from the background we use the local adaptive Sauvola thresholding algorithm, which determines binarization thresholds for each voxel based on its local neighborhood, see Shafait et al. (Reference Shafait, Keysers and Breuel2008). Such local thresholding techniques can produce better results than global thresholds, due to globally inconsistent grayscale values in XMT images. One crucial and nontrivial task in image processing is the particle-wise segmentation, which allows the extraction of single particles from the image data for quantitative analysis. In a first step, we use a marker-based watershed algorithm, see Spettl et al. (Reference Spettl, Wimmer, Werz, Heinze, Odenbach, Krill and Schmidt2015), to obtain an initial segmentation. This results in some particles being wrongly separated into multiple fragments. To remove such oversegmentations, we train a neural network (Hastie et al., Reference Hastie, Tibshirani and Friedman2009) to decide whether adjacent segments should be merged or not. To make this decision the neural network is supplied with information regarding the local morphology and grayscale values around two adjacent fragments. By applying the neural network as a post-processing step on the initial segmentation, we receive our final segmentation, see Figure 1b.

Fig. 1. a: Volumetric XMT data (gray) with two registered 2D SEM-EDS planes (blue and red). b: Cut-out of the particle-wise segmentation of the XMT data.

2D SEM-EDS Data and Registration

In addition to the 3D XMT image data, SEM-EDS data were considered by Furat et al. (Reference Furat, Leißner, Ditscherlein, Sedivy, Weber, Bachmann, Gutzmer, Peuker and Schmidt2018). The latter were obtained with a mineral liberation analyzer from the same sample at two spatially different planar sections. An illustration of the SEM-EDS data can be seen in Figure 2a. It provides 2D information about the morphology of the particles in planar sections, but in contrast to the XMT data, see Figure 2b, it also provides information about the mineralogical composition of particles. For example, in the false color image of Figure 2a the blue phase indicates zinnwaldite.

Fig. 2. a: 2D SEM-EDS image, the blue phase indicates zinnwaldite and pink indicates quartz. b: The corresponding section registered in the 3D XMT data.

Thus, by localizing the 2D SEM-EDS data in the segmented 3D image (registration), see Figure 1a, we have knowledge about the mineralogical composition of each 3D particle that intersects with the planar SEM-EDS data. Due to the particle-wise segmentation described in the “3D XMT Data Preprocessing and Segmentation” section we also know the 3D morphology of each of these particles. Furthermore, we can link the particle-wise segmentation to the grayscale values of the 3D XMT image, since the grayscale values provide valuable information about the composition of particles, as shown in Furat et al. (Reference Furat, Leißner, Ditscherlein, Sedivy, Weber, Bachmann, Gutzmer, Peuker and Schmidt2018). Therefore, both the grayscale values, and the morphology of particles in the XMT image that hit the SEM-EDS plane, allow us to adjust a classifier that can predict the mineralogical composition of an arbitrary particle solely based on information gained from the XMT image.

To be precise, let I : W ⊂ ℤ3 → [0, 1] be the grayscale XMT image, where W is a cuboid observation window. The 2D SEM-EDS data can be regarded as a map $L:H\cap W\subset { {\opf Z}}^3\to \lcub {0,1,2, \ldots} \rcub $, where H is a set of voxels representing a plane and the values of L indicate different minerals or the background, e.g., we put L (x) = 0 if background was observed at $x\in H\cap W$, or L (x) = 1 if zinnwaldite was observed at $x\in H\cap W$. Furthermore, let P 1, …, P n ⊂ W ⊂ ℤ3 be the sets of voxels corresponding to the particles that are obtained by the particle-wise segmentation of the 3D XMT image and that hit the plane H, i.e., $P_k\cap H = \lcub {x_1^{\lpar k \rpar }, \ldots, x_{\ell_k}^{\lpar k \rpar }} \rcub \ne \emptyset $ for each k = 1, …, n. Each particle P k gets the label

(1)$$L(P_k) = {\rm mode}\,\left( {L\left( {x_1^{( k ) }} \right) , \ldots, L\left( {x_{\ell_k}^{\lpar k \rpar }} \right) } \right) ,$$

where the mode of the sample of labels $\lpar {L\lpar {x_1^{\lpar k \rpar }} \rpar , \ldots, L\lpar {x_{\ell_k}^{\lpar k \rpar }} \rpar } \rpar $ is the label that appears most often. Thus, we assign to each particle P k the mineral that is observed most frequently in the intersecting plane H. This means, for example, that a composite particle P k which consists of zinnwaldite, quartz, and other minerals will be referred to as a zinnwaldite-composite particle if its main component in the intersection $P_k\cap H$ is zinnwaldite. Since we make this labeling only based on the mineralogical observation in the plane H we assume stationarity of the mineralogical composition of the particles P k, i.e., that the composition of a particle P k outside of the plane H is adequately represented by $P_k\cap H$. In our data, the set of particles observed in the plane H can be decomposed in three disjoint sets:

(2)$$\lcub {P_1, \ldots, P_n} \rcub = Z\cup Q\cup O$$

where Z = {P k : L(P k) = 1} are the zinnwaldite-composite particles and Q = {P k : L(P k) = 2} are the quartz-composite particles. The set O = {P k : L(P k) ≥ 3} contains the remaining particles that were observed in H. In the considered plane, H contains 861 particles, from which 342 belong to the set of zinnwaldite-composite particles Z and 462 to the set of quartz-composite particles Q. Since the set O contains only 57 particles, we disregard these observations from now on. Note that due to the labeling performed according to equation (1), a particle P ∈ Z does not necessarily consist solely of zinnwaldite. Figure 3a shows the distribution of the zinnwaldite volume fraction for particles labeled as zinnwaldite-composite particles in the cross-section $P\cap H$. Analogously, the quartz particles P ∈ Q are consolidated, see Figure 3b. The goal of the present paper is to develop a method which allows us to predict the main mineralogical component L(P) of particles when the additional planar SEM-EDS data are not available.

Fig. 3. a: Histogram of the zinnwaldite volume fractions for particles for which the majority mineral phase is zinnwaldite. The mean zinnwaldite volume fraction is 0.925 with a variance of 0.018. b: Histogram of the quartz volume fractions for particles for which the majority mineral phase is quartz. The mean quartz volume fraction is 0.937 with a variance of 0.013.

Particle Characteristics

In the “Stochastic Modeling of Particle Characteristics” section, we will describe a method which allows us to determine the mineralogical characterization of particles solely based on XMT information. This will be done by prediction models, which determine the mineralogical composition of a particle based on grayscale values and shape/size characteristics obtained from XMT data and its particle-wise segmentation. In order to adjust such a prediction model we use the ground truth information from a given planar SEM-EDS section, i.e., the sets of particles Z and Q, for which we know that they consist mostly of zinnwaldite and quartz, respectively. After adjusting the prediction model, we still used another spatially separated SEM-EDS section to validate the prediction model. In order to make such predictions, we first introduce some particle descriptors, by which a particle P ⊂ ℤ3 will be characterized.

Size Characteristics

Relevant size descriptors of a particle P′ ⊂ ℝ3, which we only observe on the grid as P ⊂ ℤ3, are the volume ν 3(P′) and the surface area a(P′). It is clear that the volume can be estimated by counting voxels belonging to the set P and the surface area can be estimated by considering suitably defined voxel configurations, see Schladitz et al. (Reference Schladitz, Ohser, Nagel, Kuba, Nyúl and Palágyi2006). We denote the estimated particle volume and surface area by ν 3(P) and a(P), respectively.

For our prediction models we use the following estimator of the volume equivalent radius

(3)$$r(P) = \root \scriptstyle{\vskip3pt \hskip-1.5pt 3} \of {\displaystyle{3 \over {4\pi}} \nu _3(P)}, $$

which is in a one-to-one relationship with the volume. The surface area a(P) will not be directly regarded as a particle characteristic, but it is required for the sphericity factor which is a shape characteristic considered in the next section.

Shape Characteristics

In the previous section, we only considered the volume equivalent radius r as a size characteristic, but this is not enough to reliably distinguish between zinnwaldite- and quartz-composite particles, since particles of similar sizes can have very different appearance. Therefore, we examine some further characteristics for describing the shape of particles. One of the shape characteristics we consider is the sphericity factor

(4)$$s(P) = \displaystyle{{\root \scriptstyle{\vskip1pt \hskip-1pt 3} \of {36\pi \nu _3^2 \,(P)}} \over {a(P)}},$$

which takes values in [0, 1], where s(P′) = 1 holds if P′ ⊂ ℝ3 is a sphere.

Another shape characteristic of interest is the convexity factor

(5)$$c(P) = \displaystyle{{\nu _3(P)} \over {\nu _3(q(P))}},$$

where q(P) ⊂ ℤ3 is the convex hull of P on the lattice ℤ3. Like the sphericity factor, the convexity factor takes values in [0, 1], where c(P) = 1 holds if and only if P is convex on ℤ3. There is a causality between the sphericity factor s and the convexity factor c since more spherically shaped particles also have higher convexity factors. Yet, a convexity factor of 1 does not necessarily imply a large sphericity factor, since, for example, ellipsoids have always a convexity factor of 1, yet their sphericity factor can be arbitrarily close to 0. Since we observe a large number of flat particles in our image data, we also consider characteristics that quantify elongation of particles. For each particle P ⊂ ℤ3 with the barycenter $\bar{x} = \,(\bar{x}_1,\bar{x}_3,\bar{x}_3)\in {\opf R}^3$, whose coordinates are given by

(6)$$\bar{x}_i = \displaystyle{1 \over {\nu _3(P)}}\mathop \sum \limits_{x\in P} x_i\quad {\rm for}\;i = 1,2,3,$$

where x = (x 1, x 2, x 3), we consider the (positive-semidefinite) covariance matrix:

(7)$$C = \left( {\displaystyle{1 \over {\nu_3(P)}}\mathop \sum \limits_{x\in P} (x_i-{\bar{x}}_i)(x_j-{\bar{x}}_j)} \right)_{i,j = 1,2,3}.$$

The eigenvalues 0 ≤ λ 1 ≤ λ 2 ≤ λ 3 of C, where we assume that λ 3 >0, are closely related to the axis lengths of the best fitting ellipsoid ε(P) corresponding to the particle P, i.e., the axis lengths a 1, a 2, a 3 of ε(P) are given by $a_i = \gamma \sqrt {\lambda _i} $ for i = 1, 2, 3, where γ is some scaling factor. The elongation factor e(P) of the particle P is then given by

(8)$$e(P) = \displaystyle{{a_1} \over {a_3}} = \sqrt {\displaystyle{{\lambda _1} \over {\lambda _3}}}. $$

Note that, like the previously considered characteristics, the elongation factor e(P) is normalized and takes values in [0, 1], where values close to 0 indicate elongated particles such as rod- or plate-shaped particles. By additionally analyzing the relationship between λ 2 and λ 3 it is possible to distinguish between rods and plates, but since we observed no rod-like particles in our data set, this was not necessary in the present paper.

Grayscale Characteristics

The XMT data do not only provide information about the 3D morphology of particles. In Furat et al. (Reference Furat, Leißner, Ditscherlein, Sedivy, Weber, Bachmann, Gutzmer, Peuker and Schmidt2018), we have seen that the grayscale values of XMT images contain substantial information about the mineralogical composition of particles. Recall that the grayscale values of a particle P = {x 1, …, x n} ⊂ W are given by y 1 = I(x 1), …, y n = I(x n) ∈ [0, 1], where I is the XMT image. Furthermore, we assume without loss of generality that the grayscale values are ordered, i.e., y 1 ≤ y 2 ≤ … ≤ y n. This allows us to define the following grayscale characteristics.

A natural choice could be the mean

(9)$$\bar{y} = \displaystyle{1 \over n}\mathop \sum \limits_{k = 1}^n y_k.$$

But, since particles often have some imperfections, such as small regions of high density, see Figure 2, the mean is not well suited for representing the dominant grayscale value. Therefore, we used the more robust median

(10)$$m\,(P) = y_{0.5}$$

as grayscale characteristic, where the median is defined by the empirical quantiles of the sample

(11)$$y_p = \left\{ {\matrix{ {y_{[ {np} ] + 1},} \hfill & {\;{\rm if}\;np\in {\rm {\opf N}},} \hfill \cr {\displaystyle{{(y_{[ {np} ] } + y_{[{np} ] + 1})} \over 2},} \hfill & {\;{\rm if}\;np\notin {\rm {\opf N}},} \hfill \cr}} \right.$$

for p = 0.5. A natural choice for measuring the variability of grayscale values of a particle could be the sample standard deviation

(12)$$\sigma = \sqrt {\displaystyle{1 \over {n-1}}\mathop \sum \limits_{k = 1}^n \,{(y_k-\bar{y})}^2}. $$

Similarly to the mean, the standard deviation is not a robust particle descriptor. Therefore, we use the interquartile range

(13)$${\rm iqr}\,(P) = y_{0.75}-y_{0.25}$$

to characterize the variability of the grayscale values of the dominant grayscale region of the particle. Further characteristics that describe the texture of particles (Shapiro & Stockman, Reference Shapiro and Stockman2001) will be investigated in a forthcoming paper.

Stochastic Modeling of Particle Characteristics

In this section, we fit parametric distributions to the particle characteristics discussed above. Moreover, we fit parametric distributions separately for zinnwaldite- and quartz-composite particles, since the mineralogical composition of particles that intersect with the SEM-EDS data is known by the labeling considered in equation (1). In Figure 4, the fitted 1D distributions of the median grayscale value for both zinnwaldite and quartz are shown. Note that the large variance of the median grayscale values of zinnwaldite-composite particles, in comparison with the median grayscale values of quartz-composite particles, can have multiple reasons. One reason might be the variable chemical composition of zinnwaldite, see Rieder et al. (Reference Rieder, Huka, Kučerová, Minařík, Obermajer and Povondra1970), which can lead to said increase of the variance. Another reason for a broader median distribution is partial volume effects of XMT data (Boas & Fleischmann, Reference Boas and Fleischmann2012). More precisely, Figure 5 indicates that zinnwaldite-composite particles have a smaller sphericity than quartz-composite particles, which in turn implies that the former have a relatively large surface area or more surface voxels in the XMT image. Due to partial volume effects, the grayscale values of such surface voxels are influenced by neighboring particles or the background. Nevertheless, we can see in Figure 4 that both minerals have quite distinct distributions. Still, there are overlapping regions which make a classification merely based on these distributions difficult. To be more precise, let $f_{m}^{\rm z},f_m^{\rm q}$ be the fitted probability density functions of the particle-wise median grayscale values of zinnwaldite and quartz, respectively. Using a likelihood approach for classification, we say that a particle P ⊂ ℤ3 with median grayscale value x = m(P) ∈ [0, 1] is mainly composed of zinnwaldite if

(14)$$f_m^{\rm z} (x) \gt f_m^{\rm q} (x),$$

otherwise we say that P is a quartz-composite particle. The probability of misclassifying zinnwaldite-composite particles is then given by

(15)$${\rm {\opf P}}\lpar {\,f_m^{\rm z} \,(X) \le f_m^{\rm q} \,(X)} \rpar = \int_0^1 {\,f_m^{\rm z} \,(x){\tf="PSX238E" 0}_{\,f_m^{\rm z} (x) \le f_m^{\rm q} \,(x)}{\rm d}x,} $$

where X is a random median grayscale value with probability density $f_m^{\rm z} $ and ${\tf="PSX238E" 0}_{f{_m^{\rm z} (x) \le f_m^{\rm q} (x)}} $ denotes the indicator function, i.e.,

(16)$${\tf="PSX238E" 0}_{\,f_m^{\rm z} (x) \le f_m^{\rm q}} = \left\{ {\matrix{ {1,} & {{\rm if}\;f_m^{\rm z} (x) \le f_m^{\rm q} (x),} \cr {0,} & {{\rm if}\;f_m^{\rm z} (x) \gt f_m^{\rm q} (x).} \cr}} \right.$$

This misclassification probability corresponds to the blue area in Figure 4 and has the size of 0.19. Furthermore, Table 1 shows a confusion matrix of the prediction rule given in equation (14), where we can see that 73 out of 342 zinnwaldite-composite particles in the SEM-EDS plane were wrongly classified as quartz-composite particles.

Fig. 4. 1D probability densities of the median grayscale value of zinnwaldite (blue curve) and quartz (red curve) particles. The empirical densities were fitted with beta distributions, see the “Modeling of Single Particle Characteristics” section. The orange area provides the probability of classifying a quartz-composite particle as zinnwaldite, whereas the blue area gives the probability of classifying zinnwaldite as quartz.

Fig. 5. Joint probability density of sphericity factor and median grayscale value for zinnwaldite (left) and quartz (right) particles.

Table 1. Confusion Matrix for Predicting the Particle Composition Based on 1D Densities of the Median Grayscale Value, Where the Particles Observed in the SEM-EDS Data Have Been Used for Adjusting the Prediction Model.

In order to reduce the misclassification probabilities we additionally consider multidimensional distributions of vectors of particle characteristics. For example, in Figure 5 the joint probability density of the sphericity factor and the median grayscale value is shown for zinnwaldite (left) and quartz (right) particles. We can see that these 2D distributions are much more distinct, in comparison with the 1D distributions of Figure 4. In accordance with this, the consideration of bivariate probability densities leads to a remarkable drop of the zinnwaldite misclassification probability from 0.19 to 0.06, where we used a 2D analog of the decision rule given in equation (14), see also equation (41) for the general definition of a multidimensional decision rule. Note that the number of wrongly classified zinnwaldite-composite particles dropped from 73 to 27, see Table 2, whereas the number of wrongly classified quartz-composite particles only increased slightly. Thus, by considering multidimensional probability distributions of particle characteristics we can achieve better prediction results. In the following, we describe in detail how to construct such two- or even higher-dimensional probability distributions and their corresponding decision rules.

Table 2. Confusion Matrix for Predicting the Particle Composition Based on Joint Densities of the Sphericity Factor and the Median Grayscale Value, Where the Particles Observed in the SEM-EDS Data Have Been Used for Adjusting the Prediction Model.

Modeling of Single Particle Characteristics

We now fit parametric probability distributions to the 1D particle characteristics described in the “Particle Characteristics” section. To be precise, we first fit parametric distributions to the volume equivalent radius, sphericity factor, convexity factor, elongation factor, median grayscale value, and the iqr for the 3D zinnwaldite-composite particles that intersect with the SEM-EDS plane. The chosen families of parametric distributions and the corresponding parameters that were determined to model the distributions of these characteristics are listed in Table 3. Note that we used gamma- and beta-distributions, whose probability density functions are given by

(17)$$f_{{\rm gamma}}(x) = \displaystyle{1 \over {\theta ^k\Gamma (k)}}x^{k-1}\exp \left( {-\displaystyle{x \over \theta}} \right){\rm \;} {\tf="PSX238E" 0}_{x \gt 0}$$

and

(18)$$f_{{\rm beta}}\,(x) = \displaystyle{{\Gamma (\alpha )\Gamma (\beta )} \over {\Gamma (\alpha + \beta )}}x^{\alpha -1}(1-x)^{\beta -1}{\tf="PSX238E" 0}_{x\in [{0,1}] },$$

respectively, where k, θ, α, β > 0 are some model parameters and the gamma function Γ:[0, ∞) → [0, ∞) is defined by the integral

(19)$${\rm \Gamma (}\alpha {\rm )} = \mathop \int \limits_0^\infty x^{\alpha -1}{\rm e}^{-x}{\rm d}x\quad {\rm for}\;{\rm each}\;\alpha \gt 0.$$

For modeling the volume equivalent radius we used the gamma distribution. Note that there are also other families of distributions which result in a good fit, like the log-normal distribution. The remaining characteristics were modeled with beta distributions, since these particle characteristics have only values in the interval [0, 1] which coincides with the support of the beta distribution. A visual comparison with the histograms of particle characteristics is given in Figure 6. In order to determine the model parameters (k, θ for the gamma and α, β for the beta distribution) for these 1D fits we used maximum-likelihood estimators (Held & Bové, Reference Held and Bové2014). Analogously, the 1D distributions of single particle characteristics were fitted for quartz-composite particles, see Figure 7 and Table 3.

Fig. 6. Histograms (blue) and fitted probability densities of zinnwaldite-composite particle characteristics for (a) volume equivalent radius, (b) sphericity factor, (c) convexity factor, (d) elongation factor, (e) median grayscale value, and (f) iqr, see also Table 3.

Fig. 7. Histograms (blue) and fitted probability densities of quartz-composite particle characteristics for (a) volume equivalent radius, (b) sphericity factor, (c) convexity factor, (d) elongation factor, (e) median grayscale value, and (f) iqr.

Table 3. Parameters of Fitted Distributions.

Modeling of Multidimensional Particle Characteristics

When modeling the joint distribution of independent random particle characteristics X,  Y with probability density functions f X and f Y, respectively, the joint distribution is immediately given by f (XY) (x, y) = f X(x)f Y(y). However, this approach is not available for correlated characteristics, as it is the case for our data, where, for example, we obtained a correlation coefficient of 0.36 for the sphericity factor and median grayscale value of zinnwaldite-composite particles.

Thus, in order to describe the joint distribution of vectors of particle characteristics, whose marginal distributions are given by the 1D distributions we fitted in the “Modeling of Single Particle Characteristics” section, see Table 3, we consider a more general approach, using the so-called Archimedean copulas, see Nelsen (Reference Nelsen2006).

To make the paper more self-contained, we first explain some basic ideas of the copula approach, which will be used to construct multivariate probability distributions with non-normal marginals. Therefore, for some d ≥ 2, let f 1, …, f d : ℝ → [0, ∞) denote the 1D probability densities of particle characteristics for which we want to determine the d-dimensional joint probability density f : ℝd → [0, ∞) whose marginal densities are given by f 1, …, f d. In order to construct the multivariate density f we need to consider, for technical reasons, the 1D (cumulative) distribution functions F 1, …, F d : ℝ → [0, 1] which are given by

(20)$$F_i\,(x) = \mathop \int \limits_{-\infty} ^x f_i\,(y)\,{\rm d}y\quad {\rm for}\;i = 1, \ldots, d.$$

Note that F 1(x) denotes the probability that the particle characteristic described by the density f 1 does not exceed the value x ∈ ℝ and that the density can be obtained from the distribution function F i by

(21)$$f_i(x) = \displaystyle{{\rm d} \over {{\rm d}x}}F_i(x)\quad {\rm for}\;i = 1, \ldots, d.$$

Analogously, the d-dimensional density f has a cumulative distribution function F : ℝd → [0, 1] which is given by

(22)$$\eqalign{&F(x) = \mathop \int \limits_{-\infty} ^{x_1} \ldots \mathop \int \limits_{-\infty} ^{x_d} f\,(y_1, \ldots, y_d)\,{\rm d}y_d \ldots {\rm d}y_1 \cr & {\rm for}\;x = (x_1, \ldots, x_d)\in {\rm {\opf R}}^d.$$

The joint density f is then obtained by the d-fold partial derivative

(23)$$f\,(x) = \displaystyle{{\partial ^d} \over {\partial x_1\cdots \partial x_d}}F(x)\quad {\rm for}\;x = (x_1, \ldots, x_d)\in {\rm {\opf R}}^d.$$

In order to show the connection between the multivariate distribution function f and the notion of copulas we begin with the definition of the latter. A d-dimensional copula is a multivariate cumulative distribution function K : ℝd → [0, 1], whose 1D marginal distributions are uniform distributions in the interval [0, 1]. For example, the marginal cumulative distribution function K 1 : ℝ → [0, 1] of the first component is given by

(24)$$\eqalign{&K_1(x_1) & = \mathop {\lim} \limits_{x_2, \ldots, x_d\to \infty} K\lpar {x_1,x_2, \ldots, x_d} \rpar \cr & = \left\{ {\matrix{ {0,} \hfill & {\;{\rm if}\;x_1 \lt 0,} \hfill \cr {x_1,} \hfill & {\;{\rm if}\;x_1\in [ {0,1} ],} \hfill \cr {1,} \hfill & {\;{\rm if}\;x_1 \gt 1.} \hfill \cr}} \right.}$$

Copulas are of special interest because of Sklar's theorem (Nelsen, Reference Nelsen2006). It states that F is a d-dimensional cumulative distribution function with marginal distribution functions F 1, …, F d if and only if there is a copula function K such that

(25)$$F(x) = K(F_1(x_1), \ldots, F_d(x_d))\quad {\rm for}\;x = (x_1, \ldots, x_d)\in {\rm {\opf R}}^d.$$

This means that every multivariate distribution function F can be represented by its marginals F 1, …, F d and a copula function K. Thus, when modeling the joint distribution function F of random vectors whose marginal distributions are given by F 1, …, F d, it suffices to model the copula K. Note that there are numerous classes of copula functions, for which many of them do not have an analytical representation. In the present paper we limit the search for an appropriate copula function to the so-called Archimedean copulas.

Archimedean Copulas and Differential Variant of Sklar's Theorem

For that purpose, let φ : [0, 1] → [0, ∞] be a continuous, strictly decreasing and convex function with φ(1) = 0 and φ(0) = ∞. Note that φ is called an Archimedean generator. Then, it can be shown that the function K : [0, 1]d → [0, 1] given by

(26)$$\eqalign{&K(u) = \varphi ^{-1}(\varphi (u_1) + \cdots + \varphi (u_d)) \cr & {\rm for}\;u = (u_1, \ldots, u_d)\in [0,1]^d$$

can be uniquely extended to a copula on ℝd, i.e., a function possessing the properties mentioned in the “Modeling of Multidimensional Particle Characteristics” section. It is called an Archimedean copula (Nelsen, Reference Nelsen2006). With the copula function K given in equation (26) which still depends on some abstract function φ we can construct a d-dimensional distribution function F by means of equation (25) which has the marginal distribution functions F 1, …, F d. Since, finally, we are interested in the joint density f of particle characteristics, applying equations (23)–(25) we get that

(27)$$\eqalign{&f\,(x) = f_1(x_1) \ldots f_d(x_d)\,k(F_1(x_1), \ldots, F_d(x_d)) \cr & {\rm for}\;x = (x_1, \ldots, x_d)\in {\rm {\opf R}}^d,$$

which can be seen as a differential variant of Sklar's theorem given in equation (25), where the function k : ℝd → [0, ∞) is the d-fold derivative

(28)$$k(u) = \displaystyle{{\partial ^d} \over {\partial u_1 \ldots \partial u_d}}K(u)\quad {\rm for}\;u = (u_1, \ldots, u_d)\in {\rm {\opf R}}^d.$$

Recall that the d-dimensional probability density f given by equation (27) has the marginal densities f 1, …, f d and, besides this, depends on the choice of a suitable copula function K or, equivalently, the choice of its derivative k. Thus, the task in modeling multidimensional probability distributions for given (1D) marginal densities is the determination of a suitable copula function K, which describes the correlation structure of the 1D marginals. Since in the present paper we only consider Archimedean copulas, which are defined by their generator φ, this task is equivalent to finding a suitable Archimedean generator.

Parametric Families of Archimedean Generators

Instead of analyzing single generator functions φ one by one, we can instead consider parametric families {φ θ : θ ∈ Θ} of generators. For such families the function φ θ is an Archimedean generator for each parameter θ ∈ Θ, where Θ is some set of admissible parameters. This has the advantage that we can consider a whole range of generators for which it is rather easy to determine an optimal choice of a generator among the family {φ θ : θ ∈ Θ} for modeling the joint density f.

An example of a one-parametric family of generators is the Ali-Mikhail-Haq generator, see Nelsen (Reference Nelsen2006), which is given by

(29)$$\varphi _\theta (u) = \log \left( {\displaystyle{{1-\theta \,(1-u)} \over u}} \right),$$

for each u ∈ [0, 1] and some θ ∈ Θ = ( − 1, 1). Note that the parameter θ regulates the correlation between the uniformly distributed marginals of the corresponding copula K θ. The relationship between the copula parameter θ and the common correlation coefficients, namely, the Pearson, Kendall, and Spearman correlation coefficients (Nelsen, Reference Nelsen2006), is shown in Figure 8 for 2D Ali-Mikhail-Haq copulas.

Fig. 8. Relationship between the copula parameter θ and the Pearson, Kendall, and Spearman correlation coefficients of the uniformly distributed marginals of 2D Ali-Mikhail-Haq copulas.

Note, that in the present paper, other one-parametric families of copulas are considered as well, e.g., the Frank copula whose generator is given by

(30)$$\varphi _\theta (u) = -\log \left( {\displaystyle{{{\rm e}^{-\theta u}-1} \over {{\rm e}^{-\theta} -1}}} \right),$$

for θ ≠ 0. An extensive list on even further copulas, such as the Clayton, Gumbel, Joe, and Plackett copula, can be found in Joe (Reference Joe1997). Beyond that, it is possible to obtain further copulas by rotating a given 2D copula by a multiple of 90°. Thus, there are four rotated versions of the aforementioned parametric families of copulas, which increases the list of considered parametric copula families even further.

Fitting a model from a parametric family of generators {φ θ : θ ∈ Θ} is equivalent to determining an optimal parameter $\hskip2.5pt \hat{\hskip-2.8pt\theta} $. This problem can be described as an optimization problem using maximum-likelihood methods in the following way.

The family of generators {φ θ : θ ∈ Θ} induces a family of copulas {K θ : θ ∈ Θ} given by

(31)$$\eqalign{&K_\theta (u) = \varphi _\theta ^{-1} (\varphi _\theta (u_1) + \cdots + \varphi _\theta (u_d)) \cr & {\rm for}\;u = (u_1, \ldots, u_d)\in [0,1]^d,$$

for which the optimal parameter $\hskip2.5pt \hat{\hskip-2.8pt\theta} $ has to be determined. Assuming that K θ is differentiable we can consider the d-fold derivative

(32)$$k_\theta (u) = \displaystyle{{\partial ^d} \over {\partial u_1 \ldots \partial u_d}}K_\theta (u),$$

and the corresponding d-dimensional distribution function F θ, which is given by equation (25), has the probability density function

(33)$$\eqalign{& f_\theta (x) = f_1(x_1) \ldots f_d(x_d)k_\theta (F_1(x_1), \ldots, F_d(x_d)) \cr & {\rm for}\;x = (x_1, \ldots, x_d)\in {\rm {\opf R}}^d,$$

where f 1, …, f d and F 1, …, F d are the 1D densities and distribution functions fitted in the “Modeling of Single Particle Characteristics” section to the image data considered in the present paper.

Note that, for a sample of ℓ ∈ ℕ observations x (1), …, x (ℓ) ∈ ℝd of the considered d-dimensional vector of particle characteristics, the so-called log-likelihood function is given by

(34)$$\eqalign{&\log {\rm {\cal L}}(\theta \vert x^{(1)}, \ldots, x^{(\ell )}) = \mathop \sum \limits_{k = 1}^\ell \log \,(\,f_\theta (x^{(k)})) \cr &\, {\rm for}\;\theta \in \Theta.$$

This leads to the maximum-likelihood estimator

(35)$$\hskip2.5pt \hat{\hskip-2.8pt\theta} = \mathop {{\rm argmax}}\limits_{\theta \in {\rm \Theta}} \;\log \,{\rm {\cal L}}(\theta \vert x^{(1)}, \ldots, x^{(\ell )}),$$

which can be considered as the optimal choice for the parameter θ when the sample x (1), …, x (ℓ) was observed. The fitted joint density f is then given by $f = f_{\hskip1.5pt \hat{\hskip-2pt\theta}}. $

Until now, the procedure discussed above solely explains how to estimate the copula parameter θ for a given parametric family of copulas. In order to select an adequate model type among several parametric copula families, one can use the so-called Akaike information criterion (AIC) which utilizes the log-likelihood function considered in equation (34) to compare parametric families of copulas and select the best performing model. Note that, besides utilizing the log-likelihood function considered in equation (34), the AIC additionally takes into consideration the number of model parameters to avoid overfitting models. For more details regarding the AIC and model selection, see e.g., Held & Bové (Reference Held and Bové2014).

Application to Particle Characteristics

In the context considered in the present paper, the estimation procedure described above can be applied in the following way.

In order to obtain the (2D) joint density of the sphericity and median grayscale value of zinnwaldite-composite particles, see Figure 5, we utilize the 2D vectors of particle characteristics x (j) = (s(P j), m(P j)) which have been computed in the “Particle Characteristics” section for each zinnwaldite-composite particle P 1, …, P  ∈ Z that hits the SEM-EDS plane. For both characteristics the parameters of the corresponding 1D probability densities $f_s^{\rm z}, f_m^{\rm z} $ for zinnwaldite-composite particles are given in Table 3. Now we consider, for example, the family of Frank copulas, assuming that it is able to describe the 2D density of the sphericity and median grayscale value. Then, we utilize the formula given in equation (33) and the previously determined 1D probability densities $f_s^{\rm z}, f_m^{\rm z} $ to obtain a parametric family of possible candidates for the 2D joint density of the considered particle characteristics. By inserting the computed vectors of particle characteristics x (j) into equation (34), we obtain the log-likelihood function $\log {\rm \;} {\rm {\cal L}}$, see Figure 9, which after maximization provides us the copula parameter $\hskip2.5pt \hat{\hskip-2.8pt\theta} = 2.45$. Thus, by means of equation (33) we obtain the 2D probability density of zinnwaldite-composite particle characteristics:

(36)$$f_{s,m}^{\rm z} (x_1,x_2) = f_s^{\rm z} (x_1)f_m^{\rm z} (x_2)k_{\hskip0.8pt \hat{\hskip-1.8pt\theta}} (F_s^{\rm z} (x_1),F_m^{\rm z} (x_2)),$$

where $F_s^{\rm z} $ and $F_m^{\rm z} $ are the distribution functions corresponding to the densities $f_s^{\rm z} $ and $f_m^{\rm z} $, respectively. Note that we found that the family of Frank copulas indeed provides a good fit, in comparison with other parametric families, by comparing the AIC of various families of copulas mentioned in the “Parametric Families of Archimedean Generators” section. Similarly, we showed that the Frank copula rotated by 90° provides a good fit for the 2D probability density $f_{s,m}^{\rm q} $ of the sphericity and median grayscale value of quartz-composite particles with the copula parameter $\hskip2.5pt \hat{\hskip-2.8pt\theta} = -1.14$, see Figure 5.

Fig. 9. Log-likelihood functions for fitting 2D copulas to the sphericity and median grayscale value of zinnwaldite-composite particles using a Frank copula (left) and of quartz-composite particles using a Frank copula rotated by 90° (right).

Recall that the quality of the decision, whether a particle mainly comprises zinnwaldite or quartz, improved significantly once we considered the 2D probability densities $f_{s \comma m}^{\rm z},\, $$f_{\vskip0.01pt s,m}^{\rm q} $ over 1D densities, compare Tables 1 and 2. Furthermore, recall that the estimation procedure described in the “Parametric Families of Archimedean Generators” section is not limited to the simultaneous consideration of two particle characteristics. But, in the same way, it can be applied for general d-dimensional vectors of particle characteristics. For example, we can investigate the six particle characteristics introduced in the “Particle Characteristics” section, thus considering the case d = 6.

For simplicity, we first fit one-parametric copulas to the six-dimensional (6D) data, which leads to better prediction results than just considering 2D particle characteristics, see Table 4. However, we are aware of the fact that one-parametric copulas do not entirely capture the correlation structure of the 6D data, because the empirical (pairwise) correlation coefficients can vary distinctly for each individual pair of particle characteristics. It turned out that, among the one-parametric copula families mentioned in the “Parametric Families of Archimedean Generators” section, the Ali-Mikhail-Haq copula yielded the best fit. In particular, using both the 6D vectors of characteristics x (j) = (r(P j),  s(P j),  c(P j),  e(P j),  m(P j),  iqr(P j)) of zinnwaldite particles P 1, …, P  ∈ Z and the corresponding 1D probability densities $f_r^{\rm z}, f_s^{\rm z}, f_c^{\rm z}, f_e^{\rm z}, f_m^{\rm z}, f_{{\rm iqr}}^{\rm z} $, see the “Modeling of Single Particle Characteristics” section, by means of equations (31)–(34) we obtain the copula parameter $\hskip2.5pt \hat{\hskip-2.8pt\theta} = 0.69$, and thus a parametric fit for the 6D density f z of zinnwaldite-composite particle characteristics.

Table 4. Confusion Matrices of the Copula-Based Decision Rule Given in Equation (41) for Particles Observed in the SEM-EDS Section Which Has Been Used for Adjusting the Prediction Model and for the Particles Observed in the SEM-EDS Section Which Was Withheld for Validation.

Analogously, we obtained the Ali-Mikhail-Haq copula with parameter $\hskip2.5pt \hat{\hskip-2.8pt\theta} = 0.47$ and thus the multidimensional density function f q for the characteristics of quartz particles.

One way to achieve even better prediction results would be to utilize multi-parametric copula families to fully capture the correlation structure of the 6D data, such as, e.g., vine copulas, see Bedford & Cooke (Reference Bedford and Cooke2002). However, in the following, we consider yet another approach to achieve this goal, which uses the 6D data, but projects it to a suitably chosen 2D subspace. In this way, we still exploit information of the 6D vector of particle characteristics while being able to consider 2D copulas, or even 2D kernel density estimators.

Kernel Density Estimation and Dimension Reduction

Recall that for describing multivariate probability densities of particle characteristics, which are necessary for improving the decision rule of the 1D case considered in equation (14), we used a parametric copula approach in the “Application to Particle Characteristics” section. Alternatively, such densities could be estimated in a nonparametric way, using the so-called kernel density estimators (Scott, Reference Scott2015). To be precise, from a sample x (1), …, x (ℓ) ∈ ℝ of a 1D characteristic of zinnwaldite-composite particles a non-parametric density $\hskip1pt\hat{\hskip-1pt f}_{\rm z}$, can be estimated by

(37)$$\hskip1pt\hat{\hskip-1pt f}_{\rm z}(x) = \displaystyle{1 \over \ell} \mathop \sum \limits_{\,j = 1}^\ell \kappa \left( {\displaystyle{{x-x^{(\,j)}} \over h}} \right)\quad {\rm for}\;x\in {\rm {\opf R}},$$

where h >0 is called the bandwidth and κ : ℝ → [0, ∞) is a kernel function, e.g.,

(38)$$\kappa (x) = \displaystyle{1 \over {\sqrt {2\pi}}} \exp {\rm \;} \left( {-\displaystyle{{x^2} \over 2}} \right)\quad {\rm for}\;x\in {\rm {\opf R}}.$$

Similarly, it would be possible to estimate non-parametric multivariate densities for vectors of particle characteristics. However, note that due to the “curse of dimensionality” the required sample size for adequate kernel density estimation increases exponentially with the considered dimension, see Scott (Reference Scott2015). Thus, it is difficult to utilize kernel density estimators for computing accurate 6D densities representing our 6D data. Nevertheless, the issue concerning the sample size of the 6D vectors of particle characteristics, caused by the “curse of dimensionality,” can be remedied by dimension reduction. Common techniques such as the principal component analysis (PCA) project the data to a lower dimensional subspace which maximizes the variance of the data (Hastie et al., Reference Hastie, Tibshirani and Friedman2009). However, PCA does not ensure a good separability of the two classes, namely zinnwaldite- and quartz-composite particles, after projection of the data. Other methods, such as linear discriminant analysis (LDA) can reduce the dimension of the data such that classification can still be performed reliably. Note that LDA always projects data on a c − 1 dimensional subspace, where c is the number of classes. In the setting of the present paper, LDA would project the 6D vectors of zinnwaldite- and quartz-composite particle characteristics on a 1D subspace such that the normalized discrepancy measure

(39)$$\Delta _1 = \displaystyle{{{(\mu _{\rm z}-\mu _{\rm q})}^2} \over {\sigma _{\rm z}^{\,2} + \sigma _{\rm q}^{\,2}}} $$

is maximized, where μ z, μ q ∈ ℝ are the means of the projected characteristics vectors of zinnwaldite and quartz, respectively. Analogously, the values $\sigma _{\rm z}^{\,2}, \sigma _{\rm q}^{\,2} \in {\rm {\open R}}$ are their variances. Heuristically speaking, the LDA maximizes the distance between clusters, while minimizing their variances. In order to fit multidimensional probability densities or to perform multidimensional kernel density estimation on the data after dimension reduction we modified the expression considered in equation (39) such that we reduce the data to two dimensions. More precisely, we project the 6D data on a 2D subspace such that the normalized discrepancy measure

(40)$$\Delta _2 = \displaystyle{{\parallel {\tilde{\mu}} _{\rm z}-{\tilde{\mu}} _{\rm q}\parallel ^2} \over {\sigma _{{\rm z},1}^{\,2} + \sigma _{{\rm z},2}^{\,2} + \sigma _{{\rm q},1}^{\,2} + \sigma _{{\rm q},2}^{\,2}}} $$

is maximized, where $\parallel \cdot \parallel $ denotes the Euclidean norm in ℝ2 and $\tilde{\mu} _{\rm z},\tilde{\mu} _{\rm q}\in {\opf R}^2$ are the means of the clusters corresponding to zinnwaldite- and quartz-composite particles after projection. The values $\sigma _{{\rm z},1}^{\,2}, \sigma _{{\rm z},2}^{\,2} $ are the respective variances of the first and second components for vectors of characteristics of zinnwaldite after projection. Analogously, $\sigma _{{\rm q},1}^{\,2}, \sigma _{{\rm q},2}^{\,2} $ denote the variances of the components for the quartz case. By maximizing the expression given in equation (40) we obtain the dimension reduction by projecting for each particle P the corresponding vectors of characteristics x = (r(P), s(P), c(P), e(P), m(P), iqr(P)) onto the plane with span vectors v 1 = (0.0014, 0.3189, 0.0189, 0.0562, − 0.9455, − 0.0285) and v 2 = (0.0001, − 0.0944, 0.1410, − 0.0193, − 0.0598, 0.9835). Figure 10 indicates that the vectors of characteristics for zinnwaldite- and quartz-composite particles after projections are relatively distinct. Furthermore, this reduction of dimension makes kernel density estimation more viable again. Note that the “ragged” probability densities in Figures 11a and 11b still indicate too small sample sizes for adequate kernel density estimation. However, the methods discussed in the “Modeling of Single Particle Characteristics” and “Parametric Families of Archimedean Generators” sections can be used to fit parametric copulas to the 2D data after dimension reduction, see Figures 11c and 11d.

Fig. 10. 6D data after dimension reduction obtained by maximizing cluster distance, while minimizing cluster variances.

Fig. 11. Kernel density estimation of vectors of characteristics after dimension reduction for (a) zinnwaldite- and (b) quartz-composite particles. The corresponding probability densities obtained by fitting parametric copulas for (c) zinnwaldite- and (d) quartz-composite particles.

Results and Discussion

At the beginning of the “Stochastic Modeling of Particle Characteristics” section we showed a way how the mineralogical composition of particles could be predicted on the basis of 1D probability distributions of single particle characteristics introduced in the “Particle Characteristics” section. However, in Figure 4 and Table 1 we have seen that 1D distributions do not suffice for making reliable decisions, whereas Figure 5 and Table 2 indicate that the predictive power of our decision rule can increase when we consider bivariate vectors of characteristics. Moreover, in the “Application to Particle Characteristics” section we showed how multidimensional density functions f z and f q for six characteristics of zinnwaldite- and quartz-composite particles can be determined. Then, analogously to the 1D case, it is possible to predict the composition of a particle P ⊂ ℤ3 observed in the XMT image data, using the following decision rule: If

(41)$$\eqalign{& f_{\rm z}(r(P),s(P),c(P),e(P),m(P),{\rm iqr}(P)) \cr & \gt f_{\rm q}(r(P),s(P),c(P),e(P),m(P),{\rm iqr}(P)),}$$

then we assume that P is mainly composed of zinnwaldite, otherwise of quartz. In other words, the decision rule given in equation (41) states that a particle P with the vector of characteristics x = (r(P), s(P), c(P), e(P), m(P), iqr(P)) is classified as a zinnwaldite-composite particle if the configuration of characteristics x is more likely to occur according to the distribution of characteristics of zinnwaldite-composite particles given by f z.

Table 4 shows that the decision rule given in equation (41) is rather accurate, i.e., only $2.7\% $ of zinnwaldite-composite particles are wrongly classified as quartz-composite particles and $5.7\% $ of the quartz-composite particles are wrongly classified as zinnwaldite-composite particles. In comparison with the decision rule which only used the sphericity factor and the median grayscale value, see Table 2, the number of wrongly classified zinnwaldite-composite particles dropped from 27 to 9, see also Figure 12. This is a significant improvement compared with the prediction based on two characteristics shown in Table 2. We suppose that the prediction results can become even better when we consider multi-parametric copulas.

Fig. 12. a: Cutout of the SEM-EDS section used for the calibration of the prediction model, where the blue phase indicates zinnwaldite and pink indicates quartz. b: The corresponding XMT section. Blue ellipses mark zinnwaldite-composite particles which were wrongly classified as quartz-composite particles by the decision rule given in equation (41). Analogously, red ellipses indicate wrongly classified quartz-composite particles.

Alternatively, we can use the dimension reduction considered in the “Kernel Density Estimation and Dimension Reduction” section, where the 6D vectors of particle characteristics were projected onto a 2D subspace and the corresponding probability densities required for classification were determined using either kernel density estimation (Figs. 11a and 11b) or Archimedean copulas (Figs. 11c and 11d). The classification results using the latter procedure are listed in Table 5, where the errors for the particles from the SEM-EDS section used for calibration are similar to the errors occurring by the usage of 6D copulas. The approach using kernel density estimation performs best on the calibration data, see Table 6. This was to be expected, since the entire sample information is encoded in the estimated densities.

Table 5. Confusion Matrix of the Copula-Based Decision Rule Utilizing Dimension Reduction.

The corresponding fitted probability densities are shown in Figures 11c and 11d.

Table 6. Confusion Matrix of the Decision Rule Utilizing Dimension Reduction Followed by Kernel Density Estimation.

The corresponding probability densities are shown in Figures 11a and 11b.

To validate the predictive capability of the presented models, we used an additional planar SEM-EDS data set measured at a spatially different location of the 3D sample, see red plane in Figure 1a. For particles that hit this second plane, we determined the particle characteristics and predicted the composition of the particles using the method based on 6D copulas, described in the “Application to Particle Characteristics” section, and the classification method based on dimension reduction followed by fitting parametric copulas or kernel density estimation. Due to the SEM-EDS data the true composition of these particles is known. A comparison between the ground truth and the predictions can be seen in Tables 4–6, where the predictions were again rather good.

Since in the latter case the mineralogical information of particles that hit the second SEM-EDS plane was not used for calibrating the prediction models, we can assume, based on the results of Tables 4–6, that the predictions will remain accurate for each particle of the entire 3D XMT image. Furthermore, the validation results show that the prediction models based on dimension reduction, followed by fitting parametric copulas or kernel density estimation, perform best on the validation section, which indicates that the approach based on 6D copulas can be improved by using more general, higher-parametric copula models.

Note that, even though the prediction method using multidimensional probability densities was presented for a classification problem with two classes, it is not limited to such binary cases. In general, for a prediction problem with n ∈ ℕ classes, the probability density f k of the considered d-dimensional feature vector is required for each class k = 1, …, n. Furthermore, it is necessary to have a priori knowledge on the occurrence probability p k of each class k. Then a vector of characteristics x ∈ ℝd is assigned to class i if

(42)$$f_i\,(x)\,p_i \gt f_k(x)\,p_k$$

for each k ≠ i. Note that we omitted the a priori occurrence probabilities p k in the present paper, since we had roughly the same amount of zinnwaldite- and quartz-composite particles. For more details regarding such classification methods we refer the reader to Duda et al. (Reference Duda, Hart and Stork2001).

Conclusions and Outlook

We presented methods for the mineralogical characterization of particles observed in 3D XMT image data which mainly consists of the minerals zinnwaldite and quartz. The grayscale value I(x) of a voxel x ∈ W ⊂ ℤ3 in the 3D XMT image already provides some information about the mass density of the observed mineral at the location x. However, this information does not suffice for reliably distinguishing between zinnwaldite- and quartz-composite particles. Therefore, we additionally considered the morphology of particles to characterize them. The proposed prediction models can then characterize particles based on their 3D morphology and grayscale values extracted solely from XMT data.

For the calibration of the prediction models we had to localize a 2D SEM-EDS data set in the 3D XMT image. The SEM-EDS data provided a mineralogical characterization of particles in a planar cross-section of the sample, such that we know the 3D morphology and the composition of particles that intersect with this cross-section. In order to extract single 3D particles that hit the planar cross-section, we computed a particle-wise segmentation of the 3D image data. This allowed us to determine for each particle its vector of characteristics which is relevant to distinguish between zinnwaldite- and quartz-composite particles. Among the considered characteristics are the median grayscale value of particles, but also particle size and shape characteristics such as the volume equivalent radius and the sphericity factor.

This resulted in vectors of characteristics for each particle. Since the mineralogical composition of particles that hit the SEM-EDS plane is known, we assigned the corresponding vectors of characteristics to the mineral the particle is mostly composed of. Therefore, we had a set of vectors of characteristics which corresponded to particles dominated by zinnwaldite and analogously another set of such vectors corresponding to particles dominated by quartz. The proposed prediction models require joint densities f z and f q of these characteristics for particles composed of zinnwaldite and quartz, respectively. Therefore, we fitted parametric copulas to each of the two data sets. This entailed fitting 1D parametric families of distributions to each considered characteristic (separately for zinnwaldite- and quartz-composite particles). In a second step these 1D densities were combined to obtain multidimensional densities. Since the considered characteristics were correlated, the multidimensional densities had to reflect the correlation structure. We achieved this for 2D densities with the help of parametric Archimedean copulas. Furthermore, we projected our 6D particle characteristics onto a 2D subspace, such that after this dimension reduction the vectors of particle characteristics corresponding to zinnwaldite- or quartz-composite particles show a good separability. This approach made kernel density estimation more viable again, such that it was possible to utilize the corresponding densities for a further prediction model.

The resulting decision rules characterize a particle with a vector of characteristics x ∈ ℝd as zinnwaldite-composite particle if f z(x) > f q(x), whereby the densities f z, f q were either fitted using a copula approach or kernel density estimation. We observed that this decision rule became more accurate when higher dimensional vectors of characteristics were considered. However, there was already a significant drop of the misclassification percentage when we considered 2D characteristics, instead of just considering single (i.e., 1D) particle characteristics. This error was reduced even further when we considered vectors of six characteristics. However, for simplicity we used only one-parametric copulas when considering six characteristics. We suppose that higher-parametric models for fitting 6D copulas can lead to even better results. As an alternative to higher-parametric copula models, we fitted 2D copulas and also used kernel density estimators to the data after dimension reduction—which led to the best predictive results. To validate the prediction models we used additional SEM-EDS data at a spatially different location to the SEM-EDS data set that was used to adjust the models. This allowed us to compare the predictions of the models with the ground truth obtained by SEM-EDS. Since we observed that the prediction models discussed in the “Results and Discussion” section were quite accurate in this validation step, we can assume that they can reliably distinguish between zinnwaldite- and quartz-composite particles in the entire XMT image—even for areas where no SEM-EDS data are available.

In a forthcoming study we will use the copula-based modeling considered in the present paper to quantify the success of particle separation processes. To be precise, assume that the multidimensional density f of characteristics of a mixture of zinnwaldite- and quartz-composite particles is given by the convex-combination

(43)$$f = (1-p)f_{\rm z} + pf_{\rm q},$$

for some p ∈ [0, 1]. When the p value drops to 0 after a separation process we can say that the zinnwaldite-composite particles are separated from quartz-composite particles. However, it is possible that the distribution of zinnwaldite-composite particle characteristics itself changes significantly during separation. For example, if the separation process only extracts small zinnwaldite-composite particles, the multidimensional density of zinnwaldite characteristics after separation would differ from the original density f z, prior to separation. In order to track and compare these changes in the densities, it is useful to represent the multidimensional densities f z prior and after separation with parametric copulas since they are described by only a few parameters. This complexity reduction is an advantage of parametric copulas, since it allows us to compare complicated multidimensional distributions by simply comparing the parameters of the distributions.

Acknowledgments

The financial support from the German Research Foundation (DFG) for funding the X-ray microscope (INST267/129-1) as well as the research projects (PE1160/22-1 and SCHM997/27-1) within the priority program SPP 2045 “Highly specific and multidimensional fractionation of fine particle systems with technical relevance” is gratefully acknowledged. The authors would like to thank Sabine Gilbricht for her work at the MLA and Roland Würkert for preparing the epoxy blocks.

References

Bedford, T and Cooke, RM (2002). Vines: A new graphical model for dependent random variables. Ann Stat 30(4), 10311068.Google Scholar
Boas, FE and Fleischmann, D (2012). CT artifacts: Causes and reduction techniques. Imaging Med 4(2), 229240.Google Scholar
Buades, A, Coll, B and Morel, JM (2005). A non-local algorithm for image denoising. In Computer Society Conference on Computer Vision and Pattern Recognition, CVPR, vol. 2, pp. 6065. San Diego: IEEE Computer Society.Google Scholar
Duda, RO, Hart, PE and Stork, DG (2001). Pattern Classification, 2nd ed. New York, Weinheim: Wiley.Google Scholar
Feinauer, J, Brereton, T, Spettl, A, Weber, M, Manke, I and Schmidt, V (2015). Stochastic 3D modeling of the microstructure of lithium-ion battery anodes via Gaussian random fields on the sphere. Comput Mater Sci 109, 137146.Google Scholar
Furat, O, Leißner, T, Ditscherlein, R, Sedivy, O, Weber, M, Bachmann, K, Gutzmer, J, Peuker, U and Schmidt, V (2018). Description of ore particles from XMT images, supported by SEM-based image analysis. Microsc Microanal 24, 461470.Google Scholar
Hastie, T, Tibshirani, R and Friedman, J (2009). The Elements of Statistical Learning: Data Mining, Inference and Prediction, 2nd ed. New York: Springer.Google Scholar
Held, L and Bové, DS (2014). Applied Statistical Inference: Likelihood and Bayes. Berlin, Heidelberg: Springer.Google Scholar
Joe, H (1997). Multivariate Models and Dependence Concepts. London: Chapman & Hall.Google Scholar
Leißner, T, Bachmann, K, Gutzmer, J and Peuker, U (2016). MLA-based partition curves for magnetic separation. Miner Eng 94, 94103.Google Scholar
Nelsen, R (2006). An Introduction to Copulas. New York: Springer.Google Scholar
Reyes, F, Lin, Q, Cilliers, J and Neethling, S (2018). Quantifying mineral liberation by particle grade and surface exposure using X-ray microCT. Miner Eng 125, 7582.Google Scholar
Rieder, M, Huka, M, Kučerová, D, Minařík, L, Obermajer, J and Povondra, P (1970). Chemical composition and physical properties of lithium-iron micas from the Krušné hory Mts.(erzgebirge). Part A: Chemical composition. Contrib Mineral Petrol 27(2), 131158.Google Scholar
Schladitz, K, Ohser, J and Nagel, W (2006). Measuring intrinsic volumes in digital 3D images. In Kuba, A, Nyúl, LG and Palágyi, K (eds), Discrete Geometry for Computer Imagery: 13th International Conference, DGCI 2006, Szeged, Hungary, October 25–27, 2006. Proceedings. Berlin, Heidelberg: Springer, pp. 247258.Google Scholar
Scott, DW (2015). Multivariate Density Estimation: Theory, Practice, and Visualization. New York: John Wiley & Sons.Google Scholar
Shafait, F, Keysers, D and Breuel, TM (2008). Efficient implementation of local adaptive thresholding techniques using integral images. In Proc. SPIE 6815, Document Recognition and Retrieval XV, 681510 (28 January 2008). doi: 10.1117/12.767755.Google Scholar
Shapiro, L and Stockman, G (2001). Computer Vision. Upper Saddle River, NJ, USA: Prentice Hall.Google Scholar
Spettl, A, Wimmer, R, Werz, T, Heinze, M, Odenbach, S, Krill, CE III. and Schmidt, V (2015). Stochastic 3D modeling of Ostwald ripening at ultra-high volume fractions of the coarsening phase. Modell Simul Mater Sci Eng 23(6), 065001.Google Scholar
Su, D and Yan, W (2018). 3D characterization of general-shape sand particles using microfocus X-ray computed tomography and spherical harmonic functions, and particle regeneration using multivariate random vector. Powder Technol 323, 823.Google Scholar
Sunderland, D and Gottlieb, P (1991). Application of automated quantitative mineralogy in mineral processing. Miner Eng 4(7–11), 753762.Google Scholar
Wang, Y, Lin, C and Miller, J (2017). Quantitative analysis of exposed grain surface area for multiphase particles using X-ray microtomography. Powder Technol 308, 368377.Google Scholar
Figure 0

Fig. 1. a: Volumetric XMT data (gray) with two registered 2D SEM-EDS planes (blue and red). b: Cut-out of the particle-wise segmentation of the XMT data.

Figure 1

Fig. 2. a: 2D SEM-EDS image, the blue phase indicates zinnwaldite and pink indicates quartz. b: The corresponding section registered in the 3D XMT data.

Figure 2

Fig. 3. a: Histogram of the zinnwaldite volume fractions for particles for which the majority mineral phase is zinnwaldite. The mean zinnwaldite volume fraction is 0.925 with a variance of 0.018. b: Histogram of the quartz volume fractions for particles for which the majority mineral phase is quartz. The mean quartz volume fraction is 0.937 with a variance of 0.013.

Figure 3

Fig. 4. 1D probability densities of the median grayscale value of zinnwaldite (blue curve) and quartz (red curve) particles. The empirical densities were fitted with beta distributions, see the “Modeling of Single Particle Characteristics” section. The orange area provides the probability of classifying a quartz-composite particle as zinnwaldite, whereas the blue area gives the probability of classifying zinnwaldite as quartz.

Figure 4

Fig. 5. Joint probability density of sphericity factor and median grayscale value for zinnwaldite (left) and quartz (right) particles.

Figure 5

Table 1. Confusion Matrix for Predicting the Particle Composition Based on 1D Densities of the Median Grayscale Value, Where the Particles Observed in the SEM-EDS Data Have Been Used for Adjusting the Prediction Model.

Figure 6

Table 2. Confusion Matrix for Predicting the Particle Composition Based on Joint Densities of the Sphericity Factor and the Median Grayscale Value, Where the Particles Observed in the SEM-EDS Data Have Been Used for Adjusting the Prediction Model.

Figure 7

Fig. 6. Histograms (blue) and fitted probability densities of zinnwaldite-composite particle characteristics for (a) volume equivalent radius, (b) sphericity factor, (c) convexity factor, (d) elongation factor, (e) median grayscale value, and (f) iqr, see also Table 3.

Figure 8

Fig. 7. Histograms (blue) and fitted probability densities of quartz-composite particle characteristics for (a) volume equivalent radius, (b) sphericity factor, (c) convexity factor, (d) elongation factor, (e) median grayscale value, and (f) iqr.

Figure 9

Table 3. Parameters of Fitted Distributions.

Figure 10

Fig. 8. Relationship between the copula parameter θ and the Pearson, Kendall, and Spearman correlation coefficients of the uniformly distributed marginals of 2D Ali-Mikhail-Haq copulas.

Figure 11

Fig. 9. Log-likelihood functions for fitting 2D copulas to the sphericity and median grayscale value of zinnwaldite-composite particles using a Frank copula (left) and of quartz-composite particles using a Frank copula rotated by 90° (right).

Figure 12

Table 4. Confusion Matrices of the Copula-Based Decision Rule Given in Equation (41) for Particles Observed in the SEM-EDS Section Which Has Been Used for Adjusting the Prediction Model and for the Particles Observed in the SEM-EDS Section Which Was Withheld for Validation.

Figure 13

Fig. 10. 6D data after dimension reduction obtained by maximizing cluster distance, while minimizing cluster variances.

Figure 14

Fig. 11. Kernel density estimation of vectors of characteristics after dimension reduction for (a) zinnwaldite- and (b) quartz-composite particles. The corresponding probability densities obtained by fitting parametric copulas for (c) zinnwaldite- and (d) quartz-composite particles.

Figure 15

Fig. 12. a: Cutout of the SEM-EDS section used for the calibration of the prediction model, where the blue phase indicates zinnwaldite and pink indicates quartz. b: The corresponding XMT section. Blue ellipses mark zinnwaldite-composite particles which were wrongly classified as quartz-composite particles by the decision rule given in equation (41). Analogously, red ellipses indicate wrongly classified quartz-composite particles.

Figure 16

Table 5. Confusion Matrix of the Copula-Based Decision Rule Utilizing Dimension Reduction.

Figure 17

Table 6. Confusion Matrix of the Decision Rule Utilizing Dimension Reduction Followed by Kernel Density Estimation.