Comparison of single-step methods for genomic prediction of age at first calving in dairy buffaloes

Jessica Cristina Gonçalves dos Santos; Francisco Ribeiro de Araújo Neto; Gabriela Stefani Fernandez; Daniel Jordan de Abreu Santos; Felipe Pereira Cunha; Rusbel Raul Aspilcueta-Borquis; Humberto Tonhati

doi:10.1017/S0021859624000364

Comparison of single-step methods for genomic prediction of age at first calving in dairy buffaloes

Published online by Cambridge University Press: 14 November 2024

Jessica Cristina Gonçalves dos Santos ,

Francisco Ribeiro de Araújo Neto

Gabriela Stefani Fernandez ,

Daniel Jordan de Abreu Santos ,

Felipe Pereira Cunha ,

Rusbel Raul Aspilcueta-Borquis and

Humberto Tonhati

Show author details

Jessica Cristina Gonçalves dos Santos: Affiliation:
Faculdade de Ciências Agrárias e Veterinárias de Jaboticabal – UNESP, Jaboticabal, São Paulo, Brazil
Francisco Ribeiro de Araújo Neto*: Affiliation:
Instituto Federal de Ciência e Tecnologia Goiano – IFGoiano, Rio Verde, Goiás
Gabriela Stefani Fernandez: Affiliation:
Faculdade de Ciências Agrárias e Veterinárias de Jaboticabal – UNESP, Jaboticabal, São Paulo, Brazil
Daniel Jordan de Abreu Santos: Affiliation:
Faculdade de Ciências Agrárias e Veterinárias de Jaboticabal – UNESP, Jaboticabal, São Paulo, Brazil
Felipe Pereira Cunha: Affiliation:
Instituto Federal de Ciência e Tecnologia Goiano – IFGoiano, Rio Verde, Goiás
Rusbel Raul Aspilcueta-Borquis: Affiliation:
Universidade Tecnológica Federal do Paraná – UFTPR, Dois Vizinhos, Paraná
Humberto Tonhati: Affiliation:
Faculdade de Ciências Agrárias e Veterinárias de Jaboticabal – UNESP, Jaboticabal, São Paulo, Brazil
*: Corresponding author: F. R. Araujo Neto; Email: [email protected]

Article contents

Abstract
Introduction
Materials and methods
Results
Discussion
Authors’ contributions
Funding statement
Competing interests
Ethical standards
References

Rights & Permissions

Abstract

The age at first calving (AFC) is an important trait to be considered in breeding programmes of dairy buffaloes, where new approaches and technologies, such as genomic selection, are constantly applied. Thus, the objective of this study was to compare the predictive ability of different genomic single-step methods using AFC information from Murrah buffaloes. From a pedigree file containing 3320 buffaloes, 2247 cows had AFC records and 553 animals were genotyped. The following models were performed: pedigree-based BLUP (PBLUP), single-step GBLUP (ssGBLUP), weighted single-step GBLUP (WssGBLUP), and single-step Bayesian regression methods (ssBR-BayesA, BayesBπ, BayesCπ, Bayes-Lasso, and BayesRR). To compare the methodologies, the accuracy and dispersion of (G)EBVs were assessed using the LR method. Accuracy estimates for the genotyped animals ranged from 0.30 (PBLUP) to 0.39 (WssGBLUP). Predictions with the traditional model (PBLUP) were very dispersed from what was expected, while BayesCπ (0.99) and WssGBLUP (1.00) obtained the lowest dispersion. The results indicate that the use of genomic information can improve the genetic gain for AFC by increasing the accuracy and reducing inflation/deflation of predictions compared to the traditional pedigree-based model. In addition, among all genomic single-step models studied, WssGBLUP and single-step BayesA were the most advantageous methods to be used in the genomic evaluation of AFC of buffaloes from this population.

Keywords

Bayesian inference Bubalus bubalis genomic selection reproductive traits

Type: Animal Research Paper
Information: The Journal of Agricultural Science , Volume 162 , Issue 4 , August 2024 , pp. 377 - 383

DOI: https://doi.org/10.1017/S0021859624000364 [Opens in a new window]
Copyright: Copyright © The Author(s), 2024. Published by Cambridge University Press

Introduction

Age at first calving (AFC) is an important indicator trait of reproductive efficiency that is genetically monitored by the Milk-Recording Buffalo Program in Brazil (Aspilcueta-Borquis et al., Reference Aspilcueta-Borquis, Seno, Araujo Neto, Santos, Hurtado-Lugo and Tonhati2022). It is well known, however, that the genetic progress obtained by direct selection of AFC can be very slow due to the low heritability magnitude of this trait in Buffaloes (Camargo et al., Reference Camargo, Aspilcueta-Borquis, Fortes, Porto-Neto, Cardoso, Santos, Lehnert, Reverter, Moore and Tonhati2015). Since genomic selection is more accurate than traditional selection, especially for low-heritability traits (Calus et al., Reference Calus, Meuwissen, De Roos and Veerkamp2008), genomic evaluation is the best strategy for predicting breeding values for AFC. Furthermore, Camargo et al. (Reference Camargo, Aspilcueta-Borquis, Fortes, Porto-Neto, Cardoso, Santos, Lehnert, Reverter, Moore and Tonhati2015) and Araujo Neto et al. (Reference Araujo Neto, Takada, Santos, Aspilcueta-Borquis, Cardoso, Nascimento, Leão, Oliveira and Tonhati2020a) have reported some specific genomic regions highly associated to this trait in GWAS studies in this population, indicating that differential shrinkage models, such as Bayesian alphabet and weighted single-step GBLUP models, would be a reasonable alternative to be explored.

The single-step GBLUP (ssGBLUP) method has become the main methodology used for genomic evaluations in dairy buffaloes (Araujo Neto et al., Reference Araujo Neto, Takada, Santos, Aspilcueta-Borquis, Cardoso, Nascimento, Leão, Oliveira and Tonhati2020a, Reference Araujo Neto, Santos, Fernandes Júnior, Aspilcueta-Borquis, Nascimento, Seno, Tonhati and Oliveira2020b; Cesarani et al., Reference Cesarani, Biffani, Garcia, Lourenco, Bertolini, Neglia, Misztal and Macciotta2021; Lázaro et al., Reference Lázaro, Tonhati, Oliveira, Silva, Nascimento, Santos, Stefani and Brito2021; Araujo Neto et al., Reference Araujo Neto, Santos, Silva Arce, Borquis, Santos, Guimarães, Nascimento, Oliveira and Tonhati2022) given the ease of integrating the relationship matrices, based on pedigree (A) and genomic (G) information, without major changes in the mixed model equations (Misztal et al., Reference Misztal, Lourenco and Legarra2020). Moreover, this methodology allows assigning different weights for each marker, originating the weighted single-step GBLUP method – WssGBLUP (Wang et al., Reference Wang, Misztal, Aguilar, Legarra and Muir2012; Zhang et al., Reference Zhang, Lourenco, Aguilar, Legarra and Misztal2016).

Subsequently, Fernando et al. (Reference Fernando, Dekkers and Garrick2014) proposed single-step methodologies combined to Bayesian regression methods (ssBR), as an option to ssGBLUP. Nonetheless, few studies have been developed comparing the results of the different single-step methodologies (Lee et al., Reference Lee, Cheng, Garrick, Golden, Dekkers, Park, Lee and Fernando2017; Gao et al., Reference Gao, Koivula, Jensen, Strandén, Madsen, Pitkänen, Aamand and Mäntysaari2018; Zhou et al., Reference Zhou, Mrode, Zhang, Zhang, Li and Liu2018). Based on this, it becomes interesting to perform similar analyses in buffalo databases to assist in decision making in breeding programmes that employ genomic technology. Thus, this work was designed with the aim of comparing the predictive ability of different single-step methods using AFC information from Murrah buffaloes.

Materials and methods

Dataset

The phenotypic dataset consisted of 2,314AFC records of Murrah buffalo cows born between 1995 and 2017. Contemporary groups (CG) were formed considering animals born on the same farm, year and season of birth, which was divided into two (dry and rainy season). Animals with records outside the range between ±3 standard deviations from their CG averages and animals belonging to GCs with less than five individuals were removed.

From a pedigree file containing 3320 buffaloes, 553 animals (539 dams and 14 sires) were genotyped with the Axiom Buffalo Genotyping Array 90 K (Iamartino et al., Reference Iamartino, Nicolazzi, Van Tassell, Reecy, Fritz-Waters, Koltes, Biffani, Sonstegard, Schroeder, Ajmone-Marsan, Negrini, Pasquariello, Ramelli, Coletta, Garcia, Ali, Ramunno, Cosenza, Oliveira, Drummond, Bastianetto, Davassi, Pirani, Brew and Williams2017). Only SNPs present on autosomal chromosomes (BBU1-BBU24 referenced by UOA_WB_1 genome assembly), as well as those with call rate > 95%, MAF > 3% and significance level for Hardy Weinberg equilibrium test was 10⁻⁵ , were remained in the analysis. All samples had a call rate >90%. The database description (phenotypic and genotypic information) after the consistency step is in Table 1.

Table 1. Description of age at first calving dataset used in this study

PBLUP, ssGBLUP and WssGBLUP models

This set of methodologies is based on the use of relationship matrices between individuals in mixed model equations, with the differences consisting of the type of information used (pedigree or genomic) and the way these matrices are constructed. The model using pedigree based BLUP (PBLUP) can be described as:

$$y = {\boldsymbol X}\beta + {\boldsymbol Z}\alpha + e$$

where, y is the vector of phenotypic records (AFC); β is the fixed effect vector (CG), X is the incidence matrix that associates β with y; α is the vector of additive genetic effects, Z is an incidence matrix associating α with y and, e is the vector of residuals. The following assumptions were made:

$$E( y ) = {\boldsymbol X}\beta \quad E( \alpha ) = 0\quad E( e ) = 0$$

$${\rm Var}( \alpha ) = A\sigma _\alpha ^2 \quad {\rm Var}( e ) = {\boldsymbol I}_n\sigma _\varepsilon ^2 $$

where $\sigma _\alpha ^2$ and $\sigma _\varepsilon ^2$ represent the additive genetic and residual variances; A and I_n are the pedigree relationship and an identity matrices, respectively.

The single step GBLUP (ssGBLUP) method is an extension of the PBLUP model, in which the pedigree-based (A) and genomic (G) relationship matrices are combined into a single matrix (H), as described by Aguilar et al. (Reference Aguilar, Misztal, Johnson, Legarra, Tsuruta and Lawlor2010). So, its inverse can be obtained as:

$${\boldsymbol H}^{{-}1} = {\boldsymbol A}^{{-}1} + \left[{\matrix{ 0 \hfill & 0 \hfill \cr 0 \hfill & {{\boldsymbol G}^{{-}1}-{\boldsymbol A}_{22}^{{-}1} } \hfill \cr } } \right]$$

where A⁻¹ and G⁻¹ are the inverse matrices of A and G respectively, and ${\boldsymbol A}_{22}^{{-}1}$ is the inverse of the section of A related to the genotyped animals. The G matrix was obtained according to VanRaden (Reference VanRaden2008) as:

$${\boldsymbol G} = \displaystyle{{{\boldsymbol ZD{Z}^{\prime}}} \over {2\;\sum {\,p_i} ( 1-p_i{) }^{\prime}}}$$

where p _i is the minor allele frequency (MAF) of SNP i, Z is a matrix relating genotypes of each locus centred by allele frequencies, and D is a diagonal matrix of weights for SNP variances, (with elements d _i = SNP i weight). In ssGBLUP model is set D = I, so that the weight of all SNPs is equal to 1.

In WssGBLUP method (Wang et al., Reference Wang, Misztal, Aguilar, Legarra and Muir2012) markers are assigned with different weights using an iterative process described with the following steps:

1. Set D ^(t) = I, when t = 1
2. The genomic relationship matrix is setup for t as ${\boldsymbol G}^{( t ) } = {\boldsymbol Z}{\boldsymbol D}^{( {\boldsymbol t} ) }{\boldsymbol {Z}^{\prime}}/( 2\;\sum p_i( {1-p_i} )$
3. The Genomic Breeding Value (GEBV or $\hat{\alpha }$) for t are obtained;
4. 4.The GEBV $\hat{\alpha }^{( t ) }$are converted to SNP effects ($\hat{u}$) as $\hat{u}^t = {\boldsymbol D}^{( t ) }{\boldsymbol {Z}^{\prime}}( {\boldsymbol G}^{( t ) }) ^{{-}1}\hat{\alpha }^{( t ) }$
5. The weight of the ith SNP (the ith element of D or d _i) is calculated;
6. SNP weights are normalized in D^(t+1) to have constant genetic variances of SNP effects;
7. Loop to step 2 three times and exit.

For the methods mentioned above, the program BLUPF90+ (Misztal et al., Reference Misztal, Tsuruta, Lourenço, Masuda, Aguilar, Legarra and Vitezica2014) was used for the estimation of the variance components and for the validation step. The estimation of the effects of the markers and the calculation of the weighting (WssGBLUP) were performed with the program POSTGSF90 (Misztal et al., Reference Misztal, Tsuruta, Lourenço, Masuda, Aguilar, Legarra and Vitezica2014). The heritability estimates for AFC were obtained from these estimated variance components.

Single-step Bayesian regression models (ssBR)

The ssBR models combine all available data from genotyped and non-genotyped animals and use imputed marker covariates for animals that are not genotyped (Fernando et al., Reference Fernando, Dekkers and Garrick2014), can be described as:

$$\left[{\matrix{ {y_n} \cr {y_g} \cr } } \right] = \left[{\matrix{ {{\boldsymbol X}_{\boldsymbol n}} \cr {{\boldsymbol X}_g} \cr } } \right]\beta + \left[{\matrix{ {{\boldsymbol Z}_n} & 0 \cr 0 & {{\boldsymbol Z}_g} \cr } } \right]\left[{\matrix{ {{\boldsymbol M}_n\alpha + \epsilon } \cr {{\boldsymbol M}_g\alpha \;\;\;\;\;\;\;} \cr } } \right] + e$$

where vectors and matrices for non-genotyped animals are denoted with the subscript n and for genotyped animals with the subscript g. Thus, y _n and y _g represent the vectors of phenotypic observations; β is the vector of systematic effects of GC (equivalent to the fixed effect in frequentist methods); X_n and X_g design matrices for the fixed effects; Z_n and Z_g represent incidence matrices associated with the genomic values of the animals; α, $\epsilon$ and e, represent in this order, the vectors of marker effects, imputation residual and model residual; M_g is the marker matrix for the genotyped animals and M_n is the imputed marker matrix for the non-genotyped animals. The matrix M_n can be obtained as:

$${\boldsymbol M}_n = {\boldsymbol A}_{ng}{\boldsymbol A}_{22}^{{-}1} {\boldsymbol M}_g$$

where A_ng represents the pedigree-based relationship matrix between genotyped and non-genotyped animals and; ${\boldsymbol A}_{2g2g}^{{-}1}$ represents the inverse of the pedigree-based relationship matrix between genotyped animals. The following ssBR methods were used: BayesA (ssBA- Meuwissen et al., Reference Meuwissen, Hayes and Goddard2001), BayesBπ (ssBBπ- Meuwissen et al., Reference Meuwissen, Hayes and Goddard2001), BayesCπ (ssBCπ- Habier et al., Reference Habier, Fernando, Kizilkaya and Garrick2011), Lasso Bayesian (ssBL- Yi and Xu, Reference Yi and Xu2008) and Bayesian ridge regression (ssBRR- Campos et al., Reference Campos, Hickey, Pong-Wong, Daetwyler and Calus2013), where the assumptions for markers effects are shown in Table 2. The hibayes package (Yin et al., Reference Yin, Zhang, Li, Zhao and Liu2022), available for the R program (R Core Team, 2021), was used to perform the ssBR analyses. A total of 350 000 samples and a burn-in period of 150 000 samples were generated. The convergence was evaluated using graphical analysis. The heritability estimates for AFC were also obtained using these models.

Table 2. Description of different prior for markers effects used in Bayesian single-step regression methods used for genomic analysis of age at first calving in dairy buffaloes

Validation of (G)EBVs

The predictive ability of the methods was assessed using accuracy and dispersion statistics proposed by Legarra and Reverter (Reference Legarra and Reverter2018). This methodology is based on predicting (G)EBV using full ($\hat{u} _w$) and a partial $( \hat{u} _p)$ datasets. The partial dataset (reference group) was obtained by omitting phenotypic records of genotyped animals born after 2010 (validation group). Thus, to assess the prediction ability of the models, only genotyped animals from the validation group were used to calculate the following statistics:

$$\matrix{ {{\rm Accuracy}} \hfill & {{\rm Dispersion}} \hfill \cr {r_{A\hat{A}} = \sqrt {\displaystyle{{{\rm cov}( {\hat{u}_w, \;\hat{u}_p} ) } \over {( {1-\bar{F}} ) \hat{\sigma }_g^2 }}} } \hfill & {b_{w, p} = \displaystyle{{{\rm cov}( {\hat{u}_w, \;\hat{u}_p} ) } \over {{\rm var}( {\hat{u}_p} ) }}} \hfill \cr } $$

where $\bar{F}$ is the average inbreeding coefficient of the validation individuals and $\hat{\sigma }_{g, i}^2$ is the estimate of additive genetic variance.

Results

In this study, the estimated heritability for AFC was 0.17 using BLUP-based methods and 0.16 to 0.18 using ssBR methods and, the estimates of the effects of the markers can be seen in Figure 1. Accuracy estimates for the genotyped animals ranged from 0.30 (ssBRR) to 0.39 (WssGBLUP) (Fig. 2). Estimates of accuracy with low magnitude were observed for non-genotyped animals, with values ranging from 0.29 (ssGBLUP) to 0.33 (ssBA). The PBLUP model presented accuracies of 0.30 and 0.29 for genotyped and non-genotyped animals, respectively.

Figure 1. Estimates of SNP marker effects (Manhattan plot) for the different methods employed (ssBA, Single step Bayes A; ssBπ, Single step Bayes Bπ; ssBCπ, Single step Bayes Cπ; ssBL, Single step Bayes Lasso; ssBRR, Single step Bayes Ridge Regression), and cluster analysis based on similarity between marker effects.

Figure 2. Estimates of accuracy and dispersion of genetic and genomic values using different single-step methods (ssBA, Single step Bayes A; ssBπ, Single step Bayes Bπ; ssBCπ, Single step Bayes Cπ; ssBL, Single step Bayes Lasso; ssBRR, Single step Bayes Ridge Regression) for the study of age at first calving in dairy buffaloes (genotyped animals).

Regarding the dispersion of the GEBV prediction (Fig. 2), ssBCπ (0.99) and WssGBLUP (1.00) obtained the best results considering only genotyped animals, while for non-genotyped animals only WssGBLUP (1.02) showed the best value, although it was also the higher than 1. The PBLUP model had the worst result with values straying 35% below from the reference value (1.0).

In the full dataset, Pearson's correlation between GEBVs and EBVs (predicted via PBLUP) ranged from 0.92 (ssBRR) to 0.93 (ssGBLUP), while for animals with both genotypic and phenotypic information, the correlation between GEBVs and EBVs ranged from 0.71 (ssBRR) to 0.77 (WssGBLUP). The correlations between all GEBVs predicted with the single-step genomic models were higher than 0.95.

Discussion

In this study, a heritability estimate obtained for AFC, had magnitude similar to those described in several studies with the buffalo species, with values ranging from 0.13 to 0.19 (Agudelo et al., Reference Agudelo-Gomez, Pineda-Sierra and Cerón-Muñoz2015; Thiruvenkadan et al., Reference Thiruvenkadan, Panneerselvam and Murali2015; Fernandes et al., Reference Fernandes, Marques, de Araujo Neto, de Oliveira, Hurtado-Lugo, Aspilcueta-Borquis and Tonhati2016; Araujo Neto et al., Reference Araujo Neto, Takada, Santos, Aspilcueta-Borquis, Cardoso, Nascimento, Leão, Oliveira and Tonhati2020a, Reference Araujo Neto, Santos, Fernandes Júnior, Aspilcueta-Borquis, Nascimento, Seno, Tonhati and Oliveira2020b). These results show that despite the greater information provided by the markers in the analysis, there was very little influence on the estimation of the genetic parameters, possibly due to the polygenic nature of the trait. There are few reports in the literature on comparison of genomic prediction for reproductive traits, and it is not yet possible to find studies that compared the predictive ability between different single-step methods for AFC. Moreover, several accuracy measures have been reported by different authors, which makes the comparison between their values even more difficult. For instance, Costa et al. (Reference Costa, Irano, Diaz, Takada, Hermisdorff, Carvalheiro, Baldi, Oliveira, Tonhati and Albuquerque2019), working only with genotyped Nellore breed animals and considering the correlation between GEBV and adjusted AFC (r _GEBV,Y*), estimated values with magnitudes similar to those found in our study with BayesCπ (0.31) and Bayes-Lasso (0.31), while Toghiani et al. (Reference Toghiani, Hay, Sumreddee, Geary, Rekaya and Roberts2017) reported lower values with BayesA (0.148) and BayesCπ (0.15), studying the same breed and using the same type of accuracy. In both cited studies, accuracy estimates were not presented for PBLUP model, which does not allow us to fully contrast with our findings. Possibly, for comparison purposes, checking the accuracy increments in relation to PBLUP is the best approach to discuss our results. Thus, all literature discussed about predictability were made based on the difference in performance between methods also used in our study.

Considering the percentage of accuracy increment in relation to PBLUP predictions in only genotyped animals, we observed three groups of models: increments less than 5% – ssBL and ssBRR; increments close 10% – ssBBπ, ssBCπ and ssGBLUP and; increments greater than 25% – ssBA and WssGBLUP. Estimates of accuracy increments using SNP markers with magnitudes similar to those we have found for AFC, have been described in the literature for productive traits in buffaloes (Cesarani et al., Reference Cesarani, Biffani, Garcia, Lourenco, Bertolini, Neglia, Misztal and Macciotta2021; Herrera et al., Reference Herrera, Flores, Duijvesteijn, Moghaddar and van der Werf2021).

For milk, fat and protein yield, greater increases in accuracy (r _GEBV,Y*) are presented by Herrera et al. (Reference Herrera, Flores, Duijvesteijn, Moghaddar and van der Werf2021) (13.04 to 76.47%) and Cesarani et al. (Reference Cesarani, Biffani, Garcia, Lourenco, Bertolini, Neglia, Misztal and Macciotta2021) (15.28 to 33.33%). Aspilcueta-Borquis et al. (Reference Aspilcueta-Borquis, Araujo Neto, Santos, Hurtado-Lugo, Silva and Tonhati2015), studying the same population of this study and ssGBLUP model, reported similar increase in the average accuracy (based on prediction error variance) of (G)EBVs, which ranged from 8.52 to 12.05% for several dairy traits. This gain in accuracy observed with the use of genomic information is due to the additional capture of both Mendelian sampling variations and relationships between animals (Christensen et al., Reference Christensen, Madsen, Nielsen, Ostersen and Su2012; Cesarani et al., Reference Cesarani, Garcia, Hidalgo, Degano, Vicario, Macciotta and Lourenco2020). The similarity in predictive performance between the different single-step genomic methods, however, could be related to the reduced number of animals evaluated, the small subset of genotyped animals analysed, and the polygenic nature of the AFC (Calus, Reference Calus2010; Campos et al., Reference Campos, Hickey, Pong-Wong, Daetwyler and Calus2013).

The proximity between ssBA and GRM-based model observed in this study (by the magnitude of estimated accuracies) may be explained by the polygenic nature of the trait and the robustness of ssBA method for different genetic architectures. Zhou et al. (Reference Zhou, Mrode, Zhang, Zhang, Li and Liu2018), studying the effects of QTL number on the accuracy prediction with simulated data, also verified that ssBA method was robust and performed similarly to ssGBLUP in scenarios with a large number of QTL.

The dispersion criterion is related to the degree of inflation or deflation of predictions, which was measured in our validation set as the slope of the regression of the GEBV obtained with the full dataset on GEBV from the partial dataset. Our results showed that the inclusion of genomic information reduces the dispersion of prediction compared to the traditional evaluations (PBLUP), regardless of the single-step model considered. The same improvement in prediction slopes by using genomic information was reported by Lázaro et al. (Reference Lázaro, Tonhati, Oliveira, Silva, Nascimento, Santos, Stefani and Brito2021) studying several milk-related traits with random regression models in the same buffalo population, while other authors, such as Gao et al. (Reference Gao, Koivula, Jensen, Strandén, Madsen, Pitkänen, Aamand and Mäntysaari2018) also analysing dairy traits in Finnish red dairy cattle, report no benefit. Moreover, the GEBVs predicted with WssGBPUP did not disperse in relation to what was expected for the subset of genotyped animals and presented little deflation for the prediction of non-genotyped animals. The low dispersion of GEBVs predicted with WssGBLUP compared to the predictions from other genomic models is probably due to the self-fitting of the SNP weights to the dataset (weighting steps), which regulates the scale of the additive effect to the full dataset previously (a priori).

The results indicate that the use of genomic information can improve the genetic gain for AFC by increasing the accuracy and reducing inflation/deflation of predictions of this trait compared to the traditional pedigree-based model. In addition, among all genomic single-step models studied, WssGBLUP and ssBA were the most advantageous models to be used for the genomic evaluation of AFC of buffaloes from this population.

Authors’ contributions

Jessica Cristina Gonçalves dos Santos: Methodology, Analysis , Writing – original draft.; Francisco Ribeiro de Araujo Neto: Conceptualization, Methodology, Statistical analysis, Writing – review; Gabriela Stefani Fernandez: Database; Daniel Jordan de Abreu Santos: Writing – review & editing; Felipe Pereira Cunha: Writing editing; Rusbel Raul Aspilcueta-Borquis: Statistical analysis; Humberto Tonhati: Conceptualization, Database, Supervision.

Funding statement

This research received no specific grant from any funding agency, commercial or not-for-profit sectors.

Competing interests

None.

Ethical standards

Not applicable (the work was developed with a database).

References

Agudelo-Gomez, D, Pineda-Sierra, S and Cerón-Muñoz, MF (2015) Genetic evaluation of dual-purpose buffaloes (Bubalus bubalis) in Colombia using principal component analysis. PLoS One 10, e0132811.CrossRef Google Scholar PubMed

Aguilar, I, Misztal, I, Johnson, DL, Legarra, A, Tsuruta, S and Lawlor, TJ (2010) Hot topic: a unified approach to utilize phenotypic, full pedigree, and genomic information for genetic evaluation of Holstein final score. Journal of Dairy Science 93, 743–752. https://doi.org/10.3168/jds.2009-2730.CrossRef Google Scholar

Araujo Neto, FR, Takada, L, Santos, DJA, Aspilcueta-Borquis, RR, Cardoso, DF, Nascimento, AV, Leão, KM, Oliveira, HN and Tonhati, H (2020 a) Identification of genomic regions related to age at first calving and first calving interval in water buffalo using single-step GBLUP. Reproduction in Domestic Animals 55, 1565–1572. https://doi.org/10.1111/rda.13811.CrossRef Google Scholar PubMed

Araujo Neto, FR, Santos, DJA, Fernandes Júnior, GA, Aspilcueta-Borquis, RR, Nascimento, AVD, Seno, LO, Tonhati, H and Oliveira, HN (2020 b) Genome-wide association studies for growth traits in buffaloes using the single step genomic BLUP. Journal of Applied Genetics 61, 113–115. doi: 10.1007/s13353-019-00528-5.CrossRef Google Scholar PubMed

Araujo Neto, FR, Santos, JCG, Silva Arce, CD, Borquis, RRA, Santos, DJA, Guimarães, KC, Nascimento, AV, Oliveira, HN and Tonhati, H (2022) Genomic study of the resilience of buffalo cows to a negative energy balance. Journal of Applied Genetics 63, 379–388. doi: 10.1007/s13353-021-00680-x.CrossRef Google Scholar PubMed

Aspilcueta-Borquis, RR, Araujo Neto, FR, Santos, DJ, Hurtado-Lugo, NA, Silva, JA and Tonhati, H (2015) Multiple-trait genomic evaluation for milk yield and milk quality traits using genomic and phenotypic data in buffalo in Brazil. Genetics and Molecular Research 14, 18009–18017. doi: 10.4238/2015.December.22.27.CrossRef Google Scholar PubMed

Aspilcueta-Borquis, RR, Seno, LO, Araujo Neto, FR, Santos, DJA, Hurtado-Lugo, NA and Tonhati, H (2022) Lifetime productivity: genetic study of longevity and its associations with economically important traits in dairy buffaloes. Livestock Science 259, 104900. https://doi.org/10.1016/j.livsci.2022.104900.CrossRef Google Scholar

Calus, MPL (2010) Genomic breeding value prediction: methods and procedures. Animal 4, 157–164. https://doi.org/10.1017/S1751731109991352.CrossRef Google Scholar PubMed

Calus, MPL, Meuwissen, THE, De Roos, APW and Veerkamp, RF (2008) Accuracy of genomic selection using different methods to define haplotypes. Genetics 178, 553–561. https://doi.org/10.1534/genetics.107.080838.CrossRef Google Scholar PubMed

Camargo, GMF, Aspilcueta-Borquis, RR, Fortes, MRS, Porto-Neto, R, Cardoso, DF, Santos, DJA, Lehnert, SA, Reverter, A, Moore, SS and Tonhati, H (2015) Prospecting major genes in dairy buffaloes. BMC Genomics 16, 872. https://doi.org/10.1186/s12864-015-1986-2.CrossRef Google Scholar PubMed

Campos, G, Hickey, JM, Pong-Wong, R, Daetwyler, HD and Calus, MPL (2013) Whole-genome regression and prediction methods applied to plant and animal breeding. Genetics 193, 327–345. https://doi.org/10.1534/genetics.112.143313.CrossRef Google Scholar PubMed

Cesarani, A, Garcia, A, Hidalgo, J, Degano, L, Vicario, D, Macciotta, NPP and Lourenco, D (2020) Genomic information allows for more accurate breeding values for milk ability in dual-purpose Italian Simmental cattle. Journal of Dairy Science 104, 5719–5727. https://doi.org/10.3168/jds.2020-19838.CrossRef Google Scholar

Cesarani, A, Biffani, S, Garcia, A, Lourenco, D, Bertolini, G, Neglia, G, Misztal, I and Macciotta, NPP (2021) Genomic investigation of milk production in Italian buffalo. Italian Journal of Animal Science 20, 539–547. https://doi.org/10.1080/1828051X.2021.1902404.CrossRef Google Scholar

Christensen, OF, Madsen, P, Nielsen, B, Ostersen, T and Su, G (2012) Single-step methods for genomic evaluation in pigs. Animal 6, 1565–1571. https://doi.org/10.1017/S1751731112000742.CrossRef Google Scholar PubMed

Costa, RB, Irano, N, Diaz, IDPS, Takada, L, Hermisdorff, IC, Carvalheiro, R, Baldi, F, Oliveira, HN, Tonhati, H and Albuquerque, LG (2019) Prediction of genomic breeding values for reproductive traits in Nellore heifers. Theriogenology 125, 12–17. https://doi.org/10.1016/j.theriogenology.2018.10.014.CrossRef Google Scholar PubMed

Fernandes, PB, Marques, KO, de Araujo Neto, FR, de Oliveira, DP, Hurtado-Lugo, NA, Aspilcueta-Borquis, RR and Tonhati, H (2016) Genetic-quantitative study of the first-service pregnancy probability of Murrah heifers. Reproduction in Domestic Animals 51, 428–434.CrossRef Google Scholar PubMed

Fernando, RL, Dekkers, JC and Garrick, DJ (2014) A class of Bayesian methods to combine large numbers of genotyped and non-genotyped animals for whole-genome analyses. Genetics Selection Evolution 46, 50. https://doi.org/https://doi.org/10.1186/1297-9686-46-50.CrossRef Google Scholar PubMed

Gao, H, Koivula, M, Jensen, J, Strandén, I, Madsen, P, Pitkänen, T, Aamand, GP and Mäntysaari, EA (2018) Short communication: genomic prediction using different single-step methods in the Finnish red dairy cattle population. Journal of Dairy Science 101, 10082–10088. doi: 10.3168/jds.2018-14913.CrossRef Google Scholar PubMed

Habier, D, Fernando, RL, Kizilkaya, K and Garrick, DJ (2011) Extension of the Bayesian alphabet for genomic selection. BMC Bioinformatics 12, 186. https://doi.org/10.1186/1471-2105-12-186.CrossRef Google Scholar PubMed

Herrera, JRV, Flores, EB, Duijvesteijn, N, Moghaddar, N and van der Werf, JH (2021) Accuracy of genomic prediction for milk production traits in Philippine dairy buffaloes. Frontiers in Genetics 12, 682576. doi: 10.3389/fgene.2021.682576.CrossRef Google Scholar PubMed

Iamartino, D, Nicolazzi, EL, Van Tassell, CP, Reecy, JM, Fritz-Waters, ER, Koltes, JE, Biffani, S, Sonstegard, TS, Schroeder, SG, Ajmone-Marsan, P, Negrini, R, Pasquariello, R, Ramelli, P, Coletta, A, Garcia, JF, Ali, A, Ramunno, L, Cosenza, G, Oliveira, DAA, Drummond, MG, Bastianetto, E, Davassi, A, Pirani, A, Brew, F and Williams, JL (2017) Design and validation of a 90K SNP genotyping assay for the water buffalo (Bubalus bubalis). PLoS One 12, e0185220. https://doi.org/10.1371/journal.pone.0185220.CrossRef Google Scholar PubMed

Lázaro, SF, Tonhati, H, Oliveira, HR, Silva, AA, Nascimento, AV, Santos, DJA, Stefani, G and Brito, LF (2021) Genomic studies of milk-related traits in water buffalo (Bubalus bubalis) based on single-step genomic best linear unbiased prediction and random regression models. Journal of Dairy Science 104, 5768–5793. https://doi.org/10.3168/jds.2020-19534.CrossRef Google Scholar PubMed

Lee, J, Cheng, H, Garrick, D, Golden, B, Dekkers, J, Park, K, Lee, D and Fernando, R (2017) Comparison of alternative approaches to single-trait genomic prediction using genotyped and non-genotyped Hanwoo beef cattle. Genetics Selection Evolution 49, 2. https://doi.org/10.1186/s12711-016-0279-9.CrossRef Google Scholar PubMed

Legarra, A and Reverter, A (2018) Semi-parametric estimates of population accuracy and bias of predictions of breeding values and future phenotypes using the LR method. Genetics Selection Evolution 50, 53. https://doi.org/10.1186/s12711-018-0426-6.CrossRef Google Scholar PubMed

Meuwissen, THE, Hayes, BJ and Goddard, ME (2001) Prediction of total genetic value using genome-wide dense marker maps. Genetics 157, 1819–1829. doi: 10.1093/genetics/157.4.1819.CrossRef Google Scholar PubMed

Misztal, I, Tsuruta, S, Lourenço, D, Masuda, Y, Aguilar, I, Legarra, A and Vitezica, Z (2014) BLUPF90 family of programs. Disponível em: http://nce.ads.uga.edu/wiki/doku.php.Google Scholar

Misztal, I, Lourenco, D and Legarra, A (2020) Current status of genomic evaluation. Journal of Animal Science 98, skaa101. https://doi.org/10.1093/jas/skaa101.CrossRef Google Scholar PubMed

R Core Team (2021) R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing. https://www.R-project.org/.Google Scholar

Thiruvenkadan, AK, Panneerselvam, S and Murali, N (2015) Study on ages at first mating and calving of Murrah buffaloes in hot and humid climate of Tamil Nadu. India. Indian Journal of Animal Research 49, 591–594.Google Scholar

Toghiani, S, Hay, E, Sumreddee, P, Geary, TW, Rekaya, R and Roberts, AJ (2017) Genomic prediction of continuous and binary fertility traits of females in a composite beef cattle breed. Journal of Animal Science 95, 4787–4795. doi: 10.2527/jas2017.1944.CrossRef Google Scholar

VanRaden, PM (2008) Efficient methods to compute genomic predictions. Journal Dairy Science 91, 4414–4423.CrossRef Google Scholar PubMed

Wang, H, Misztal, I, Aguilar, I, Legarra, A and Muir, W (2012) Genome-wide association mapping including phenotypes from relatives without genotypes. Genetics Research 94, 73–83. doi: 10.1017/S0016672312000274.CrossRef Google Scholar PubMed

Yi, N and Xu, S (2008) Bayesian LASSO for quantitative trait loci mapping. Genetics 179, 1045–1055. doi: 10.1534/genetics.107.085589.CrossRef Google Scholar PubMed

Yin, L, Zhang, H, Li, X, Zhao, S and Liu, X (2022) Hibayes: an R package to fit individual-level, summary-level and single-step Bayesian regression models for genomic prediction and genome-wide association studies. bioRxiv. Accessed in December 2022. Available in: https://doi.org/10.1101/2022.02.12.480230.Google Scholar

Zhang, X, Lourenco, D, Aguilar, I, Legarra, A and Misztal, I (2016) Weighting strategies for single-step genomic BLUP: an iterative approach for accurate calculation of GEBV and GWAS. Frontiers in Genetics 7, 151. https://doi.org/10.3389/fgene.2016.00151.CrossRef Google Scholar PubMed

Zhou, L, Mrode, R, Zhang, S, Zhang, Q, Li, B and Liu, JF (2018) Factors affecting GEBV accuracy with single-step Bayesian models. Heredity 120, 100–109. doi: 10.1038/s41437-017-0010-9.CrossRef Google Scholar PubMed

Table 1. Description of age at first calving dataset used in this study

Table 2. Description of different prior for markers effects used in Bayesian single-step regression methods used for genomic analysis of age at first calving in dairy buffaloes

Article contents

Comparison of single-step methods for genomic prediction of age at first calving in dairy buffaloes

Abstract

Keywords

Introduction

Materials and methods

Dataset

PBLUP, ssGBLUP and WssGBLUP models

Single-step Bayesian regression models (ssBR)

Validation of (G)EBVs

Results

Discussion

Authors’ contributions

Funding statement

Competing interests

Ethical standards

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests