Microsatellite markers (MS) have been widely used for parentage verification in most of the livestock species over the past decades mainly due to their high polymorphic information content. In the genomic era, the spread of genotype information as single-nucleotide polymorphism (SNP) has raised the question to effectively use SNPs also for parentage testing. Despite the clear advantages of SNP panels in terms of cost, accuracy, and automation, the transition from MS to SNP markers for parentage verification is still very slow and, so far, only routinely applied in cattle. A major difficulty during this transition period is the need of SNP data for parents and offspring, which in most cases is not yet feasible due to the genotyping cost. To overcome the unavailability of same genotyping platform during the transition period, in this study we aimed to assess the feasibility of a MS imputation pipeline from SNPs in four native sheep dairy breeds: Comisana (N = 331), Massese (N = 210), Delle Langhe (N = 59) and Sarda (N = 1003). Those sheep were genotyped for 11 MS and with the Ovine SNP50 Bead Chip. Prior to imputation, a quality control (QC) was performed, and SNPs located within a window of 2 Mb from each MS were selected. The core of the developed pipeline was made up of three steps: (a) storing of both MS and SNP data in a Variant Call Format file, (b) masking MS information in a random sample of individuals (10%), (c) imputing masked MS based on non-missing individuals (90%) using an imputation program. The feasability of the proposed methodology was assessed also among different training - testing split ratio, population size, number of flanking SNPs as well as within and among breeds. The accuracy of the MS imputation was assessed based on the genotype concordance as well as at parentage verification level in a subset of animals in which assigned parents' MS were available. A total of 8 MS passed the QC, and 505 SNPs were located within the +/- 2 Mb window from each MS, with an average of 63 SNPs per MS. The results were encouraging since when excluding the worst imputed MS (OARAE129), and regardless on the analyses performed (within and across breeds) for all breeds, we achieved an overall concordance rate over 94%. In addition, on average, the imputed offspring MS resulted in equivalent parentage outcome in 94% of the cases when compared to verification using original MS, highlighting both the feasibility and the eventual practical advantage of using this imputation pipeline.
Microsatellite imputation using SNP data for parentage verification in four Italian sheep breeds / Ablondi, Michela; Stocco, Giorgia; Cortellari, Matteo; Carta, Antonello; Summer, Andrea; Negro, Alessio; Grande, Silverio; Crepaldi, Paola; Cipolat-Gotet, Claudio; Biffani, Stefano. - In: JOURNAL OF ANIMAL BREEDING AND GENETICS. - ISSN 0931-2668. - (2024). [10.1111/jbg.12839]
Microsatellite imputation using SNP data for parentage verification in four Italian sheep breeds
Ablondi, Michela;Stocco, Giorgia;Summer, Andrea;Crepaldi, Paola;Cipolat-Gotet, Claudio
;Biffani, Stefano
2024-01-01
Abstract
Microsatellite markers (MS) have been widely used for parentage verification in most of the livestock species over the past decades mainly due to their high polymorphic information content. In the genomic era, the spread of genotype information as single-nucleotide polymorphism (SNP) has raised the question to effectively use SNPs also for parentage testing. Despite the clear advantages of SNP panels in terms of cost, accuracy, and automation, the transition from MS to SNP markers for parentage verification is still very slow and, so far, only routinely applied in cattle. A major difficulty during this transition period is the need of SNP data for parents and offspring, which in most cases is not yet feasible due to the genotyping cost. To overcome the unavailability of same genotyping platform during the transition period, in this study we aimed to assess the feasibility of a MS imputation pipeline from SNPs in four native sheep dairy breeds: Comisana (N = 331), Massese (N = 210), Delle Langhe (N = 59) and Sarda (N = 1003). Those sheep were genotyped for 11 MS and with the Ovine SNP50 Bead Chip. Prior to imputation, a quality control (QC) was performed, and SNPs located within a window of 2 Mb from each MS were selected. The core of the developed pipeline was made up of three steps: (a) storing of both MS and SNP data in a Variant Call Format file, (b) masking MS information in a random sample of individuals (10%), (c) imputing masked MS based on non-missing individuals (90%) using an imputation program. The feasability of the proposed methodology was assessed also among different training - testing split ratio, population size, number of flanking SNPs as well as within and among breeds. The accuracy of the MS imputation was assessed based on the genotype concordance as well as at parentage verification level in a subset of animals in which assigned parents' MS were available. A total of 8 MS passed the QC, and 505 SNPs were located within the +/- 2 Mb window from each MS, with an average of 63 SNPs per MS. The results were encouraging since when excluding the worst imputed MS (OARAE129), and regardless on the analyses performed (within and across breeds) for all breeds, we achieved an overall concordance rate over 94%. In addition, on average, the imputed offspring MS resulted in equivalent parentage outcome in 94% of the cases when compared to verification using original MS, highlighting both the feasibility and the eventual practical advantage of using this imputation pipeline.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.