The Royal Society
Browse
rsos160457_si_001.pdf (379.69 kB)

Supplementary Materials from Accurate recapture identification for genetic mark–recapture studies with error-tolerant likelihood-based match calling and sample clustering

Download (379.69 kB)
Version 2 2016-12-21, 12:05
Version 1 2016-12-12, 14:42
journal contribution
posted on 2016-12-12, 14:42 authored by Suresh Andrew Sethi, Daniel Linden, John Wenburg, Cara Lewis, Patrick Lemons, Angela Fuller, Matthew Hare
Supplementary Materials are combined into a single .pdf document, with the following contents: Supplement 1: Detail of the error-tolerant likelihood-based match calling and sample clustering approach Supplement 2: R script to implement the error-tolerant likelihood-based match calling model and sample clustering algorithms: MSATs Supplement 3: R script to implement the error-tolerant likelihood-based match calling model and sample clustering algorithms : SNPs Supplement 4: Case study genetic marker characteristics Supplement 5: Detailed base case simulation results. Error-tolerant likelihood-based match calling presents a promising technique to accurately identify recapture events in genetic mark–recapture studies by combining probabilities of latent genotypes and probabilities of observed genotypes, which may contain genotyping errors. Combined with clustering algorithms to group samples into sets of recaptures based upon pairwise match calls, these tools can be used to reconstruct accurate capture histories for mark–recapture modelling. Here, we assess the performance of a recently introduced error-tolerant likelihood-based match-calling model and sample clustering algorithm for genetic mark–recapture studies. We assessed both biallelic (i.e. single nucleotide polymorphisms; SNP) and multiallelic (i.e. microsatellite; MSAT) markers using a combination of simulation analyses and case study data on Pacific walrus (Odobenus rosmarus divergens) and fishers (Pekania pennanti). A novel two-stage clustering approach is demonstrated for genetic mark–recapture applications. First, repeat captures within a sampling occasion are identified. Subsequently, recaptures across sampling occasions are identified. The likelihood-based matching protocol performed well in simulation trials, demonstrating utility for use in a wide range of genetic mark–recapture studies. Moderately sized SNP (more than or equal to 64) and MSAT (10–15) panels produced accurate match calls for recaptures and accurate non-match calls for samples from closely related individuals in the face of low to moderate genotyping error. Furthermore, matching performance remained stable or increased as the number of genetic markers increased, genotyping error notwithstanding.

History

Usage metrics

    Royal Society Open Science

    Categories

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC