LBG - FS2024 – Exercise 1

Author

Peter von Rohr

Problem 1: Genotype Data

Use the dataset on genotypes of animals to compute a first set of descriptive statistics of the gene-locations. The dataset is available in comma-separated format at: https://charlotte-ngs.github.io/lbgfs2024/data/geno_first_100.csv

  • Use the function readr::read_delim() with the delimitor set to comma (delim = ",") to read the data
  • Use either functions unique() or a combination of levels() and as.factor() to determine which genotypes are available for which locus, start with the first three loci as examples

An alternative is to use factors

  • Compute genotype frequencies using the function table()