For our homework, we’ll be digging a little further into linkage disequilibrium (LD). Now, this is a sometimes difficult concept to grasp (even I needed a refresher in order to teach it), so we’ll be focusing on the assigned readings in this homework to hopefully working through this concept and fill any gaps in understanding left by leture and the lab.
As before, you’ll be turning in your homework via an online interface. I recommend writing your essays in a document on your laptop and only engaging with the online interface when you’re ready to turn in all of your completed answers.
According to Slatkin (2008), what is the definition of linkage disequilibrium (LD)?
Slatkin (2008) differentiates between what LD can tell at the genomic scale (i.e., across the genome), and at the gene-region scale (i.e., within a gene). What different information are these two types of LD telling us?
We already know what HWE (and deviations from it) is telling us about a particular locus. According to Slatkin (2008), how is LD telling us both similar - and different - things about the loci we’re investigating?
What is the effect of genetic drift (like population bottlenecks) on LD? Why is this important to for understanding human evolutionary history?
According to the Claiborne Stephens et al. (2001) article why are rare variants (with low MAF) important to understanding recent population processes?
According to the readings, will a haplotype be a region with high or low mutual LD between all the loci that comprise it?
According to the readings, are loci that are physically close to each other (i.e., only a few base pairs away) more likely to be in LD than loci that are further away from each other on the chromosome? Why might this be?
Based on the small D’ LD heatmap you created in Lab 3 for your 1000 Genomes population, which SNPs are linked? Please take a screenshot or save a PNG or PDF file of your heatmap and upload it as part of your answer. NOTE: If you’re having trouble interpreting the LD heatmaps, this link can help!
Take a look at the LD matrix you created in the last portion of Lab 3. List the SNPs that are in LD with each of our four SNPs of interest for Lab 3.
In the Ramos et al. (2012) paper, which SNP was found to be associated with obesity in a Brazilian population?
Did they find that SNP to be in linkage disequilibrium with the other SNPs of interest? Did you find the same results in your population?
According to Ramos et al. (2012), why is it important to know that those SNPs are not in linkage disequilibrium? HINT: read the Discussion!
The following questions are for those students registered for AN 733. Students registered for AN 333 may also answer these questions for extra credit.
According to Claiborne Stephens et al. (2001), what might explain their findings regarding LD across the human genome, and how does that have bearing on the discovery of SNPs associated with various phenotypes of interest?
What is the Hill-Robertson effect? According to Slatkin (2008), why is this an important phenomenon to understand and take into account when inferring the evolution of gene function?
What effect does selection have on LD? How might you use LD to detect natural selection in a genomic locus?
Use the UCP3 data you downloaded for Midterm 1 to create an LD heatmap for this gene region. Which, if any, SNPs in UCP3 among the Chinese Dai in Xishuangbanna, China are in perfect LD according to the D’ statistic (i.e., LD = 1)? If there are none, name three pairs of SNPs that are the closest to being in perfect LD.