Homework: Introduction to the International Genome Sample Resource (Formerly known as the 1000 Genomes Project)



Readings:





At this point, technical difficulties aside, we’ve got a bit more knowledge regarding how to navigate the SCC and how to gather data of various kinds from Ensembl, so let’s put that knowledge to use with some practice!

This homework assignment is meant to both stretch your abilities from the past two modules, and prepare you for what’s coming in the next module. If you can’t remember how to do something, check your Pre-Module slides/notes and Module 1.

To make things easier, I’ve also created an online interface where you can answer the questions.





Question 1 (25 points):

Go to the Ensembl web page for the human ACE2 gene, and look at the variant table.

Question 2 (50 points):

Go back to the variant table, and filter and sort it to find the Stop Gained variant with the highest minor allele frequency (MAF).

Question 3 (25 points):

We should probably get a little more practice with the tabix and vcftools coding for downloading data into our SCC space.