Description of the project
Imputation is a method to predict the alleles of missing single nucleotide polymorphisms (SNPs). Imputation methods use the linkage disequilibrium (LD) structure to impute the alleles of the hidden SNPs. The HapMap, where a large number of SNPs are genotyped, is usually used as a reference data set to infer the correlation structure between SNPs. Then, using the observed SNPs and the correlation structure between the observed SNPs and hidden SNPs, the alleles of the hidden SNPs can be predicted. A variety of imputation methods based on haplotype proxies or Hidden Markov Models (HMM) have been recently proposed. In this project, I will explore the various imputation methods and figure out how to develop a simple imputation algorithm. Then, I will implement the simple algorithm using the HapMap as the reference data set.
About me
I am a first-year PhD student in computer science.
Goal for end of quarter
To design and implement simple software for imputation.
Weekly schedule
Week 10
- Weekly progress
- Next week plan
- Grade for week: A
- Problems that came up
- Problems solved this week
Week 9
- Weekly progress
- Completed the 2nd version of the software, which is called Multi-SNP method
- Analyzed the single-snp and multi-snp imputation methods.
- Next week plan
- Prepare for the presentation
- Grade for week: A
- Problems that came up
- Problems solved this week
1% of Missingness | 5% of Missingness | 10% of Missingness |
Week 8
- Weekly progress
- Submitted the preliminary report.
- Completed the 1st version of the software, which is called Single-SNP method
- Next week plan
- Test and improve the software
- Grade for week: A
- Problems that came up
- Problems solved this week
Week 7
- Weekly progress
- finished the parsing module.
- started the correlation module.
- Next week plan
- Write a preliminary report.
- Finished the first version of the software.
- Grade for week: A
- Problems that came up
- Problems solved this week
Week 6
- Weekly progress
- Analyzed the HapMap data - Phasing data.
- Designed the data structure and started to write the module for parsing the input file.
- Next week plan
- Finish the parsing module.
- Design the correlation module.
- Grade for week: A
- Problems that came up
- Problems solved this week
Week 5
- Weekly progress
- Searched and reviewed papers related to imputation.
- Downloaded the HapMap data
- Next week plan
- Define the input format.
- Start to design a simple algorithm.
- Grade for week: A
- Problems that came up
- Problems solved this week
Week 4
- Weekly progress
- Chose a project topic.
- Wrote a project proposal
- Created a project page
- Next week plan
- Review literature
- Get the HapMap data
- Grade for week: A
- Problems that came up
- Problems solved this week