Variant annotation and prioritization

image-title-here

It has been found that most of the SNPs identified in GWAS are located in non coding region of the DNA, and yet they have an incidence on the trait they are associated to. A current hypothesis to explain this fact is that they are located in functional regions which can regulate gene expression (e.g. promoter, enhancer, Transcription Factor Binding Sites, regulators of chromatin opening). Therefore, we chose to use a variant prioritization approach based on DNA functional region matching, following Lu et al. method and prioritization tool “GenoWAP” [1,2,3]. This method integrates 22 genomic and epigenomic annotations to predict whole-genome functional annotation [1]. Then, it computes a score for each variant locus assessing its functionality given the annotations at this locus [3]. This scoring will prioritize variants and reveal the ones located in functional sites, thus having more chance to be causal variants for the trait. A visualization of prioritized GWAS signal compared to GWAS signal can be seen in the figure above.

Secondly, we used a declination of this approach, following Lu et al. method and tool “GenoSkyline” [2] which integrate 111 tissue specific epigenomic annotations to predict tissue-specific functional annotation. This allows to compute tissue-specific enrichment for GWAS signal, thus indicating in which tissue GWAS variants hit the most functional regions. This can identify relevant tissues and can further improve prioritization through integrating functional annotations of relevant tissue types.

Lu, Q. et al. (2015). A statistical framework to predict functional non-coding regions in the human genome through integrated analysis of annotation data. Sci. Rep., 5, 10576.

Lu, Q., et al. (2016). Integrative tissue- specific functional annotations in the human genome provide novel insights on many complex traits and improve signal prioritization in genome wide association studies. PLoS Genet, 12(4).

Lu, Q. et al. (2015). GenoWAP: GWAS signal prioritization through integrated analysis of genomic functional annotation. Bioinformatics, 32(4), 542-548.

Thorvaldsdóttir, H. et al. (2013). Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration. ‎Brief. Bioinform, 14(2), 178-192.