題:
解析帶註釋的VCF文件的工具或腳本
arupgsh
2017-08-11 12:44:08 UTC
view on stackexchange narkive permalink

我已經使用snpEff註釋了VCF文件,並正在尋找工具或腳本來解析VCF文件並清理該文件,以使生物學家可以解釋該文件。

一 回答:
Pierre
2017-08-11 13:04:59 UTC
view on stackexchange narkive permalink

(編輯)您可以使用 snpsift 過濾VCF註釋,我還編寫了 VcfFilterSequenceOntology http://lindenb.github.io/jvarkit /VcfFilterSequenceOntology.html

我寫了vcf2table: http://lindenb.github.io/jvarkit/VcfToTable.html它解碼了VEP和SNPeff註釋:

  >>chr1 / 10001 / T(n 1)變體+ -------- + -------------------- + |關鍵價值| + -------- + -------------------- + |鉻chr1 | (....)VEP + -------------------------- + ------ + -------- -------- + ------------ + ----------------- + -------- +- ----------------- + -------------------------------- --------------- + ------------- + --------- + ---------- ------- + ---------------------- + | PolyPhen | EXON | SIFT | ALLELE_NUM |基因|符號|蛋白質位置|後果|氨基酸|密碼子|功能|生物型| + -------------------------- + ------ + --------------- -+ ------------ + ----------------- + -------- + -------- ---------- + --------------------------------------- -------- + ------------- + --------- + ----------------- + ---------------------- + |可能損壞(0.956)| 8/9 |有害的(0)| 1 | ENSG00000102967 | DHODH | 346/395 | missense_variant |讀/寫| Cgg / Tgg | ENST00000219240 | protein_coding | | | 3/4 | | 1 | ENSG00000102967 | DHODH | | non_coding_exon_variant&nc_transcript_variant | | | ENST00000571392 | Reserved_intron | | | | | 1 | ENSG00000102967 | DHODH | |下游_基因_變異| | ENST00000572003 | Reserved_intron |
| | | | 1 | ENSG00000102967 | DHODH | |下游_基因_變異| | ENST00000573843 | Reserved_intron | | | | | 1 | ENSG00000102967 | DHODH | |下游_基因_變異| | ENST00000573922 |已處理的文字| | | | | 1 | ENSG00000102967 | DHODH | -/ 193 | intron_variant | | | ENST00000574309 | protein_coding | |可能損壞(0.946)| 8/9 |有害的(0)| 1 | ENSG00000102967 | DHODH | 344/393 | missense_variant |讀/寫| Cgg / Tgg | ENST00000572887 | protein_coding | + -------------------------- + ------ + --------------- -+ ------------ + ----------------- + -------- + -------- ---------- + --------------------------------------- -------- + ------------- + --------- + ----------------- + ---------------------- +基因型+ --------- + ------ + ------- + ---- + ---- + ----- + --------- + |樣品類型廣告| DP | GQ | GT | PL | + --------- + ------ + ------- + ---- + ---- + ----- + -------- -+ | M10475 | HET | 10,2 | 15 | 10 | 0/1 | 25,0,10 | | M10478 | HET | 10,4 | 16 | 5 | 0/1 | 40,0,5 | | M10500 | HET | 10,10 | 21 | 7 | 0/1 | 111,0,7 | | M128215 | HET | 15,5 | 24 | 0 | 0/1 | 49,0,0 | + --------- + ------ + ------- + ---- + ---- + ----- + -------- -+  


該問答將自動從英語翻譯而來。原始內容可在stackexchange上找到,我們感謝它分發的cc by-sa 3.0許可。
Loading...