| | 117 | |
| | 118 | === Filtering === |
| | 119 | In the '''ttpmh''' mode, !ImputationTool applies the following filtering between a study and a reference dataset: |
| | 120 | |
| | 121 | The filtering steps imputation tool does when comparing to reference: |
| | 122 | |
| | 123 | * assesses alleles and swaps SNP if needed |
| | 124 | ref: C/T |
| | 125 | GWAS: A/G --> needs to be swapped and inverted to become C/T |
| | 126 | * checks Hardy-Weinberg equilibrium <= 0.0001, MAF < 0.01, callrate < 0.95. If above threshold, SNP is removed |
| | 127 | * checks if SNP is present in reference data, if not, SNP is removed from GWAS data |
| | 128 | * checks if SNP has null alleles, if so, SNP is removed from GWAS data |
| | 129 | * checks if allele frequency is comparable to reference. If not (>25% difference), SNP is removed from GWAS data. |
| | 130 | * Assesses if the haplotype structure is comparable between reference and GWAS data. This is performed by pairwise comparison of r-squared between SNPs in both reference and GWAS. For SNPs in LD (r-squared > 0.1), the allele frequencies are compared. SNPs are removed from the GWAS data when the major allele differs more often than it is identical. |
| | 131 | |