Scroll to navigation

VCF_FILTER(1) User Commands VCF_FILTER(1)


vcf_filter - Filter a VCF file


vcf_filter [-h] [--no-short-circuit] [--no-filtered] [--output OUTPUT] [--local-script LOCAL_SCRIPT] input filter [filter_args] [filter [filter_args]] ...


This script is part of PyVCF.


positional arguments:

File to process (use - for STDIN) (default: None)

optional arguments:

Show this help message and exit. (default: False)
Do not stop filter processing on a site if any filter is triggered (default: False)
Filename to output [STDOUT] (default: <_io.TextIOWrapper name='<stdout>' mode='w' encoding='ANSI_X3.4-1968'>)
Output only sites passing the filters (default: False)
Python file in current working directory with the filter classes (default: None)


Filters sites with only low quality variants. It is possible to have a high site quality with many low quality calls. This filter demands at least one call be above a threshold quality.
Filter sites with no genotypes above this quality (default: 50)


Choose only SNP variants


Threshold read depth per sample
Minimum required coverage in each sample (default: 5)


Threshold average read depth per sample (read_depth / sample_count)
Minimum required average coverage per sample (default: 3)


Filter sites that look like correlated sequencing errors. Some sequencing technologies, notably pyrosequencing, produce mutation hotspots where there is a constant level of noise, producing some reference and some heterozygote calls. This filter computes a Bayes Factor for each site by comparing the binomial likelihood of the observed allelic depths under: * A model with constant error equal to the MAF. * A model where each sample is the ploidy reported by the caller. The test value is the log of the bayes factor. Higher values are more likely to be errors. Note: this filter requires rpy2
Filter sites above this error log odds ratio (default: -10)


Filter low quailty sites
Filter sites below this quality (default: 30)
October 2015 0.6.7