.\" DO NOT MODIFY THIS FILE! It was generated by help2man 1.47.8. .TH HHFILTER "1" "February 2019" "hhfilter 3.0~beta3+dfsg" "User Commands" .SH NAME hhfilter \- filter an alignment by maximum sequence identity of match states and minimum coverage .SH SYNOPSIS .B hhfilter \fI\,\-i infile \-o outfile \/\fR[\fI\,options\/\fR] .SH DESCRIPTION HHfilter 3.0.0 (15\-03\-2015) Filter an alignment by maximum pairwise sequence identity, minimum coverage, minimum sequence identity, or score per column to the first (seed) sequence.n(C) Johannes Soeding, Michael Remmert, Andreas Biegert, Andreas Hauser Remmert M, Biegert A, Hauser A, and Soding J. HHblits: Lightning\-fast iterative protein sequence searching by HMM\-HMM alignment. Nat. Methods 9:173\-175 (2011). .TP \fB\-i\fR read input file in A3M/A2M or FASTA format .TP \fB\-o\fR write to output file in A3M format .TP \fB\-a\fR append to output file in A3M format .SH OPTIONS .TP \fB\-v\fR verbose mode: 0:no screen output 1:only warings 2: verbose .TP \fB\-id\fR [0,100] maximum pairwise sequence identity (%) (def=90) .TP \fB\-diff\fR [0,inf[ filter MSA by selecting most diverse set of sequences, keeping at least this many seqs in each MSA block of length 50 (def=0) .TP \fB\-cov\fR [0,100] minimum coverage with query (%) (def=0) .TP \fB\-qid\fR [0,100] minimum sequence identity with query (%) (def=0) .TP \fB\-qsc\fR [0,100] minimum score per column with query (def=\-20.0) .TP \fB\-neff\fR [1,inf] target diversity of alignment (default=off) .SS "Input alignment format:" .TP \fB\-M\fR a2m use A2M/A3M (default): upper case = Match; lower case = Insert; \&'\-' = Delete; '.' = gaps aligned to inserts (may be omitted) .TP \fB\-M\fR first use FASTA: columns with residue in 1st sequence are match states .TP \fB\-M\fR [0,100] use FASTA: columns with fewer than X% gaps are match states .PP Example: hhfilter \fB\-id\fR 50 \fB\-i\fR d1mvfd_.a2m \fB\-o\fR d1mvfd_.fil.a2m