.TH "split_fasta" 1 "" "CCTools 7.1.2 FINAL" "Cooperative Computing Tools" .SH NAME .LP \fBsplit_fasta\fP - Split a fasta file according to sequence and character counts .SH SYNOPSIS .LP \FC\fBsplit_fasta query_granularity character_granularity fasta_file\fP\FT .SH DESCRIPTION .LP \fBsplit_fasta\fP is a simple script to split a fasta file according to user provided parameters. The script iterates over the given file, generating a new sub_file called input.i each time the contents of the previous file (input.(i-1)) exceed the number of queries given by query_granularity or the number of characters given by character_granularity. .SH OPTIONS .LP .LP .SH EXIT STATUS .LP On success, returns zero. On failure, returns non-zero. .SH ENVIRONMENT VARIABLES .LP .SH EXAMPLES .LP To split a fasta file smallpks.fa into pieces no larger than 500 queries and with no piece receiving additional sequences if it exceeds 10000 characters we would do: .fam C .nf .nh .IP "" 8 python split_fasta 500 10000 smallpks.fa .fi .hy .fam .P This would generate files input.0, input.1, ..., input.N where N is the number of appropriately constrained files necessary to contain all sequences in smallpks.fa. .SH COPYRIGHT .LP The Cooperative Computing Tools are Copyright (C) 2005-2019 The University of Notre Dame. This software is distributed under the GNU General Public License. See the file COPYING for details. .SH SEE ALSO .LP .IP \(bu 4 \fBCooperative Computing Tools Documentation\fP .IP \(bu 4 \fBMakeflow User Manual\fP .IP \(bu 4 \fBmakeflow(1)\fP \fBmakeflow_monitor(1)\fP \fBmakeflow_analyze(1)\fP \fBmakeflow_viz(1)\fP \fBmakeflow_graph_log(1)\fP \fBstarch(1)\fP \fBmakeflow_ec2_setup(1)\fP \fBmakeflow_ec2_cleanup(1)\fP