.TH "makeflow_blast" 1 "" "CCTools 7.0.9 FINAL" "Cooperative Computing Tools" .SH NAME .LP \fBmakeflow_blast\fP - Generate a Makeflow to parallelize and distribute blastall jobs .SH SYNOPSIS .LP \FC\fBmakeflow_blast query_granularity character_granularity [blast_options]\fP\FT .SH DESCRIPTION .LP \fBmakeflow_blast\fP is a script to generate \fBmakeflow()\fP workflows to execute blastall jobs. Essentially, the script uses query_granularity (the maximum number of sequences per fasta file split) and character_granularity (the maximum number of characters per fasta file split) to determine how to break up the input fasta file. It then creates a makeflow that will execute a blastall with the desired parameters on each part and concatenate the results into the desired output file. For simplicity, all of the arguments following query_granularity and character_granularity are passed through as the options to \fBblastall()\fP. .PP \fBmakeflow_blast\fP executes a small test BLAST job with the user provided parameters in order to be sure that the given parameters are sane. It then calculates the number of parts the provided fasta input file will require, prints a makeflow rule to generate those parts using \fBsplit_fasta()\fP, and enumerates makeflow rules to execute blastall with the given parameters on each part. Subsequent rules to condense and clean the intermediate input and output are then produced. .PP \fBmakeflow_blast\fP expects a blastall in the path, and should be used from the directory containing the input files and databases. For distribution convenience, it is required that the files constituting a given BLAST database must be stored in a folder with the same name as that database. .SH OPTIONS .LP .LP .TP .BI \ -i \ . Specifiy the input fasta file for querying the BLAST database .TP .BI \ -o \ . Specify the output file for final results .TP .BI \ -d \ . Specify the BLAST database to be queried .SH EXIT STATUS .LP On success, returns zero. On failure, returns non-zero. .SH ENVIRONMENT VARIABLES .LP .SH EXAMPLES .LP To generate a makeflow to run blastall -p blastn on smallpks.fa and testdb, splitting smallpks.fa every 500 sequences or 10000 characters and placing the blast output into test.txt do: .fam C .nf .nh .IP "" 8 python makeflow_blast 500 10000 -i smallpks.fa -o test -d testdb/testdb -p blastn > Makeflow .fi .hy .fam .P You can then execute this workflow in a variety of distributed and parallel environments using the makeflow command. .SH COPYRIGHT .LP The Cooperative Computing Tools are Copyright (C) 2003-2004 Douglas Thain and Copyright (C) 2005-2015 The University of Notre Dame. This software is distributed under the GNU General Public License. See the file COPYING for details. .SH SEE ALSO .LP .IP \(bu 4 \fBCooperative Computing Tools Documentation\fP .IP \(bu 4 \fBMakeflow User Manual\fP .IP \(bu 4 \fBmakeflow(1)\fP \fBmakeflow_monitor(1)\fP \fBmakeflow_analyze(1)\fP \fBmakeflow_viz(1)\fP \fBmakeflow_graph_log(1)\fP \fBstarch(1)\fP \fBmakeflow_ec2_setup(1)\fP \fBmakeflow_ec2_cleanup(1)\fP