.\" Automatically generated by Pod::Man 4.10 (Pod::Simple 3.35) .\" .\" Standard preamble: .\" ======================================================================== .de Sp \" Vertical space (when we can't use .PP) .if t .sp .5v .if n .sp .. .de Vb \" Begin verbatim text .ft CW .nf .ne \\$1 .. .de Ve \" End verbatim text .ft R .fi .. .\" Set up some character translations and predefined strings. \*(-- will .\" give an unbreakable dash, \*(PI will give pi, \*(L" will give a left .\" double quote, and \*(R" will give a right double quote. \*(C+ will .\" give a nicer C++. Capital omega is used to do unbreakable dashes and .\" therefore won't be available. \*(C` and \*(C' expand to `' in nroff, .\" nothing in troff, for use with C<>. .tr \(*W- .ds C+ C\v'-.1v'\h'-1p'\s-2+\h'-1p'+\s0\v'.1v'\h'-1p' .ie n \{\ . ds -- \(*W- . ds PI pi . if (\n(.H=4u)&(1m=24u) .ds -- \(*W\h'-12u'\(*W\h'-12u'-\" diablo 10 pitch . if (\n(.H=4u)&(1m=20u) .ds -- \(*W\h'-12u'\(*W\h'-8u'-\" diablo 12 pitch . ds L" "" . ds R" "" . ds C` "" . ds C' "" 'br\} .el\{\ . ds -- \|\(em\| . ds PI \(*p . ds L" `` . ds R" '' . ds C` . ds C' 'br\} .\" .\" Escape single quotes in literal strings from groff's Unicode transform. .ie \n(.g .ds Aq \(aq .el .ds Aq ' .\" .\" If the F register is >0, we'll generate index entries on stderr for .\" titles (.TH), headers (.SH), subsections (.SS), items (.Ip), and index .\" entries marked with X<> in POD. Of course, you'll have to process the .\" output yourself in some meaningful fashion. .\" .\" Avoid warning from groff about undefined register 'F'. .de IX .. .nr rF 0 .if \n(.g .if rF .nr rF 1 .if (\n(rF:(\n(.g==0)) \{\ . if \nF \{\ . de IX . tm Index:\\$1\t\\n%\t"\\$2" .. . if !\nF==2 \{\ . nr % 0 . nr F 2 . \} . \} .\} .rr rF .\" .\" Accent mark definitions (@(#)ms.acc 1.5 88/02/08 SMI; from UCB 4.2). .\" Fear. Run. Save yourself. No user-serviceable parts. . \" fudge factors for nroff and troff .if n \{\ . ds #H 0 . ds #V .8m . ds #F .3m . ds #[ \f1 . ds #] \fP .\} .if t \{\ . ds #H ((1u-(\\\\n(.fu%2u))*.13m) . ds #V .6m . ds #F 0 . ds #[ \& . ds #] \& .\} . \" simple accents for nroff and troff .if n \{\ . ds ' \& . ds ` \& . ds ^ \& . ds , \& . ds ~ ~ . ds / .\} .if t \{\ . ds ' \\k:\h'-(\\n(.wu*8/10-\*(#H)'\'\h"|\\n:u" . ds ` \\k:\h'-(\\n(.wu*8/10-\*(#H)'\`\h'|\\n:u' . ds ^ \\k:\h'-(\\n(.wu*10/11-\*(#H)'^\h'|\\n:u' . ds , \\k:\h'-(\\n(.wu*8/10)',\h'|\\n:u' . ds ~ \\k:\h'-(\\n(.wu-\*(#H-.1m)'~\h'|\\n:u' . ds / \\k:\h'-(\\n(.wu*8/10-\*(#H)'\z\(sl\h'|\\n:u' .\} . \" troff and (daisy-wheel) nroff accents .ds : \\k:\h'-(\\n(.wu*8/10-\*(#H+.1m+\*(#F)'\v'-\*(#V'\z.\h'.2m+\*(#F'.\h'|\\n:u'\v'\*(#V' .ds 8 \h'\*(#H'\(*b\h'-\*(#H' .ds o \\k:\h'-(\\n(.wu+\w'\(de'u-\*(#H)/2u'\v'-.3n'\*(#[\z\(de\v'.3n'\h'|\\n:u'\*(#] .ds d- \h'\*(#H'\(pd\h'-\w'~'u'\v'-.25m'\f2\(hy\fP\v'.25m'\h'-\*(#H' .ds D- D\\k:\h'-\w'D'u'\v'-.11m'\z\(hy\v'.11m'\h'|\\n:u' .ds th \*(#[\v'.3m'\s+1I\s-1\v'-.3m'\h'-(\w'I'u*2/3)'\s-1o\s+1\*(#] .ds Th \*(#[\s+2I\s-2\h'-\w'I'u*3/5'\v'-.3m'o\v'.3m'\*(#] .ds ae a\h'-(\w'a'u*4/10)'e .ds Ae A\h'-(\w'A'u*4/10)'E . \" corrections for vroff .if v .ds ~ \\k:\h'-(\\n(.wu*9/10-\*(#H)'\s-2\u~\d\s+2\h'|\\n:u' .if v .ds ^ \\k:\h'-(\\n(.wu*10/11-\*(#H)'\v'-.4m'^\v'.4m'\h'|\\n:u' . \" for low resolution devices (crt and lpr) .if \n(.H>23 .if \n(.V>19 \ \{\ . ds : e . ds 8 ss . ds o a . ds d- d\h'-1'\(ga . ds D- D\h'-1'\(hy . ds th \o'bp' . ds Th \o'LP' . ds ae ae . ds Ae AE .\} .rm #[ #] #H #V #F C .\" ======================================================================== .\" .IX Title "PREDICTPROTEIN 1" .TH PREDICTPROTEIN 1 "2019-08-02" "1.1.09" "User Commands" .\" For nroff, turn off justification. Always turn off hyphenation; it makes .\" way too many mistakes in technical documents. .if n .ad l .nh .SH "NAME" predictprotein \- analyse protein sequence .SH "SYNOPSIS" .IX Header "SYNOPSIS" predictprotein [\-\-blast\-processors] [\-\-num\-cpus|c] [\-\-debug|d] [\-\-help] [\-\-make\-file|m] [\-\-makedebug] [\-\-man] [\-\-method] [\-\-dryrun|n] [\-\-numresmax] [\-\-output\-dir|o] [\-\-print\-ext\-method\-map] [\-\-profnumresmin] [\-\-psicexe] [\-\-prot\-name|p] [\-\-sequence|seq|s] [\-\-seqfile] [\-\-spkeyidx] [\-\-target]* [\-\-version|v] [\-\-work\-dir|w] .PP predictprotein [\-\-bigblastdb] [\-\-big80blastdb] [\-\-pfam2db] [\-\-pfam3db] [\-\-prodomblastdb] [\-\-prositedat] [\-\-prositeconvdat] [\-\-swissblastdb] .PP predictprotein [\-\-setacl|acl] [\-\-cache\-merge] [\-\-force\-cache\-store] [\-\-use\-cache] .SH "DESCRIPTION" .IX Header "DESCRIPTION" predictprotein runs a set of protein sequnce analysis methods: .SS "Standard methods" .IX Subsection "Standard methods" These methods are run by the default target 'all': .PP .Vb 10 \& Feature Target Extension Man page \& \-\-\-\-\-\-\- \-\-\-\-\-\- \-\-\-\-\-\-\-\-\- \-\-\-\-\-\-\-\- \& atom mobility profbval profbval, profb4snap profbval(1) \& bacterial transmem\- proftmb proftmb, proftmbdat proftmb(1) \& brane beta barrels \& coiled\-coils coiledcoils coils, coils_raw coils\-wrap(1) \& ncoils(1) \& disulfide bridges disulfinder disulfinder disulfinder(1) \& Gene Ontology terms metastudent metastudent.BPO.txt, metastudent(1) \& metastudent.CCO.txt, \& metastudent.MFO.txt \& local alignment blast blastPsiOutTmp, chk, blastpgp(1) \& blastPsiMat, \& blastPsiAli, \& blastpSwissM8 blastall(1) \& local complexity ncbi\-seg segNorm, segNormGCG ncbi\-seg(1) \& non\-regular secondary norsp nors, sumNors norsp(1) \& structure \& nuclear localization predictnls nls, nlsDat, nlsSum predictnls(1) \& Pfam scan hmmer v2 hmm2pfam hmm2pfam hmm2pfam(1) \& Pfam scan hmmer v3 hmm3pfam hmm3pfam, hmm3pfamTbl, hmmscan(1) \& hmm3pfamDomTbl \& PROSITE scan prosite prosite prosite_scan(1) \& protein\-protein profisis isis profisis(1) \& interaction sites \& secondary structure, prof profRdb prof(1) \& accessibility from \& sequence profile \& secondary structure, prof prof1Rdb prof(1) \& accessibility from \& single sequence \& secondary structure, reprof reprof reprof(1) \& accessibility from \& single sequence \& transmembrane phd phdPred, phdRdb prof(1) \& helices \& unstructured loops norsnet norsnet norsnet(1) .Ve .SS "Optional methods" .IX Subsection "Optional methods" These methods are non-redistributable or depend on non-redistributable software (indicated by '*'). You have to acquire the non-redistributable components yourself before you can use these methods. .PP These methods are run by the target 'optional'. .PP .Vb 10 \& Feature Target Extension Man page \& \-\-\-\-\-\-\- \-\-\-\-\-\- \-\-\-\-\-\-\-\-\- \-\-\-\-\-\-\-\- \& disordered regions metadisorder mdisorder metadisorder(1) \& subcellular loctree3 {arch,bact,euka}.lc3 loctree3(1) \& tmhmm* tmhmm n.a. \& protein\-RNA, somena somena somena(1) \& protein\-DNA \& interaction sites \& DNA\- , RNA\- and prona prona prona2019(1) \& Protein\-binding \& protein and binding \& sites \& position\-specific psic* psic, clustalngz psic(1), \& independent counts runNewPSIC(1), \& and its base multi\- clustalw(1) \& ple alignment \& transmembrane helices tmhmm tmhmm n.a. \& tmseg tmseg tmseg(1) \& functional regions consurf _consurf.grades consurf(1) .Ve .SS "Resources" .IX Subsection "Resources" .Vb 9 \& Database Cmd line argument \& \-\-\-\-\-\-\-\- \-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\- \& big (Uniprot+PDB) blast database \-\-bigblastdb \& big_80 (big @ 80% sequence identity \-\-big80blastdb \& redunancy level) blast database \& swiss blast database \-\-swissblastdb \& pfam v2 database \-\-pfam2db \& pfam v3 database \-\-pfam3db \& prosite_convert.dat \-\-prositeconvdat .Ve .PP \fIResources for optional targets\fR .IX Subsection "Resources for optional targets" .PP .Vb 6 \& Database Cmd line argument \& \-\-\-\-\-\-\-\- \-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\- \& big (Uniprot+PDB) blast database \-\-bigblastdb \& prosite.dat \-\-prositedat \& Swiss\-Prot keyword\-to\-accession \-\-spkeyidx \& \*(Aqindex\*(Aq for loctree .Ve .SS "Generating Resources" .IX Subsection "Generating Resources" Courtesy of Wiktor Jurkowski: .PP .Vb 3 \& * rostlab\-data\-prosite_convert prosite.dat prosite_convert.dat \& * perl /usr/share/loctree/perl/keyindex4loctree.pl < keyindex.txt > keyindex_loctree.txt \& * hmmpress Pfam\-A.hmm .Ve .SS "Output format" .IX Subsection "Output format" Method outputs are deposited into \fB\-\-output\-dir\fR. Each method has one or more file name extensions associated with it, see the table above. Refer to the man page of the individual methods for further details. Extensions ending with `gz' are compressed with \fBgzip\fR\|(1). .SH "REFERENCES" .IX Header "REFERENCES" .IP "Rost, B., Yachdav, G., and Liu, J. (2004). The PredictProtein server. Nucleic Acids Res, 32(Web Server issue), W321\-6." 4 .IX Item "Rost, B., Yachdav, G., and Liu, J. (2004). The PredictProtein server. Nucleic Acids Res, 32(Web Server issue), W321-6." .PP In case you find predictprotein and the tools within useful please cite: .PP * the references for PredictProtein, see above .PP * the references for the tools you used, see \s-1REFERENCES\s0 on the man page of the tool .SH "OPTIONS" .IX Header "OPTIONS" .IP "\fB\-\-blast\-processors\fR" 4 .IX Item "--blast-processors" Number of processors to use, default = 1 .IP "\fB\-c\fR, \fB\-\-num\-cpus\fR" 4 .IX Item "-c, --num-cpus" Make jobs, default = 1 .IP "\fB\-d\fR, \fB\-\-debug\fR" 4 .IX Item "-d, --debug" .PD 0 .IP "\fB\-\-help\fR" 4 .IX Item "--help" .PD Print a brief help message and exits. .IP "\fB\-m\fR, \fB\-\-make\-file\fR" 4 .IX Item "-m, --make-file" make file to use, default = /usr/share/predictprotein/MakefilePP.mk .IP "\fB\-\-makedebug\fR" 4 .IX Item "--makedebug" debug argument for make, see \fBmake\fR\|(1) .IP "\fB\-\-man\fR" 4 .IX Item "--man" This documentation page .IP "\fB\-\-method\fR" 4 .IX Item "--method" Describes method control parameters and requests methods to run when \fB\-\-target\fR is not \fIall\fR. Format example: .Sp .Vb 1 \& \-\-method=norsp,win=50 .Ve .Sp * begin with the method name, e.g. `norsp' .Sp * list method control parameters, e.g. win=50 .Sp Not all methods support passing control parameters in this way due to their primitive command line interfaces. .IP "\fB\-n\fR, \fB\-\-dryrun\fR" 4 .IX Item "-n, --dryrun" Do not execute, just shows what is about to be run .IP "\fB\-\-numresmax\fR" 4 .IX Item "--numresmax" Maximum sequence length, default: \fI6000\fR. Sequences longer than this will make predictprotein fail with the respective error code, see \s-1ERRORS\s0. .IP "\fB\-o\fR, \fB\-\-output\-dir\fR" 4 .IX Item "-o, --output-dir" Final location of outputfiles, required unless caching is used. .IP "\fB\-\-print\-ext\-method\-map\fR" 4 .IX Item "--print-ext-method-map" Print externsion-to-method map. Useful as input file for consistency checkers. Format: . .IP "\fB\-\-profnumresmin\fR" 4 .IX Item "--profnumresmin" Minimum sequence length required by prof, default: \fI17\fR. Sequences shorter than this will make predictprotein fail with the respective error code, see \s-1ERRORS\s0. .IP "\fB\-\-psicexe\fR" 4 .IX Item "--psicexe" psic wrapper executable, default: /usr/share/rost\-runpsic/runNewPSIC.pl .IP "\fB\-p\fR, \fB\-\-prot\-name\fR" 4 .IX Item "-p, --prot-name" Base name of result files and protein name in \- for example \- \s-1FASTA\s0 files. Default = `query'. .Sp Valid names are of the character set \f(CW\*(C`[[:alnum:]._\-]\*(C'\fR. .IP "\fB\-s\fR, \fB\-\-seq\fR, \fB\-\-sequence\fR" 4 .IX Item "-s, --seq, --sequence" one letter amino acid sequence input .IP "\fB\-\-seqfile\fR" 4 .IX Item "--seqfile" \&\s-1FASTA\s0 amino acid sequence file; if `\-', standard input is read .IP "\fB\-\-spkeyidx\fR" 4 .IX Item "--spkeyidx" Swiss-Prot keyword-to-identifier 'index' file for \fBloctree\fR\|(1). .IP "\fB\-\-target\fR=\fIstring\fR" 4 .IX Item "--target=string" Method groups to run. Give this argument for each target you need. Default: the value of `default_targets' in the configuration file; `all' if that is not given. .Sp Some targets of interest: .RS 4 .IP "\fIall\fR" 4 .IX Item "all" methods that are \s-1GPL\s0 or redistributable to non-commercial entities .IP "\fIoptional\fR" 4 .IX Item "optional" methods that do not fit into \fIall\fR .RE .RS 4 .Sp Look at /usr/share/predictprotein/MakefilePP.mk for a list of targets (\*(L"Use the source Luke\*(R"). .RE .IP "\fB\-v\fR, \fB\-\-version\fR" 4 .IX Item "-v, --version" Print package version .IP "\fB\-w\fR, \fB\-\-work\-dir\fR" 4 .IX Item "-w, --work-dir" Working directory, optional .SS "Database options" .IX Subsection "Database options" .IP "\fB\-\-bigblastdb\fR" 4 .IX Item "--bigblastdb" Path to comprehensive blast database .IP "\fB\-\-big80blastdb\fR" 4 .IX Item "--big80blastdb" Path to comprehensive blast database at 80% sequence identity redundancy level .IP "\fB\-\-pfam2db\fR" 4 .IX Item "--pfam2db" Pfam v2 database, e.g. \fIPfam_ls\fR .IP "\fB\-\-pfam3db\fR" 4 .IX Item "--pfam3db" Pfam v3 database, e.g. \fIPfam\-A.hmm\fR .IP "\fB\-\-prodomblastdb\fR" 4 .IX Item "--prodomblastdb" Obsolete. This argument is kept only to maintain compatibility with older versions. .IP "\fB\-\-prositedat\fR" 4 .IX Item "--prositedat" Path to `prosite.dat' file, see .IP "\fB\-\-prositeconvdat\fR" 4 .IX Item "--prositeconvdat" Path to `prosite_convert.dat' file, see .IP "\fB\-\-swissblastdb\fR" 4 .IX Item "--swissblastdb" Path to SwissProt blast database .SS "Cache related options" .IX Subsection "Cache related options" .IP "\fB\-\-acl\fR, \fB\-\-setacl\fR" 4 .IX Item "--acl, --setacl" Set access control lists. Access control lists are set \fIonly\fR in case results are stored in the cache. This option is ineffective otherwise. All previous ACLs are lost \- no merging. The read bit controls browsability of results. Other bits are not used. E.g. .Sp .Vb 1 \& u:lkajan:4,u:gyachdav:4,g:lkajan:4,o::0 .Ve .IP "\fB\-\-cache\-merge\fR" 4 .IX Item "--cache-merge" .PD 0 .IP "\fB\-\-nocache\-merge\fR" 4 .IX Item "--nocache-merge" .PD Merge/do not merge results into cache. \fB\-\-cache\-merge\fR reuses results already in cache; this turns \fB\-\-use\-cache\fR on automatically. \fB\-\-cache\-merge\fR is incompatible with \fB\-\-force\-cache\-store\fR. .Sp \&\fB\-\-nocache\-merge\fR is the default \s-1UNLESS\s0 .RS 4 .IP "\(bu" 4 \&\fB\-\-use\-cache\fR is on and .IP "\(bu" 4 \&\fB\-\-noforce\-cache\-store\fR is in effect and .IP "\(bu" 4 \&\fB\-\-target\fR is used and .IP "\(bu" 4 the cache is not empty .RE .RS 4 .Sp \&\fB\-\-cache\-merge\fR is silently ignored in case the cache is empty. .RE .IP "\fB\-\-force\-cache\-store\fR" 4 .IX Item "--force-cache-store" .PD 0 .IP "\fB\-\-noforce\-cache\-store\fR" 4 .IX Item "--noforce-cache-store" .PD Enable/disable forcing storage of results into cache. Implies \fB\-\-use\-cache\fR. Default: \fB\-\-noforce\-cache\-store\fR .Sp With \fB\-\-noforce\-cache\-store\fR when predictprotein finds cached results it simply fetches them from the cache and does no processing (even if the results are incomplete). With \fB\-\-force\-cache\-store\fR predictprotein does not fetch anything from the cache but does store the results, completely replacing what was cached. .Sp \&\fB\-\-force\-cache\-store\fR is incompatible with \fB\-\-cache\-merge\fR. .IP "\fB\-\-use\-cache\fR" 4 .IX Item "--use-cache" .PD 0 .IP "\fB\-\-nouse\-cache\fR" 4 .IX Item "--nouse-cache" .PD Use/do not use cache for predictprotein results. Default: \fB\-\-nouse\-cache\fR. .Sp Option `use_cache' may be given in configuration files to override default. .SH "ERRORS" .IX Header "ERRORS" .IP "\fI253\fR" 4 .IX Item "253" Sequence is too long, see \fB\-\-numresmax\fR .IP "\fI254\fR" 4 .IX Item "254" Sequence is too short, shorter than minimum length required by prof. See \fB\-\-profnumresmin\fR. .SH "EXAMPLES" .IX Header "EXAMPLES" .Vb 1 \& predictprotein \-\-seqfile /usr/share/doc/predictprotein/examples/tquick.fasta \-\-output\-dir /tmp/pp \& \& predictprotein \-\-seqfile /usr/share/doc/predictprotein/examples/tquick.fasta \-\-output\-dir /tmp/pp \-\-target query.profRdb \-\-target loctree3 \& \& predictprotein \-\-seqfile /usr/share/doc/predictprotein/examples/tquick.fasta \-\-method=norsp,win=100 \-\-output\-dir /tmp/pp .Ve .SS "Cache examples" .IX Subsection "Cache examples" .IP "Store results in cache, do not care about storing files in \fB\-\-output\-dir\fR:" 4 .IX Item "Store results in cache, do not care about storing files in --output-dir:" .Vb 1 \& predictprotein \-\-seqfile /usr/share/doc/predictprotein/examples/tquick.fasta \-\-method=norsp,win=100 \-\-use\-cache \-\-setacl g:rostlab:7 .Ve .IP "If not in cache store, otherwise fetch results from cache into \fB\-\-output\-dir\fR:" 4 .IX Item "If not in cache store, otherwise fetch results from cache into --output-dir:" .Vb 1 \& predictprotein \-\-seqfile /usr/share/doc/predictprotein/examples/tquick.fasta \-\-method=norsp,win=100 \-\-use\-cache \-\-setacl g:rostlab:7 \-\-output\-dir /tmp/pp .Ve .SH "ENVIRONMENT" .IX Header "ENVIRONMENT" .IP "\s-1PREDICTPROTEINCONF\s0" 4 .IX Item "PREDICTPROTEINCONF" Location of predictproteinrc configuration file to use, overriding other configuration files .SH "FILES" .IX Header "FILES" .IP "\fI/usr/share/predictprotein/predictproteinrc.default\fR" 4 .IX Item "/usr/share/predictprotein/predictproteinrc.default" Default configuration file. See this file for a description of the parameters. .IP "\fI/etc/predictproteinrc\fR" 4 .IX Item "/etc/predictproteinrc" System configuration file overriding values in \fI/usr/share/predictprotein/predictproteinrc.default\fR .IP "\fI~/.predictproteinrc\fR" 4 .IX Item "~/.predictproteinrc" User configuration file overriding values in \fI/etc/predictproteinrc\fR .SH "NOTES" .IX Header "NOTES" .SS "Popularity Contest" .IX Subsection "Popularity Contest" The pp-popularity-contest package included with this image sets up a cron job that will periodically anonymously submit to the Rost Lab developers statistics about the most used Rost Lab packages on this system. .PP This information helps us making decisions such as which packages should receive high priority when fixing bugs. It also helps us decide which packages should receive funding for further development and support. This information is also very important when the Rost Lab applies for funding. .PP Without the funding we receive based on the usage statistics you volunteer none of the packages on this image could be made available to you at no cost. .PP In case you do not wish to participate in the popularity contest please remove the pp-popularity-contest package. .SH "AUTHOR" .IX Header "AUTHOR" Burkhard Rost, Antoine de Daruvar, Jinfeng Liu, Guy Yachdav, Laszlo Kajan .SH "SEE ALSO" .IX Header "SEE ALSO" \&\fBppc_store\fR\|(1), \fBppc_fetch\fR\|(1), \fBppqsub\fR\|(1)