Scroll to navigation

RSPAMD_STATS(8) RSPAMD_STATS(8)

NAME

rspamd_stats - analyze Rspamd rules by parsing log files

SYNOPSIS

rspamd_stats [options] [--symbol=SYM1 [--symbol=SYM2...]] [--log file]

DESCRIPTION

rspamd_stats will read the given log file (or standard input) and provide statistics for the specified symbols:

Symbol: BAYES_SPAM (weight 3.763) (381985 hits, 26.827%)
Ham hits: 184557 (48.315%), total ham: 1095487 (ham with BAYES_SPAM: 16.847%)
Spam hits: 15134 (3.962%), total spam: 16688 (spam with BAYES_SPAM: 90.688%)
Junk hits: 182294 (47.723%), total junk: 311699 (junk with BAYES_SPAM: 58.484%)
Spam changes (ham/junk -> spam): 7026 (1.839%), total percentage (changes / spam hits): 42.102%
Junk changes (ham -> junk): 95192 (24.920%), total percentage (changes / junk hits): 30.540%

    

Where there are the following attributes:

Weight: average score for a symbols
Total hits: total number of hits and percentage of symbol hits divided by total number of messages
HAM hits: provides the following information about HAM messages with the specified symbol (from left to right):
1.
total symbol hits: number of messages that has this symbol and are HAM
2.
ham percentage: number of symbol hits divided by overall HAM messages count
3.
total ham hits: overall number of HAM messages
4.
ham with symbol percentage: percentage of number of hits with specified symbol in HAM messages divided by total number of HAM messages.
SPAM hits: provides the following information about SPAM messages - same as previous but for SPAM class.
Junk hits: provides the following information about Junk messages - same as previous but for JUNK class.
Spam changes: displays data about how much messages switched their class because of the specific symbol weight.
Junk changes: displays data about how much messages switched their class because of the specific symbol weight.

OPTIONS

--log
Specifies log file or directory to read data from. If a directory is specified rspamd_stats analyses files in the directory including known compressed file types. Number of log files can be limited using --num-logs and --exclude-logs options. This assumes that files in the log directory have newsyslog(8)- or logrotate(8)-like name format with numeric indexes. Files without indexes (generally it is merely one file) are considered the most recent and files with lower indexes are considered newer.
--reject-score
Specifies the reject (spam) threshold.
--junk-score
Specifies the junk (add header or rewrite subject) threshold.
--alpha-score
Specifies the minimum score for a symbol to be considered by this script.
--symbol
Add symbol or pattern (pcre format) to analyze.
--num-logs
If set, limits number of analyzed logfiles in the directory to the specified value.
--exclude-logs
Number of latest logs to exclude (0 by default).
--correlations
Additionally print correlation rate for each symbol displayed. This routine calculates merely paired correlations between symbols.
--search-pattern
Do not process input unless finding the specified regular expression. Useful to skip logs to a certain position.
--exclude
Exclude log lines if certain symbols are fired (e.g. GTUBE). You may specify this option multiple time to skip multiple symbols.
--start
Select log entries after this time. Format: "YYYY-MM-DD HH:MM:SS" (can be truncated to any desired accuracy). If used with --end select entries between --start and --end. The omitted date defaults to the current date if you supply the time.
--end
Select log entries before this time. Format: "YYYY-MM-DD HH:MM:SS" (can be truncated to any desired accuracy). If used with --start select entries between --start and --end. The omitted date defaults to the current date if you supply the time.
--help
Print a brief help message and exits.
--man
Prints the manual page and exits.

AUTHORS

Vsevolod Stakhov.
March 5, 2018