Scroll to navigation

BOLT(1) General Commands Manual BOLT(1)

NAME

bolt - Efficient large cohorts genome-wide Bayesian mixed-model association testing

SYNOPSIS

bolt [options]

DESCRIPTION

The BOLT-LMM software package currently consists of two main algorithms, the BOLT-LMM algorithm for mixed model association testing, and the BOLT-REML algorithm for variance components analysis (i.e., partitioning of SNP-heritability and estimation of genetic correlations).

The BOLT-LMM algorithm computes statistics for testing association between phenotype and genotypes using a linear mixed model. By default, BOLT-LMM assumes a Bayesian mixture-of-normals prior for the random effect attributed to SNPs other than the one being tested. This model generalizes the standard infinitesimal mixed model used by previous mixed model association methods, providing an opportunity for increased power to detect associations while controlling false positives. Additionally, BOLT-LMM applies algorithmic advances to compute mixed model association statistics much faster than eigendecomposition-based methods, both when using the Bayesian mixture model and when specialized to standard mixed model association.

The BOLT-REML algorithm estimates heritability explained by genotyped SNPs and genetic correlations among multiple traits measured on the same set of individuals. BOLT-REML applies variance components analysis to perform these tasks, supporting both multi-component modeling to partition SNP-heritability and multi-trait modeling to estimate correlations. BOLT-REML applies a Monte Carlo algorithm that is much faster than eigendecomposition-based methods for variance components analysis at large sample sizes.

OPTIONS

-h [ --help ] print help message with typical options

print help message with full option list
prefix of PLINK .fam, .bim, .bed files
prefix of PLINK .fam.gz, .bim.gz, .bed.gz files
PLINK .fam file (note: file names ending in .gz are auto-[de]compressed)
PLINK .bim file(s); for >1, use multiple --bim and/or {i:j}, e.g., data.chr{1:22}.bim
PLINK .bed file(s); for >1, use multiple --bim and/or {i:j} expansion
Oxford-format file for interpolating genetic distances: tables/genetic_map_hg##.txt.gz
file(s) listing individuals to ignore (no header; FID IID must be first two columns)
file(s) listing SNPs to ignore (no header; SNP ID must be first column)
QC filter: max missing rate per SNP

--maxMissingPerIndiv arg (=0.1) QC filter: max missing rate per person

phenotype file (header required; FID IID must be first two columns)
phenotype column header
use last (6th) column of .fam file as phenotype
covariate file (header required; FID IID must be first two columns)
categorical covariate column(s); for >1, use multiple --covarCol and/or {i:j} expansion
quantitative covariate column(s); for >1, use multiple --qCovarCol and/or {i:j} expansion
include samples with missing covariates in analysis via missing indicator method (default: ignore such samples)
run variance components analysis to precisely estimate heritability (but not compute assoc stats)
compute assoc stats under the inf model and with Bayesian non-inf prior (VB approx), if power gain expected
compute mixed model assoc stats under the infinitesimal model
compute non-inf assoc stats even if BOLT-LMM expects no power gain
file(s) listing SNPs to use in model (i.e., GRM) (default: use all non-excluded SNPs)
LD Scores for calibration of Bayesian assoc stats: tables/LDSCORE.1000G_EUR.tab.gz
number of computational threads
output file for assoc stats at PLINK genotypes
file(s) containing imputed SNP dosages to test for association (see manual for format)
file listing FIDs and IIDs of samples in dosageFile(s), one line per sample
output file for assoc stats at dosage format genotypes
list of [chr file] pairs containing IMPUTE2 SNP probabilities to test for association
file listing FIDs and IIDs of samples in IMPUTE2 files, one line per sample
MAF threshold on IMPUTE2 genotypes; lower-MAF SNPs will be ignored
file(s) containing Oxford BGEN-format genotypes to test for association
file containing Oxford sample file corresponding to BGEN file(s)
list of [bgen sample] file pairs containing BGEN imputed variants to test for association
MAF threshold on Oxford BGEN-format genotypes; lower-MAF SNPs will be ignored
INFO threshold on Oxford BGEN-format genotypes; lower-INFO SNPs will be ignored
output file for assoc stats at BGEN-format genotypes
output file for assoc stats at IMPUTE2 format genotypes
list of [map dosage] file pairs with 2-dosage SNP probabilities (Ricopili/plink2 --dosage format=2) to test for association
output file for assoc stats at 2-dosage format genotypes

SEE ALSO

https://data.broadinstitute.org/alkesgroup/BOLT-LMM/

COPYRIGHT

Copyright © 2014-2018 Harvard University. Distributed under the GNU GPLv3+ open source license.

May 2018 2.3.2