BEAGLE Genetic Analysis Software Package

Program: BEAGLE
Version: 3.3.2
Copyright (c) 2007-2013 Brian L. Browning
Email: browning@uw.edu
This page was last updated on 11 Oct 2013

Contents


Introduction

BEAGLE is a state of the art software package for analysis of large-scale genetic data sets with hundreds of thousands of markers genotyped on thousands of samples. BEAGLE can

Beagle can also be used in conjunction with PRESTO, a program for fast and flexible permutation testing. PRESTO can compute empirical distributions of order statistics, analyze stratified data, and determine significance levels for one-stage and two-stage genetic association studies.

BEAGLE is written in Java and runs on any computing platform with a Java version 1.6 interpreter (e.g. Windows, Unix, Linux, Solaris, Mac).

back to contents

Beagle version 4

You can download the most recent stable snapshot of version 4 here. Although, Beagle version 4 is still being developed, in general we recommend using version 4 in preference to version 3 because version 4 has improved algorithms for genotype calling, genotype phasing, imputation of ungenotyped markers, and identity-by-descent segment detection. Version 4 does not perform association testing, but version 3 is available for this purpose.

back to contents


Citation

If you use BEAGLE in a published analysis, please report the BEAGLE version used and cite the appropriate publication or publications listed below.

  1. BEAGLE's fastIBD method is described in
    B L Browning and S R Browning (2011)  A fast, powerful method for detecting identity by descent.  The American Journal of Human Genetics 88:173-182.  [link to article]
  2. BEAGLE's methods for detecting homozygosity-by-descent and identity-by-descent are described in
    S R Browning and B L Browning (2010)  High-resolution detection of identity by descent in unrelated individuals.   The American Journal of Human Genetics 86:526-539.  [link to article]
  3. BEAGLE's methods for calling genotypes from genotype likelihood data are described in
    B L Browning and Z Yu (2009)  Simultaneous genotype calling and haplotype phase inference improves genotype accuracy and reduces false positive associations for genome-wide association studies.   The American Journal of Human Genetics 85:847-861.  [link to article]
  4. BEAGLE's methods for imputing ungenotyped markers and phasing parent-offspring trios are described in
    B L Browning and S R Browning (2009) A unified approach to genotype imputation and haplotype phase inference for large data sets of trios and unrelated individuals. Am J Hum Genet 84:210-223. [link to article]
  5. BEAGLE's methods for inferring haplotype phase or sporadic missing data in unrelated individuals are described in
    S R Browning and B L Browning (2007) Rapid and accurate haplotype phasing and missing data inference for whole genome association studies using localized haplotype clustering. Am J Hum Genet 81:1084-1097. [link to article]
  6. BEAGLE's methods for association testing are described in
    B L Browning and S R Browning (2007) Efficient multilocus association mapping for whole genome association studies using localized haplotype clustering. Genet Epidemiol 31:365-375. [link to article]
  7. BEAGLE's haplotype frequency model was first described in:

    S R Browning (2006) Multilocus association mapping using variable-length Markov chains. Am J Hum Genet 78:903-13. [link to article]

back to contents


Download BEAGLE

You may download and use the BEAGLE executable file for non-commercial and commercial research.   BEAGLE is distributed on an "AS IS" basis.  There is NO WARRANTY for the program, to the extent permitted by applicable law.   In no event unless required by applicable law will any BEAGLE copyright holder be liable to you for damages, including any general, special, incidental or consequential damages arising out of the use or inability to use the program (including but not limited to loss of data or data being rendered inaccurate or losses sustained by you or third parties or a failure of the program to operate with any other programs).

The following files are available for download:

  1. beagle_3.3.2_31Oct11.pdf - The documentation for BEAGLE.
  2. beagle.jar - The BEAGLE executable file.
  3. beagle_example.zip - A ZIP-compressed directory with example input and output files from a BEAGLE analysis.  The analysis of the example files is described in the BEAGLE documentation.

back to contents


Download Phased Imputation Reference Panels

Phased genotype data in BEAGLE format from phase 1 of the 1000 Genomes Project are available for download from

http://bochet.gcc.biostat.washington.edu/beagle/1000_Genomes.phase1_release_v3/

back to contents


Utility Programs

The BEAGLE Utilities web site has a number of utility programs that can be used to prepare input files and process output files.

When using multiple Beagle input files, the alleles for a marker must be on the same chromosome strand in each input file. Sharon Browning has written a python script that checks that alleles are defined with respect to the same chromosome strand and switches alleles if necessary to make the chromosome strands consistent. Click here to download Sharon's strand-checking utility.

Sharon Browning has written some utilities for processing output files from BEAGLE HBD/IBD analysis. Click here to download Sharon's HBD/IBD utilities.

When dealing with large sample sizes, it might be necessary to divide a sample so that imputation can be performed in each subsample separately. Click here to download a simple shell script for dividing your sample.

Two specialized programs, pseudomarker.jar and cluster2haps.jar are described in the BEAGLE documentation, and may be useful to some users:

back to contents


Version History

back to contents


Old Versions

back to contents