Program: BEAGLE
Version: 3.3.1
Copyright (c) 2007-2010 Brian L. Browning
Email: browning@uw.edu
This page was last updated on 26 Dec 2010
BEAGLE is a state of the art software package for analysis of large-scale genetic data sets with hundreds of thousands of markers genotyped on thousands of samples. BEAGLE can
BEAGLE is written in Java and runs on any computing platform with a Java version 1.6 interpreter (e.g. Windows, Unix, Linux, Solaris, Mac).
back to contents
BEAGLE 3.1 extends BEAGLE to accept genotype likelihoods (rather than called genotypes) for unrelated individuals. Genotype likelihoods enable genotype uncertainty to be modelled during phasing and imputation. The methods genotype likelihood methods in Beagle 3.1 have also been incorporated in the BEAGLECALL software package for calling SNP genotypes from microarray genotype data.
BEAGLE 3.2 extends BEAGLE to perform homozygosity-by-descent (HBD) detection and identity-by-descent (IBD) detection for unrelated individuals.
BEAGLE 3.3 adds a computationally efficient identity-by-descent detection method called fastIBD. The fastIBD method can analyze large data sets with thousands of samples.
If you use BEAGLE in a published analysis, please report the BEAGLE version used and cite the appropriate publication or publications listed below.
S R Browning and B L Browning (2010) High-resolution detection of identity by descent in unrelated individuals. The American Journal of Human Genetics 86:526-539. [link to article]
B L Browning and Z Yu (2009) Simultaneous genotype calling and haplotype phase inference improves genotype accuracy and reduces false positive associations for genome-wide association studies. The American Journal of Human Genetics 85:847-861. [link to article]
B L Browning and S R Browning (2009) A unified approach to genotype imputation and haplotype phase inference for large data sets of trios and unrelated individuals. Am J Hum Genet 84:210-223. [link to article]
S R Browning and B L Browning (2007) Rapid and accurate haplotype phasing and missing data inference for whole genome association studies using localized haplotype clustering. Am J Hum Genet 81:1084-1097. [link to article]
B L Browning and S R Browning (2007) Efficient multilocus association mapping for whole genome association studies using localized haplotype clustering. Genet Epidemiol 31:365-375. [link to article]
S R Browning (2006) Multilocus association mapping using variable-length Markov chains. Am J Hum Genet 78:903-13. [link to article]
A BEAGLE analysis of a large genome-wide association study is described in the following paper:
B L Browning and S R Browning (2008) Haplotypic analysis of Wellcome Trust Case Control Consortium data. Human Genetics 123:273-280. [link to article]
The preceding reference describes strategies for eliminating false-positive associations due to genotyping artefacts. However, in our experience, a much better way to eliminate false-positive associations due to genotyping artefacts is to simultaneously call and phase genotypes using BEAGLECALL prior to performing a haplotypic analysis using BEAGLE.
You may download and use the BEAGLE executable file for non-commercial and commercial research. BEAGLE is distributed on an "AS IS" basis. There is NO WARRANTY for the program, to the extent permitted by applicable law. In no event unless required by applicable law will any BEAGLE copyright holder be liable to you for damages, including any general, special, incidental or consequential damages arising out of the use or inability to use the program (including but not limited to loss of data or data being rendered inaccurate or losses sustained by you or third parties or a failure of the program to operate with any other programs).
The following files are available for download:
Phased genotype data in BEAGLE format from individuals sequenced by the 1000 Genomes Project (2010.08.04 sequence index) are available for download:
The BEAGLE-format files were created from corresponding MACH-format files. Information regarding the samples in each file and the marker filters applied is available from the MACH web site. The phasing of the data in these files was performed by the Broad Institute using BEAGLE. Some individuals of admixed ancestry are included in more than one file.
The BEAGLE Utilities web site has a number of utility programs that can be used to prepare input files and process output files.
When using multiple Beagle input files, the alleles for a marker must be on the same chromosome strand in each input file. Sharon Browning has written a python script that checks that alleles are defined with respect to the same chromosome strand and switches alleles if necessary to make the chromosome strands consistent. Click here to download Sharon's strand-checking utility.
Sharon Browning has written some utilities for processing output files from BEAGLE HBD/IBD analysis. Click here to download Sharon's HBD/IBD utilities.
When dealing with large sample sizes, it might be necessary to divide a sample so that imputation can be performed in each subsample separately. Click here to download a simple shell script for dividing your sample.
Two specialized programs, pseudomarker.jar and cluster2haps.jar, are described in the BEAGLE documentation, and may be useful to some users: