BEAGLE Utilities

VCF file utilities

gtstats.jar - calculates genotype statistics for each marker in a VCF file with GT field data.
splitvcf.jar - splits a single VCF file into multiple VCF files corresponding to overlapping chromosome intervals.
mergevcf.jar - merges multiple VCF files corresponding to overlapping chromosome intervals into a single VCF file.
consensusvcf.jar - creates a VCF files with a consensus phasing from a set of VCF files with phased GT field data for the same samples and markers.

File conversion utilities

base2genetic.jar - converts NCBI base positions to genetic map positions.
beagle2gprobs.jar - converts a Beagle v3 genotypes file to a Beagle v3 genotype probabilities file.
beagle2linkage.jar - converts a Beagle v3 genotypes file to a linkage file.
beagle2vcf.jar - converts a Beagle v3 genotypes file to VCF format.
gprobs2beagle.jar - converts a Beagle v3 genotype probabilities file to a Beagle v3 genotypes file.
linkage2beagle.jar - converts a linkage file to a Beagle v3 genotypes file.
vcf2beagle.jar - converts a VCF file with GT field data to a Beagle v3 genotypes file.
vcf2gprobs.jar - converts a VCF file with GP field data to a Beagle v3 genotype probabilities file.

Data QC utilities

gprobshwe.jar - calculates exact Hardy-Weinberg equilibrium p-values for each marker in a Beagle v3 genotype probabilities file.
gprobsmetrics.jar - calculates per-marker statistics from Beagle v3 genotype probabilities file.
gprobsmissing.jar - calculates the missing genotype proportion for each marker in a Beagle v3 genotype probabilities file.
gprobssamplemissing.jar - calculates the missing genotype proportion for each sample in a Beagle v3 genotype probabilities file.

File manipulation utilities

changecolumn.jar - replaces values in a column of a file.
changeline.jar - replaces values in a line of a file.
cut.jar - extracts columns from a file.
filtercolumns.jar - filters columns of input data according to the values in a line.
filterlines.jar - filters lines of input data according to the values in a column.
paste.jar - pastes together files that have shared initial columns followed by data columns.
transpose.jar - transposes rows and columns of a file.

Genotype imputation utilities

updategprobs.jar - update Beagle genotype probabilities file with data from another Beagle genotype probabilities file.

IBD utilities (Beagle v4 IBD files)

ibdmerge.jar - merge Beagle v4 IBD file.

Introduction

This page includes simple utility programs for manipulating text files. If you are performing analyses using BEAGLECALL or Beagle, you may find some of these programs to be useful for preparing input files and for working with output files. The Beagle utilities are written in java and run on all common computing platforms (e.g. Windows, Unix, Linux, Solaris, Mac).

All the utility programs on this web page are licensed under the Apache version 2.0 open source license. You may obtain a copy of the License from http://www.apache.org/licenses/LICENSE-2.0

BEAGLE Utilities

Contents

Introduction

VCF file utilities

File conversion utilities

Data QC utilities

File manipulation utilities

Genotype imputation utilities

IBD utilities (Beagle v4 IBD files)

Java Source Code

Introduction

gtstats.jar

Description:

Usage:

Notes:

splitvcf.jar

Description:

Usage:

Notes:

mergevcf.jar

Description:

Usage:

Notes:

consensusvcf.jar

Description:

Usage:

Notes:

base2genetic.jar

Description:

Usage:

Notes:

beagle2gprobs.jar

Description:

Usage:

Notes:

beagle2linkage.jar

Description:

Usage:

Notes:

beagle2vcf.jar

Description:

Usage:

Notes:

gprobs2beagle.jar

Description:

Usage:

Notes:

linkage2beagle.jar

Description:

Usage:

Notes:

vcf2beagle.jar

Description:

Usage:

Notes:

vcf2gprobs.jar

Description:

Usage:

Notes:

gprobshwe.jar

Description:

Usage:

Notes:

gprobsmetrics.jar

Description:

Usage:

Notes:

gprobsmissing.jar

Description:

Usage:

Notes:

gprobssamplemissing.jar

Description:

Usage:

Notes:

changecolumn.jar

Description:

Usage:

Notes:

changeline.jar