ukbtools: Manipulate and Explore UK Biobank Data |
|
UKB DataframeFunctions to wrangle the UKB data into a dataframe with meaningful column names. |
|
---|---|
Reads a UK Biobank phenotype fileset and returns a single dataset. |
|
Makes a UKB data-field to variable name table for reference or lookup. |
|
Recursively join a list of UKB datasets |
|
Checks for duplicated names within a UKB dataset |
|
Inserts UKB centre names into data |
|
Demographics of a UKB sample subset |
|
Genetic MetadataFunctions to query the associated genetic sample QC information |
|
Reads a PLINK format fam file |
|
Reads an Oxford format sample file |
|
Relatedness count |
|
Subset of the UKB relatedness dataframe with data |
|
Related samples (with data on the variable of interest) to remove |
|
Sample QC column names |
|
Writes a BGENIE format phenotype or covariate file. |
|
Writes a PLINK format phenotype or covariate file |
|
Disease DiagnosesFunctions to query the UKB hospital episodes statistics. |
|
Retrieves description for a ICD code. |
|
Retrieves diagnoses for an individual. |
|
Frequency of an ICD diagnosis by a target variable |
|
Retrieves diagnoses containing a description. |
|
Returns the prevalence for an ICD diagnosis |
|
Datasets |
|
UKB assessment centre |
|
International Classification of Diseases Revision 10 (ICD-10) chapters |
|
International Classification of Diseases Revision 10 (ICD-10) codes |
|
International Classification of Diseases Revision 9 (ICD-9) chapters |
|
International Classification of Diseases Revision 9 (ICD-9) codes |
|
DefunctThe genetic metadata functions were written to retrieve genetic metadata from the phenotype file for the interim genotype release. The fields retrieved became obselete when the full genotyping results were released at the end of 2017. With the release of the full sample (500K individuals) genotypes, sample QC (ukb_sqc_v2.txt) and relatedness (ukbA_rel_sP.txt) data are now supplied as separate files. The contents of these files, along with all other genetic files are described in UKB Resource 531. |
|
Sample exclusions |
|
Inserts NA into phenotype for genetic metadata exclusions |
|
Heterozygosity outliers |
|
Genetic metadata |
|
Genetic principal components |
|
Creates a table of related individuals |
|
Writes a PLINK format file for combined exclusions |