Splicing QTLs for NYGC ALS Consortium Paper
Description
The files below contain nominal and permuted quantitative trait loci (QTL) associations between common genetic variants derived from whole genome sequencing and Leafcutter splice junction phenotypes generated from RNA-seq of post-mortem tissue sections. All QTLs were mapped with TensorQTL.
Top association files are gzip-compressed tab-separated variable files - *cis_qtl.txt.gz
Nominal association files are stored as Parquet files to save space. These can be converted to text files using the following code snippet:
pip install pandas pyarrow
conda install -c bioconda htslib # provides bgzip
python3 -c "
import pandas as pd
df = pd.read_parquet('your_file.parquet')
df.to_csv('your_file.tsv.gz', sep='\t', index=False)
" | bgzip > your_file.tsv.gz
NYGC_all_common_variants_alleles.tsv.gz - Allele information for all SNPs tested in the sQTL analysis
NYGC_gene_junction_annotation_GENCODE_v46.txt.gz - Gene annotation for each junction (GENCODE v46 Ensembl IDs and gene names). TSV format. Junctions mapping to multiple genes have all genes assigned, split by comma.
Table columns are formatted as follows:
Nominal QTL results include all SNP-gene pairs tested using a 1Mb window from each side of the transcription start site (TSS) of the gene.
- phenotype_id - splice junction coordinations in hg38.
- variant_id - SNP tested for association (rsid or chr:position:ref:alt)
- tss_distance - distance of the SNP to the gene transcription start site (TSS)
- maf - minor allele frequency in cohort
- ma_samples - number of samples carrying the minor allele
- ma_count - total number of minor alleles across individuals
- pval_nominal - nominal P-value from linear regression
- slope - slope of the linear regression
- slope_se - standard error of the slope
Top association results include only the top SNP-gene association for each gene. Table columns are formatted as follows:
- phenotype_id - ensembl ID of the gene tested (GENCODE v30
- num_var - total number of variants tested in cis
- beta_shape1 - first parameter value of the fitted beta distribution
- beta_shape2 - second parameter value of the fitted beta distribution
- true_df - effective degrees of freedom the beta distribution approximation
- pval_true_df - empirical P-value for the beta distribution approximation
- variant_id - ID of the top variant (rsid or chr:position:ref:alt)
- tss_distance - distance of the SNP to the gene transcription start site (TSS)
- ma_samples - number of samples carrying the minor allele
- ma_count - total number of minor alleles across individuals
- maf -minor allele frequency in MiGA cohort
- ref_factor - flag indicating if the alternative allele is the minor allele in the cohort (1 if AF <= 0.5, -1 if not)
- pval_nominal - nominal P-value from linear regression
- slope - slope of the linear regression
- slope_se - standard error of the slope
- pval_perm - first permutation P-value directly obtained from the permutations with the direct method
- pval_beta - second permutation P-value obtained via beta approximation. This is the one to use for downstream analysis
- qval - Storey q-value derived from pval_beta (FDR adjusted)
- pval_nominal_threshold - nominal P-value threshold for calling a variant-gene pair significant for the gene
Allele Information for each variant:
- CHROM - chromosome position of the variant
- POS - position of the variant in the chromosome
- REF - reference allele (GRCh38)
- ALT - alternative allele (this is the effect allele in the eQTL analysis)
- ID - variant id (rsid or chr:position:ref:alt)
Files
Files
(46.5 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:18ee71d3d684bef9651a44901d9d63b9
|
82.1 MB | Download |
|
md5:d7c42d10edfccc9ec78403c214bf328e
|
1.6 MB | Download |
|
md5:ef062b2985b2bb9a237b0c5e75baae34
|
4.3 GB | Download |
|
md5:74e3b352a2906613058bcac3fc398b47
|
1.8 MB | Download |
|
md5:1d57ee5ed908d63377a0bdba0dfbfba3
|
4.8 GB | Download |
|
md5:8c6dfa06628fa9a56ad042f396966f0d
|
1.8 MB | Download |
|
md5:eb91e7398d1f37790dc59e42bbda2efa
|
5.0 GB | Download |
|
md5:2499a44fb928aaf56eb64fbba212fe5c
|
2.0 MB | Download |
|
md5:363942483f50d1aa7918bb4ccbbd7e79
|
1.8 MB | Download |
|
md5:c40f33b7a474e36141c02c7086c46be8
|
3.7 GB | Download |
|
md5:5c9ef830eac48e0f6cb8a8c4bedeff04
|
1.8 MB | Download |
|
md5:41d3aa28777ccfe5d6a175386ac781d3
|
4.7 GB | Download |
|
md5:8e86fc5381cd4052a1e822166d0b2324
|
1.8 MB | Download |
|
md5:fbcbb8cac0053b180daf19d29de45414
|
4.2 GB | Download |
|
md5:ff5b5fdf63c215d3f51e67aa417413f2
|
1.8 MB | Download |
|
md5:00d5c84e9465382469a899915e1310c4
|
4.2 GB | Download |
|
md5:27b055d0bb2291bf6ccbb73e2125919c
|
1.8 MB | Download |
|
md5:e7862b0d8bb6a24be359d7013be48273
|
4.2 GB | Download |
|
md5:2cd368f582a031c6bc2711b3b672219f
|
1.8 MB | Download |
|
md5:12e7a71d2f974a28c27930c090400fd1
|
3.7 GB | Download |
|
md5:285e509eb0519123efc37b506b6b970d
|
1.8 MB | Download |
|
md5:79321f7c05e80463b189e4dca1046a16
|
4.0 GB | Download |
|
md5:18260adcbc8427b5524185a7786446ac
|
1.7 MB | Download |
|
md5:5a620f385c56d75e47d90c5007b47f85
|
3.6 GB | Download |