Preparing to use PRSice
To install PRSice v.2.3.5 (Linux 64 bit)
{% code overflow="wrap" %}
wget https://github.com/choishingwan/PRSice/releases/download/2.3.5/PRSice_linux.zip
unzip PRSice_linux.zip
{% endcode %}
To get the required scoring files
The following scoring files were taken from PGS Catalog on the basis of the cohort distribution, year of compilation, and number of variants for the required phenotype. Each of these scoring files had 103 variants for colorectal cancer for white cohorts.
These scoring files were loaded using the following command on the Linux terminal:
{% code overflow="wrap" %}
wget https://ftp.ebi.ac.uk/pub/databases/spot/pgs/scores/PGS000074/ScoringFiles/PGS000074.txt.gz
gunzip PGS000074.txt.gz
{% endcode %}
{% code overflow="wrap" %}
wget https://ftp.ebi.ac.uk/pub/databases/spot/pgs/scores/PGS000785/ScoringFiles/PGS000785.txt.gz
gunzip PGS000785.txt.gz
{% endcode %}
Quality control of the scoring files
To remove metadata (in Linux terminal)
sed '1,14d' PGS000074.txt > PGS000074.txt
sed '1,14d' PGS000785.txt > PGS000785.txt
Since scoring files from the PGS Catalog do not include p-values, it was set to a default value of 0 (in R)
df1 <- read.table("PGS000074.txt", header=TRUE, sep="\t")
df1$P <- '0'
write.table(df1, "PGS000074.txt", quote=FALSE, row.names=FALSE)
df2 <- read.table("PGS000785.txt", header=TRUE, sep="\t")
df2$P <- '0'
write.table(df2, "PGS000785.txt", quote=FALSE, row.names=FALSE)