Read a GMT file into a data frame
Arguments
- file
A connection object or a character string (can be a URL).
- geneset_label
Column name for gene sets (first column of the GMT file) in the output data frame.
- gene_label
Column name for genes (variable columns of the GMT file) in the output data frame.
Value
A data frame with gene sets as the first column and genes as the second column (one gene per row).
Examples
gmt <- "http://software.broadinstitute.org/gsea/msigdb/supplemental/scsig.all.v1.0.symbols.gmt"
gmt_tbl <- read_gmt(gmt)
head(gmt_tbl)
#> # A tibble: 6 × 2
#> celltype gene
#> <chr> <chr>
#> 1 Zheng_Cord_Blood_C1_Putative_Megakaryocyte_Progenitor ABCC3
#> 2 Zheng_Cord_Blood_C1_Putative_Megakaryocyte_Progenitor ABCC4
#> 3 Zheng_Cord_Blood_C1_Putative_Megakaryocyte_Progenitor ACTN1
#> 4 Zheng_Cord_Blood_C1_Putative_Megakaryocyte_Progenitor ARHGAP18
#> 5 Zheng_Cord_Blood_C1_Putative_Megakaryocyte_Progenitor ARHGAP6
#> 6 Zheng_Cord_Blood_C1_Putative_Megakaryocyte_Progenitor BANK1