Compiled date: 2024-10-29
Last edited: 2022-01-12
License: GPL-3
Run the following code to install the Bioconductor version of the package.
# install.packages("BiocManager")
BiocManager::install("fobitools")
fobitools
library(fobitools)
You can also load some additional packages that will be very useful in this vignette.
library(dplyr)
library(kableExtra)
metaboliteUniverse
and metaboliteList
In microarrays, for example, we can study almost all the genes of an organism in our sample, so it makes sense to perform an over representation analysis (ORA) considering all the genes present in Gene Ontology (GO). Since most of the GO pathways would be represented by some gene in the microarray.
This is different in nutrimetabolomics. Targeted nutrimetabolomics studies sets of about 200-500 diet-related metabolites, so it would not make sense to use all known metabolites (for example in HMDB or CHEBI) in an ORA, as most of them would not have been quantified in the study.
In nutrimetabolomic studies it may be interesting to study enriched or over represented foods/food groups by the metabolites resulting from the study statistical analysis, rather than the enriched metabolic pathways, as would make more sense in genomics or other metabolomics studies.
The Food-Biomarker Ontology (FOBI) provides a biological knowledge for conducting these enrichment analyses in nutrimetabolomic studies, as FOBI provides the relationships between several foods and their associated dietary metabolites (Castellano-Escuder et al. 2020).
Accordingly, to perform an ORA with the fobitools
package, it is necessary to provide a metabolite universe (all metabolites included in the statistical analysis) and a list of selected metabolites (selected metabolites according to a statistical criterion).
Here is an example:
# select 300 random metabolites from FOBI
idx_universe <- sample(nrow(fobitools::idmap), 300, replace = FALSE)
metaboliteUniverse <- fobitools::idmap %>%
dplyr::slice(idx_universe) %>%
pull(FOBI)
# select 10 random metabolites from metaboliteUniverse that are associated with 'Red meat' (FOBI:0193),
# 'Lean meat' (FOBI:0185) , 'egg food product' (FOODON:00001274),
# or 'grape (whole, raw)' (FOODON:03301702)
fobi_subset <- fobitools::fobi %>% # equivalent to `parse_fobi()`
filter(FOBI %in% metaboliteUniverse) %>%
filter(id_BiomarkerOf %in% c("FOBI:0193", "FOBI:0185", "FOODON:00001274", "FOODON:03301702")) %>%
dplyr::slice(sample(nrow(.), 10, replace = FALSE))
metaboliteList <- fobi_subset %>%
pull(FOBI)
fobitools::ora(metaboliteList = metaboliteList,
metaboliteUniverse = metaboliteUniverse,
subOntology = "food",
pvalCutoff = 0.01)
className | classSize | overlap | pval | padj | overlapMetabolites |
---|---|---|---|---|---|
black tea leaf (dry) | 7 | 5 | 0.0000003 | 0.0000176 | FOBI:030…. |
kale leaf (raw) | 7 | 5 | 0.0000003 | 0.0000176 | FOBI:030…. |
Red meat | 13 | 6 | 0.0000003 | 0.0000176 | FOBI:030…. |
apple juice | 8 | 5 | 0.0000007 | 0.0000211 | FOBI:030…. |
orange juice | 8 | 5 | 0.0000007 | 0.0000211 | FOBI:030…. |
green tea leaf (dry) | 9 | 5 | 0.0000015 | 0.0000293 | FOBI:030…. |
red tea | 9 | 5 | 0.0000015 | 0.0000293 | FOBI:030…. |
red velvet | 9 | 5 | 0.0000015 | 0.0000293 | FOBI:030…. |
lemon (whole, raw) | 17 | 6 | 0.0000024 | 0.0000398 | FOBI:030…. |
White fish | 5 | 4 | 0.0000031 | 0.0000398 | FOBI:030…. |
white bread | 5 | 4 | 0.0000031 | 0.0000398 | FOBI:030…. |
white wine | 5 | 4 | 0.0000031 | 0.0000398 | FOBI:030…. |
white sugar | 6 | 4 | 0.0000092 | 0.0001007 | FOBI:030…. |
cherry (whole, raw) | 12 | 5 | 0.0000092 | 0.0001007 | FOBI:030…. |
grapefruit (whole, raw) | 13 | 5 | 0.0000148 | 0.0001329 | FOBI:030…. |
sweet potato vegetable food product | 13 | 5 | 0.0000148 | 0.0001329 | FOBI:030…. |
tomato (whole, raw) | 13 | 5 | 0.0000148 | 0.0001329 | FOBI:030…. |
soybean (whole) | 23 | 6 | 0.0000180 | 0.0001526 | FOBI:030…. |
almond (whole, raw) | 7 | 4 | 0.0000212 | 0.0001703 | FOBI:030…. |
soybean oil | 3 | 3 | 0.0000269 | 0.0002061 | FOBI:030…. |
cocoa | 8 | 4 | 0.0000416 | 0.0003032 | FOBI:030…. |
bread food product | 9 | 4 | 0.0000737 | 0.0004697 | FOBI:030…. |
rye food product | 9 | 4 | 0.0000737 | 0.0004697 | FOBI:030…. |
whole bread | 9 | 4 | 0.0000737 | 0.0004697 | FOBI:030…. |
eggplant (whole, raw) | 4 | 3 | 0.0001058 | 0.0006228 | FOBI:030…. |
stem or spear vegetable | 4 | 3 | 0.0001058 | 0.0006228 | FOBI:030…. |
carrot root (whole, raw) | 10 | 4 | 0.0001208 | 0.0006601 | FOBI:030…. |
dairy food product | 10 | 4 | 0.0001208 | 0.0006601 | FOBI:030…. |
wine (food product) | 20 | 5 | 0.0001606 | 0.0008471 | FOBI:030…. |
grain plant | 11 | 4 | 0.0001867 | 0.0008927 | FOBI:030…. |
grain product | 11 | 4 | 0.0001867 | 0.0008927 | FOBI:030…. |
oregano (ground) | 11 | 4 | 0.0001867 | 0.0008927 | FOBI:030…. |
herb | 5 | 3 | 0.0002599 | 0.0012050 | FOBI:030…. |
olive (whole, ripe) | 13 | 4 | 0.0003913 | 0.0017609 | FOBI:030…. |
black pepper food product | 6 | 3 | 0.0005106 | 0.0021693 | FOBI:030…. |
vinegar | 6 | 3 | 0.0005106 | 0.0021693 | FOBI:030…. |
coffee (liquid drink) | 14 | 4 | 0.0005388 | 0.0021693 | FOBI:030…. |
cumin seed (whole, dried) | 14 | 4 | 0.0005388 | 0.0021693 | FOBI:030…. |
strawberry (whole, raw) | 15 | 4 | 0.0007225 | 0.0028345 | FOBI:030…. |
egg food product | 7 | 3 | 0.0008777 | 0.0033572 | FOBI:030…. |
chicory (whole, raw) | 2 | 2 | 0.0010033 | 0.0037442 | FOBI:030…. |
beer | 17 | 4 | 0.0012183 | 0.0043348 | FOBI:030…. |
flour | 17 | 4 | 0.0012183 | 0.0043348 | FOBI:030…. |
ale | 8 | 3 | 0.0013793 | 0.0044902 | FOBI:030…. |
blackberry (whole, raw) | 8 | 3 | 0.0013793 | 0.0044902 | FOBI:030…. |
blueberry (whole, raw) | 8 | 3 | 0.0013793 | 0.0044902 | FOBI:030…. |
wheat | 8 | 3 | 0.0013793 | 0.0044902 | FOBI:030…. |
oil | 19 | 4 | 0.0019184 | 0.0061148 | FOBI:030…. |
pear (whole, raw) | 9 | 3 | 0.0020322 | 0.0062184 | FOBI:030…. |
raspberry (whole, raw) | 9 | 3 | 0.0020322 | 0.0062184 | FOBI:030…. |
black coffee | 3 | 2 | 0.0029562 | 0.0082235 | FOBI:030…. |
black turtle bean (whole) | 3 | 2 | 0.0029562 | 0.0082235 | FOBI:030…. |
chickpea (whole) | 3 | 2 | 0.0029562 | 0.0082235 | FOBI:030…. |
lentil (whole) | 3 | 2 | 0.0029562 | 0.0082235 | FOBI:030…. |
turnip (whole, raw) | 3 | 2 | 0.0029562 | 0.0082235 | FOBI:030…. |
black currant (whole, raw) | 11 | 3 | 0.0038505 | 0.0101574 | FOBI:030…. |
orange (whole, raw) | 11 | 3 | 0.0038505 | 0.0101574 | FOBI:030…. |
tea food product | 11 | 3 | 0.0038505 | 0.0101574 | FOBI:030…. |
cauliflower (whole, raw) | 4 | 2 | 0.0058065 | 0.0145638 | FOBI:030…. |
celery stalk (raw) | 4 | 2 | 0.0058065 | 0.0145638 | FOBI:030…. |
pea (whole) | 4 | 2 | 0.0058065 | 0.0145638 | FOBI:030…. |
legume food product | 111 | 8 | 0.0063296 | 0.0156197 | FOBI:030…. |
grape (whole, raw) | 14 | 3 | 0.0080462 | 0.0192355 | FOBI:030…. |
meat food product | 14 | 3 | 0.0080462 | 0.0192355 | FOBI:030…. |
Whole grain cereal products | 117 | 8 | 0.0092659 | 0.0217036 | FOBI:030…. |
pomegranate (whole, raw) | 5 | 2 | 0.0095042 | 0.0217036 | FOBI:030…. |
rice grain food product | 5 | 2 | 0.0095042 | 0.0217036 | FOBI:030…. |
metaboliteList
termsThen, with the fobi_graph
function we can visualize the metaboliteList
terms with their corresponding FOBI relationships.
terms <- fobi_subset %>%
pull(id_code)
# create the associated graph
fobitools::fobi_graph(terms = terms,
get = "anc",
labels = TRUE,
legend = TRUE)
sessionInfo()
#> R version 4.4.1 (2024-06-14)
#> Platform: x86_64-pc-linux-gnu
#> Running under: Ubuntu 24.04.1 LTS
#>
#> Matrix products: default
#> BLAS: /home/biocbuild/bbs-3.20-bioc/R/lib/libRblas.so
#> LAPACK: /usr/lib/x86_64-linux-gnu/lapack/liblapack.so.3.12.0
#>
#> locale:
#> [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C
#> [3] LC_TIME=en_GB LC_COLLATE=C
#> [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8
#> [7] LC_PAPER=en_US.UTF-8 LC_NAME=C
#> [9] LC_ADDRESS=C LC_TELEPHONE=C
#> [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
#>
#> time zone: America/New_York
#> tzcode source: system (glibc)
#>
#> attached base packages:
#> [1] stats4 stats graphics grDevices utils datasets methods
#> [8] base
#>
#> other attached packages:
#> [1] SummarizedExperiment_1.36.0 Biobase_2.66.0
#> [3] GenomicRanges_1.58.0 GenomeInfoDb_1.42.0
#> [5] IRanges_2.40.0 S4Vectors_0.44.0
#> [7] BiocGenerics_0.52.0 MatrixGenerics_1.18.0
#> [9] matrixStats_1.4.1 metabolomicsWorkbenchR_1.16.0
#> [11] POMA_1.16.0 ggrepel_0.9.6
#> [13] rvest_1.0.4 kableExtra_1.4.0
#> [15] lubridate_1.9.3 forcats_1.0.0
#> [17] stringr_1.5.1 dplyr_1.1.4
#> [19] purrr_1.0.2 readr_2.1.5
#> [21] tidyr_1.3.1 tibble_3.2.1
#> [23] ggplot2_3.5.1 tidyverse_2.0.0
#> [25] fobitools_1.14.0 BiocStyle_2.34.0
#>
#> loaded via a namespace (and not attached):
#> [1] rstudioapi_0.17.1 jsonlite_1.8.9
#> [3] MultiAssayExperiment_1.32.0 magrittr_2.0.3
#> [5] magick_2.8.5 farver_2.1.2
#> [7] rmarkdown_2.28 zlibbioc_1.52.0
#> [9] vctrs_0.6.5 memoise_2.0.1
#> [11] tinytex_0.53 S4Arrays_1.6.0
#> [13] htmltools_0.5.8.1 curl_5.2.3
#> [15] qdapRegex_0.7.8 tictoc_1.2.1
#> [17] SparseArray_1.6.0 sass_0.4.9
#> [19] parallelly_1.38.0 bslib_0.8.0
#> [21] impute_1.80.0 RecordLinkage_0.4-12.4
#> [23] cachem_1.1.0 igraph_2.1.1
#> [25] lifecycle_1.0.4 pkgconfig_2.0.3
#> [27] Matrix_1.7-1 R6_2.5.1
#> [29] fastmap_1.2.0 GenomeInfoDbData_1.2.13
#> [31] future_1.34.0 selectr_0.4-2
#> [33] digest_0.6.37 syuzhet_1.0.7
#> [35] colorspace_2.1-1 ps_1.8.1
#> [37] RSQLite_2.3.7 labeling_0.4.3
#> [39] fansi_1.0.6 timechange_0.3.0
#> [41] abind_1.4-8 httr_1.4.7
#> [43] polyclip_1.10-7 compiler_4.4.1
#> [45] proxy_0.4-27 bit64_4.5.2
#> [47] withr_3.0.2 BiocParallel_1.40.0
#> [49] viridis_0.6.5 DBI_1.2.3
#> [51] highr_0.11 ggforce_0.4.2
#> [53] MASS_7.3-61 lava_1.8.0
#> [55] DelayedArray_0.32.0 textclean_0.9.3
#> [57] tools_4.4.1 chromote_0.3.1
#> [59] future.apply_1.11.3 nnet_7.3-19
#> [61] glue_1.8.0 promises_1.3.0
#> [63] grid_4.4.1 fgsea_1.32.0
#> [65] generics_0.1.3 gtable_0.3.6
#> [67] lexicon_1.2.1 tzdb_0.4.0
#> [69] class_7.3-22 websocket_1.4.2
#> [71] data.table_1.16.2 hms_1.1.3
#> [73] XVector_0.46.0 tidygraph_1.3.1
#> [75] xml2_1.3.6 utf8_1.2.4
#> [77] pillar_1.9.0 limma_3.62.0
#> [79] vroom_1.6.5 later_1.3.2
#> [81] splines_4.4.1 tweenr_2.0.3
#> [83] lattice_0.22-6 survival_3.7-0
#> [85] bit_4.5.0 tidyselect_1.2.1
#> [87] knitr_1.48 gridExtra_2.3
#> [89] bookdown_0.41 svglite_2.1.3
#> [91] xfun_0.48 graphlayouts_1.2.0
#> [93] statmod_1.5.0 UCSC.utils_1.2.0
#> [95] stringi_1.8.4 yaml_2.3.10
#> [97] evaluate_1.0.1 codetools_0.2-20
#> [99] evd_2.3-7.1 ggraph_2.2.1
#> [101] BiocManager_1.30.25 cli_3.6.3
#> [103] ontologyIndex_2.12 rpart_4.1.23
#> [105] xtable_1.8-4 systemfonts_1.1.0
#> [107] struct_1.18.0 munsell_0.5.1
#> [109] processx_3.8.4 jquerylib_0.1.4
#> [111] Rcpp_1.0.13 globals_0.16.3
#> [113] parallel_4.4.1 blob_1.2.4
#> [115] ff_4.5.0 listenv_0.9.1
#> [117] viridisLite_0.4.2 ipred_0.9-15
#> [119] scales_1.3.0 prodlim_2024.06.25
#> [121] e1071_1.7-16 crayon_1.5.3
#> [123] clisymbols_1.2.0 rlang_1.1.4
#> [125] ada_2.0-5 cowplot_1.1.3
#> [127] fastmatch_1.1-4
Castellano-Escuder, Pol, Raúl González-Domı́nguez, David S Wishart, Cristina Andrés-Lacueva, and Alex Sánchez-Pla. 2020. “FOBI: An Ontology to Represent Food Intake Data and Associate It with Metabolomic Data.” Database 2020.