#ensure you have mzR installed
if (!requireNamespace("BiocManager", quietly = TRUE))
install.packages("BiocManager")
BiocManager::install("mzR", version = "3.8")
# install the package directly from Github
library(devtools)
devtools::install_github("JosieLHayes/adductomicsR")
#install the data package containing the data
if (!requireNamespace("BiocManager", quietly = TRUE))
install.packages("BiocManager")
BiocManager::install("ExperimentHub", version = "3.9")
#or download the packages and install from source
library(devtools)
devtools::install("path_to_dir/adductomicsR")
devtools::install("path_to_dir/adductData")
After installation of the adductomics package and all dependencies attach the adductomics package by typing (copying and pasting) this line of code into the R console and hitting enter:
We have provided 2 mzXML files for use in this vignette in adductData.
Mass drift correction: Usually mass drift is corrected using lock masses on the mass spectrometer. If this has not been done a python script is provided in the directory in which the package is saved on your computer at /inst/extdata/thermo_MassDriftCalc.py and can be launched from within python using (replace the path to the python script in your system): exec(open(“thermo_MassDriftCalc.py“).read())
Each sample is corrected for retention time drift using the rtDevModeling function. To run this with the default parameters enter the path of the directory containing your mzXML files and the run order file (order in which samples were run). For further information on parameters see ??rtDevModelling. An example run order file is available in inst/extdata (within the directory where the package is saved on your computer) and 2 mzXML files are available in adductData/ExperimentHub.These files will be used in this vignette automatically.
Download the mzXML files from ExperimentHub for use in this vignette. They must have .mzXML to be recognized by the package so they are renamed as well.
eh = suppressMessages(suppressWarnings(ExperimentHub::ExperimentHub()))
temp = suppressMessages(suppressWarnings(
AnnotationHub::query(eh, 'adductData')))
## EH1957
## "/home/biocbuild/.cache/R/ExperimentHub/2115eb32f57b2_1957"
## [1] TRUE
## see ?adductData and browseVignettes('adductData') for documentation
## downloading 1 resources
## retrieving 1 resource
## loading from cache
## EH1958
## "/home/biocbuild/.cache/R/ExperimentHub/2115eb5065dfa_1958"
## [1] TRUE
The specSimPepId function detects adducts present on the peptide. To run this with the default parameters (the largest triply charged peptide of human serum albumin) enter the path of your mzxml files and rtDevModels object. For further information on running this with different peptides see ??specSimPepId. This produces MS2 spectra plots, each in a separate directory for each sample. A plot of the model spectrum is also saved in the mzXML files directory for comparison. The spectra are grouped based on the mz and RT windows, and plots of these groups are also provided based on the raw RT and adjusted RT. These plots can be used to determine whether multiple groups pertain to the same peak.
A list of the adducts for quantification and their monoisotopic mass (MIM), retention time (RT), peptide and charge is generated using the following command. Substitute the file path of the allResults file to the location of your allResults file from the previous step.
generateTargTable(
allresultsFile=paste0(system.file("extdata",package =
"adductomicsR"),'/allResults_ALVLIAFAQYLQQCPFEDHVK_example.csv'),
csvDir=tempdir(check = FALSE)
)
It is recommended that the allGroups plot ( m/z vs RT) is used to ensure that the adducts in the target table do not pertain to the same peak, as the quantification step can be computationally intensive.
See ??adductQuant for an explanation on the parameters for this function. To use your target table produced in the previous step, alter the value in the ‘targTable’ option to the path of your target table. Similarly replaced the path to the directory of your own mzXML files in filePaths (set as “Users/Documents/mzXMLfiles” here.
adductQuant(
nCores=2,
targTable=paste0(system.file("extdata",
package="adductomicsR"),
'/exampletargTable2.csv'),
intStdRtDrift=30,
rtDevModels= paste0(hubCache(temp),'/rtDevModels.RData'),
filePaths=list.files(hubCache(temp),pattern=".mzXML",
all.files=FALSE,full.names=TRUE),
quantObject=NULL,
indivAdduct=NULL,
maxPpm=5,
minSimScore=0.8,
spikeScans=1,
minPeakHeight=100,
maxRtDrift=20,
maxRtWindow=240,
isoWindow=80,
hkPeptide='LVNEVTEFAK',
gaussAlpha=16
)
It is recommended that spectra for each of the adducts found are checked manually using LC-MS software, either at this step or before quantification.
To load your adductquantif object set the path to the file on your system. In the example it assumes the file is present in your working directory.
Mass spectrometry data is inherently noisy, and the filterAdductTable() function will filter out samples and adducts based on set thresholds. It is recommended to use this filter function to remove adducts that have many missing values and samples where the housekeeping peptide is weak, suggestive of misintegration. Substitute the name of the peaklist file with the path and the name of your peaklist file produced in the previous step.
## R version 4.2.1 (2022-06-23)
## Platform: x86_64-pc-linux-gnu (64-bit)
## Running under: Ubuntu 20.04.5 LTS
##
## Matrix products: default
## BLAS: /home/biocbuild/bbs-3.16-bioc/R/lib/libRblas.so
## LAPACK: /home/biocbuild/bbs-3.16-bioc/R/lib/libRlapack.so
##
## locale:
## [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C
## [3] LC_TIME=en_GB LC_COLLATE=C
## [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8
## [7] LC_PAPER=en_US.UTF-8 LC_NAME=C
## [9] LC_ADDRESS=C LC_TELEPHONE=C
## [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
##
## attached base packages:
## [1] stats graphics grDevices utils datasets methods base
##
## other attached packages:
## [1] adductomicsR_1.14.0 ExperimentHub_2.6.0 AnnotationHub_3.6.0
## [4] BiocFileCache_2.6.0 dbplyr_2.2.1 BiocGenerics_0.44.0
## [7] adductData_1.13.0
##
## loaded via a namespace (and not attached):
## [1] bitops_1.0-7 bit64_4.0.5
## [3] filelock_1.0.2 httr_1.4.4
## [5] prabclus_2.3-2 GenomeInfoDb_1.34.0
## [7] tools_4.2.1 bslib_0.4.0
## [9] utf8_1.2.2 R6_2.5.1
## [11] DT_0.26 DBI_1.1.3
## [13] ade4_1.7-19 nnet_7.3-18
## [15] withr_2.5.0 tidyselect_1.2.0
## [17] bit_4.0.4 curl_4.3.3
## [19] compiler_4.2.1 rvest_1.0.3
## [21] cli_3.4.1 Biobase_2.58.0
## [23] pastecs_1.3.21 xml2_1.3.3
## [25] sass_0.4.2 diptest_0.76-0
## [27] DEoptimR_1.0-11 robustbase_0.95-0
## [29] rappdirs_0.3.3 stringr_1.4.1
## [31] digest_0.6.30 rmarkdown_2.17
## [33] XVector_0.38.0 pkgconfig_2.0.3
## [35] htmltools_0.5.3 fastmap_1.1.0
## [37] htmlwidgets_1.5.4 rlang_1.0.6
## [39] RSQLite_2.2.18 shiny_1.7.3
## [41] jquerylib_0.1.4 generics_0.1.3
## [43] jsonlite_1.8.3 mclust_6.0.0
## [45] dplyr_1.0.10 RCurl_1.98-1.9
## [47] magrittr_2.0.3 modeltools_0.2-23
## [49] GenomeInfoDbData_1.2.9 Rcpp_1.0.9
## [51] S4Vectors_0.36.0 fansi_1.0.3
## [53] lifecycle_1.0.3 stringi_1.7.8
## [55] yaml_2.3.6 MASS_7.3-58.1
## [57] zlibbioc_1.44.0 plyr_1.8.7
## [59] flexmix_2.3-18 grid_4.2.1
## [61] blob_1.2.3 parallel_4.2.1
## [63] promises_1.2.0.1 crayon_1.5.2
## [65] lattice_0.20-45 Biostrings_2.66.0
## [67] KEGGREST_1.38.0 knitr_1.40
## [69] pillar_1.8.1 boot_1.3-28
## [71] fpc_2.2-9 reshape2_1.4.4
## [73] codetools_0.2-18 stats4_4.2.1
## [75] glue_1.6.2 BiocVersion_3.16.0
## [77] evaluate_0.17 BiocManager_1.30.19
## [79] png_0.1-7 vctrs_0.5.0
## [81] httpuv_1.6.6 foreach_1.5.2
## [83] purrr_0.3.5 kernlab_0.9-31
## [85] assertthat_0.2.1 cachem_1.0.6
## [87] xfun_0.34 mime_0.12
## [89] xtable_1.8-4 later_1.3.0
## [91] class_7.3-20 tibble_3.1.8
## [93] iterators_1.0.14 AnnotationDbi_1.60.0
## [95] memoise_2.0.1 IRanges_2.32.0
## [97] cluster_2.1.4 ellipsis_0.3.2
## [99] interactiveDisplayBase_1.36.0