Import
Import.Rmd
Introduction
This vignette gives an overview about importing files for downstream analysis with mpwR.
Import
Importing the output files from each software can be easily performed
with prepare_mpwR
. Please put all output files in one
folder and follow the guidelines for naming the files. No other
files/subfolders are allowed.
files <- prepare_mpwR(path = "Path_to_Folder_with_files")
Naming Files
The software output files need to be named according some simple rules.
Suffix
Please use the following suffixes for the corresponding software:
- Spectronaut
- *.tsv - _Report
- MaxQuant
- evidence.txt - _evidence
- peptides.txt - _peptides
- proteinGroups.txt - _proteinGroups
- DIA-NN
- *.tsv - _Report
- ProteomeDiscoverer
- PSMs.txt - _PSMs
- PeptideGroups.txt - _PeptideGroups
- Proteins.txt - _Proteins
- ProteinGroups.txt - _ProteinGroups
Note that the suffixes for MaxQuant and ProteomeDiscoverer can be used as is.
Prefix
Please use the same prefix for your analysis and the respective software output file(s). Each analysis needs a unique identifier.
Examples:
- Spectronaut
- WorkflowA1_Report.tsv
- MaxQuant
- WorkflowA2_evidence.txt
- WorkflowA2_peptide.txt
- WorkflowA2_proteinGroups.txt
- DIA-NN
- WorkflowA3_Report.tsv
- ProteomeDiscoverer
- WorkflowA4_PSMs.txt
- WorkflowA4_PeptideGroups.txt
- WorkflowA4_Proteins.txt
- WorkflowA4_ProteinGroups.txt
Handling Error Messages
Subsequently, common error messages are explained and solutions are pointed out.
Wrong suffix
Error in prepare_mpwR(path = path) : Unknown file suffix detected! Please use: _evidence, _peptides, _proteinGroups, _PSMs, _Proteins, _PeptideGroups, _ProteinGroups, _ReportSolution: Check your suffixes and make sure to use the correct spelling and the correct one for the respective software.
Wrong number of files
Error in prepare_mpwR(path = path) : Wrong number of analyses detected for WorkflowA2! MaxQuant requires 3 input files. Remember: Use unique name for each analysis.Solution: Make sure to include all output files from the respective software. MaxQuant requires 3 files, PD 4 files, DIA-NN and Spectronaut 1 file, respectively. Also use the same name (prefix) for all files.
Subfolder
Error in data.table::fread(input_file, …) : File ‘DIR//subfolder’ is a directory. Not yet implemented.Solution: Do not use subfolders in the directory of the output files. Put all files in a folder and use this path for prepare_mpwR.
Same file name
Error in prepare_mpwR(path = path) : Please use a unique filename for each analysis.
Solution: Please use a unique name (prefix) for each respective output file(s). E.g. if you used…:
- WorkflowA1_Report.tsv
- WorkflowA1_evidence.txt
- WorkflowA1_peptide.txt
- WorkflowA1_proteinGroups.txt
… the error message is displayed. As a solution, for example add the used software to the name:
- WorkflowA1_Spectronaut_Report.tsv
- WorkflowA1_MQ_evidence.txt
- WorkflowA1_MQ_peptide.txt
- WorkflowA1_MQ_proteinGroups.txt
Missing columns
Error in prepare_input(ordered_files[[i]][[“data”]][[“DIA-NN”]], software = “DIA-NN”) : Not all required columns present in submitted data.
Here example for DIA-NN. Similar messages for other software tools. Please make sure to include all required columns for the used software. Details are in the vignette Requirements.