configfile
Headers
The configfile is an ini file which is separated in different blocks:
Dirs
The Dirs block defines path information to important directories.
baseDir: the base directory where the sequencer writes the output into.
outputDir: the directory where demultiplexing will be performed.
flowLogDir: the directory where dissectBCL will write its log files into.
seqFacDir: the directory where the sequencing facility has access to. Lightweight QC files will be written here.
piDir: The base directory that holds each principal investigator’s (PI) folder (See PIs).
Internals
The Internals block defines which PI is internal. Upon completion, projects are either copied into the ‘periphery’ or uploaded via fexsend so external users can download the project. Inside this block there are two elements:
PIs: a list of principal investigators.
seqDir: the directory inside a PI’s directory where the sequencing data can be deposited.
fex: Boolean that indicates if an external project (PI not in PIs list) should be packed as a tar and uploaded using fexsend.
If a project is from an internal PI, it will be copied over into:
piDir/PI/seqDir
Note that multiple seqDirs per PI are allowed. For example if seqDir = sequencing_data:
sequencing_data
sequencing_data1
sequencing_data2
can exist, and the latest (e.g. the one with the highest number) will be used to copy over the data.
parkour
The parkour block contains all necessary information to communicate with parkour <https://github.com/maxplanck-ie/parkour2>. Note that this block contains sensitive information.
pullURL: URL to pull data from in parkour2.
pushURL: URL to push metrics to in parkour2.
user: the username for API requests
pw: the password for API requests
cert: the pem certificate for API requests
URL: the URL to Parkour2, e.g. https://parkour.yourdomain.tld.
software
The software block contains paths to all the necessary software and files that are NOT included in the conda installation.
bclconvert: path to the bcl-convert executable
bases2fastq: path to the bases2fastq executable (for aviti runs)
fastqc_adapters: a (custom) list of adapters used by fastqc.
kraken2db: path to your kraken database (created with contam, or sourced from elsewhere <https://github.com/DerrickWood/kraken2/blob/master/docs/MANUAL.markdown>)
misc
the misc block contains a path to an image file that will be put in the multiQC file. Additionaly, a default explanation on the custom kraken2 database is provided, which will be available in the multiqc report as well. The number of threads (total number) that will be used by dissectBCL needs to be set here too.
threads: the number of threads that will be used by dissectBCL.
mpiImg: path to jpg file.
krakenExpl: explanation string.
communication
The communication block has four elements, all of which are related to email communication by the pipeline.
subject: Part of the subject line for e-mails. This string can be used to organise and filter dissectBCL’s e-mails easier.
fromAddress: the e-mail address where the emails come from.
host: the email host <https://docs.python.org/3/library/smtplib.html>
finishedTo: email address(es) to send a notification upon completion of a flowcell. If multiple emails, these are comma separated.
bioinfoCore: email address of the core unit, where error messages go to.
debug_mode: boolean (True or False) that indicates if the log entries should be printed to stdout as well. If set to False, they are written to a flowcell specific file in the specified flowLogDir dir.
example
[Dirs]
baseDir=/path/to/bcl/folder
outputDir=/path/to/fastq/output/folder
flowLogDir=/path/to/log/folder
seqFacDir=/path/to/share/qc/with/facility
piDir=/base/with/enduser/folders
bioinfoCoreDir=/path/to/share/qc/with/core
[Internals]
PIs=[pi1,pi2,pi3,pi4,pi5]
seqDir=seqfolderstr
[parkour]
pullURL=parkour.pull.url/api/analysis_list/analysis_list
pushURL=parkour.push.url/api/run_statistics/upload
user=parkourUser
password=parkourPw
cert=/path/to/cert.pem
URL=parkour.domain.tld
[software]
bclconvert=/path/to/bclconvert
fastqc_adapters=/path/to/fastqc_adapters.txt
kraken2db=/path/to/kraken2_contaminome/contaminomedb
[misc]
mpiImg=/path/to/multiqc_headerimg.jpg
[communication]
deepSeq=email@seqfacility.de
bioinfoCore=email@bioinfocore.de
fromAddress=sender@dissectbcl.de
host=hostmail.address.de