OverviewThe Sequencing Run Details in SeqSphere+ contain the details of an Illumina or a PacBio sequencing run. They are automatically read and imported by a SeqSphere+ assembling pipeline, if the required Illumina or PacBio run info files (see below) are found together with the FASTQ files. The Sequencing Run Details are stored in the database and can be used for further quality control on run level. The Sample procedure details field Sequencing Run ID is used to hold the run ID as a reference to the stored run details. The field Sequencing Run QC is used for QC warnings. If not set by the manually defined procedure details, the Sequencing Length and Sequencing Platform are also filled from the run details. Important: For importing Illumina Sequencing Run Details on a Windows client computer the Windows Subsystem For Linux is required or the Microsoft Visual C++ 2015 libraries must be installed. Illumina Run DetailsThe following Illumina run directories and files are required to import Sequencing Run Details:
If the pipeline is defined to import FASTQ files from a Directory, this directory must contain the InterOp directory and all other required files mentioned above. If the InterOp folder is not available or SeqSphere+ cannot read it, some values would be missed. These figures are Cluster Density, percentage of reads passing Q30, Error Rate and Output number of bases. If the input source type of the pipeline is a MiSeq Repository, the run files are automatically detected and imported from the run folder. When importing samples in a pipeline, SeqSphere+ compares the imported sample names with names indicated in the SampleSheet.csv. If any of the processed samples do not appear in the sample sheet, a warning message is written in the pipeline log. If a sample that is listed in the sample sheet is missed, no warning message is stated. However, both, extra and missing samples are listed and are highlighted in the Sequencing Run Details window. The sequencing run details are automatically quality controlled for the two parameters %>=Q30 and Cluster Density. Warnings are given if a parameters does not succeed the following thresholds:
The thresholds are based on the Illumina specifications for MiSeq and NextSeq. PacBio Run DetailsThe following PacBio run directories and files are required to import Sequencing Run Details:
If the pipeline is defined to import FASTA/BAM files from a Directory, all required files mentioned above. These figures are P0, P1, P2, percentage of productivity. When importing samples in a pipeline, SeqSphere+ compares the imported sample names with names indicated in the sample sheet. If sample sheet CSV and XML both are available then CSV file is considered to compare imported sample names. If any of the processed samples do not appear in the sample sheet, a warning message is written in the pipeline log. If a sample that is listed in the sample sheet is missed, no warning message is stated. However, both, extra and missing samples are listed and are highlighted in the Sequencing Run Details window. Browse Sequencing Run DetailsThe stored Sequencing Run Details can be accessed in SeqSphere+ using the menu Options | Sequencing Run Details.... The table shows the stored Sequencing Run Details, filtered by one of the two time criteria:
By double-clicking on a row in the table or by selecting a row and pressing the button Open Run Details, the details of a run can be opened in a new dialog window. Alternatively, the the run details can also be opened by right-clicking on the field Sequencing Run ID in the procedure tab of a loaded sample. When the window is opened, it contains in the upper part the parameters for the sequencing run. For Illumina runs, the two quality checked parameters %>=Q30 and Cluster Density are highlighted green if they pass the quality control, else they are highlighted yellow. The tooltips for those fields show the recommended ranges for the values according to kit version and read length. Below the parameters, the sample sheet is listed in a table together with the samples that were already processed. The first five columns are values from the sample sheet, the sixth (Sample ID) and following columns are values from the processed samples. Grayed out rows (that have only the first fifth columns filled-in) are samples that are defined in the sample sheet, but were not yet processed. Red colored rows (that have only the sixth and following columns filled-in) are samples that were processed, but were not found in the sample sheet. The processed samples can be directly opened from this table by selecting them and using the load button. The parameters, the samples table, and the original sample sheet can be exported. |