C. magister MiSeq data processing

I am helping to analyze some initial MBDSeq data from a Dungeness crab OA experiment, run by NOAA (Krista Nichols’ group, with Mac) on the MiSeq to do a quick QC before sending the libraries off for full sequencing.

My task: Run the MiSeq data through a trimming/mapping and alignment pipeline so we can get an idea of mapping efficiency and bisulfite conversion rate (i.e. CHG methylation from Bismark). The pipeline is based on the one developed by the MethCompare group (check out their repo), and which I tested for the WGBS data. See my WGBS pipeline in this Jupyter Notebook Notebook-01_Exploring-WGBS-data.ipynb.

Check out my Jupyter Notebook entry for more details!: MBD-01 Processing QC MiSeq data.ipynb

Here’s the Bismark summary report for the MiSeq data (NOTE: 6 of the samples are not here, since their data files were corrupted during transfer from NOAA). Click on the image to view in HTML format:

Bismark Summary Report

Questions for Science Hour:

I could not get Jupyter Notebook to work on Mox despite following instructions on the Mox wiki. I emailed the Mox folks, but haven’t heard back. I instead just summarized my steps in the above Jupyter Notebook after running things on Mox.

I encountered a path/directory issue running Bismark on Mox. I had to set my working directory to the folder which housed my trimmed reads (rather than specifying the read location). According to the scripts written by others, for example the one from the MethCompare group, this isn’t necessary. What gives?

Written on December 15, 2020