Modified single cell tutorial and initiated bulk tutorial #368

Vivian0105 · 2025-01-27T15:35:01Z

PR checklist

…ials.

nf-core-bot · 2025-01-27T15:35:39Z

Warning

Newer version of the nf-core template is available.

Your pipeline is using an old version of the nf-core template: 3.0.2.
Please update your pipeline to the latest version.

For more documentation on how to update your pipeline, please see the nf-core documentation and Synchronisation documentation.

ggabernet · 2025-03-08T22:22:59Z

docs/usage/sc_AIRRseq/single_cell_tutorial.md

+
+
+## Running airrflow pipeline from two different input formats
+There are two acceptable input formats for airrflow single-cell AIRRseq pipeline: AIRR rearrangement or fastq format. 


@Vivian0105 it would be great to show here what is the output message that appears on the console once the tests pass successfully

docs/usage/sc_AIRRseq/single_cell_tutorial.md

ggabernet · 2025-03-09T01:20:39Z

docs/usage/sc_AIRRseq/single_cell_tutorial.md

+   - If the automatic threshold is unsatisfactory, you can set the threshold manually and re-run the pipeline. 
+   (Tip: use -resume whenever running the Nextflow pipeline to avoid duplicating previous work). 
+   - For TCR data, where somatic hypermutation does not occur, set the clonal_threshold to 0 when running the Airrflow pipeline.  
+   - Once the threshold is established, clones are assigned to the sequences. A variety of tables and plots associated with clonal analysis were added to the folder 'clonal_analysis/define_clones', such as  sequences_per_locus_table, sequences_per_c_call_table, sequences_per_constant_region_table,num_clones_table, clone_sizes_table,clone size distribution plot, clonal abundance plot, diversity plot and etc. 


I've added more information on the clonal analysis in a dedicated section now, could you rewrite the section here to just point out to the respective find_threshold report, clonal_analysis report and lineage_threshold reports?

ggabernet · 2025-03-09T01:26:50Z

docs/usage/sc_AIRRseq/single_cell_tutorial.md

+
+6. Other reporting.
+   - Additional reports are also generated, including: a multiqc report which summarizes QC metrics across all samples, pipeline_info reports and report_file_size reports.
+


Could you rewrite this part a bit focusing on the different html reports and a general description on what is inside them?

ggabernet

@Vivian0105 I've reviewed and edited the single-cell tutorial for now and added also some comments to this review on how it can still be improved. Let me know if you have any questions!

ggabernet · 2025-03-11T00:15:03Z

docs/usage/bulk_AIRRseq/bulk_tutorial.md

+nextflow run nf-core/airrflow -r 4.2.0 -profile test,docker --outdir test_results
+```
+
+## Running airrflow pipeline 


@Vivian0105 could you also add the output message here?

ggabernet · 2025-03-11T00:16:28Z

docs/usage/sc_AIRRseq/single_cell_tutorial.md

+
+> [Tip]
+> When launching a Nextflow pipeline with the `-resume` option, any processes that have already been run with the exact same code, settings and inputs will be cached and the pipeline will resume from the last step that changed or failed with an error. The benefit of using `-resume` is to avoid duplicating previous work and save time when re-running a pipeline.
+> We include `-resume` in our Nextflow command as a precaution in case anything goes wrong during execution. After fixing the issue, you can relaunch the pipeline with the same command, it will resume running from the point of failure, significantly reducing runtime and resource usage.


@Vivian0105 could you add this here as well?

ggabernet · 2025-03-11T00:20:31Z

docs/usage/sc_AIRRseq/single_cell_tutorial.md

+   - Pipeline_info report: various reports relevant to the running and execution of the pipeline.
+   - Report_file_size report: Summary of the number of sequences left after each of the most important pipeline steps.
+
+## Understanding error messages


@Vivian0105 I just thought of a new section to explain how to understand error messages and facilitate debugging. What do you think?

ggabernet · 2025-03-11T02:59:50Z

docs/usage/bulk_AIRRseq/bulk_tutorial.md

+
+- A configuration file requiring memory, cpu and time. Before setting the configuration file, we recommend verifying the available memory and cpus on your system. Otherwise, exceeding the system's capacity may result in unexpected errors.
+
+- Information on bulk library generation method(protocol).


@Vivian0105 it would be great to also provide the samplesheet and configuration file example for this tutorial for download.

ggabernet · 2025-03-11T03:20:12Z

docs/usage/bulk_AIRRseq/bulk_tutorial.md

+
+After launching the pipeline the following will be printed to the console output:
+
+```bash


@Vivian0105 Could you provide here the console output examples for the tutorial?

ggabernet · 2025-03-11T03:21:43Z

docs/usage/bulk_AIRRseq/bulk_tutorial.md

+
+After running the pipeline, several reports are generated under the result folder.
+
+![example of result folder](bulk_tutorial_images/AIRRFLOW_BULK_RESULT.png)


@Vivian0105 Here as well to make it easier to maintain I would just list the output directories as a text list here.

ggabernet · 2025-03-11T03:22:48Z

docs/usage/bulk_AIRRseq/bulk_tutorial.md

+The analysis steps and their corresponding folders, where the results are stored, are listed below.
+
+
+1. QC


@Vivian0105 I would focus in this section on pointing to the reports for each of these analysis steps that you are mentioning here and what they contain.

Vivian0105 and others added 8 commits January 15, 2025 11:39

Add links for docker installation.

28727e6

add *.Rhistory in .gitignore.

867a44c

Modified single_cell_tutorial.md and added files needed for the tutor…

43209dc

…ials.

Delete docs/usage/single_cell_tutorial.md

3beee19

added steps on running pipeline from 10x fastq files.

387e2f6

Added explaination on airrflow results

eec5916

change airrflow_result_folder_example.png

9f43919

Modified single cell tutorial and initiated bulk tutorial

0ad712d

Vivian0105 added 5 commits January 30, 2025 14:23

Added more contents for bulk tutorials

fa6d385

Modified bulk tutorial

d096162

Modified bulk tutorial

d510476

Polish single cell tutorial

cb6cc5c

Polish bulk tutorial

393d2fe

ggabernet self-requested a review March 8, 2025 00:16

ggabernet reviewed Mar 8, 2025

View reviewed changes

ggabernet reviewed Mar 9, 2025

View reviewed changes

docs/usage/sc_AIRRseq/single_cell_tutorial.md Outdated Show resolved Hide resolved

ggabernet reviewed Mar 9, 2025

View reviewed changes

ggabernet and others added 3 commits March 8, 2025 20:33

improve single cell tutorial

0524552

Gisela modified single cell tutorial

f00b4f5

Add output file description

3af3432

ggabernet self-assigned this Mar 10, 2025

ggabernet reviewed Mar 11, 2025

View reviewed changes

update sc tutorial

ea4b3ec

ggabernet reviewed Mar 11, 2025

View reviewed changes

updates to bulk and sc tutorials

db2d953

ggabernet reviewed Mar 11, 2025

View reviewed changes

ggabernet and others added 6 commits March 10, 2025 23:23

update bulk tutorial

fa8a487

Add message regarding running airrflow pipeline

226aa19

Add message regarding running airrflow for bulk tutorial

b585114

updates bulk tutorial

788949a

updates tutorials

dfeddb1

Further editing Understand the result part

36265da

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Modified single cell tutorial and initiated bulk tutorial #368

Modified single cell tutorial and initiated bulk tutorial #368

Vivian0105 commented Jan 27, 2025

nf-core-bot commented Jan 27, 2025

ggabernet Mar 8, 2025

ggabernet Mar 9, 2025

ggabernet Mar 9, 2025

ggabernet left a comment

ggabernet Mar 11, 2025

ggabernet Mar 11, 2025

ggabernet Mar 11, 2025

ggabernet Mar 11, 2025

ggabernet Mar 11, 2025

ggabernet Mar 11, 2025

ggabernet Mar 11, 2025



		## Running airrflow pipeline from two different input formats
		There are two acceptable input formats for airrflow single-cell AIRRseq pipeline: AIRR rearrangement or fastq format.


		6. Other reporting.
		- Additional reports are also generated, including: a multiqc report which summarizes QC metrics across all samples, pipeline_info reports and report_file_size reports.


		- A configuration file requiring memory, cpu and time. Before setting the configuration file, we recommend verifying the available memory and cpus on your system. Otherwise, exceeding the system's capacity may result in unexpected errors.

		- Information on bulk library generation method(protocol).


		After launching the pipeline the following will be printed to the console output:

		```bash


		After running the pipeline, several reports are generated under the result folder.

		![example of result folder](bulk_tutorial_images/AIRRFLOW_BULK_RESULT.png)

		The analysis steps and their corresponding folders, where the results are stored, are listed below.


		1. QC

Modified single cell tutorial and initiated bulk tutorial #368

Are you sure you want to change the base?

Modified single cell tutorial and initiated bulk tutorial #368

Conversation

Vivian0105 commented Jan 27, 2025

PR checklist

nf-core-bot commented Jan 27, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ggabernet left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment