Pipeline for Single Cell Data (deprecated)
Source:R/methods-SingleCellPipeline.R
sc_long_pipeline.Rd
This function is deprecated. Please use [SingleCellPipeline()] instead.
Usage
sc_long_pipeline(
annotation,
fastq,
outdir,
genome_fa,
minimap2 = NULL,
barcodes_file = NULL,
expect_cell_number = NULL,
config_file = NULL
)
Arguments
- annotation
The file path to the annotation file in GFF3 format
- fastq
The file path to input fastq file
- outdir
The path to directory to store all output files.
- genome_fa
The file path to genome fasta file.
- minimap2
Path to minimap2, optional.
- barcodes_file
The file with expected cell barcodes, with each barcode on a new line.
- expect_cell_number
The expected number of cells in the sample. This is used if
barcodes_file
is not provided. SeeBLAZE
for more details.- config_file
File path to the JSON configuration file.
See also
SingleCellPipeline
for the new pipeline interface,
BulkPipeline
for bulk long data,
MultiSampleSCPipeline
for multi sample single cell pipelines.
Examples
outdir <- tempfile()
dir.create(outdir)
bc_allow <- file.path(outdir, "bc_allow.tsv")
genome_fa <- file.path(outdir, "rps24.fa")
R.utils::gunzip(
filename = system.file("extdata", "bc_allow.tsv.gz", package = "FLAMES"),
destname = bc_allow, remove = FALSE
)
R.utils::gunzip(
filename = system.file("extdata", "rps24.fa.gz", package = "FLAMES"),
destname = genome_fa, remove = FALSE
)
sce <- FLAMES::sc_long_pipeline(
genome_fa = genome_fa,
fastq = system.file("extdata", "fastq", "musc_rps24.fastq.gz", package = "FLAMES"),
annotation = system.file("extdata", "rps24.gtf.gz", package = "FLAMES"),
outdir = outdir,
barcodes_file = bc_allow,
config_file = FLAMES::create_config(
outdir,
pipeline_parameters.demultiplexer = "flexiplex"
)
)
#> Writing configuration parameters to: /tmp/RtmpdufN84/file7a181a6a699/config_file_31256.json
#> Configured steps:
#> barcode_demultiplex: TRUE
#> genome_alignment: TRUE
#> gene_quantification: TRUE
#> isoform_identification: TRUE
#> read_realignment: TRUE
#> transcript_quantification: TRUE
#> samtools not found, will use Rsamtools package instead
#> ── Running step: barcode_demultiplex @ Tue Aug 19 07:47:35 2025 ────────────────
#> Using flexiplex for barcode demultiplexing.
#> FLEXIPLEX 0.96.2
#> Setting max barcode edit distance to 2
#> Setting max flanking sequence edit distance to 8
#> Setting read IDs to be replaced
#> Setting number of threads to 8
#> Search pattern:
#> primer: CTACACGACGCTCTTCCGATCT
#> BC: NNNNNNNNNNNNNNNN
#> UMI: NNNNNNNNNNNN
#> polyT: TTTTTTTTT
#> Setting known barcodes from /tmp/RtmpdufN84/file7a181a6a699/bc_allow.tsv
#> Number of known barcodes: 143
#> Processing file: /__w/_temp/Library/FLAMES/extdata/fastq/musc_rps24.fastq.gz
#> Searching for barcodes...
#> Number of reads processed: 393
#> Number of reads where at least one barcode was found: 368
#> Number of reads with exactly one barcode match: 364
#> Number of chimera reads: 1
#> All done!
#> Reads Barcodes
#> 10 2
#> 9 2
#> 8 5
#> 7 4
#> 6 3
#> 5 7
#> 4 14
#> 3 14
#> 2 29
#> 1 57
#> ── Running step: genome_alignment @ Tue Aug 19 07:47:35 2025 ───────────────────
#> Creating junction bed file from GFF3 annotation.
#> Aligning sample /tmp/RtmpdufN84/file7a181a6a699/matched_reads.fastq.gz -> /tmp/RtmpdufN84/file7a181a6a699/align2genome.bam
#> Warning: samtools not found, using Rsamtools instead, this could be slower and might fail for large BAM files.
#> Sorting BAM files by genome coordinates with 8 threads...
#> Indexing bam files
#> ── Running step: gene_quantification @ Tue Aug 19 07:47:35 2025 ────────────────
#> 07:47:35 AM Tue Aug 19 2025 quantify genes
#> Using BAM(s): /tmp/RtmpdufN84/file7a181a6a699/align2genome.bam
#> ── Running step: isoform_identification @ Tue Aug 19 07:47:35 2025 ─────────────
#> ── Running step: read_realignment @ Tue Aug 19 07:47:36 2025 ───────────────────
#> Checking for fastq file(s) /__w/_temp/Library/FLAMES/extdata/fastq/musc_rps24.fastq.gz
#> files found
#> Checking for fastq file(s) /tmp/RtmpdufN84/file7a181a6a699/matched_reads.fastq.gz
#> files found
#> Checking for fastq file(s) /tmp/RtmpdufN84/file7a181a6a699/matched_reads_dedup.fastq.gz
#> files found
#> Realigning sample /tmp/RtmpdufN84/file7a181a6a699/matched_reads_dedup.fastq.gz -> /tmp/RtmpdufN84/file7a181a6a699/realign2transcript.bam
#> Warning: samtools not found, using Rsamtools instead, this could be slower and might fail for large BAM files.
#> Sorting BAM files by 8 with CB threads...
#> ── Running step: transcript_quantification @ Tue Aug 19 07:47:36 2025 ──────────
#> Pipeline saved to /tmp/RtmpdufN84/file7a181a6a699/pipeline.rds