Pre-processing Pipeline

The TI-Toolbox pre-processing pipeline prepares anatomical MRI data for TI simulations by converting DICOM files to BIDS-compliant NIfTI format, performing FreeSurfer cortical reconstruction, and creating SimNIBS head models. This comprehensive pipeline ensures that all subsequent steps have access to high-quality, standardized neuroimaging data.

Overview

The pre-processing pipeline consists of three main stages:

DICOM to NIfTI Conversion - Convert raw DICOM files to BIDS-compliant NIfTI format
FreeSurfer recon-all - Cortical reconstruction and segmentation
SimNIBS charm - Head model creation for electromagnetic simulations

Required Input Data Structure

BIDS Format Requirements

The toolbox expects data to be organized following the BIDS (Brain Imaging Data Structure) standard:

project_root/
├── sourcedata/
│   └── sub-{subject_id}/
│       ├── T1w/
│       │   ├── dicom/          # Raw T1w DICOM files
│       │   └── *.tgz           # Compressed DICOM archives (optional)
│       └── T2w/
│           ├── dicom/          # Raw T2w DICOM files
│           └── *.tgz           # Compressed DICOM archives (optional)
└── sub-{subject_id}/
    └── anat/                   # Converted NIfTI files (created by pipeline)
        ├── anat-T1w_acq-MPRAGE.nii.gz
        ├── anat-T1w_acq-MPRAGE.json
        ├── anat-T2w_acq-CUBE.nii.gz
        └── anat-T2w_acq-CUBE.json

Data Requirements

Requirement	Description	Status
T1-weighted MRI	High-resolution anatomical image (typically MPRAGE)	Required
T2-weighted MRI	High-resolution anatomical image (typically CUBE/SPACE)	Recommended
Image Resolution	Minimum 1mm isotropic voxels	Required
Subject ID	Numeric identifier (e.g., 101, 102)	Required

Supported Input Formats

DICOM files (.dcm, .dicom)
Compressed DICOM archives (.tgz)
NIfTI files (.nii, .nii.gz) - if already converted

Processing Stages

Stage 1: DICOM to NIfTI Conversion

Script: dicom2nifti.sh
Purpose: Convert raw DICOM files to BIDS-compliant NIfTI format

Features

Automatic T1w/T2w Detection: Identifies scan types based on DICOM series descriptions
Compressed Archive Support: Handles .tgz compressed DICOM archives
BIDS Compliance: Generates proper BIDS naming conventions
Metadata Preservation: Maintains scan parameters in JSON sidecars

Process Flow

graph LR
    A[Raw DICOM Files] --> B[Extract Archives]
    B --> C[dcm2niix Conversion]
    C --> D[Rename Based on SeriesDescription]
    D --> E[BIDS-compliant NIfTI + JSON]

Usage

# Convert DICOM files for a single subject
./dicom2nifti.sh /path/to/sub-101

# Quiet mode (no console output)
./dicom2nifti.sh /path/to/sub-101 --quiet

Generated Output Structure

sub-101/
└── anat/
    ├── anat-T1w_acq-MPRAGE.nii.gz    # T1-weighted image
    ├── anat-T1w_acq-MPRAGE.json      # T1 metadata
    ├── anat-T2w_acq-CUBE.nii.gz      # T2-weighted image
    └── anat-T2w_acq-CUBE.json        # T2 metadata

Stage 2: FreeSurfer recon-all

Script: recon-all.sh
Purpose: Cortical reconstruction, segmentation, and surface generation

Features

T1 + T2 Processing: Utilizes both T1 and T2 images when available for improved pial surface reconstruction
Parallel Processing: Configurable for single-threaded or multi-threaded execution
Resilient Execution: Continues processing other subjects even if some fail

Process Flow

graph TD
    A[Basic Input Validation] --> B[T1/T2 Detection]
    B --> C[FreeSurfer Environment Check]
    C --> D[Motion Correction]
    D --> E[Intensity Normalization]
    E --> F[Skull Stripping]
    F --> G[White Matter Segmentation]
    G --> H[Surface Generation]
    H --> I[T2pial Refinement]
    I --> J[Basic Completion Check]

Usage

# Single subject processing (1 core)
./recon-all.sh /path/to/sub-101

# With parallel processing (all available cores for this subject)
./recon-all.sh /path/to/sub-101 --parallel

# Quiet mode
./recon-all.sh /path/to/sub-101 --quiet

Note: The --parallel flag in recon-all.sh enables FreeSurfer’s internal parallelization (multiple cores for one subject). This is different from the --parallel flag in structural.sh which enables processing multiple subjects simultaneously.

Generated Output Structure

derivatives/
└── freesurfer/
    └── sub-101/
        ├── mri/           # Volumetric data
        ├── surf/          # Surface meshes
        ├── label/         # Anatomical labels
        └── scripts/

Stage 3: SimNIBS charm (Head Model Creation)

Script: charm.sh
Purpose: Create head models for TI simulation

Features

Input: Supports T1-only or T1+T2 processing
Sequential Processing: Runs one subject at a time

Process Flow

graph TD
    A[T1/T2 Detection] --> B[SimNIBS Environment Check]
    B --> C[Memory Safeguards Setup]
    C --> D[Tissue Segmentation]
    D --> E[Mesh Generation]
    E --> F[Electrode Positioning]
    F --> G[Quality Control]
    G --> H[Head Model Output]

Usage

# Create head model for single subject
./charm.sh /path/to/sub-101

# Quiet mode
./charm.sh /path/to/sub-101 --quiet

Generated Output Structure

derivatives/
└── SimNIBS/
    └── sub-101/
        └── m2m_101/

Orchestration Script

structural.sh - Pipeline Orchestrator

Purpose: Coordinates all pre-processing stages with flexible execution options

Command Line Interface

# Sequential mode (default) - one subject at a time, all cores per subject
./structural.sh /path/to/sub-101 /path/to/sub-102 recon-all --convert-dicom --create-m2m

# Parallel mode - multiple subjects simultaneously, 1 core per subject
./structural.sh /path/to/sub-101 /path/to/sub-102 recon-all --parallel --convert-dicom --create-m2m

# Recon-all only
./structural.sh /path/to/sub-101 recon-all --recon-only

# Subject ID format
./structural.sh --subjects 101,102,103 recon-all --parallel

Processing Options

Option	Description	Usage
`recon-all`	Run FreeSurfer reconstruction	Always required
`--convert-dicom`	Include DICOM conversion stage	Optional
`--create-m2m`	Include SimNIBS head model creation	Optional
`--parallel`	Enable parallel processing mode (multiple subjects, 1 core each)	Optional
`--recon-only`	Skip all non-recon steps	Optional
`--quiet`	Suppress console output	Optional

Processing Mode Selection

Default (Sequential Mode):

# Best for: Small datasets (1-3 subjects), maximum per-subject speed
./structural.sh /path/sub-101 /path/sub-102 recon-all --convert-dicom

Parallel Mode:

# Best for: Large datasets (4+ subjects), maximum throughput
./structural.sh /path/sub-101 /path/sub-102 /path/sub-103 /path/sub-104 recon-all --parallel --convert-dicom

Parallelization Strategy

Two-Mode Processing Architecture

The pipeline implements a simple and efficient two-mode parallelization strategy:

Processing Modes

graph TD
    A[Processing Mode Selection] --> B{--parallel flag?}
    B -->|No| C[Sequential Mode]
    B -->|Yes| D[Parallel Mode]
    
    C --> E[One subject at a time<br/>All cores per subject<br/>Maximum speed per subject]
    D --> F[Multiple subjects simultaneously<br/>1 core per subject<br/>Maximum throughput]
    
    G[8 CPU cores example] --> H[Sequential: 1 subject × 8 cores]
    G --> I[Parallel: 8 subjects × 1 core each]

Mode Comparison

Mode	Command	Subjects Running	Cores per Subject	Best For
Sequential (Default)	`./structural.sh sub-101 sub-102 recon-all`	1 at a time	All available	Small datasets, fastest per-subject
Parallel	`./structural.sh sub-101 sub-102 recon-all --parallel`	Multiple	1 each	Large datasets, maximum throughput

Implementation Details

Sequential Mode:

# Each subject uses all available cores
export OMP_NUM_THREADS=$AVAILABLE_CORES
export ITK_GLOBAL_DEFAULT_NUMBER_OF_THREADS=$AVAILABLE_CORES

# Process subjects one by one
for subject in subjects; do
    recon-all.sh $subject --parallel  # FreeSurfer internal parallelization
done

Parallel Mode:

# Each subject uses single core
export OMP_NUM_THREADS=1
export ITK_GLOBAL_DEFAULT_NUMBER_OF_THREADS=1

# Process multiple subjects with GNU Parallel
parallel --jobs $AVAILABLE_CORES recon-all.sh {} ::: "${SUBJECTS[@]}"

Performance Characteristics

System Specs	Sequential Mode	Parallel Mode	Recommendation
4-8 cores, 1-3 subjects	~6-8 hours/subject	~6-8 hours/subject	Sequential - simpler
8+ cores, 4+ subjects	6-8 hours × N subjects	~6-8 hours total	Parallel - much faster
Limited memory (<16GB)	Recommended	May cause OOM	Sequential - safer
Abundant resources	Good	Optimal	Parallel - maximum efficiency

SimNIBS Processing

SimNIBS charm processing is always sequential regardless of mode:

One subject processed at a time to prevent PETSC memory conflicts
Full CPU cores available per subject
Memory safeguards to prevent segmentation faults

Complete Pipeline Execution

Processing Mode Examples

Sequential Mode (Default) - Maximum Speed per Subject

# Best for: 1-3 subjects, fastest individual processing
./structural.sh \
    /mnt/study_data/sub-101 \
    /mnt/study_data/sub-102 \
    recon-all \
    --convert-dicom \
    --create-m2m

# Estimated timing (8-core system):
# Subject 1: ~6-8 hours (all 8 cores)
# Subject 2: ~6-8 hours (all 8 cores)
# Total: ~12-16 hours

Parallel Mode - Maximum Throughput

# Best for: 4+ subjects, fastest total processing
./structural.sh \
    /mnt/study_data/sub-101 \
    /mnt/study_data/sub-102 \
    /mnt/study_data/sub-103 \
    /mnt/study_data/sub-104 \
    /mnt/study_data/sub-105 \
    /mnt/study_data/sub-106 \
    /mnt/study_data/sub-107 \
    /mnt/study_data/sub-108 \
    recon-all \
    --parallel \
    --convert-dicom \
    --create-m2m

# Estimated timing (8-core system):
# All 8 subjects: ~6-8 hours total (8 subjects × 1 core each)
# 66% faster than sequential for 8 subjects!

Stage-by-Stage Execution

# Stage 1: DICOM conversion only
./dicom2nifti.sh /mnt/study_data/sub-101

# Stage 2: FreeSurfer reconstruction only
# Sequential mode (all cores for this subject)
./recon-all.sh /mnt/study_data/sub-101 --parallel

# Parallel mode (1 core for this subject)
./recon-all.sh /mnt/study_data/sub-101

# Stage 3: SimNIBS head model only  
./charm.sh /mnt/study_data/sub-101

Quick Decision Guide

# How many subjects do you have?

# 1-3 subjects → Use Sequential Mode (default)
./structural.sh sub-101 sub-102 sub-103 recon-all --convert-dicom

# 4+ subjects → Use Parallel Mode
./structural.sh sub-101 sub-102 sub-103 sub-104 sub-105 recon-all --parallel --convert-dicom

# Limited memory/resources → Always use Sequential Mode
./structural.sh sub-101 sub-102 recon-all --convert-dicom

# Time-critical analysis → Use Parallel Mode for maximum speed
./structural.sh sub-{101..120} recon-all --parallel --convert-dicom

Output Directory Structure

Complete Processing Output

project_root/
├── sourcedata/                     # Original DICOM data
│   └── sub-101/
│       ├── T1w/dicom/
│       └── T2w/dicom/
├── sub-101/                        # BIDS data
│   └── anat/
│       ├── anat-T1w_acq-MPRAGE.nii.gz
│       └── anat-T2w_acq-CUBE.nii.gz
└── derivatives/                    # Processed outputs
    ├── freesurfer/                 # FreeSurfer outputs
    │   └── sub-101/
    │       ├── mri/
    │       ├── surf/
    │       └── scripts/
    ├── SimNIBS/                    # SimNIBS outputs
    │   └── sub-101/
    │       └── m2m_101/
    └── logs/                       # Processing logs
        └── sub-101/
            ├── dicom2nifti_20250625_120000.log
            ├── recon-all_20250625_130000.log
            └── charm_20250625_140000.log

Logging and Monitoring

Log File Organization

derivatives/logs/sub-{subject_id}/
├── dicom2nifti_{timestamp}.log     # DICOM conversion logs
├── recon-all_{timestamp}.log       # FreeSurfer processing logs
└── charm_{timestamp}.log           # SimNIBS processing logs

Log Content Examples

Successful Processing

[2025-06-25 13:45:23] [recon-all] [INFO] Starting FreeSurfer recon-all for subject: sub-101
[2025-06-25 13:45:24] [recon-all] [INFO] Found T1 image: /mnt/study/sub-101/anat/anat-T1w_acq-MPRAGE.nii.gz
[2025-06-25 13:45:24] [recon-all] [INFO] Found T2 image: /mnt/study/sub-101/anat/anat-T2w_acq-CUBE.nii.gz
[2025-06-25 13:45:24] [recon-all] [INFO] T2 image will be used for improved pial surface reconstruction
[2025-06-25 15:23:45] [recon-all] [INFO] Verification results: Essential files found: 9/9
[2025-06-25 15:23:45] [recon-all] [INFO] FreeSurfer completion verification PASSED

Error Detection

[2025-06-25 14:15:32] [recon-all] [ERROR] Command failed with critical system error: recon-all -subject sub-103...
[2025-06-25 14:15:32] [recon-all] [ERROR] System error details: Illegal instruction
[2025-06-25 14:15:32] [recon-all] [ERROR] FreeSurfer recon-all verification failed for subject: sub-103

Monitoring Progress

Monitor processing progress in real-time:

# Monitor all logs for a subject
tail -f /mnt/project/derivatives/logs/sub-101/*.log

# Monitor specific stage
tail -f /mnt/project/derivatives/logs/sub-101/recon-all_*.log

# Check processing status across subjects
ls -la /mnt/project/derivatives/freesurfer/*/mri/aseg.mgz

Troubleshooting

Common Issues

Issue	Symptoms	Solution
Missing T1 Image	“No T1 image found” error	Ensure DICOM conversion completed successfully
Illegal Instruction	FreeSurfer crashes early	✅ Fixed in latest version with improved resource management
Memory Issues	OOM errors, crashes	Use sequential mode, check Docker memory allocation
PETSC Segmentation Fault	SimNIBS charm crashes	Ensure sequential processing, check memory limits
Partial FreeSurfer Output	Some files missing	Check log files, results may still be usable
Missing T2 Image	Warning in logs	Processing continues with T1 only

Recent Improvements

Version 2024.12+:

✅ Resolved “Illegal instruction” errors through improved parallelization strategy
✅ Simplified processing modes - clear sequential vs parallel options
✅ Better resource management - proper thread and memory allocation
✅ Cleaner error handling - no false retry attempts
✅ Improved logging - clearer progress indication

Processing Behavior

The pipeline now implements a more flexible approach:

Partial Results: Keeps partial results instead of deleting them
Continued Processing: Continues with other subjects even if some fail
T2 Handling: Gracefully falls back to T1-only if T2 is missing/unreadable
Basic Validation: Simple existence checks replace strict file validation
Error Reporting: Provides warnings instead of errors for non-critical issues

System Requirements

Component	Minimum	Recommended
CPU Cores	4	8+
RAM	8 GB	16+ GB
Disk Space	10 GB per subject	20+ GB per subject
Docker Memory	6 GB	12+ GB

Performance Optimization

Parallel Processing: Use --parallel flag for multiple subjects
Memory Management: Ensure adequate Docker memory allocation
Disk I/O: Use fast storage (SSD) for improved performance
CPU Utilization: Match number of parallel jobs to available cores
Failure Handling: Pipeline continues even if some subjects fail

Integration with Analysis Pipeline

The pre-processing pipeline generates all necessary inputs for downstream TI analysis:

FreeSurfer surfaces → ex-search electrode optimization
SimNIBS head models → simulator electromagnetic field computation
BIDS anatomical data → analyzer ROI analysis and visualization

See the Ex-Search and Simulator documentation for details on using pre-processed data in TI analysis workflows.

Wiki Guides

Pre-processing Pipeline

Overview

Required Input Data Structure

BIDS Format Requirements

Data Requirements

Supported Input Formats

Processing Stages

Stage 1: DICOM to NIfTI Conversion

Features

Process Flow

Usage

Generated Output Structure

Stage 2: FreeSurfer recon-all

Features

Process Flow

Usage

Generated Output Structure

Stage 3: SimNIBS charm (Head Model Creation)

Features

Process Flow

Usage

Generated Output Structure

Orchestration Script

structural.sh - Pipeline Orchestrator

Command Line Interface

Processing Options

Processing Mode Selection

Parallelization Strategy

Two-Mode Processing Architecture

Processing Modes

Mode Comparison

Implementation Details

Performance Characteristics

SimNIBS Processing

Complete Pipeline Execution

Processing Mode Examples

Sequential Mode (Default) - Maximum Speed per Subject

Parallel Mode - Maximum Throughput

Stage-by-Stage Execution

Quick Decision Guide

Output Directory Structure

Complete Processing Output

Logging and Monitoring

Log File Organization

Log Content Examples

Successful Processing

Error Detection

Monitoring Progress

Troubleshooting

Common Issues

Recent Improvements

Processing Behavior

System Requirements

Performance Optimization

Integration with Analysis Pipeline