Migration Guide

This page consolidates upgrade notes for moving between exo2micro versions. Start with your current version and walk forward.

From 2.3 to 2.4

Pipeline behavior is unchanged at defaults. Most existing scripts will run identically after the upgrade. There are two new behaviors that can change what happens, both governed by an opt-out flag:

Pre-flight resource checks

Both run_batch() and SampleDye.run() now estimate peak RAM and total disk output from raw TIFF headers before any task runs, and raise MemoryError or OSError if the estimate exceeds available headroom.

For most existing scripts on machines that previously ran successfully, this is silent: the check passes and the run proceeds. The change affects two categories of script:

Scripts that always ran on machines with comfortable RAM headroom — no change visible.
Scripts that occasionally tipped a low-RAM machine into swap-thrashing or OOM-kill — those scripts now raise MemoryError immediately rather than failing later with a dead kernel and no traceback. The error message includes a remediation list with the current values of n_workers, pad, etc. so you can see what to change.

To restore the pre-2.4 behavior of “just try it”:

results = e2m.run_batch(..., force_run=True)
# or
run.run(force_run=True)

This downgrades the hard fail to a warning. Not recommended for routine use; a run that actually OOM-kills mid-batch can leave a corrupt checkpoint file that the next run can’t read. See Memory and Performance for full details.

The pre/post histogram plot

plot_pre_post_histograms() (stage 4 output pre_post_histograms.png) has been rewritten to give an apples-to-apples comparison between the discrete-valued post image and a now-also-discrete raw pre image. The previous version overlaid the post against the bilinearly-interpolated warped pre, which produced misleading shape differences.

Three changes in the saved PNG:

The pre-stain foreground histogram is now the raw padded pre (discrete 8-bit) rather than the warped pre (continuous float).
The warped pre is still shown, but as a faint grey background reference.
The y-axis switches to log automatically when the zero bin dominates (more than 5× the next-tallest bin). Linear y remains the default.

If you’ve been calling plot_pre_post_histograms() directly from your own scripts with only post_im and pre_im arguments, the function falls back to the previous two-curve layout for backward compatibility. To get the new three-curve layout, pass raw_pre_im= as well (this is what stage 4 does internally now).

Recommended new features

If your collaborator’s kernel dies partway through batches, try parallel='subprocess' mode in run_batch(). Each task gets a fresh Python process; the OS reclaims memory between tasks reliably. See Memory and Performance.
For unattended overnight batches, set timeout_per_task=1800 (or similar) so a wedged task doesn’t block the rest of the run.
For diagnosing intermittent memory problems, pass memory_debug=True to run_batch(). Prints RSS snapshots between tasks so you can tell whether you have a leak (RSS climbs) or just an oversize task (RSS spikes and recovers). Requires the optional psutil dependency.

From 2.2 to 2.3

Pipeline behaviour is unchanged at defaults — if you were running with all default parameters, nothing about the output files changes except that the final difference image is now correctly named 04_difference_difference.tiff instead of the mislabelled 05_difference_difference.tiff. The stage-4 diagnostic plots are unchanged.

Breaking changes

1. Removed parameters. These eight parameters no longer exist in PARAMETER_REGISTRY and will raise ValueError if passed to SampleDye.set_params():

signal_threshold
dilation_iters
signal_percentile
robust_percentile
noise_floor_percentile
boundary_erosion
n_hist_bins
zoom_box

The first two belonged to the tissue-masking stage that was deleted in v2.2; the rest belonged to the LS/robust-percentile scaling code that was superseded by the Moffat fit in v2.2. See Scale Estimation Methods for why the old methods were dropped.

If you have scripts like this:

run.set_params(robust_percentile=90, boundary_erosion=30)

…rewrite them:

# v2.3 equivalent: the old "robust_percentile=90" produced a
# scale estimate at the 90th percentile of the log-ratio
# distribution. In v2.3 that's scale_percentile=90.0.
#
# Note however that the default workflow has changed — the
# Moffat fit now runs unconditionally, and scale_percentile
# produces an *additional* difference image rather than
# replacing the main one. So you can drop the parameter
# entirely if you just want the Moffat result.
run.set_params(scale_percentile=90.0)

The dead scaling parameters (boundary_erosion and friends) are simply dropped — there’s nothing for them to do in v2.3.

2. ``scaling.py`` is gone. If you have imports like:

from exo2micro.scaling import optimize_subtraction

…rewrite as:

from exo2micro.legacy import optimize_subtraction

The function itself is unchanged. It’s no longer called by any stage of the v2.3 pipeline.

3. ``SampleDye.run`` return dict has new keys. In v2.2:

{'sample': ..., 'dye': ..., 'scale_estimate': ..., 'status': ...}

In v2.3:

{'sample': ..., 'dye': ..., 'scale_estimate': ...,
 'scale_percentile_value': ..., 'manual_scale': ...,
 'status': ...}

The new keys are None when the corresponding method is not active, so existing code that only reads scale_estimate continues to work unchanged.

4. :func:`plot_excess_heatmap` gained a ``scales=`` kwarg. The old single-line scale= kwarg still works and is deprecated in behaviour but not removed. For overplotting multiple scale lines (one per active method), use:

plot_excess_heatmap(
    post, pre,
    scales=[
        ('Moffat fit',  1.234, '#00cc88'),
        ('ratio p50',   1.198, '#ff9933'),
        ('manual',      1.250, '#ff3366'),
    ],
    save_path='excess.png')

Recommended new features

For interactive inspection of difference images, use the new Zoom & Inspect panel in the GUI, or plot_zoom() / plot_zoom_multi() from Python.
For diagnosing alignment quality, use the Blink Comparison panel in the GUI.
For manual scale overrides, use the new manual_scale parameter.

From 2.1 to 2.2

Important

Upgrading from 2.1 (or earlier) requires you to re-run the pipeline from stage 1. The image loading changed in a way that affects the actual pixel values of stage-1 checkpoints: v2.2 extracts the fluorescence-bearing RGB channel directly rather than converting RGB to grayscale with luminance weights. The old v2.1 stage-1 files have ~41% less dynamic range than the new ones, and all downstream diagnostics depend on precise pixel values.

To re-run:

run = e2m.SampleDye('CD070', 'SybrGld_microbe')
run.run(from_stage=1, force=True)

Pipeline changes

Pipeline shrank from 5 stages to 4. Stage 4 (masking) was removed; the old stage 5 (scaling + plots) was replaced by a new stage 4 (diagnostics + subtraction using a Moffat-fit scale estimate).
Code that called run(from_stage=5) should now use run(from_stage=4). The default to_stage is now 4.

Plotting module changes

Ten legacy plotting functions moved from exo2micro.plotting to exo2micro.legacy. If you import them directly, update:

# OLD
from exo2micro.plotting import plot_signal_scatter, plot_im_sub

# NEW
from exo2micro.legacy import plot_signal_scatter, plot_im_sub

The functions still work, but some of the full-resolution contour-overlay ones crashed on images larger than ~25000×25000 pixels due to matplotlib’s cell block limit. The new stage 4 avoids these paths entirely.

Scale estimation

In v2.1 and earlier, scale estimation used either least-squares or a robust-percentile method on the log-ratio distribution. v2.2 replaced both with a Moffat fit on the left wing of the same distribution, which fits empirical microscopy noise better. The old methods are preserved in exo2micro.legacy if you need to reproduce earlier results.

From 2.0 to 2.1

New stage 3: interior SIFT alignment. Runs automatically after stage 2 unless interior_ecc=False is set. No action required for most users — the pipeline just gets a better alignment for free.
Stage 2 is no longer just “coarse alignment” — it now includes both boundary correlation and ICP refinement in a single stage. Filenames changed from 02_coarse_aligned_pre to 02_icp_aligned_pre.
The old stage 3 03_fine_aligned_pre checkpoint is now 03_interior_aligned_pre.
Old-style checkpoint filenames won’t be found by the new pipeline. Re-run from stage 1 or rename files manually.

From 1.x to 2.0

v2.0 was a full package refactor. The monolithic exo2micro.py was split into eight modules, a class-based SampleDye API was introduced, and the pipeline became checkpoint-driven. For 1.x users, the migration path is:

Replace any calls to the module-level functions with the class-based API:

run = e2m.SampleDye('CD070', 'SybrGld_microbe')
run.set_params(...)
run.run()

Move any output directory configuration into the SampleDye constructor (output_dir=, raw_dir=).
Let the new parameter-versioning mechanism handle filenames. Remove any manual filename construction in your scripts.

Beyond that, the underlying algorithms in v2.0 were largely unchanged from 1.x. If you have specific 1.x scripts that don’t translate cleanly, the v2.0 changelog in the repository has a detailed function-by-function migration list.