Proofreading Strategies for Connectome Reconstruction

Instructor Notes

This document is a standalone instructor script. It provides the full narrative, real references, worked examples, and practical decision frameworks. Adapt depth and pacing to your audience; the material is intentionally detailed so that nothing needs to be improvised.


1. Overview: Why Strategy Matters

Not all proofreading is created equal. A naive approach – start at one corner of the volume and fix every error you encounter – is almost never the right choice. The strategy you select determines:

The five major strategies described below are not mutually exclusive. Most real projects use a hybrid approach, and the worked example at the end of this document illustrates how to combine them.


2. Exhaustive Local Proofreading

2.1 Definition

Correct every detectable error within a defined subvolume. The goal is a “saturated” reconstruction: every neurite in the volume is correctly segmented, every synapse is correctly assigned, and every cell boundary is accurate.

2.2 When to Use

2.3 How It Works

  1. Define a bounding box (typically 10-50 um on a side).
  2. Enumerate all segments that have any part inside the box.
  3. For each segment, inspect in 2D and 3D. Fix all merge errors (split), split errors (merge), and boundary errors (manual painting).
  4. Cross-check every synapse annotation within the box for correct pre/post assignment.
  5. Perform inter-annotator agreement on a subset (typically 10-20 %) to estimate residual error rate.

2.4 Cost

Kasthuri et al. (2015) described saturated reconstruction of a 1,500 cubic micrometer volume from mouse neocortex. The effort required thousands of person-hours for what amounts to a tiny fraction of even one cortical column. Extrapolating, exhaustive proofreading of a full MICrONS-scale volume (roughly 1 mm^3) would require an estimated 50,000-500,000 person-hours. This makes exhaustive proofreading impractical for large volumes.

2.5 Instructor Tip

Ask students: “If exhaustive proofreading is so expensive, why do it at all?” The answer: without at least some exhaustive ground truth, you cannot measure the quality of any other strategy. Exhaustive proofreading of small regions is the yardstick against which everything else is calibrated.


3. Targeted / Skeleton-Guided Proofreading

3.1 Definition

Select specific neurons of interest and proofread only those neurons, following each branch from soma to tip and fixing errors along the way. The rest of the volume remains uncorrected.

3.2 When to Use

3.3 How It Works

  1. Start at the soma. Identify the cell body of your target neuron.
  2. Trace the primary arbor. Follow the main dendrite or axon in 3D, scrolling through 2D slices to verify continuity.
  3. At each branch point, push one branch onto a stack and continue along the other. Process branches depth-first or breadth-first depending on preference.
  4. When you encounter a merge error: The segment suddenly includes processes that do not belong to your neuron. Split the merge. Continue tracing your neuron on the correct fragment.
  5. When you encounter a split error: Your neuron’s segment ends abruptly. Search for a continuation fragment nearby (same caliber, same trajectory, within a few sections). Merge the fragments. Continue.
  6. When you encounter a dead end: Determine whether it is a true biological termination (terminal bouton with vesicles, or a dendrite tip) or a split error. If uncertain, flag for review.
  7. Record all edits in the annotation system for auditability.

3.4 Handling Ambiguous Continuations

The hardest judgment call in targeted proofreading is: “Is this orphan fragment the continuation of my neuron, or a different neuron?” Criteria to evaluate:

If two or more fragments are plausible candidates, flag the ambiguity rather than guessing. A wrong merge is worse than a documented gap.

3.5 Cost

Targeted proofreading of a single neuron in dense neuropil typically takes 30 minutes to several hours, depending on arbor complexity and error density. A pyramidal cell with an extensive axonal arbor spanning hundreds of micrometers may require 4-8 hours. This is far cheaper than exhaustive proofreading of the same volume, but scales linearly with the number of neurons of interest.


4. Priority-Ranked Proofreading

4.1 Definition

Use automated error detection to generate a ranked list of candidate errors, sorted by estimated downstream impact. Fix the highest-ranked candidates first, working down the list until a quality target or time budget is reached.

4.2 When to Use

4.3 Automated Error Detection

Machine learning models can be trained to flag likely errors using features such as:

Lu et al. (2019) trained a classifier on these features and achieved 80-90 % precision at 60-70 % recall for merge error detection. Zung et al. (2017) proposed a metric learning approach that embeds supervoxels such that errors can be detected by distance in embedding space.

4.4 Ranking Heuristics

Once candidate errors are detected, rank them by impact:

  1. Large neurons first. Errors on neurons with extensive arbors affect more synapses and more downstream analyses.
  2. High-connectivity errors. A merge that adds 50 false synapses is worse than one that adds 2.
  3. Region of interest. If the scientific question focuses on layer 4, errors in layer 4 rank higher.
  4. Error type. Merge errors generally rank above split errors because they corrupt connectivity more severely.

4.5 Instructor Tip

Draw an analogy to triage in emergency medicine: you cannot treat every patient simultaneously, so you prioritize by severity and treatability. Similarly, you cannot fix every segmentation error, so you prioritize by impact and confidence.


5. Crowd-Sourced Proofreading

5.1 The FlyWire Model

Dorkenwald et al. (2024) described the FlyWire project, which produced a whole-brain connectome of Drosophila melanogaster using 287 proofreaders distributed globally. Key features:

5.2 The EyeWire Model

The Seung lab’s EyeWire project (Kim et al., 2014) demonstrated that citizen scientists with no neuroscience background could contribute meaningful proofreading for retinal connectomics. Key innovations:

5.3 When to Use

5.4 Challenges


6. Hybrid Strategies

Most real-world projects combine multiple strategies. A common recipe:

  1. Priority-ranked pass (weeks 1-4). Run automated error detection on the full volume. Fix the top 1,000 highest-impact candidate errors. This raises the overall quality floor quickly.
  2. Targeted pass (weeks 5-12). For each neuron in the scientific study, perform skeleton-guided proofreading. This ensures the specific neurons of interest are correct.
  3. Exhaustive pass on focal region (weeks 13-16). In the core region of the study (e.g., a 50 x 50 x 50 um cube), proofread exhaustively to produce a gold-standard reference.
  4. Metrics and iteration (ongoing). Compute quality metrics (VI, ERL) on the exhaustive region. If metrics are below target, iterate.

6.1 The FlyWire Hybrid

FlyWire used exactly this pattern: automated segmentation was followed by priority-ranked automated error detection, then targeted neuron-by-neuron proofreading by the crowd, with expert review of critical neurons and exhaustive proofreading of small benchmark regions.


7. When to Stop Proofreading

7.1 Diminishing Returns

Proofreading follows a classic diminishing-returns curve. The first few hours of proofreading fix high-impact errors and dramatically improve metrics. As obvious errors are corrected, remaining errors become harder to find and less impactful.

7.2 Quality Targets

Define stopping criteria before starting:

7.3 Cost-Benefit Analysis

At some point, the cost of finding and fixing the next error exceeds the benefit of correcting it. This threshold depends on the scientific question. A study of rare cell types tolerates lower overall quality if those specific cells are correct. A study of network-wide statistics needs higher overall quality but can tolerate errors on individual neurons.

7.4 Instructor Tip

Ask students: “You have fixed 95 % of detected errors and your ERL is 120 um. Your collaborator wants you to keep going. How do you decide?” This opens a discussion about opportunity cost, statistical power, and the difference between “perfect data” and “data good enough to answer the question.”


8. Worked Example: Designing a Proofreading Campaign

8.1 Scenario

You have a freshly segmented 200 x 200 x 100 um volume of mouse visual cortex (V1, layers 2/3-4). Your scientific goal is to characterize the connectivity of 100 inhibitory interneurons of four subtypes (PV, SST, VIP, Lamp5). You have a team of 5 proofreaders and 3 months.

8.2 Campaign Design

Phase 1: Automated triage (week 1)

Phase 2: Priority-ranked global cleanup (weeks 2-4)

Phase 3: Targeted neuron proofreading (weeks 5-10)

Phase 4: Connectivity verification (weeks 11-12)

Phase 5: Exhaustive benchmark (week 13)

8.3 Milestones and Decision Points


9. References


End of instructor script: Proofreading Strategies for Connectome Reconstruction