Lesson Flow

Learn

Goals and Concepts

Start with the capability target and concept set for this module.

Practice

Studio Activity

Apply the ideas in a guided activity tied to realistic outputs.

Check

Assessment Rubric

Use the rubric to verify competency and identify improvement targets.

Interactive Lab

Practice in short loops: checkpoint quiz, microtask decision, and competency progress tracking.

Checkpoint Quiz

Q1. Which output most clearly demonstrates module competency?

Competency is shown through measurable, method-linked evidence.

Q2. What should always accompany a technical claim in this curriculum?

Every claim should include boundaries and uncertainty.

Q3. What is the best next step after identifying a gap in understanding?

Progress improves when gaps become explicit practice targets.

Microtask Decision

Choose the action that best improves scientific reliability.

Progress Tracker

State is saved locally in your browser for this module.

0% complete

Annotation Challenge

Click the hotspot with the strongest evidence for the requested feature.

Connectomics training scene

Selected hotspot: none

Capability target

Produce a reproducible preprocessing release that transforms raw or intermediate connectomics outputs into analysis-ready data, with explicit quality gates and full provenance.

Why this module matters

Most downstream failures in connectome analysis are not model failures first; they are data-quality and preprocessing failures. This module teaches how to clean data without erasing signal, and how to document each transformation so conclusions remain defensible.

Concept set

1) Cleaning vs distortion

2) Provenance as a scientific requirement

3) QC metrics must be decision-linked

Core workflow: preprocessing for connectomics

  1. Ingest and integrity validation
    • Confirm file completeness, schema conformance, and version compatibility.
    • Log dataset identifiers and checksums.
  2. Artifact and anomaly screening
    • Identify missing values, label conflicts, geometric outliers, and suspicious connectivity spikes.
    • Triage issues by likely biological impact.
  3. Cleaning transforms
    • Apply deterministic corrections (schema normalization, unit harmonization, explicit missing-value policy).
    • Isolate heuristic transforms for extra review.
  4. QC and drift checks
    • Compare pre/post distributions and topology statistics.
    • Verify no unacceptable biological-signal loss.
  5. Release packaging
    • Publish analysis-ready tables/volumes plus transform log, metric report, and known limitations.

Studio activity: preprocessing release simulation

Scenario: Your team receives a mixed-quality connectomics export with missing labels, duplicated IDs, and inconsistent units.

Tasks

  1. Define cleaning policy for each issue category.
  2. Implement a preprocessing pipeline (pseudocode or notebook-level steps).
  3. Run pre/post QC metrics and justify any tradeoffs.
  4. Produce a release note that includes lineage metadata and known residual risks.

Expected outputs

Assessment rubric

Teaching resources

60-minute tutorial run-of-show

Materials

Timing and flow

  1. **00:00-08:00 Setup and target**
    • Define release objective and non-negotiable quality gates.
  2. **08:00-18:00 Instructor modeling**
    • Live demonstration of ingest checks and anomaly triage logic.
  3. **18:00-32:00 Team preprocessing design**
    • Teams draft cleaning rules and escalation criteria.
  4. **32:00-44:00 QC pass**
    • Teams compute/estimate pre-post metrics and decide release/no-release.
  5. **44:00-54:00 Cross-team review**
    • Teams audit each other’s transform logs for reproducibility gaps.
  6. **54:00-60:00 Competency checkpoint**
    • Submit one release note with provenance, thresholds, and residual risk.

Success criteria for this session

Evidence anchors from connectomics practice

Key papers to use in this module

Key datasets to practice on

Competency checks

Quick practice prompt

Take one connectomics table (real or mock) and write:

  1. Three cleaning rules with rationale.
  2. Two QC thresholds and associated actions.
  3. One limitation that remains after preprocessing.

Teaching Materials

Activity Worksheet

Learner worksheet aligned to the studio activity and rubric.

Open worksheet

Slide Source

Marp source file for editing and rendering.

course/decks/marp/modules/module18.marp.md

Related Content