Lesson Flow

Learn

Goals and Concepts

Start with the capability target and concept set for this module.

Practice

Studio Activity

Apply the ideas in a guided activity tied to realistic outputs.

Check

Assessment Rubric

Use the rubric to verify competency and identify improvement targets.

Interactive Lab

Practice in short loops: checkpoint quiz, microtask decision, and competency progress tracking.

Checkpoint Quiz

Q1. Which output most clearly demonstrates module competency?

Competency is shown through measurable, method-linked evidence.

Q2. What should always accompany a technical claim in this curriculum?

Every claim should include boundaries and uncertainty.

Q3. What is the best next step after identifying a gap in understanding?

Progress improves when gaps become explicit practice targets.

Microtask Decision

Choose the action that best improves scientific reliability.

Progress Tracker

State is saved locally in your browser for this module.

0% complete

Annotation Challenge

Click the hotspot with the strongest evidence for the requested feature.

Connectomics training scene

Selected hotspot: none

Capability target

Build one connectome graph representation and justify two metric choices for a defined hypothesis.

Concept set

1) From EM to graph: a lossy transformation

A connectome graph is an abstraction. The path from EM images to a graph involves: raw images → segmentation → synapse detection → neuron-to-neuron edge list → graph. Each step loses information: the graph retains connectivity topology but discards spatial relationships, organelle details, and membrane geometry. This is a feature (graphs are computationally efficient and analytically tractable) and a limitation (the graph cannot answer questions that require spatial context).

Key decision: What are your nodes? What are your edges? Neurons as nodes and synapses as directed edges is the default, but alternatives exist — compartment-level nodes (axon vs dendrite of the same neuron), type-level nodes (aggregating neurons by class), and different edge weightings (binary, synapse count, cleft area).

2) The threshold problem

In real connectomes, many neuron pairs share only 1-2 synapses. Are these “real” connections or detection noise? The choice of minimum synapse threshold for defining an edge dramatically changes the graph:

There is no universally correct threshold. Report results across multiple thresholds (sensitivity analysis) and justify your choice based on the analysis question.

3) Graph metrics and what they mean biologically

4) Null models and the interpretation trap

A graph metric is meaningless without a baseline. “Clustering coefficient = 0.15” tells you nothing until you compare to a random graph with the same degree distribution (where clustering might be 0.02 — making 0.15 highly significant). Every metric computation should include the corresponding null-model comparison.

Core workflow

  1. Define node/edge schema: what are your nodes, what are your edges, what weighting scheme?
  2. Construct graph from synapse table (e.g., using CAVEclient + NetworkX). Inspect: number of nodes, edges, density, connected components.
  3. Compute candidate metrics: degree distribution, clustering, path length, reciprocity, modularity.
  4. Compare each metric to null-model expectation (degree-preserving random graph as minimum).
  5. Interpret metrics against hypothesis. Report which metrics are significant and which are not.
  6. Document abstraction limits: what information was lost in the graph construction?

60-minute tutorial run-of-show

Pre-class preparation (10 min async)

Minute-by-minute plan

  1. **00:00-08:00 Graph abstraction choices**
    • Show the same circuit as: (a) 3D EM rendering, (b) adjacency matrix, (c) node-link diagram. “These are three views of the same biology. Today we work with (b) and (c).”
    • Discussion: “What did we gain and lose in each transformation?”
  2. **08:00-20:00 Graph build demo**
    • Live coding: load a synapse table, construct a NetworkX DiGraph, apply threshold, print basic stats.
    • Visualize the graph with spring layout. Color nodes by cell type.
    • “Notice: the spatial layout in this diagram is arbitrary. The graph doesn’t know where neurons are in the brain.”
  3. **20:00-34:00 Metric computation**
    • Hands-on: learners compute degree distribution, clustering coefficient, and average path length.
    • Plot degree distribution (log-log). Is it heavy-tailed?
    • Compute clustering and compare to a random graph (NetworkX: nx.watts_strogatz_graph for comparison).
  4. **34:00-46:00 Interpretation and null concerns**
    • “Your clustering coefficient is 3× higher than the random graph. What does that mean biologically?”
    • Discuss: spatial proximity as a confound. Would a spatially constrained null model change the conclusion?
    • Walk through one example: reciprocal connections. Count in real data vs degree-preserving null.
  5. **46:00-60:00 Competency check**
    • Each learner writes a 1-paragraph graph analysis summary:
      • Schema (nodes, edges, threshold)
      • Two metrics with values and null-model comparisons
      • One biological interpretation and one limitation
    • Exit ticket: “Name one reason a graph metric might be misleading in your dataset.”

Studio activity: graph analysis report (60-75 minutes)

Scenario: You have the connectivity graph of 500 neurons in a cortical column from the MICrONS dataset. Your PI asks: “Is this circuit small-world? Are there hub neurons? Are there communities?”

Task sequence:

  1. Load the graph and compute basic statistics (nodes, edges, density, components).
  2. Compute: degree distribution, clustering coefficient, average path length.
  3. Compare to degree-preserving random graph and Watts-Strogatz small-world reference.
  4. Identify candidate hub neurons (top 5% by degree or betweenness centrality).
  5. Run community detection (Louvain or Leiden). Do detected communities align with cell types?
  6. Write a 1-page graph analysis report with figures, metrics, null comparisons, and biological interpretation.

Expected outputs:

Assessment rubric

Content library references

Teaching resources

References

Quick practice prompt

State one reason a graph metric might be misleading in your current dataset.

Teaching Materials

Activity Worksheet

Learner worksheet aligned to the studio activity and rubric.

Open worksheet

Slide Source

Marp source file for editing and rendering.

course/decks/marp/modules/module10.marp.md

Related Content