CryoVesNet: A dedicated framework for synaptic vesicle segmentation in cryo-electron tomograms

Khosrozadeh, Amin; Seeger, Raphaela; Witz, Guillaume; Radecke, Julika; Sørensen, Jakob B.; Zuber, Benoît

doi:10.1083/jcb.202402169

Cryo-electron tomography (cryo-ET) has the potential to reveal cell structure down to atomic resolution. Nevertheless, cellular cryo-ET data is highly complex, requiring image segmentation for visualization and quantification of subcellular structures. Due to noise and anisotropic resolution in cryo-ET data, automatic segmentation based on classical computer vision approaches usually does not perform satisfactorily. Communication between neurons relies on neurotransmitter-filled synaptic vesicle (SV) exocytosis. Cryo-ET study of the spatial organization of SVs and their interconnections allows a better understanding of the mechanisms of exocytosis regulation. Accurate SV segmentation is a prerequisite to obtaining a faithful connectivity representation. Hundreds of SVs are present in a synapse, and their manual segmentation is a bottleneck. We addressed this by designing a workflow consisting of a convolutional network followed by post-processing steps. Alongside, we provide an interactive tool for accurately segmenting spherical vesicles. Our pipeline can in principle segment spherical vesicles in any cell type as well as extracellular and in vitro spherical vesicles.

Introduction

The fine architecture of cells can be investigated by cryo-electron tomography (cryo-ET) (Bäuerlein and Baumeister, 2021). Cellular structures are preserved down to the atomic scale through vitrification and observation of the samples in a fully hydrated state. When a macromolecule is present in a sufficient number of copies in the cells imaged by cryo-ET, it is possible to obtain its atomic structure in situ using subtomogram averaging (Ni et al., 2022; Obr et al., 2022; Tegunov et al., 2021) Cellular cryo-ET datasets are usually extremely complex, making them difficult to analyze. This is aggravated by the sensitivity of biological samples to electron radiation, which limits the signal-to-noise ratio in cryo-ET datasets (Lucić et al., 2005). Tomographic reconstructions are generated from a series of images of the sample acquired at different viewing angles. The geometry of the samples prevents acquisition at certain angles, resulting in anisotropic spatial coverage. The resolution in the directions close to the axis of the electron beam incident on the untilted sample is strongly reduced. This effect, commonly referred to as the missing-wedge artifact, further complicates data analysis. In particular, organelles fully bounded by a membrane appear to have holes at their top and bottom (relative to the electron beam axis) (Lucić et al., 2005).

The synapse is a specialized cellular contact at which information is transmitted from one neuron to another, the presynaptic and postsynaptic synapses, respectively. In most cases, the signal is transmitted by the release of neurotransmitters into the intercellular space. Neurotransmitters are stored in synaptic vesicles (SVs) and are released following the fusion of an SV with the presynaptic plasma membrane. A synapse contains hundreds of SVs and their mobility and recruitability for neurotransmitter release depends on inter-vesicle interactions through so-called connector structures (Fernández-Busnadiego et al., 2010; Radecke et al., 2023; Zuber and Lucić, 2019, 2022). The characterization of these interactions can be performed automatically with the Pyto software, which implements a hierarchical connectivity approach to segment connectors (Lucić et al., 2016). For accurate connector segmentation, precise segmentation of SVs is a prerequisite. To date, SV segmentation has been performed manually, but given the large number of SVs per dataset, it is an extremely time-consuming process. Typically, one person spends three to eight working days segmenting a single dataset. Attempts to perform this task automatically based on classical computer vision algorithms have not yielded sufficiently accurate results (Martinez-Sanchez et al., 2014). Subsequent automation efforts, such as the 3D ART VeSElecT (Kaltdorf et al., 2017), enabled automatic segmentation of SVs in freeze-substituted resin-embedded samples. However, these methods still faced challenges in achieving the level of accuracy required for detailed analyses of SV pools and their interactions in fully hydrated, vitrified cryo-ET samples. To alleviate this situation, we decided to develop an approach based on deep learning.

Convolutional neural networks (CNN) have shown promise in segmenting cryo-ET data and have been integrated into EMAN2 package (Chen et al., 2017). However, while these approaches have been sufficient for some visualization purposes, they have not yet met the stringent requirements for segmenting tethers and connectors in detailed analyses such as those performed with Pyto. Later on, Imbrosci et al. described accurate SV segmentation of transmission electron microscopy images using CNN, but this approach is limited to 2-dimensional (2D) images of resin-embedded synapses (Imbrosci et al., 2022). In the former study, cryo-ET data are decomposed in individual 2D slices, which are handed as separate inputs to the CNN. The independent output 2D prediction images are then reassembled in a 3-dimensional (3D) stack (Chen et al., 2017; Held et al., 2024). As discussed above, membranes oriented approximately parallel to the plane of the 2D tomographic images are not resolved. In the absence of contextual knowledge of the other 2D images, the CNN fails to segment these regions of the vesicles. Hence, spherical vesicles appear open, whereas we expect closed spherical objects. Recently Zhou et al. addressed this issue by implementing a downstream fitting step based on a Gaussian process approach, allowing for the smooth closure of the membranes (Zhou et al., 2023). Ideally, 3D networks should be used to segment 3D cryo-ET data. In this context, several groups have published applications of 3D networks in cryo-ET for other tasks, such as particle picking and classification, to perform subtomogram averaging (de Teresa-Trueba et al., 2023; Lamm et al., 2022; Moebel et al., 2021). However, these papers have not focused on accurately segmenting membranes in cryo-ET data.

We opted to employ a 3D U-Net CNN to process 3D images as input (Çiçek et al., 2016). Weigert et al. (2018) implemented a U-Net for content-aware restoration (CARE) of 3D fluorescence microscopy datasets. They showed that it can restore information from anisotropic and very noisy datasets. Such networks have been used in the last couple of years in cryo-ET analysis, mainly to perform denoising and object detection (Buchholz et al., 2019; de Teresa-Trueba et al., 2023; Liu et al., 2022; Moebel et al., 2021). As is typically the case, segmentation methods can benefit from denoising techniques. However, a dedicated tool is still required to achieve accurate segmentation. Such a tool is essential for enabling detailed studies of SV pool regulation in the presynaptic terminal. Recent work combined 3D network with multiple rounds of manual annotation to obtain better results, which comes with the extra cost of active learning (Lamm et al., 2024, Preprint). We implemented a 3D U-Net based on CARE building blocks and trained it with manually segmented datasets. This method provided good accuracy and was not strongly affected by the missing wedge artifact. Nevertheless, it was not sufficient for our downstream Pyto analysis. Hence, we developed a post-processing method, which transforms the segmented objects into spheres and refines their radius and center location. The workflow includes outlier detection based on the radial profile features of the segmented objects. Then, these mis-segmented vesicles can be either removed or refined. This leads to a substantial improvement in accuracy, which is reflected in Pyto performances comparable with those obtained after manual SV segmentation. We also introduced a semiautomatic method to quickly fix wrongly segmented or missed SVs. This tool can potentially be used to generate a larger training set.

Although our set of procedures was developed with the use case of SV segmentation in mind, it can be used to segment any other types of biological spherical vesicles, such as transport vesicles, secretory vesicles, endocytic vesicles, and extracellular vesicles. Furthermore, the method can also extend to segment membrane-bound organelles that mildly deviate from spherical shape such as endosome.

Results

Overview of training and test paradigm

In view of the effort required for the manual segmentation of SVs, we decided to develop an automatic segmentation procedure. Since we had previously manually segmented a number of tomograms with the program IMOD, we could use these segmentations as the ground truth (Kremer et al., 1996). We trained a U-Net with a set of nine segmented tomograms of rat synaptosomes (see Materials and methods).

We sought to further improve segmentation accuracy by feeding the probability map output by the U-Net to a series of post-processing steps (Fig. 1). Three sets of tomograms were used to assess the performance of the pipeline:

(1)
Train tomograms: the nine rat synaptosome tomograms that have been used for U-Net training.
(2)
Within-distribution test tomograms: nine additional rat synaptosome tomograms.
(3)
Out-of-distribution test tomograms: 12 mouse primary neuronal culture tomograms.

Figure 1.

View large Download slide

Pipeline of automatic segmentation. The pipeline consists of two main stages: Training (top row) involving data preparation, U-Net segmentation, until achieving the probability map, and post processing (bottom row) including thresholding, radial profile refinement, and outlier removal to produce the final segmented vesicles.

Additionally, to further explore the generalizability of CryoVesNet across data from different species and its potential applicability to non-synaptic spherical vesicles, we applied our method to a set of publicly available tomograms. Due to the absence of ground truth annotations for these datasets, we conducted a qualitative assessment of the segmentation results on these tomograms:

(4)
Generalization tomograms:
- •
  Excitatory and inhibitory synapses in cultured rat hippocampal neurons (EMD-30364, EMD-30365) (Tao et al., 2018).
- •
  In vitro reconstituted SVs (EMPIAR-10498) (Ginger et al., 2020).
- •
  Cell bodies in a Drosophila melanogaster brain cryo-FIB lamella (EMD-12727) (Bäuerlein et al., 2021, Preprint).
- •
  Axons in a human neuronal organoid culture (EMPIAR-10805) (Hoffmann et al., 2021).
- •
  Cell body in a Caenorhabditis elegans cryo-FIB lamella (EMD-4869) (Schaffer et al., 2019).

U-Net segmentation workflow

Our automated SV segmentation pipeline consists of several key steps, beginning with the generation of a probability map and then combining this with spherical refinement to optimize the segmentation. This approach aims to accurately identify and segment SVs in cryo-electron tomograms, addressing challenges such as noise and the missing wedge artifact inherent in cryo-ET data.

Each tomogram was split into patches of 32³ voxels. These patches were fed into the U-Net, which outputs a probability map for those patches. To obtain a complete probability map, the patches were stitched back together (Figs. 1 and S1). The resulting probability map underwent binarization using a global threshold to create an initial segmentation. Except where mentioned, we restricted the analysis to the region of the tomogram corresponding to the presynaptic terminal by masking it.

+ Expand view − Collapse view

Figure S1.

View large Download slide

2D slice of an automatically segmented dataset. (A) A section through a presynaptic terminal in a neuron tomogram. Bar, 100 nm. (B) Predicted probability map restricted to the segmentation region. Purple corresponds to a low SV probability and yellow to a high SV probability. (C) SV labels after post processing.

At this stage, we noticed that some vesicles were not segmented accurately. Indeed, vesicles in close proximity to one another were at times misidentified as a single entity. Separating them necessitated adjusting the detection threshold to a more stringent value (see “Adaptative local thresholding” section in Materials and methods and Fig. S2). Additionally, there were instances where the initial detection captured only a fraction of a vesicle. In this situation, our adaptative local thresholding strategy loosened the threshold for a more accurate segmentation. In any case, the assignment of a unique label to each segmented vesicle was essential for the subsequent analysis steps.

+ Expand view − Collapse view

Figure S2.

View large Download slide

Adaptative local thresholding. (A) Virtual section through a probability map overlaid on the corresponding tomogram. The probability map is shown as colored contours ranging from 0.5 to 1 and logarithmically spaced (steps get finer and finer from 0.5 to 1). Red box size: 305 × 180 nm. (B) Probability map close-up corresponding to the red box in A. In the center, two SVs were initially detected as a single object. All the contours lower or equal to 0.8210 detect a single object. The contours equal to 0.9128 and above detect two separated objects.

Sphericalization and radial profile-based refinement

Although at first glance the segmentation looked good after these steps, we noticed that it was not extremely accurate. For example, the vesicles were not always centered in the segment or the radius of the segment was inexact. Very often, the vesicle segment looked shrunk in the z-direction, whereas the actual vesicles were spherical. This would be highly problematic for automatic connector and tether segmentation. To address these issues, we represented each vesicle as a sphere. We determined the center and radius of the sphere as described in the Materials and methods section. We then performed a radial averaging of the intensity around the center of the sphere. And we adjusted iteratively the position and radius of the sphere to match the actual structure in the tomogram (Fig. 2 and see Materials and methods). The radial profile refinement is a pivotal tool as it ensures that the segmented vesicles are a true representation of their form in the tomogram.

Figure 2.

View large Download slide

Vesicle radius and position refinement through radial profile and cross-correlation. (A) Initial segmentation of a vesicle. (B) Radial profile. The blue range is from the membrane center to the outer white halo center. This is defined as the search range for the optimal radius. (C) The second derivative of radial profile was used to define the exact edge of the membrane. (D) Central cross-section in the three-dimensional radial average of the vesicle in its initial position. (E–H) Same as (A–D) after refinement.

While research conducted on synapses from rodent brains indicates that SVs are generally spherical, and, indicating a predominant spherical morphology within synapses, elliptical vesicles can be observed with a higher prevalence in inhibitory synapses (Tao et al., 2018). To distinguish between spherical and elliptical vesicles, we adopted the criteria by Tao et al. (2018). Using this approach, we could either retain the initial segmentation of elliptical vesicles or discard them entirely (see Materials and methods). We demonstrate its use in an excitatory and an inhibitory synapse (Fig. 3). We did not restrict our analysis to the presynaptic terminal as a higher proportion of elliptical vesicles was observed in regions outside of it. The majority of vesicles were spherical (blue), while some were elliptical (yellow). In a magnified view, elliptical vesicles are marked with asterisks (Fig. 3 C), and these vesicles were discarded in this instance. In contrast, Fig. 3 D shows the segmentation retaining them after thresholding. Although our network was initially trained on spherical vesicles, this figure indicates capability in detecting and segmenting elliptical vesicles as well.

Figure 3.

View large Download slide

Segmentation of spherical and elliptical synaptic vesicles in rat hippocampal neurons. (A and B) 3D reconstructions showing spherical (blue) and elliptical (yellow) vesicles in excitatory (A) and inhibitory (B) synapses. (C and D) 2D slice of the tomogram shown in A. (C) The labels for elliptical vesicles were discarded but the position of these vesicles is indicated with asterisk (*). (D) In this panel, the post-thresholding label of the elliptical vesicles was retained. Scale bar, 100 nm. Tomograms are EMD-30364 and EMD-30365 (Tao et al., 2018).

Outlier detection and refinement

Despite the improvement brought by the radial profile refinement, some vesicles were still not segmented accurately. By quantifying several parameters of the segmented vesicles, such as radius, membrane thickness, membrane intensity, and lumen intensity, we were able to spot outliers using multivariate statistics, specifically by calculating the Mahalanobis distance (see Materials and methods). Our outlier detection method proved particularly effective in identifying three main types of segmentation inaccuracies: (1) non-vesicular structures mistakenly segmented as vesicles, (2) correctly segmented vesicles with abnormal characteristics (e.g., unusually large radius), and (3) misplaced vesicles with otherwise normal features.

This outlier detection step was important in preparing our data for downstream analyses, as it helped eliminate potential sources of error that could have skewed our results. An example of outlier detection is shown in Fig. 4. In this example, three outliers are highlighted. Outlier 1 (red, top row) corresponds to the mistaken segmentation of a non-vesicular membrane-bound structure. The high Mahalanobis distance of this outlier can be explained by a vesicle radius, membrane thickness, and intensity that are very different from the average of the dataset. Outlier 2 (green, middle row) is correctly segmented but is flagged for its abnormally high radius. Indeed, both the membrane thickness and intensity are close to the average of the dataset but the highly increased radius leads to a high Mahalanobis distance. Outlier 3 (blue, bottom row) is initially detected but is misplaced. Its radius was not divergent from the average but its membrane thickness and intensity were. We could then refine these outliers or remove them if refinement failed.

Figure 4.

View large Download slide

Multidimensional outlier detection. The scatter plot (left panel) represents vesicle features in the space defined by membrane intensity, radius, and thickness, with points colored according to the P value of their Mahalanobis distance, identifying potential outliers. Central panels: outliers are highlighted. Right panels: outliers have been either removed (top and middle row) or fixed by refinement(bottom row). In addition, the right panels show the final vesicle segmentation boundaries. Scale bars, 50 nm.

Expanding segmentation scope

Traditional manual segmentation, while precise, is time-consuming and often limited in scope. In previous cryo-ET studies of presynaptic terminals, the analysis of spatial organization was restricted within 250 nm of the active zone to keep segmentation time reasonable (yellow SVs in Fig. 5). This limitation inherently narrows the scope of synaptic analyses. The advent of deep learning-based segmentation offers a promising alternative, providing both speed and scalability. We were able to segment all SVs (blue, Fig. 5 D) in a fraction of the time that manual segmentation would take. This increased efficiency enables comprehensive analysis of all SV pools, providing a more complete picture of synaptic organization and function.

Figure 5.

View large Download slide

Scalability was provided by automatic segmentation. (A) 3D representation of a synaptosome slice visualized in an orthogonal view. (B) Probability map output by the neural network. The map is represented with a color gradient ranging from blue (0.9968) to red (1), indicating the likelihood of SV presence. (C) Segmentation after global threshold optimization. (D) The final representation of SVs segmentation post-processing. CryoVesNet segmented vesicles are depicted in light blue, combined with expert annotation in yellow (restricted within 250 nm of the active zone shown in red). Scale bar, 100 nm.

Performance

The performance of all steps was quantitatively assessed by comparing the obtained segmentation with the ground truth using the Sørensen-Dice coefficient (DICE) metric (see Materials and methods). The DICE of the probability map was 0.80 ± 0.04 for the train tomograms, and in test datasets 0.78 ± 0.03 for the within-distribution, and 0.71 ± 0.08 for the out-of-distribution tomograms (Tables 1, 2, 3, S1, S2, S3, S4, and S5; and Fig. 6). The probability map was then binarized with a global threshold step, which led to a DICE of 0.80 ± 0.05 and 0.83 ± 0.05 in the train and within-distribution test datasets, respectively, while it led to a slight increase of DICE to 0.73 ± 0.10 in the out-of-distribution test dataset. While the localized thresholding step does not significantly improve the DICE, it is essential for reducing false negative vesicle detections. Specifically, the ability of this step to separate falsely connected vesicles is not reflected in the DICE score. This separation corrects instances where two distinct vesicles were initially identified as a single object. Although not captured by the DICE, this process is necessary for accurately determining the center and radius of individual vesicles. The radial profile refinement and final outlier removal steps led to a DICE of 0.88 ± 0.04, 0.85 ± 0.05 and 0.82 ± 0.08, respectively.

Table 1.

Evaluation of the segmentation on the synaptosomal train set

Dataset	Map DICE	Final DICE	δ_d	Δ_c (nm)	Number of Vesicles	TP	FN	FP	F1-score
SC 1	0.72	0.78	0.08	2.55 ± 1.91	223	188	35	19	0.874
SC 2	0.84	0.91	0.06	2.08 ± 1.06	105	104	1	1	0.990
SC 3	0.82	0.92	0.05	1.77 ± 0.85	128	128	0	2	0.992
SC 4	0.83	0.89	0.03	1.78 ± 0.9	144	138	6	2	0.972
SC 5	0.79	0.86	0.04	1.84 ± 1.01	214	190	24	3	0.934
SC 6	0.76	0.88	0.04	1.86 ± 1.03	104	102	2	7	0.958
SC 7	0.78	0.90	0.06	1.92 ± 0.94	184	184	0	10	0.974
SC 8	0.81	0.90	0.05	1.76 ± 1.12	132	128	4	1	0.981
SC 9	0.82	0.91	0.05	2.0 ± 1.15	135	132	3	3	0.978
Average	0.80 ± 0.04	0.88 ± 0.04	0.05 ± 0.01	1.95 ± 1.11	152.11	95.64%	4.36%	3.25%	0.961 ± 0.04

Dataset	Map DICE	Final DICE	δ_d	Δ_c (nm)	Number of Vesicles	TP	FN	FP	F1-score
SC 1	0.72	0.78	0.08	2.55 ± 1.91	223	188	35	19	0.874
SC 2	0.84	0.91	0.06	2.08 ± 1.06	105	104	1	1	0.990
SC 3	0.82	0.92	0.05	1.77 ± 0.85	128	128	0	2	0.992
SC 4	0.83	0.89	0.03	1.78 ± 0.9	144	138	6	2	0.972
SC 5	0.79	0.86	0.04	1.84 ± 1.01	214	190	24	3	0.934
SC 6	0.76	0.88	0.04	1.86 ± 1.03	104	102	2	7	0.958
SC 7	0.78	0.90	0.06	1.92 ± 0.94	184	184	0	10	0.974
SC 8	0.81	0.90	0.05	1.76 ± 1.12	132	128	4	1	0.981
SC 9	0.82	0.91	0.05	2.0 ± 1.15	135	132	3	3	0.978
Average	0.80 ± 0.04	0.88 ± 0.04	0.05 ± 0.01	1.95 ± 1.11	152.11	95.64%	4.36%	3.25%	0.961 ± 0.04

Map DICE: soft DICE for the predicted map; Final DICE: final DICE after post-processing; δ d: average relative diameter deviation over all correctly detected vesicles; Δ c: center residual error average and SD (nm); Number of Vesicles: number of expected vesicles; TP: True Positive; FN: False Negative; FP: False Positive; F1-score: F1-score based on TP, FN, and FP.

View large

Table 2.

Evaluation of the segmentation on the synaptosomal within-distribution test set

Dataset	Map DICE	Final DICE	δ_d	Δ_c (nm)	Number of Vesicles	TP	FN	FP	F1-score
ST 1	0.76	0.84	0.11	2.61 ± 1.51	699	683	16	2	0.987
ST 2	0.74	0.79	0.11	2.31 ± 1.72	122	117	5	1	0.975
ST 3	0.74	0.75	0.10	3.6 ± 2.28	434	380	54	22	0.909
ST 5	0.79	0.86	0.08	2.22 ± 1.27	535	514	21	15	0.966
ST 6	0.76	0.85	0.06	1.98 ± 0.99	373	355	18	23	0.945
ST 7	0.77	0.82	0.06	2.26 ± 1.23	110	107	3	7	0.955
ST 8	0.83	0.91	0.04	2.14 ± 1.05	100	100	0	1	0.995
SC 10	0.80	0.89	0.07	1.92 ± 1.22	129	127	2	5	0.973
ST 10	0.81	0.90	0.04	1.89 ± 1.0	76	75	1	5	0.962
Average	0.78 ± 0.03	0.85 ± 0.05	0.07 ± 0.03	2.33 ± 1.36	286.44	96.31%	3.69%	3.63%	0.963 ± 0.03

Dataset	Map DICE	Final DICE	δ_d	Δ_c (nm)	Number of Vesicles	TP	FN	FP	F1-score
ST 1	0.76	0.84	0.11	2.61 ± 1.51	699	683	16	2	0.987
ST 2	0.74	0.79	0.11	2.31 ± 1.72	122	117	5	1	0.975
ST 3	0.74	0.75	0.10	3.6 ± 2.28	434	380	54	22	0.909
ST 5	0.79	0.86	0.08	2.22 ± 1.27	535	514	21	15	0.966
ST 6	0.76	0.85	0.06	1.98 ± 0.99	373	355	18	23	0.945
ST 7	0.77	0.82	0.06	2.26 ± 1.23	110	107	3	7	0.955
ST 8	0.83	0.91	0.04	2.14 ± 1.05	100	100	0	1	0.995
SC 10	0.80	0.89	0.07	1.92 ± 1.22	129	127	2	5	0.973
ST 10	0.81	0.90	0.04	1.89 ± 1.0	76	75	1	5	0.962
Average	0.78 ± 0.03	0.85 ± 0.05	0.07 ± 0.03	2.33 ± 1.36	286.44	96.31%	3.69%	3.63%	0.963 ± 0.03

For the meaning of the columns, see Table 1.

View large

Table 3.

Evaluation of the segmentation on the neuronal out-of-distribution test set

Dataset	Map DICE	Final DICE	δ_d	Δ_c (nm)	Number of Vesicles	TP	FN	FP	F1-score
N 134	0.73	0.80	0.08	2.73 ± 2.53	629	530	99	79	0.856
N 80	0.73	0.81	0.07	2.78 ± 2.73	110	101	9	13	0.902
N 73	0.72	0.83	0.08	3.14 ± 2.77	510	480	30	39	0.933
N 133	0.80	0.90	0.05	2.21 ± 2.06	516	498	18	12	0.971
N 116	0.66	0.83	0.07	2.57 ± 2.59	294	281	13	44	0.908
N 84	0.85	0.90	0.07	1.6 ± 1.89	481	464	17	29	0.953
N 115	0.56	0.64	0.08	3.41 ± 3.11	163	130	33	62	0.732
N 102	0.64	0.89	0.05	1.52 ± 0.83	103	96	7	2	0.955
N 123	0.64	0.68	0.05	2.3 ± 1.8	65	59	6	6	0.908
N 132	0.70	0.87	0.04	1.61 ± 1.03	135	125	10	17	0.903
N 114	0.71	0.83	0.07	2.71 ± 2.07	127	115	12	23	0.868
N 128	0.75	0.90	0.04	2.11 ± 1.17	248	239	9	23	0.937
Average	0.71 ± 0.08	0.82 ± 0.08	0.06 ± 0.02	2.39 ± 2.05	281.75	91.83%	8.17%	11.22%	0.902 ± 0.06

Dataset	Map DICE	Final DICE	δ_d	Δ_c (nm)	Number of Vesicles	TP	FN	FP	F1-score
N 134	0.73	0.80	0.08	2.73 ± 2.53	629	530	99	79	0.856
N 80	0.73	0.81	0.07	2.78 ± 2.73	110	101	9	13	0.902
N 73	0.72	0.83	0.08	3.14 ± 2.77	510	480	30	39	0.933
N 133	0.80	0.90	0.05	2.21 ± 2.06	516	498	18	12	0.971
N 116	0.66	0.83	0.07	2.57 ± 2.59	294	281	13	44	0.908
N 84	0.85	0.90	0.07	1.6 ± 1.89	481	464	17	29	0.953
N 115	0.56	0.64	0.08	3.41 ± 3.11	163	130	33	62	0.732
N 102	0.64	0.89	0.05	1.52 ± 0.83	103	96	7	2	0.955
N 123	0.64	0.68	0.05	2.3 ± 1.8	65	59	6	6	0.908
N 132	0.70	0.87	0.04	1.61 ± 1.03	135	125	10	17	0.903
N 114	0.71	0.83	0.07	2.71 ± 2.07	127	115	12	23	0.868
N 128	0.75	0.90	0.04	2.11 ± 1.17	248	239	9	23	0.937
Average	0.71 ± 0.08	0.82 ± 0.08	0.06 ± 0.02	2.39 ± 2.05	281.75	91.83%	8.17%	11.22%	0.902 ± 0.06

For the meaning of the columns, see Table 1.

View large

Figure 6.

View large Download slide

DICE improvement during post-processing. DICE development at different post-processing steps of predicted masks (different colors correspond to different steps). Each dot corresponds to a tomogram. (A–C) synaptosomal train set (B) synaptosomal within-distribution test set (C) neuronal out-of-distribution test set. Boxplot: Box edges show 25th and 75th percentiles, the middle line is the median. Whiskers extend to the last points within 1.5 times the interquartile range. Points beyond are outliers.

In addition to the DICE metric, which is a voxel-wise evaluation, we performed a vesicle-wise evaluation. Namely, we quantified vesicle diameter deviation and center residual (Fig. 6; and Tables 1, 2, and 3). Results show that our method transfers well across datasets even without fine-tuning which shows robustness and generalization, with an F1-score of 0.96 ± 0.04, 0.96 ± 0.03, and 0.90 ± 0.06, fort the train, within-distribution test, and out-of-distribution datasets, respectively (Tables S3, S4, and S5).

Comparison with other methods

To provide a comparison with non-deep learning-based approaches, we assessed the performance of 3D ART VeSElecT, a pipeline designed to segment automatically SVs in electron tomograms of freeze-substituted samples (Kaltdorf et al., 2017). The workflow of 3D ART VeSElecT involves several preprocessing steps, including smoothing, contrast, and edge enhancement. The segmentation process involves image thresholding and the watershed algorithm for 2D and 3D segmentation. Fig. 7 A shows the cryo-ET dataset used for the test, while Fig. 7 B shows the 3D distance map obtained from the 3D ART VeSElecT algorithm before binarization and the final labels output by the algorithm. It is apparent that most SVs do not get correctly segmented by this procedure. This can likely be attributed to the low signal-to-noise ratio inherent to cryo-ET.

Figure 7.

View large Download slide

Prediction mask and label comparison. (A and B) Orthogonal slices of a mouse neuron tomogram, (B) 3D ART VeSElecT; left: distance map before binarization; right: final output. (C) VGGNet probability map (left) and segmentation after global thresholding (right). (D) CryoVesNet probability map (left) and segmentation after global thresholding (right). The color bar is from 0 to 1 and scale bar, 100 nm.

We then went on to compare our approach to well-known 2D network architectures and the model implemented in EMAN2 (Chen et al., 2017). Held et al. (2024) have recently trained the latter model on a set of manually annotated SV cross-sections and they have subsequently automatically segmented additional SVs. For the comparison, we conducted extensive training experiments using several well-known 2D networks (Last et al., 2024, Preprint). These networks consisted of InceptionNet (Szegedy et al., 2015, Preprint), ResNet (Szegedy et al., 2017, Preprint), 2D U-Net (Ronneberger et al., 2015), VGGNet (Simonyan and Zisserman, 2014, Preprint), and the model in EMAN2 (Galaz-Montoya et al., 2015). They were trained under conditions similar to those used for our proposed method, ensuring a fair and accurate comparison (see Material and methods and Fig. S3).

+ Expand view − Collapse view

Figure S3.

View large Download slide

Training performance of 2D networks. DICE coefficient and training loss for VGGNet, 2D U-Net, EMAN2, ResNet, and InceptionNet over 200 epochs.

Fig. 7, C and D illustrates the probability maps and the labels after initial thresholding generated by the best-performing 2D network (VGGNet) and by CryoVesNet, respectively. While 2D networks can effectively capture information in the x-y plane, they struggle with the full 3D complexity of the data. This limitation is particularly evident when dealing with the missing wedge effect in cryo-electron tomography, which results in anisotropic resolution and incomplete information in the reconstructed volumes (Fig. S4) Consequently, 2D networks trained on these tomograms often miss the top and bottom parts of the vesicles along the z-axis, as it can be observed in Fig. 7 C. This shortcoming highlights the need for a 3D approach like CryoVesNet.

+ Expand view − Collapse view

Figure S4.

View large Download slide

Probability map comparison. (A) Orthogonal slice of a mouse neuron tomogram. Scale bar, 100 nm. (B) VGGNet as best 2D network probability map. (C) CryoVesNet probability map.

We quantitatively compared performance, measured by DICE score, of six different deep learning models (CryoVesNet, VGGNet, 2D U-Net, EMAN2, ResNet, and InceptionNet) across all post-processing steps except adaptive thresholding (Fig. 8). This step was omitted due to its tendency to expand small vesicles in noisy probability maps, which were obtained by some 2D networks, leading to poorer results. The results show that our post-processing steps consistently improve segmentation performance across all assessed models. This highlights the potential of our pipeline as a versatile tool in cryo-ET data analysis. Despite the superior performance of 3D models in theory, the practical reality of cryo-ET data analysis often relies on 2D annotations due to time constraints and the complexity of 3D manual segmentation (Last et al., 2024, Preprint). Our approach bridges this gap, allowing researchers to leverage existing 2D annotations while benefiting from improved 3D segmentation accuracy. By providing a robust post-processing pipeline that enhances results across multiple architectures, we offer a practical solution that can be integrated into various existing workflows, potentially reducing the need for time-consuming 3D manual annotations while improving overall segmentation quality.

Figure 8.

View large Download slide

Post-processing performance of different models on out-of-distribution dataset. The box plots show the DICE scores for six models (CryoVesNet, VGGNet, 2D U-Net, EMAN2, ResNet, and InceptionNet) at four stages of post-processing: probability map (soft DICE), map thresholding, radial profile refinement, and outlier removal. Boxplot definition: see Fig. 6.

Generalization and interactive tool usage

To assess the generalization capabilities of our procedure, we used our tools on publicly available datasets on the Electron Microscopy Public Image Archive (EMPIAR) and Electron Microscopy Data Bank (EMDB). The generalization dataset contains in vitro–reconstituted SVs (EMPIAR-10498), cell bodies in a D. melanogaster brain cryo-FIB lamella (EMD-12727), axons in a human neuronal organoid culture (EMPIAR-10805), and a cell body in a C. elegans cryo-FIB lamella (EMD-4869). Due to the lack of ground truth annotations, quantitative performance metrics could not be calculated. However, visual inspection and qualitative analysis demonstrate the method’s potential applicability across diverse sample types (Fig. 9).

Figure 9.

View large Download slide

Tool generalization across diverse samples and interactive cleaning application. Left column: tomographic x-y slice. Middle column: initial probability map. Right column: final segmentation after interactive cleaning in our Napari tool. In the right column, correctly segmented vesicles are shown in green, false positives are shown in red, and false negatives are shown in blue. Color bar range is 0–1 and scale bar, 100 nm.

Even if the pipeline efficiently and accurately segments vesicles, we typically get a few percent of regions mistakenly segmented as vesicles (false positives) and missed vesicles (false negatives; Tables 1, 2, and 3). To fix them, we developed an interactive tool using Napari, a multidimensional image viewer for Python. It enables users to manually remove incorrectly identified vesicles in a single click. Users can also click the approximate center of a missed vesicle and the tool will automatically refine the center position and find the radius of the vesicle. An example of the output of the tool is shown in Fig. 9; and Videos 1 and 2. The right column shows in green the vesicles that were correctly segmented by CryoVesNet without any user intervention. The wrongly segment vesicles are depicted in red, while the missed vesicles were segmented using the Napari tool and are shown in blue. More details on the function of the tool are given in Materials and methods and in Videos 1 and 2.

Video 1.

Demonstration of Napari interactive tool usage. This video illustrates the use of our Napari-based interactive tool. The video highlights key features of the tool, including Compute Labels to segment synaptic vesicle and Remove Labels for eliminating incorrect annotations. Users can iterate through this process, refining the segmentation as needed, and correct mistakes. Once satisfied with the results, users can save the final segmentation in MRC format. Additional features include adding spherical labels for the manual addition of vesicle labels and undoing a function to revert the last modifications. 30 frames per second.

Video 2.

Demonstration of Napari Interactive Tool to segment vesicles from scratch. This video shows how to use the tool to segment vesicles from scratch. This can be used for example if only a handful of vesicles are present in a tomogram or to quickly build a training set. The tool does not need a probability map. The user puts a point in each vesicle to segment. The compute labels function then instantly refines the vesicle center position and computes the vesicle radius. 30 frames per second.

Downstream analysis and application

Pyto is a software package designed for the analysis of pleomorphic membrane-bound molecular complexes in 3D images, particularly in the context of synaptic cryo-ET (Lucić et al., 2016). A key feature of Pyto is its ability to accurately segment connectors and tethers within the pre-synaptic terminal, a task that requires a high level of vesicle segmentation precision. This segmentation process is hierarchical and connectivity-based, detecting densities interconnecting vesicles (connectors) and densities connecting vesicles to the active-zone plasma membrane (tethers). CryoVesNet has been designed to be compatible with Pyto (Lucić et al., 2016) and an application is demonstrated in Fig. 10 and Video 3. This enables us to investigate SV connectivity and priming at the ultrastructural level to better understand the structural basis of SV exocytosis regulation. A detailed close-up of Fig. 10 D is provided in Fig. S5.

Figure 10.

View large Download slide

Downstream connector segmentation to a CryoVesNet-segmented cultured mouse neuron synapse. (A) Slice through a tomogram. (B) Threshold-based segmentation of plasma membrane (light blue), mitochondria (dark blue), endosomes (green), microtubules (dark magenta). (C) SVs (orange) and connectors (black) segmented with CryoVesNet and Pyto, respectively. (D) Combination of B and C. Scale bar, 100 nm.

Video 3.

Tomogram and segmentation of a cultured mouse neuron synapse. The video begins with a complete tomogram of the presynaptic terminal in a cultured mouse neuron. It then displays the segmentation of synaptic vesicles (SVs) using CryoVesNet, where SVs are shown in orange and connectors in black. Additional overlays illustrate the segmentation of microtubules (dark magenta), the plasma membrane (light blue), mitochondria (dark blue), and endosomes (green), providing a comprehensive annotation of the neuronal structures. 30 frames per second.

+ Expand view − Collapse view

Figure S5.

View large Download slide

Close-up of segmented SVs and connectors. Close-up of Fig. 10 D, SVs are depicted in orange and connectors in black. Scale bar, 100 nm.

Discussion

Synaptic vesicles: Molecular interactions and functions

SVs play a central role in neurotransmission, facilitating the release of neurotransmitters into the synaptic cleft. These vesicles undergo a series of molecular interactions with various protein complexes, transitioning from a tethered to a primed state, and eventually to neurotransmitter release through exocytosis. Synapsins have been identified as key proteins in regulating the availability of SVs for exocytosis. It has been hypothesized that synapsins crosslink SVs, thereby preventing their premature release. Cryo-ET emerges as a powerful tool to address these challenges, offering unparalleled insights into the molecular architecture of synapses. Accurate segmentation of structures such as vesicles, connectors, and tethers is essential for a comprehensive understanding of synaptic function. Cryo-ET, however, is not without its challenges. The technique suffers from a high level of noise and anisotropic resolution (known as the missing wedge phenomenon), which complicates data analysis and interpretation. Addressing these challenges is crucial for obtaining clear and accurate tomographic reconstructions.

CryoVesNet: Automatic vesicle segmentation in Cryo-ET

By utilizing a U-Net architecture trained on manually segmented tomograms and postprocessing steps, we have developed a system that can efficiently and accurately segment SVs in tomographic datasets. In particular, CryoVesNet is uniquely insensitive to the missing wedge and can segment complete vesicles even if the membrane is not fully visible in the tomogram. Recent methods like IsoNet learn about the effects of the missing wedge by artificially introducing additional missing wedge artifacts during training (Liu et al., 2022). This approach enables them to partially restore information lost due to the experimental missing wedge. IsoNet applies this strategy to reduce tomogram resolution anisotropy. In contrast, our approach takes a different path. CryoVesNet inherently addresses the missing information problem during training as the ground truth segmentations consist of perfect spheres. By learning from these idealized representations, the network can accurately segment vesicles even when faced with incomplete data in real tomograms. The results obtained from our method, as evidenced by the DICE and other evaluation metrics, demonstrate its robustness and accuracy. Notably, the applicability of our method across different datasets, namely from rat synaptosomes and primary neuronal cultures, underscores its versatility. The potential of CryoVesNet to generalize across species and to segment not only SVs but also other spherical membrane-bound organelles illustrates its broad utility in structural biology research. This adaptability suggests that researchers could potentially apply CryoVesNet to a broad range of studies involving spherical vesicles or similar structures. The enhancement in segmentation quality provided by the post-processing steps was observed for both our 3D U-Net and popular 2D networks. Such improvements underscore the crucial role of post-processing in refining results, especially when dealing with image noise and anisotropic resolution, which are characteristic of cryo-ET data.

In our segmentation approach, the use of both global and adaptive localized thresholding techniques further refines the segmentation, addressing challenges posed by closely packed vesicles. Our results highlight the effectiveness and robustness of our post-processing steps, including radial profile refinement and the removal of outliers. The radial profile, in particular, ensures that the segmented vesicles closely match their actual structure in the tomogram, providing a more accurate representation. Although our network was trained exclusively on spherical vesicles, it also detects and segments elliptical vesicles. This facilitates the analysis of inhibitory synapses, which consistently contain a modest yet significant proportion of elliptical SVs. Furthermore, our method’s compatibility with software like Pyto, which is designed for the analysis of pleomorphic membrane-bound molecular complexes in cryo-electron tomograms, enhances its utility. By integrating our segmentation approach with tools like Pyto, researchers can gain deeper insights into vesicle interactions.

By visualizing and quantifying changes in vesicle distribution, connectivity, and tether morphology, we can infer how molecular manipulations translate into structural alterations that ultimately affect synaptic transmission. We focus on nanometer-scale morphological analysis that quantifies SV distribution, tether morphology, and connectivity patterns. This level of analysis is essential for understanding the spatial organization and interactions of synaptic components, offering a comprehensive view of synaptic architecture that bridges the gap between molecular-level interactions and overall synaptic function.

Conclusion

In conclusion, CryoVesNet for automatic segmentation in cryo-ET represents a step forward in the study of SVs and their associated structures. By combining the power of deep learning with optimized post-processing techniques, we offer a solution that is both efficient and precise. As the emerging field of structural cell biology develops, tools like ours will contribute to advancing our understanding of complex cellular structures and processes. Future studies could further expand on this approach by combining our morphological analysis with higher-resolution techniques like subtomogram averaging, potentially providing a multiscale view of synaptic organization from individual protein complexes to overall vesicle arrangements. This integration of methods could significantly advance our understanding of the structure–function relationships in synapses and how they are regulated in various physiological and pathological conditions.

Materials and methods

Cryo-electron tomography datasets

In this study, we used datasets originating from either rat synaptosomes or mouse primary neuron cultures. They represent a total of 30 tomograms with heterogeneous pixel sizes, defocus, and resolution, and we split them into three groups: (1) Train set: nine synaptosome tomograms were used for training. (2) Within-distribution test set: nine independent synaptosome tomograms were used for testing. (3) Out-of-distribution test set: 12 neuron tomograms were used for assessing transfer learning potential. (4) Generalization test set: six tomograms from publicly available databank to demonstrate the generalization across species. The preparation procedure of the samples from which the datasets were obtained as well as the biological analysis of these datasets was previously reported (Radecke et al., 2023).

Manual segmentation and automatic interboundary segment detection

Manual segmentation of SVs, the presynaptic cytoplasm, and the active zone plasma membrane was done in IMOD (Kremer et al., 1996). SVs were segmented as spheres. The presynaptic cytoplasm marked the region to be analyzed by Pyto (Lucić et al., 2016). Later on, we refer to this region as the cytoplasmic segmentation region. It consisted of the volume comprising the active zone and the cluster of SVs. The analysis by Pyto was essentially the same as described previously (Fernández-Busnadiego et al., 2010; Lucić et al., 2016). In short, the segmented region is divided into one voxel thick layers parallel to the active zone for distance calculations. A hierarchical connectivity segmentation detects densities interconnecting boundaries. The boundaries were SVs and the active zone plasma membrane. Detected intervesicular segments are termed connectors and segments connecting vesicles to the active zone plasma membrane are called tethers. Distance calculations respective to SVs were done from the SV center. The segmentation procedure is conservative and tends to miss some tethers and connectors because of noise. Consequently, the numbers of tethers and connectors should not be considered as absolute values, but rather to compare experimental groups. As it was done before, an upper limit was set between 2,100 and 3,200 nm³ on segment volume. The tomograms that were used for this analysis were binned by a factor of 2–3, resulting in voxel sizes between 2.1 and 2.4 nm.

Train and validation set generation

In the preparation of our train set, we utilized segmented 3D image volumes. The primary volume was systematically divided into 32³ cubic sub-volumes. To ensure the relevance and richness of the data, only those sub-volumes that were sufficiently occupied by vesicles, specifically containing >1,000 voxels, were retained. 900 sub-volumes were used for training and 200 sub-volumes were used for validation in our experimental setup, we configured the model with a dropout rate of 0.2 to stabilize the validation loss.

Network architecture and training procedure

We used a U-Net with two downsampling stages and two convolutional layers per stage, with a kernel size of 3, and ReLU activation function based on the open-source CARE framework (Fig. 1) (Weigert et al., 2018). We employed the Adam optimizer on a weighted (10:1) binary cross-entropy loss function. The learning progress was tracked by calculating the DICE and the loss value after each training epoch (Fig. S6). The DICE for the train set was initially started at ∼0.25 and rose to just below 0.9 during training, the validation DICE score remained stable around 0.8 after 50 epochs. The loss for the train set decreased from over 1.2 to values close to 0.2 after 50 epochs, whereas for the validation set, the loss initially dropped slightly below 0.5 and then exhibited slight increases and fluctuations. A dropout rate of 0.2 was used to prevent overfitting. We used 50 sub-volumes per batch, and the training was conducted for 200 epochs.

+ Expand view − Collapse view

Figure S6.

View large Download slide

Dice coefficient and loss value for train and validation set over epoch. (A–D) Training DICE coefficient, (B) training loss, (C) validation DICE coefficient, and (D) validation loss.

Probability map construction

Our U-Net model, trained on 32³-voxel patches, utilizes a 24-voxel region of interest (ROI). To mitigate tiling effects during testing, the network input can be expanded to accommodate larger volumes specifically in this case of 64³ voxels. The tomogram undergoes padding to align with the ROI, ensuring reduced edge artifacts. Segmentation is executed in tiles, where the U-Net predicts the SV probability for each tile. Only the central part of the segmented patch, corresponding to the ROI, is retained. Finally, the segmented tiles are reassembled, yielding a continuous SV probability map of the entire volume.

Global thresholding

Segmenting implies turning the probability map into a binary mask. To find the optimal threshold value, we iterated through potential threshold values ranging from 0.8 to 1 in increments of 0.01. A binary mask was generated for each threshold. Subsequently, an erosion operation was applied to the binary mask, and the difference between the original and eroded masks produced the vesicle shell. The voxel intensity values of the original image corresponding to this shell were recorded for each threshold. We minimized the average intensity of the shell voxels to determine the optimal threshold value since the shell of correctly segmented vesicles corresponds to the vesicle membrane, which in cryo-ET appears darker, i.e., with lower intensity values.

Adaptative local thresholding

Each individual segment was given a label using the scikit-image label method (van der Walt et al., 2014). A majority of vesicles were correctly segmented but we noticed some segments included more than one vesicle. We therefore evaluated each segment with two criteria based on the fact that SVs have a homogenous size and are spherical. Firstly, we calculated the volume z-score z for each segment:

z (S_{i}) = \frac{V (S_{i}) - μ}{σ},

(1)

where S_i is the segment i, V(S_i) is the volume of S_i, μ the average volume of all segments, and σ the standard deviation of the segment volumes. Secondly, we computed the segment extent e as:

e (S_{i}) = \frac{V (S_{i})}{B (S_{i})},

(2)

where B(S_i) is the volume of segment i bounding box. The extent of a sphere equals

\frac{π}{6}

⁠. Segments with both a z-score z > 1 and an extent e < 0.25 were considered as potentially comprising more than one vesicle. For each of these segments, the threshold was increased stepwise until two distinct segments were generated. Subsequently, the extent and volume of all segments were evaluated again. Any segment with e < 0.25, or e > 0.75, or V < k was discarded. k was defined as the volume of a sphere with a radius of 12 nm. This ensured that segments deviating highly from spherical shape and segments with a volume smaller than an acceptable volume k were removed.

Segmentation refinement using radial profile

Even if most SVs were detected and well-segmented, segmentation accuracy was not sufficient for our downstream application. To improve accuracy, each segment was converted to a spherical segment and its radius and position were refined. The initial spherical conversion was done by setting the center of the sphere C at the position of the centroid of the segment, while the radius r was defined as half the length of the bounding box’s longest edge. The segment position and radius were iteratively refined as follows.

(1)
The radial average $〈 I (d) 〉$ was computed:

⟨ I (d) ⟩ = \frac{1}{4 π^{2} r^{2}} \int_{0}^{2 π} \int_{0}^{2 π} I (d, θ, ϕ) d ϕ d θ,

(3)

where d is the radial distance from the segment center, θ the polar angle, and ϕ the azimuthal angle.

(2)
The radius of the vesicle r was updated as:

r = d_{m} + \frac{t_{m}}{2},

(4)

where d_m is the radial distance of the center of the vesicle membrane and t_m is the thickness of the vesicle membrane. d_m was defined as the radial distance for which the radial average was minimal.

\frac{t_{m}}{2}

was calculated as the distance between the center of the vesicle membrane and the minimum of the second derivative of the radial profile in the interval between the center of the vesicle membrane and the maximum of the Fresnel fringe outside the membrane.

(3)
The radial average was back-projected in three dimensions:

I (x, y, z) = ⟨ I (\sqrt{x^{2} + y^{2} + z^{2}}) ⟩,

(5)

where (x,y,z) = (0,0,0) is the coordinate of the segment center.

(4)
We computed by cross-correlation the shift between the obtained 3-D average and the 3-D image in the cubic box with central coordinates C and edge length l = 2r + c, where c is a constant. C was updated by subtraction of the shift.
(5)
Steps 1–4 were repeated for a maximum of 10 iterations until convergence or until a total shift of $\frac{1}{2} \sqrt{3 l_{o}^{2}}$ ⁠, where l_o is the edge length of the initial box. The feature space of predicted vesicle labels was computed, containing membrane thickness t_m, membrane intensity ρ, and vesicle radius r. ρ was defined as the mean intensity of the radial average within the radial distance interval $[d_{m} - \frac{t_{m}}{2}, d_{m} + \frac{t_{m}}{2}]$ ⁠.

Outlier detection and refinement

Following radial profile calculation, key features, namely thickness, radius, and membrane intensity, were extracted. Using these criteria, the Mahalanobis distance D² was calculated for each data point to quantify its distance from the distribution of these features using Scipy’s implementation (Virtanen et al., 2020), as follows:

D^{2} = {(x - μ)}^{⊤} S^{- 1} (x - μ),

(6)

where x is the vector of the three features, μ is the mean vector of the features, and S⁻¹ is the inverse covariance matrix of the features. The significance of the Mahalanobis distance was interpreted as follows:

p = 1 - C D F_{χ^{2}} (D^{2}, ν),

(7)

where

C D F_{χ^{2}}

is the cumulative distribution function of the chi-squared distribution and ν is the number of degrees of freedom, which is equal to the number of features - 1. The resulting P value provides a measure of the statistical significance of each data point’s distance (Fig. 3).

With the computed P values, outlier detection and refinement were conducted. Each vesicle with a P value lower than a given threshold was defined as an outlier. In this study, we empirically set the threshold to 0.3, but other values can be used, depending on the use case. The radial profile and P value of the outliers were recalculated using a different box size. We performed this step iteratively. At each iteration, the box size was made larger by 2 × 2 × 2 voxels. For each outlier, the iteration stopped when its P value was higher than the threshold. A maximum of 10 iterations was performed. Vesicles that did not meet the P value criteria were removed from the dataset.

Criteria for spherical vesicle selection

To distinguish between spherical and elliptical vesicles, we adopted the criteria used in the study by Tao et al. (2018). Vesicles are initially segmented using CryoVesNet, and their shapes are analyzed based on the major and minor axes. A vesicle is classified as spherical if the ratio of its major to minor axis is ≤1.14. This corresponds to an eccentricity (E) of 0.481, which we use as a threshold for classification. Eccentricity (E) is a measure of how much a shape deviates from being circular. It is calculated using the formula:

E = \sqrt{1 - {(\frac{m i n o r a x i s}{m a j o r a x i s})}^{2}} .

(8)

Eccentricity values range from 0 to 1, with 0 indicating a perfect sphere and values closer to 1 indicating more elongated ellipses. Vesicles with an eccentricity <0.48 are considered spherical. We classify vesicles with an eccentricity exceeding 0.95 as too elongated to be considered SVs.

Method comparison

For our 2D networks, we adopted code from the Ais codebase (Last et al., 2024, Preprint). To ensure a fair comparison, we extracted slices with label density matching our model’s training set. These networks were trained under conditions identical to our proposed method, including the use of the same optimizer and class weight ratio (10:1). Training continued for 200 epochs, during which we observed a consistent decrease in training loss and an increase in validation DICE (Fig. S3). We utilized a batch size of 25 slices throughout the training process.

To compare our approach with non-deep learning algorithms, we employed the FIJI Macro 3D ART VeSElecT v2. To align our cryo-ET data more closely with the characteristics expected by this method, we applied non-anisotropic denoising (NAD) and histogram matching as preprocessing steps. We used the default characterization parameters of the VeSElecT method, which include minimum radius, sphericity, and elongation. This preparation ensured that this non-deep learning approach could be applied to our dataset, allowing for a more meaningful comparison between traditional and deep learning-based segmentation techniques.

Interactive vesicle segmentation tool

The interactive Napari tool offers several key features for image visualization and segmentation editing. It displays the original cryo-ET image alongside an editable label layer representing the current segmentation. Users can add, modify, or remove vesicle annotations using a point-based interface. It is particularly useful for addressing false positives and false negatives in vesicle detection.

The tool provides various modification operations, including computing labels, removing labels, and adding spherical labels. The “compute labels” function uses a radial profile refinement step to segment vesicles based on user-specified points as vesicle center approximations. In this process, the function initially creates a bounding box around each point using a default diameter of 45 nm (larger than a typical SV diameter) as a starting point for analysis. The refinement step then examines the intensity profile radially from the center of each potential vesicle, adjusting the position and radius to best fit the actual vesicle boundaries in the image data iteratively (see “Segmentation refinement using radial profile” section). The “remove labels” function will remove any label under the user-specified points. Users can also employ the “add spherical labels” feature as a fully manual annotation method, in which they interactively define the center and radius of the label to add. To enhance user efficiency, keyboard shortcuts for common operations have been implemented. The procedure is demonstrated in Video 1.

Evaluation metrics

The evaluation framework was designed to assess the capabilities of the proposed toolbox for automatic SV segmentation. We defined as ground truth the manual segmentation of SVs. The evaluation was performed within the cytoplasmic segmentation region (see “Manual segmentation and automatic interboundary segment detection”). We performed per-vesicle evaluation and voxel-wise evaluation. For the former, we defined a vesicle as correctly segmented if the center of the predicted vesicle was located inside the ground truth vesicle. Based on that we calculated an F1 score. For the voxel-wise evaluation, we calculated the DICE between the prediction and the ground truth.

Voxel-wise evaluation

During training, a DICE for subvolumes probabilistic masks was calculated after each epoch as a performance quantification (Kremer et al., 1996) (Fig. S1). After reconstructing the probability map after training or prediction, we employed the DICE metric for the whole tomogram to evaluate the similarity of the predicted probability map with ground truth (Tables 1, 2, and 3). The DICE is defined as:

1 - \frac{2 \sum_{v o x e l s} y_{t r u e} y_{p r e d}}{\sum_{v o x e l s} y_{t r u e}^{2} + \sum_{v o x e l s} y_{p r e d}^{2}},

(9)

where y_true and y_pred are the ground truth and predicted probability values, respectively, for each voxel.

The DICE was also employed to monitor all stages of post-processing of the labels and to observe the effect of each post-processing step.

Vesicle diameter and position deviation

In addition to the F1 score, we also evaluated the precision of our segmentation by calculating the deviation of the estimated vesicle diameter and position from the ground truth. Given the diameter of a ground-truth manually segmented vesicles d_GTi and the predicted diameters of the same vesicles d_Pi, the average deviation in diameter estimation across all vesicles can be expressed as δ_d, where δ_d is calculated as:

δ_{d} = \frac{1}{n} \sum_{i = 1}^{n} (1 - \frac{\min (d P i, d G T i)}{\max (d P i, d G T i)}) .

(10)

Here, n represents the total number of vesicles, dPi is the diameter of i-th vesicle predicted by the segmentation, and dGTi is the diameter of the i-th vesicle in the ground truth. This formula offers an insight into the congruence between our estimated diameter and the manual segmentation, with a diminished value of δd signifying a closer approximation.

Similarly, the average deviation in position estimation across all vesicles can be expressed as Δ_c. It corresponds to the average Euclidean distance between the center of ground truth vesicles and the center of predicted vesicles.

Statistical comparison

Multiple pairwise ANOVA comparisons with Benjamini-Hochberg correction were performed on the DICE values summarized in Table S2 to assess the statistical significance of the differences between the DICE values (Benjamini and Hochberg, 1995). We performed Benjamini-Hochberg correction with the multipletests function implemented in the Python module statsmodels (Seabold and Perktold, 2010). A list of P-values resulting from pairwise comparisons was input, and multipletests output a list of corrected P-values. The used implementation of the Benjamini-Hochberg correction does not require a false discovery rate to be input. This variation of the original Benjamini-Hochberg correction algorithm was proposed by Yekutieli and Benjamini (1999). If a corrected P-value is smaller than the defined acceptable false discovery rate, then the null hypothesis is rejected, i.e., the difference is considered statistically significant. This algorithm enables to test multiple false discovery rates in one step and its conclusions are exactly the same as the original Benjamini-Hochberg correction algorithm run multiple times with different false discovery rates.

Computational setup

All experiments were conducted using 4 × NVIDIA 2080 Ti GPUs with CUDA 10.1. The software environment was set up with Python 3. Key libraries and packages used include TensorFlow 2.4.1 with GPU support and Keras 2.4.3. Image visualization was achieved with UCSF ChimeraX and Amira 2022.2 (Thermo Fisher Scientific). The contouring shown in Fig. S2 was done using MATLAB version R2023a (The MathWorks, Inc.). Surface rendering was performed using the volume tracer and color zone in UCSF ChimeraX.

Manuscript preparation

The first version of the manuscript was written with the open and collaborative scientific writing package Manubot (Himmelstein et al., 2019).

Online supplemental material

Fig. S1 shows a 2D slice of an automatically segmented dataset, including a presynaptic terminal section and predicted probability maps. Fig. S2 illustrates the adaptive local thresholding applied to the probability map. Fig. S3 presents the training performance of various 2D networks. Fig. S4 shows the reconstructed neuron tomogram and probability map output of VGGNet and CryoVesnet. Fig. S5 provides a close-up of segmented SVs and connectors. Fig. S6 illustrates the DICE coefficient and loss values for both training and validation sets over epochs. Table S1 lists DICE statistical values across different algorithm steps for various datasets. Table S2 provides corrected P-values for DICE value comparisons between different algorithms. Table S3 summarizes per-tomogram train set metrics, including F1 scores and false positives. Table S4 presents metrics for within-distribution test sets. Table S5 details out-of-distribution test set metrics. Video 1 demonstrates the usage of the Napari Interactive Tool for segmenting SVs. Video 2 shows how to use the tool for segmenting SVs from scratch. Video 3 shows a tomogram and segmentation of a cultured mouse neuron synapse, highlighting various neuronal structures.

Data availability

The training dataset annotation, along with the corresponding raw data, binned tomograms, and processed tomograms, has been deposited to EMPIAR (Iudin et al., 2023) under accession code EMPIAR-12195. CryoVesNet is available on GitHub at https://github.com/Zuber-group/CryoVesNet.

Acknowledgments

This work was supported by the Swiss National Science Foundation (grant number 179520 to B. Zuber), ERA-NET NEURON (NEURON-119 to B. Zuber), and the University of Bern Research Foundation (to Ioan Iacovache).

Author contributions: A. Khosrozadeh: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Resources, Software, Validation, Visualization, Writing - original draft, Writing - review & editing, R. Seeger: Data curation, Formal analysis, Investigation, Validation, Visualization, Writing - original draft, G. Witz: Methodology, Software, J. Radecke: Investigation, Resources, Writing - review & editing, J.B. Sørensen: Resources, Writing - review & editing, B. Zuber: Conceptualization, Data curation, Formal analysis, Funding acquisition, Methodology, Project administration, Software, Supervision, Writing - review & editing.

References

Bäuerlein

,

F.J.

,

J.C.

Pastor-Pareja

, and

R.

Fernández-Busnadiego

.

2021

.

Cryo-electron tomography of native Drosophila tissues vitrified by plunge freezing

.

bioRxiv

.

https://doi.org/10.1101/2021.04.14.437159

(Preprint posted April 14, 2021)

.

Google Scholar

Bäuerlein

,

F.J.B.

, and

W.

Baumeister

.

2021

.

Towards visual proteomics at high resolution

.

J. Mol. Biol.

433

:

167187

.

https://doi.org/10.1016/j.jmb.2021.167187

Google Scholar

Crossref

PubMed

Benjamini

,

Y.

, and

Y.

Hochberg

.

1995

.

Controlling the false discovery rate: A practical and powerful approach to multiple testing

.

J. R. Stat. Soc. Ser. B Stat. Methodol.

57

:

289

–

300

.

https://doi.org/10.1111/j.2517-6161.1995.tb02031.x

Google Scholar

Crossref

Buchholz

,

T.-O.

,

M.

Jordan

,

G.

Pigino

, and

F.

Jug

.

2019

.

Cryo-CARE: Content-aware image restoration for cryo-transmission electron microscopy data

. In

2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019)

.

IEEE

,

Piscataway, NJ, USA

.

502

–

506

.

https://doi.org/10.1109/ISBI.2019.8759519

Google Scholar

Crossref

Chen

,

M.

,

W.

Dai

,

S.Y.

Sun

,

D.

Jonasch

,

C.Y.

He

,

M.F.

Schmid

,

W.

Chiu

, and

S.J.

Ludtke

.

2017

.

Convolutional neural networks for automated annotation of cellular cryo-electron tomograms

.

Nat. Methods

.

14

:

983

–

985

.

https://doi.org/10.1038/nmeth.4405

Google Scholar

Crossref

PubMed

Çiçek

,

Ö.

,

A.

Abdulkadir

,

S.S.

Lienkamp

,

T.

Brox

, and

O.

Ronneberger

.

2016

.

3D U-Net: Learning dense volumetric segmentation from sparse annotation

. In

Medical Image Computing and Computer-Assisted Intervention–MICCAI 2016

.

19th International Conference

,

Athens, Greece

,

October 17–21, 2016

.

Part II 19

.

Springer

,

New York, NY, USA

.

424

-

432

.

Google Scholar

Crossref

de Teresa-Trueba

,

I.

,

S.K.

Goetz

,

A.

Mattausch

,

F.

Stojanovska

,

C.E.

Zimmerli

,

M.

Toro-Nahuelpan

,

D.W.C.

Cheng

,

F.

Tollervey

,

C.

Pape

,

M.

Beck

, et al.

2023

.

Convolutional networks for supervised mining of molecular patterns within cellular context

.

Nat. Methods

.

20

:

284

–

294

.

https://doi.org/10.1038/s41592-022-01746-2

Google Scholar

Crossref

PubMed

Fernández-Busnadiego

,

R.

,

B.

Zuber

,

U.E.

Maurer

,

M.

Cyrklaff

,

W.

Baumeister

, and

V.

Lucic

.

2010

.

Quantitative analysis of the native presynaptic cytomatrix by cryoelectron tomography

.

J. Cell Biol.

188

:

145

–

156

.

https://doi.org/10.1083/jcb.200908082

Google Scholar

Crossref

PubMed

Galaz-Montoya

,

J.G.

,

J.

Flanagan

,

M.F.

Schmid

, and

S.J.

Ludtke

.

2015

.

Single particle tomography in EMAN2

.

J. Struct. Biol.

190

:

279

–

290

.

https://doi.org/10.1016/j.jsb.2015.04.016

Google Scholar

Crossref

PubMed

Ginger

,

L.

,

J.

Malsam

,

A.F.

Sonnen

,

D.

Morado

,

A.

Scheutzow

,

T.H.

Söllner

, and

J.A.G.

Briggs

.

2020

.

Arrangements of proteins at reconstituted synaptic vesicle fusion sites depend on membrane separation

.

FEBS Lett.

594

:

3450

–

3463

.

https://doi.org/10.1002/1873-3468.13916

Google Scholar

Crossref

PubMed

Held

,

R.G.

,

J.

Liang

, and

A.T.

Brunger

.

2024

.

Nanoscale architecture of synaptic vesicles and scaffolding complexes revealed by cryo-electron tomography

.

Proc. Natl. Acad. Sci. USA

.

121

:e2403136121.

https://doi.org/10.1073/pnas.2403136121

Google Scholar

Himmelstein

,

D.S.

,

V.

Rubinetti

,

D.R.

Slochower

,

D.

Hu

,

V.S.

Malladi

,

C.S.

Greene

, and

A.

Gitter

.

2019

.

Open collaborative writing with Manubot

.

PLOS Comput. Biol.

15

:e1007128.

https://doi.org/10.1371/journal.pcbi.1007128

Google Scholar

Hoffmann

,

P.C.

,

S.L.

Giandomenico

,

I.

Ganeva

,

M.R.

Wozny

,

M.

Sutcliffe

,

M.A.

Lancaster

, and

W.

Kukulski

.

2021

.

Electron cryo-tomography reveals the subcellular architecture of growing axons in human brain organoids

.

Elife

.

10

:e70269.

https://doi.org/10.7554/eLife.70269

Google Scholar

Imbrosci

,

B.

,

D.

Schmitz

, and

M.

Orlando

.

2022

.

Automated detection and localization of synaptic vesicles in electron microscopy images

.

eNeuro

.

9

:

ENEURO.0400-20.2021

.

https://doi.org/10.1523/ENEURO.0400-20.2021

Google Scholar

Crossref

PubMed

Iudin

,

A.

,

P.K.

Korir

,

S.

Somasundharam

,

S.

Weyand

,

C.

Cattavitello

,

N.

Fonseca

,

O.

Salih

,

G.J.

Kleywegt

, and

A.

Patwardhan

.

2023

.

EMPIAR: The electron microscopy public image archive

.

Nucleic Acids Res.

51

:

D1503

–

D1511

.

https://doi.org/10.1093/nar/gkac1062

Google Scholar

Crossref

PubMed

Kaltdorf

,

K.V.

,

K.

Schulze

,

F.

Helmprobst

,

P.

Kollmannsberger

,

T.

Dandekar

, and

C.

Stigloher

.

2017

.

FIJI Macro 3D ART VeSElecT: 3D automated reconstruction tool for vesicle structures of electron tomograms

.

PLOS Comput. Biol.

13

:e1005317.

https://doi.org/10.1371/journal.pcbi.1005317

Google Scholar

Kremer

,

J.R.

,

D.N.

Mastronarde

, and

J.R.

McIntosh

.

1996

.

Computer visualization of three-dimensional image data using IMOD

.

J. Struct. Biol.

116

:

71

–

76

.

https://doi.org/10.1006/jsbi.1996.0013

Google Scholar

Crossref

PubMed

Lamm

,

L.

,

R.D.

Righetto

,

W.

Wietrzynski

,

M.

Pöge

,

A.

Martinez-Sanchez

,

T.

Peng

, and

B.D.

Engel

.

2022

.

MemBrain: A deep learning-aided pipeline for detection of membrane proteins in cryo-electron tomograms

.

Comput. Methods Programs Biomed.

224

:

106990

.

https://doi.org/10.1016/j.cmpb.2022.106990

Google Scholar

Crossref

PubMed

Lamm

,

L.

,

S.

Zufferey

,

R.D.

Righetto

,

W.

Wietrzynski

,

K.A.

Yamauchi

,

A.

Burt

,

Y.

Liu

,

H.

Zhang

,

A.

Martinez-Sanchez

, and

S.

Ziegler

.

2024

.

MemBrain v2: An end-to-end tool for the analysis of membranes in cryo-electron tomography

.

bioRxiv

.

https://doi.org/10.1101/2024.01.05.574336

(Preprint posted January 05, 2024)

.

Google Scholar

Last

,

M.G.F.

,

L.

Abendstein

,

L.M.

Voortman

, and

T.H.

Sharp

.

2024

.

Ais: Streamlining segmentation of cryo-electron tomography datasets

.

bioRxiv

.

https://doi.org/10.1101/2024.04.04.586917

(Preprint posted April 04, 2024)

.

Google Scholar

Liu

,

Y.T.

,

H.

Zhang

,

H.

Wang

,

C.L.

Tao

,

G.Q.

Bi

, and

Z.H.

Zhou

.

2022

.

Isotropic reconstruction for electron tomography with deep learning

.

Nat. Commun.

13

:

6482

.

https://doi.org/10.1038/s41467-022-33957-8

Google Scholar

Crossref

PubMed

Lučić

,

V.

,

R.

Fernández-Busnadiego

,

U.

Laugks

, and

W.

Baumeister

.

2016

.

Hierarchical detection and analysis of macromolecular complexes in cryo-electron tomograms using Pyto software

.

J. Struct. Biol.

196

:

503

–

514

.

https://doi.org/10.1016/j.jsb.2016.10.004

Google Scholar

Crossref

PubMed

Lucić

,

V.

,

F.

Förster

, and

W.

Baumeister

.

2005

.

Structural studies by electron tomography: From cells to molecules

.

Annu. Rev. Biochem.

74

:

833

–

865

.

https://doi.org/10.1146/annurev.biochem.73.011303.074112

Google Scholar

Crossref

PubMed

Martinez-Sanchez

,

A.

,

I.

Garcia

,

S.

Asano

,

V.

Lucic

, and

J.J.

Fernandez

.

2014

.

Robust membrane detection based on tensor voting for electron tomography

.

J. Struct. Biol.

186

:

49

–

61

.

https://doi.org/10.1016/j.jsb.2014.02.015

Google Scholar

Crossref

PubMed

Moebel

,

E.

,

A.

Martinez-Sanchez

,

L.

Lamm

,

R.D.

Righetto

,

W.

Wietrzynski

,

S.

Albert

,

D.

Larivière

,

E.

Fourmentin

,

S.

Pfeffer

,

J.

Ortiz

, et al.

2021

.

Deep learning improves macromolecule identification in 3D cellular cryo-electron tomograms

.

Nat. Methods

.

18

:

1386

–

1394

.

https://doi.org/10.1038/s41592-021-01275-4

Google Scholar

Crossref

PubMed

Ni

,

T.

,

T.

Frosio

,

L.

Mendonça

,

Y.

Sheng

,

D.

Clare

,

B.A.

Himes

, and

P.

Zhang

.

2022

.

High-resolution in situ structure determination by cryo-electron tomography and subtomogram averaging using emClarity

.

Nat. Protoc.

17

:

421

–

444

.

https://doi.org/10.1038/s41596-021-00648-5

Google Scholar

Crossref

PubMed

Obr

,

M.

,

W.J.H.

Hagen

,

R.A.

Dick

,

L.

Yu

,

A.

Kotecha

, and

F.K.M.

Schur

.

2022

.

Exploring high-resolution cryo-ET and subtomogram averaging capabilities of contemporary DEDs

.

J. Struct. Biol.

214

:

107852

.

https://doi.org/10.1016/j.jsb.2022.107852

Google Scholar

Crossref

PubMed

Radecke

,

J.

,

R.

Seeger

,

A.

Kádková

,

U.

Laugks

,

A.

Khosrozadeh

,

K.N.

Goldie

,

V.

Lučić

,

J.B.

Sørensen

, and

B.

Zuber

.

2023

.

Morphofunctional changes at the active zone during synaptic vesicle exocytosis

.

EMBO Rep.

24

:e55719.

https://doi.org/10.15252/embr.202255719

Google Scholar

Ronneberger

,

O.

,

P.

Fischer

, and

T.

Brox

.

2015

.

U-net: Convolutional networks for biomedical image segmentation

. In

Medical image computing and computer-assisted intervention–MICCAI 2015

.

18th International Conference

.

Munich, Germany

,

October 5–9, 2015

.

Part III 18

.

Springer

,

New York, NY, USA

.

234

–

241

.

Google Scholar

Crossref

Schaffer

,

M.

,

S.

Pfeffer

,

J.

Mahamid

,

S.

Kleindiek

,

T.

Laugks

,

S.

Albert

,

B.D.

Engel

,

A.

Rummel

,

A.J.

Smith

,

W.

Baumeister

, and

J.M.

Plitzko

.

2019

.

A cryo-FIB lift-out technique enables molecular-resolution cryo-ET within native Caenorhabditis elegans tissue

.

Nat. Methods

.

16

:

757

–

762

.

https://doi.org/10.1038/s41592-019-0497-5

Google Scholar

Crossref

PubMed

Seabold

,

S.

, and

J.

Perktold

.

2010

.

Statsmodels: Econometric and statistical modeling with python

.

SciPy

.

7

:

1

.

https://doi.org/10.25080/Majora-92bf1922-011

Google Scholar

Crossref

Simonyan

,

K.

, and

A.

Zisserman

.

2014

.

Very deep convolutional networks for large-scale image recognition

.

arXiv

.

https://doi.org/10.48550/arXiv.1409.1556

(Preprint posted February 19, 2022)

.

Google Scholar

Szegedy

,

C.

,

S.

Ioffe

,

V.

Vanhoucke

, and

A.

Alemi

.

2017

.

Inception-v4, inception-resnet and the impact of residual connections on learning

.

arXiv

.

https://doi.org/10.48550/arXiv.1602.07261

(Preprint posted February 23, 2016)

.

Google Scholar

Szegedy

,

C.

,

W.

Liu

,

Y.

Jia

,

P.

Sermanet

,

S.

Reed

,

D.

Anguelov

,

D.

Erhan

,

V.

Vanhoucke

, and

A.

Rabinovich

.

2015

.

Going deeper with convolutions

.

arXiv

.

https://doi.org/10.48550/arXiv.1409.4842

(Preprint posted September 17, 2014)

.

Google Scholar

Tao

,

C.L.

,

Y.T.

Liu

,

R.

Sun

,

B.

Zhang

,

L.

Qi

,

S.

Shivakoti

,

C.L.

Tian

,

P.

Zhang

,

P.M.

Lau

,

Z.H.

Zhou

, and

G.Q.

Bi

.

2018

.

Differentiation and characterization of excitatory and inhibitory synapses by cryo-electron tomography and correlative microscopy

.

J. Neurosci.

38

:

1493

–

1510

.

https://doi.org/10.1523/JNEUROSCI.1548-17.2017

Google Scholar

Crossref

PubMed

Tegunov

,

D.

,

L.

Xue

,

C.

Dienemann

,

P.

Cramer

, and

J.

Mahamid

.

2021

.

Multi-particle cryo-EM refinement with M visualizes ribosome-antibiotic complex at 3.5 Å in cells

.

Nat. Methods

.

18

:

186

–

193

.

https://doi.org/10.1038/s41592-020-01054-7

Google Scholar

Crossref

PubMed

van der Walt

,

S.

,

J.L.

Schonberger

,

J.

Nunez-Iglesias

,

F.

Boulogne

,

J.D.

Warner

,

N.

Yager

,

E.

Gouillart

,

T.

Yu

, and

scikit-image contributors

.

2014

.

scikit-image: Image processing in Python

.

PeerJ

.

2

:e453.

https://doi.org/10.7717/peerj.453

Google Scholar

Virtanen

,

P.

,

R.

Gommers

,

T.E.

Oliphant

,

M.

Haberland

,

T.

Reddy

,

D.

Cournapeau

,

E.

Burovski

,

P.

Peterson

,

W.

Weckesser

,

J.

Bright

, et al.

2020

.

Author correction: SciPy 1.0: Fundamental algorithms for scientific computing in Python

.

Nat. Methods

.

17

:

352

.

https://doi.org/10.1038/s41592-020-0772-5

Google Scholar

Crossref

PubMed

Weigert

,

M.

,

U.

Schmidt

,

T.

Boothe

,

A.

Müller

,

A.

Dibrov

,

A.

Jain

,

B.

Wilhelm

,

D.

Schmidt

,

C.

Broaddus

,

S.

Culley

, et al.

2018

.

Content-aware image restoration: Pushing the limits of fluorescence microscopy

.

Nat. Methods

.

15

:

1090

–

1097

.

https://doi.org/10.1038/s41592-018-0216-7

Google Scholar

Crossref

PubMed

Yekutieli

,

D.

, and

Y.

Benjamini

.

1999

.

Resampling-based false discovery rate controlling multiple test procedures for correlated test statistics

.

J. Stat. Plan. Inference

.

82

:

171

–

196

.

https://doi.org/10.1016/S0378-3758(99)00041-5

Google Scholar

Crossref

Zhou

,

L.

,

C.

Yang

,

W.

Gao

,

T.

Perciano

,

K.M.

Davies

, and

N.K.

Sauter

.

2023

.

A machine learning pipeline for membrane segmentation of cryo-electron tomograms

.

J. Comput. Sci.

66

:

101904

.

https://doi.org/10.1016/j.jocs.2022.101904

Google Scholar

Crossref

Zuber

,

B.

, and

V.

Lučić

.

2019

.

Molecular architecture of the presynaptic terminal

.

Curr. Opin. Struct. Biol.

54

:

129

–

138

.

https://doi.org/10.1016/j.sbi.2019.01.008

Google Scholar

Crossref

PubMed

Zuber

,

B.

, and

V.

Lučić

.

2022

.

Neurons as a model system for cryo-electron tomography

.

J. Struct. Biol. X

.

6

:

100067

.

https://doi.org/10.1016/j.yjsbx.2022.100067

Google Scholar

Crossref

PubMed

Author notes

Disclosures: The authors declare no competing interests exist.

2024

Khosrozadeh et al.

This article is distributed under the terms of an Attribution–Noncommercial–Share Alike–No Mirror Sites license for the first six months after the publication date (see http://www.rupress.org/terms/). After six months it is available under a Creative Commons License (Attribution–Noncommercial–Share Alike 4.0 International license, as described at https://creativecommons.org/licenses/by-nc-sa/4.0/).

CryoVesNet: A dedicated framework for synaptic vesicle segmentation in cryo-electron tomograms

Introduction

Results

Overview of training and test paradigm

U-Net segmentation workflow

Sphericalization and radial profile-based refinement

Outlier detection and refinement

Expanding segmentation scope

Performance

Comparison with other methods

Generalization and interactive tool usage

Downstream analysis and application

Discussion

Synaptic vesicles: Molecular interactions and functions

CryoVesNet: Automatic vesicle segmentation in Cryo-ET

Conclusion

Materials and methods

Cryo-electron tomography datasets

Manual segmentation and automatic interboundary segment detection

Train and validation set generation

Network architecture and training procedure

Probability map construction

Global thresholding

Adaptative local thresholding

Segmentation refinement using radial profile

Outlier detection and refinement

Criteria for spherical vesicle selection

Method comparison

Interactive vesicle segmentation tool

Evaluation metrics

Voxel-wise evaluation

Vesicle diameter and position deviation

Statistical comparison

Computational setup

Manuscript preparation

Online supplemental material

Data availability

Acknowledgments

References

Author notes

Supplementary data

Suggested Content

Email alerts