Home · Viewer · How It Works · Comparison · Developer Guide · WebGPU · WebGL · GitHub

Layout Comparison

How does BitZoom's layout compare to established graph layout algorithms?
Full report with methodology, all configurations, and limitations.

Why compare?

Graph layout is a solved problem in many contexts. Force-directed algorithms like ForceAtlas2 have decades of refinement. Dimensionality reduction methods like UMAP and t-SNE produce excellent 2D embeddings. BitZoom takes a different approach: it positions nodes by property similarity first, uses topology as a secondary signal, and produces a hierarchical zoom structure from stored coordinates.

The natural question is what you gain and what you give up. This comparison measures both topology preservation (do connected nodes end up nearby?) and property-similarity preservation (do nodes with similar attributes end up nearby?). Topology metrics favor force-directed methods; property metrics evaluate BitZoom's core design goal.

Which method should I use?

Method	Best when	Tradeoff
BitZoom	Property similarity matters more than connectivity. Interactive exploration, zoom levels, millisecond speed.	Weaker on sparse chain topologies.
ForceAtlas2	Topology IS the signal. Social networks, connectivity analysis, chain-like graphs.	Slower (minutes). No property awareness.
UMAP	Dimensionality reduction with interpretable embeddings. High-dimensional feature data.	Seconds, not milliseconds. Seed-dependent.
t-SNE	Revealing fine-grained local clusters. Best local topology preservation.	Requires perplexity tuning. Slow.

What we measured

EdgeLen: How close are connected nodes? Normalized Euclidean distance between edge endpoints. Lower = tighter topology.
TopoNbrP: Do a node's graph neighbors end up as its layout neighbors? Jaccard overlap at k=10. Higher = better topology fidelity.
PropNbrP: Do a node's most property-similar nodes end up as its layout neighbors? Jaccard overlap at k=10. Higher = better property grouping. Measured using per-node token sets (the same tokens BitZoom's MinHash operates on).
Silhouette: Do known communities form visible clusters? Score vs ground-truth labels, -1 to +1. Only available for Email-EU (42 departments).

We use k=10 neighbors as a balance between local structure fidelity and global layout quality.

The short version

On topology metrics, ForceAtlas2 and t-SNE lead. On property-similarity, the answer depends on whether properties correlate with graph connectivity. When they don't (MITRE, Synth Packages), BitZoom with property weights leads by 2.6-6x. When they do (BitZoom Source, where call edges track file structure), topology-based methods capture property structure incidentally and score higher.

Dataset	Nodes	Props	BitZoom	PropNbrP	TopoNbrP	Takeaway
MITRE	4,736	rich	α=0 wt	2.57x	0.37x	Properties ≠ topology; BitZoom leads
Synth Pkg	1,868	rich	α=0 wt	6.02x	0.31x	Properties ≠ topology; BitZoom leads
BZ Source	433	rich	α=0.5 wt gauss	0.95x	0.28x	Properties ≈ topology; near FA2 with gaussian
Email-EU	1,005	none	α=1.0	0.87x	0.87x	Edge-only; topology comparable
Facebook	4,039	none	α=1.0	0.79x	0.74x	Dense; smoothing works well
Power Grid	4,941	none	α=0.75	0.68x	0.01x	Sparse chains; smoothing limited

Ratios are BitZoom / ForceAtlas2. Higher = better for both NbrP columns.

Dataset by dataset

Email-EU: a dense communication network

1,005 researchers at a European institution, 16.7K emails, 42 departments as ground truth. Edge-only (no node properties beyond auto-generated tokens).

Layout	Time	EdgeLen	TopoNbrP	PropNbrP	Silhouette	Note
BitZoom α=0	2ms	0.466	0.006	0.007	-0.47	Near-random layout
BitZoom α=1.0	1ms	0.221	0.056	0.007	-0.29	Best BitZoom for topology
ForceAtlas2	54s	0.008	0.064	0.008	-0.40	Shortest edges
UMAP	11s	0.182	0.107	0.010	+0.01	Only positive silhouette
t-SNE	3s	0.154	0.109	0.009	-0.12	Highest TopoNbrP

PropNbrP is uniformly low (0.007-0.010) across all methods. Without real node properties, auto-generated tokens provide little differentiation. UMAP and t-SNE recover department structure best on topology metrics.

Facebook: dense ego networks

4,039 users, 88K friendship edges. Dense community structure.

Layout	Time	EdgeLen	TopoNbrP	PropNbrP	Note
BitZoom α=1.0	6ms	0.063	0.110	0.003	74% of FA2's TopoNbrP
ForceAtlas2	172s	0.011	0.150	0.003	Shortest edges
t-SNE	15s	0.071	0.176	0.003	Highest TopoNbrP

Dense ego-network structure responds well to topology smoothing. BitZoom at α=1.0 reaches 74% of ForceAtlas2's TopoNbrP. PropNbrP is uniformly low (edge-only dataset).

Power Grid: sparse chains

4,941 substations, 6.6K transmission lines. Average degree 2.7, diameter ~46.

Layout	Time	EdgeLen	TopoNbrP	PropNbrP	Note
BitZoom α=0.75	8ms	0.271	0.003	0.002	Best BitZoom; α=1.0 is worse
ForceAtlas2	152s	0.005	0.197	0.002	Traces chains via global forces
t-SNE	26s	0.178	0.041	0.002	Limited by sparse adjacency

ForceAtlas2 dominates. Its global optimization traces long chains (diameter ~46) that 5-pass local smoothing cannot reach. α=0.75 outperforms α=1.0 because pure topology with few passes oversmooths hubs while leaving chains unresolved.

MITRE ATT&CK: the property-similarity test

4,736 nodes (techniques, tactics, software, mitigations) with rich properties: platforms, kill chain phases, aliases. This is the dataset that tests BitZoom's core claim.

Layout	Time	EdgeLen	TopoNbrP	PropNbrP	Note
BitZoom α=0	10ms	0.461	0.001	0.007	No property weights
BitZoom α=0 wt	8ms	0.534	0.002	0.034	Property weights: best PropNbrP
BitZoom α=0.5 wt	8ms	0.482	0.002	0.034	Adding topology barely changes PropNbrP
ForceAtlas2	178s	0.203	0.004	0.013	Shorter edges; lower PropNbrP
t-SNE	23s	0.292	0.004	0.026	Second-highest PropNbrP

BitZoom with property weights (group=5, platforms=6, killchain=4) scores 2.6x higher than ForceAtlas2 on PropNbrP (0.034 vs 0.013). Without property weights, BitZoom's PropNbrP drops to 0.007 — the weights are what provide the signal. All methods score low on TopoNbrP because graph neighbors are often semantically different node types. t-SNE scores second on PropNbrP (0.026), likely because adjacency partially correlates with property similarity.

Synth Packages: designed group structure

1,868 synthetic packages, 4K co-reference edges. Properties: group, downloads, license, version, depcount. Edges are co-reference links, not property-based.

Layout	Time	EdgeLen	TopoNbrP	PropNbrP	Note
BitZoom α=0	2ms	0.470	0.003	0.010	No property weights
BitZoom α=0 wt	2ms	0.387	0.006	0.049	Property weights: best PropNbrP (6x FA2)
BitZoom α=0.5 wt	3ms	0.306	0.011	0.039	Topology trades PropNbrP for TopoNbrP
ForceAtlas2	151s	0.030	0.018	0.008	Shortest edges; low PropNbrP
t-SNE	9s	0.218	0.001	0.012	Low on both metrics

BitZoom with weights scores 6x higher than ForceAtlas2 on PropNbrP (0.049 vs 0.008). Graph connectivity and property similarity diverge: edges are co-reference links, not property-based. Increasing α from 0 to 0.5 trades PropNbrP for TopoNbrP, directly demonstrating the α tradeoff. Gaussian quantization shows negligible difference here (±2%), suggesting the post-blend distribution is approximately uniform.

BitZoom Source: when topology tracks properties

433 functions/methods/classes from this project's source code, 940 call edges. Properties: kind, file, lines, bytes, age.

Layout	Time	EdgeLen	TopoNbrP	PropNbrP	Note
BitZoom α=0 wt	1ms	0.420	0.019	0.172	Property weights, rank quant
BitZoom α=0.5 wt	2ms	0.267	0.032	0.179	Rank quant; topology helps
BitZoom α=0.5 wt gauss	1ms	0.301	0.035	0.198	Gaussian quant: +11% PropNbrP
ForceAtlas2	12s	0.031	0.127	0.208	High PropNbrP via adjacency
UMAP	13s	0.115	0.082	0.244	Highest PropNbrP
t-SNE	3s	0.278	0.048	0.204	High PropNbrP via adjacency

Topology-based methods score higher on PropNbrP than BitZoom with rank quantization (0.20-0.24 vs 0.17-0.18). Functions that call each other tend to share file and kind, so adjacency captures property structure incidentally. Switching to Gaussian quantization improves BitZoom's PropNbrP by 11% (0.179 → 0.198), reaching 95% of ForceAtlas2. Gaussian quantization preserves density structure rather than forcing uniform occupancy, which helps when similar nodes form tight clusters.

What each method is good at

Aspect	ForceAtlas2	UMAP / t-SNE	BitZoom
Edge length	Best (optimizes for this)	Moderate	Improves with α
Topology preservation	Strong; global forces	Best overall (t-SNE)	Comparable on dense graphs
Property grouping	Incidental; depends on adjacency	Moderate (via adjacency)	Best when props ≠ topology
Sparse / chain graphs	Strong (global forces)	Limited	Limited (local smoothing)
Speed	Minutes (O(n log n)/iter)	Seconds	Milliseconds (O(n))
Hierarchical zoom	No	No	14 levels from 4 bytes/node
Determinism	Seed-dependent	Seed-dependent	Fully deterministic

Rank vs Gaussian quantization

BitZoom supports two quantization modes. Rank quantization sorts nodes by position and assigns grid cells uniformly. Gaussian quantization uses fixed CDF boundaries, preserving density structure: tight clusters stay tight, sparse regions stay spread out.

Dataset	Config	Rank PropNbrP	Gauss PropNbrP	Change
BZ Source	α=0 wt	0.172	0.179	+4%
BZ Source	α=0.5 wt	0.179	0.198	+11%; closes gap to FA2 (0.208)
Synth Pkg	α=0 wt	0.049	0.050	+1%
Synth Pkg	α=0.5 wt	0.039	0.038	-2%

Gaussian quantization helps when the post-blend distribution has meaningful density variation (BZ Source: functions cluster by file). When the distribution is approximately uniform (Synth Packages), the two modes produce similar results. The effect is dataset-dependent, but Gaussian quantization never hurts substantially and can close the gap to topology-based methods by preserving cluster tightness.

Caveats

PropNbrP uses the same similarity BitZoom optimizes. Token-set Jaccard is both the ground-truth similarity and the signal BitZoom's MinHash approximates. This is partially circular. A fully independent property-similarity metric (e.g., domain-expert labels) would be stronger evidence.

Edge-only datasets show no property differentiation. Three of six datasets have no real node properties. PropNbrP is uniformly low across all methods on these graphs.

When adjacency correlates with properties, topology-based methods win PropNbrP too. On BitZoom Source, ForceAtlas2 and UMAP outscore BitZoom on PropNbrP because call-graph edges track file/kind similarity. BitZoom's advantage is specific to datasets where property structure differs from graph connectivity.

Hierarchical zoom is not measured. BitZoom derives 14 aggregation levels from 4 stored bytes per node. No other method produces a zoom hierarchy. This capability is not captured by any metric above.