Track: Track1; Team name: PushparajD; Model: Graph Attention with self/neighbour-separated attention by Pdevadiga45 · Pull Request #355 · geometric-intelligence/topobench

Pdevadiga45 · 2026-06-14T04:38:35Z

Checklist

My pull request has a clear and explanatory title.
My pull request passes the Linting test.
I added appropriate unit tests and I made sure the code passes all unit tests.
My PR follows PEP8 guidelines.
My code is properly documented, using numpy docs conventions, and I made sure the documentation renders properly.
I linked to issues and PRs that are relevant to this PR.

Description

Adds GATE (Graph Attention with self/neighbour-separated attention) as a Track 1 graph backbone.

Nimrah Mustafa, Rebekka Burkholz. GATE: How to Keep Out Intrusive Neighbors. ICML 2024 - arXiv:2406.00418
code: RelationalML/GATE.

GATE is GATv2 with one targeted change (Eq. 4): the attention logit uses a separate learnable vector for a node's self-loop (a_t) versus its neighbours (a_s), so a node can parameterize its own self-attention independently and suppress aggregation from unrelated ("intrusive") neighbours. That's what makes it robust on heterophilic graphs - the axis GraphUniverse sweeps.

Files

topobench/nn/backbones/graph/gate.py : GATEConv (the attention layer) and GATE (stacked backbone). Docstrings cite the paper's Eq. 1/2/4.
configs/model/graph/gate.yaml : Hydra config on GNNWrapper + NoReadOut; one config serves both challenge tasks.
test/nn/backbones/graph/test_gate.py : 10 tests, 100% backbone coverage.
test/pipeline/test_pipeline.py : registers graph/gate for the CI MUTAG integration test.
2026_tdl_challenge/outputs/.../results.json : GraphUniverse grid output

Fidelity
The reference repo doesn't run under modern PyG (it ships a modified, older-PyG MessagePassing base), so I implemented a clean standalone version directly from the paper's equations, and validated it three ways:

an independent dense reimplementation of the GATE update (parity across configurations);
reduction to PyG's official GATv2Conv : in the shared-attention special case our layer matches PyG bit-for-bit (external reference for the routing/softmax/aggregation);
a property test of Thm. 4.3 - with zero-initialized attention the layer is exactly uniform mean-aggregation at init.

I follow the paper's GATE (Eq. 1/2/4), not the reference repo's optional omega gate or separate self-loop value transform, neither is part of the published model (the paper adds only the d-dimensional a_t). Init follows the paper: zero attention (Thm. 4.3) + random-orthogonal weights.

Initialization
I apply the paper's prescription where it bears on the GATE mechanism: zero attention vectors (Thm. 4.3 - no initial inductive bias, so the layer starts as uniform mean-aggregation) and random-orthogonal weight matrices. I deliberately do not reproduce the paper's full looks-linear (channel-mirroring) weight construction: zero-attention already delivers the at-init uniform-aggregation property that looks-linear is there to support, and orthogonal init captures the random-orthogonal specification it builds on. The mirroring would add implementation surface without changing the attention mechanism this PR contributes. This is documented in the module docstring.

TopoBench integration

Backbone returns node embeddings; the readout owns the classification head.
forward(x, edge_index, edge_weight=None, **kwargs) accepts the GNNWrapper arguments; hidden width matches the encoder so the wrapper's residual is consistent.
The config uses the fully-qualified _target_ (topobench.nn.backbones.graph.gate.GATE): the backbone auto-discovery loads files under a synthetic module name, which otherwise breaks PyG MessagePassing's inspector.

Cost. Attention is O(E·H·d); the benchmarked config (hidden 64, 4 heads, 2 layers) has 16,896 trainable parameters.

Results (72 runs, seeds 42/43/44). In-distribution community-detection accuracy 0.31–0.69 (mean 0.45; chance ≈ 0.05 over 20 communities); triangle-count MSE/triangles 0.015–3.80, all finite. Per-setting/per-seed/OOD values are in results.json. Run on CPU (no CUDA device) under WANDB_MODE=offline, so the optional W&B fields are empty; metrics are seeded and
device-independent.

On results.json generation. The shipped evaluation notebook can't run as-is (its integrity-check cell's stored hash doesn't match its own cells, so it aborts). Without modifying the notebook or utils.py, I called the functions it wraps i.e., run_challenge_grid + save_challenge_artifacts - which run the identical pipeline.

Issue

Track 1 entry for the TDL Challenge 2026.

Additional context

Python 3.11, torch 2.3.0. pre-commit (ruff-format, ruff, numpydoc-validation, standard hooks) passes; 10 unit tests at 100% backbone coverage; MUTAG pipeline test passes.

…t, cite Eq. 1/2/4)

Pdevadiga45 added 6 commits June 13, 2026 19:59

Add GATE backbone for Track 1

b9580e6

test(gate): add PyG GATv2 reduction and omega switch-off fidelity tests

478ac90

fix(gate): align to paper (drop omega/self-value, zero+orthogonal ini…

92b0f4b

…t, cite Eq. 1/2/4)

style(gate): apply ruff-format (pinned v0.9.9)

466b0ed

Add GATE benchmark results (GraphUniverse grid, 72 runs)

43fd41f

feat(gate): guard hidden_channels divisibility by heads

e44ec4b

gbg141 added the track-1-gnn 2026 Topological Deep Learning Challenge -- Track 1 GNNs label Jun 15, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Track: Track1; Team name: PushparajD; Model: Graph Attention with self/neighbour-separated attention#355

Track: Track1; Team name: PushparajD; Model: Graph Attention with self/neighbour-separated attention#355
Pdevadiga45 wants to merge 6 commits into
geometric-intelligence:mainfrom
Pdevadiga45:track1-gate

Pdevadiga45 commented Jun 14, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Pdevadiga45 commented Jun 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Checklist

Description

Issue

Additional context

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Pdevadiga45 commented Jun 14, 2026 •

edited

Loading