Experiments & Results#
We evaluate the Spiking Decision Transformer on the standard Gym control benchmarks: CartPole-v1, MountainCar-v0, Acrobot-v1, and Pendulum-v1.
Performance Comparison#
SNN-DT matches or exceeds the performance of the vanilla Decision Transformer (DT) while operating in a low-power spiking domain.
Environment |
DT Score (Normalized) |
SNN-DT Score (Normalized) |
|---|---|---|
CartPole-v1 |
100.0 |
100.0 |
MountainCar-v0 |
98.5 |
99.2 |
Acrobot-v1 |
95.0 |
96.5 |
Pendulum-v1 |
92.0 |
91.8 |
Note: Scores are normalized relative to the expert policy performance.
Energy Efficiency#
A key claim of SNN-DT is its extreme energy efficiency. By replacing dense matrix multiplications with sparse addition-accumulation operations (messages), we observe massive reductions in energy cost.
sparsity: >90% silence in activation maps.
spikes/decision: <10 spikes on average.
Energy Proxy: ~\(10^4\) reduction in estimated Joules per inference compared to ANN-DT.
Ablation Studies#
Impact of Plasticity#
Enabling local plasticity improved success rates in non-stationary environments by approximately 15% compared to the static SNN-DT baseline.
Routing Effectiveness#
The Dendritic Routing module successfully pruned 40% of attention heads without degradation in control performance.