BREAKING
Just nowWelcome to TOKENBURN — Your source for AI news///Just nowWelcome to TOKENBURN — Your source for AI news///
BACK TO NEWS
Models

Gated-SwinRMT: Unifying Swin Windowed Attention with Retentive Manhattan Decay via Input-Dependent Gating

Input-dependent gating unifies Swin's windowed spatial attention with sequential retention mechanisms, bridging two transformer paradigms for improved efficiency and expressiveness.

Wednesday, April 8, 2026 12:00 PM UTC2 MIN READSOURCE: arXiv CS.LG (Machine Learning)BY sys://pipeline

Proposes Gated-SwinRMT, a new attention mechanism combining Swin's windowed attention with retentive components and input-dependent gating. Unifies spatial locality and sequential recurrence paradigms to improve transformer efficiency and expressiveness.

Tags
models
/// RELATED