An experimental sequence modeling engine that runs on constant memory.
Symphony replaces the standard Transformer KV-Cache. Instead of eating up more VRAM as your context grows, it uses a fixed, constant amount of memory.
- ASH-C (Memory Core): Compresses historical context into a fixed-size matrix. VRAM stays flat no matter how long the input text is.
- HEP-DNA (Pointer Core): Tracks token positions using a simple list of integer IDs instead of huge vectors. This allows 100% exact word-for-word copy-paste from past context.
- Plug and Play: Drops directly into the MLP layers of standard models. Currently tested and working on Qwen-1.5B and 7B.
- No Full Retraining: The base model stays completely frozen. We only train the newly injected memory layers.
- Infinite Context: Trained at 43k tokens to handle noise. Thanks to NTK rescaling, it can test at 100k+ tokens during inference without extra fine-tuning.
pip install torch transformers accelerate bitsandbytes matplotlib numpy- Generate Data: Create the long-range reasoning tasks.
python training/build_universal_dataset.py- Phase 4 (Alignment): Train on short context (512 tokens) so the model learns the basic memory math.
python training/titan_phase4_kaggle_train_fft_math.py- Outdated (Need Update)Phase 7 (Long Range): Stretch the context to 43k+ tokens to fine-tune the pointer network.
python training/titan_phase7_hysparse_training.py- ( "Fully Updated" )**Phase 7 Kaagle Script(All Combined Of core files ) (Long Range):
python training/titan_phase7_kaggle_hysparse.pyLook inside the logs/ folder to verify the model's performance.
Based on concepts from FHRR/VSAs, Pointer Networks, and State Space Models.
📄 Read the Latest Technical Paper (PDF)
(Updated with learned routing and dual-stream FHRR mechanics)
Previous Draft (Archived):
Jeevan Joshi (2026). Symphony: Constant-Memory Sequence Modeling via Holographic Recurrence and Coordinate Pointer Networks.
DOI: 10.5281/zenodo.20566771
@misc{joshi2026symphony,
author = {Joshi, Jeevan},
title = {Symphony: Constant-Memory Sequence Modeling via Holographic Recurrence and Coordinate Pointer Networks},
year = 2026,
publisher = {Zenodo},
doi = {10.5281/zenodo.20566771}
}MIT License.