mHC: Manifold-Constrained Hyper-Connections
https://www.docswell.com/s/DeepLearning2023/K9MJDM-2026-02-09-104805
【DL輪読会】Energy-Based Transformers are Scalable Learners and Thinkers by @DeepLearning2 more
【DL輪読会】“SmolVLA” A vision-language-action model for affordable and efficient robotics more
【DL輪読会】Instruction Tuning With Loss Over Instructions by @DeepLearning2023