Category:
Diffusion for World Modeling: Visual Details Matter in Atari
【Diffusion勉強会】Diffusion for World Modeling: Visual Details Matter in Atari by @DeepLe more
Deep Researcher with Test-Time Diffusion
【DL輪読会】”Deep Researcher with Test-Time Diffusion” by @DeepLearning2023
V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and Planning
【DL輪読会】 V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and P more
Between Circuits and Chomsky: Pre- pretraining on Formal Languages Imparts Linguistic Bias
【DL輪読会】Between Circuits and Chomsky: Pre- pretraining on Formal Languages Imparts Lin more
V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and Planning
【DL輪読会】V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and Pl more
Energy-Based Transformers are Scalable Learners and Thinkers
【DL輪読会】Energy-Based Transformers are Scalable Learners and Thinkers by @DeepLearning2 more
“SmolVLA” A vision-language-action model for affordable and efficient robotics
【DL輪読会】“SmolVLA” A vision-language-action model for affordable and efficient robotics more
Instruction Tuning With Loss Over Instructions
【DL輪読会】Instruction Tuning With Loss Over Instructions by @DeepLearning2023
Robust Function-Calling for On-Device Language Model via Function Masking
【DL輪読会】Robust Function-Calling for On-Device Language Model via Function Masking by @ more