投稿者: risa.murata
Diffusion for World Modeling: Visual Details Matter in Atari
【Diffusion勉強会】Diffusion for World Modeling: Visual Details Matter in Atari by @DeepLe more
V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and Planning
【DL輪読会】V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and Pl more
Robust Function-Calling for On-Device Language Model via Function Masking
【DL輪読会】Robust Function-Calling for On-Device Language Model via Function Masking by @ more
TaskBench: Benchmarking Large Language Models for Task Automation
【DL輪読会】TaskBench: Benchmarking Large Language Models for Task Automation by @DeepLear more
Learning to Reason without External Rewards
【DL輪読会】Learning to Reason without External Rewards by @DeepLearning2023
VisionZip: Longer is Better but Not Necessary in Vision Language Models [S. Yang+CVPR’25]
【DL輪読会】VisionZip: Longer is Better but Not Necessary in Vision Language Models [S. Ya more
Novelty Detection in Reinforcement Learning withWorld Models
【DL輪読会】Novelty Detection in Reinforcement Learning with World Models by @DeepLearning more
Programmatic Video Prediction Using Large Language Models
【DL輪読会】Programmatic Video Prediction Using Large Language Models by @DeepLearning2023