“SmolVLA” A vision-language-action model for affordable and efficient robotics
【DL輪読会】“SmolVLA” A vision-language-action model for affordable and efficient robotics more
【DL輪読会】“SmolVLA” A vision-language-action model for affordable and efficient robotics more
【DL輪読会】Instruction Tuning With Loss Over Instructions by @DeepLearning2023
【DL輪読会】Robust Function-Calling for On-Device Language Model via Function Masking by @ more
【DL輪読会】TaskBench: Benchmarking Large Language Models for Task Automation by @DeepLear more
【DL輪読会】Learning to Reason without External Rewards by @DeepLearning2023
【DL輪読会】VisionZip: Longer is Better but Not Necessary in Vision Language Models [S. Ya more
【DL輪読会】Train for the Worst, Plan for the Best: Understanding Token Ordering in Masked more
【DL輪読会】Novelty Detection in Reinforcement Learning with World Models by @DeepLearning more
【DL輪読会】Programmatic Video Prediction Using Large Language Models by @DeepLearning2023
【DL輪読会】 WoMAP: World Models For Embodied Open-Vocabulary Object Localization by @Deep more