カテゴリー: dls-2024
The Surprising Effectiveness of TestTime Training for Abstract Reasoning
【DL輪読会】The Surprising Effectiveness of TestTime Training for Abstract Reasoning by @D more
On The Role of Attention Heads in Large Language Model Safety
【DL輪読会】On The Role of Attention Heads in Large Language Model Safety by @DeepLearning more
Intelligence at the Edge of Chaos
【DL輪読会】Intelligence at the Edge of Chaos by @DeepLearning2023
1 π0: A Vision-Language-Action Flow Model for General Robot Control
【DL輪読会】1 π0: A Vision-Language-Action Flow Model for General Robot Control by @DeepLe more
PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators
【DL輪読会】PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Naviga more
Transferable Tactile Transformers for Representation Learning Across Diverse Sensors and Tasks
【DL輪読会】 Transferable Tactile Transformers for Representation Learning Across Diverse more
BEYOND AUTOREGRESSION: FASTLLMS VIA SELF-DISTILLATION THROUGH TIME
【DL輪読会】BEYOND AUTOREGRESSION: FAST LLMS VIA SELF-DISTILLATION THROUGH TIME by @DeepLe more
Mechanistic Interpretability for AI Safety: A Review
【DL輪読会】Mechanistic Interpretability for AI Safety: A Review by @DeepLearning2023
Imagine yourself: Tuning-Free Personalized Image Generation
【DL輪読会】Imagine yourself: Tuning-Free Personalized Image Generation by @DeepLearning20 more