Category: dls-2024
Do Differentiable Simulators Give Better Policy Gradients?
【DL輪読会】 Do Differentiable Simulators Give Better Policy Gradients? by @DeepLearning20 more
DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantized LLMs
【DL輪読会】DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantize more
1 RomanSetu: Efficiently unlocking multilingual capabilities of Large Language Models via Romanization (ACL 2024) + An Empirical Comparison of Vocabulary Expansion and Initialization Approaches for Language Models (2024)
【DL輪読会】RomanSetu: Efficiently unlocking multilingual capabilities of Large Language M more
Denoising Diffusion Bridge Models
【DL輪読会】Denoising Diffusion Bridge Models by @DeepLearning2023
Classifier-Free Guidance is a Predictor-Corrector
【DL輪読会】Classifier-Free Guidance is a Predictor-Corrector by @DeepLearning2023
Foundation Models for 3D Scene Understanding
Foundation Models for 3D Scene Understanding by @DeepLearning2023
Lory: Fully Differentiable Mixture-of-Experts forAutoregressive Language Model Pre-training
Sorry, this entry is only available in Japanese. For the sake of viewer convenience, more
What Do Language Models Learn in Context? The Structured Task Hypothesis
What Do Language Models Learn in Context? The Structured Task Hypothesis by @DeepLear more