DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantized LLMs
【DL輪読会】DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantize more
【DL輪読会】DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantize more
【DL輪読会】RomanSetu: Efficiently unlocking multilingual capabilities of Large Language M more
【DL輪読会】Denoising Diffusion Bridge Models by @DeepLearning2023
【DL輪読会】Classifier-Free Guidance is a Predictor-Corrector by @DeepLearning2023
Foundation Models for 3D Scene Understanding by @DeepLearning2023
Lory Fully Differentiable Mixture-of-Experts for Autoregressive Language Model Pre-tr more
What Do Language Models Learn in Context? The Structured Task Hypothesis by @DeepLear more
【DL輪読会】RoboDreamer: Learning Compositional World Models for Robot Imagination (ICML20 more
【DL輪読会】Behavior Generation with Latent Actions by @DeepLearning2023