BEYOND AUTOREGRESSION: FAST LLMS VIA SELF-DISTILLATION THROUGH TIME
【DL輪読会】BEYOND AUTOREGRESSION: FAST LLMS VIA SELF-DISTILLATION THROUGH TIME by @DeepLe more
【DL輪読会】BEYOND AUTOREGRESSION: FAST LLMS VIA SELF-DISTILLATION THROUGH TIME by @DeepLe more
【DL輪読会】Mechanistic Interpretability for AI Safety: A Review by @DeepLearning2023
【DL輪読会】Imagine yourself: Tuning-Free Personalized Image Generation by @DeepLearning20 more
【DL輪読会】Improving Text Embeddings with Large Language Models by @DeepLearning2023
【DL輪読会】 Training-Free Diffusion Models Alignment with Sampling Demons by @DeepLearnin more
【DL輪読会】 Do Differentiable Simulators Give Better Policy Gradients? by @DeepLearning20 more
【DL輪読会】DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantize more
【DL輪読会】RomanSetu: Efficiently unlocking multilingual capabilities of Large Language M more