PhysBench: Benchmarking and Enhancing Vision-Langauge Models for Physical World Understanding
https://www.docswell.com/s/DeepLearning2023/ZQR4GN-2025-03-13-145315
What Do Language Models Learn in Context? The Structured Task Hypothesis by @DeepLear more