[AI]■ STORY TIMELINE
BYTEDANCE: QA TRAINING BEATS TRANSCRIPTION FOR LLMS
ByteDance researchers found that training large language models through question-answering outperforms transcription methods for processing long, image-heavy documents. A 7B model trained this way matched larger models' performance on documents four times longer than its training data.
The Decoder+0m
ByteDance Seed shows that a 7B model can answer questions on long, image-heavy documents more reliably than much larger…