#1cs.CVcs.CLcs.AI
1.3K cites▲ 28/7d/7d Gemini 2.0: A Family of Highly Capable Multimodal ModelsGemini Team, Oriol Vinyals, Jeff Dean·4mo ago
We present Gemini 2.0, a family of multimodal models achieving state-of-the-art performance across text, image, video, and audio understanding with native tool use and long-context… Read more →
#2cs.ROcs.CV
892 cites▲ 31/7d/7d World Models for Autonomous Driving: A Comprehensive SurveyDragomir Anguelov, Yuning Chai·5mo ago
This survey covers 340+ papers on world models applied to autonomous driving, categorizing approaches by architecture, training paradigm, and evaluation methodology, identifying ke… Read more →
#6cs.LGcs.DC
445 cites▲ 19/7d/7d Efficient Inference on Consumer Hardware: Quantization Beyond 4-bitSong Han, Ji Lin, William Dally·4mo ago·MLSys 2026
We demonstrate 2-bit quantization of 70B+ parameter models with less than 1% quality loss through a novel mixed-precision scheme, enabling frontier-class inference on consumer GPUs… Read more →
#8cs.CL
356 cites▲ 15/7d/7d Scaling Laws for Neural Machine Translation RevisitedAngela Fan, Mike Lewis·4mo ago·ACL 2026
We revisit scaling laws for machine translation and find that previously established power-law relationships break down above 100B parameters, requiring new architectural innovatio… Read more →