Machine Perception
- Algorithms & Theory
- Climate & Sustainability
- Conferences & Events
- Data Management
- Data Mining & Modeling
- Distributed Systems & Parallel Computing
- Economics & Electronic Commerce
- Education Innovation
- General Science
- Generative AI
- Global
- Hardware & Architecture
- Health & Bioscience
- Human-Computer Interaction and Visualization
- Machine Intelligence
- Machine Perception
- Machine Translation
- Mobile Systems
- Natural Language Processing
- Networking
- Open Source Models & Datasets
- Photography
- Product
- Programs
- Quantum
- RAI-HCT Highlights
- Responsible AI
- Robotics
- Security, Privacy and Abuse Prevention
- Software Systems & Engineering
- Sound & Accoustics
- Speech Processing
- Year in Review
-
March 18, 2024
MELON: Reconstructing 3D objects from images with unknown poses -
March 14, 2024
Cappy: Outperforming and boosting large multi-task language models with a small scorer -
March 8, 2024
Health-specific embedding tools for dermatology and pathology -
February 22, 2024
VideoPrism: A foundational visual encoder for video understanding -
January 31, 2024
MobileDiffusion: Rapid text-to-image generation on-device -
December 19, 2023
VideoPoet: A large language model for zero-shot video generation -
December 15, 2023
StyleDrop: Text-to-image generation in any style -
November 21, 2023
Open sourcing Project Guideline: A platform for computer vision accessibility technology -
November 14, 2023
Scaling multimodal understanding to long videos -
October 9, 2023
SANPO: A Scene understanding, Accessibility, Navigation, Pathfinding, & Obstacle avoidance dataset -
September 28, 2023
DynIBaR: Space-time view synthesis from videos of dynamic scenes -
September 26, 2023
Google Research embarks on effort to map a mouse brain