The Sound of Water: Inferring Physical Properties from Pouring Liquids

Open Access
Authors
Publication date 2025
Book title ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Book subtitle Hyderabad, India, 6-11 April 2025
ISBN
  • 9798350368758
ISBN (electronic)
  • 9798350368741
Event 2025 IEEE International Conference on Acoustics, Speech and Signal Processing
Pages (from-to) 736-740
Number of pages 5
Publisher Piscataway, NJ: IEEE
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
We study the connection between audio-visual observations and the underlying physics of a mundane yet intriguing everyday activity: pouring liquids. Given only the sound of liquid pouring into a container, our objective is to automatically infer physical properties such as the liquid level, the shape and size of the container, the pouring rate, and the time to fill. To this end, we: (i) show in theory that these properties can be determined from the fundamental frequency (pitch); (ii) train a pitch detection model with supervision from simulated data and visual data with a physics-inspired objective; (iii) introduce a new large dataset of real pouring videos for a systematic study; (iv) show that the trained model can indeed infer these physical properties for real data; and finally, (v) we demonstrate strong generalization to various container shapes, other datasets, and in-the-wild YouTube videos. Our work presents a keen understanding of a narrow yet rich problem at the intersection of acoustics, physics, and learning. It opens up applications to enhance multisensory perception in robotic pouring.
Document type Conference contribution
Language English
Published at https://doi.org/10.1109/ICASSP49660.2025.10889950
Other links https://www.proceedings.com/81086.html
Downloads
Permalink to this page
Back