Search results
Results: 2
Number of items: 2
-
Salehi, S., Dorkenwald, M., Thoker, F. M., Gavves, E., Snoek, C. G. M., & Asano, Y. M. (2025). SIGMA: Sinkhorn-Guided Masked Video Modeling. In A. Leonardis, E. Ricci, S. Roth, O. Russakovsky, T. Sattler, & G. Varol (Eds.), Computer Vision – ECCV 2024: 18th European Conference, Milan, Italy, September 29–October 4, 2024 : proceedings (Vol. XXIV, pp. 293-312). (Lecture Notes in Computer Science; Vol. 15082). Springer. https://doi.org/10.1007/978-3-031-72691-0_17 -
Dorkenwald, M., Barazani, N., Snoek, C. G. M., & Asano, Y. M. (2024). PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs. In 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition: CVPR 2024 : Seattle, Washington, USA, 16-22 June 2024 : proceedings (pp. 13548-13558). IEEE Computer Society. https://doi.org/10.48550/arXiv.2402.08657, https://doi.org/10.1109/CVPR52733.2024.01286
Page of