Follow
Zihui Xue
Title
Cited by
Cited by
Year
What makes multi-modal learning better than single (provably)
Y Huang, C Du, Z Xue, X Chen, H Zhao, L Huang
Advances in Neural Information Processing Systems 34, 10944-10956, 2021
2122021
On feature decorrelation in self-supervised learning
T Hua, W Wang, Z Xue, S Ren, Y Wang, H Zhao
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
1872021
Co-advise: Cross inductive bias distillation
S Ren, Z Gao, T Hua, Z Xue, Y Tian, S He, H Zhao
Proceedings of the IEEE/CVF Conference on computer vision and pattern …, 2022
442022
Ego-exo4d: Understanding skilled human activity from first-and third-person perspectives
K Grauman, A Westbury, L Torresani, K Kitani, J Malik, T Afouras, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
412024
Dynamic multimodal fusion
Z Xue, R Marculescu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
352023
Multimodal knowledge expansion
Z Xue, S Ren, Z Gao, H Zhao
Proceedings of the IEEE/CVF International Conference on Computer Vision, 854-863, 2021
252021
The modality focusing hypothesis: Towards understanding crossmodal knowledge distillation
Z Xue, Z Gao, S Ren, H Zhao
arXiv preprint arXiv:2206.06487, 2022
212022
Learning fine-grained view-invariant representations from unpaired ego-exo videos via temporal alignment
ZS Xue, K Grauman
Advances in Neural Information Processing Systems 36, 53688-53710, 2023
122023
Sugar: Efficient subgraph-level training via resource-aware graph partitioning
Z Xue, Y Yang, R Marculescu
IEEE Transactions on Computers, 2023
82023
Learning object state changes in videos: An open-world perspective
Z Xue, K Ashutosh, K Grauman
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
72024
Egocentric video task translation
Z Xue, Y Song, K Grauman, L Torresani
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
72023
Put myself in your shoes: Lifting the egocentric perspective from exocentric videos
M Luo, Z Xue, A Dimakis, K Grauman
arXiv preprint arXiv:2403.06351, 2024
42024
Egocentric video task translation@ ego4d challenge 2022
Z Xue, Y Song, K Grauman, L Torresani
arXiv preprint arXiv:2302.01891, 2023
32023
Sampling graphlets of multiplex networks: a restricted random walk approach
S Jiao, Z Xue, X Chen, Y Xu
ACM Transactions on the Web (TWEB) 15 (4), 1-31, 2021
32021
Detours for navigating instructional videos
K Ashutosh, Z Xue, T Nagarajan, K Grauman
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
22024
Anytime depth estimation with limited sensing and computation capabilities on mobile devices
Y Yang, Z Xue, R Marculescu
Conference on Robot Learning, 609-618, 2022
22022
Training-free robust multimodal learning via sample-wise jacobian regularization
Z Gao, S Ren, Z Xue, S Li, H Zhao
arXiv preprint arXiv:2204.02485, 2022
12022
Action2Sound: Ambient-Aware Generation of Action Sounds from Egocentric Videos
C Chen, P Peng, A Baid, Z Xue, WN Hsu, D Harwarth, K Grauman
arXiv preprint arXiv:2406.09272, 2024
2024
HOI-Swap: Swapping Objects in Videos with Hand-Object Interaction Awareness
Z Xue, M Luo, C Chen, K Grauman
arXiv preprint arXiv:2406.07754, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–19