Tam: Temporal adaptive module for video recognition Z Liu, L Wang, W Wu, C Qian, T Lu Proceedings of the IEEE/CVF international conference on computer vision …, 2021 | 329 | 2021 |
Teinet: Towards an efficient architecture for video recognition Z Liu, D Luo, Y Wang, L Wang, Y Tai, C Wang, J Li, F Huang, T Lu Proceedings of the AAAI conference on artificial intelligence 34 (07), 11669 …, 2020 | 251 | 2020 |
Motionbert: A unified perspective on learning human motion representations W Zhu, X Ma, Z Liu, L Liu, W Wu, Y Wang Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 125* | 2023 |
Dynamic sampling networks for efficient action recognition in videos YD Zheng*, Z Liu*, T Lu, L Wang (* denotes equal contribution) IEEE Transactions on Image Processing 29, 7970-7983, 2020 | 86 | 2020 |
Interngpt: Solving vision-centric tasks by interacting with chatgpt beyond language Z Liu, Y He, W Wang, W Wang, Y Wang, S Chen, Q Zhang, Z Lai, Y Yang, ... arXiv preprint arXiv:2305.05662, 2023 | 69 | 2023 |
Context-aware attention LSTM network for flood prediction Z Liu, W Xu, J Feng, S Palaiahnakote, T Lu 2018 24th international conference on pattern recognition (ICPR), 1301-1306, 2018 | 36 | 2018 |
Joint-modal label denoising for weakly-supervised audio-visual video parsing H Cheng, Z Liu, H Zhou, C Qian, W Wu, L Wang European Conference on Computer Vision, 431-448, 2022 | 26 | 2022 |
Controlllm: Augment language models with tools by searching on graphs Z Liu, Z Lai, Z Gao, E Cui, Z Li, X Zhu, L Lu, Q Chen, Y Qiao, J Dai, ... arXiv preprint arXiv:2310.17796, 2023 | 22 | 2023 |
Progressive attention on multi-level dense difference maps for generic event boundary detection J Tang, Z Liu, C Qian, W Wu, L Wang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 18 | 2022 |
DeeperForensics Challenge 2020 on real-world face forgery detection: Methods and results L Jiang, Z Guo, W Wu, Z Liu, Z Liu, CC Loy, S Yang, Y Xiong, W Xia, ... arXiv preprint arXiv:2102.09471, 2021 | 15 | 2021 |
LLMs Meet Multimodal Generation and Editing: A Survey Y He, Z Liu, J Chen, Z Tian, H Liu, X Chi, R Liu, R Yuan, Y Xing, W Wang, ... arXiv preprint arXiv:2405.19334, 2024 | 5 | 2024 |
VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks J Wu, M Zhong, S Xing, Z Lai, Z Liu, W Wang, Z Chen, X Zhu, L Lu, T Lu, ... arXiv preprint arXiv:2406.08394, 2024 | 4 | 2024 |
Context and temporal aware attention model for flood prediction Z Liu, Y Wu, Y Ding, J Feng, T Lu Advances in Multimedia Information Processing–PCM 2018: 19th Pacific-Rim …, 2018 | 4 | 2018 |
Filter-Recovery Network for Multi-Speaker Audio-Visual Speech Separation H Cheng, Z Liu, W Wu, L Wang International Conference on Learning Representations 2023 (ICLR), 0 | 3* | |
VidMuse: A Simple Video-to-Music Generation Framework with Long-Short-Term Modeling Z Tian, Z Liu, R Yuan, J Pan, X Huang, Q Liu, X Tan, Q Chen, W Xue, ... arXiv preprint arXiv:2406.04321, 2024 | 2 | 2024 |
MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions X Chi, Y Wang, A Cheng, P Fang, Z Tian, Y He, Z Liu, X Qi, J Pan, ... arXiv preprint arXiv:2407.20962, 2024 | 1 | 2024 |
VLG: General Video Recognition with Web Textual Knowledge J Lin, Z Liu, W Wang, W Wu, L Wang International Journal of Computer Vision (IJCV) 2024, 2022 | 1 | 2022 |
Submission to generic event boundary detection challenge@ cvpr 2022: Local context modeling and global boundary decoding approach J Tang, Z Liu, J Tan, C Qian, W Wu, L Wang arXiv preprint arXiv:2206.15268, 2022 | 1 | 2022 |
A Unified Pretraining Framework for Human Motion Analysis W Zhu, X Ma, Z Liu, L Liu, W Wu, Y Wang | | |
1 Details of optimizing network P AVV Parsing, H Cheng, Z Liu, H Zhou, C Qian, W Wu, L Wang | | |