欢迎来到上海交通大学智能媒体组 (MediaX@SJTU)

MediaX 隶属于 上海交通大学未来媒体网络协同创新中心, 专注于 计算机视觉机器学习生成式智能媒体 交叉领域的前沿研究。 我们致力于推动多模态媒体(2D/3D/4D)在生成、修复与增强、重建与压缩、以及质量评价等方向的发展。 我们的使命是构建能够理解、建模和操控复杂人类中心视觉内容的智能系统, 以实现高质量、高效率的下一代智能媒体内容生产。

🎯 研究方向

媒体感知与质量评价
构建面向UGC、PGC和AIGC内容的多维度智能质量评价体系。(F-BenchFineVQ等)

视频修复与生成
高质量视频增强、可控生成与编辑,支持4K/8K分辨率。(StoryGenDr2等)

3D/4D重建与生成
基于3D高斯建模与生成式AI,实现沉浸式动态场景的高效表示与压缩。(4DGCVARFVV等)

智能媒体创作平台
构建协同、多智能体驱动的自动化与交互式媒体生产系统。(央视4K/8K超高清媒体的智能增强制作平台)

📢 加入我们

我们长期欢迎 博士研究生、硕士研究生、本科科研助理 加入团队。
如果你对智能媒体与生成式AI充满热情,欢迎将 个人简历与成绩单 发送至: mediax@sjtu.edu.cn

联系我们 GitHub 微信

    News

    🔥 News:

    [2025/10] One paper is accepted to JSTSP 2025
    [2025/9] First Prize, Intelligent Restoration and Enhancement Track, 4th Broadcast and Online Audio-Visual Artificial Intelligence Application Innovation Competition
    [2025/9] Two papers are accepted to NeurIPS 2025
    [2025/9] MediaX团队超高清AI修复技术助力抗战胜利80周年晚会
    [2025/8] Second Place, ICCV 2025 MIPI Challenge – Detailed Image Quality Assessment
    [2025/7] Second Place, ICCV 2025 VQualA Challenge – GenAI-Bench AIGC Video Quality Assessment
    [2025/7] Two papers are accepted to ACM MM 2025
    [2025/6] Two papers are accepted to ICCV 2025
    [2025/5] One paper is accepted to ICML 2025
    [2025/3] Two papers are accepted to ICME 2025
    [2025/2] Two papers are accepted to CVPR 2025
    [2025/2] NTIRE 2025 XGC Quality Assessment Challenge Organizer
    [2025/1] One paper is accepted to JSAC 2025
    [2024/12] One paper is accepted to AAAI 2025
    [2024/7] One paper is accepted to TCSVT 2024
    [2024/7] One paper is accepted to ACM MM 2024
    [2024/6] One paper is accepted to ICIP 2024

      Publications

      f-bench

      [JSTSP'2025] MoA-VR: A Mixture-of-Agents System Towards All-in-One Video Restoration

      Lu Liu, Chunlei Cai, Shaocheng Shen, Jianfeng Liang, Weimin Ouyang, Tianxiao Ye, Jian Mao, Huiyu Duan, Jiangchao Yao, Xiaoyun Zhang, Qiang Hu, Guangtao Zhai

      JSTSP 2025.

      f-bench

      [NeurIPS'2025] 4DGCPro: Efficient Hierarchical 4D Gaussian Compression for Progressive Volumetric Video Streaming

      Zihan Zheng, Zhenlong Wu, Houqiang Zhong, Yuan Tian, Ning Cao, Lan Xu, Jiangchao Yao, Xiaoyun Zhang, Qiang Hu, Wenjun Zhang

      NeurIPS 2025.

      f-bench

      [NeurIPS'2025] Long-tailed Recognition with Model Rebalancing

      Jiaan Luo, Feng Hong, Qiang Hu, Xiaofeng Cao, Feng Liu, Jiangchao Yao

      NeurIPS 2025.

      f-bench

      [MM'2025] BEAM: Bridging Physically-based Rendering and Gaussian Modeling for Relightable Volumetric Video

      Yu Hong, Yize Wu, Zhehao Shen, Chengcheng Guo, Yuheng Jiang, Yingliang Zhang, Qiang Hu, Jingyi Yu, Lan Xu

      Proceedings of the ACM International Conference on Multimedia(MM), 2025.

      f-bench

      [MM'2025] MultiEgo: A Multi-View Egocentric Video Dataset For 4D Scene Reconstruction

      Bate Li, Houqiang Zhong, Zhengxue Cheng, Qiang Hu, Qiang Wang, Li Song, Wenjun Zhang

      Proceedings of the ACM International Conference on Multimedia(MM), 2025.

      f-bench

      [ICCV'2025] F-Bench: Rethinking Human Preference Evaluation Metrics for Benchmarking Face Generation, Customization, and Restoration

      Lu Liu, Huiyu Duan, Qiang Hu, Liu Yang, Chunlei Cai, Tianxiao Ye, Huayu Liu, Xiaoyun Zhang, Guangtao Zhai

      IEEE/CVF International Conference on Computer Vision (ICCV), 2025.

      seriallora

      [CVPR'2025]4DGC: Rate-Aware 4D Gaussian Compression for Efficient Streamable Free-Viewpoint Video

      Qiang Hu, Zihan Zheng, Houqiang Zhong, Sihua Fu, Li Song, Xiaoyun Zhang, Guangtao Zhai, Yanfeng Wang.

      IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025.

      seriallora

      [CVPR'2025] FineVQ: Fine-Grained User Generated Content Video Quality Assessment

      Huiyu Duan, Qiang Hu, Wang Jiarui, Liu Yang, Zitong Xu, Lu Liu, Xiongkuo Min, Chunlei Cai, Tianxiao Ye, Xiaoyun Zhang, Guangtao Zhai

      IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2025.

      TDBFR

      [JSAC'2025]VARFVV: View-Adaptive Real-Time Interactive Free-View Video Streaming with Edge Computing

      Qiang Hu, Qihan He, Houqiang Zhong, GuoLu, Xiaoyun Zhang,Guangtao Zhai,Yanfeng Wang

      IEEE Journal on Selected Areas in Communications (JSAC), 2025.

      f-bench

      [AAAI'2025] VRVVC: Variable-Rate NeRF-Based Volumetric Video Compression

      Qiang Hu,Houqiang Zhong,Zihan Zheng,Xiaoyun Zhang,Zhengxue Cheng,Li Song,Guangtao Zhai,Yanfeng Wang

      The Association for the Advancement of Artificial Intelligence (AAAI), 2025.

      f-bench

      [MM'2024] HPC: Hierarchical Progressive Coding Framework for Volumetric Video

      Zihan Zheng, Houqiang Zhong, Qiang Hu, Xiaoyun Zhang, Li Song, Ya Zhang, Yanfeng Wang

      Proceedings of the ACM International Conference on Multimedia(MM), 2024.

      f-bench

      [CVPR'2024] Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Models

      Chang Liu, Haoning Wu, Yujie Zhong, Xiaoyun Zhang, Yanfeng Wang, Weidi Xie

      IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024.

      More on publication page