先进制造业知识服务平台
国家科技图书文献中心机械分馆  工信部产业技术基础公共服务平台  国家中小企业公共服务示范平台

会议文集


文集名AAAI Technical Tracks (Computer Vision VI)
会议名39th AAAI Conference on Artificial Intelligence (AAAI-25), 37th Conference on Innovative Applications of Artificial Intelligence (IAAI-25), 15th Symposium on Educational Advances in Artificial Intelligence (EAAI-25)
中译名《第三十九届AAAI人工智能会议,第三十七届人工智能创新应用会议,第十五届人工智能教育进展讨论会,卷7-2》
机构Association for the Advancement of Artificial Intelligence (AAAI)
会议日期25 February - 4 March 2025
会议地点Philadelphia, Pennsylvania, USA
出版年2025
馆藏号358187


题名作者出版年
Promptable Representation Distribution Learning and Data Augmentation for Gigapixel Histopathology WSI AnalysisKunming Tang; Zhiguo Jiang; Jun Shi; Wei Wang; Haibo Wu; Yushan Zheng2025
Compressing Streamable Free-Viewpoint Videos to 0.1 MB per FrameLuyang Tang; Jiayu Yang; Rui Peng; Yongqi Zhai; Shihe Shen; Ronggang Wang2025
Sign-IDD: Iconicity Disentangled Diffusion for Sign Language ProductionShengeng Tang; Jiayi He; Dan Guo; Yanyan Wei; Feng Li; Richang Hong2025
BEV-TSR: Text-Scene Retrieval in BEV Space for Autonomous DrivingTao Tang; Dafeng Wei; Zhengyu Jia; Tian Gao; Changwei Cai; Chengkai Hou; Peng Jia; Kun Zhan; Haiyang Sun; Jingchen Fan; Yixing Zhao; Xiaodan Liang; Xianpeng Lang; Yang Wang2025
More Text, Less Point: Towards 3D Data-Efficient Point-Language UnderstandingYuan Tang; Xu Han; Xianzhi Li; Qiao Yu; Jinfeng Xu; Yixue Hao; Long Hu; Min Chen2025
Empowering LLMs with Pseudo-Untrimmed Videos for Audio-Visual Temporal UnderstandingYunlong Tang; Daiki Shimada; Jing Bi; Mingqian Feng; Hang Hua; Chenliang Xu2025
CaRDiff: Video Salient Object Ranking Chain of Thought Reasoning for Saliency Prediction with DiffusionYunlong Tang; Gen Zhan; Li Yang; Yiting Liao; Chenliang Xu2025
RAGG: Retrieval-Augmented Grasp Generation ModelZhenhua Tang; Bin Zhu; Yanbin Hao; Chong-Wah Ngo; Richang Hong2025
Cycle3D: High-quality and Consistent Image-to-3D Generation via Generation-Reconstruction CycleZhenyu Tang; Junwu Zhang; Xinhua Cheng; Wangbo Yu; Chaoran Feng; Yatian Pang; Bin Lin; Li Yuan2025
From Representation Space to Prognostic Insights: Whole Slide Image Generation with Hierarchical Diffusion Model for Survival PredictionZhihao Tang; Xi Zhang; Chaozhuo Li2025
3D~2-Actor: Learning Pose-Conditioned 3D-Aware Denoiser for Realistic Gaussian Avatar ModelingZichen Tang; Hongyu Yang; Hanchen Zhang; Jiaxin Chen; Di Huang2025
Kernel-Aware Graph Prompt Learning for Few-Shot Anomaly DetectionFenfang Tao; Guo-Sen Xie; Fang Zhao; Xiangbo Shu2025
Relieving Universal Label Noise for Unsupervised Visible-Infrared Person Re-Identification by Inferring from NeighborsXiao Teng; Long Lan; Dingyao Chen; Kele Xu; Nan Yin2025
Stitch, Contrast, and Segment: Learning a Human Action Segmentation Model Using Trimmed Skeleton VideosHaitao Tian; Pierre Payeur2025
DrivingForward: Feed-forward 3D Gaussian Splatting for Driving Scene Reconstruction from Flexible Surround-view InputQijian Tian; Xin Tan; Yuan Xie; Lizhuang Ma2025
Unsupervised Self-Prior Embedding Neural Representation for Iterative Sparse-View CT ReconstructionXuanyu Tian; Lixuan Chen; Qing Wu; Chenhe Du; Jingjing Shi; Hongjiang Wei; Yuyao Zhang2025
AI-generated Image Quality Assessment in Visual CommunicationYu Tian; Yixuan Li; Baoliang Chen; Hanwei Zhu; Shiqi Wang; Sam Kwong2025
ChatterBox: Multimodal Referring and Grounding with Chain-of-QuestionsYunjie Tian; Tianren Ma; Lingxi Xie; Qixiang Ye2025
Anywhere: A Multi-Agent Framework for User-Guided, Reliable, and Diverse Foreground-Conditioned Image GenerationXie Tianyidan; Rui Ma; Qian Wang; Xiaoqian Ye; Feixuan Liu; Ying Tai; Zhenyu Zhang; Lanjun Wang; Zili Yi2025
G-VEval: A Versatile Metric for Evaluating Image and Video Captions Using GPT-4oTony Cheng Tong; Sirui He; Zhiwen Shao; Dit-Yan Yeung2025
1234