先进制造业知识服务平台
国家科技图书文献中心机械分馆  工信部产业技术基础公共服务平台  国家中小企业公共服务示范平台

会议文集


文集名AAAI Technical Tracks (Computer Vision II)
会议名39th AAAI Conference on Artificial Intelligence (AAAI-25), 37th Conference on Innovative Applications of Artificial Intelligence (IAAI-25), 15th Symposium on Educational Advances in Artificial Intelligence (EAAI-25)
中译名《第三十九届AAAI人工智能会议,第三十七届人工智能创新应用会议,第十五届人工智能教育进展讨论会,卷3-2》
机构Association for the Advancement of Artificial Intelligence (AAAI)
会议日期25 February - 4 March 2025
会议地点Philadelphia, Pennsylvania, USA
出版年2025
馆藏号358179


题名作者出版年
Multi-Pair Temporal Sentence Grounding via Multi-Thread Knowledge Transfer NetworkXiang Fang; Wanlong Fang; Changshuo Wang; Daizong Liu; Keke Tang; Jianfeng Dong; Pan Zhou; Beibei Li2025
AE-NeRF: Augmenting Event-Based Neural Radiance Fields for Non-ideal Conditions and Larger ScenesChaoran Feng; Wangbo Yu; Xinhua Cheng; Zhenyu Tang; Junwu Zhang; Li Yuan; Yonghong Tian2025
PROSAC: Provably Safe Certification for Machine Learning Models under Adversarial AttacksChen Feng; Ziquan Liu; Zhuo Zhi; Ilija Bogunovic; Carsten Gerner-Beuerle; Miguel Rodrigues2025
VQA4CIR: Boosting Composed Image Retrieval with Visual Question AnsweringChun-Mei Feng; Yang Bai; Tao Luo; Zhen Li; Salman Khan; Wangmeng Zuo; Rick Siow Mong Goh; Yong Liu2025
PoseLLaVA: Pose Centric Multimodal LLM for Fine-Grained 3D Pose ManipulationDong Feng; Ping Guo; Encheng Peng; Mingmin Zhu; Wenhao Yu; Peng Wang2025
Residual Diffusion Deblurring Model for Single Image Defocus DeblurringHaoxuan Feng; Haohui Zhou; Tian Ye; Sixiang Chen; Lei Zhu2025
DiT4Edit: Diffusion Transformer for Image EditingKunyu Feng; Yue Ma; Bingyuan Wang; Chenyang Qi; Haozhe Chen; Qifeng Chen; Zeyu Wang2025
Semantic Ambiguity Modeling and Propagation for Fine-Grained Visual Cross View Geo-LocalizationMingtao Feng; Fenghao Tian; Jianqiao Luo; Zijie Wu; Weisheng Dong; Yaonan Wang; Ajmal Mian2025
Weakly Supervised Gland Segmentation with Class Semantic Consistency and Purified Labels FiltrationSiyang Feng; Huadeng Wang; Chu Han; Zhenbing Liu; Hualong Zhang; Rushi Lan; Xipeng Pan2025
HDLayout: Hierarchical and Directional Layout Planning for Arbitrary Shaped Visual Text GenerationTonghui Feng; Chunsheng Yan; Qianru Wang; Jiangtao Cui; Xiaotian Qiao2025
ViPOcc: Leveraging Visual Priors from Vision Foundation Models for Single-View 3D Occupancy PredictionYi Feng; Yu Han; Xijing Zhang; Tanghui Li; Yanting Zhang; Rui Fan2025
Simplifying Control Mechanism in Text-to-Image Diffusion ModelsZhida Feng; Li Chen; Yuenan Sun; Jiaxiang Liu; Shikun Feng2025
BGHR: Bridging the Gap Between HBox-Supervised and RBox-Supervised Oriented Object Detection via Adaptive Fine-Grained Sample MiningChenlin Fu; Yingying Zhu2025
Foundation Model Driven Appearance Extraction for Robust Multiple Object TrackingTeng Fu; Haiyang Yu; Ke Niu; Bin Li; Xiangyang Xue2025
Exploring Unbiased Deepfake Detection via Token-Level Shuffling and MixingXinghe Fu; Zhiyuan Yan; Taiping Yao; Shen Chen; Xi Li2025
MFL-Owner: Ownership Protection for Multi-modal Federated Learning via Orthogonal Transform WatermarkKeke Gai; Dongjue Wang; Jing Yu; Mohan Wang; Liehuang Zhu; Qi Wu2025
DFDNet: Disentangling and Filtering Dynamics for Enhanced Video PredictionLianqiang Gan; Junyu Lai; Jingze Ju; Lianli Gao; Yi Bin2025
PNVC: Towards Practical INR-based Video CompressionGe Gao; Ho Man Kwan; Fan Zhang; David Bull2025
AIM: Let Any Multimodal Large Language Models Embrace Efficient In-Context LearningJun Gao; Qian Qiao; Tianxiang Wu; Zili Wang; Ziqiang Cao; Wenjie Li2025
TC-LLaVA: Rethinking the Transfer of LLava from Image to Video Understanding with Temporal ConsiderationsMingze Gao; Jingyu Liu; Mingda Li; Jiangtao Xie; Qingbin Liu; Kevin Zhao; Xi Chen; Hui Xiong2025
1234