nr |
titel |
auteur |
tijdschrift |
jaar |
jaarg. |
afl. |
pagina('s) |
type |
1 |
A Closer Look at Benchmarking Self-supervised Pre-training with Image Classification
|
Marks, Markus |
|
|
133 |
8 |
p. 5013-5025 |
artikel |
2 |
ADEM-VL: Adaptive and Embedded Fusion for Efficient Vision-Language Tuning
|
Hao, Zhiwei |
|
|
133 |
8 |
p. 5527-5543 |
artikel |
3 |
Advances in 3D Neural Stylization: A Survey
|
Chen, Yingshu |
|
|
133 |
8 |
p. 5026-5061 |
artikel |
4 |
A Fast and Lightweight 3D Keypoint Detector
|
Yang, Chengzhuan |
|
|
133 |
8 |
p. 5216-5237 |
artikel |
5 |
A2M2-Net: Adaptively Aligned Multi-scale Moment for Few-Shot Action Recognition
|
Gao, Zilin |
|
|
133 |
8 |
p. 5363-5378 |
artikel |
6 |
Animal-CLIP: A Dual-Prompt Enhanced Vision-Language Model for Animal Action Recognition
|
Jing, Yinuo |
|
|
133 |
8 |
p. 5062-5082 |
artikel |
7 |
An Information Theory-Inspired Strategy for Automated Network Pruning
|
Zheng, Xiawu |
|
|
133 |
8 |
p. 5455-5482 |
artikel |
8 |
A Survey of Representation Learning, Optimization Strategies, and Applications for Omnidirectional Vision
|
Ai, Hao |
|
|
133 |
8 |
p. 4973-5012 |
artikel |
9 |
Autoregressive Temporal Modeling for Advanced Tracking-by-Diffusion
|
Nguyen, Pha |
|
|
133 |
8 |
p. 5505-5526 |
artikel |
10 |
AvatarStudio: High-Fidelity and Animatable 3D Avatar Creation from Text
|
Zhang, Xuanmeng |
|
|
133 |
8 |
p. 5178-5196 |
artikel |
11 |
BackdoorBench: A Comprehensive Benchmark and Analysis of Backdoor Learning
|
Wu, Baoyuan |
|
|
133 |
8 |
p. 5700-5787 |
artikel |
12 |
Bamboo: Building Mega-Scale Vision Dataset Continually with Human–Machine Synergy
|
Zhang, Yuanhan |
|
|
133 |
8 |
p. 5806-5821 |
artikel |
13 |
CLIMS++: Cross Language Image Matching with Automatic Context Discovery for Weakly Supervised Semantic Segmentation
|
Xie, Jinheng |
|
|
133 |
8 |
p. 5569-5588 |
artikel |
14 |
Correction: Consistent Prompt Tuning for Generalized Category Discovery
|
Yang, Muli |
|
|
133 |
8 |
p. 5872-5881 |
artikel |
15 |
Creatively Upscaling Images with Global-Regional Priors
|
Qian, Yurui |
|
|
133 |
8 |
p. 5197-5215 |
artikel |
16 |
C2RF: Bridging Multi-modal Image Registration and Fusion via Commonality Mining and Contrastive Learning
|
Tang, Linfeng |
|
|
133 |
8 |
p. 5262-5280 |
artikel |
17 |
Data-Adaptive Weight-Ensembling for Multi-task Model Fusion
|
Tang, Anke |
|
|
133 |
8 |
p. 5396-5412 |
artikel |
18 |
Diffusion-Enhanced Test-Time Adaptation with Text and Image Augmentation
|
Feng, Chun-Mei |
|
|
133 |
8 |
p. 5083-5098 |
artikel |
19 |
DocScanner: Robust Document Image Rectification with Progressive Learning
|
Feng, Hao |
|
|
133 |
8 |
p. 5343-5362 |
artikel |
20 |
D3T: Dual-Domain Diffusion Transformer in Triplanar Latent Space for 3D Incomplete-View CT Reconstruction
|
Liu, Xuhui |
|
|
133 |
8 |
p. 5238-5261 |
artikel |
21 |
Effectively Leveraging CLIP for Generating Situational Summaries of Images and Videos
|
Verma, Dhruv |
|
|
133 |
8 |
p. 5302-5325 |
artikel |
22 |
Few-Shot Referring Video Single- and Multi-Object Segmentation Via Cross-Modal Affinity with Instance Sequence Matching
|
Liu, Heng |
|
|
133 |
8 |
p. 5610-5628 |
artikel |
23 |
Free Lunch to Meet the Gap: Intermediate Domain Reconstruction for Cross-Domain Few-Shot Learning
|
Zhang, Tong |
|
|
133 |
8 |
p. 5118-5137 |
artikel |
24 |
Generalized Relative Pose and Scale from Affine Correspondences
|
Xu, Wanting |
|
|
133 |
8 |
p. 5840-5856 |
artikel |
25 |
High-Fidelity Image Inpainting with Multimodal Guided GAN Inversion
|
Zhang, Libo |
|
|
133 |
8 |
p. 5788-5805 |
artikel |
26 |
HiLM-D: Enhancing MLLMs with Multi-scale High-Resolution Details for Autonomous Driving
|
Ding, Xinpeng |
|
|
133 |
8 |
p. 5379-5395 |
artikel |
27 |
Image Captions are Natural Prompts for Training Data Synthesis
|
Lei, Shiye |
|
|
133 |
8 |
p. 5435-5454 |
artikel |
28 |
Interaction Confidence Attention for Human–Object Interaction Detection
|
Zhang, Hong-Bo |
|
|
133 |
8 |
p. 5629-5648 |
artikel |
29 |
IPAD: Iterative, Parallel, and Diffusion-Based Network for Scene Text Recognition
|
Yang, Xiaomeng |
|
|
133 |
8 |
p. 5589-5609 |
artikel |
30 |
Local Concept Embeddings for Analysis of Concept Distributions in Vision DNN Feature Spaces
|
Mikriukov, Georgii |
|
|
133 |
8 |
p. 5649-5699 |
artikel |
31 |
NU-AIR: A Neuromorphic Urban Aerial Dataset for Detection and Localization of Pedestrians and Vehicles
|
Iaboni, Craig |
|
|
133 |
8 |
p. 5099-5117 |
artikel |
32 |
Paragraph-to-Image Generation with Information-Enriched Diffusion Model
|
Wu, Weijia |
|
|
133 |
8 |
p. 5413-5434 |
artikel |
33 |
P2Object: Single Point Supervised Object Detection and Instance Segmentation
|
Chen, Pengfei |
|
|
133 |
8 |
p. 5544-5568 |
artikel |
34 |
P2P: Part-to-Part Motion Cues Guide a Strong Tracking Framework for LiDAR Point Clouds
|
Nie, Jiahao |
|
|
133 |
8 |
p. 5326-5342 |
artikel |
35 |
RGB-D Visual Perception for Occluded Scenes via Event Camera
|
Li, Siqi |
|
|
133 |
8 |
p. 5483-5504 |
artikel |
36 |
Segment Anything in 3D with Radiance Fields
|
Cen, Jiazhong |
|
|
133 |
8 |
p. 5138-5160 |
artikel |
37 |
Simplified Concrete Dropout - Improving the Generation of Attribution Masks for Fine-grained Classification
|
Korsch, Dimitri |
|
|
133 |
8 |
p. 5857-5871 |
artikel |
38 |
SimZSL: Zero-Shot Learning Beyond a Pre-defined Semantic Embedding Space
|
Atigh, Mina Ghadimi |
|
|
133 |
8 |
p. 5161-5177 |
artikel |
39 |
Supplementary Prompt Learning for Vision-Language Models
|
Zeng, Rongfei |
|
|
133 |
8 |
p. 5822-5839 |
artikel |
40 |
SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting
|
Huang, Mingxin |
|
|
133 |
8 |
p. 5281-5301 |
artikel |