nr |
titel |
auteur |
tijdschrift |
jaar |
jaarg. |
afl. |
pagina('s) |
type |
1 |
An Experimental Study on Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-training
|
Gao, Jin |
|
|
133 |
7 |
p. 3918-3950 |
artikel |
2 |
A Solution to Co-occurrence Bias in Pedestrian Attribute Recognition: Theory, Algorithms, and Improvements
|
Zhou, Yibo |
|
|
133 |
7 |
p. 4712-4726 |
artikel |
3 |
A Survey on Deep Stereo Matching in the Twenties
|
Tosi, Fabio |
|
|
133 |
7 |
p. 4245-4276 |
artikel |
4 |
Attribute-Centric Compositional Text-to-Image Generation
|
Cong, Yuren |
|
|
133 |
7 |
p. 4555-4570 |
artikel |
5 |
Bootstrapping Vision-Language Models for Frequency-Centric Self-Supervised Remote Physiological Measurement
|
Yue, Zijie |
|
|
133 |
7 |
p. 4112-4133 |
artikel |
6 |
Camouflaged Object Detection with Adaptive Partition and Background Retrieval
|
Yin, Bowen |
|
|
133 |
7 |
p. 4877-4893 |
artikel |
7 |
Consistent Prompt Tuning for Generalized Category Discovery
|
Yang, Muli |
|
|
133 |
7 |
p. 4014-4041 |
artikel |
8 |
Continual Test-Time Adaptation for Single Image Defocus Deblurring via Causal Siamese Networks
|
Cui, Shuang |
|
|
133 |
7 |
p. 4134-4157 |
artikel |
9 |
Contrastive Decoupled Representation Learning and Regularization for Speech-Preserving Facial Expression Manipulation
|
Chen, Tianshui |
|
|
133 |
7 |
p. 3822-3838 |
artikel |
10 |
Correction: Generalized Robot Vision-Language Model via Linguistic Foreground-Aware Contrast
|
Liu, Kangcheng |
|
|
133 |
7 |
p. 4971 |
artikel |
11 |
Correction: SplitNet: Learnable Clean-Noisy Label Splitting for Learning with Noisy Labels
|
Kim, Daehwan |
|
|
133 |
7 |
p. 4970 |
artikel |
12 |
CT3D++: Improving 3D Object Detection with Keypoint-Induced Channel-wise Transformer
|
Sheng, Hualian |
|
|
133 |
7 |
p. 4817-4836 |
artikel |
13 |
Deep Convolutional Neural Network Enhanced Non-uniform Fast Fourier Transform for Undersampled MRI Reconstruction
|
Li, Yuze |
|
|
133 |
7 |
p. 4158-4176 |
artikel |
14 |
Deep Hierarchical Learning for 3D Semantic Segmentation
|
Li, Chongshou |
|
|
133 |
7 |
p. 4420-4441 |
artikel |
15 |
DiffuVolume: Diffusion Model for Volume based Stereo Matching
|
Zheng, Dian |
|
|
133 |
7 |
p. 3807-3821 |
artikel |
16 |
DustNet++: Deep Learning-Based Visual Regression for Dust Density Estimation
|
Michel, Andreas |
|
|
133 |
7 |
p. 4220-4244 |
artikel |
17 |
Exemplar-Free Continual Learning of Vision Transformers via Gated Class-Attention and Cascaded Feature Drift Compensation
|
Cotogni, Marco |
|
|
133 |
7 |
p. 4571-4589 |
artikel |
18 |
Expressive Image Generation and Editing with Rich Text
|
Ge, Songwei |
|
|
133 |
7 |
p. 4604-4622 |
artikel |
19 |
Fg-T2M++: LLMs-Augmented Fine-Grained Text Driven Human Motion Generation
|
Wang, Yin |
|
|
133 |
7 |
p. 4277-4293 |
artikel |
20 |
FlowSDF: Flow Matching for Medical Image Segmentation Using Distance Transforms
|
Bogensperger, Lea |
|
|
133 |
7 |
p. 4864-4876 |
artikel |
21 |
Fully Decoupled End-to-End Person Search: An Approach without Conflicting Objectives
|
Zhang, Pengcheng |
|
|
133 |
7 |
p. 4795-4816 |
artikel |
22 |
Fusion4DAL: Offline Multi-modal 3D Object Detection for 4D Auto-labeling
|
Yang, Zhiyuan |
|
|
133 |
7 |
p. 3951-3969 |
artikel |
23 |
Fusion for Visual-Infrared Person ReID in Real-World Surveillance Using Corrupted Multimodal Data
|
Josi, Arthur |
|
|
133 |
7 |
p. 4690-4711 |
artikel |
24 |
Guest Editorial: Special Issue on Biometrics Security and Privacy
|
Wan, Jun |
|
|
133 |
7 |
p. 4966-4969 |
artikel |
25 |
Guest Editorial: Special Issue on Large-Scale Generative Models for Content Creation and Manipulation
|
He, Shengfeng |
|
|
133 |
7 |
p. 4962-4965 |
artikel |
26 |
Image Matting and 3D Reconstruction in One Loop
|
Liu, Xinshuang |
|
|
133 |
7 |
p. 4091-4111 |
artikel |
27 |
Imbuing, Enrichment and Calibration: Leveraging Language for Unseen Domain Extension
|
Jiang, Chenyi |
|
|
133 |
7 |
p. 4064-4090 |
artikel |
28 |
I2MD: 3D Action Representation Learning with Inter- and Intra-Modal Mutual Distillation
|
Mao, Yunyao |
|
|
133 |
7 |
p. 4944-4961 |
artikel |
29 |
Informative Scene Graph Generation via Debiasing
|
Gao, Lianli |
|
|
133 |
7 |
p. 4196-4219 |
artikel |
30 |
Instance-Level Moving Object Segmentation from a Single Image with Events
|
Wan, Zhexiong |
|
|
133 |
7 |
p. 4042-4063 |
artikel |
31 |
Investigating Self-Supervised Methods for Label-Efficient Learning
|
Nandam, Srinivasa Rao |
|
|
133 |
7 |
p. 4522-4537 |
artikel |
32 |
LaMD: Latent Motion Diffusion for Image-Conditional Video Generation
|
Hu, Yaosi |
|
|
133 |
7 |
p. 4384-4400 |
artikel |
33 |
LaneCorrect: Self-Supervised Lane Detection
|
Nie, Ming |
|
|
133 |
7 |
p. 4894-4908 |
artikel |
34 |
Learning Structure-Supporting Dependencies via Keypoint Interactive Transformer for General Mammal Pose Estimation
|
Xu, Tianyang |
|
|
133 |
7 |
p. 3858-3876 |
artikel |
35 |
Learning to Generalize Heterogeneous Representation for Cross-Modality Image Synthesis via Multiple Domain Interventions
|
Huang, Yawen |
|
|
133 |
7 |
p. 4727-4748 |
artikel |
36 |
LiDAR-guided Geometric Pretraining for Vision-Centric 3D Object Detection
|
Huang, Linyan |
|
|
133 |
7 |
p. 3877-3890 |
artikel |
37 |
LMD: Light-Weight Prediction Quality Estimation for Object Detection in Lidar Point Clouds
|
Riedlinger, Tobias |
|
|
133 |
7 |
p. 4349-4365 |
artikel |
38 |
LR-ASD: Lightweight and Robust Network for Active Speaker Detection
|
Liao, Junhua |
|
|
133 |
7 |
p. 4749-4769 |
artikel |
39 |
METS: Motion-Encoded Time-Surface for Event-Based High-Speed Pose Tracking
|
Xu, Ninghui |
|
|
133 |
7 |
p. 4401-4419 |
artikel |
40 |
Multi-Source Domain Adaptation by Causal-Guided Adaptive Multimodal Diffusion Networks
|
Cai, Ziyun |
|
|
133 |
7 |
p. 4623-4645 |
artikel |
41 |
Multi-Text Guidance Is Important: Multi-Modality Image Fusion via Large Generative Vision-Language Model
|
Wang, Zeyu |
|
|
133 |
7 |
p. 4646-4668 |
artikel |
42 |
Not All Pixels are Equal: Learning Pixel Hardness for Semantic Segmentation
|
Xiao, Xin |
|
|
133 |
7 |
p. 4669-4689 |
artikel |
43 |
On the Trustworthiness Landscape of State-of-the-art Generative Models: A Survey and Outlook
|
Fan, Mingyuan |
|
|
133 |
7 |
p. 4317-4348 |
artikel |
44 |
Parameter Efficient Fine-Tuning for Multi-modal Generative Vision Models with Möbius-Inspired Transformation
|
Duan, Haoran |
|
|
133 |
7 |
p. 4590-4603 |
artikel |
45 |
Part-Whole Relational Fusion Towards Multi-Modal Scene Understanding
|
Liu, Yi |
|
|
133 |
7 |
p. 4483-4503 |
artikel |
46 |
PointSea: Point Cloud Completion via Self-structure Augmentation
|
Zhu, Zhe |
|
|
133 |
7 |
p. 4770-4794 |
artikel |
47 |
Preconditioned Score-Based Generative Models
|
Ma, Hengyuan |
|
|
133 |
7 |
p. 4837-4863 |
artikel |
48 |
Pre-training for Action Recognition with Automatically Generated Fractal Datasets
|
Svyezhentsev, Davyd |
|
|
133 |
7 |
p. 4923-4943 |
artikel |
49 |
Realistic Evaluation of Deep Active Learning for Image Classification and Semantic Segmentation
|
Mittal, Sudhanshu |
|
|
133 |
7 |
p. 4294-4316 |
artikel |
50 |
ScenarioDiff: Text-to-video Generation with Dynamic Transformations of Scene Conditions
|
Zhang, Yipeng |
|
|
133 |
7 |
p. 4909-4922 |
artikel |
51 |
Semantics-Conditioned Generative Zero-Shot Learning via Feature Refinement
|
Chen, Shiming |
|
|
133 |
7 |
p. 4504-4521 |
artikel |
52 |
Smaller But Better: Unifying Layout Generation with Smaller Large Language Models
|
Zhang, Peirong |
|
|
133 |
7 |
p. 3891-3917 |
artikel |
53 |
Swap Attention in Spatiotemporal Diffusions for Text-to-Video Generation
|
Wang, Wenjing |
|
|
133 |
7 |
p. 4177-4195 |
artikel |
54 |
Temporal Transductive Inference for Few-Shot Video Object Segmentation
|
Siam, Mennatullah |
|
|
133 |
7 |
p. 4465-4482 |
artikel |
55 |
Towards Boosting Out-of-Distribution Detection from a Spatial Feature Importance Perspective
|
Zhu, Yao |
|
|
133 |
7 |
p. 3839-3857 |
artikel |
56 |
UMSCS: A Novel Unpaired Multimodal Image Segmentation Method Via Cross-Modality Generative and Semi-supervised Learning
|
Yang, Feiyang |
|
|
133 |
7 |
p. 4442-4464 |
artikel |
57 |
UniFace++: Revisiting a Unified Framework for Face Reenactment and Swapping via 3D Priors
|
Xu, Chao |
|
|
133 |
7 |
p. 4538-4554 |
artikel |
58 |
Unknown Support Prototype Set for Open Set Recognition
|
Jiang, Guosong |
|
|
133 |
7 |
p. 4366-4383 |
artikel |
59 |
VideoQA in the Era of LLMs: An Empirical Study
|
Xiao, Junbin |
|
|
133 |
7 |
p. 3970-3993 |
artikel |
60 |
VL-Trojan: Multimodal Instruction Backdoor Attacks against Autoregressive Visual Language Models
|
Liang, Jiawei |
|
|
133 |
7 |
p. 3994-4013 |
artikel |