ICCV 2023|Universal data enhancement technology, random quantization is suitable for any data modality

Click “Xiaobai Xue Vision” above and choose to add “star” or “Pin“ Heavy stuff, delivered as soon as possible Source丨Heart of Machine Editor | Jishi Platform Introduction to Jishi This paper proposes a self-supervised learning data enhancement technique suitable for arbitrary data modalities. Self-supervised learning algorithms have made significant progress in fields such as natural […]

MICCAI 2023 | IFE: Feature Enhancement for Medical Image Segmentation

Click the Card below and follow the “CVer” public account AI/CV heavy-duty information, delivered as soon as possible Click to enter->[NeRF and Transformer] Communication Group Author: Glenn (Source: Zhihu, authorized) https://zhuanlan.zhihu.com/p/635329786 Reply in the background of the CVer WeChat public account: IFE, you can download the pdf and code of this paper and start learning! […]

YOLOv5 uses ICCV2023 snake convolution DSConv

[Remind everyone: The module has only been briefly packaged and tested. Everyone’s data is different. Whether it can increase the point requires more experiments. You can try different positions or make further improvements] DSConv is a fusion topology-controlled convolution proposed in ICCV2023 inspired by deformed convolution. It is mainly aimed at segmenting elongated morphological targets […]

ICCV 2023 | The only way to 3D sensing large models! UniTR: unified multi-modal Transformer Encoder!

Click the Card below and follow the “CVer” public account AI/CV heavy-duty information, delivered as soon as possible Click to enter->[3D Point Cloud and Transformer] Communication Group Reply in the background of CVer WeChat public account: UniTR, you can download the pdf and code of this paper Unified multi-modal transformer encoder for 3D perception UniTR: […]

ICCV 2023 | Apple proposes FastViT: fast convolution and Transformer hybrid architecture

Click the Card below and follow the “CVer” public account AI/CV heavy-duty information, delivered as soon as possible Click to enter->[Target Detection and Transformer] Communication group Reply in the background of CVer WeChat public account: FastViT, you can download the pdf and code of this paper Reprinted from: Jishi Platform | Author: Technology Beast Introduction […]

ICCV 2023 Oral | DDFM: The first method to use diffusion model for multi-modal image fusion

Click the Card below and follow the “CVer” public account AI/CV heavy-duty information, delivered as soon as possible Click to enter->[Image Fusion and Diffusion Model] Communication Group Author: Oppenheimer (Source: Zhihu, authorized) | Editor: CVer public account https://zhuanlan.zhihu.com/p/653761272 Reply in the background of CVer WeChat public account: DDFM, you can download the pdf and code […]

ICCV2023MRN: Multiplexed Routing Network for Incremental Multilingual Text Recognition

MRN reading Paper reading Background and motivation Rehearsal-imbalance MRN solves Rehearsal Imbalance problem MRN network structure experiment Summarize code run Build environment Create a virtual environment activate environment Install dependencies and packages start operation Paper address: MRN: Multiplexed Routing Network for Incremental Multilingual Text Recognition Source code address: https://github.com/simplify23/MRN Paper reading This is a paper […]

ICCV 2023 | SAFMN: Spatially Adaptive Feature Modulation for Efficient Image Super-Resolution

Click the Card below and follow the “CVer” public account AI/CV heavy-duty information, delivered as soon as possible Click to enter->[Super Resolution and Transformer] Communication Group Author: It’s really hard to name a cat (source: Zhihu, authorized) | Editor: CVer public account https://zhuanlan.zhihu.com/p/652234003 Reply in the background of CVer WeChat public account: SAFMN, you can […]

ICCV 2023 | Universal Data Augmentation Technology! Stochastic quantization suitable for arbitrary data modalities

Click the Card below and follow the “CVer” public account AI/CV heavy-duty information, delivered as soon as possible Click to enter->[Target Detection and Transformer] Communication Group Reprinted from: Heart of the Machine This paper proposes a self-supervised learning data enhancement technique suitable for arbitrary data modalities. Self-supervised learning algorithms have made significant progress in fields […]