Volume 33 Issue 4 | Journal of Electronic Imaging

Journal of Electronic Imaging

VOL. 33 · NO. 4 | July 2024

CONTENTS

IN THIS ISSUE

JEI Letters (1)

Regular Articles (58)

< Previous Issue | Next Issue >

Receive Email Alerts

VIEW ALL ABSTRACTS +

JEI Letters

Additive cosine margin for unsupervised softmax embedding

Dan Wang, Jianwei Yang, Cailing Wang

Journal of Electronic Imaging, Vol. 33, Issue 04, 040501, (August 2024) https://doi.org/10.1117/1.JEI.33.4.040501

TOPICS: Feature extraction, Visualization, Machine learning, Image retrieval, Education and training, Mathematical optimization, Overfitting, Curium, Deep learning, Data modeling

Read Abstract +

Regular Articles

Deep unsupervised nonconvex optimization for edge-preserving image smoothing

Yiwen Xiong, Yang Yang, Lanling Zeng, Xinyu Wang, Zhigeng Pan, Lei Jiang

Journal of Electronic Imaging, Vol. 33, Issue 04, 043001, (July 2024) https://doi.org/10.1117/1.JEI.33.4.043001

TOPICS: Tunable filters, Image filtering, Image enhancement, High dynamic range imaging, Education and training, Digital filtering, Smoothing, Electrophoretic light scattering, Machine learning, Data modeling

Read Abstract +

SDANet: scale-deformation awareness network for crowd counting

Jianyong Wang, Xiangyu Guo, Qilei Li, Ahmed Abdelmoniem, Mingliang Gao

Journal of Electronic Imaging, Vol. 33, Issue 04, 043002, (July 2024) https://doi.org/10.1117/1.JEI.33.4.043002

TOPICS: Convolution, Deformation, Head, Network architectures, Education and training, Tunable filters, Adverse weather, Visualization, Image processing, Image fusion

Read Abstract +

Real-world image denoising via efficient diffusion model with controllable noise generation

Cheng Yang, Cong Wang, Lijing Liang, Zhixun Su

Journal of Electronic Imaging, Vol. 33, Issue 04, 043003, (July 2024) https://doi.org/10.1117/1.JEI.33.4.043003

TOPICS: Diffusion, Image denoising, Image processing, Denoising, Education and training, Performance modeling, Process modeling, Image enhancement, Visualization, Mathematical modeling

Read Abstract +

Copy-move forgery detection algorithm based on binarized statistical image features and principal component analysis

Azzedine Bensaad, Khaled Loukhaoukha, Said Sadoudi, Aissa Snani

Journal of Electronic Imaging, Vol. 33, Issue 04, 043004, (July 2024) https://doi.org/10.1117/1.JEI.33.4.043004

TOPICS: Counterfeit detection, Detection and tracking algorithms, Tunable filters, Principal component analysis, Visualization, Matrices, Digital imaging, Image processing, Feature extraction, Chromium

Read Abstract +

Multi-scale point pair normal encoding for local feature description and 3D object recognition

Chu’ai Zhang, Yating Wang, Qiao Wu, Jiangbin Zheng, Jiaqi Yang, Siwen Quan, Yanning Zhang

Journal of Electronic Imaging, Vol. 33, Issue 04, 043005, (July 2024) https://doi.org/10.1117/1.JEI.33.4.043005

TOPICS: Object recognition, Point clouds, 3D modeling, Clutter, Laser range finders, Data modeling, 3D image processing, Histograms, Voxels, Matrices

Read Abstract +

Improved self-supervised learning for disease identification in chest X-ray images

Yongjun Ma, Shi Dong, Yuchao Jiang

Journal of Electronic Imaging, Vol. 33, Issue 04, 043006, (July 2024) https://doi.org/10.1117/1.JEI.33.4.043006

TOPICS: Chest imaging, Data modeling, Image classification, Machine learning, Diseases and disorders, Education and training, Performance modeling, Medical imaging, Transformers, Ablation

Read Abstract +

Robust video hashing with canonical polyadic decomposition and Hahn moments

Zhenjun Tang, Huijiang Zhuang, Mengzhu Yu, Lv Chen, Xiaoping Liang, Xianquan Zhang

Journal of Electronic Imaging, Vol. 33, Issue 04, 043007, (July 2024) https://doi.org/10.1117/1.JEI.33.4.043007

TOPICS: Video, Feature extraction, Video acceleration, Video compression, Discrete wavelet transforms, Matrices, Visualization, Detection and tracking algorithms, 3D video compression, Tunable filters

Read Abstract +

Progressive reversible data hiding in encrypted images based on polynomial secret sharing and Chinese remainder theorem

Chao Jiang, Minqing Zhang, Zongbao Jiang, Yongjun Kong, Fuqiang Di

Journal of Electronic Imaging, Vol. 33, Issue 04, 043008, (July 2024) https://doi.org/10.1117/1.JEI.33.4.043008

TOPICS: Image encryption, Image restoration, Data hiding, Computer security, Histograms, Visualization, Interpolation, Infrared imaging, Defense and security, Image quality

Read Abstract +

Deep degradation-aware up-sampling-based depth video coding

Zhaoqing Pan, Yuqing Niu, Bo Peng, Ge Li, Sam Kwong, Jianjun Lei

Journal of Electronic Imaging, Vol. 33, Issue 04, 043009, (July 2024) https://doi.org/10.1117/1.JEI.33.4.043009

TOPICS: Video coding, Video, Video compression, Lawrencium, Education and training, Visualization, Spatial resolution, Feature extraction, Video acceleration, 3D video compression

Read Abstract +

Radar spectrum-image fusion using dual 2D-3D convolutional neural network to transformer inspired multi-headed self-attention bi-long short-term memory network for vehicle recognition

Ferris Arnous, Ram Narayanan

Journal of Electronic Imaging, Vol. 33, Issue 04, 043010, (July 2024) https://doi.org/10.1117/1.JEI.33.4.043010

TOPICS: Signal to noise ratio, Education and training, Radar, Data modeling, Image fusion, Feature fusion, Image segmentation, Synthetic aperture radar, Visualization, 3D modeling

Read Abstract +

Underwater object detection by integrating YOLOv8 and efficient transformer

Jing Liu, Kaiqiong Sun, Xiao Ye, Yaokun Yun

Journal of Electronic Imaging, Vol. 33, Issue 04, 043011, (July 2024) https://doi.org/10.1117/1.JEI.33.4.043011

TOPICS: Object detection, Target detection, Submerged target modeling, Feature fusion, Transformers, Submerged target detection, Head, Detection and tracking algorithms, Education and training, Small targets

Read Abstract +

ResRetinaFace: an efficient face detection network based on RetinaFace and residual structure

Xuanyu Liu, Shuliang Zhang, Junjie Hu, Peiyu Mao

Journal of Electronic Imaging, Vol. 33, Issue 04, 043012, (July 2024) https://doi.org/10.1117/1.JEI.33.4.043012

TOPICS: Facial recognition systems, Convolution, Detection and tracking algorithms, Deformation, Target detection, Education and training, Feature extraction, Data modeling, Performance modeling, Neural networks

Read Abstract +

The detection of multiple faces in unconstrained environment in deep learning suffers from insufficient detection accuracy and inefficiency; at the same time, the detection of blurred, occluded, and very small faces is even more unsatisfactory. The detection of blurred, occluded, and very small faces in multiple face detection in unconstrained environment is a hard problem in face detection nowadays. It is difficult to balance the detection accuracy and real-time efficiency in face detection with the improved RetinaFace chosen in this study. Therefore, in order to improve the efficiency of detecting blurred, occluded, and very small faces among multiple faces in unconstrained environments, we introduce deformable convolution, feature pyramid networks (FPN), and coordinate attention (CA) attention mechanism based on RetinaFace algorithm. Deformable convolution can be dynamically adjusted according to the shape and deformation of the recognized object and is no longer limited to a fixed-size square receptive field to improve the image feature extraction capability of the convolutional layer. FPN enhances the feature semantic information of the lower layers with a small increase in computational effort and improves the robustness of the detection algorithm to detect targets of different sizes. CA is a novel, lightweight, and efficient attention mechanism module for improving model performance, which can be easily integrated into mobile networks to improve accuracy with little additional computational overhead. The improved ResRetinaFace algorithm does not increase the computational overhead too much while improving the recognition accuracy, and it can better combine the characteristics of multiple postures and deformations of faces in complex scenes, adapt to the deformation state of faces’ postures, and provide more effective features for face detection, so as to pay better attention to the detection target and enhance the network characterization ability. Meanwhile, the improved algorithm combines the feature pyramid with the context module, which improves the detection effect in the case of blurred, occluded, and very small faces. The experimental outcomes demonstrate that, in contrast to the method before enhancement, the accuracy rates for easy, medium, and hard classification scenarios on the WIDER FACE dataset, utilizing the ResNet50 backbone network, are 94.83%, 93.28%, and 84.99%, respectively. Accompanied by a frames-per-second rate of 7.704, this meets the precision and real-time criteria for face measurement tasks. Validation on the WIDER FACE dataset further affirms that ResRetinaFace consistently achieves reliable face detection while maintaining high detection efficiency.

Improving the deblurring method of D²Net network for infrared videos

Jia Zhang, Yanzhu Zhang, Fan Yang, Tingxue Li, Yuhai Li, He Zhao, Jixiong Pu

Journal of Electronic Imaging, Vol. 33, Issue 04, 043013, (July 2024) https://doi.org/10.1117/1.JEI.33.4.043013

TOPICS: Deblurring, Video, Infrared radiation, Infrared imaging, Thermography, Feature extraction, Education and training, Video acceleration, Image quality, Image processing

Read Abstract +

PGDIG-YOLO: a lightweight method for airport runway foreign object detection

Liushuai Zheng, Xinyu Chen, Liuchuang Zheng

Journal of Electronic Imaging, Vol. 33, Issue 04, 043014, (July 2024) https://doi.org/10.1117/1.JEI.33.4.043014

TOPICS: Object detection, Performance modeling, Convolution, Instrument modeling, Data modeling, Feature extraction, Education and training, Detection and tracking algorithms, Target detection, Semantics

Read Abstract +

Hyperspectral image denoising via self-modulated cross-attention deformable convolutional neural network

Ying Wang, Jie Qiu, Yanxiang Zhao

Journal of Electronic Imaging, Vol. 33, Issue 04, 043015, (July 2024) https://doi.org/10.1117/1.JEI.33.4.043015

TOPICS: Denoising, Education and training, Convolution, Deep learning, Hyperspectral imaging, Image denoising, Deformation, Network architectures, Visualization, Modulation

Read Abstract +

Scale separation: video crowd counting with different density maps

Ao Zhang, Xin Deng, Baoying Liu, Weiwei Zhang, Jun Guo, Linrui Xie

Journal of Electronic Imaging, Vol. 33, Issue 04, 043016, (July 2024) https://doi.org/10.1117/1.JEI.33.4.043016

TOPICS: Video, Education and training, Convolution, Feature extraction, Ablation, Visualization, Image processing, Data modeling, Video processing, Covariance

Read Abstract +

D-YOLOv7-tiny: a lightweight network for defect detection of prefabricated steel pipe

Qian Gu, Xiangdi Yue, Yang Huang, Anquan Jian, Xiuxiang Huang

Journal of Electronic Imaging, Vol. 33, Issue 04, 043017, (July 2024) https://doi.org/10.1117/1.JEI.33.4.043017

TOPICS: Defect detection, Pipes, Convolution, Target detection, Performance modeling, Detection and tracking algorithms, Head, Buildings, Small targets, Feature extraction

Read Abstract +

Temporal residual neural radiance fields for monocular video dynamic human body reconstruction

Tianle Du, Jie Wang, Xiaolong Xie, Wei Li, Pengxiang Su, Jiee Liu

Journal of Electronic Imaging, Vol. 33, Issue 04, 043018, (July 2024) https://doi.org/10.1117/1.JEI.33.4.043018

TOPICS: Education and training, Video, 3D modeling, Video acceleration, Neural networks, Modeling, RGB color model, Image restoration, Video coding, Reconstruction algorithms

Read Abstract +

StyleWA: adaptive discriminator-based wavelet knowledge distillation

Rui Li, Yihao Bao

Journal of Electronic Imaging, Vol. 33, Issue 04, 043019, (July 2024) https://doi.org/10.1117/1.JEI.33.4.043019

TOPICS: Gallium nitride, Wavelets, Education and training, Image quality, RGB color model, Performance modeling, Discrete wavelet transforms, Baryon acoustic oscillations, Image processing, Image enhancement

Read Abstract +

Efficient and lightweight multiscale network for person reidentification

Yunzuo Zhang, Yuehui Yang, Weili Kang

Journal of Electronic Imaging, Vol. 33, Issue 04, 043020, (July 2024) https://doi.org/10.1117/1.JEI.33.4.043020

TOPICS: Feature extraction, Convolution, Education and training, Design, Feature fusion, Data modeling, Mathematical optimization, Image segmentation, Semantics, Image processing

Read Abstract +

Pose-guided node and trajectory construction transformer for occluded person re-identification

Chentao Hu, Yanbing Chen, Lingyi Guo, Lingbing Tao, Zhixin Tie, Wei Ke

Journal of Electronic Imaging, Vol. 33, Issue 04, 043021, (July 2024) https://doi.org/10.1117/1.JEI.33.4.043021

TOPICS: Transformers, Feature extraction, Semantics, Matrices, Education and training, Pose estimation, Head, Data modeling, Image segmentation, Image processing algorithms and systems

Read Abstract +

Three-dimensional human pose estimation based on contact pressure

Ning Yin, Ke Wang, Nian Wang, Jun Tang, Wenxia Bao

Journal of Electronic Imaging, Vol. 33, Issue 04, 043022, (July 2024) https://doi.org/10.1117/1.JEI.33.4.043022

TOPICS: Pose estimation, Feature extraction, Visualization, Education and training, Data modeling, 3D image processing, Video, Network architectures, Neural networks, Design

Read Abstract +

Background-focused contrastive learning for unpaired image-to-image translation

Mingwen Shao, Minggui Han, Lingzhuang Meng, Fukang Liu

Journal of Electronic Imaging, Vol. 33, Issue 04, 043023, (July 2024) https://doi.org/10.1117/1.JEI.33.4.043023

TOPICS: Education and training, Visualization, Semantics, Image processing, Image quality, Feature extraction, Projection systems, Gallium nitride, Ablation, Image classification

Read Abstract +

Early quadtree with nested multitype tree partitioning algorithm based on convolution neural network for the versatile video coding standard

Bouthaina Abdallah, Sonda Ben Jdidia, Fatma Belghith, Mohamed Ali Ben Ayed, Nouri Masmoudi

Journal of Electronic Imaging, Vol. 33, Issue 04, 043024, (July 2024) https://doi.org/10.1117/1.JEI.33.4.043024

TOPICS: Video coding, Copper, Video, Evolutionary algorithms, Education and training, High efficiency video coding, Neural networks, Decision trees, Convolution, Standards development

Read Abstract +

Lightweight human activity recognition system for resource constrained environments

Mihir Karandikar, Ankit Jain, Abhishek Srivastava

Journal of Electronic Imaging, Vol. 33, Issue 04, 043025, (July 2024) https://doi.org/10.1117/1.JEI.33.4.043025

TOPICS: Shape memory alloys, Action recognition, Windows, Data modeling, 3D modeling, Semantics, Classification systems, Performance modeling, Machine learning, Feature extraction

Read Abstract +

Highly compressed image encryption algorithm via fractal and semi-tensor product compressed sensing

Lin Fan, Meng Li

Journal of Electronic Imaging, Vol. 33, Issue 04, 043026, (July 2024) https://doi.org/10.1117/1.JEI.33.4.043026

TOPICS: Image compression, Image encryption, Fractal analysis, Image restoration, Computer security, Matrices, Diffusion, Image transmission, Data storage, Image storage

Read Abstract +

High-resolution cloud detection network

Jingsheng Li, Tianxiang Xue, Jiayi Zhao, Jingmin Ge, Yufang Min, Wei Su, Kun Zhan

Journal of Electronic Imaging, Vol. 33, Issue 04, 043027, (July 2024) https://doi.org/10.1117/1.JEI.33.4.043027

TOPICS: Clouds, Education and training, Object detection, Feature fusion, Image segmentation, Remote sensing, Semantics, Image fusion, Image processing, Feature extraction

Read Abstract +

Event-frame object detection under dynamic background condition

Wenhao Lu, Zehao Li, Junying Li, Yuncheng Lu, Tae Hyoung (Tony) Kim

Journal of Electronic Imaging, Vol. 33, Issue 04, 043028, (July 2024) https://doi.org/10.1117/1.JEI.33.4.043028

TOPICS: Object detection, Detection and tracking algorithms, Internet of things, Cameras, Sensors, Interference (communication), Histograms, Surveillance, Lutetium, Tunable filters

Read Abstract +

No-reference video quality assessment based on human visual perception

Zhou Zhou, Guangqian Kong, Xun Duan, Huiyun Long

Journal of Electronic Imaging, Vol. 33, Issue 04, 043029, (July 2024) https://doi.org/10.1117/1.JEI.33.4.043029

TOPICS: Video, Feature extraction, Databases, Visualization, Education and training, Video processing, Transformers, Video compression, Molybdenum, Image segmentation

Read Abstract +

Research on ground-based cloud image classification combining local and global features

Xin Zhang, Wanting Zheng, Jianwei Zhang, Weibin Chen, Liangliang Chen

Journal of Electronic Imaging, Vol. 33, Issue 04, 043030, (July 2024) https://doi.org/10.1117/1.JEI.33.4.043030

TOPICS: Clouds, Image classification, Feature extraction, Transformers, Rain, Education and training, Image fusion, Convolution, Performance modeling, Visual process modeling

Read Abstract +

Scene adaptive color compensation and multi-weight fusion of underwater image

Muhammad Aon, Huibing Wang, Muhammad Noman Waleed, Yulin Wei, Xianping Fu

Journal of Electronic Imaging, Vol. 33, Issue 04, 043031, (July 2024) https://doi.org/10.1117/1.JEI.33.4.043031

TOPICS: Image fusion, Color, Image enhancement, Image processing, Image quality, Visualization, Tunable filters, Image sharpness, Image filtering, Attenuation

Read Abstract +

Chaotic multiple-image encryption scheme: a simple and highly efficient solution for diverse applications

K. Abhimanyu Kumar Patro, Pulkit Singh, Narendra Khatri, Bibhudendra Acharya

Journal of Electronic Imaging, Vol. 33, Issue 04, 043032, (August 2024) https://doi.org/10.1117/1.JEI.33.4.043032

TOPICS: Image encryption, Histograms, Diffusion, Computer security, Chaos, Image processing, RGB color model, Optical image encryption, Statistical analysis, Bismuth

Read Abstract +

Multi-scale adaptive low-light image enhancement based on deep learning

Taotao Cao, Taile Peng, Hao Wang, Xiaotong Zhu, Jia Guo, Zhen Zhang

Journal of Electronic Imaging, Vol. 33, Issue 04, 043033, (August 2024) https://doi.org/10.1117/1.JEI.33.4.043033

TOPICS: Image enhancement, Visualization, Deep learning, Image quality, Education and training, Convolution, Visual process modeling, Denoising, Light sources and illumination, Data modeling

Read Abstract +

Deep inner-knuckle-print recognition using lightweight Siamese network

Hongxia Wang, Hongwu Yuan

Journal of Electronic Imaging, Vol. 33, Issue 04, 043034, (August 2024) https://doi.org/10.1117/1.JEI.33.4.043034

TOPICS: Printing, Feature extraction, Detection and tracking algorithms, Biometrics, Education and training, Deep learning, Performance modeling, Databases, Image processing, Data modeling

Read Abstract +

Fine-tuned Siamese neural network–based multimodal vein biometric system with hybrid firefly–particle swarm optimization

Gurunathan Velliangiri, Sudhakar Radhakrishnan

Journal of Electronic Imaging, Vol. 33, Issue 04, 043035, (August 2024) https://doi.org/10.1117/1.JEI.33.4.043035

TOPICS: Veins, Neural networks, Biometrics, Feature extraction, Classification systems, Particle swarm optimization, Mathematical optimization, Binary data, Machine learning, Image processing

Read Abstract +

Video frame interpolation based on depthwise over-parameterized recurrent residual convolution

Xiaohui Yang, Weijing Liu, Shaowen Wang

Journal of Electronic Imaging, Vol. 33, Issue 04, 043036, (August 2024) https://doi.org/10.1117/1.JEI.33.4.043036

TOPICS: Interpolation, Video, Convolution, Feature extraction, Optical flow, Education and training, Visualization, Motion estimation, Video processing, Performance modeling

Read Abstract +

FCCText: frequency-color complementary bistream structure for scene text detection

Ruiyi Han, Xin Li

Journal of Electronic Imaging, Vol. 33, Issue 04, 043037, (August 2024) https://doi.org/10.1117/1.JEI.33.4.043037

TOPICS: RGB color model, Feature extraction, Feature fusion, Lithium, Image segmentation, Convolution, Visualization, Transformers, Image processing, Education and training

Read Abstract +

DeepLab-Rail: semantic segmentation network for railway scenes based on encoder-decoder structure

Qingsong Zeng, Linxuan Zhang, Yuan Wang, Xiaolong Luo, Yannan Chen

Journal of Electronic Imaging, Vol. 33, Issue 04, 043038, (August 2024) https://doi.org/10.1117/1.JEI.33.4.043038

TOPICS: Semantics, Image segmentation, Feature extraction, Convolution, Education and training, Detection and tracking algorithms, Data modeling, Surface plasmons, Transformers, Signal attenuation

Read Abstract +

Robust auto-weighted and dual-structural representation learning for image clustering

Kun Jiang, Zhaoli Liu, Qindong Sun

Journal of Electronic Imaging, Vol. 33, Issue 04, 043039, (August 2024) https://doi.org/10.1117/1.JEI.33.4.043039

TOPICS: Machine learning, Data modeling, Databases, Matrices, Performance modeling, Sun, Statistical modeling, Mathematical optimization, Algorithms, Received signal strength

Read Abstract +

Semantic segmentation of multiclass walls in complex architectural floor plan image

Zhongguo Xu, Naresh Jha, Syed Mehadi, Santi Maity, Mrinal Mandal

Journal of Electronic Imaging, Vol. 33, Issue 04, 043040, (August 2024) https://doi.org/10.1117/1.JEI.33.4.043040

TOPICS: Image segmentation, Semantics, Convolution, Education and training, Computed tomography, Visualization, Spatial learning, Feature extraction, Matrices, Convolutional neural networks

Read Abstract +

SiamGPF: temporal correlation-based visual tracking

Shengxue Cao, Biao Zhu, Keyan Kong, Lixiang Ma, Bingyou Liu

Journal of Electronic Imaging, Vol. 33, Issue 04, 043041, (August 2024) https://doi.org/10.1117/1.JEI.33.4.043041

TOPICS: Performance modeling, Optical tracking, Education and training, Data modeling, Video, Detection and tracking algorithms, Network architectures, Deformation, Ablation, Head

Read Abstract +

Image-text multimodal classification via cross-attention contextual transformer with modality-collaborative learning

Qianyao Shi, Wanru Xu, Zhenjiang Miao

Journal of Electronic Imaging, Vol. 33, Issue 04, 043042, (August 2024) https://doi.org/10.1117/1.JEI.33.4.043042

TOPICS: Machine learning, Transformers, Image classification, Data modeling, Performance modeling, Visualization, Feature extraction, Education and training, Information fusion, Image fusion

Read Abstract +

Nowadays, we are surrounded by various types of data from different modalities, such as text, images, audio, and video. The existence of this multimodal data provides us with rich information, but it also brings new challenges: how do we effectively utilize this data for accurate classification? This is the main problem faced by multimodal classification tasks. Multimodal classification is an important task that aims to classify data from different modalities. However, due to the different characteristics and structures of data from different modalities, effectively fusing and utilizing them for classification is a challenging problem. To address this issue, we propose a cross-attention contextual transformer with modality-collaborative learning for multimodal classification (CACT-MCL-MMC) to better integrate information from different modalities. On the one hand, existing multimodal fusion methods ignore the intra- and inter-modality relationships, and there is unnoticed information in the modalities, resulting in unsatisfactory classification performance. To address the problem of insufficient interaction of modality information in existing algorithms, we use a cross-attention contextual transformer to capture the contextual relationships within and among modalities to improve the representativeness of the model. On the other hand, due to differences in the quality of information among different modalities, some modalities may have misleading or ambiguous information. Treating each modality equally may result in modality perceptual noise, which reduces the performance of multimodal classification. Therefore, we use modality-collaborative to filter misleading information, alleviate the quality difference of information among modalities, align modality information with high-quality and effective modalities, enhance unimodal information, and obtain more ideal multimodal fusion information to improve the model’s discriminative ability. Our comparative experimental results on two benchmark datasets for image-text classification, CrisisMMD and UPMC Food-101, show that our proposed model outperforms other classification methods and even state-of-the-art (SOTA) multimodal classification methods. Meanwhile, the effectiveness of the cross-attention module, multimodal contextual attention network, and modality-collaborative learning was verified through ablation experiments. In addition, conducting hyper-parameter validation experiments showed that different fusion calculation methods resulted in differences in experimental results. The most effective feature tensor calculation method was found. We also conducted qualitative experiments. Compared with the original model, our proposed model can identify the expected results in the vast majority of cases. The codes are available at https://github.com/KobeBryant8-24-MVP/CACT-MCL-MMC. The CrisisMMD is available at https://dataverse.mpisws.org/dataverse/icwsm18, and the UPMC-Food-101 is available at https://visiir.isir.upmc.fr/.

Yarn hairiness measurement based on multi-camera system and perspective maximization model

Hongyan Cao, Zhenze Chen, Haihua Hu, Xiangbing Huai, Hao Zhu, Zhongjian Li

Journal of Electronic Imaging, Vol. 33, Issue 04, 043043, (August 2024) https://doi.org/10.1117/1.JEI.33.4.043043

TOPICS: Image segmentation, Image processing, Cameras, Histograms, Matrices, Convolution, Image enhancement, 3D modeling, Cotton, Equipment

Read Abstract +

Squeeze-and-excitation attention and bi-directional feature pyramid network for filter screens surface detection

Junpeng Xu, Xiangbo Zhu, Lei Shi, Jin Li, Ziman Guo

Journal of Electronic Imaging, Vol. 33, Issue 04, 043044, (August 2024) https://doi.org/10.1117/1.JEI.33.4.043044

TOPICS: Tunable filters, Education and training, Displays, Detection and tracking algorithms, Defect detection, Object detection, Feature fusion, Feature extraction, Neck, Image filtering

Read Abstract +

Joint merging and pruning: adaptive selection of better token compression strategy

Wei Peng, Liancheng Zeng, Lizhuo Zhang, Yue Shen

Journal of Electronic Imaging, Vol. 33, Issue 04, 043045, (August 2024) https://doi.org/10.1117/1.JEI.33.4.043045

TOPICS: Education and training, Performance modeling, Transformers, Visual process modeling, Matrices, Statistical modeling, Data modeling, Visualization, Mathematical optimization, Computer vision technology

Read Abstract +

LGD-FCOS: driver distraction detection using improved FCOS based on local and global knowledge distillation

Kunbiao Li, Xiaohui Yang, Jing Wang, Feng Zhang, Tao Xu

Journal of Electronic Imaging, Vol. 33, Issue 04, 043046, (August 2024) https://doi.org/10.1117/1.JEI.33.4.043046

TOPICS: Object detection, Education and training, Detection and tracking algorithms, Data modeling, Sensors, Roads, Safety, Performance modeling, Convolution, Feature extraction

Read Abstract +

D²Net: discriminative feature extraction and details preservation network for salient object detection

Qianqian Guo, Yanjiao Shi, Jin Zhang, Jinyu Yang, Qing Zhang

Journal of Electronic Imaging, Vol. 33, Issue 04, 043047, (August 2024) https://doi.org/10.1117/1.JEI.33.4.043047

TOPICS: Feature extraction, Convolution, Performance modeling, Education and training, Object detection, Design, Data modeling, Semantics, Feature fusion, Bromine

Read Abstract +

Fusion 3D object tracking method based on region and point cloud registration

Yixin Jin, Jiawei Zhang, Yinhua Liu, Wei Mo, Hua Chen

Journal of Electronic Imaging, Vol. 33, Issue 04, 043048, (August 2024) https://doi.org/10.1117/1.JEI.33.4.043048

TOPICS: 3D tracking, Point clouds, 3D modeling, Contour modeling, 3D image processing, Image segmentation, Detection and tracking algorithms, Pose estimation, Matrices, Cameras

Read Abstract +

Frequency domain-based reversible adversarial attacks for privacy protection in Internet of Things

Yang Lu, Tianfeng Ma, Zilong Pang, Xiuli Chai, Zhen Chen, Zongwei Tang

Journal of Electronic Imaging, Vol. 33, Issue 04, 043049, (August 2024) https://doi.org/10.1117/1.JEI.33.4.043049

TOPICS: Radium, Visualization, Image processing, Education and training, Image classification, Neural networks, Discrete wavelet transforms, Image quality, Data modeling, Deep learning

Read Abstract +

Fast and robust object region segmentation with self-organized lattice Boltzmann based active contour method

Fatema Albalooshi, Vijayan Asari

Journal of Electronic Imaging, Vol. 33, Issue 04, 043050, (August 2024) https://doi.org/10.1117/1.JEI.33.4.043050

TOPICS: Image segmentation, Education and training, Thermography, Medical imaging, Laser induced fluorescence, Visualization, Image processing, Tumors, Image processing algorithms and systems, Contour modeling

Read Abstract +

Edge-oriented unrolling network for infrared and visible image fusion

Tian-Hui Yuan, Zongliang Gan, Changhong Chen, Ziguan Cui

Journal of Electronic Imaging, Vol. 33, Issue 04, 043051, (August 2024) https://doi.org/10.1117/1.JEI.33.4.043051

TOPICS: Image fusion, Infrared radiation, Infrared imaging, Image enhancement, Visible radiation, Feature extraction, Feature fusion, Education and training, Convolution, Image quality

Read Abstract +

Enhancing hyperspectral image classification with graph attention neural network

Niruban Rathakrishnan, Deepa Raja

Journal of Electronic Imaging, Vol. 33, Issue 04, 043052, (August 2024) https://doi.org/10.1117/1.JEI.33.4.043052

TOPICS: Image classification, Education and training, Hyperspectral imaging, Image enhancement, Machine learning, Matrices, Independent component analysis, Optical filters, Neural networks, Data modeling

Read Abstract +

Precipitation nowcasting based on ConvLSTM-UNet deep spatiotemporal network

Xiangming Zheng, Huawang Qin, Haoran Chen, Weixi Wang, Piao Shi

Journal of Electronic Imaging, Vol. 33, Issue 04, 043053, (August 2024) https://doi.org/10.1117/1.JEI.33.4.043053

TOPICS: Data modeling, Rain, Performance modeling, Convolution, Atmospheric modeling, Radar, Feature extraction, Meteorology, Education and training, Image restoration

Read Abstract +

Spatio-temporal enhancement method based on dense connection structure for compressed video

Hongyao Li, Xiaohai He, Xiaodong Bi, Shuhua Xiong, Honggang Chen

Journal of Electronic Imaging, Vol. 33, Issue 04, 043054, (August 2024) https://doi.org/10.1117/1.JEI.33.4.043054

TOPICS: Video, Video compression, Feature fusion, Image enhancement, Image quality, Feature extraction, Convolution, Education and training, Deformation, Image fusion

Read Abstract +

Alternative evaluation of industrial surface defect synthesis data based on analytic hierarchy process

Yang Lu, Hang Hao, Linhui Chen, Longfei Yang, Xiaoheng Jiang

Journal of Electronic Imaging, Vol. 33, Issue 04, 043055, (August 2024) https://doi.org/10.1117/1.JEI.33.4.043055

TOPICS: Data modeling, Education and training, Defect detection, Image quality, Matrices, Statistical modeling, Gallium nitride, Signal to noise ratio, Performance modeling, Magnetism

Read Abstract +

Settlement detection from satellite imagery using fully convolutional network

Tayaba Anjum, Ahsan Ali, Muhammad Tahir Naseem

Journal of Electronic Imaging, Vol. 33, Issue 04, 043056, (August 2024) https://doi.org/10.1117/1.JEI.33.4.043056

TOPICS: Image segmentation, Convolution, Feature extraction, Satellites, Earth observing sensors, Satellite imaging, Education and training, RGB color model, Tunable filters, Object detection

Read Abstract +

SGTformer: improved Shifted Window Transformer network for white blood cell subtype classification

Xiangyu Deng, Lihao Pan, Zhiyan Dang

Journal of Electronic Imaging, Vol. 33, Issue 04, 043057, (August 2024) https://doi.org/10.1117/1.JEI.33.4.043057

TOPICS: White blood cells, Transformers, Feature extraction, Image classification, Windows, Convolution, Education and training, Image enhancement, Image processing, Data modeling

Read Abstract +

Coded target recognition algorithm for vision measurement

Peng Zhang, Qing Liu, Shengpeng Li, Fei Liu, Wenjing Liu

Journal of Electronic Imaging, Vol. 33, Issue 04, 043058, (August 2024) https://doi.org/10.1117/1.JEI.33.4.043058

TOPICS: Target recognition, Detection and tracking algorithms, Image segmentation, Light sources and illumination, Image processing algorithms and systems, Image compression, Cameras, Visualization, Binary data, Environmental sensing

Read Abstract +

Keywords/Phrases

Search In:

Publication Years