Papers with the keyword Image (Page 2)

#21

JOURNAL ARTICLE

Contrastive Graph Pooling for Explainable Classification of Brain Networks.

Jiaxing Xu, Qingtian Bian, Xinhang Li, Aihu Zhang, Yiping Ke, Miao Qiao, Wei Zhang, Wei Khang Jeremy Sim, Balazs Gulyas

Functional magnetic resonance imaging (fMRI) is a commonly used technique to measure neural activation. Its application has been particularly important in identifying underlying neurodegenerative conditions such as Parkinson's, Alzheimer's, and Autism. Recent analysis of fMRI data models the brain as a graph and extracts features by graph neural networks (GNNs). However, the unique characteristics of fMRI data require a special design of GNN. Tailoring GNN to generate effective and domain-explainable features remains challenging...

38656865

April 24, 2024: IEEE Transactions on Medical Imaging

#22

JOURNAL ARTICLE

TreEducation: A Visual Education Platform for Teaching Treemap Layout Algorithms.

Johannes Fuchs, Bastian Jackl, Michael Juttler, Daniel A Keim, Rita Sevastjanova

Treemaps are a powerful tool for representing hierarchical data in a space-efficient manner and are used in various domains, including network security or software development. However, interpreting the topology encoded by nested rectangles can be challenging, particularly compared to tree-structured representations like node-link diagrams or icicle plots. To address this challenge, we introduce TreEducation, a visual education platform designed to improve the visualization literacy skills required for reading treemaps among non-expert users...

38656864

April 24, 2024: IEEE Transactions on Visualization and Computer Graphics

#23

JOURNAL ARTICLE

Speech-driven Personalized Gesture Synthetics: Harnessing Automatic Fuzzy Feature Inference.

Fan Zhang, Zhaohan Wang, Xin Lyu, Siyuan Zhao, Mengjian Li, Weidong Geng, Naye Ji, Hui Du, Fuxing Gao, Hao Wu, Shunman Li

Speech-driven gesture generation is an emerging field within virtual human creation. However, a significant challenge lies in accurately determining and processing the multitude of input features (such as acoustic, semantic, emotional, personality, and even subtle unknown features). Traditional approaches, reliant on various explicit feature inputs and complex multimodal processing, constrain the expressiveness of resulting gestures and limit their applicability. To address these challenges, we present Persona-Gestor, a novel end-to-end generative model designed to generate highly personalized 3D full-body gestures solely relying on raw speech audio...

38656863

April 24, 2024: IEEE Transactions on Visualization and Computer Graphics

#24

JOURNAL ARTICLE

What Makes Deviant Places?

Jin-Hwi Park, Young-Jae Park, Ilyung Cheong, Junoh Lee, Young Eun Huh, Hae-Gon Jeon

Urban safety plays an essential role in the quality of citizens' lives and in the sustainable development of cities. In recent years, researchers have attempted to apply machine learning techniques to identify the role of location-specific attributes in the development of urban safety. However, existing studies have mainly relied on limited images (e.g., map images, single- or four-directional images) of areas based on a relatively large geographical unit and have narrowly focused on severe crime rates, which limits their predictive performance and implications for urban safety...

38656859

April 24, 2024: IEEE Transactions on Pattern Analysis and Machine Intelligence

#25

JOURNAL ARTICLE

Novel Uncertainty Quantification through Perturbation-Assisted Sample Synthesis.

Yifei Liu, Rex Shen, Xiaotong Shen

This paper introduces a novel Perturbation-Assisted Inference (PAI) framework utilizing synthetic data generated by the Perturbation-Assisted Sample Synthesis (PASS) method. The framework focuses on uncertainty quantification in complex data scenarios, particularly involving unstructured data while utilizing deep learning models. On one hand, PASS employs a generative model to create synthetic data that closely mirrors raw data while preserving its rank properties through data perturbation, thereby enhancing data diversity and bolstering privacy...

38656858

April 24, 2024: IEEE Transactions on Pattern Analysis and Machine Intelligence

#26

JOURNAL ARTICLE

A Survey on Efficient Vision Transformers: Algorithms, Techniques, and Performance Benchmarking.

Lorenzo Papa, Paolo Russo, Irene Amerini, Luping Zhou

Vision Transformer (ViT) architectures are becoming increasingly popular and widely employed to tackle computer vision applications. Their main feature is the capacity to extract global information through the self-attention mechanism, outperforming earlier convolutional neural networks. However, ViT deployment and performance have grown steadily with their size, number of trainable parameters, and operations. Furthermore, self-attention's computational and memory cost quadratically increases with the image resolution...

38656856

April 24, 2024: IEEE Transactions on Pattern Analysis and Machine Intelligence

#27

JOURNAL ARTICLE

NeuralRecon: Real-Time Coherent 3D Scene Reconstruction from Monocular Video.

Xi Chen, Jiaming Sun, Yiming Xie, Hujun Bao, Xiaowei Zhou

We present a novel framework named NeuralRecon for real-time 3D scene reconstruction from a monocular video. Unlike previous methods that estimate single-view depth maps separately on each key-frame and fuse them later, we propose to directly reconstruct local surfaces represented as sparse TSDF volumes for each video fragment sequentially by a neural network. A learning-based TSDF fusion module based on gated recurrent units is used to guide the network to fuse features from previous fragments. This design allows the network to capture local smoothness prior and global shape prior of 3D surfaces when sequentially reconstructing the surfaces, resulting in accurate, coherent, and real-time surface reconstruction...

38656855

April 24, 2024: IEEE Transactions on Pattern Analysis and Machine Intelligence

#28

JOURNAL ARTICLE

Memory-based Cross-modal Semantic Alignment Network for Radiology Report Generation.

Yitian Tao, Liyan Ma, Jing Yu, Han Zhang

Generating radiology reports automatically reduces the workload of radiologists and helps the diagnoses of specific diseases. Many existing methods take this task as modality transfer process. However, since the key information related to disease accounts for a small proportion in both image and report, it is hard for the model to learn the latent relation between the radiology image and its report, thus failing to generate fluent and accurate radiology reports. To tackle this problem, we propose a memory-based cross-modal semantic alignment model (MCSAM) following an encoder-decoder paradigm...

38656853

April 24, 2024: IEEE Journal of Biomedical and Health Informatics

#29

JOURNAL ARTICLE

Efficient click-based interactive segmentation for medical image with improved Plain-ViT.

Mengxing Huang, Jie Zou, Yu Zhang, Uzair Aslam Bhatti, Jing Chen

The primary objective of interactive medical image segmentation systems is to achieve more precise segmentation outcomes with reduced human intervention. This endeavor holds significant clinical importance for both pre-diagnostic pathological assessments and prognostic recovery. Among the various interaction methods available, click-based interactions stand out as an intuitive and straightforward approach compared to alternatives such as graffiti, bounding boxes, and extreme points. To improve the model's ability to interpret click-based interactions, we propose a comprehensive interactive segmentation framework that leverages an iterative weighted loss function based on user clicks...

38656851

April 24, 2024: IEEE Journal of Biomedical and Health Informatics

#30

JOURNAL ARTICLE

Balanced Unfolding Induced Tensor Nuclear Norms for High-Order Tensor Completion.

Yuning Qiu, Guoxu Zhou, Andong Wang, Qibin Zhao, Shengli Xie

The recently proposed tensor tubal rank has been witnessed to obtain extraordinary success in real-world tensor data completion. However, existing works usually fix the transform orientation along the third mode and may fail to turn multidimensional low-tubal-rank structure into account. To alleviate these bottlenecks, we introduce two unfolding induced tensor nuclear norms (TNNs) for the tensor completion (TC) problem, which naturally extends tensor tubal rank to high-order data. Specifically, we show how multidimensional low-tubal-rank structure can be captured by utilizing a novel balanced unfolding strategy, upon which two TNNs, namely, overlapped TNN (OTNN) and latent TNN (LTNN), are developed...

38656849

April 24, 2024: IEEE Transactions on Neural Networks and Learning Systems

#31

JOURNAL ARTICLE

Fine-Grained Essential Tensor Learning for Robust Multi-View Spectral Clustering.

Chong Peng, Kehan Kang, Yongyong Chen, Zhao Kang, Chenglizhao Chen, Qiang Cheng

Multi-view subspace clustering (MVSC) has drawn significant attention in recent study. In this paper, we propose a novel approach to MVSC. First, the new method is capable of preserving high-order neighbor information of the data, which provides essential and complicated underlying relationships of the data that is not straightforwardly preserved by the first-order neighbors. Second, we design log-based nonconvex approximations to both tensor rank and tensor sparsity, which are effective and more accurate than the convex approximations...

38656843

April 24, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society

#32

JOURNAL ARTICLE

CS2DIPs: Unsupervised HSI Super-Resolution Using Coupled Spatial and Spectral DIPs.

Yuan Fang, Yipeng Liu, Chong-Yung Chi, Zhen Long, Ce Zhu

In recent years, fusing high spatial resolution multispectral images (HR-MSIs) and low spatial resolution hyperspectral images (LR-HSIs) has become a widely used approach for hyperspectral image super-resolution (HSI-SR). Various unsupervised HSI-SR methods based on deep image prior (DIP) have gained wide popularity thanks to no pre-training requirement. However, DIP-based methods often demonstrate mediocre performance in extracting latent information from the data. To resolve this performance deficiency, we propose a coupled spatial and spectral deep image priors (CS2DIPs) method for the fusion of an HR-MSI and an LR-HSI into an HR-HSI...

38656842

April 24, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society

#33

JOURNAL ARTICLE

Multi-Stage Network with Geometric Semantic Attention for Two-View Correspondence Learning.

Shuyuan Lin, Xiao Chen, Guobao Xiao, Hanzi Wang, Feiran Huang, Jian Weng

The removal of outliers is crucial for establishing correspondence between two images. However, when the proportion of outliers reaches nearly 90%, the task becomes highly challenging. Existing methods face limitations in effectively utilizing geometric transformation consistency (GTC) information and incorporating geometric semantic neighboring information. To address these challenges, we propose a Multi-Stage Geometric Semantic Attention (MSGSA) network. The MSGSA network consists of three key modules: the multi-branch (MB) module, the GTC module, and the geometric semantic attention (GSA) module...

38656841

April 24, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society

#34

JOURNAL ARTICLE

Model-based Explainable Deep Learning for Light-field Microscopy Imaging.

Pingfan Song, Herman Verinaz Jadan, Carmel L Howe, Amanda J Foust, Pier Luigi Dragotti

In modern neuroscience, observing the dynamics of large populations of neurons is a critical step of understanding how networks of neurons process information. Light-field microscopy (LFM) has emerged as a type of scanless, high-speed, three-dimensional (3D) imaging tool, particularly attractive for this purpose. Imaging neuronal activity using LFM calls for the development of novel computational approaches that fully exploit domain knowledge embedded in physics and optics models, as well as enabling high interpretability and transparency...

38656840

April 24, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society

#35

JOURNAL ARTICLE

Graph-Represented Distribution Similarity Index for Full-Reference Image Quality Assessment.

Wenhao Shen, Mingliang Zhou, Jun Luo, Zhengguo Li, Sam Kwong

In this paper, we propose a graph-represented image distribution similarity (GRIDS) index for full-reference (FR) image quality assessment (IQA), which can measure the perceptual distance between distorted and reference images by assessing the disparities between their distribution patterns under a graph-based representation. First, we transform the input image into a graph-based representation, which is proven to be a versatile and effective choice for capturing visual perception features. This is achieved through the automatic generation of a vision graph from the given image content, leading to holistic perceptual associations for irregular image regions...

38656839

April 24, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society

#36

JOURNAL ARTICLE

Learning Contrast-enhanced Shape-biased Representations for Infrared Small Target Detection.

Fanzhao Lin, Kexin Bao, Yong Li, Dan Zeng, Shiming Ge

Detecting infrared small targets under cluttered background is mainly challenged by dim textures, low contrast and varying shapes. This paper proposes an approach to facilitate infrared small target detection by learning contrast-enhanced shape-biased representations. The approach cascades a contrast-shape encoder and a shape-reconstructable decoder to learn discriminative representations that can effectively identify target objects. The contrast-shape encoder applies a stem of central difference convolutions and a few large-kernel convolutions to extract shape-preserving features from input infrared images...

38656838

April 24, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society

#37

JOURNAL ARTICLE

Mitigating Search Interference with Task-Aware Nested Search.

Jiho Lee, Eunwoo Kim

Neural Architecture Search (NAS) has emerged as a promising tool in the field of AutoML for designing more accurate and efficient architectures. The majority of NAS works employ a weight-sharing technique to reduce the search cost by sharing the weights of a supernet, which is a composite of all architectures produced from the search space. Nonetheless, this method has a significant drawback in that negative interference may arise when candidate architectures share the same weights. This issue becomes even more severe in multi-task searches, where a supernet is shared across tasks...

38656837

April 24, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society

#38

JOURNAL ARTICLE

Multi-granularity Contrastive Cross-modal Collaborative Generation for End-to-End Long-term Video Question Answering.

Ting Yu, Kunhao Fu, Jian Zhang, Qingming Huang, Jun Yu

Long-term Video Question Answering (VideoQA) is a challenging vision-and-language bridging task focusing on semantic understanding of untrimmed long-term videos and diverse free-form questions, simultaneously emphasizing comprehensive cross-modal reasoning to yield precise answers. The canonical approaches often rely on off-the-shelf feature extractors to detour the expensive computation overhead, but often result in domain-independent modality-unrelated representations. Furthermore, the inherent gradient blocking between unimodal comprehension and cross-modal interaction hinders reliable answer generation...

38656836

April 24, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society

#39

JOURNAL ARTICLE

Automatic segmentation of 2D echocardiography ultrasound images by means of generative adversarial network.

Noreen Fatima, Sajjad Afrakhteh, Giovanni Iacca, Libertario Demi

Automated cardiac segmentation from two-dimensional (2D) echocardiographic images is a crucial step toward improving clinical diagnosis. Anatomical heterogeneity and inherent noise, however, present technical challenges and lower segmentation accuracy. The objective of this study is to propose a method for the automatic segmentation of the ventricular endocardium, the myocardium, and the left atrium, in order to accurately determine clinical indices. Specifically, we suggest using the recently introduced pixel-to-pixel Generative Adversarial Network (Pix2Pix GAN) model for accurate segmentation...

38656835

April 24, 2024: IEEE Transactions on Ultrasonics, Ferroelectrics, and Frequency Control

#40

JOURNAL ARTICLE

Comparison of transient elastography and shear wave elastography in patients with MAFLD: A single-center experience.

Mohamed Ahmed Samy Kohla, Ahmed El Fayoumi, Eman Abdel Sameea, Maha Elsabaawy, Rasha Abdelhafiz Aly, Sally Waheed, Mina Gerges, Medhat Assem Mahrous

BACKGROUND: Metabolic-associated fatty liver disease and liver fibrosis are intimately linked to insulin resistance, type 2 diabetes, obesity, and metabolic syndrome. Transient elastography (TE) and point shear wave elastography (pSWE) were used to measure liver stiffness in patients who met the ultrasound criteria for steatotic liver diseases (SLD). This study compared two methods for estimating liver stiffness in patients with SLD, which in turn correlated with liver fibrosis. METHOD: Ultrasound B-mode imaging was used to identify SLD...

38656816

April 24, 2024: Romanian Journal of Internal Medicine

Use the keywords feature with a free QxMD account.

Image

Save your favorite articles in one place with a free QxMD account.

Read

Search Tips