journal
Journals Journal of the American Statis...

Journal of the American Statistical Association

https://read.qxmd.com/read/38143785/on-robustness-of-individualized-decision-rules
#21
JOURNAL ARTICLE
Zhengling Qi, Jong-Shi Pang, Yufeng Liu
With the emergence of precision medicine, estimating optimal individualized decision rules (IDRs) has attracted tremendous attention in many scientific areas. Most existing literature has focused on finding optimal IDRs that can maximize the expected outcome for each individual. Motivated by complex individualized decision making procedures and the popular conditional value at risk (CVaR) measure, we propose a new robust criterion to estimate optimal IDRs in order to control the average lower tail of the individuals' outcomes...
2023: Journal of the American Statistical Association
https://read.qxmd.com/read/38099062/generalized-liquid-association-analysis-for-multimodal-data-integration
#22
JOURNAL ARTICLE
Lexin Li, Jing Zeng, Xin Zhang
Multimodal data are now prevailing in scientific research. One of the central questions in multimodal integrative analysis is to understand how two data modalities associate and interact with each other given another modality or demographic variables. The problem can be formulated as studying the associations among three sets of random variables, a question that has received relatively less attention in the literature. In this article, we propose a novel generalized liquid association analysis method, which offers a new and unique angle to this important class of problems of studying three-way associations...
2023: Journal of the American Statistical Association
https://read.qxmd.com/read/38046816/assessing-disparities-in-americans-exposure-to-pcbs-and-pbdes-based-on-nhanes-pooled-biomonitoring-data
#23
JOURNAL ARTICLE
Yan Liu, Dewei Wang, Li Li, Dingsheng Li
The National Health and Nutrition Examination Survey (NHANES) has been continuously biomonitoring Americans' exposure to two families of harmful environmental chemicals: polychlorinated biphenyls (PCBs) and polybrominated diphenyl ethers (PBDEs). However, biomonitoring these chemicals is expensive. To save cost, in 2005, NHANES resorted to pooled biomonitoring; i.e., amalgamating individual specimens to form a pool and measuring chemical levels from pools. Despite being publicly available, these pooled data gain limited applications in health studies...
2023: Journal of the American Statistical Association
https://read.qxmd.com/read/37997574/a-correlated-network-scale-up-model-finding-the-connection-between-subpopulations
#24
JOURNAL ARTICLE
Ian Laga, Le Bao, Xiaoyue Niu
Aggregated relational data (ARD), formed from "How many X's do you know?" questions, is a powerful tool for learning important network characteristics with incomplete network data. Compared to traditional survey methods, ARD is attractive as it does not require a sample from the target population and does not ask respondents to self-reveal their own status. This is helpful for studying hard-to-reach populations like female sex workers who may be hesitant to reveal their status. From December 2008 to February 2009, the Kiev International Institute of Sociology (KIIS) collected ARD from 10,866 respondents to estimate the size of HIV-related groups in Ukraine...
2023: Journal of the American Statistical Association
https://read.qxmd.com/read/37982009/genetic-underpinnings-of-brain-structural-connectome-for-young-adults
#25
JOURNAL ARTICLE
Yize Zhao, Changgee Chang, Jingwen Zhang, Zhengwu Zhang
With distinct advantages in power over behavioral phenotypes, brain imaging traits have become emerging endophenotypes to dissect molecular contributions to behaviors and neuropsychiatric illnesses. Among different imaging features, brain structural connectivity (i.e., structural connectome) which summarizes the anatomical connections between different brain regions is one of the most cutting edge while under-investigated traits; and the genetic influence on the structural connectome variation remains highly elusive...
2023: Journal of the American Statistical Association
https://read.qxmd.com/read/37982008/a-general-framework-for-inference-on-algorithm-agnostic-variable-importance
#26
JOURNAL ARTICLE
Brian D Williamson, Peter B Gilbert, Noah R Simon, Marco Carone
In many applications, it is of interest to assess the relative contribution of features (or subsets of features) toward the goal of predicting a response - in other words, to gauge the variable importance of features. Most recent work on variable importance assessment has focused on describing the importance of features within the confines of a given prediction algorithm. However, such assessment does not necessarily characterize the prediction potential of features, and may provide a misleading reflection of the intrinsic value of these features...
2023: Journal of the American Statistical Association
https://read.qxmd.com/read/37808547/causal-inference-in-transcriptome-wide-association-studies-with-invalid-instruments-and-gwas-summary-data
#27
JOURNAL ARTICLE
Haoran Xue, Xiaotong Shen, Wei Pan
Transcriptome-wide association studies (TWAS) have recently emerged as a popular tool to discover (putative) causal genes by integrating an outcome GWAS dataset with another gene expression/transcriptome GWAS (called eQTL) dataset. In our motivating and target application, we'd like to identify causal genes for low-density lipoprotein cholesterol (LDL), which is crucial for developing new treatments for hyperlipidemia and cardiovascular diseases. The statistical principle underlying TWAS is (two-sample) two-stage least squares (2SLS) using multiple correlated SNPs as instrumental variables (IVs); it is closely related to typical (two-sample) Mendelian randomization (MR) using independent SNPs as IVs, which is expected to be impractical and lower-powered for TWAS (and some other) applications...
2023: Journal of the American Statistical Association
https://read.qxmd.com/read/37791295/tukey-s-depth-for-object-data
#28
JOURNAL ARTICLE
Xiongtao Dai, Sara Lopez-Pintado
We develop a novel exploratory tool for non-Euclidean object data based on data depth, extending celebrated Tukey's depth for Euclidean data. The proposed metric halfspace depth, applicable to data objects in a general metric space, assigns to data points depth values that characterize the centrality of these points with respect to the distribution and provides an interpretable center-outward ranking. Desirable theoretical properties that generalize standard depth properties postulated for Euclidean data are established for the metric halfspace depth...
2023: Journal of the American Statistical Association
https://read.qxmd.com/read/37781353/discussion-of-lesa-longitudinal-elastic-shape-analysis-of-brain-subcortical-structures
#29
COMMENT
Moo K Chung, Jamie L Hanson, Richard J Davidson, Seth D Pollak
No abstract text is available yet for this article.
2023: Journal of the American Statistical Association
https://read.qxmd.com/read/37771513/sparse-topic-modeling-computational-efficiency-near-optimal-algorithms-and-statistical-inference
#30
JOURNAL ARTICLE
Ruijia Wu, Linjun Zhang, T Tony Cai
Sparse topic modeling under the probabilistic latent semantic indexing (pLSI) model is studied. Novel and computationally fast algorithms for estimation and inference of both the word-topic matrix and the topic-document matrix are proposed and their theoretical properties are investigated. Both minimax upper and lower bounds are established and the results show that the proposed algorithms are rate-optimal, up to a logarithmic factor. Moreover, a refitting algorithm is proposed to establish asymptotic normality and construct valid confidence intervals for the individual entries of the word-topic and topic-document matrices...
2023: Journal of the American Statistical Association
https://read.qxmd.com/read/37771512/multifile-partitioning-for-record-linkage-and-duplicate-detection
#31
JOURNAL ARTICLE
Serge Aleshin-Guendel, Mauricio Sadinle
Merging datafiles containing information on overlapping sets of entities is a challenging task in the absence of unique identifiers, and is further complicated when some entities are duplicated in the datafiles. Most approaches to this problem have focused on linking two files assumed to be free of duplicates, or on detecting which records in a single file are duplicates. However, it is common in practice to encounter scenarios that fit somewhere in between or beyond these two settings. We propose a Bayesian approach for the general setting of multifile record linkage and duplicate detection...
2023: Journal of the American Statistical Association
https://read.qxmd.com/read/37771511/time-to-event-analysis-with-unknown-time-origins-via-longitudinal-biomarker-registration
#32
JOURNAL ARTICLE
Tianhao Wang, Sarah J Ratcliffe, Wensheng Guo
In observational studies, the time origin of interest for time-to-event analysis is often unknown, such as the time of disease onset. Existing approaches to estimating the time origins are commonly built on extrapolating a parametric longitudinal model, which rely on rigid assumptions that can lead to biased inferences. In this paper, we introduce a flexible semiparametric curve registration model. It assumes the longitudinal trajectories follow a flexible common shape function with person-specific disease progression pattern characterized by a random curve registration function, which is further used to model the unknown time origin as a random start time...
2023: Journal of the American Statistical Association
https://read.qxmd.com/read/37771510/real-time-regression-analysis-of-streaming-clustered-data-with-possible-abnormal-data-batches
#33
JOURNAL ARTICLE
Lan Luo, Ling Zhou, Peter X-K Song
This paper develops an incremental learning algorithm based on quadratic inference function (QIF) to analyze streaming datasets with correlated outcomes such as longitudinal data and clustered data. We propose a renewable QIF (RenewQIF) method within a paradigm of renewable estimation and incremental inference, in which parameter estimates are recursively renewed with current data and summary statistics of historical data, but with no use of any historical subject-level raw data. We compare our renewable estimation method with both offline QIF and offline generalized estimating equations (GEE) approach that process the entire cumulative subject-level data all together, and show theoretically and numerically that our renewable procedure enjoys statistical and computational efficiency...
2023: Journal of the American Statistical Association
https://read.qxmd.com/read/37771509/orthogonalized-kernel-debiased-machine-learning-for-multimodal-data-analysis
#34
JOURNAL ARTICLE
Xiaowu Dai, Lexin Li
Multimodal imaging has transformed neuroscience research. While it presents unprecedented opportunities, it also imposes serious challenges. Particularly, it is difficult to combine the merits of the interpretability attributed to a simple association model with the flexibility achieved by a highly adaptive nonlinear model. In this article, we propose an orthogonalized kernel debiased machine learning approach, which is built upon the Neyman orthogonality and a form of decomposition orthogonality, for multimodal data analysis...
2023: Journal of the American Statistical Association
https://read.qxmd.com/read/37519438/multivariate-temporal-point-process-regression
#35
JOURNAL ARTICLE
Xiwei Tang, Lexin Li
Point process modeling is gaining increasing attention, as point process type data are emerging in a large variety of scientific applications. In this article, motivated by a neuronal spike trains study, we propose a novel point process regression model, where both the response and the predictor can be a high-dimensional point process. We model the predictor effects through the conditional intensities using a set of basis transferring functions in a convolutional fashion. We organize the corresponding transferring coefficients in the form of a three-way tensor, then impose the low-rank, sparsity, and subgroup structures on this coefficient tensor...
2023: Journal of the American Statistical Association
https://read.qxmd.com/read/37448462/feature-screening-for-interval-valued-response-with-application-to-study-association-between-posted-salary-and-required-skills
#36
JOURNAL ARTICLE
Wei Zhong, Chen Qian, Wanjun Liu, Liping Zhu, Runze Li
It is important to quantify the differences in returns to skills using the online job advertisements data, which have attracted great interest in both labor economics and statistics fields. In this paper, we study the relationship between the posted salary and the job requirements in online labor markets. There are two challenges to deal with. First, the posted salary is always presented in an interval-valued form, for example, 5k-10k yuan per month. Simply taking the mid-point or the lower bound as the alternative for salary may result in biased estimators...
2023: Journal of the American Statistical Association
https://read.qxmd.com/read/37409267/-ipromix-a-mixture-model-for-studying-the-function-of-ace2-based-on-bulk-proteogenomic-data
#37
JOURNAL ARTICLE
Xiaoyu Song, Jiayi Ji, Pei Wang
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has caused over six million deaths in the ongoing COVID-19 pandemic. SARS-CoV-2 uses ACE2 protein to enter human cells, raising a pressing need to characterize proteins/pathways interacted with ACE2. Large-scale proteomic profiling technology is not mature at single-cell resolution to examine the protein activities in disease-relevant cell types. We propose iProMix , a novel statistical framework to identify epithelial-cell specific associations between ACE2 and other proteins/pathways with bulk proteomic data...
2023: Journal of the American Statistical Association
https://read.qxmd.com/read/37366472/statistical-inference-for-high-dimensional-generalized-linear-models-with-binary-outcomes
#38
JOURNAL ARTICLE
T Tony Cai, Zijian Guo, Rong Ma
This paper develops a unified statistical inference framework for high-dimensional binary generalized linear models (GLMs) with general link functions. Both unknown and known design distribution settings are considered. A two-step weighted bias-correction method is proposed for constructing confidence intervals and simultaneous hypothesis tests for individual components of the regression vector. Minimax lower bound for the expected length is established and the proposed confidence intervals are shown to be rate-optimal up to a logarithmic factor...
2023: Journal of the American Statistical Association
https://read.qxmd.com/read/37347088/communication-efficient-accurate-statistical-estimation
#39
JOURNAL ARTICLE
Jianqing Fan, Yongyi Guo, Kaizheng Wang
When the data are stored in a distributed manner, direct applications of traditional statistical inference procedures are often prohibitive due to communication costs and privacy concerns. This paper develops and investigates two Communication-Efficient Accurate Statistical Estimators (CEASE), implemented through iterative algorithms for distributed optimization. In each iteration, node machines carry out computation in parallel and communicate with the central processor, which then broadcasts aggregated information to node machines for new updates...
2023: Journal of the American Statistical Association
https://read.qxmd.com/read/37347087/matching-one-sample-according-to-two-criteria-in-observational-studies
#40
JOURNAL ARTICLE
B Zhang, D S Small, K B Lasater, M McHugh, J H Silber, P R Rosenbaum
Multivariate matching has two goals: (i) to construct treated and control groups that have similar distributions of observed covariates, and (ii) to produce matched pairs or sets that are homogeneous in a few key covariates. When there are only a few binary covariates, both goals may be achieved by matching exactly for these few covariates. Commonly, however, there are many covariates, so goals (i) and (ii) come apart, and must be achieved by different means. As is also true in a randomized experiment, similar distributions can be achieved for a high-dimensional covariate, but close pairs can be achieved for only a few covariates...
2023: Journal of the American Statistical Association
journal
journal
25475
2
3
Fetch more papers »
Fetching more papers... Fetching...
Remove bar
Read by QxMD icon Read
×

Save your favorite articles in one place with a free QxMD account.

×

Search Tips

Use Boolean operators: AND/OR

diabetic AND foot
diabetes OR diabetic

Exclude a word using the 'minus' sign

Virchow -triad

Use Parentheses

water AND (cup OR glass)

Add an asterisk (*) at end of a word to include word stems

Neuro* will search for Neurology, Neuroscientist, Neurological, and so on

Use quotes to search for an exact phrase

"primary prevention of cancer"
(heart or cardiac or cardio*) AND arrest -"American Heart Association"

We want to hear from doctors like you!

Take a second to answer a survey question.