样式: 排序: IF: - GO 导出 标记为已读
-
A Survey on Statistical Theory of Deep Learning: Approximation, Training Dynamics, and Generative Models Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-11-21 Namjoon Suh, Guang Cheng
In this article, we review the literature on statistical theories of neural networks from three perspectives: approximation, training dynamics, and generative models. In the first part, results on excess risks for neural networks are reviewed in the nonparametric framework of regression. These results rely on explicit constructions of neural networks, leading to fast convergence rates of excess risks
-
Models and Rating Systems for Head-to-Head Competition Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-11-20 Mark E. Glickman, Albyn C. Jones
One of the most important tasks in sports analytics is the development of binary response models for head-to-head game outcomes to estimate team and player strength. We discuss commonly used probability models for game outcomes, including the Bradley–Terry and Thurstone–Mosteller models, as well as extensions to ties as a third outcome and to the inclusion of a home-field advantage. We consider dynamic
-
A Review of Reinforcement Learning in Financial Applications Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-11-15 Yahui Bai, Yuhe Gao, Runzhe Wan, Sheng Zhang, Rui Song
In recent years, there has been a growing trend of applying reinforcement learning (RL) in financial applications. This approach has shown great potential for decision-making tasks in finance. In this review, we present a comprehensive study of the applications of RL in finance and conduct a series of meta-analyses to investigate the common themes in the literature, such as the factors that most significantly
-
Joint Modeling of Longitudinal and Survival Data Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-11-14 Jane-Ling Wang, Qixian Zhong
In medical studies, time-to-event outcomes such as time to death or relapse of a disease are routinely recorded along with longitudinal data that are observed intermittently during the follow-up period. For various reasons, marginal approaches to model the event time, corresponding to separate approaches for survival data/longitudinal data, tend to induce bias and lose efficiency. Instead, a joint
-
Infectious Disease Modeling Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-11-12 Jing Huang, Jeffrey S. Morris
Infectious diseases pose a persistent challenge to public health worldwide. Recent global health crises, such as the COVID-19 pandemic and Ebola outbreaks, have underscored the vital role of infectious disease modeling in guiding public health policy and response. Infectious disease modeling is a critical tool for society, informing risk mitigation measures, prompting timely interventions, and aiding
-
Tensors in High-Dimensional Data Analysis: Methodological Opportunities and Theoretical Challenges Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-11-12 Arnab Auddy, Dong Xia, Ming Yuan
Large amounts of multidimensional data represented by multiway arrays or tensors are prevalent in modern applications across various fields such as chemometrics, genomics, physics, psychology, and signal processing. The structural complexity of such data provides vast new opportunities for modeling and analysis, but efficiently extracting information content from them, both statistically and computationally
-
Empirical Likelihood in Functional Data Analysis Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-11-12 Hsin-wen Chang, Ian W. McKeague
Functional data analysis (FDA) studies data that include infinite-dimensional functions or objects, generalizing traditional univariate or multivariate observations from each study unit. Among inferential approaches without parametric assumptions, empirical likelihood (EL) offers a principled method in that it extends the framework of parametric likelihood ratio–based inference via the nonparametric
-
Excess Mortality Estimation Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-11-12 Jon Wakefield, Victoria Knutson
Estimating the mortality associated with a specific mortality crisis event (for example, a pandemic, natural disaster, or conflict) is clearly an important public health undertaking. In many situations, deaths may be directly or indirectly attributable to the mortality crisis event, and both contributions may be of interest. The totality of the mortality impact on the population (direct and indirect
-
Neural Methods for Amortized Inference Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-11-12 Andrew Zammit-Mangion, Matthew Sainsbury-Dale, Raphaël Huser
Simulation-based methods for statistical inference have evolved dramatically over the past 50 years, keeping pace with technological advancements. The field is undergoing a new revolution as it embraces the representational capacity of neural networks, optimization libraries, and graphics processing units for learning complex mappings between data and inferential targets. The resulting tools are amortized
-
Causal Mediation Analysis for Integrating Exposure, Genomic, and Phenotype Data Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-10-30 Haoyu Yang, Zhonghua Liu, Ruoyu Wang, En-Yu Lai, Joel Schwartz, Andrea A. Baccarelli, Yen-Tsung Huang, Xihong Lin
Causal mediation analysis provides an attractive framework for integrating diverse types of exposure, genomic, and phenotype data. Recently, this field has seen a surge of interest, largely driven by the increasing need for causal mediation analyses in health and social sciences. This article aims to provide a review of recent developments in mediation analysis, encompassing mediation analysis of a
-
Designs for Vaccine Studies Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-10-30 M. Elizabeth Halloran
Due to dependent happenings, vaccines can have different effects in populations. In addition to direct protective effects in the vaccinated, vaccination in a population can have indirect effects in the unvaccinated individuals. Vaccination can also reduce person-to-person transmission to vaccinated individuals or from vaccinated individuals compared with unvaccinated individuals. Design of vaccine
-
A Statistical Viewpoint on Differential Privacy: Hypothesis Testing, Representation, and Blackwell's Theorem Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-10-18 Weijie J. Su
Differential privacy is widely considered the formal privacy for privacy-preserving data analysis due to its robust and rigorous guarantees, with increasingly broad adoption in public services, academia, and industry. Although differential privacy originated in the cryptographic context, in this review we argue that, fundamentally, it can be considered a pure statistical concept. We leverage Blackwell's
-
Reproducibility in the Classroom Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-10-09 Mine Dogucu
Difficulties in reproducing results from scientific studies have lately been referred to as a reproducibility crisis. Scientific practice depends heavily on scientific training. What gets taught in the classroom is often practiced in labs, fields, and data analysis. The importance of reproducibility in the classroom has gained momentum in statistics education in recent years. In this article, we review
-
Generalized Additive Models Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-10-07 Simon N. Wood
Generalized additive models are generalized linear models in which the linear predictor includes a sum of smooth functions of covariates, where the shape of the functions is to be estimated. They have also been generalized beyond the original generalized linear model setting to distributions outside the exponential family and to situations in which multiple parameters of the response distribution may
-
Statistics in Phonetics Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-10-01 Shahin Tavakoli, Beatrice Matteo, Davide Pigoli, Eleanor Chodroff, John Coleman, Michele Gubian, Margaret E.L. Renwick, Morgan Sonderegger
Phonetics is the scientific field concerned with the study of how speech is produced, heard, and perceived. It abounds with data, such as acoustic speech recordings, neuroimaging data, or articulatory data. In this article, we provide an introduction to different areas of phonetics (acoustic phonetics, sociophonetics, speech perception, articulatory phonetics, speech inversion, sound change, and speech
-
Hawkes Models and Their Applications Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-10-01 Patrick J. Laub, Young Lee, Philip K. Pollett, Thomas Taimre
The Hawkes process is a model for counting the number of arrivals to a system that exhibits the self-exciting property—that one arrival creates a heightened chance of further arrivals in the near future. The model and its generalizations have been applied in a plethora of disparate domains, though two particularly developed applications are in seismology and in finance. As the original model is elegantly
-
Identification and Inference with Invalid Instruments Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-09-26 Hyunseung Kang, Zijian Guo, Zhonghua Liu, Dylan Small
Instrumental variables (IVs) are widely used to study the causal effect of an exposure on an outcome in the presence of unmeasured confounding. IVs require an instrument, a variable that (a) is associated with the exposure, (b) has no direct effect on the outcome except through the exposure, and (c) is not related to unmeasured confounders. Unfortunately, finding variables that satisfy conditions b
-
Measuring the Functioning Human Brain Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-09-11 Martin A. Lindquist, Bonnie B. Smith, Arunkumar Kannan, Angela Zhao, Brian Caffo
The emergence of functional magnetic resonance imaging (fMRI) marked a significant technological breakthrough in the real-time measurement of the functioning human brain in vivo. In part because of their 4D nature (three spatial dimensions and time), fMRI data have inspired a great deal of statistical development in the past couple of decades to address their unique spatiotemporal properties. This
-
High-Dimensional Gene–Environment Interaction Analysis Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-09-11 Mengyun Wu, Yingmeng Li, Shuangge Ma
Beyond the main genetic and environmental effects, gene–environment (G–E) interactions have been demonstrated to significantly contribute to the development and progression of complex diseases. Published analyses of G–E interactions have primarily used a supervised framework to model both low-dimensional environmental factors and high-dimensional genetic factors in relation to disease outcomes. In
-
A Theoretical Review of Modern Robust Statistics Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-08-21 Po-Ling Loh
Robust statistics is a fairly mature field that dates back to the early 1960s, with many foundational concepts having been developed in the ensuing decades. However, the field has drawn a new surge of attention in the past decade, largely due to a desire to recast robust statistical principles in the context of high-dimensional statistics. In this article, we begin by reviewing some of the central
-
Crafting 10 Years of Statistics Explanations: Points of Significance Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-08-21 Naomi Altman, Martin Krzywinski
Points of Significance is an ongoing series of short articles about statistics in Nature Methods that started in 2013. Its aim is to provide clear explanations of essential concepts in statistics for a nonspecialist audience. The articles favor heuristic explanations and make extensive use of simulated examples and graphical explanations, while maintaining mathematical rigor. Topics range from basic
-
Statistical Data Integration for Health Policy Evidence-Building Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-08-19 Susan M. Paddock, Carolina Franco, F. Jay Breidt, Brenda Betancourt
Health policy evidence-building requires data sources such as health care claims, electronic health records, probability and nonprobability survey data, epidemiological surveillance databases, administrative data, and more, all of which have strengths and limitations for a given policy analysis. Data integration techniques leverage the relative strengths of input sources to obtain a blended source
-
The Role of the Bayes Factor in the Evaluation of Evidence Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-04-24 Colin Aitken, Franco Taroni, Silvia Bozza
The use of the Bayes factor as a metric for the assessment of the probative value of forensic scientific evidence is largely supported by recommended standards in different disciplines. The application of Bayesian networks enables the consideration of problems of increasing complexity. The lack of a widespread consensus concerning key aspects of evidence evaluation and interpretation, such as the adequacy
-
Convergence Diagnostics for Entity Resolution Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-04-24 Serge Aleshin-Guendel, Rebecca C. Steorts
Entity resolution is the process of merging and removing duplicate records from multiple data sources, often in the absence of unique identifiers. Bayesian models for entity resolution allow one to include a priori information, quantify uncertainty in important applications, and directly estimate a partition of the records. Markov chain Monte Carlo (MCMC) sampling is the primary computational method
-
Manifold Learning: What, How, and Why Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-11-29 Marina Meilă, Hanyu Zhang
Manifold learning (ML), also known as nonlinear dimension reduction, is a set of methods to find the low-dimensional structure of data. Dimension reduction for large, high-dimensional data is not merely a way to reduce the data; the new representations and descriptors obtained by ML reveal the geometric shape of high-dimensional point clouds and allow one to visualize, denoise, and interpret them.
-
Maps: A Statistical View Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-11-29 Lance A. Waller
Maps provide a data framework for the statistical analysis of georeferenced data observations. Since the middle of the twentieth century, the field of spatial statistics has evolved to address key inferential questions relating to spatially defined data, yet many central statistical properties do not translate to spatially indexed and spatially correlated data, and the development of statistical inference
-
Communication of Statistics and Evidence in Times of Crisis Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-11-29 Claudia R. Schneider, John R. Kerr, Sarah Dryhurst, John A.D. Aston
This review provides an overview of concepts relating to the communication of statistical and empirical evidence in times of crisis, with a special focus on COVID-19. In it, we consider topics relating to both the communication of numbers, such as the role of format, context, comparisons, and visualization, and the communication of evidence more broadly, such as evidence quality, the influence of changes
-
Recent Advances in Text Analysis Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-11-29 Zheng Tracy Ke, Pengsheng Ji, Jiashun Jin, Wanshan Li
Text analysis is an interesting research area in data science and has various applications, such as in artificial intelligence, biomedical research, and engineering. We review popular methods for text analysis, ranging from topic modeling to the recent neural language models. In particular, we review Topic-SCORE, a statistical approach to topic modeling, and discuss how to use it to analyze the Multi-Attribute
-
Statistical Brain Network Analysis Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-11-28 Sean L. Simpson, Heather M. Shappell, Mohsen Bahrami
The recent fusion of network science and neuroscience has catalyzed a paradigm shift in how we study the brain and led to the field of brain network analysis. Brain network analyses hold great potential in helping us understand normal and abnormal brain function by providing profound clinical insight into links between system-level properties and health and behavioral outcomes. Nonetheless, methods
-
Relational Event Modeling Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-11-28 Federica Bianchi, Edoardo Filippi-Mazzola, Alessandro Lomi, Ernst C. Wit
Advances in information technology have increased the availability of time-stamped relational data, such as those produced by email exchanges or interaction through social media. Whereas the associated information flows could be aggregated into cross-sectional panels, the temporal ordering of the events frequently contains information that requires new models for the analysis of continuous-time interactions
-
Competing Risks: Concepts, Methods, and Software Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-11-22 Ronald B. Geskus
The role of competing risks in the analysis of time-to-event data is increasingly acknowledged. Software is readily available. However, confusion remains regarding the proper analysis: When and how do I need to take the presence of competing risks into account? Which quantities are relevant for my research question? How can they be estimated and what assumptions do I need to make? The main quantities
-
Distributed Computing and Inference for Big Data Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-11-17 Ling Zhou, Ziyang Gong, Pengcheng Xiang
Data are distributed across different sites due to computing facility limitations or data privacy considerations. Conventional centralized methods—those in which all datasets are stored and processed in a central computing facility—are not applicable in practice. Therefore, it has become necessary to develop distributed learning approaches that have good inference or predictive accuracy while remaining
-
Causal Inference in the Social Sciences Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-11-17 Guido W. Imbens
Knowledge of causal effects is of great importance to decision makers in a wide variety of settings. In many cases, however, these causal effects are not known to the decision makers and need to be estimated from data. This fundamental problem has been known and studied for many years in many disciplines. In the past thirty years, however, the amount of empirical as well as methodological research
-
Interpretable Machine Learning for Discovery: Statistical Challenges and Opportunities Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-11-17 Genevera I. Allen, Luqin Gan, Lili Zheng
New technologies have led to vast troves of large and complex data sets across many scientific domains and industries. People routinely use machine learning techniques not only to process, visualize, and make predictions from these big data, but also to make data-driven discoveries. These discoveries are often made using interpretable machine learning, or machine learning models and techniques that
-
Geometric Methods for Cosmological Data on the Sphere Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-11-06 Javier Carrón Duque, Domenico Marinucci
This review is devoted to recent developments in the statistical analysis of spherical data, strongly motivated by applications in cosmology. We start from a brief discussion of cosmological questions and motivations, arguing that most cosmological observables are spherical random fields. Then, we introduce some mathematical background on spherical random fields, including spectral representations
-
Stochastic Models of Rainfall Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-10-31 Paul J. Northrop
Rainfall is the main input to most hydrological systems. To assess flood risk for a catchment area, hydrologists use models that require long series of subdaily, perhaps even subhourly, rainfall data, ideally from locations that cover the area. If historical data are not sufficient for this purpose, an alternative is to simulate synthetic data from a suitably calibrated model. We review stochastic
-
Shape-Constrained Statistical Inference Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-10-13 Lutz Dümbgen
Statistical models defined by shape constraints are a valuable alternative to parametric models or nonparametric models defined in terms of quantitative smoothness constraints. While the latter two classes of models are typically difficult to justify a priori, many applications involve natural shape constraints, for instance, monotonicity of a density or regression function. We review some of the history
-
Analysis of Microbiome Data Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-10-13 Christine B. Peterson, Satabdi Saha, Kim-Anh Do
The microbiome represents a hidden world of tiny organisms populating not only our surroundings but also our own bodies. By enabling comprehensive profiling of these invisible creatures, modern genomic sequencing tools have given us an unprecedented ability to characterize these populations and uncover their outsize impact on our environment and health. Statistical analysis of microbiome data is critical
-
Distributional Regression for Data Analysis Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-10-13 Nadja Klein
The flexible modeling of an entire distribution as a function of covariates, known as distributional regression, has seen growing interest over the past decades in both the statistics and machine learning literature. This review outlines selected state-of-the-art statistical approaches to distributional regression, complemented with alternatives from machine learning. Topics covered include the similarities
-
Role of Statistics in Detecting Misinformation: A Review of the State of the Art, Open Issues, and Future Research Directions Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-10-13 Zois Boukouvalas, Allison Shafer
With the evolution of social media, cyberspace has become the default medium for social media users to communicate, especially during high-impact events such as pandemics, natural disasters, terrorist attacks, and periods of political unrest. However, during such events, misinformation can spread rapidly on social media, affecting decision-making and creating social unrest. Identifying and curtailing
-
An Update on Measurement Error Modeling Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-10-13 Mushan Li, Yanyuan Ma
The issues caused by measurement errors have been recognized for almost 90 years, and research in this area has flourished since the 1980s. We review some of the classical methods in both density estimation and regression problems with measurement errors. In both problems, we consider when the original error-free model is parametric, nonparametric, and semiparametric, in combination with different
-
Making Sense of Censored Covariates: Statistical Methods for Studies of Huntington's Disease Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-09-08 Sarah C. Lotspeich, Marissa C. Ashner, Jesus E. Vazquez, Brian D. Richardson, Kyle F. Grosser, Benjamin E. Bodek, Tanya P. Garcia
The landscape of survival analysis is constantly being revolutionized to answer biomedical challenges, most recently the statistical challenge of censored covariates rather than outcomes. There are many promising strategies to tackle censored covariates, including weighting, imputation, maximum likelihood, and Bayesian methods. Still, this is a relatively fresh area of research, different from the
-
Variable Importance Without Impossible Data Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-08-25 Masayoshi Mase, Art B. Owen, Benjamin B. Seiler
The most popular methods for measuring importance of the variables in a black-box prediction algorithm make use of synthetic inputs that combine predictor variables from multiple observations. These inputs can be unlikely, physically impossible, or even logically impossible. As a result, the predictions for such cases can be based on data very unlike any the black box was trained on. We think that
-
Bayesian Inference for Misspecified Generative Models Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-08-24 David J. Nott, Christopher Drovandi, David T. Frazier
Bayesian inference is a powerful tool for combining information in complex settings, a task of increasing importance in modern applications. However, Bayesian inference with a flawed model can produce unreliable conclusions. This review discusses approaches to performing Bayesian inference when the model is misspecified, where, by misspecified, we mean that the analyst is unwilling to act as if the
-
Inverse Problems for Physics-Based Process Models Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-08-16 Derek Bingham, Troy Butler, Don Estep
We describe and compare two formulations of inverse problems for a physics-based process model in the context of uncertainty and random variability: the Bayesian inverse problem and the stochastic inverse problem. We describe the foundations of the two problems in order to create a context for interpreting the applicability and solutions of inverse problems important for scientific and engineering
-
Graph-Based Change-Point Analysis Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-03-09 Hao Chen, Lynna Chu
Recent technological advances allow for the collection of massive data in the study of complex phenomena over time and/or space in various fields. Many of these data involve sequences of high-dimensional or non-Euclidean measurements, where change-point analysis is a crucial early step in understanding the data. Segmentation, or offline change-point analysis, divides data into homogeneous temporal
-
Surrogate Endpoints in Clinical Trials Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-03-09 Michael R. Elliott
Surrogate markers are often used in clinical trials settings when obtaining a final outcome to evaluate the effectiveness of a treatment requires a long wait, is expensive to obtain, or both. Formal definitions of surrogate marker quality resulting from a large variety of estimation approaches have been proposed over the years. I review this work, with a particular focus on approaches that use the
-
High-Dimensional Data Bootstrap Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-03-09 Victor Chernozhukov, Denis Chetverikov, Kengo Kato, Yuta Koike
This article reviews recent progress in high-dimensional bootstrap. We first review high-dimensional central limit theorems for distributions of sample mean vectors over the rectangles, bootstrap consistency results in high dimensions, and key techniques used to establish those results. We then review selected applications of high-dimensional bootstrap: construction of simultaneous confidence sets
-
Second-Generation Functional Data Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-03-09 Salil Koner, Ana-Maria Staicu
Modern studies from a variety of fields record multiple functional observations according to either multivariate, longitudinal, spatial, or time series designs. We refer to such data as second-generation functional data because their analysis—unlike typical functional data analysis, which assumes independence of the functions—accounts for the complex dependence between the functional observations and
-
A Brief Tour of Deep Learning from a Statistical Perspective Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-03-09 Eric Nalisnick, Padhraic Smyth, Dustin Tran
We expose the statistical foundations of deep learning with the goal of facilitating conversation between the deep learning and statistics communities. We highlight core themes at the intersection; summarize key neural models, such as feedforward neural networks, sequential neural networks, and neural latent variable models; and link these ideas to their roots in probability and statistics. We also
-
Statistical Deep Learning for Spatial and Spatiotemporal Data Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-03-09 Christopher K. Wikle, Andrew Zammit-Mangion
Deep neural network models have become ubiquitous in recent years and have been applied to nearly all areas of science, engineering, and industry. These models are particularly useful for data that have strong dependencies in space (e.g., images) and time (e.g., sequences). Indeed, deep models have also been extensively used by the statistical community to model spatial and spatiotemporal data through
-
Confidentiality Protection in the 2020 US Census of Population and Housing Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2022-11-29 John M. Abowd, Michael B. Hawes
In an era where external data and computational capabilities far exceed statistical agencies’ own resources and capabilities, they face the renewed challenge of protecting the confidentiality of underlying microdata when publishing statistics in very granular form and ensuring that these granular data are used for statistical purposes only. Conventional statistical disclosure limitation methods are
-
Statistical Methods for Exoplanet Detection with Radial Velocities Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2022-11-22 Nathan C. Hara, Eric B. Ford
Exoplanets can be detected with various observational techniques. Among them, radial velocity (RV) has the key advantages of revealing the architecture of planetary systems and measuring planetary mass and orbital eccentricities. RV observations are poised to play a key role in the detection and characterization of Earth twins. However, the detection of such small planets is not yet possible due to
-
Statistical Machine Learning for Quantitative Finance Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2022-11-22 M. Ludkovski
We survey the active interface of statistical learning methods and quantitative finance models. Our focus is on the use of statistical surrogates, also known as functional approximators, for learning input–output relationships relevant for financial tasks. Given the disparate terminology used among statisticians and financial mathematicians, we begin by reviewing the main ingredients of surrogate construction
-
Approximate Methods for Bayesian Computation Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2022-11-22 Radu V. Craiu, Evgeny Levi
Rich data generating mechanisms are ubiquitous in this age of information and require complex statistical models to draw meaningful inference. While Bayesian analysis has seen enormous development in the last 30 years, benefitting from the impetus given by the successful application of Markov chain Monte Carlo (MCMC) sampling, the combination of big data and complex models conspire to produce significant
-
Fifty Years of the Cox Model Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2022-11-21 John D. Kalbfleisch, Douglas E. Schaubel
The Cox model is now 50 years old. The seminal paper of Sir David Cox has had an immeasurable impact on the analysis of censored survival data, with applications in many different disciplines. This work has also stimulated much additional research in diverse areas and led to important theoretical and practical advances. These include semiparametric models, nonparametric efficiency, and partial likelihood
-
Statistical Data Privacy: A Song of Privacy and Utility Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2022-11-19 Aleksandra Slavković, Jeremy Seeman
To quantify trade-offs between increasing demand for open data sharing and concerns about sensitive information disclosure, statistical data privacy (SDP) methodology analyzes data release mechanisms that sanitize outputs based on confidential data. Two dominant frameworks exist: statistical disclosure control (SDC) and the more recent differential privacy (DP). Despite framing differences, both SDC
-
Innovation Diffusion Processes: Concepts, Models, and Predictions Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2022-11-19 Mariangela Guidolin, Piero Manfredi
Innovation diffusion processes have attracted considerable research attention for their interdisciplinary character, which combines theories and concepts from disciplines such as mathematics, physics, statistics, social sciences, marketing, economics, and technological forecasting. The formal representation of innovation diffusion processes historically used epidemic models borrowed from biology, departing
-
Simulation-Based Bayesian Analysis Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2022-11-19 Martyn Plummer
I consider the development of Markov chain Monte Carlo (MCMC) methods, from late-1980s Gibbs sampling to present-day gradient-based methods and piecewise-deterministic Markov processes. In parallel, I show how these ideas have been implemented in successive generations of statistical software for Bayesian inference. These software packages have been instrumental in popularizing applied Bayesian modeling
-
The Role of Statistics in Promoting Data Reusability and Research Transparency Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2022-11-19 Sarah M. Nusser
The value of research data has grown as the emphasis on research transparency and data-intensive research has increased. Data sharing is now required by funders and publishers and is becoming a disciplinary expectation in many fields. However, practices promoting data reusability and research transparency are poorly understood, making it difficult for statisticians and other researchers to reframe