Annual Review of Statistics and Its Application期刊新发论文, 统计, 本领域期刊类期刊,

A Survey on Statistical Theory of Deep Learning: Approximation, Training Dynamics, and Generative Models

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-11-21
Namjoon Suh, Guang Cheng

In this article, we review the literature on statistical theories of neural networks from three perspectives: approximation, training dynamics, and generative models. In the first part, results on excess risks for neural networks are reviewed in the nonparametric framework of regression. These results rely on explicit constructions of neural networks, leading to fast convergence rates of excess risks

更新日期：2024-11-21

详情收藏

Models and Rating Systems for Head-to-Head Competition

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-11-20
Mark E. Glickman, Albyn C. Jones

One of the most important tasks in sports analytics is the development of binary response models for head-to-head game outcomes to estimate team and player strength. We discuss commonly used probability models for game outcomes, including the Bradley–Terry and Thurstone–Mosteller models, as well as extensions to ties as a third outcome and to the inclusion of a home-field advantage. We consider dynamic

更新日期：2024-11-20

详情收藏

A Review of Reinforcement Learning in Financial Applications

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-11-15
Yahui Bai, Yuhe Gao, Runzhe Wan, Sheng Zhang, Rui Song

In recent years, there has been a growing trend of applying reinforcement learning (RL) in financial applications. This approach has shown great potential for decision-making tasks in finance. In this review, we present a comprehensive study of the applications of RL in finance and conduct a series of meta-analyses to investigate the common themes in the literature, such as the factors that most significantly

更新日期：2024-11-15

详情收藏

Joint Modeling of Longitudinal and Survival Data

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-11-14
Jane-Ling Wang, Qixian Zhong

In medical studies, time-to-event outcomes such as time to death or relapse of a disease are routinely recorded along with longitudinal data that are observed intermittently during the follow-up period. For various reasons, marginal approaches to model the event time, corresponding to separate approaches for survival data/longitudinal data, tend to induce bias and lose efficiency. Instead, a joint

更新日期：2024-11-14

详情收藏

Infectious Disease Modeling

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-11-12
Jing Huang, Jeffrey S. Morris

Infectious diseases pose a persistent challenge to public health worldwide. Recent global health crises, such as the COVID-19 pandemic and Ebola outbreaks, have underscored the vital role of infectious disease modeling in guiding public health policy and response. Infectious disease modeling is a critical tool for society, informing risk mitigation measures, prompting timely interventions, and aiding

更新日期：2024-11-12

详情收藏

Tensors in High-Dimensional Data Analysis: Methodological Opportunities and Theoretical Challenges

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-11-12
Arnab Auddy, Dong Xia, Ming Yuan

Large amounts of multidimensional data represented by multiway arrays or tensors are prevalent in modern applications across various fields such as chemometrics, genomics, physics, psychology, and signal processing. The structural complexity of such data provides vast new opportunities for modeling and analysis, but efficiently extracting information content from them, both statistically and computationally

更新日期：2024-11-12

详情收藏

Empirical Likelihood in Functional Data Analysis

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-11-12
Hsin-wen Chang, Ian W. McKeague

Functional data analysis (FDA) studies data that include infinite-dimensional functions or objects, generalizing traditional univariate or multivariate observations from each study unit. Among inferential approaches without parametric assumptions, empirical likelihood (EL) offers a principled method in that it extends the framework of parametric likelihood ratio–based inference via the nonparametric

更新日期：2024-11-12

详情收藏

Excess Mortality Estimation

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-11-12
Jon Wakefield, Victoria Knutson

Estimating the mortality associated with a specific mortality crisis event (for example, a pandemic, natural disaster, or conflict) is clearly an important public health undertaking. In many situations, deaths may be directly or indirectly attributable to the mortality crisis event, and both contributions may be of interest. The totality of the mortality impact on the population (direct and indirect

更新日期：2024-11-12

详情收藏

Neural Methods for Amortized Inference

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-11-12
Andrew Zammit-Mangion, Matthew Sainsbury-Dale, Raphaël Huser

Simulation-based methods for statistical inference have evolved dramatically over the past 50 years, keeping pace with technological advancements. The field is undergoing a new revolution as it embraces the representational capacity of neural networks, optimization libraries, and graphics processing units for learning complex mappings between data and inferential targets. The resulting tools are amortized

更新日期：2024-11-12

详情收藏

Causal Mediation Analysis for Integrating Exposure, Genomic, and Phenotype Data

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-10-30
Haoyu Yang, Zhonghua Liu, Ruoyu Wang, En-Yu Lai, Joel Schwartz, Andrea A. Baccarelli, Yen-Tsung Huang, Xihong Lin

Causal mediation analysis provides an attractive framework for integrating diverse types of exposure, genomic, and phenotype data. Recently, this field has seen a surge of interest, largely driven by the increasing need for causal mediation analyses in health and social sciences. This article aims to provide a review of recent developments in mediation analysis, encompassing mediation analysis of a

更新日期：2024-10-30

详情收藏

Designs for Vaccine Studies

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-10-30
M. Elizabeth Halloran

Due to dependent happenings, vaccines can have different effects in populations. In addition to direct protective effects in the vaccinated, vaccination in a population can have indirect effects in the unvaccinated individuals. Vaccination can also reduce person-to-person transmission to vaccinated individuals or from vaccinated individuals compared with unvaccinated individuals. Design of vaccine

更新日期：2024-10-30

详情收藏

A Statistical Viewpoint on Differential Privacy: Hypothesis Testing, Representation, and Blackwell's Theorem

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-10-18
Weijie J. Su

Differential privacy is widely considered the formal privacy for privacy-preserving data analysis due to its robust and rigorous guarantees, with increasingly broad adoption in public services, academia, and industry. Although differential privacy originated in the cryptographic context, in this review we argue that, fundamentally, it can be considered a pure statistical concept. We leverage Blackwell's

更新日期：2024-10-18

详情收藏

Reproducibility in the Classroom

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-10-09
Mine Dogucu

Difficulties in reproducing results from scientific studies have lately been referred to as a reproducibility crisis. Scientific practice depends heavily on scientific training. What gets taught in the classroom is often practiced in labs, fields, and data analysis. The importance of reproducibility in the classroom has gained momentum in statistics education in recent years. In this article, we review

更新日期：2024-10-09

详情收藏

Generalized Additive Models

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-10-07
Simon N. Wood

Generalized additive models are generalized linear models in which the linear predictor includes a sum of smooth functions of covariates, where the shape of the functions is to be estimated. They have also been generalized beyond the original generalized linear model setting to distributions outside the exponential family and to situations in which multiple parameters of the response distribution may

更新日期：2024-10-07

详情收藏

Statistics in Phonetics

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-10-01
Shahin Tavakoli, Beatrice Matteo, Davide Pigoli, Eleanor Chodroff, John Coleman, Michele Gubian, Margaret E.L. Renwick, Morgan Sonderegger

Phonetics is the scientific field concerned with the study of how speech is produced, heard, and perceived. It abounds with data, such as acoustic speech recordings, neuroimaging data, and articulatory data. In this article, we provide an introduction to different areas of phonetics (acoustic phonetics, sociophonetics, speech perception, articulatory phonetics, speech inversion, sound change, and speech

更新日期：2024-10-01

详情收藏

Hawkes Models and Their Applications

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-10-01
Patrick J. Laub, Young Lee, Philip K. Pollett, Thomas Taimre

The Hawkes process is a model for counting the number of arrivals to a system that exhibits the self-exciting property—that one arrival creates a heightened chance of further arrivals in the near future. The model and its generalizations have been applied in a plethora of disparate domains, though two particularly developed applications are in seismology and in finance. As the original model is elegantly

更新日期：2024-10-01

详情收藏

Identification and Inference with Invalid Instruments

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-09-26
Hyunseung Kang, Zijian Guo, Zhonghua Liu, Dylan Small

Instrumental variables (IVs) are widely used to study the causal effect of an exposure on an outcome in the presence of unmeasured confounding. IVs require an instrument, a variable that (a) is associated with the exposure, (b) has no direct effect on the outcome except through the exposure, and (c) is not related to unmeasured confounders. Unfortunately, finding variables that satisfy conditions b

更新日期：2024-09-26

详情收藏

Measuring the Functioning Human Brain

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-09-11
Martin A. Lindquist, Bonnie B. Smith, Arunkumar Kannan, Angela Zhao, Brian Caffo

The emergence of functional magnetic resonance imaging (fMRI) marked a significant technological breakthrough in the real-time measurement of the functioning human brain in vivo. In part because of their 4D nature (three spatial dimensions and time), fMRI data have inspired a great deal of statistical development in the past couple of decades to address their unique spatiotemporal properties. This

更新日期：2024-09-11

详情收藏

High-Dimensional Gene–Environment Interaction Analysis

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-09-11
Mengyun Wu, Yingmeng Li, Shuangge Ma

Beyond the main genetic and environmental effects, gene–environment (G–E) interactions have been demonstrated to significantly contribute to the development and progression of complex diseases. Published analyses of G–E interactions have primarily used a supervised framework to model both low-dimensional environmental factors and high-dimensional genetic factors in relation to disease outcomes. In

更新日期：2024-09-11

详情收藏

A Theoretical Review of Modern Robust Statistics

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-08-21
Po-Ling Loh

Robust statistics is a fairly mature field that dates back to the early 1960s, with many foundational concepts having been developed in the ensuing decades. However, the field has drawn a new surge of attention in the past decade, largely due to a desire to recast robust statistical principles in the context of high-dimensional statistics. In this article, we begin by reviewing some of the central

更新日期：2024-08-21

详情收藏

Crafting 10 Years of Statistics Explanations: Points of Significance

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-08-21
Naomi Altman, Martin Krzywinski

Points of Significance is an ongoing series of short articles about statistics in Nature Methods that started in 2013. Its aim is to provide clear explanations of essential concepts in statistics for a nonspecialist audience. The articles favor heuristic explanations and make extensive use of simulated examples and graphical explanations, while maintaining mathematical rigor. Topics range from basic

更新日期：2024-08-21

详情收藏

Statistical Data Integration for Health Policy Evidence-Building

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-08-19
Susan M. Paddock, Carolina Franco, F. Jay Breidt, Brenda Betancourt

Health policy evidence-building requires data sources such as health care claims, electronic health records, probability and nonprobability survey data, epidemiological surveillance databases, administrative data, and more, all of which have strengths and limitations for a given policy analysis. Data integration techniques leverage the relative strengths of input sources to obtain a blended source

更新日期：2024-08-19

详情收藏

The Role of the Bayes Factor in the Evaluation of Evidence

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-04-24
Colin Aitken, Franco Taroni, Silvia Bozza

The use of the Bayes factor as a metric for the assessment of the probative value of forensic scientific evidence is largely supported by recommended standards in different disciplines. The application of Bayesian networks enables the consideration of problems of increasing complexity. The lack of a widespread consensus concerning key aspects of evidence evaluation and interpretation, such as the adequacy

更新日期：2024-04-24

详情收藏

Convergence Diagnostics for Entity Resolution

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2024-04-24
Serge Aleshin-Guendel, Rebecca C. Steorts

Entity resolution is the process of merging and removing duplicate records from multiple data sources, often in the absence of unique identifiers. Bayesian models for entity resolution allow one to include a priori information, quantify uncertainty in important applications, and directly estimate a partition of the records. Markov chain Monte Carlo (MCMC) sampling is the primary computational method

更新日期：2024-04-24

详情收藏

Manifold Learning: What, How, and Why

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-11-29
Marina Meilă, Hanyu Zhang

Manifold learning (ML), also known as nonlinear dimension reduction, is a set of methods to find the low-dimensional structure of data. Dimension reduction for large, high-dimensional data is not merely a way to reduce the data; the new representations and descriptors obtained by ML reveal the geometric shape of high-dimensional point clouds and allow one to visualize, denoise, and interpret them.

更新日期：2023-11-29

详情收藏

Maps: A Statistical View

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-11-29
Lance A. Waller

Maps provide a data framework for the statistical analysis of georeferenced data observations. Since the middle of the twentieth century, the field of spatial statistics has evolved to address key inferential questions relating to spatially defined data, yet many central statistical properties do not translate to spatially indexed and spatially correlated data, and the development of statistical inference

更新日期：2023-11-29

详情收藏

Communication of Statistics and Evidence in Times of Crisis

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-11-29
Claudia R. Schneider, John R. Kerr, Sarah Dryhurst, John A.D. Aston

This review provides an overview of concepts relating to the communication of statistical and empirical evidence in times of crisis, with a special focus on COVID-19. In it, we consider topics relating to both the communication of numbers, such as the role of format, context, comparisons, and visualization, and the communication of evidence more broadly, such as evidence quality, the influence of changes

更新日期：2023-11-29

详情收藏

Recent Advances in Text Analysis

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-11-29
Zheng Tracy Ke, Pengsheng Ji, Jiashun Jin, Wanshan Li

Text analysis is an interesting research area in data science and has various applications, such as in artificial intelligence, biomedical research, and engineering. We review popular methods for text analysis, ranging from topic modeling to the recent neural language models. In particular, we review Topic-SCORE, a statistical approach to topic modeling, and discuss how to use it to analyze the Multi-Attribute

更新日期：2023-11-29

详情收藏

Statistical Brain Network Analysis

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-11-28
Sean L. Simpson, Heather M. Shappell, Mohsen Bahrami

The recent fusion of network science and neuroscience has catalyzed a paradigm shift in how we study the brain and led to the field of brain network analysis. Brain network analyses hold great potential in helping us understand normal and abnormal brain function by providing profound clinical insight into links between system-level properties and health and behavioral outcomes. Nonetheless, methods

更新日期：2023-11-28

详情收藏

Relational Event Modeling

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-11-28
Federica Bianchi, Edoardo Filippi-Mazzola, Alessandro Lomi, Ernst C. Wit

Advances in information technology have increased the availability of time-stamped relational data, such as those produced by email exchanges or interaction through social media. Whereas the associated information flows could be aggregated into cross-sectional panels, the temporal ordering of the events frequently contains information that requires new models for the analysis of continuous-time interactions

更新日期：2023-11-28

详情收藏

Competing Risks: Concepts, Methods, and Software

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-11-22
Ronald B. Geskus

The role of competing risks in the analysis of time-to-event data is increasingly acknowledged. Software is readily available. However, confusion remains regarding the proper analysis: When and how do I need to take the presence of competing risks into account? Which quantities are relevant for my research question? How can they be estimated and what assumptions do I need to make? The main quantities

更新日期：2023-11-22

详情收藏

Distributed Computing and Inference for Big Data

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-11-17
Ling Zhou, Ziyang Gong, Pengcheng Xiang

Data are distributed across different sites due to computing facility limitations or data privacy considerations. Conventional centralized methods—those in which all datasets are stored and processed in a central computing facility—are not applicable in practice. Therefore, it has become necessary to develop distributed learning approaches that have good inference or predictive accuracy while remaining

更新日期：2023-11-17

详情收藏

Causal Inference in the Social Sciences

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-11-17
Guido W. Imbens

Knowledge of causal effects is of great importance to decision makers in a wide variety of settings. In many cases, however, these causal effects are not known to the decision makers and need to be estimated from data. This fundamental problem has been known and studied for many years in many disciplines. In the past thirty years, however, the amount of empirical as well as methodological research

更新日期：2023-11-17

详情收藏

Interpretable Machine Learning for Discovery: Statistical Challenges and Opportunities

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-11-17
Genevera I. Allen, Luqin Gan, Lili Zheng

New technologies have led to vast troves of large and complex data sets across many scientific domains and industries. People routinely use machine learning techniques not only to process, visualize, and make predictions from these big data, but also to make data-driven discoveries. These discoveries are often made using interpretable machine learning, or machine learning models and techniques that

更新日期：2023-11-17

详情收藏

Geometric Methods for Cosmological Data on the Sphere

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-11-06
Javier Carrón Duque, Domenico Marinucci

This review is devoted to recent developments in the statistical analysis of spherical data, strongly motivated by applications in cosmology. We start from a brief discussion of cosmological questions and motivations, arguing that most cosmological observables are spherical random fields. Then, we introduce some mathematical background on spherical random fields, including spectral representations

更新日期：2023-11-06

详情收藏

Stochastic Models of Rainfall

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-10-31
Paul J. Northrop

Rainfall is the main input to most hydrological systems. To assess flood risk for a catchment area, hydrologists use models that require long series of subdaily, perhaps even subhourly, rainfall data, ideally from locations that cover the area. If historical data are not sufficient for this purpose, an alternative is to simulate synthetic data from a suitably calibrated model. We review stochastic

更新日期：2023-10-31

详情收藏

Shape-Constrained Statistical Inference

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-10-13
Lutz Dümbgen

Statistical models defined by shape constraints are a valuable alternative to parametric models or nonparametric models defined in terms of quantitative smoothness constraints. While the latter two classes of models are typically difficult to justify a priori, many applications involve natural shape constraints, for instance, monotonicity of a density or regression function. We review some of the history

更新日期：2023-10-13

详情收藏

Analysis of Microbiome Data

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-10-13
Christine B. Peterson, Satabdi Saha, Kim-Anh Do

The microbiome represents a hidden world of tiny organisms populating not only our surroundings but also our own bodies. By enabling comprehensive profiling of these invisible creatures, modern genomic sequencing tools have given us an unprecedented ability to characterize these populations and uncover their outsize impact on our environment and health. Statistical analysis of microbiome data is critical

更新日期：2023-10-13

详情收藏

Distributional Regression for Data Analysis

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-10-13
Nadja Klein

The flexible modeling of an entire distribution as a function of covariates, known as distributional regression, has seen growing interest over the past decades in both the statistics and machine learning literature. This review outlines selected state-of-the-art statistical approaches to distributional regression, complemented with alternatives from machine learning. Topics covered include the similarities

更新日期：2023-10-13

详情收藏

Role of Statistics in Detecting Misinformation: A Review of the State of the Art, Open Issues, and Future Research Directions

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-10-13
Zois Boukouvalas, Allison Shafer

With the evolution of social media, cyberspace has become the default medium for social media users to communicate, especially during high-impact events such as pandemics, natural disasters, terrorist attacks, and periods of political unrest. However, during such events, misinformation can spread rapidly on social media, affecting decision-making and creating social unrest. Identifying and curtailing

更新日期：2023-10-13

详情收藏

An Update on Measurement Error Modeling

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-10-13
Mushan Li, Yanyuan Ma

The issues caused by measurement errors have been recognized for almost 90 years, and research in this area has flourished since the 1980s. We review some of the classical methods in both density estimation and regression problems with measurement errors. In both problems, we consider when the original error-free model is parametric, nonparametric, and semiparametric, in combination with different

更新日期：2023-10-13

详情收藏

Making Sense of Censored Covariates: Statistical Methods for Studies of Huntington's Disease

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-09-08
Sarah C. Lotspeich, Marissa C. Ashner, Jesus E. Vazquez, Brian D. Richardson, Kyle F. Grosser, Benjamin E. Bodek, Tanya P. Garcia

The landscape of survival analysis is constantly being revolutionized to answer biomedical challenges, most recently the statistical challenge of censored covariates rather than outcomes. There are many promising strategies to tackle censored covariates, including weighting, imputation, maximum likelihood, and Bayesian methods. Still, this is a relatively fresh area of research, different from the

更新日期：2023-09-08

详情收藏

Variable Importance Without Impossible Data

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-08-25
Masayoshi Mase, Art B. Owen, Benjamin B. Seiler

The most popular methods for measuring importance of the variables in a black-box prediction algorithm make use of synthetic inputs that combine predictor variables from multiple observations. These inputs can be unlikely, physically impossible, or even logically impossible. As a result, the predictions for such cases can be based on data very unlike any the black box was trained on. We think that

更新日期：2023-08-25

详情收藏

Bayesian Inference for Misspecified Generative Models

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-08-24
David J. Nott, Christopher Drovandi, David T. Frazier

Bayesian inference is a powerful tool for combining information in complex settings, a task of increasing importance in modern applications. However, Bayesian inference with a flawed model can produce unreliable conclusions. This review discusses approaches to performing Bayesian inference when the model is misspecified, where, by misspecified, we mean that the analyst is unwilling to act as if the

更新日期：2023-08-24

详情收藏

Inverse Problems for Physics-Based Process Models

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-08-16
Derek Bingham, Troy Butler, Don Estep

We describe and compare two formulations of inverse problems for a physics-based process model in the context of uncertainty and random variability: the Bayesian inverse problem and the stochastic inverse problem. We describe the foundations of the two problems in order to create a context for interpreting the applicability and solutions of inverse problems important for scientific and engineering

更新日期：2023-08-16

详情收藏

Graph-Based Change-Point Analysis

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-03-09
Hao Chen, Lynna Chu

Recent technological advances allow for the collection of massive data in the study of complex phenomena over time and/or space in various fields. Many of these data involve sequences of high-dimensional or non-Euclidean measurements, where change-point analysis is a crucial early step in understanding the data. Segmentation, or offline change-point analysis, divides data into homogeneous temporal

更新日期：2023-03-09

详情收藏

Surrogate Endpoints in Clinical Trials

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-03-09
Michael R. Elliott

Surrogate markers are often used in clinical trials settings when obtaining a final outcome to evaluate the effectiveness of a treatment requires a long wait, is expensive to obtain, or both. Formal definitions of surrogate marker quality resulting from a large variety of estimation approaches have been proposed over the years. I review this work, with a particular focus on approaches that use the

更新日期：2023-03-09

详情收藏

High-Dimensional Data Bootstrap

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-03-09
Victor Chernozhukov, Denis Chetverikov, Kengo Kato, Yuta Koike

This article reviews recent progress in high-dimensional bootstrap. We first review high-dimensional central limit theorems for distributions of sample mean vectors over the rectangles, bootstrap consistency results in high dimensions, and key techniques used to establish those results. We then review selected applications of high-dimensional bootstrap: construction of simultaneous confidence sets

更新日期：2023-03-09

详情收藏

Second-Generation Functional Data

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-03-09
Salil Koner, Ana-Maria Staicu

Modern studies from a variety of fields record multiple functional observations according to either multivariate, longitudinal, spatial, or time series designs. We refer to such data as second-generation functional data because their analysis—unlike typical functional data analysis, which assumes independence of the functions—accounts for the complex dependence between the functional observations and

更新日期：2023-03-09

详情收藏

A Brief Tour of Deep Learning from a Statistical Perspective

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-03-09
Eric Nalisnick, Padhraic Smyth, Dustin Tran

We expose the statistical foundations of deep learning with the goal of facilitating conversation between the deep learning and statistics communities. We highlight core themes at the intersection; summarize key neural models, such as feedforward neural networks, sequential neural networks, and neural latent variable models; and link these ideas to their roots in probability and statistics. We also

更新日期：2023-03-09

详情收藏

Statistical Deep Learning for Spatial and Spatiotemporal Data

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2023-03-09
Christopher K. Wikle, Andrew Zammit-Mangion

Deep neural network models have become ubiquitous in recent years and have been applied to nearly all areas of science, engineering, and industry. These models are particularly useful for data that have strong dependencies in space (e.g., images) and time (e.g., sequences). Indeed, deep models have also been extensively used by the statistical community to model spatial and spatiotemporal data through

更新日期：2023-03-09

详情收藏

Confidentiality Protection in the 2020 US Census of Population and Housing

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2022-11-29
John M. Abowd, Michael B. Hawes

In an era where external data and computational capabilities far exceed statistical agencies’ own resources and capabilities, they face the renewed challenge of protecting the confidentiality of underlying microdata when publishing statistics in very granular form and ensuring that these granular data are used for statistical purposes only. Conventional statistical disclosure limitation methods are

更新日期：2022-11-29

详情收藏

Statistical Methods for Exoplanet Detection with Radial Velocities

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2022-11-22
Nathan C. Hara, Eric B. Ford

Exoplanets can be detected with various observational techniques. Among them, radial velocity (RV) has the key advantages of revealing the architecture of planetary systems and measuring planetary mass and orbital eccentricities. RV observations are poised to play a key role in the detection and characterization of Earth twins. However, the detection of such small planets is not yet possible due to

更新日期：2022-11-22

详情收藏

Statistical Machine Learning for Quantitative Finance

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2022-11-22
M. Ludkovski

We survey the active interface of statistical learning methods and quantitative finance models. Our focus is on the use of statistical surrogates, also known as functional approximators, for learning input–output relationships relevant for financial tasks. Given the disparate terminology used among statisticians and financial mathematicians, we begin by reviewing the main ingredients of surrogate construction

更新日期：2022-11-22

详情收藏

Approximate Methods for Bayesian Computation

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2022-11-22
Radu V. Craiu, Evgeny Levi

Rich data generating mechanisms are ubiquitous in this age of information and require complex statistical models to draw meaningful inference. While Bayesian analysis has seen enormous development in the last 30 years, benefitting from the impetus given by the successful application of Markov chain Monte Carlo (MCMC) sampling, the combination of big data and complex models conspire to produce significant

更新日期：2022-11-22

详情收藏

Fifty Years of the Cox Model

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2022-11-21
John D. Kalbfleisch, Douglas E. Schaubel

The Cox model is now 50 years old. The seminal paper of Sir David Cox has had an immeasurable impact on the analysis of censored survival data, with applications in many different disciplines. This work has also stimulated much additional research in diverse areas and led to important theoretical and practical advances. These include semiparametric models, nonparametric efficiency, and partial likelihood

更新日期：2022-11-21

详情收藏

Statistical Data Privacy: A Song of Privacy and Utility

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2022-11-19
Aleksandra Slavković, Jeremy Seeman

To quantify trade-offs between increasing demand for open data sharing and concerns about sensitive information disclosure, statistical data privacy (SDP) methodology analyzes data release mechanisms that sanitize outputs based on confidential data. Two dominant frameworks exist: statistical disclosure control (SDC) and the more recent differential privacy (DP). Despite framing differences, both SDC

更新日期：2022-11-19

详情收藏

Innovation Diffusion Processes: Concepts, Models, and Predictions

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2022-11-19
Mariangela Guidolin, Piero Manfredi

Innovation diffusion processes have attracted considerable research attention for their interdisciplinary character, which combines theories and concepts from disciplines such as mathematics, physics, statistics, social sciences, marketing, economics, and technological forecasting. The formal representation of innovation diffusion processes historically used epidemic models borrowed from biology, departing

更新日期：2022-11-19

详情收藏

Simulation-Based Bayesian Analysis

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2022-11-19
Martyn Plummer

I consider the development of Markov chain Monte Carlo (MCMC) methods, from late-1980s Gibbs sampling to present-day gradient-based methods and piecewise-deterministic Markov processes. In parallel, I show how these ideas have been implemented in successive generations of statistical software for Bayesian inference. These software packages have been instrumental in popularizing applied Bayesian modeling

更新日期：2022-11-19

详情收藏

The Role of Statistics in Promoting Data Reusability and Research Transparency

Annu. Rev. Stat. Appl. (IF 7.4) Pub Date : 2022-11-19
Sarah M. Nusser

The value of research data has grown as the emphasis on research transparency and data-intensive research has increased. Data sharing is now required by funders and publishers and is becoming a disciplinary expectation in many fields. However, practices promoting data reusability and research transparency are poorly understood, making it difficult for statisticians and other researchers to reframe

更新日期：2022-11-19

详情收藏