论文标题
线性回归及其对嘈杂网络连锁数据的推断
Linear regression and its inference on noisy network-linked data
论文作者
论文摘要
网络连接观测的线性回归是对响应与协变量与其他网络结构之间关系建模的重要工具。先前的方法要么缺乏推理工具,要么依赖于对社会效应的限制性假设,并且通常假定没有错误观察到网络。本文提出了具有非参数网络效应的回归模型。该模型不假定关系数据或网络结构已完全观察到,并且可以证明对网络扰动非常健壮。渐近推理框架是在网络观察错误的一般要求下建立的,当错误来自随机网络模型时,在特定环境中研究了该方法的鲁棒性。当没有可用的网络模型知识时,我们发现有关网络密度的推理有效性的相位转换现象,同时还显示出通过了解网络模型的显着改善。进行了仿真研究以验证这些理论结果,并证明在不同数据生成模型下,在准确性和计算效率方面,提出的方法比现有工作的优点。然后将该方法应用于中学生的网络数据,以研究教育研讨会在减少学校冲突中的有效性。
Linear regression on network-linked observations has been an essential tool in modeling the relationship between response and covariates with additional network structures. Previous methods either lack inference tools or rely on restrictive assumptions on social effects and usually assume that networks are observed without errors. This paper proposes a regression model with nonparametric network effects. The model does not assume that the relational data or network structure is exactly observed and can be provably robust to network perturbations. Asymptotic inference framework is established under a general requirement of the network observational errors, and the robustness of this method is studied in the specific setting when the errors come from random network models. We discover a phase-transition phenomenon of the inference validity concerning the network density when no prior knowledge of the network model is available while also showing a significant improvement achieved by knowing the network model. Simulation studies are conducted to verify these theoretical results and demonstrate the advantage of the proposed method over existing work in terms of accuracy and computational efficiency under different data-generating models. The method is then applied to middle school students' network data to study the effectiveness of educational workshops in reducing school conflicts.