论文标题

部分可观测时空混沌系统的无模型预测

Energy Trees: Regression and Classification With Structured and Mixed-Type Covariates

论文作者

Giubilei, Riccardo, Padellini, Tullia, Brutti, Pierpaolo

论文摘要

数据的增加需要可以有效处理复杂结构的方法和模型,因为简化它们将导致信息丢失。尽管已经开发了几种分析工具来使用其原始形式的复杂数据对象,但这些工具通常仅限于单型变量。在这项工作中,我们建议能量树作为能够适应各种类型的结构化协变量的回归和分类模型。能量树利用能量统计来扩展条件推理树的能力,它们从中继承了合理的统计基础,可解释性,规模不变性以及免于分配假设的自由。我们特别关注功能和图形结构的协变量,同时还突出了模型在集成其他变量类型方面的灵活性。广泛的仿真研究表明,在可变选择和鲁棒性过度拟合方面,模型的竞争性能。最后,我们通过两个涉及人类生物学数据的经验分析来评估模型的预测能力。能量树在r包装中实现。

The increasing complexity of data requires methods and models that can effectively handle intricate structures, as simplifying them would result in loss of information. While several analytical tools have been developed to work with complex data objects in their original form, these tools are typically limited to single-type variables. In this work, we propose energy trees as a regression and classification model capable of accommodating structured covariates of various types. Energy trees leverage energy statistics to extend the capabilities of conditional inference trees, from which they inherit sound statistical foundations, interpretability, scale invariance, and freedom from distributional assumptions. We specifically focus on functional and graph-structured covariates, while also highlighting the model's flexibility in integrating other variable types. Extensive simulation studies demonstrate the model's competitive performance in terms of variable selection and robustness to overfitting. Finally, we assess the model's predictive ability through two empirical analyses involving human biological data. Energy trees are implemented in the R package etree.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源