Dsa-PAML: a parallel automated machine learning system via dual-stacked autoencoder
Liu PJ(刘朋杰)1,2,3,4; Pan FC(潘福成)1,2,3; Zhou XF(周晓锋)1,2,3; Li S(李帅)1,2,3,4; Zeng PY(曾鹏宇)1,2,3,4; Liu SR(刘舒锐)1,2,3; Jin L(金樑)1,2,3
刊名Neural Computing and Applications
2022
页码1-22
关键词Automated machine learning Parallel automated system Dual-stacked autoencoder Selective ensemble
ISSN号0941-0643
产权排序1
英文摘要

Finding a high-performance machine learning pipeline (ML pipeline) for a supervised learning task takes much time. It requires many choices, including preprocessing datasets, selecting algorithms, tuning hyperparameters, and ensembling candidate models. With increasing pipelines arises a combination explosion problem. This work presents a new automated machine learning (AutoML) system called Dsa-PAML to address this challenge by recommending, training, and ensembling suitable models for supervised learning tasks. Dsa-PAML is a parallel automated system based on a dual-stacked autoencoder (Dsa). Firstly, meta-features of datasets and ML pipelines are used to alleviate cold-start recommendation problems. Secondly, a novel dual-stacked autoencoder is used to simultaneously learn the latent features of datasets and ML pipelines, efficiently learning collaborations of both datasets and ML pipelines and recommending suitable ML pipelines for a new dataset. Thirdly, Dsa-PAML can train the recommended ML pipelines on the new dataset in a parallel method, which substantially reduces the time complexity of the proposed method. Finally, a parallel selective ensemble system is embedded into Dsa-PAML. It selects base models from candidate ML pipelines according to their runtime, classification performance, and diversity on the validation set, enhancing Dsa-PAML’s stability for most datasets. Amounts of experiments on 30 UCI datasets show that our approach outperforms current state-of-the-art methods.

资助项目National Key R&D Program of China[2019B090916002]
WOS关键词PREDICTION
WOS研究方向Computer Science
语种英语
WOS记录号WOS:000773831700002
资助机构National Key R&D Program of China under Grant No. 2019B090916002
内容类型期刊论文
源URL[http://ir.sia.cn/handle/173321/30743]  
专题沈阳自动化研究所_数字工厂研究室
通讯作者Zhou XF(周晓锋)
作者单位1.Institutes for Robotics and Intelligent Manufacturing, Chinese Academy of Sciences, Shenyang 110169, China
2.Key Laboratory of Networked Control Systems, Chinese Academy of Sciences, Shenyang 110016, China
3.Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang 110016, China
4.University of Chinese Academy of Sciences, Beijing 100049, China
推荐引用方式
GB/T 7714
Liu PJ,Pan FC,Zhou XF,et al. Dsa-PAML: a parallel automated machine learning system via dual-stacked autoencoder[J]. Neural Computing and Applications,2022:1-22.
APA Liu PJ.,Pan FC.,Zhou XF.,Li S.,Zeng PY.,...&Jin L.(2022).Dsa-PAML: a parallel automated machine learning system via dual-stacked autoencoder.Neural Computing and Applications,1-22.
MLA Liu PJ,et al."Dsa-PAML: a parallel automated machine learning system via dual-stacked autoencoder".Neural Computing and Applications (2022):1-22.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace