Cost-effective ensemble models selection using deep reinforcement learning

Yoni Birman, Shaked Hindi, Gilad Katz, Asaf Shabtai

Information Fusion 77, 133-148, 2022

Ensemble learning–the application of multiple learning models on the same task–is a common technique in multiple domains. While employing multiple models enables reaching higher classification accuracy, this process can be time consuming, costly, and make scaling more difficult. Given that each model may have different capabilities and costs, assigning the most cost-effective set of learners for each sample is challenging. We propose SPIREL, a novel method for cost-effective classification. Our method enables users to directly associate costs to correct/incorrect label assignment, computing resources and run-time, and then dynamically establishes a classification policy. For each analyzed sample, SPIREL dynamically assigns a different set of learning models, as well as its own classification threshold. Extensive evaluation on two large malware datasets–a domain in which the application of multiple analysis …