Empirical assessment of machine learning-based malware detectors for Android Measuring the gap between in-the-lab and in-the-wild validation scenarios

Allix Kevin; Bissyande Tegawende F.; Jerome Quentin; Klein Jacques; State Radu; Le Traon Yves

首页> 外文期刊>Empirical Software Engineering >Empirical assessment of machine learning-based malware detectors for Android Measuring the gap between in-the-lab and in-the-wild validation scenarios

【24h】

Empirical assessment of machine learning-based malware detectors for Android Measuring the gap between in-the-lab and in-the-wild validation scenarios

机译：基于Android的基于机器学习的恶意软件检测器的经验评估衡量实验室和野外验证方案之间的差距

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

To address the issue of malware detection through large sets of applications, researchers have recently started to investigate the capabilities of machine-learning techniques for proposing effective approaches. So far, several promising results were recorded in the literature, many approaches being assessed with what we call in the lab validation scenarios. This paper revisits the purpose of malware detection to discuss whether such in the lab validation scenarios provide reliable indications on the performance of malware detectors in real-world settings, aka in the wild. To this end, we have devised several Machine Learning classifiers that rely on a set of features built from applications' CFGs. We use a sizeable dataset of over 50 000 Android applications collected from sources where state-of-the art approaches have selected their data. We show that, in the lab, our approach outperforms existing machine learning-based approaches. However, this high performance does not translate in high performance in the wild. The performance gap we observed-F-measures dropping from over 0.9 in the lab to below 0.1 in the wild-raises one important question: How do state-of-the-art approaches perform in the wild?

机译：为了解决通过大量应用程序进行恶意软件检测的问题，研究人员最近开始研究机器学习技术的功能，以提出有效的方法。到目前为止，文献中记录了一些有希望的结果，许多方法都在实验室验证场景中用我们称为的方法进行了评估。本文再次探讨了恶意软件检测的目的，以讨论在实验室验证场景中进行这种检测是否可以提供可靠的指示，说明真实世界中（即在野外）设置中恶意软件检测器的性能。为此，我们设计了几种机器学习分类器，这些分类器依赖于从应用程序的CFG构建的一组功能。我们使用了一个庞大的数据集，该数据集来自超过5万个Android应用程序，这些数据源是通过最新方法选择了它们的数据而来的。我们证明，在实验室中，我们的方法优于现有的基于机器学习的方法。但是，这种高性能不能在野外转化为高性能。我们观察到的性能差距-F指标从实验室的0.9下降到野外的0.1以下，这提出了一个重要的问题：最新的方法在野外如何表现？

著录项

来源
《Empirical Software Engineering》 |2016年第1期|183-211|共29页
作者
Allix Kevin; Bissyande Tegawende F.; Jerome Quentin; Klein Jacques; State Radu; Le Traon Yves;
展开▼
作者单位

Univ Luxembourg, Interdisciplinary Ctr Secur Reliabil & Trust, 4 Rue Alphonse Weicker, L-2721 Luxembourg, Luxembourg;

Univ Luxembourg, Interdisciplinary Ctr Secur Reliabil & Trust, 4 Rue Alphonse Weicker, L-2721 Luxembourg, Luxembourg;

Univ Luxembourg, Interdisciplinary Ctr Secur Reliabil & Trust, 4 Rue Alphonse Weicker, L-2721 Luxembourg, Luxembourg;

Univ Luxembourg, Interdisciplinary Ctr Secur Reliabil & Trust, 4 Rue Alphonse Weicker, L-2721 Luxembourg, Luxembourg;

Univ Luxembourg, Interdisciplinary Ctr Secur Reliabil & Trust, 4 Rue Alphonse Weicker, L-2721 Luxembourg, Luxembourg;

Univ Luxembourg, Interdisciplinary Ctr Secur Reliabil & Trust, 4 Rue Alphonse Weicker, L-2721 Luxembourg, Luxembourg;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Machine learning; Ten-Fold; Malware; Android;

机译：机器学习;十折;恶意软件;Android;

相似文献

外文文献
中文文献
专利

1. A new machine learning-based method for android malware detection on imbalanced dataset [J] . Dehkordy Diyana Tehrany, Rasoolzadegan Abbas Multimedia Tools and Applications . 2021,第16期

机译：基于机器学习的基于机器学习的Android Malware检测方法，用于基于Inbalanced DataSet
2. Empirical Evaluation of a System Call-Based Android Malware Detector [J] . Vinod P., Viswalakshmi P. Arabian Journal for Science and Engineering . 2018,第12期

机译：基于系统调用的Android恶意软件检测器的经验评估
3. An HMM and structural entropy based detector for Android malware: An empirical study [J] . Gerardo Canfora, Francesco Mercaldo, Corrado Aaron Visaggio Computers & Security . 2016,第auga期

机译：基于HMM和基于结构熵的Android恶意软件检测器：一项实证研究
4. On the Deterioration of Learning-Based Malware Detectors for Android [C] . Xiaoqin Fu, Haipeng Cai 2019 IEEE/ACM 41st International Conference on Software Engineering: Companion Proceedings . 2019

机译：关于基于学习的Android恶意软件检测器的恶化
5. Risk Assessment of Android Malwares Using Machine Learning Techniques [D] . Padrithi, Deepthi Naidu. 2017

机译：使用机器学习技术的Android恶意软件风险评估
6. Validation of Embedded Experience Sampling (EES) for Measuring Non-cognitive Facets of Problem-Solving Competence in Scenario-Based Assessments [O] . Andreas Rausch, Kristina Kögler, Jürgen Seifried 2005

机译：验证基于情景的评估中解决问题能力的非认知层面的嵌入式体验抽样（EES）的有效性
7. Empirical assessment of machine learning-based malware detectors for Android: Measuring the Gap between In-the-Lab and In-the-Wild Validation Scenarios [O] . Allix, Kevin, Bissyande, Tegawendé François D Assise, Jerome, Quentin, 2014

机译：基于机器学习的android恶意软件检测器的实证评估：衡量实验室内和野外验证方案之间的差距
8. Sweetening Android Lemon Markets: Measuring and Curbing Malware in Application Marketplaces. [R] . N. Christin T. Vidas 2012

机译：软件android柠檬市场：在应用程序市场中测量和抑制恶意软件。

Empirical assessment of machine learning-based malware detectors for Android Measuring the gap between in-the-lab and in-the-wild validation scenarios

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅