Abstract
Android is a platform that hosts roughly 99% of known mobile malware to date and is thus the focus of much research efforts in mobile malware detection. One of the main tools used in this effort is supervised machine learning. While a decade of work has made a lot of progress in detection accuracy, there is an obstacle that each stream of research is forced to overcome, feature selection, i.e., determining which attributes of Android are most effective as inputs into machine learning models. This research tackles the feature selection problem by providing the community with an exhaustive analysis of the three primary types of Android features used by researchers: Permissions, Intents and API Calls. We applied a wide spectrum of feature selection techniques including eleven different algorithms which consisted of filter methods, wrapper methods and embedded methods. Results were evaluated with three different supervised learning classifiers, Random Forest, Support Vector Machine and Neural Network, on a dataset with over 119K Android apps and over 400 features. The results showed that using a combination of Permissions, Intents and API Calls produced higher accuracy than using any of those alone or in any other combination. The results also showed that feature selection should be performed on the combined dataset, not by feature type and then combined and that the negative effects of not doing so are more pronounced the larger the feature set.
| Original language | English |
|---|---|
| Title of host publication | 2022 IEEE/ACIS 20th International Conference on Software Engineering Research, Management and Applications, SERA 2022 |
| Editors | Juyeon Jo, Yeong-Tae Song, Lin Deng, Junghwan Rhee |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 149-154 |
| Number of pages | 6 |
| ISBN (Electronic) | 9781665483506 |
| ISBN (Print) | 9781665483506 |
| DOIs | |
| State | Published - Jun 30 2022 |
| Event | 20th IEEE/ACIS International Conference on Software Engineering Research, Management and Applications, SERA 2022 - Las Vegas, United States Duration: May 25 2022 → May 27 2022 |
Publication series
| Name | 2022 IEEE/ACIS 20th International Conference on Software Engineering Research, Management and Applications, SERA 2022 |
|---|
Conference
| Conference | 20th IEEE/ACIS International Conference on Software Engineering Research, Management and Applications, SERA 2022 |
|---|---|
| Country/Territory | United States |
| City | Las Vegas |
| Period | 5/25/22 → 5/27/22 |
Bibliographical note
Publisher Copyright:© 2022 IEEE.
ASJC Scopus Subject Areas
- Management of Technology and Innovation
- Computer Networks and Communications
- Computer Science Applications
- Software
- Safety, Risk, Reliability and Quality
Keywords
- Android
- feature selection
- machine learning
- malware detection
- mobile malware
- static analysis
Fingerprint
Dive into the research topics of 'Android Feature Selection based on Permissions, Intents, and API Calls'. Together they form a unique fingerprint.Cite this
- APA
- Standard
- Harvard
- Vancouver
- Author
- BIBTEX
- RIS