Abstract
An important component of knowledge management is the organization of documents for quick and easy access. One effective way of organizing these documents is to group them by a fixed set of specific knowledge categories. For large-scale technical teams, the number of categories can reach thousands or even tens of thousands, which makes this type of cataloging especially useful. Text classification is a sophisticated process that involves data preprocessing, transformation, dimensionality reduction, application of classification techniques, classifier evaluation, and classifier validation. This paper describes the preliminary results from phase one of a design-science research study for the development of a model that can be used for classification of financial software development documentation in knowledge discovery systems using machine learning. Specifically, testing with a small dataset of 64 documents from the Natural Language Technical Project Documentation dataset was conducted to assess the effectiveness of traditional text classification methods. Results indicate limitations to these traditional methods for classifying technical documents. The next steps include evaluating the performance on a bigger dataset of technical documents and testing new deep learning techniques.
| Original language | English |
|---|---|
| Pages (from-to) | 67-72 |
| Number of pages | 6 |
| Journal | Issues in Information Systems |
| Volume | 21 |
| Issue number | 4 |
| DOIs | |
| State | Published - 2020 |
Bibliographical note
Publisher Copyright:© 2020 Issues in Information Systems. All rights reserved.
ASJC Scopus Subject Areas
- General Business,Management and Accounting
Keywords
- Knowledge Discovery
- Knowledge Management
- Machine Learning
- Text Classification