Research on the Impact of Data Volume on the Accuracy of Anomaly Detection Methods in Network Traffic

Anastasia Ma; Elena Avksentieva; Nikolai Zhukov

doi:10.20397/2177-6652/2025.v25i2.3161

Research on the Impact of Data Volume on the Accuracy of Anomaly Detection Methods in Network Traffic

Authors

Anastasia Ma ITMO University, Saint Petersburg, Russia https://orcid.org/0009-0009-4942-4211
Elena Avksentieva ITMO University, Saint Petersburg, Russia https://orcid.org/0000-0001-5000-4868
Nikolai Zhukov ITMO University, Saint Petersburg, Russia; The Herzen State Pedagogical University of Russia, Saint-Petersburg, Russia https://orcid.org/0000-0002-5641-1613

DOI:

https://doi.org/10.20397/2177-6652/2025.v25i2.3161

Abstract

This article discusses the use of machine learning algorithms to detect anomalies based on the CICIDS2017 dataset, which was specifically designed to simulate real- world network attack scenarios. Special attention is paid to three popular algorithms: logistic regression, random forest and neural networks. These algorithms were chosen due to their ability to efficiently process large amounts of data and identify complex patterns. Within the framework of this article, a series of experiments has been conducted in which the amount of training data will vary and the performance of models will be evaluated, both on pure and noisy data. For noisy data, neural networks retain their lead with a slight accuracy drop, while random forest performs well but is less effective than on clean data. Logistic regression, though most sensitive to noise, improves with larger datasets, emphasizing the need for thorough preprocessing.The results of this study will help to better understand how different algorithms respond to changes in the amount of data and the quality of input information, which is an important aspect for developing effective cyber security systems

Author Biographies

Anastasia Ma, ITMO University, Saint Petersburg, Russia

Faculty of Software Engineering and Computer Systems

Elena Avksentieva, ITMO University, Saint Petersburg, Russia

Faculty of Software Engineering and Computer Systems

Nikolai Zhukov, ITMO University, Saint Petersburg, Russia; The Herzen State Pedagogical University of Russia, Saint-Petersburg, Russia

1Faculty of Software Engineering and Computer Systems
2 Institute of Computer Science and Technology Education

Downloads

Published

2025-04-07

How to Cite

Ma, A., Avksentieva, E., & Zhukov, N. (2025). Research on the Impact of Data Volume on the Accuracy of Anomaly Detection Methods in Network Traffic. Revista Gestão & Tecnologia, 25(2), 108–125. https://doi.org/10.20397/2177-6652/2025.v25i2.3161

Download Citation

Issue

Vol. 25 No. 2 (2025): Special Edition - Invited Articles

Section

ARTIGO

License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

Authors who publish with this journal agree to the following terms: • 1. The author(s) authorize the publication of the article in the journal. • 2. The author(s) ensure that the contribution is original and unpublished and is not being evaluated in other journal(s). • 3. The journal is not responsible for the opinions, ideas and concepts expressed in the texts because they are the sole responsibility of the author(s). • 4. The publishers reserve the right to make adjustments and textual adaptation to the norms of APA. • 5. Authors retain copyright and grant the journal right of first publication, with the work [SPECIFY PERIOD OF TIME] after publication simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal. • 6. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal. • 7. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access) at http://opcit.eprints.org/oacitation-biblio.html

Research on the Impact of Data Volume on the Accuracy of Anomaly Detection Methods in Network Traffic

Authors

DOI:

Abstract

Author Biographies

Anastasia Ma, ITMO University, Saint Petersburg, Russia

Elena Avksentieva, ITMO University, Saint Petersburg, Russia

Nikolai Zhukov, ITMO University, Saint Petersburg, Russia; The Herzen State Pedagogical University of Russia, Saint-Petersburg, Russia

Downloads

Published

How to Cite

Issue

Section

License

Make a Submission

indexing

Keywords

visitors