Home
Esportazione
Statistica
Opzioni
Visualizza tutti i metadati (visione tecnica)
Robust URL Classification With Generative Adversarial Networks
Martino Trevisan
•
Idilio Drago
2019
journal article
Periodico
PERFORMANCE EVALUATION REVIEW
Abstract
Classifying URLs is essential for different applications, such as parental control, URL filtering and Ads/tracking protection. Such systems historically identify URLs by means of regular expressions, even if machine learning alternatives have been proposed to overcome the time-consuming maintenance of classification rules. Classical machine learning algorithms, however, require large samples of URLs to train the models, covering the diverse classes of URLs (i.e., a ground truth), which somehow limits the applicability of the approach. We here give a first step towards the use of Generative Adversarial Neural Networks (GANs) to classify URLs. GANs are attractive for this problem for two reasons. First, GANs can produce samples of URLs belonging to specific classes even if exposed to a limited training set, outputting both synthetic traces and a robust discriminator. Second, a GAN can be trained to discriminate a class of URLs without being exposed to all other URLs classes – i.e., GANs are robust even if not exposed to uninteresting URL classes during training. Experiments on real data show that not only the generated synthetic traces are somehow realistic, but also the URL classification is accurate with GANs. © is is held held by by author/owner(s). author/owner(s).
DOI
10.1145/3308897.3308959
Archivio
http://hdl.handle.net/11368/3025221
info:eu-repo/semantics/altIdentifier/scopus/2-s2.0-85061525599
https://dl.acm.org/citation.cfm?id=3308959
Diritti
open access
license:copyright dell'editore
license:digital rights management non definito
license uri:publisher
license uri:iris.pri00
FVG url
https://arts.units.it/request-item?handle=11368/3025221
Soggetti
Generative Adversaria...
Machine Learning
Neural Network
URL generation
google-scholar
Vedi dettagli