Logo del repository
  1. Home
 
Opzioni

Regex-based Entity Extraction with Active Learning and Genetic Programming

BARTOLI, Alberto
•
DE LORENZO, ANDREA
•
MEDVET, Eric
•
TARLAO, FABIANO
2016
  • journal article

Periodico
APPLIED COMPUTING REVIEW
Abstract
We consider the long-standing problem of the automatic generation of regular expressions for text extraction, based solely on examples of the desired behavior. We investigate several active learning approaches in which the user annotates only one desired extraction and then merely answers extraction queries generated by the system. The resulting framework is attractive because it is the system, not the user, which digs out the data in search of the samples most suitable to the specific learning task. We tailor our proposals to a state-of-the-art learner based on Genetic Programming and we assess them experimentally on a number of challenging tasks of realistic complexity. The results indicate that active learning is indeed a viable framework in this application domain and may thus significantly decrease the amount of costly annotation effort required.
DOI
10.1145/2993231.2993232
WOS
WOS:000382652100001
Archivio
http://hdl.handle.net/11368/2880141
http://dl.acm.org/citation.cfm?id=2993232
Diritti
open access
license:creative commons
license:digital rights management non definito
license uri:http://creativecommons.org/licenses/by-nc-nd/3.0/it/
FVG url
https://arts.units.it/bitstream/11368/2880141/1/2016_ACR_ActiveLearningRegex (1).pdf
Soggetti
  • Information Extractio...

  • Entity Extraction

  • Programming by Exampl...

  • Machine Learning

Web of Science© citazioni
2
Data di acquisizione
Mar 12, 2024
Visualizzazioni
1
Data di acquisizione
Apr 19, 2024
Vedi dettagli
google-scholar
Get Involved!
  • Source Code
  • Documentation
  • Slack Channel
Make it your own

DSpace-CRIS can be extensively configured to meet your needs. Decide which information need to be collected and available with fine-grained security. Start updating the theme to match your nstitution's web identity.

Need professional help?

The original creators of DSpace-CRIS at 4Science can take your project to the next level, get in touch!

Realizzato con Software DSpace-CRIS - Estensione mantenuta e ottimizzata da 4Science

  • Impostazioni dei cookie
  • Informativa sulla privacy
  • Accordo con l'utente finale
  • Invia il tuo Feedback