Logo del repository
  1. Home
 
Opzioni

Learning Text Patterns using Separate-and-Conquer Genetic Programming

BARTOLI, Alberto
•
DE LORENZO, ANDREA
•
MEDVET, Eric
•
TARLAO, FABIANO
2015
  • conference object

Abstract
The problem of extracting knowledge from large volumes of unstructured textual information has become increasingly important. We consider the problem of extracting text slices that adhere to a syntactic pattern and propose an approach capable of generating the desired pattern automatically, from a few annotated examples. Our approach is based on Genetic Programming and generates extraction patterns in the form of regular expressions that may be input to existing engines without any post-processing. Key feature of our proposal is its ability of discovering automatically whether the extraction task may be solved by a single pattern, or rather a set of multiple patterns is required. We obtain this property by means of a separate-and-conquer strategy: once a candidate pattern provides adequate performance on a subset of the examples, the pattern is inserted into the set of final solutions and the evolutionary search continues on a smaller set of examples including only those not yet solved adequately. Our proposal outperforms an earlier state-of-the-art approach on three challenging datasets.
DOI
10.1007/978-3-319-16501-1_2
WOS
WOS:000361758600002
Archivio
http://hdl.handle.net/11368/2832545
info:eu-repo/semantics/altIdentifier/scopus/2-s2.0-84925059040
http://link.springer.com/chapter/10.1007/978-3-319-16501-1_2
http://link.springer.com/book/10.1007/978-3-319-16501-1
Diritti
open access
license:digital rights management non definito
FVG url
https://arts.units.it/bitstream/11368/2832545/2/2015_EuroGP_LearningMultiPatterns.pdf
Soggetti
  • machine learning

  • evolutinary computing...

  • natural language proc...

  • genetic programming

Web of Science© citazioni
17
Data di acquisizione
Mar 26, 2024
Visualizzazioni
3
Data di acquisizione
Apr 19, 2024
Vedi dettagli
google-scholar
Get Involved!
  • Source Code
  • Documentation
  • Slack Channel
Make it your own

DSpace-CRIS can be extensively configured to meet your needs. Decide which information need to be collected and available with fine-grained security. Start updating the theme to match your nstitution's web identity.

Need professional help?

The original creators of DSpace-CRIS at 4Science can take your project to the next level, get in touch!

Realizzato con Software DSpace-CRIS - Estensione mantenuta e ottimizzata da 4Science

  • Impostazioni dei cookie
  • Informativa sulla privacy
  • Accordo con l'utente finale
  • Invia il tuo Feedback