Logo del repository
  1. Home
 
Opzioni

Elastic-Degenerate String Matching with 1 Error

Bernardini, Giulia
•
Gabory, Esteban
•
Pissis, Solon P.
altro
Zuba, Wiktor
2022
  • conference object

Abstract
An elastic-degenerate (ED) string is a sequence of n finite sets of strings of total length N, introduced to represent a set of related DNA sequences, also known as a pangenome. The ED string matching (EDSM) problem consists in reporting all occurrences of a pattern of length m in an ED text. The EDSM problem has recently received some attention by the combinatorial pattern matching community, culminating in an O~(nmω−1)+O(N)-time algorithm [Bernardini et al., SIAM J. Comput. 2022], where ω denotes the matrix multiplication exponent and the O~(⋅) notation suppresses polylog factors. In the k-EDSM problem, the approximate version of EDSM, we are asked to report all pattern occurrences with at most k errors. k-EDSM can be solved in O(k2mG+kN) time under edit distance, where G denotes the total number of strings in the ED text [Bernardini et al., Theor. Comput. Sci. 2020]. Unfortunately, G is only bounded by N, and so even for k=1, the existing algorithm runs in Ω(mN) time in the worst case. Here we make progress in this direction. We show that 1-EDSM can be solved in O((nm2+N)logm) or O(nm3+N) time under edit distance. For the decision version of the problem, we present a faster O(nm2logm−−−−−√+Nloglogm)-time algorithm. Our algorithms rely on non-trivial reductions from 1-EDSM to special instances of classic computational geometry problems (2d rectangle stabbing or range emptiness), which we show how to solve efficiently.
DOI
10.1007/978-3-031-20624-5_2
WOS
WOS:001416662300002
Archivio
https://hdl.handle.net/11368/3032860
info:eu-repo/semantics/altIdentifier/scopus/2-s2.0-85144528755
Diritti
open access
license:copyright editore
license:digital rights management non definito
license:digital rights management non definito
license uri:iris.pri02
license uri:iris.pri00
license uri:iris.pri00
Soggetti
  • String algorithm

  • Approximate string ma...

  • Edit distance

  • Degenerate string

  • Elastic-degenerate st...

google-scholar
Get Involved!
  • Source Code
  • Documentation
  • Slack Channel
Make it your own

DSpace-CRIS can be extensively configured to meet your needs. Decide which information need to be collected and available with fine-grained security. Start updating the theme to match your nstitution's web identity.

Need professional help?

The original creators of DSpace-CRIS at 4Science can take your project to the next level, get in touch!

Realizzato con Software DSpace-CRIS - Estensione mantenuta e ottimizzata da 4Science

  • Impostazioni dei cookie
  • Informativa sulla privacy
  • Accordo con l'utente finale
  • Invia il tuo Feedback