Logo del repository
  1. Home
 
Opzioni

Redefining text-to-SQL metrics by incorporating semantic and structural similarity

Pinna, Giovanni
•
Perezhohin, Yuriy
•
Manzoni, Luca
altro
De Lorenzo, Andrea
2025
  • journal article

Periodico
SCIENTIFIC REPORTS
Abstract
The rapid advancements in text-to-SQL systems have driven the scientific community to create increasingly complex benchmarks for this task. However, evaluation metrics often rely on simplistic or binary approaches that fail to capture the similarities and differences between equivalent SQL queries. Current metrics overlook critical aspects such as partial correctness, structural differences, and semantic equivalence. To address these limitations, we propose a novel metric for SQL query comparison, designed to offer a more precise assessment of the similarity between SQL queries at both the semantic (string) and execution result (resultant table) levels. This new metric allows for a granular evaluation of SQL query similarity, supporting a more accurate assessment and ranking of text-to-SQL tools and models. The proposed approach could have a meaningful impact on text-to-SQL research and development. It might improve evaluation by distinguishing between models that handle simple queries and those capable of tackling more complex ones. The metric could also help to identify where the differences between two queries lie. Additionally, it may support the development of more accurate language models by offering precise training signals to help the model recognize query similarities. The experimental results highlight the metric’s effectiveness over existing evaluation methodologies, allowing us to identify the current best text-to-SQL models through distribution analysis. In some cases, the metric allows the detection of missing aggregation operators or variations in query ordering operators.
DOI
10.1038/s41598-025-04890-9
WOS
WOS:001523033000029
Archivio
https://hdl.handle.net/11368/3112298
info:eu-repo/semantics/altIdentifier/scopus/2-s2.0-105009902701
https://doi.org/10.1038/s41598-025-04890-9
Diritti
open access
license:creative commons
license:creative commons
license uri:http://creativecommons.org/licenses/by/4.0/
license uri:http://creativecommons.org/licenses/by/4.0/
FVG url
https://arts.units.it/bitstream/11368/3112298/1/s41598-025-04890-9.pdf
Soggetti
  • SQL metric

  • Evaluation metric

  • Text-to-SQL

  • Benchmark SQL

  • SQL similarity

google-scholar
Get Involved!
  • Source Code
  • Documentation
  • Slack Channel
Make it your own

DSpace-CRIS can be extensively configured to meet your needs. Decide which information need to be collected and available with fine-grained security. Start updating the theme to match your nstitution's web identity.

Need professional help?

The original creators of DSpace-CRIS at 4Science can take your project to the next level, get in touch!

Realizzato con Software DSpace-CRIS - Estensione mantenuta e ottimizzata da 4Science

  • Impostazioni dei cookie
  • Informativa sulla privacy
  • Accordo con l'utente finale
  • Invia il tuo Feedback