A4 Article in conference proceedings
Explaining Causes Behind SQL Query Formulation Errors (2020)


Taipalus, T. (2020). Explaining Causes Behind SQL Query Formulation Errors. In FIE 2020 : Proceedings of the 50th IEEE Frontiers in Education Conference. IEEE. Conference proceedings : Frontiers in Education Conference. https://doi.org/10.1109/FIE44824.2020.9274114


JYU authors or editors


Publication details

All authors or editorsTaipalus, Toni

Parent publicationFIE 2020 : Proceedings of the 50th IEEE Frontiers in Education Conference

Conference:

  • Frontiers in Education Conference

Place and date of conferenceUppsala, Sweden21.-24.10.2020

ISBN978-1-7281-8962-8

eISBN9781728189611

Journal or seriesConference proceedings : Frontiers in Education Conference

ISSN1539-4565

eISSN2377-634X

Publication year2020

PublisherIEEE

Publication countryUnited States

Publication languageEnglish

DOIhttps://doi.org/10.1109/FIE44824.2020.9274114

Publication open accessOther way freely accessible online

Publication channel open access

Publication is parallel published (JYX)https://jyx.jyu.fi/handle/123456789/73017

Web address where publication is availablehttps://www.fie2020.org/abstracts-and-papers/


Abstract

This Full Research Paper presents the most prominent query formulation errors in Structured Query Language (SQL), and maps these errors to their cognitive explanations. Understanding query formulation errors is a key to teaching SQL. more effectively. However, studies on what kind of errors novices struggle with are relatively scarce when compared to, for example, programming languages. Although committing errors is a crucial part in learning, some errors are relatively easy to fix, and their commonness is not necessarily an indication of their difficulty. Other errors, however, halt the learning process, and are never fixed by the query writer. Using a previously established error taxonomy and queries from four cohorts with a total of 987 students, we set out to identify common errors which students are unable to correct, i.e., errors that are likely to cause query formulation failures. Our results indicate that on a general level, logical errors are the most common cause for query formulation failures, while syntax and semantic errors are usually fixed by query writers. Although query concepts, for example, expressions, joins and grouping, have a strong influence on what types of errors are committed, some errors are common regardless of query concepts. Specifically, our results indicate that missing expressions, extraneous or omitted grouping columns, incorrect comparison operators, missing joins, and missing ordering columns are the most common errors that novices are unable to fix. Based on the results, we speculate on the reasons behind the most common persistent errors using previously identified cognitive explanations. Finally, we present that solutions for mitigating the causes behind query formulation errors are already available. In order to more effectively teach query formulation, educators should emphasize natural language patterns, query planning, and increasingly ambiguous exercises.


Keywordserrorsquery languagesprogramming languagesSQLeducation and traininglearning

Free keywordsStructured Query Language (SQL); database; error; education; novice


Contributing organizations


Ministry reportingYes

VIRTA submission year2020

JUFO rating1


Last updated on 2024-22-04 at 13:27