Throw exception for cast of nan and infinity to int types #22917

rschlussel · 2024-06-04T18:03:38Z

Description

Fix cast of nan and infinity from DOUBLE/REAL to
BIGINT/INT/SMALLINT/TINYTINT. Previously all except double -> bigint would return zero. Now they will all throw an INVALID_CAST_ARGUMENT exception.

Fix silent overflow for casting from REAL to BIGINT type. We now throw
an INVALID_CAST_ARGUMENT if the value is out of the BIGINT range.

Change error code from NUMERIC_VALUE_OUT_OF_RANGE to
INVALID_CAST_ARGUMENT for out of range values in casts from floating
point to integer types. With the library we now use for the
implementaiton for these casts, we can't always tell by the exception
type that the value is out of range, so we return the more generic, but
still relevant INVALID_CAST_ARGUMENT instead.

Motivation and Context

Fixes #22910.
Nan and infinity can't be represented as an integer, so casting to an int type should throw an error.

This also brings Presto and velox behavior in alignment. We are choosing to modify Presto in this case because velox behavior is correct here (we would want to change Presto regardless of the native worker migration)

Impact

CAST of nan and infinity to BIGINT, INTEGER, SMALLINT, and TINYINT types will now return an exception with the INVALID_CAST_ARGUMENT error code

Cast of out of range values for DOUBLE or FLOAT to BIGINT, INTEGER, SMALLINT, or TINYINT will now return an INVALID_CAST_ARGUMENT error code rather than NUMERIC_VALUE_OUT_OF_RANGE error code

CAST of real values outside of BIGINT range will now return a NUMERIC_VALUE_OUT_OF_RANGE error. Previously they would silently overflow.

Test Plan

unit tests

Contributor checklist

Please make sure your submission complies with our development, formatting, commit message, and attribution guidelines.
PR description addresses the issue accurately and concisely. If the change is non-trivial, a GitHub Issue is referenced.
Documented new properties (with its default value), SQL syntax, functions, or other functionality.
If release notes are required, they follow the release notes guidelines.
Adequate tests were added if applicable.
CI passed.

Release Notes

Please follow release notes guidelines and fill in the release notes below.

== RELEASE NOTES ==

General Changes
* Fix cast of NaN and Infinity from DOUBLE or REAL to  BIGINT, INTEGER, SMALLINT, and TINYINT. It will now return an exception with the INVALID_CAST_ARGUMENT error code. Previously it would return zero.
* Fix CAST of REAL values outside of BIGINT range to return an exception with an INVALID_CAST_ARGUMENT error code. Previously they would silently overflow.
* Change error code for cast from DOUBLE or REAL to BIGINT, INTEGER, SMALLINT or TINYINT for out of range values from ``NUMERIC_VALUE_OUT_OF_RANGE`` to ``INVALID_CAST_ARGUMENT``.

ClarenceThreepwood · 2024-06-04T22:25:07Z

This also brings Presto and velox behavior in alignment. We are choosing to modify Presto in this case because velox behavior is correct here (we would want to change Presto regardless of the native worker migration)

Another data point - this is also the approach that most other db engines take (e.g. Postgres)

elharo

Keying off exception messages is very brittle.

elharo · 2024-06-05T12:23:30Z

presto-main/src/main/java/com/facebook/presto/type/DoubleOperators.java

        }
        catch (ArithmeticException e) {
-            throw new PrestoException(NUMERIC_VALUE_OUT_OF_RANGE, "Out of range for integer: " + value, e);
+            if (e.getMessage().equals("not in range")) {


How can this be done without relying on undocumented exception messages that are subject to change between JDKs and JDK versions?

If it can't, I'd drop the conditional and simply always use an INVALID_CAST_ARGUMENT

yeah, I didn't love it, but didn't see other options.. I'll switch to always using INVALID_CAST_ARGUMENT instead.

elharo · 2024-06-05T12:24:47Z

presto-main/src/main/java/com/facebook/presto/type/DoubleOperators.java

+            return Shorts.checkedCast(DoubleMath.roundToInt(value, HALF_UP));
+        }
+        catch (ArithmeticException e) {
+            if (e.getMessage().equals("not in range")) {


elharo · 2024-06-05T12:25:35Z

presto-main/src/main/java/com/facebook/presto/type/DoubleOperators.java

+            return SignedBytes.checkedCast(DoubleMath.roundToInt(value, HALF_UP));
+        }
+        catch (ArithmeticException e) {
+            if (e.getMessage().equals("not in range")) {


elharo · 2024-06-05T12:25:53Z

presto-main/src/main/java/com/facebook/presto/type/RealOperators.java

+            return DoubleMath.roundToLong(intBitsToFloat((int) value), HALF_UP);
+        }
+        catch (ArithmeticException e) {
+            if (e.getMessage().equals("not in range")) {


elharo · 2024-06-05T12:26:01Z

presto-main/src/main/java/com/facebook/presto/type/RealOperators.java

        }
        catch (ArithmeticException e) {
-            throw new PrestoException(NUMERIC_VALUE_OUT_OF_RANGE, "Out of range for integer: " + value, e);
+            if (e.getMessage().equals("not in range")) {


Fix cast of nan and infinity from DOUBLE/REAL to BIGINT/INT/SMALLINT/TINYTINT. Previously all except double -> bigint would return zero. Now they will all throw an INVALID_CAST_ARGUMENT exception. Fix silent overflow for casting from REAL to BIGINT type. We now throw an INVALID_CAST_ARGUMENT if the value is out of the BIGINT range. Change error code from NUMERIC_VALUE_OUT_OF_RANGE to INVALID_CAST_ARGUMENT for out of range values in casts from floating point to integer types. With the library we now use for the implementaiton for these casts, we can't always tell by the exception type that the value is out of range, so we return the more generic, but still relevant INVALID_CAST_ARGUMENT instead.

rschlussel requested a review from a team as a code owner June 4, 2024 18:03

rschlussel requested a review from presto-oss June 4, 2024 18:03

rschlussel force-pushed the cast-nan branch from 487903d to a857670 Compare June 4, 2024 19:05

ClarenceThreepwood previously approved these changes Jun 4, 2024

View reviewed changes

elharo requested changes Jun 5, 2024

View reviewed changes

rschlussel dismissed ClarenceThreepwood’s stale review via 8a5be08 June 5, 2024 14:29

rschlussel force-pushed the cast-nan branch from a857670 to 8a5be08 Compare June 5, 2024 14:29

rschlussel force-pushed the cast-nan branch from 8a5be08 to 45ef7e3 Compare June 5, 2024 16:03

rschlussel requested a review from elharo June 5, 2024 17:52

elharo approved these changes Jun 5, 2024

View reviewed changes

arhimondr approved these changes Jun 5, 2024

View reviewed changes

rschlussel merged commit b5b0dd8 into prestodb:master Jun 5, 2024
56 checks passed

wanglinsong mentioned this pull request Jun 25, 2024

Add release notes for 0.288 #23079

Merged

36 tasks

tdcmeehan mentioned this pull request Jul 29, 2024

Presto allows cast from out-of-range floating point value to bigint and returns "bogus" result #22640

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Throw exception for cast of nan and infinity to int types #22917

Throw exception for cast of nan and infinity to int types #22917

rschlussel commented Jun 4, 2024 •

edited

Loading

ClarenceThreepwood commented Jun 4, 2024

elharo left a comment

elharo Jun 5, 2024

rschlussel Jun 5, 2024

elharo Jun 5, 2024

elharo Jun 5, 2024

elharo Jun 5, 2024

elharo Jun 5, 2024

Throw exception for cast of nan and infinity to int types #22917

Throw exception for cast of nan and infinity to int types #22917

Conversation

rschlussel commented Jun 4, 2024 • edited Loading

Description

Motivation and Context

Impact

Test Plan

Contributor checklist

Release Notes

ClarenceThreepwood commented Jun 4, 2024

elharo left a comment

Choose a reason for hiding this comment

elharo Jun 5, 2024

Choose a reason for hiding this comment

rschlussel Jun 5, 2024

Choose a reason for hiding this comment

elharo Jun 5, 2024

Choose a reason for hiding this comment

elharo Jun 5, 2024

Choose a reason for hiding this comment

elharo Jun 5, 2024

Choose a reason for hiding this comment

elharo Jun 5, 2024

Choose a reason for hiding this comment

rschlussel commented Jun 4, 2024 •

edited

Loading