Handle invalid rows in the Storage Api sink #17423

reuvenlax · 2022-04-21T05:03:52Z

No description provided.

codecov · 2022-04-21T05:30:53Z

Codecov Report

Merging #17423 (0c9cf43) into master (1dfab62) will increase coverage by 0.01%.
The diff coverage is n/a.

❗ Current head 0c9cf43 differs from pull request most recent head c4af119. Consider uploading reports for the commit c4af119 to get more accurate results

@@            Coverage Diff             @@
##           master   #17423      +/-   ##
==========================================
+ Coverage   73.98%   74.00%   +0.01%     
==========================================
  Files         696      696              
  Lines       91851    91851              
==========================================
+ Hits        67958    67975      +17     
+ Misses      22644    22627      -17     
  Partials     1249     1249

Flag	Coverage Δ
go	`50.45% <0.00%> (ø)`
python	`83.75% <0.00%> (+0.02%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
sdks/python/apache_beam/runners/direct/executor.py	`96.46% <0.00%> (-0.55%)`	⬇️
sdks/python/apache_beam/transforms/core.py	`92.30% <0.00%> (ø)`
...ks/python/apache_beam/runners/worker/sdk_worker.py	`89.09% <0.00%> (+0.15%)`	⬆️
...hon/apache_beam/runners/worker/bundle_processor.py	`93.55% <0.00%> (+0.24%)`	⬆️
...ks/python/apache_beam/runners/worker/data_plane.py	`88.13% <0.00%> (+0.56%)`	⬆️
...eam/runners/portability/fn_api_runner/execution.py	`93.08% <0.00%> (+0.64%)`	⬆️
...hon/apache_beam/runners/direct/test_stream_impl.py	`94.02% <0.00%> (+0.74%)`	⬆️
...che_beam/runners/interactive/interactive_runner.py	`90.64% <0.00%> (+1.43%)`	⬆️
sdks/python/apache_beam/utils/interactive_utils.py	`97.56% <0.00%> (+2.43%)`	⬆️
.../python/apache_beam/testing/test_stream_service.py	`92.85% <0.00%> (+4.76%)`	⬆️
... and 1 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 1dfab62...c4af119. Read the comment docs.

aaltay · 2022-04-28T16:57:51Z

@reuvenlax - Is this ready for a review?

reuvenlax · 2022-04-28T16:59:26Z

@aaltay This depends on #17404 being merged first

reuvenlax · 2022-04-30T19:36:01Z

@aaltay this is now ready for review. who would be the best reviewer for this?

reuvenlax · 2022-05-18T18:04:10Z

friendly ping

chamikaramj

Thanks.

chamikaramj · 2022-05-20T22:57:06Z

...ud-platform/src/main/java/org/apache/beam/sdk/io/gcp/bigquery/StorageApiConvertMessages.java

+        StorageApiWritePayload payload = messageConverter.toMessage(element.getValue());
+        o.get(successfulWritesTag).output(KV.of(element.getKey(), payload));
+      } catch (TableRowToStorageApiProto.SchemaConversionException e) {
+        TableRow tableRow = messageConverter.toTableRow(element.getValue());


I'm bit worried about just pushing all messages from an exception handler to a DLQ.

(1) This could result in errors from downstream fused steps being sent to DQL instead of being retried.
(2)Messages being send to a DLQ in an unintended way may be perceived as dataloss by a user of the I/O connector.

I think we should build a retry policy around this (or use existing BQ retry policy) so that users explicitly mark messages that should be sent to a DLQ.

WDYT ?

reuvenlax · 2022-05-20T23:22:23Z

We're not pushing all errors, just the SchemaConversionExceptions which are thrown when converting the json to a proto. If a downstream ParDo threw our internal SchemaConversionException, that would be a very weird thing to do (we could make it package private to ensure this can't happen).

…

On Fri, May 20, 2022 at 7:11 PM Chamikara Jayalath ***@***.***> wrote: ***@***.**** commented on this pull request. Thanks. ------------------------------ In sdks/java/io/google-cloud-platform/src/main/java/org/apache/beam/sdk/io/gcp/bigquery/StorageApiConvertMessages.java <#17423 (comment)>: > throws Exception { dynamicDestinations.setSideInputAccessorFromProcessContext(c); MessageConverter<ElementT> messageConverter = messageConverters.get( element.getKey(), dynamicDestinations, getDatasetService(pipelineOptions)); - StorageApiWritePayload payload = messageConverter.toMessage(element.getValue()); - o.output(KV.of(element.getKey(), payload)); + try { + StorageApiWritePayload payload = messageConverter.toMessage(element.getValue()); + o.get(successfulWritesTag).output(KV.of(element.getKey(), payload)); + } catch (TableRowToStorageApiProto.SchemaConversionException e) { + TableRow tableRow = messageConverter.toTableRow(element.getValue()); I'm bit worried about just pushing all messages from an exception handler to a DLQ. (1) This could result in errors from downstream fused steps being sent to DQL instead of being retried. (2)Messages being send to a DLQ in an unintended way may be perceived as dataloss by a user of the I/O connector. I think we should build a retry policy around this (or use existing BQ retry policy) so that users explicitly mark messages that should be sent to a DLQ. WDYT ? — Reply to this email directly, view it on GitHub <#17423 (review)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AFAYJVKKQVNK44CGTVGR2C3VLAL3VANCNFSM5T6AVIFA> . You are receiving this because you were mentioned.Message ID: ***@***.***>

chamikaramj · 2022-05-20T23:47:21Z

Can we build a retry policy that only include "SchemaConversionExceptions" by default ?

And can we expose this through the API similar to existing failedInsertRetryPolicy ?

beam/sdks/java/io/google-cloud-platform/src/main/java/org/apache/beam/sdk/io/gcp/bigquery/BigQueryIO.java

Line 2340 in 0c9cf43

public Write<T> withFailedInsertRetryPolicy(InsertRetryPolicy retryPolicy) {

reuvenlax · 2022-05-21T00:03:23Z

I'm not sure I understand. Retry _never_ makes sense here. If we failed to convert once, we will continue to fail to convert. The Storage Write API does not currently return per-row errors, however that is being added. However the plan is for it to only return errors for rows that cannot be inserted at all (e.g. the row is larger than allowed, a field does not match a schema constraint, etc.). A retry policy won't make sense there either, since such a failure means that the insert will continue to fail no matter how many times we retry it. The failedInsertRetryPolicy made sense for the old InsertAll API, since that API would return back errors that were retryable, and the user needed to specify sometimes whether to retry or not (though that API was always incomplete, since users often wanted to specify a maximum number of retries which was not supported). Here I don't think it really makes sense.

…

On Fri, May 20, 2022 at 7:47 PM Chamikara Jayalath ***@***.***> wrote: Can we build a retry policy that only include "SchemaConversionExceptions" by default ? Also, can we implement something similar to the failedInsertRetryPolicy for the Storage Write API ? https://github.com/apache/beam/blob/0c9cf43a7edae2e2a2622a8f4241b64a638121bb/sdks/java/io/google-cloud-platform/src/main/java/org/apache/beam/sdk/io/gcp/bigquery/BigQueryIO.java#L2340 — Reply to this email directly, view it on GitHub <#17423 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AFAYJVOFEXS5RKSF4OQDQ63VLAQBLANCNFSM5T6AVIFA> . You are receiving this because you were mentioned.Message ID: ***@***.***>

chamikaramj · 2022-05-21T00:10:56Z

Ok. Thanks for clarifying. LGTM.

+1 for making SchemaConversionException package private.

reuvenlax · 2022-05-23T04:19:18Z

Run Java PreCommit

reuvenlax · 2022-05-23T04:19:29Z

Run Kotlin_Examples PreCommit

reuvenlax · 2022-05-23T04:19:40Z

Run Java_Examples_Dataflow PreCommit

reuvenlax · 2022-05-23T05:33:53Z

Run Java PreCommit

reuvenlax · 2022-05-23T06:40:00Z

Run Java PreCommit

reuvenlax · 2022-05-23T16:06:08Z

Run Java PreCommit

reuvenlax · 2022-05-23T18:07:42Z

Run Java PreCommit

reuvenlax · 2022-05-23T19:25:38Z

Run Java PreCommit

reuvenlax · 2022-05-24T05:50:12Z

Run Java PreCommit

reuvenlax · 2022-05-25T13:19:15Z

Run Java PreCommit

reuvenlax · 2022-05-25T18:38:30Z

Run Java PreCommit

reuvenlax · 2022-05-25T20:47:50Z

Run Java PreCommit

reuvenlax · 2022-05-25T20:48:27Z

Run Java PreCommit

reuvenlax · 2022-05-26T10:10:42Z

Run Java PreCommit

algirdas-k · 2022-06-02T14:13:52Z

This seems to be in direction of solving BEAM-13158 issue. Maybe issue and PR should be linked? Will there be other PR's regarding row-level error handling via Storage API?

reuvenlax · 2022-06-02T14:16:23Z

This handles the case where there are "obvious" schema incompatibilities (e.g. wrong fields names, wrong types, etc.). We want to do better, but that will require support from BigQuery, which should be coming.

…

On Thu, Jun 2, 2022 at 7:14 AM Algirdas Kazla ***@***.***> wrote: This seems to be in direction of solving BEAM-13158 <https://issues.apache.org/jira/browse/BEAM-13158> issue. Maybe issue and PR should be linked? Will there be other PR's regarding row-level error handling via Storage API? — Reply to this email directly, view it on GitHub <#17423 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AFAYJVLW64UEEDQNIMQTHILVNC6SXANCNFSM5T6AVIFA> . You are receiving this because you modified the open/close state.Message ID: ***@***.***>

github-actions bot added build gcp io java labels Apr 21, 2022

reuvenlax force-pushed the vortex_dead_letter branch from 8513761 to 4b71441 Compare April 30, 2022 05:14

github-actions bot removed the build label Apr 30, 2022

reuvenlax requested a review from chamikaramj May 11, 2022 20:26

chamikaramj reviewed May 20, 2022

View reviewed changes

DLQ for BQ Storage Api writes

c4af119

reuvenlax force-pushed the vortex_dead_letter branch from 3e0949b to c4af119 Compare May 23, 2022 17:09

reuvenlax merged commit 25039a8 into apache:master May 27, 2022

bvolpato mentioned this pull request Jul 21, 2022

Improve exception when requested error tag does not exist (#22401) #22405

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle invalid rows in the Storage Api sink #17423

Handle invalid rows in the Storage Api sink #17423

reuvenlax commented Apr 21, 2022

codecov bot commented Apr 21, 2022 •

edited

Loading

aaltay commented Apr 28, 2022

reuvenlax commented Apr 28, 2022

reuvenlax commented Apr 30, 2022

reuvenlax commented May 18, 2022

chamikaramj left a comment

chamikaramj May 20, 2022

reuvenlax commented May 20, 2022 via email

chamikaramj commented May 20, 2022 •

edited

Loading

reuvenlax commented May 21, 2022 via email

chamikaramj commented May 21, 2022

reuvenlax commented May 23, 2022

reuvenlax commented May 23, 2022

reuvenlax commented May 23, 2022

reuvenlax commented May 23, 2022

reuvenlax commented May 23, 2022

reuvenlax commented May 23, 2022

reuvenlax commented May 23, 2022

reuvenlax commented May 23, 2022

reuvenlax commented May 24, 2022

reuvenlax commented May 25, 2022

reuvenlax commented May 25, 2022

reuvenlax commented May 25, 2022

reuvenlax commented May 25, 2022

reuvenlax commented May 26, 2022

algirdas-k commented Jun 2, 2022

reuvenlax commented Jun 2, 2022 via email

Handle invalid rows in the Storage Api sink #17423

Handle invalid rows in the Storage Api sink #17423

Conversation

reuvenlax commented Apr 21, 2022

codecov bot commented Apr 21, 2022 • edited Loading

Codecov Report

aaltay commented Apr 28, 2022

reuvenlax commented Apr 28, 2022

reuvenlax commented Apr 30, 2022

reuvenlax commented May 18, 2022

chamikaramj left a comment

Choose a reason for hiding this comment

chamikaramj May 20, 2022

Choose a reason for hiding this comment

reuvenlax commented May 20, 2022 via email

chamikaramj commented May 20, 2022 • edited Loading

reuvenlax commented May 21, 2022 via email

chamikaramj commented May 21, 2022

reuvenlax commented May 23, 2022

reuvenlax commented May 23, 2022

reuvenlax commented May 23, 2022

reuvenlax commented May 23, 2022

reuvenlax commented May 23, 2022

reuvenlax commented May 23, 2022

reuvenlax commented May 23, 2022

reuvenlax commented May 23, 2022

reuvenlax commented May 24, 2022

reuvenlax commented May 25, 2022

reuvenlax commented May 25, 2022

reuvenlax commented May 25, 2022

reuvenlax commented May 25, 2022

reuvenlax commented May 26, 2022

algirdas-k commented Jun 2, 2022

reuvenlax commented Jun 2, 2022 via email

codecov bot commented Apr 21, 2022 •

edited

Loading

chamikaramj commented May 20, 2022 •

edited

Loading