Separate validation of operation and coercion of variables into separate steps #3658

IvanGoncharov · 2022-06-22T12:27:10Z

This draft PR shows what I'm currently working on.
It passes tests but fails due to lower coverage.
I will try to split it into smaller commits with proper descriptions and separate discussions.
The purpose of this Draft PR is to show the end goal of these smaller comments.

Any feedback/review is welcome but please keep in mind this PR is in the exploration phase (and code is still raw) so please focus only on high-level stuff.

netlify · 2022-06-22T12:27:19Z

✅ Deploy Preview for compassionate-pike-271cb3 ready!

Name	Link
🔨 Latest commit	`086a4f7`
🔍 Latest deploy log	https://app.netlify.com/sites/compassionate-pike-271cb3/deploys/62b30aa053b498000877b169
😎 Deploy Preview	https://deploy-preview-3658--compassionate-pike-271cb3.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site settings.

github-actions · 2022-06-22T12:44:12Z

@github-actions publish-pr-on-npm

@IvanGoncharov The latest changes of this PR are available on NPM as
graphql@17.0.0-alpha.1.canary.pr.3658.null
Note: no gurantees provided so please use your own discretion.

Also you can depend on latest version built from this PR:
npm install --save graphql@canary-pr-3658

IvanGoncharov · 2022-06-22T13:29:19Z

It's interesting that this change made execute faster.

I ran it a few times and it was always faster.

yaacovCR · 2022-06-22T17:47:45Z

src/execution/__tests__/executor-test.ts

@@ -871,7 +871,6 @@ describe('Execute: Handles basic execution tasks', () => {
    expectJSON(
      executeSync({ schema, document, operationName: 'Q' }),
    ).toDeepEqual({
-      data: null,


this change fits your proposed simplified criteria here: graphql/graphql-spec#894 (comment) but I am not sure about it

yaacovCR · 2022-06-22T17:47:57Z

src/execution/__tests__/executor-test.ts

@@ -883,7 +882,6 @@ describe('Execute: Handles basic execution tasks', () => {
    expectJSON(
      executeSync({ schema, document, operationName: 'M' }),
    ).toDeepEqual({
-      data: null,


same as above

yaacovCR · 2022-06-22T17:48:04Z

src/execution/__tests__/executor-test.ts

@@ -895,7 +893,6 @@ describe('Execute: Handles basic execution tasks', () => {
    expectJSON(
      executeSync({ schema, document, operationName: 'S' }),
    ).toDeepEqual({
-      data: null,


same as above

yaacovCR · 2022-06-22T17:55:19Z

src/execution/__tests__/subscribe-test.ts

@@ -424,6 +424,7 @@ describe('Subscription Initialization Phase', () => {
        {
          message: 'The subscription field "unknownField" is not defined.',
          locations: [{ line: 1, column: 16 }],
+          path: ['unknownField'],


I am not sure about this one. There is not an exact corollary to queries/mutations, because unknown fields there are silently dropped rather than raising an error.

The spec at https://spec.graphql.org/draft/#sel-HAPHRPJABABmE-1Y says:

This field should be a list of path segments starting at the root of the response and ending with the field associated with the error. Path segments that represent fields should be strings, and path segments that represent list indices should be 0-indexed integers. If the error happens in an aliased field, the path to the error should use the aliased name, since it represents a path in the response, not in the request.

The last sentence seems odd to me as the paths in the response and the request are the same -- perhaps it should be interpreted that the path is the path in the response, not in the schema. That might argue as you have here that a field not in the schema can have a path. Although, I am not sure about this, as perforce if the field is not in the schema, it cannot be in the response. For subscriptions, of course, there is no path in the response of CreateSourceEventStream, as the response for that section of the algorithm is not a map, it's an AsyncIterable. So, overall, I think we are better off without the path, although it seems to me that it could go either way.

yaacovCR · 2022-06-22T17:56:56Z

src/execution/execute.ts

-  errors: ReadonlyArray<GraphQLError>,
-): ExecutionResult {
-  return errors.length === 0 ? { data } : { errors, data };
+interface GraphQLExecutionPlanOptions {


These options are to be passed into the makeExecutionPlan function and are used to construct the Plan below. But they also just form part of the Plan itself, and so they are not that different in kind from the other properties of the Plan, which are all essentially options. It seems as if these options might be better named "MakeExecutionPlanOptions", see my comment below on that function.

yaacovCR · 2022-06-22T18:00:06Z

src/execution/execute.ts


+type ResultOrGraphQLErrors<T> =


Is this helpful in terms of #3649 ?? -- I'm afraid I did not try to reproduce that error, and have not yet hit it with my TS version.

Yes, exactly.
I missed #3649 but I always dislike this particular check.
I think we need to have a generic convention and this one seems like one that suits our use case the best since the error response is a valid GraphQL response.

yaacovCR · 2022-06-22T18:12:22Z

src/execution/execute.ts

+  schema: GraphQLSchema,
+  document: DocumentNode,
+  operationName: Maybe<string>,
+  options: GraphQLExecutionPlanOptions = {},


Why are some of these named options, but not others. Is the function going to be memoized?

yaacovCR · 2022-06-22T18:20:47Z

src/execution/execute.ts

+  | ExecutableMutationRequest
+  | ExecutableSubscriptionRequest;
+
+export interface ExecutableQueryRequest {


@IvanGoncharov This is the key comment I believe in my review of this approach. It's more of a question than a comment.

I know I have asked elsewhere to export interfaces so that third parties can customize, add hooks, etc. I am generally happy to see the suggestion adopted, but I have to say that I don't know if the added complexity in this particular instance is so helpful. We currently have a pipeline of schema generation => request parse => request validate => request execute. Hidden in there also as a step is that without "assumeValid", schema validation is automatically performed.

My thinking in overall direction was to break out additional recommended steps, i.e., "validateSchema" would be its own step, request document parsing would be its own step, variable coercion would be its own step, and execute would then operation on the results of the previous stages, such that ExecutionContext would be broken down into its constituent parts, some cacheable (document), some not cacheable/not as cacheable (variables). Perhaps this interface/implementation approach has some additional benefits I am not appreciating?

Hidden in there also as a step is that without "assumeValid", schema validation is automatically performed.

It's not hidden, it's "optional" you can perform it explicitly if you want.
All GraphQL servers I saw just call validateSchema and even our own graphql function does that, for example, https://github.com/graphql/express-graphql/blob/f4414b44996f25a5328e523d9a4b213fd1d70b16/src/index.ts#L268
But I agree it's a bad API design so I plan to work on the GraphQLExecutableSchema class to make it more explicit from an API standpoint.

I know I have asked elsewhere to export interfaces so that third parties can customize, add hooks, etc. I am generally happy to see the suggestion adopted, but I have to say that I don't know if the added complexity in this particular instance is so helpful.

I still have the same position about hooks, this PR adds interfaces that can be used by external libraries to customize executor.
Different here is that we don't have any single line of code that can run 3rd-party ExecutableRequest, so it's not a hook.

I'm for adding configuration/interfaces if the below rules are satisfied:

Use case for which we add it is fully spec-compliant.

We can't incorporate this use case in graphql-js. For example, there is no "one size fits all" solution.

Custom GraphQL executors satisfy both rules, JIT is a valid use case but JIT is not a universal one (not everyone trusts generated code to run through eval) it also applies some limitations: https://github.com/zalando-incubator/graphql-jit#differences-to-graphql-js
Even JIT aside, custom execute engines purposely built for complex scenarios like federation, stitching, etc. will always outperform generic ones even with all necessary hooks provided (hooks add their own cost and prevent certain JIT/compilation optimizations).
So it's a valid use case for adding interfaces.

This PR basically implements the #3314 proposal and defines clean interfaces that can be used by GraphQL servers to support specialized GraphQL executors.

I have to say that I don't know if the added complexity in this particular instance is so helpful.

Just to be clear, are you referring to interfaces specifically or your comment is about the entire approach of using class instances vs having functions accept query plan as an argument?

I don't think we need an ExecutableSchema class. I think if validateSchema is not called, the schema should not be validated, plain and simple.

In terms of your last question -- I am not sure I understand the distinction. I commented below in terms of relationship to #3314.

Basically, if the goal is to separate out coercion, and storing result of document analysis into separate step, I think that can be done as separate exported functions, like analyzeDocument and coerceVariables, and then the user can cache them. We actually don't have to deprecate execute, we can still include it for those who want it but just be explicit about which exported pipeline steps it performs: analyzeDocument, coerceVariables, getRootType, executeOperation...

yaacovCR · 2022-06-22T18:25:59Z

src/execution/execute.ts

+  schema: GraphQLSchema;
+  operation: OperationDefinitionNode;
+  fragments: ObjMap<FragmentDefinitionNode>;
+  rootType: GraphQLObjectType;


It's nice that this is cacheable at this point, so that we don't have to repeatedly check that it exists for the same schema/operation.

yaacovCR · 2022-06-22T18:31:34Z

src/execution/execute.ts

+  /**
+   * Implements the "ExecuteQuery" algorithm described in the GraphQL specification.
+   */
+  executeOperation: (


see above, I like how executeOperation now actually executes every operation (whereas previously the term was used for retrieving the data-only for queries/mutations). I just think we can deprecate the execute stage of the pipeline and utilize executeOperation, which would take a set of cacheable parameters (schema/document) as well as the 3 arguments here).

I just think we can deprecate the execute stage of the pipeline and utilize executeOperation, which would take a set of cacheable parameters (schema/document) as well as the 3 arguments here).

We can do that as an intermediate step.
Just to clarify you are proposing using WeakMap internally to cache stuff based on schema/document/operationName as you do in graphql-executor?
Or are you proposing to use externally catchable ExecutionPlanas a replacement for schema/document/operationName?

The latter.

But as I wrote above, although the beginnings of #3314 r here, there is very little that is being actually done and stored within the plan (afaict) besides storing the a validated root type. In terms of the exported interface, it also would not be enough (afaict) to build a customized stitching executor which needs access to later steps in the execution algorithm. Although I think I see the direction, in terms of functionality, I am not sure this as yet brings enough minimum value to be worth the complexity.

I was constrained within graphql-executor because I wanted it to be a drop in replacement for graphql-js. So there, I used a simple memoization strategy that assumed if you have cached a document, everything about it should be cached.

If we are talking about changing the graphql-js api - I think exporting a few more low level building blocks is the low-hanging fruit.

IvanGoncharov added 5 commits June 21, 2022 17:18

subscribe: fix missing path on unknown field error

dbd1118

step 1

a1b86ac

step 2

974d92d

step3

f5c9d66

temp

086a4f7

This comment has been minimized.

Sign in to view

yaacovCR reviewed Jun 22, 2022

View reviewed changes

This was referenced Jun 24, 2022

alternative refactor #3660

Closed

execute: integrate subscriptions and refactor pipeline #3644

Closed

yaacovCR mentioned this pull request Jul 21, 2022

Allow separation of variable value coercion from the rest of execution #3679

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Separate validation of operation and coercion of variables into separate steps #3658

Separate validation of operation and coercion of variables into separate steps #3658

IvanGoncharov commented Jun 22, 2022

netlify bot commented Jun 22, 2022 •

edited

Loading

This comment has been minimized.

github-actions bot commented Jun 22, 2022

IvanGoncharov commented Jun 22, 2022

yaacovCR Jun 22, 2022

yaacovCR Jun 22, 2022

yaacovCR Jun 22, 2022

yaacovCR Jun 22, 2022

yaacovCR Jun 22, 2022

yaacovCR Jun 22, 2022

IvanGoncharov Jun 22, 2022

yaacovCR Jun 22, 2022

yaacovCR Jun 22, 2022

IvanGoncharov Jun 22, 2022 •

edited

Loading

yaacovCR Jun 23, 2022

yaacovCR Jun 22, 2022

yaacovCR Jun 22, 2022

IvanGoncharov Jun 22, 2022

yaacovCR Jun 23, 2022

yaacovCR Jun 23, 2022

yaacovCR Jun 23, 2022


		type ResultOrGraphQLErrors<T> =

Separate validation of operation and coercion of variables into separate steps #3658

Are you sure you want to change the base?

Separate validation of operation and coercion of variables into separate steps #3658

Conversation

IvanGoncharov commented Jun 22, 2022

netlify bot commented Jun 22, 2022 • edited Loading

✅ Deploy Preview for compassionate-pike-271cb3 ready!

This comment has been minimized.

github-actions bot commented Jun 22, 2022

IvanGoncharov commented Jun 22, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

IvanGoncharov Jun 22, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

netlify bot commented Jun 22, 2022 •

edited

Loading

IvanGoncharov Jun 22, 2022 •

edited

Loading