WIP: try making cargo tree's Graph from a UnitGraph #11379

alsuren · 2022-11-15T12:00:22Z

What does this PR try to resolve?

This is an attempt at implementing step 1 from #9599 (comment) - a function which converts from a UnitGraph to a cargo::ops::tree::graph::Graph

This would make it easier to find edge cases where cargo tree disagrees with cargo build.

How should we test and review this PR?

TODO (this PR is currently WIP)

Demonstrate how you test this change and guide reviewers through your PR.
With a smooth review process, a pull request usually gets reviewed quicker.

If you don't know how to write and run your tests, please read the guide:
https://doc.crates.io/contrib/tests

Additional information

minimal test
- think a bit harder about how/whether to expose this code (currently hidden behind a CARGO_TREE_FROM_UNIT_GRAPH environment variable for simplicity)
actually implement Graph::from_bcx() (to create a Graph from a UnitGraph)
run everything in the current test suite using Graph::from_bcx() rather than graph::build() and fix any discrepancies
- think about how cargo::core::compiler::unit_dependencies::State::deps() should handle filtering for --target=all (it seems like tree has a copy-pasta version of it. It might be possible to reconcile the two, but it feels a bit meaningless)
get feedback on whether the team likes this approach (if not then that's fine - I need to build something like this for cargo quickbuild anyway, so it won't be wasted effort).

Failing tests to chip away at:

rustbot · 2022-11-15T12:00:26Z

Thanks for the pull request, and welcome! The Rust team is excited to review your changes, and you should hear from @weihanglo (or someone else) soon.

Please see the contribution instructions for more information.

* does not understand build dependencies * completely ignores dev dependencies

…ncies

alsuren · 2022-11-15T18:01:00Z

Progress report for today:

my simple_from_unit_graph test now passes (so top-level build-deps and dev-deps are now supported)

If I apply this patch:

diff --git a/crates/cargo-test-support/src/lib.rs b/crates/cargo-test-support/src/lib.rs
index 4ce0162bf..513d4f4f8 100644
--- a/crates/cargo-test-support/src/lib.rs
+++ b/crates/cargo-test-support/src/lib.rs
@@ -414,6 +414,7 @@ impl Project {
         let mut execs = self.process(&cargo);
         if let Some(ref mut p) = execs.process_builder {
             p.env("CARGO", cargo);
+            p.env("CARGO_TREE_FROM_UNIT_GRAPH", "1");
             p.arg_line(cmd);
         }
         execs

then it causes failures from the 25 tests

[edit: copied to the top comment to give a better indication of progress]

Something to work on tomorrow 😁 .

test tree::features now passes

This reverts commit df6cd75.

TODO: think about --target=all

…target=all'

…ypes

alsuren · 2022-11-20T19:45:58Z

Progress report for the weekend:

Apart from the tests that pass --target=all, I only have about 3 failing tests. I can probably chip away at these, but I'm not sure what to do about --target=all.

Questions:
a) Do you think that it's worth trying to teach cargo::core::compiler::unit_dependencies about --target=all or is it possible to deprecate it?
b) cargo tree -p $package seems to mean something slightly different from cargo build -p $package (cargo tree seems to use the whole workspace for calculating the dep tree, and then print a filtered tree by using $package as the root). Do I have this right, or do I just have bugs in my code/understanding? I'm tempted to make a cargo build --stop-after-package $package that does the same thing as cargo tree -p $package. Does this seem like a reasonable idea?
c) at the moment, I'm only adding complexity (because the existing code is useful as an oracle), but in theory this approach would let us remove a bunch of complexity by removing the existing Graph-building machinery. Is this a thing we want to do? Should I have a stab at it, and see what my diffstat looks like with the old code removed?
d) is this PR fleshed-out enough to determine whether we even want to go down this route? Is there anything else that would make it easier to make that decision?

I doubt that this PR will be in a reasonable state to merge before I get swallowed up by my new job, but it will hopefully serve as a reference of what is possible and what is easier/harder if you go down this route. I will therefore close this PR on Tuesday, whatever state it's in. I will leave the branch around in case someone wants to use it as inspiration.

weihanglo · 2022-11-21T12:18:51Z

Haven't really looked deeply into your change, but I do appreciate what you've done. The amount of code added so far are less than I expected. It is indeed a great exploration. Thank you!

a) Do you think that it's worth trying to teach cargo::core::compiler::unit_dependencies about --target=all or is it possible to deprecate it?

I don't think so, at least for the latter. They have different meanings. We already have ForceAllTargets to disable target platform filters. Maybe we can look into that without too many churns on the other parts of code?

b) cargo tree -p $package seems to mean something slightly different from cargo build -p $package (cargo tree seems to use the whole workspace for calculating the dep tree, and then print a filtered tree by using $package as the root). Do I have this right, or do I just have bugs in my code/understanding? I'm tempted to make a cargo build --stop-after-package $package that does the same thing as cargo tree -p $package. Does this seem like a reasonable idea?

Package filtering process happens here, so ws_resolve should resolve what is passed incargo tree -p. Did you find any abnormal behaviour? For --stop-after-package, I am not sure what it is used for, though usually Cargo team takes it serious when adding a new flag.

c) at the moment, I'm only adding complexity (because the existing code is useful as an oracle), but in theory this approach would let us remove a bunch of complexity by removing the existing Graph-building machinery. Is this a thing we want to do? Should I have a stab at it, and see what my diffstat looks like with the old code removed?

Go ahead, or put it into another module, so we can have a clear view of the diff :)

d) is this PR fleshed-out enough to determine whether we even want to go down this route? Is there anything else that would make it easier to make that decision?

Even the final result reduces the complexity, I'll still ask about the extensibility, such as handling an issue like #10593 (comment). I think ehuss has some concerns on sharing code, so having ehuss taking a look is better once it is in a relatively good shape.

alsuren · 2022-11-21T13:36:28Z

Haven't really looked deeply into your change, but I do appreciate what you've done. The amount of code added so far are less than I expected. It is indeed a great exploration. Thank you!

a) Do you think that it's worth trying to teach cargo::core::compiler::unit_dependencies about --target=all or is it possible to deprecate it?

I don't think so, at least for the latter. They have different meanings. We already have ForceAllTargets to disable target platform filters. Maybe we can look into that without too many churns on the other parts of code?

Ah. I think I might have misunderstood that part of the code. For some reason, I thought ForceAllTargets was about the --all-targets flag (equivalent to specifying --lib --bins --tests --benches --examples). Thanks. I'll have a go at using that machinery.

b) cargo tree -p $package seems to mean something slightly different from cargo build -p $package (cargo tree seems to use the whole workspace for calculating the dep tree, and then print a filtered tree by using $package as the root). Do I have this right, or do I just have bugs in my code/understanding? I'm tempted to make a cargo build --stop-after-package $package that does the same thing as cargo tree -p $package. Does this seem like a reasonable idea?

Package filtering process happens here, so ws_resolve should resolve what is passed incargo tree -p. Did you find any abnormal behaviour? For --stop-after-package, I am not sure what it is used for, though usually Cargo team takes it serious when adding a new flag.

I think this may be due to bugs in my understanding again. I was under the impression that it was possible to ask cargo tree to "show me how cargo would build this workspace, but only show me things below $package" in the tree.

dig into the the tests that I chalked up to this discrepancy and see if I can resolve them.

c) at the moment, I'm only adding complexity (because the existing code is useful as an oracle), but in theory this approach would let us remove a bunch of complexity by removing the existing Graph-building machinery. Is this a thing we want to do? Should I have a stab at it, and see what my diffstat looks like with the old code removed?

Go ahead, or put it into another module, so we can have a clear view of the diff :)

I will do the "another module" thing first, and then do "deleting the old code" as a separate PR against my own fork if I get time.

d) is this PR fleshed-out enough to determine whether we even want to go down this route? Is there anything else that would make it easier to make that decision?

Even the final result reduces the complexity, I'll still ask about the extensibility, such as handling an issue like #10593 (comment). I think ehuss has some concerns on sharing code, so having ehuss taking a look is better once it is in a relatively good shape.

I suppose this is a philosophical thing: is cargo tree a teaching/exploring tool (in which case, clarity and expressiveness might be more important) or is it a debugging tool (in which case correctness might be more important)?

Fundamentally, cargo tree does not work on the Unit level, and there will always be discrepancies between it and cargo build. This is something that I didn't appreciate when I started this project. It will certainly help when I next have time to work on cargo quickbuild.

alsuren · 2022-11-22T12:24:18Z

src/cargo/ops/tree/mod.rs

+    // FIXME: we're creating this here, but build_ctx() is also creating one.
+    // Can we refactor things so that they are shared?


Suggested change

// FIXME: we're creating this here, but build_ctx() is also creating one.

// Can we refactor things so that they are shared?

// FIXME: we're creating this here, but create_bcx() is also creating one.

// Can we refactor create_bcx() into multiple functions so that they can be shared?

alsuren · 2022-11-22T12:24:30Z

I've reached the end of my timebox for this.

I briefly tried propagating ForceAllTargets, but I ran out of time before I got any --target=all tests passing. I also didn't get time to do any cleaning up, so it's still a mess. Sorry.

I still think that this is a promising idea.

Advantages:

The Graph building code in graph::from_bcx() does not need to be recursive anymore. This means that it is easier to follow, and doesn't have to worry about infinite loops.
- It does a pass to add all of the Package nodes, then a pass to add all of the links (which may add Feature nodes if feature edges are enabled)
- The pass to add cli features at the end is currently still the old recursive add_internal_features() code, but it might be possible to flatten that out too (not sure).
We now match the behaviour of UnitGraph - build-dependencies are ignored if there is no build.rs unit.

As an aside, I wonder if we could use a similar approach to support displaying artifact dependencies and build-scripts. You would want a -e units or something, and a Node::Unit variant, a bit like how Node::Feature works (I have not thought about how it would interact with -e features). Going via UnitGraph for that feature would make things a lot easier, so if you are already going via UnitGraph for everything else, you might have an advantage.

Risks:

I didn't have time to look into the --target=all tests, and a few other things, so there are 6 failing tests. Any one of them might be difficult enough to solve that it kills this idea dead.
we still rely on calling out to the resolver to work out why each of the UnitDeps are there, because they don't contain enough information on their own. This could be considered to be a bad thing, but also may be considered to be a good thing, if we need extension points.
I'm still using a few of the old builder functions, like add_internal_features(), and they might be correcting for sloppiness in how I'm building the tree. It might be valuable to rewrite them in terms of UnitGraph to make sure we're not missing anything.
Please don't let my prototype-quality code convince you that it's a bad idea: it is my first attempt at hacking on cargo internals, so it's a bit of a mess.

Thanks for all of your support in this exploration. Hopefully it contains some useful ideas. Even if it doesn't, I have learned a lot.

Thanks again.

David.

alsuren added 3 commits November 15, 2022 11:12

initial failing test

c0a4e8b

skeleton Graph::from_bcx()

7cab4ab

fixme

532d075

rustbot assigned weihanglo Nov 15, 2022

rustbot added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Nov 15, 2022

alsuren mentioned this pull request Nov 15, 2022

cargo tree doesn't handle transitive duplications properly #9599

Open

alsuren added 3 commits November 15, 2022 15:51

partially working graph::from_bcx()

b928ab5

* does not understand build dependencies * completely ignores dev dependencies

handle build dependencies by asking the resolver

c4486e0

use CompileMode::Test to convince the resolver to give us dev-depende…

8679e80

…ncies

alsuren added 19 commits November 18, 2022 17:44

also remove fixme about dev deps

cf7dab6

REVERTME: run tests with new Graph builder algorithm

1a11448

initial support for features

0677377

REVERTME: enable trace debugging for tests

df6cd75

identify what the problem is with my test

13c87f1

make test include build.rs to enable build-dependencies

099e351

REVERTME: debug for common panic in package_for_id()

c8d609f

filter out edge kinds that were not requested

d24448c

optional: set CompileMode based on HasDevUnits

6b671e9

fill in dep_name_map

4a40877

copy TreeOptions::cli_features onto CompileOptions

161d0fb

test tree::features now passes

Revert "REVERTME: enable trace debugging for tests"

9e48c3d

This reverts commit df6cd75.

only populate dep_name_map when needed

6ec86cb

forward packages from TreeOptions to CompileOptions

f605535

copy requested_kinds across, for --target

40f795e

fudge tree::filters_target test so that it passes

c25fa77

TODO: think about --target=all

add build.rs to tree::invert_with_build_dep test

f9e104f

add comments for remaining test failures

63446d0

note why the rest of the tests are failing. All '-e features' and '--…

e8ed11a

…target=all'

alsuren added 8 commits November 20, 2022 15:23

handle -e features - add default feature and disable all other link t…

fde5f77

…ypes

write up how features_namespaced::tree is failing

fde631d

do feature-adding properly; don't preallocate ::Feature nodes

8befb2a

don't panic on --features bar?/feat where bar is not activated

a3e0932

hack in a Normal dep if no features asked for

bde2a08

annotate another --target=all test

3499509

note where tree::host_Dep_feature might be going wrong

95dcd44

update fixmes in tests

f2c6540

WIP on supporting --target=all via ForceAllTargets (doesn't work yet)

bb9a4e2

alsuren commented Nov 22, 2022

View reviewed changes

alsuren closed this Nov 22, 2022

alsuren mentioned this pull request Nov 22, 2022

Fix builds involving proc-macro crates (brute-force the thing?) cargo-quick/cargo-quick#36

Open

11 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: try making cargo tree's Graph from a UnitGraph #11379

WIP: try making cargo tree's Graph from a UnitGraph #11379

alsuren commented Nov 15, 2022 •

edited

Loading

rustbot commented Nov 15, 2022

alsuren commented Nov 15, 2022 •

edited

Loading

alsuren commented Nov 20, 2022

weihanglo commented Nov 21, 2022

alsuren commented Nov 21, 2022 •

edited

Loading

alsuren Nov 22, 2022

alsuren commented Nov 22, 2022

		// FIXME: we're creating this here, but build_ctx() is also creating one.
		// Can we refactor things so that they are shared?

WIP: try making cargo tree's Graph from a UnitGraph #11379

WIP: try making cargo tree's Graph from a UnitGraph #11379

Conversation

alsuren commented Nov 15, 2022 • edited Loading

What does this PR try to resolve?

How should we test and review this PR?

Additional information

rustbot commented Nov 15, 2022

alsuren commented Nov 15, 2022 • edited Loading

alsuren commented Nov 20, 2022

weihanglo commented Nov 21, 2022

alsuren commented Nov 21, 2022 • edited Loading

alsuren Nov 22, 2022

Choose a reason for hiding this comment

alsuren commented Nov 22, 2022

alsuren commented Nov 15, 2022 •

edited

Loading

alsuren commented Nov 15, 2022 •

edited

Loading

alsuren commented Nov 21, 2022 •

edited

Loading