Other proc macros can break the soundness of our custom derives #388

joshlf · 2023-09-17T17:33:44Z

This issue tracks soundness holes in our custom derives introduced by other proc macros. Tasks:

Misc Notes

It might be the case that attribute evaluation order has to be guaranteed (whether that was intended in the past or not) because it's observable. In particular, the widely-used technique of a custom derive defining its own attributes (e.g., #[serde(...)]) seems to depend on attribute evaluation order.

The text was updated successfully, but these errors were encountered:

djkoloski · 2023-09-27T17:36:17Z

I verified that attribute macros can change the definition of an item after derives on it have already run:

use transform::insert_field;

trait A {}
trait B {}

#[derive(Debug)]
#[insert_field(baz = "u32")]
struct Foo {
    bar: i32,
}

fn main() {
    println!("{:?}", Foo {
        bar: 10,
        baz: 100,
    });
}

prints:

Foo { bar: 10 }

Which demonstrates that even though Foo had an extra baz field inserted via attribute proc-macro, the item gets passed to the Debug derive before that modification takes place. Expanding the source with cargo expand also demonstrates the issue. For completeness, the source of the insert_field proc-macro is:

use proc_macro2::{Ident, Span};
use quote::quote;
use syn::{parse_macro_input, DeriveInput, Meta, Lit, Expr, Visibility, token::Colon, Data, Fields, Field, parse_quote};

#[proc_macro_attribute]
pub fn insert_field(
    attr: proc_macro::TokenStream,
    item: proc_macro::TokenStream,
) -> proc_macro::TokenStream {
    let meta = parse_macro_input!(attr as Meta);
    let Meta::NameValue(name_value) = meta else { panic!("expected name-value") };
    let name = name_value.path.get_ident().expect("name to be an identifier");
    let Expr::Lit(lit) = &name_value.value else { panic!("expected literal") };
    let Lit::Str(ty_name) = &lit.lit else { panic!("expected string") };
    let ident = Ident::new(&ty_name.value(), Span::call_site());

    let mut item = parse_macro_input!(item as DeriveInput);
    let Data::Struct(struct_) = &mut item.data else { panic!("expected struct") };
    let Fields::Named(fields) = &mut struct_.fields else { panic!("expected fields") };
    fields.named.push(Field {
        attrs: Vec::new(),
        vis: Visibility::Inherited,
        mutability: syn::FieldMutability::None,
        ident: Some(name.clone()),
        colon_token: Some(Colon { spans: [Span::call_site()]}),
        ty: parse_quote!(#ident),
    });

    let output = quote! {
        #item
    };

    output.into()
}

jswrenn · 2023-09-27T17:43:17Z

The status quo is that proc macros evaluate from the outside in. We should confirm this is specified, and do what we can to mitigate it.

We could defend against @djkoloski's example by also emitting code that destructures the annotated type, thus ensuring that there would be a compile error if the definition changed.

However, imagine a proc-macro attribute that only removed (or tampered) with #[repr(C)] from annotated definitions, but left fields unchanged. For this, the only mitigation I can see is forbidding the presence of unknown attributes.

joshlf · 2023-09-27T17:46:20Z

IIUC, guaranteeing evaluation order should be enough to mitigate the "unknown attribute" problem: We just ensure that we're placed in a location that evaluates after any attribute macros.

That still leaves open the question of shadowing attributes by name - e.g., introducing a proc macro attribute called repr that our custom derives mistakenly think is the built-in repr attribute.

Another thought: Does the token stream emitted by a proc macro attribute include the proc macro attribute annotation? If not, we should expect that any proc macro attributes which execute before us will no longer be present in the token stream that we see. This should mean that a proc macro attribute which shadows repr would be removed by the time it gets to us, and we'd only see "real" repr attributes.

jswrenn · 2023-09-27T17:50:50Z

I've confirmed that it's not possible to shadow repr attributes. Doing so produces an error:

`repr` is ambiguous
ambiguous because of a name conflict with a builtin attribute
use `crate::repr` to refer to this attribute macro unambiguously

joshlf · 2023-09-27T17:51:26Z

Phew

joshlf · 2024-01-31T16:35:13Z

cc @reinerp

joshlf added bug Something isn't working compatibility-breaking Changes that are (likely to be) breaking labels Sep 17, 2023

joshlf mentioned this issue Sep 20, 2023

Tracking issue for proving soundness, preventing regressions, and documenting security ethos #61

Open

joshlf changed the title ~~Could attributes cause unsoundness in our derives?~~ Other proc macros can break the soundness of our custom derives Sep 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Other proc macros can break the soundness of our custom derives #388

Other proc macros can break the soundness of our custom derives #388

joshlf commented Sep 17, 2023 •

edited

Loading

djkoloski commented Sep 27, 2023

jswrenn commented Sep 27, 2023 •

edited

Loading

joshlf commented Sep 27, 2023

jswrenn commented Sep 27, 2023

joshlf commented Sep 27, 2023

joshlf commented Jan 31, 2024

Other proc macros can break the soundness of our custom derives #388

Other proc macros can break the soundness of our custom derives #388

Comments

joshlf commented Sep 17, 2023 • edited Loading

Misc Notes

djkoloski commented Sep 27, 2023

jswrenn commented Sep 27, 2023 • edited Loading

joshlf commented Sep 27, 2023

jswrenn commented Sep 27, 2023

joshlf commented Sep 27, 2023

joshlf commented Jan 31, 2024

joshlf commented Sep 17, 2023 •

edited

Loading

jswrenn commented Sep 27, 2023 •

edited

Loading