[mono] Adding support for Vector128::ExtractMostSignificantBits intrinsics on amd64 #89997

matouskozak · 2023-08-04T07:06:19Z

Adding intrinsics support for Vector128::ExtractMostSignificantBits method on amd64 (miniJIT and llvm AOT).

Implementation

Extracting the most significant bits (MSBs) from Vector128 on amd64 is based on the use of _mm_movemask_epi8/ps/pd (SSE/SSE2).

Support for two new opcodes for vectors added:

sse_movmsk: Create mask from the most significant bit of each 8/32/64-bit element (_mm_movemask_epi8/ps/pd).
ssse3_shuffle: Shuffle 8-bit elements of vector according to shuffle control mask (_mm_shuffle_epi8).

`Short`/`UShort` element types

Since the _mm_movemask_epi8/ps/pd doesn't support Short/UShort element types, we first perform _mm_shuffle_epi8 (SSSE3) to shuffle odd bytes (most significant bytes of each Short/UShort) to the lower half of vector while zeroing out the upper half. Next, we use _mm_movemask_epi8 to extract MSBs from shuffled vector.

Other primitive element types

Based on the size of element type, the corresponding version of _mm_movemask_epi8/ps/pd is used to extract MSBs.

Future work

Emitting intrinsics for Vector128 of floating-point types is currently not supported in Mono. This PR adds the support for emitting it on amd64 platform but additional work must be done for arm64 and possibly for WASM before enabling it for Mono.

Contributes to #76025.

ivanpovazan

LGTM!

matouskozak · 2023-08-04T14:51:40Z

/azp run runtime-extra-platforms

azure-pipelines · 2023-08-04T14:52:05Z

Azure Pipelines successfully started running 1 pipeline(s).

matouskozak · 2023-08-07T07:18:05Z

maccatalyst-x64 Release AllSubsets_Mono build error (https://dev.azure.com/dnceng-public/public/_build/results?buildId=363584&view=logs&j=a2a39d6b-70c5-5040-933a-4b8b6e52bf19&t=7de46cbc-684f-5a34-7a8f-d5a9c81d3460) seems releated. I'll investigate further.

jandupej

LGTM. I cannot immediately see why LLVM would fail on the i16 case.

matouskozak · 2023-08-08T13:20:24Z

/azp run runtime-extra-platforms

azure-pipelines · 2023-08-08T13:20:48Z

Azure Pipelines successfully started running 1 pipeline(s).

matouskozak · 2023-08-09T08:18:18Z

LGTM

@jandupej looks like it was caused by missing SSSE3 check which is weird considering SSSE3 is more than 15 years old.

matouskozak · 2023-08-10T08:50:27Z

The failing CI lines are tracked on main and unrelated to this PR.

jandupej · 2023-08-14T09:38:00Z

LGTM

@jandupej looks like it was caused by missing SSSE3 check which is weird considering SSSE3 is more than 15 years old.

If memory serves, we require the CPU to support at least SSE4.1, so this should not be an issue. Still, it is a good practice to check for ISA extension support before using it.

Extract MSB amd64

300b981

ghost assigned matouskozak Aug 4, 2023

dotnet-issue-labeler bot added the area-Codegen-JIT-mono label Aug 4, 2023

vargaz approved these changes Aug 4, 2023

View reviewed changes

matouskozak marked this pull request as ready for review August 4, 2023 13:35

matouskozak requested review from lambdageek and SamMonoRT as code owners August 4, 2023 13:35

ivanpovazan requested a review from jandupej August 4, 2023 13:52

ivanpovazan approved these changes Aug 4, 2023

View reviewed changes

jandupej approved these changes Aug 7, 2023

View reviewed changes

matouskozak added the NO-MERGE The PR is not ready for merge yet (see discussion for detailed reasons) label Aug 7, 2023

add SSSE3 check

735a95a

matouskozak removed the NO-MERGE The PR is not ready for merge yet (see discussion for detailed reasons) label Aug 8, 2023

matouskozak requested a review from vargaz August 9, 2023 08:22

matouskozak merged commit f465d33 into dotnet:main Aug 10, 2023
134 of 156 checks passed

matouskozak mentioned this pull request Aug 10, 2023

[mono] Enable floats in Vector128::ExtractMostSignificantBits on arm64 and amd64 #90304

Merged

ghost locked as resolved and limited conversation to collaborators Sep 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[mono] Adding support for Vector128::ExtractMostSignificantBits intrinsics on amd64 #89997

[mono] Adding support for Vector128::ExtractMostSignificantBits intrinsics on amd64 #89997

matouskozak commented Aug 4, 2023

ivanpovazan left a comment

matouskozak commented Aug 4, 2023

azure-pipelines bot commented Aug 4, 2023

matouskozak commented Aug 7, 2023

jandupej left a comment

matouskozak commented Aug 8, 2023

azure-pipelines bot commented Aug 8, 2023

matouskozak commented Aug 9, 2023

matouskozak commented Aug 10, 2023

jandupej commented Aug 14, 2023

[mono] Adding support for Vector128::ExtractMostSignificantBits intrinsics on amd64 #89997

[mono] Adding support for Vector128::ExtractMostSignificantBits intrinsics on amd64 #89997

Conversation

matouskozak commented Aug 4, 2023

Implementation

Short/UShort element types

Other primitive element types

Future work

ivanpovazan left a comment

Choose a reason for hiding this comment

matouskozak commented Aug 4, 2023

azure-pipelines bot commented Aug 4, 2023

matouskozak commented Aug 7, 2023

jandupej left a comment

Choose a reason for hiding this comment

matouskozak commented Aug 8, 2023

azure-pipelines bot commented Aug 8, 2023

matouskozak commented Aug 9, 2023

matouskozak commented Aug 10, 2023

jandupej commented Aug 14, 2023

`Short`/`UShort` element types