Skip to content

Conversation

@valassi
Copy link
Owner

@valassi valassi commented Nov 14, 2025

No description provided.

Qubitol and others added 27 commits November 11, 2025 19:13
…ests on aarch64 (with DanieleM)

This fixes a hang in the testMisc tests on aarch64 in sqrtNewtonRaphson (madgraph5#1064)
(testMisc -> constexpr_tan -> constexpr_tan_quad -> constexpr_cos_quad -> constexpr_sqrt -> sqrtNewtonRaphson)

It uses the same workaround previously adopted for avoiding testMisc hangs when running valgrind (madgraph5#906)
…ts on aarch64 (with DanieleM)

This fixes a hang in the testMisc tests on aarch64 in sqrtNewtonRaphson (madgraph5#1064)
(testMisc -> constexpr_tan -> constexpr_tan_quad -> constexpr_cos_quad -> constexpr_sqrt -> sqrtNewtonRaphson)

It uses the same workaround previously adopted for avoiding testMisc hangs when running valgrind (madgraph5#906)
…(with DanieleM)

Remove the custom __ARM_NEON__ with two extra underscores
Use 'g++ -march=armv8.2-a+simd -E -dM - < /dev/null | grep ARM' to check

Results on lxplus-arm:

for avx in none sse4; do ./build.${avx}_m_inl0_hrd0/check_cpp.exe -p 1024 256 1 \
  | \egrep '(EvtsPerSec\[MECalcOnly\]|MeanMatrixElemValue|fptype_sv)'; done

Internal loops fptype_sv    = SCALAR ('none': ~vector[1], no SIMD)
EvtsPerSec[MECalcOnly] (3a) = ( 2.425635e+05                 )  sec^-1
MeanMatrixElemValue         = ( 2.080788e+00 +- 6.803789e-03 )  GeV^0

Internal loops fptype_sv    = VECTOR[2] ('sse4': ARM NEON, 128bit) [cxtype_ref=YES]
EvtsPerSec[MECalcOnly] (3a) = ( 3.261666e+05                 )  sec^-1
MeanMatrixElemValue         = ( 2.080788e+00 +- 6.803789e-03 )  GeV^0
…ith DanieleM)

Remove the custom __ARM_NEON__ with two extra underscores
Use 'g++ -march=armv8.2-a+simd -E -dM - < /dev/null | grep ARM' to check
…ple (with DanieleM)

Results on an Apple M1 (thanks Olivier!)

for avx in none sse4; do ./build.${avx}_m_inl0_hrd0/check_cpp.exe -p 1024 256 1
  | \egrep '(EvtsPerSec\[MECalcOnly\]|MeanMatrixElemValue|fptype_sv)'; done

Internal loops fptype_sv    = SCALAR ('none': ~vector[1], no SIMD)
EvtsPerSec[MECalcOnly] (3a) = ( 5.535410e+05                 )  sec^-1
MeanMatrixElemValue         = ( 2.080788e+00 +- 6.803789e-03 )  GeV^0

Internal loops fptype_sv    = VECTOR[2] ('sse4': ARM NEON, 128bit) [cxtype_ref=NO]
EvtsPerSec[MECalcOnly] (3a) = ( 8.327442e+05                 )  sec^-1
MeanMatrixElemValue         = ( 2.080788e+00 +- 6.803789e-03 )  GeV^0
…ts on aarch64 (with DanieleM)

This fixes a hang in the testMisc tests on aarch64 in sqrtNewtonRaphson (madgraph5#1064)
(testMisc -> constexpr_tan -> constexpr_tan_quad -> constexpr_cos_quad -> constexpr_sqrt -> sqrtNewtonRaphson)

It uses the same workaround previously adopted for avoiding testMisc hangs when running valgrind (madgraph5#906)
…ith DanieleM)

Remove the custom __ARM_NEON__ with two extra underscores
Use 'g++ -march=armv8.2-a+simd -E -dM - < /dev/null | grep ARM' to check
…ODEGEN commits

Revert "[fix-arm-support] gg_tt.mad: undefine__ARM_NEON for cppnone on arm/apple (with DanieleM)"
This reverts commit d427fcf.

Revert "[fix-arm-support] gg_tt.mad: use builtin __ARM_NEON for aarch64 simd (with DanieleM)"
This reverts commit 82645d2.

Revert "[fix-arm-support] gg_tt.mad: use higher tolerance for constexpr_tan tests on aarch64 (with DanieleM)"
This reverts commit 625bca8.

Revert "Regenerate processes"
This reverts commit 1e1c899.
…using constexpr_sqrt on aarch64 (with DanieleM)
…l tests using constexpr_sqrt on aarch64 (with Danie>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants