Is there a best combination of GCC options for TX2?
Usually, with correct machine dependent options, it is possible to get better performance on CPU. For example, these are usually specified for building code against TK1
armhf / c++11 support / neon-vfpv4 / cortex-a15
On the other hand, what should be given to gcc for optimization on TX2? Especially, for Denver cores, what is the best combination?
[*]Use latest GCC toolchain 7.2
[*]Use CLANG llvm front end an alternative to GCC
[*]-march=armv8.a+crypto+simd, this enables SIMD, crypto and floating point instruction set and may help.
For Denver, we are checking internally. Will update information with you later.
1. For AArch64 mode: -O3 -ffast-math -flto -march=armv8-a+crypto -mcpu=cortex-a57+crypto 2. Other options like -funroll-loops and -fvect-cost-model=unlimited may have a benefit, but it’ll be application dependent 3. Some of the AArch32 options such as -mfloat-abi=hard might not be applicable, depending on the system libraries available.
To my knowledge there isn’t a way to specify which core an application is running on (except for disabling the Denver cores which isn’t desirable). So assuming all six cores are running, do the suggestions in comment #2:
Dear ,for Denver, you are checking internally. Do you have information or advice to update your answer(I mean the optimal compile options)?
Tks and good luck!