Создает ли этот простой код, не содержащий никаких циклов, цикл на ассемблере? - Цифровое Кемерово

Создает ли этот простой код, не содержащий никаких циклов, цикл на ассемблере? ⇐ C++

Ответить

1 сообщение • Страница 1 из 1

Гость

Создает ли этот простой код, не содержащий никаких циклов, цикл на ассемблере?

Цитата

Сообщение Гость » 04 мар 2024, 18:30

I was playing with this code to check if an integer is a power of 4:

// C++ version bool is_pow_4(unsigned a) { return (std::popcount(a) == 1) && (std::countr_zero(a) % 2 == 0); } // C version int is_pow_4(unsigned a) { return (__builtin_popcount(a) == 1) && (__builtin_ctz(a) % 2 == 0); } Basically I check that there is just one bit set and it is on an odd position.

I was expecting a branchless code, however I see two jumps and a rep instruction. From what I recall a rep is basically a loop, but on assembly level. It seems that std::countr_zero/__builtin_ctz generates the rep instruction.

C++ output:

is_pow_4(unsigned int): lea edx, [rdi-1] mov ecx, edi xor eax, eax xor ecx, edx cmp edx, ecx jb .L7 .L1: ret .L7: mov eax, 1 test edi, edi je .L1 xor eax, eax rep bsf eax, edi not eax and eax, 1 ret C output is similar.

I understand that the loop is bound by the width of the integer (32), so I think the complexity of the code is O(1), but I was still surprised to find a loop.

Is my understanding correct? Is this code a loop on x86? Is this because while there is a x86 popcount instruction, there is no count leading/trailing zeros x86 instruction?

Источник: https://stackoverflow.com/questions/781 ... n-assembly

1709566245

Гость


I was playing with this code to check if an integer is a power of 4:
 
// C++ version bool is_pow_4(unsigned a) {     return (std::popcount(a) == 1) && (std::countr_zero(a) % 2 == 0); }  // C version int is_pow_4(unsigned a) {    return (__builtin_popcount(a) == 1) && (__builtin_ctz(a) % 2 == 0); }  Basically I check that there is just one bit set and it is on an odd position.
 
I was expecting a branchless code, however I see two jumps and a rep instruction. From what I recall a rep is basically a loop, but on assembly level. It seems that std::countr_zero/__builtin_ctz generates the rep instruction.
 
C++ output:
 
is_pow_4(unsigned int):         lea     edx, [rdi-1]         mov     ecx, edi         xor     eax, eax         xor     ecx, edx         cmp     edx, ecx         jb      .L7 .L1:         ret .L7:         mov     eax, 1         test    edi, edi         je      .L1         xor     eax, eax         rep bsf eax, edi         not     eax         and     eax, 1         ret  C output is similar.
 
I understand that the loop is bound by the width of the integer (32), so I think the complexity of the code is O(1), but I was still surprised to find a loop.
 
Is my understanding correct? Is this code a loop on x86? Is this because while there is a x86 popcount instruction, there is no count leading/trailing zeros x86 instruction?
 

Источник: [url]https://stackoverflow.com/questions/78102412/does-this-simple-code-not-containing-any-loop-generate-a-loop-in-assembly[/url]

Ответить

1 сообщение • Страница 1 из 1

Быстрый ответ

Заголовок:

Имя пользователя:

Изменение регистра текста:

Смайлики

Ещё смайлики…

К этому ответу прикреплено по крайней мере одно вложение.

Если вы не хотите добавлять вложения, оставьте поля пустыми. Можно прикреплять файлы, перетаскивая их в окно сообщения.

Максимально разрешённый размер вложения: 15 МБ.

Имя файла:

Комментарий к файлу:

Имя файла	Комментарий к файлу	Размер	Статус

Вернуться в «C++»

Programmiererforum