Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

strstr() a lot slower with ASAN (Linux glibc x86-64) #1832

Open
fuhsnn opened this issue Dec 24, 2024 · 3 comments
Open

strstr() a lot slower with ASAN (Linux glibc x86-64) #1832

fuhsnn opened this issue Dec 24, 2024 · 3 comments

Comments

@fuhsnn
Copy link

fuhsnn commented Dec 24, 2024

After enabling ASAN in the compiler project I work on, I noticed a strstr() call in the tokenization loop is responsible for significant slowdown in ASAN builds.

I believe this is worth reporting since it's a simple change that caused a huge diff, which should make an interesting case for profiling.

In the worst case, ASAN builds are about 170x slower on Skylake, 150x on Zen4, after replacing the strstr() with a loop it's back to normal 2.8x / 2.5x.
The change is fuhsnn/slimcc@eb66a46.

A reproduce script testing ASAN on-off and strstr() with-without, should work on any recent-ish glibc docker:

set -e

git clone https://github.com/fuhsnn/c_files
git clone https://github.com/fuhsnn/slimcc

cd slimcc
git checkout eb66a4619f168af7ce5c34ecc8cc37c82988d51d

cc *.c -o slimcc
echo "Asan 0, strstr 0"
time ./slimcc ../c_files/gcc.c -E -o/dev/null

cc *.c -o slimcc -fsanitize=address
echo "Asan 1 strstr 0"
time ./slimcc ../c_files/gcc.c -E -o/dev/null

git checkout HEAD~1

cc *.c -o slimcc
echo "Asan 0, strstr 1"
time ./slimcc ../c_files/gcc.c -E -o/dev/null

cc *.c -o slimcc -fsanitize=address
echo "Asan 1, strstr 1"
time ./slimcc ../c_files/gcc.c -E -o/dev/null
@ramosian-glider
Copy link
Member

Does the behavior change if you run with and without ASAN_OPTIONS=intercept_strstr=0 ?

@fuhsnn
Copy link
Author

fuhsnn commented Jan 8, 2025

Does the behavior change if you run with and without ASAN_OPTIONS=intercept_strstr=0 ?

Yes, the slowdown is gone with intercept_strstr=0, thanks for the pointer!
So this issue is really about the intercept wrapper being slow for a simple short needle ("*/" in this case).

@ramosian-glider
Copy link
Member

In addition to calling the original strstr(haystack, needle), the interceptor does roughly the following:

  uptr len1 = internal_strlen(s1);
  uptr len2 = internal_strlen(s2);
  __asan_region_is_poisoned(s1, common_flags()->strict_string_checks ? (internal_strlen(s1)) + 1 : (r ? r - s1 + len2 : len1 + 1);)
  __asan_region_is_poisoned(s2, len2 + 1);

Maybe setting strict_string_checks=0 should speed the things up?

@vitalybuka I think there's an opportunity here to spare a call to internal_strlen(s1)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants