[MC] Explicitly use memcpy in emitBytes() (NFC) by nikic · Pull Request #177187 · llvm/llvm-project

nikic · 2026-01-21T15:56:58Z

We've observed a compile-time regression in LLVM 22 when including large blobs. The root cause was that emitBytes() was copying bytes one-by-one, which is much slower than using memcpy for large objects.

Optimization of std::copy to memmove is apparently much less reliable than one might think. In particular, when using a non-bleeding-edge libstdc++ (anything older than version 15), this does not happen if the types of the input and output iterators do not match (like here, where there is a signed/unsigned mismatch).

As this code is performance sensitive, I think it makes sense to directly use memcpy.

Previously this code used SmallVector::append, which explicitly uses memcpy here:

llvm-project/llvm/include/llvm/ADT/SmallVector.h

Line 519 in 82d7a52

std::memcpy(reinterpret_cast<void *>(Dest), I, (E - I) * sizeof(T));

We've observed a compile-time regression in LLVM 22 when including large blobs. The root cause way that emitBytes() was copying bytes one-by-one, which is much slower than using memcpy. Optimization of std::copy to memmove is apparently much less reliable than one might think. In particular, when using a non-bleeding-edge libstdc++ (anything older than version 15), this does not happen if the types of the input and output iterators do not match (like here, where there is a signed/unsigned mismatch). As this code is performance sensitive, I think it makes sense to directly use memcpy.

llvmbot · 2026-01-21T15:57:38Z

@llvm/pr-subscribers-llvm-mc

Author: Nikita Popov (nikic)

Changes

We've observed a compile-time regression in LLVM 22 when including large blobs. The root cause was that emitBytes() was copying bytes one-by-one, which is much slower than using memcpy for large objects.

Optimization of std::copy to memmove is apparently much less reliable than one might think. In particular, when using a non-bleeding-edge libstdc++ (anything older than version 15), this does not happen if the types of the input and output iterators do not match (like here, where there is a signed/unsigned mismatch).

As this code is performance sensitive, I think it makes sense to directly use memcpy.

Previously this code used SmallVector::append, which explicitly uses memcpy here:

llvm-project/llvm/include/llvm/ADT/SmallVector.h

Line 519 in 82d7a52

std::memcpy(reinterpret_cast<void *>(Dest), I, (E - I) * sizeof(T));

Full diff: https://github.com/llvm/llvm-project/pull/177187.diff

1 Files Affected:

(modified) llvm/lib/MC/MCObjectStreamer.cpp (+3-1)

diff --git a/llvm/lib/MC/MCObjectStreamer.cpp b/llvm/lib/MC/MCObjectStreamer.cpp
index 261e9a37ecb55..d44e14a35cac8 100644
--- a/llvm/lib/MC/MCObjectStreamer.cpp
+++ b/llvm/lib/MC/MCObjectStreamer.cpp
@@ -109,7 +109,9 @@ void MCObjectStreamer::addSpecialFragment(MCFragment *Frag) {
 void MCObjectStreamer::appendContents(ArrayRef<char> Contents) {
   ensureHeadroom(Contents.size());
   assert(FragSpace >= Contents.size());
-  llvm::copy(Contents, getCurFragEnd());
+  // As this is performance-sensitive code, explicitly use std::memcpy.
+  // Optimization of std::copy to memmove is unreliable.
+  std::memcpy(getCurFragEnd(), Contents.begin(), Contents.size());
   CurFrag->FixedSize += Contents.size();
   FragSpace -= Contents.size();
 }

nikic · 2026-01-22T08:25:39Z

/cherry-pick 15e421d

llvmbot · 2026-01-22T08:32:42Z

/pull-request #177320

We've observed a compile-time regression in LLVM 22 when including large blobs. The root cause was that emitBytes() was copying bytes one-by-one, which is much slower than using memcpy for large objects. Optimization of std::copy to memmove is apparently much less reliable than one might think. In particular, when using a non-bleeding-edge libstdc++ (anything older than version 15), this does not happen if the types of the input and output iterators do not match (like here, where there is a signed/unsigned mismatch). As this code is performance sensitive, I think it makes sense to directly use memcpy. Previously this code used SmallVector::append, which explicitly uses memcpy. (cherry picked from commit 15e421d)

pcc · 2026-01-23T20:48:47Z

Looks like this caused a ubsan failure: https://lab.llvm.org/buildbot/#/builders/85/builds/17944 . Can you please take a look?

(Specifically the tools/llvm-dwarfutil/ELF/X86/dwarf4-macro-vendor-specific.test failure; the LLD failures are unrelated and were fixed by #177562.)

nikic · 2026-01-23T21:13:14Z

I've pushed a speculative fix at d064f39.

nikic · 2026-01-26T08:22:41Z

/cherry-pick d064f39

llvmbot · 2026-01-26T08:29:58Z

/pull-request #177907

Update to LLVM 22 Scheduled release date: Feb 24 1.94 becomes stable: Mar 5 Changes: * Update to rc2, with one patch to work around our outdated illumos sysroot (rust-lang/llvm-project@41256ab). * Update the host toolchain as well, otherwise we lose cross-language LTO, in particular for jemalloc. * Adjust one loongarch assembly test. The split into r and s variants is based on the suggestion in #151134. Depends on: * [x] #151410 * [ ] #150756 * [x] llvm/llvm-project#175190 * [x] llvm/llvm-project#175912 * [x] llvm/llvm-project#175965 * [x] llvm/llvm-project#176195 * [x] llvm/llvm-project#157073 * [x] llvm/llvm-project#176421 * [x] llvm/llvm-project#176925 * [x] llvm/llvm-project#177187

Update to LLVM 22 Scheduled release date: Feb 24 1.94 becomes stable: Mar 5 Changes: * Update to rc2, with one patch to work around our outdated illumos sysroot (rust-lang/llvm-project@41256ab). * Update the host toolchain as well, otherwise we lose cross-language LTO, in particular for jemalloc. * Adjust one loongarch assembly test. The split into r and s variants is based on the suggestion in rust-lang/rust#151134. Depends on: * [x] rust-lang/rust#151410 * [ ] rust-lang/rust#150756 * [x] llvm/llvm-project#175190 * [x] llvm/llvm-project#175912 * [x] llvm/llvm-project#175965 * [x] llvm/llvm-project#176195 * [x] llvm/llvm-project#157073 * [x] llvm/llvm-project#176421 * [x] llvm/llvm-project#176925 * [x] llvm/llvm-project#177187

This reverts commit 15e421d.

nikic requested review from MaskRay and aengelke January 21, 2026 15:56

llvmbot added the llvm:mc Machine (object) code label Jan 21, 2026

aengelke approved these changes Jan 21, 2026

View reviewed changes

MaskRay approved these changes Jan 21, 2026

View reviewed changes

nikic merged commit 15e421d into llvm:main Jan 22, 2026
13 checks passed

nikic deleted the objectstreamer-memcpy branch January 22, 2026 08:24

nikic added this to the LLVM 22.x Release milestone Jan 22, 2026

github-project-automation bot added this to LLVM Release Status Jan 22, 2026

github-project-automation bot moved this to Done in LLVM Release Status Jan 22, 2026

nikic mentioned this pull request Jan 23, 2026

Update to LLVM 22 rust-lang/rust#150722

Merged

10 tasks

JDevlieghere added a commit to JDevlieghere/llvm-project that referenced this pull request Mar 26, 2026

Revert "[MC] Explicitly use memcpy in emitBytes() (NFC) (llvm#177187)"

752ddb6

This reverts commit 15e421d.

JDevlieghere mentioned this pull request Mar 26, 2026

Revert "MCFragment: Use trailing data for fixed-size part" #188779

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MC] Explicitly use memcpy in emitBytes() (NFC)#177187

[MC] Explicitly use memcpy in emitBytes() (NFC)#177187
nikic merged 1 commit intollvm:mainfrom
nikic:objectstreamer-memcpy

nikic commented Jan 21, 2026

Uh oh!

llvmbot commented Jan 21, 2026

Uh oh!

Uh oh!

nikic commented Jan 22, 2026

Uh oh!

llvmbot commented Jan 22, 2026

Uh oh!

pcc commented Jan 23, 2026

Uh oh!

nikic commented Jan 23, 2026

Uh oh!

nikic commented Jan 26, 2026

Uh oh!

llvmbot commented Jan 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

nikic commented Jan 21, 2026

Uh oh!

llvmbot commented Jan 21, 2026

Uh oh!

Uh oh!

nikic commented Jan 22, 2026

Uh oh!

llvmbot commented Jan 22, 2026

Uh oh!

pcc commented Jan 23, 2026

Uh oh!

nikic commented Jan 23, 2026

Uh oh!

nikic commented Jan 26, 2026

Uh oh!

llvmbot commented Jan 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants