rust-lang/rust - rust - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
Zalathar	bef8f646a6	Use `LLVMDIBuilderCreateArrayType`	2025-09-17 12:28:08 +10:00
Zalathar	2552deb9cd	Use `LLVMDIBuilderCreateUnionType`	2025-09-17 12:28:08 +10:00
Zalathar	5419896111	Use `LLVMDIBuilderCreateSubroutineType`	2025-09-17 12:28:08 +10:00
bjorn3	f2933b34a8	Remove want_summary argument from prepare_thin It is always false nowadays. ThinLTO summary writing is instead done by llvm_optimize.	2025-09-06 18:37:23 +00:00
Daniel Paoliello	da8f230d5f	Update to ar_archive_writer 0.5.1	2025-08-29 16:37:42 -07:00
bors	d36f964125	Auto merge of #145877 - nikic:capture-address, r=tmiasko Use captures(address) instead of captures(none) for indirect args While provenance cannot be captured through these arguments, the address / object identity can. Fixes https://github.com/rust-lang/rust/issues/137668. r? `@ghost`	2025-08-28 00:01:22 +00:00
Nikita Popov	c3ab409b4f	Use captures(address) instead of captures(none) for indirect args While provenance cannot be captured through these arguments, the address / object identity can.	2025-08-26 16:16:23 +02:00
Zalathar	fcff8f7f5a	Assert that LLVM range-attribute values don't exceed 128 bits The underlying implementation of `LLVMCreateConstantRangeAttribute` assumes that each of `LowerWords` and `UpperWords` points to enough u64 values to define an integer of the specified bit-length, and will encounter UB if that is not the case. Our safe wrapper function always passes pointers to `[u64; 2]` arrays, regardless of the bit-length specified. That's fine in practice, because scalar primitives never exceed 128 bits, but it is technically a soundness hole in a safe function. We can close the soundness hole by explicitly asserting `size_bits <= 128`. This is effectively just a stricter version of the existing check that the value must be small enough to fit in `c_uint`.	2025-08-26 13:07:19 +10:00
Zalathar	b4e97e5d86	Rename `llvm::Bool` aliases to standard const case This avoids the need for `#![allow(non_upper_case_globals)]`.	2025-08-24 23:09:54 +10:00
Zalathar	455a67bd4f	Replace the `llvm::Bool` typedef with a proper newtype	2025-08-24 23:09:54 +10:00
Nikita Popov	d71ed8d19b	Tell LLVM about read-only captures `&Freeze` parameters are not only `readonly` within the function, but any captures of the pointer can also only be used for reads. This can now be encoded using the `captures(address, read_provenance)` attribute.	2025-08-20 19:08:16 +02:00
Stuart Cook	8748d8e7d5	Rollup merge of #145484 - Zalathar:archive-builder, r=bjorn3 Remove `LlvmArchiveBuilder` and supporting code/bindings Switching over to the newer Rust-based `ArArchiveBuilder` happened in rust-lang/rust#128936, a year ago. Per the comment in `new_archive_builder`, that seems like enough time to justify removing the older, unused `LlvmArchiveBuilder` implementation and its associated bindings. Fixes rust-lang/rust#128955.	2025-08-19 14:18:25 +10:00
Stuart Cook	8945924d77	Rollup merge of #145432 - Zalathar:target-machine, r=wesleywiser cg_llvm: Small cleanups to `owned_target_machine` This PR contains a few tiny cleanups to the `owned_target_machine` code. Each individual commit should be fairly straightforward.	2025-08-19 14:18:25 +10:00
Zalathar	cf8ec6798f	Remove `LlvmArchiveBuilder` and supporting code/bindings	2025-08-16 16:38:12 +10:00
Zalathar	44f5ec7d56	Avoid an explicit cast from `const c_uchar` to `const c_char` As noted in the `ffi` module docs, passing pointer/length byte strings from Rust to C++ is easier if we declare them as `const c_uchar` on the Rust side, but `const char ` (possibly signed) on the C++ side. This is allowed because both pointer types are ABI-compatible, regardless of char signedness.	2025-08-15 20:24:13 +10:00
Zalathar	e193b5342b	Use `LLVMGetTypeKind`	2025-08-15 19:35:35 +10:00
Zalathar	c64c6d85e1	Use `LLVMSetTailCallKind`	2025-08-15 13:57:37 +10:00
Nikita Popov	ebef9d7f63	Set dead_on_return attribute for indirect arguments Set the dead_on_return attribute (added in LLVM 21) for arguments that are passed indirectly, but not byval. This indicates that the value of the argument on return does not matter, enabling additional dead store elimination.	2025-08-11 12:39:23 +02:00
Zalathar	81ed042c8c	coverage: Remove all unstable support for MC/DC instrumentation	2025-08-06 22:38:52 +10:00
Stuart Cook	8628b78f24	Rollup merge of #144232 - xacrimon:explicit-tail-call, r=WaffleLapkin Implement support for `become` and explicit tail call codegen for the LLVM backend This PR implements codegen of explicit tail calls via `become` in `rustc_codegen_ssa` and support within the LLVM backend. Completes a task on (https://github.com/rust-lang/rust/issues/112788). This PR implements all the necessary bits to make explicit tail calls usable, other backends have received stubs for now and will ICE if you use `become` on them. I suspect there is some bikeshedding to be done on how we should go about implementing this for other backends, but it should be relatively straightforward for GCC after this is merged. During development I also put together a POC bytecode VM based on tail call dispatch to test these changes out and analyze the codegen to make sure it generates expected assembly. That is available [here](https://github.com/xacrimon/tcvm).	2025-07-31 15:42:00 +10:00
Joel Wejdenstål	a448837045	Implement support for explicit tail calls in the MIR block builders and the LLVM codegen backend.	2025-07-26 01:02:29 +02:00
bjorn3	fe2eeabe27	Use the object crate rather than LLVM for extracting bitcode sections	2025-07-25 11:21:28 +00:00
许杰友 Jieyou Xu (Joe)	5e3eb25125	Rollup merge of #142097 - ZuseZ4:offload-host1, r=oli-obk gpu offload host code generation r? ghost This will generate most of the host side code to use llvm's offload feature. The first PR will only handle automatic mem-transfers to and from the device. So if a user calls a kernel, we will copy inputs back and forth, but we won't do the actual kernel launch. Before merging, we will use LLVM's Info infrastructure to verify that the memcopies match what openmp offloa generates in C++. `LIBOMPTARGET_INFO=-1 ./my_rust_binary` should print that a memcpy to and later from the device is happening. A follow-up PR will generate the actual device-side kernel which will then do computations on the GPU. A third PR will implement manual host2device and device2host functionality, but the goal is to minimize cases where a user has to overwrite our default handling due to performance issues. I'm trying to get a full MVP out first, so this just recognizes GPU functions based on magic names. The final frontend will obviously move this over to use proper macros, like I'm already doing it for the autodiff work. This work will also be compatible with std::autodiff, so one can differentiate GPU kernels. Tracking: - https://github.com/rust-lang/rust/issues/131513	2025-07-22 00:54:24 +08:00
Manuel Drehwald	5958ebe829	add various wrappers for gpu code generation	2025-07-18 16:24:12 -07:00
Nikita Popov	12b19be741	Pass wasm exception model to TargetOptions This is no longer implied by -wasm-enable-eh.	2025-07-18 09:35:50 +02:00
Oli Scherer	e574fef728	Shrink some `unsafe` blocks in cg_llvm	2025-07-14 08:27:08 +00:00
Oli Scherer	d3d51b4fdb	Avoid a bunch of unnecessary `unsafe` blocks in cg_llvm	2025-07-14 08:27:08 +00:00
bors	855e0fe46e	Auto merge of #142911 - mejrs:unsized, r=compiler-errors Remove support for dynamic allocas Followup to rust-lang/rust#141811	2025-07-11 05:27:32 +00:00
Dillon Amburgey	0455577974	fix: correct parameter names in LLVMRustBuildMinNum and LLVMRustBuildMaxNum FFI declarations	2025-07-08 06:24:19 -05:00
mejrs	49421d1fa3	Remove support for dynamic allocas	2025-07-07 23:04:06 +02:00
sayantn	9415f3d8a6	Use `LLVMIntrinsicGetDeclaration` to completely remove the hardcoded intrinsics list	2025-06-15 22:15:16 +05:30
sayantn	d56fcd968d	Simplify implementation of Rust intrinsics by using type parameters in the cache	2025-06-12 00:32:42 +05:30
Ralf Jung	a387c86a92	get rid of rustc_codegen_ssa::common::AtomicOrdering	2025-05-28 22:57:55 +02:00
Zalathar	b6300294a8	Make `LLVMRustInlineAsmVerify` take `*const c_uchar` This avoids the need for an explicit `as_c_char_ptr` conversion.	2025-05-11 14:38:42 +10:00
Zalathar	b1094f6a0a	Add a safe wrapper for `LLVMAppendModuleInlineAsm` This patch also changes the Rust-side declaration to take `const c_uchar` instead of `const c_char`, to avoid the need for `AsCCharPtr`.	2025-05-11 14:38:42 +10:00
Zalathar	d1bb310a7a	Use `LLVMGetInlineAsm` This LLVM-C binding replaces the existing `LLVMRustInlineAsm` function.	2025-05-11 14:37:54 +10:00
Zalathar	8764ecd0c1	Add a searchable tag `PTR_LEN_STR` to explain `const c_uchar` bindings This module comment describes why it's OK for LLVM bindings to declare a parameter type of `const c_uchar` for pointer/length strings, even though the corresponding parameter on the C/C++ side uses `const char `. Adding a searchable term to each such parameter should make it easier for future maintainers to understand why `const c_uchar` is being used instead of `*const c_char`.	2025-05-11 14:26:14 +10:00
Ralf Jung	79dfd0a472	remove 'unordered' atomic intrinsics	2025-05-09 17:39:52 +02:00
Manuel Drehwald	75f86e6e2e	fix LooseTypes flag and PrintMod behaviour, add debug helper	2025-04-12 01:36:44 -04:00
Stuart Cook	c6bf3a01ef	Rollup merge of #137880 - EnzymeAD:autodiff-batching, r=oli-obk Autodiff batching Enzyme supports batching, which is especially known from the ML side when training neural networks. There we would normally have a training loop, where in each iteration we would pass in some data (e.g. an image), and a target vector. Based on how close we are with our prediction we compute our loss, and then use backpropagation to compute the gradients and update our weights. That's quite inefficient, so what you normally do is passing in a batch of 8/16/.. images and targets, and compute the gradients for those all at once, allowing better optimizations. Enzyme supports batching in two ways, the first one (which I implemented here) just accepts a Batch size, and then each Dual/Duplicated argument has not one, but N shadow arguments. So instead of ```rs for i in 0..100 { df(x[i], y[i], 1234); } ``` You can now do ```rs for i in 0..100.step_by(4) { df(x[i+0],x[i+1],x[i+2],x[i+3], y[i+0], y[i+1], y[i+2], y[i+3], 1234); } ``` which will give the same results, but allows better compiler optimizations. See the testcase for details. There is a second variant, where we can mark certain arguments and instead of having to pass in N shadow arguments, Enzyme assumes that the argument is N times longer. I.e. instead of accepting 4 slices with 12 floats each, we would accept one slice with 48 floats. I'll implement this over the next days. I will also add more tests for both modes. For any one preferring some more interactive explanation, here's a video of Tim's llvm dev talk, where he presents his work. https://www.youtube.com/watch?v=edvaLAL5RqU I'll also add some other docs to the dev guide and user docs in another PR. r? ghost Tracking: - https://github.com/rust-lang/rust/issues/124509 - https://github.com/rust-lang/rust/issues/135283	2025-04-05 13:18:13 +11:00
Manuel Drehwald	b7c63a973f	add autodiff batching backend	2025-04-04 14:24:23 -04:00
Daniel Paoliello	79b9664091	Reduce visibility of most items in `rustc_codegen_llvm`	2025-03-25 16:36:47 +11:00
Zalathar	d07ef5b0e1	coverage: Add LLVM plumbing for expansion regions This is currently unused, but paves the way for future work on expansion regions without having to worry about the FFI parts.	2025-03-20 12:40:36 +11:00
Matthias Krüger	63c548d82c	Rollup merge of #137549 - oli-obk:llvm-ffi, r=davidtwco Clean up various LLVM FFI things in codegen_llvm cc ```@ZuseZ4``` I touched some autodiff parts The major change of this PR is [`bfd88ce`](`bfd88cead0`) which makes `CodegenCx` generic just like `GenericBuilder` The other commits mostly took advantage of the new feature of making extern functions safe, but also just used some wrappers that were already there and shrunk unsafe blocks. best reviewed commit-by-commit	2025-03-07 19:15:34 +01:00
Michael Goulet	a59a8f9e75	Revert "Auto merge of #135335 - oli-obk:push-zxwssomxxtnq, r=saethlin" This reverts commit `a7a6c64a65`, reversing changes made to `ebbe63891f`.	2025-03-02 18:52:48 +00:00
bors	0c72c0d11a	Auto merge of #133250 - DianQK:embed-bitcode-pgo, r=nikic The embedded bitcode should always be prepared for LTO/ThinLTO Fixes #115344. Fixes #117220. There are currently two methods for generating bitcode that used for LTO. One method involves using `-C linker-plugin-lto` to emit object files as bitcode, which is the typical setting used by cargo. The other method is through `-C embed-bitcode=yes`. When using with `-C embed-bitcode=yes -C lto=no`, we run a complete non-LTO LLVM pipeline to obtain bitcode, then the bitcode is used for LTO. We run the Call Graph Profile Pass twice on the same module. This PR is doing something similar to LLVM's `buildFatLTODefaultPipeline`, obtaining the bitcode for embedding after running `buildThinLTOPreLinkDefaultPipeline`. r? nikic	2025-03-01 08:22:18 +00:00
Oli Scherer	553828c6f4	Mark more LLVM FFI as safe	2025-02-24 15:11:29 +00:00
Oli Scherer	3565603d25	Use a safe wrapper around an LLVM FFI function	2025-02-24 15:11:29 +00:00
Oli Scherer	396baa750e	Make allocator shim creation mostly use safe code	2025-02-24 15:11:29 +00:00
Oli Scherer	a54bfcf52b	Use safe FFI for various functions in codegen_llvm	2025-02-24 15:05:56 +00:00

1 2 3 4 5 ...

373 Commits