Implement Windows SemLock in _multiprocessing module by youknowone · Pull Request #7199 · RustPython/RustPython

youknowone · 2026-02-22T10:57:54Z

Summary by CodeRabbit

New Features
- Windows semaphore synchronization: new semaphore and handle types with ownership, recursive-locking, lifecycle hooks, and context-manager support.
- Windows-specific attributes and accessors exposed for semaphore objects.
Bug Fixes
- More consistent exception reduction on Windows (winerror and filename ordering/presence).
- Socket receive now returns trimmed byte buffers matching actual bytes read.

coderabbitai · 2026-02-22T10:58:03Z

Note

Reviews paused

It looks like this branch is under active development. To avoid overwhelming you with review comments due to an influx of new commits, CodeRabbit has automatically paused this review. You can configure this behavior by changing the reviews.auto_review.auto_pause_after_reviewed_commits setting.

Use the following commands to manage reviews:

@coderabbitai resume to resume automatic reviews.
@coderabbitai review to trigger a single review.

Use the checkboxes below for quick actions:

▶️ Resume reviews
🔍 Trigger review

📝 Walkthrough

Walkthrough

Adds a Windows-specific _multiprocessing implementation with SemLock/SemHandle types and full Win32 synchronization semantics, changes socket recv to return a truncated Vec<u8>, and updates PyOSError.reduce to handle optional filename2 and Windows winerror ordering.

Changes

Cohort / File(s)	Summary
Windows multiprocessing implementation `crates/stdlib/src/multiprocessing.rs`	Adds extensive Win32 imports and atomics; implements `SemLock` and `SemHandle` (creation, Drop, handle/name accessors), acquire/release semantics (blocking, timeout, recursive mutex), context-manager methods, `_rebuild`/`_after_fork`/`__reduce__`, `sem_unlink` noop, `flags` attr, and updates `recv` to return a truncated `Vec<u8>`.
OSError reduction logic `crates/vm/src/exceptions.rs`	Refactors `PyOSError::__reduce__` to treat `filename2` as optional and to include Windows `winerror` when present, with platform-gated argument ordering.

Sequence Diagram(s)

sequenceDiagram
    actor Py as Python caller
    participant SemLock as SemLock (Rust)
    participant Win32 as Win32 API
    participant Kernel as Kernel/Handles

    Py->>SemLock: acquire(timeout?)
    SemLock->>Win32: WaitForSingleObjectEx(handle, timeout)
    Win32->>Kernel: resolve wait on HANDLE
    Kernel-->>Win32: WAIT_OBJECT_0 / WAIT_TIMEOUT / WAIT_FAILED
    Win32-->>SemLock: result
    alt acquired
        SemLock-->>Py: True
    else timeout
        SemLock-->>Py: False
    else error
        SemLock-->>Py: raise OSError (include winerror)
    end

    Py->>SemLock: release()
    SemLock->>Win32: ReleaseSemaphore / ReleaseMutex
    Win32-->>SemLock: success / error
    SemLock-->>Py: return / raise OSError

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Possibly related PRs

Fix winerror handling #6454 — Modifies crates/vm/src/exceptions.rs PyOSError handling; touches the same reduction logic.
reject SemLock reduce #6738 — Changes SemLock.__reduce__ and SemLock pickling behavior in multiprocessing; closely related to new SemLock methods.
new_last_{os,errno}_error #6381 — Adjusts Windows multiprocessing recv/WinSock handling; relates to the recv signature and behavior changes.

Poem

"I hop through code with twitchy paws,
I stitch a lock without a pause.
Semaphores and tiny springs,
Now threads may dance on quiet wings.
Hooray — this rabbit guards the cause!" 🐇

🚥 Pre-merge checks | ✅ 3

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title 'Implement Windows SemLock in _multiprocessing module' accurately summarizes the main change—adding Windows-specific SemLock implementation to the multiprocessing module, which aligns with the comprehensive additions to crates/stdlib/src/multiprocessing.rs.
Docstring Coverage	✅ Passed	Docstring coverage is 88.46% which is sufficient. The required threshold is 80.00%.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings (stacked PR)
📝 Generate docstrings (commit on current branch)

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

github-actions · 2026-02-22T11:04:29Z

Code has been automatically formatted

The code in this PR has been formatted using:

cargo fmt --all
Please pull the latest changes before pushing again:

git pull origin win-semlock

Add SemLock class using Windows semaphore APIs (CreateSemaphoreW, WaitForSingleObjectEx, ReleaseSemaphore) so test_multiprocessing suites are no longer skipped with "lacks a functioning sem_open". Also add sem_unlink as no-op and flags dict for Windows.

coderabbitai

Actionable comments posted: 2

🧹 Nitpick comments (1)

crates/stdlib/src/multiprocessing.rs (1)

179-185: Fast-path ignores WAIT_FAILED, causing a redundant system call

When the fast-path WaitForSingleObjectEx returns WAIT_FAILED (handle invalid, etc.), the result is silently ignored and the main blocking wait is called immediately after, almost certainly returning the same error. Checking the fast-path result for WAIT_FAILED saves the redundant call:

♻️ Proposed fix

-        if unsafe { WaitForSingleObjectEx(self.handle.as_raw(), 0, 0) } == WAIT_OBJECT_0 {
+        let fast = unsafe { WaitForSingleObjectEx(self.handle.as_raw(), 0, 0) };
+        if fast == WAIT_FAILED {
+            return Err(vm.new_last_os_error());
+        }
+        if fast == WAIT_OBJECT_0 {
             self.last_tid
                 .store(unsafe { GetCurrentThreadId() }, Ordering::Release);
             self.count.fetch_add(1, Ordering::Release);
             return Ok(true);
         }

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@crates/stdlib/src/multiprocessing.rs` around lines 179 - 185, The fast-path
call to WaitForSingleObjectEx in the block around self.handle currently ignores
WAIT_FAILED; change that fast-path to check for WAIT_FAILED and immediately
propagate the failure instead of falling through to the blocking wait.
Specifically, in the fast-path around
WaitForSingleObjectEx(self.handle.as_raw(), 0, 0) detect if the return equals
WAIT_FAILED and return an Err created from the OS error (e.g.,
std::io::Error::last_os_error()) so the code in the blocking path (and callers
relying on WaitForSingleObjectEx/WAIT_FAILED) aren't invoked redundantly;
otherwise keep the existing success path that stores GetCurrentThreadId() into
self.last_tid and increments self.count.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@crates/stdlib/src/multiprocessing.rs`:
- Around line 327-329: The current validation allows maxvalue == 0 which leads
CreateSemaphoreW/WaitForSingleObjectEx to fail with an OS
ERROR_INVALID_PARAMETER; update the bounds check around args (the one that now
reads `if args.value < 0 || args.value > args.maxvalue`) to also reject
non‑positive maxvalue (e.g. `args.maxvalue <= 0`) and return a clear Python
ValueError via vm.new_value_error; mention
CreateSemaphoreW/WaitForSingleObjectEx in the comment if helpful and keep the
error message descriptive (e.g. "invalid maxvalue").
- Around line 78-83: Remove the stale GetLastError check block after
CreateSemaphoreW: the code that reads
windows_sys::Win32::Foundation::GetLastError(), compares to
0/ERROR_ALREADY_EXISTS, calls CloseHandle(handle) and then returns
vm.new_last_os_error() must be deleted because anonymous semaphores will not set
ERROR_ALREADY_EXISTS and a prior nonzero last-error can close a valid handle and
produce misleading errors; locate the sequence around CreateSemaphoreW,
GetLastError, CloseHandle and vm.new_last_os_error in multiprocessing.rs and
simply delete that entire branch; if named-semaphore support is added later,
implement the correct pattern: check only for last_err == ERROR_ALREADY_EXISTS,
save the error before CloseHandle, and restore it via SetLastError before
calling vm.new_last_os_error().

---

Nitpick comments:
In `@crates/stdlib/src/multiprocessing.rs`:
- Around line 179-185: The fast-path call to WaitForSingleObjectEx in the block
around self.handle currently ignores WAIT_FAILED; change that fast-path to check
for WAIT_FAILED and immediately propagate the failure instead of falling through
to the blocking wait. Specifically, in the fast-path around
WaitForSingleObjectEx(self.handle.as_raw(), 0, 0) detect if the return equals
WAIT_FAILED and return an Err created from the OS error (e.g.,
std::io::Error::last_os_error()) so the code in the blocking path (and callers
relying on WaitForSingleObjectEx/WAIT_FAILED) aren't invoked redundantly;
otherwise keep the existing success path that stores GetCurrentThreadId() into
self.last_tid and increments self.count.

crates/stdlib/src/multiprocessing.rs

…lity - recv() now returns bytes instead of int (matching CPython) - Remove spurious GetLastError() check after CreateSemaphoreW - Add signal checking during blocking SemLock acquire

github-actions · 2026-02-22T15:22:28Z

coderabbitai

🧹 Nitpick comments (1)

crates/stdlib/src/multiprocessing.rs (1)
21-24: Consider importing WAIT_TIMEOUT, WAIT_FAILED, and ERROR_TOO_MANY_POSTS from windows-sys instead of hardcoding.

WAIT_OBJECT_0 is imported from the crate, but WAIT_TIMEOUT, WAIT_FAILED, and ERROR_TOO_MANY_POSTS are hardcoded. Using the crate's constants avoids divergence if values change or types are redefined.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@crates/stdlib/src/multiprocessing.rs` around lines 21 - 24, Replace the
hardcoded constants with the corresponding exports from the windows-sys crate:
stop defining WAIT_TIMEOUT, WAIT_FAILED and ERROR_TOO_MANY_POSTS as local consts
and instead import the matching symbols from windows-sys (the same place
WAIT_OBJECT_0 is imported from) so types and values stay consistent; leave
INFINITE if it isn’t provided by your current import but prefer using the
windows-sys INFINITE if available, and update any references to WAIT_TIMEOUT,
WAIT_FAILED and ERROR_TOO_MANY_POSTS in the module (e.g., where WAIT_OBJECT_0 is
used) to use the imported names.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Duplicate comments:
In `@crates/stdlib/src/multiprocessing.rs`:
- Around line 343-345: The current validation in multiprocessing.rs only checks
args.value against args.maxvalue but allows maxvalue == 0, which causes
CreateSemaphoreW to fail; update the validation that returns vm.new_value_error
to also reject non‑positive maxvalue (e.g., args.maxvalue <= 0) so that any call
where args.maxvalue is zero or negative triggers a clean ValueError before
calling CreateSemaphoreW; modify the existing check that uses args.value and
args.maxvalue to include this maxvalue > 0 requirement (referencing the same
args variable and the CreateSemaphoreW call).

---

Nitpick comments:
In `@crates/stdlib/src/multiprocessing.rs`:
- Around line 21-24: Replace the hardcoded constants with the corresponding
exports from the windows-sys crate: stop defining WAIT_TIMEOUT, WAIT_FAILED and
ERROR_TOO_MANY_POSTS as local consts and instead import the matching symbols
from windows-sys (the same place WAIT_OBJECT_0 is imported from) so types and
values stay consistent; leave INFINITE if it isn’t provided by your current
import but prefer using the windows-sys INFINITE if available, and update any
references to WAIT_TIMEOUT, WAIT_FAILED and ERROR_TOO_MANY_POSTS in the module
(e.g., where WAIT_OBJECT_0 is used) to use the imported names.

coderabbitai

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Duplicate comments:
In `@crates/stdlib/src/multiprocessing.rs`:
- Around line 339-345: The constructor validation in py_new for the
multiprocessing primitive is missing a check that Args.maxvalue must be > 0
(CreateSemaphoreW requires lMaximumCount > 0); update the validation in py_new
(the function handling Args.kind, Args.value, Args.maxvalue against
RECURSIVE_MUTEX and SEMAPHORE) to explicitly reject maxvalue <= 0 by returning
vm.new_value_error("invalid maxvalue" or similar), and keep the existing value
vs maxvalue check (value < 0 || value > maxvalue) so invalid maxvalue is
rejected at Python level instead of provoking CreateSemaphoreW
ERROR_INVALID_PARAMETER.

When filename is None, __reduce__ was reconstructing a 2-element (errno, msg) tuple, dropping the winerror at position 3 in the original args. Use the original args tuple instead, matching CPython.

coderabbitai

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@crates/vm/src/exceptions.rs`:
- Around line 2064-2084: The Windows-specific branch that pushes winerror even
when filename2 is None (affecting args_reduced in OSError.__reduce__)
intentionally diverges from CPython to preserve winerror across pickle
round-trips; add a concise comment above the branch referencing filename2,
winerror, args_reduced and the TODO about errno/winerror to explain this
deliberate behavior (or, if you prefer CPython compatibility instead, remove the
winerror push in the else branch so the reduced tuple stays 3 elements when
filename2 is absent).

crates/vm/src/exceptions.rs

…ergence

coderabbitai

🧹 Nitpick comments (1)

crates/stdlib/src/multiprocessing.rs (1)
385-395: recv now correctly returns the actual bytes received — consider capping size to i32::MAX.

The truncation to n_read is correct, and returning Vec<u8> is the right abstraction. However, the pre-existing size as i32 cast on line 388 means a size > i32::MAX would wrap to a negative len for WinSock::recv while also allocating a multi-GB buffer. Consider clamping or validating:
🛡️ Suggested size guard
 fn recv(socket: usize, size: usize, vm: &VirtualMachine) -> PyResult<Vec<u8>> {
+    if size > i32::MAX as usize {
+        return Err(vm.new_overflow_error("recv size too large".to_owned()));
+    }
     let mut buf = vec![0u8; size];
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@crates/stdlib/src/multiprocessing.rs` around lines 385 - 395, The recv
function currently casts size to i32 and allocates a buffer of size bytes which
can overflow or allocate huge memory when size > i32::MAX; before allocating or
calling WinSock::recv (in function recv, referencing buf, size, and
WinSock::recv), validate or clamp size to i32::MAX (e.g., cap a local len =
size.min(i32::MAX as usize) or return an error for too-large sizes) and use that
clamped len for buffer allocation and for the i32 cast passed to WinSock::recv
so you never pass a negative length or allocate an oversized buffer.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In `@crates/stdlib/src/multiprocessing.rs`:
- Around line 385-395: The recv function currently casts size to i32 and
allocates a buffer of size bytes which can overflow or allocate huge memory when
size > i32::MAX; before allocating or calling WinSock::recv (in function recv,
referencing buf, size, and WinSock::recv), validate or clamp size to i32::MAX
(e.g., cap a local len = size.min(i32::MAX as usize) or return an error for
too-large sizes) and use that clamped len for buffer allocation and for the i32
cast passed to WinSock::recv so you never pass a negative length or allocate an
oversized buffer.

ℹ️ Review info

Configuration used: Path: .coderabbit.yml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between f10bc82 and 1a5aa69.

📒 Files selected for processing (2)

crates/stdlib/src/multiprocessing.rs
crates/vm/src/exceptions.rs

youknowone force-pushed the win-semlock branch from bbdfb4c to 067ca98 Compare February 22, 2026 13:32

youknowone marked this pull request as ready for review February 22, 2026 13:45

coderabbitai bot reviewed Feb 22, 2026

View reviewed changes

crates/stdlib/src/multiprocessing.rs Outdated Show resolved Hide resolved

crates/stdlib/src/multiprocessing.rs Show resolved Hide resolved

youknowone added 2 commits February 22, 2026 23:27

Fix _multiprocessing recv to return bytes and improve SemLock reliabi…

64e740f

…lity - recv() now returns bytes instead of int (matching CPython) - Remove spurious GetLastError() check after CreateSemaphoreW - Add signal checking during blocking SemLock acquire

Include winerror in OSError.__reduce__ on Windows

effff24

coderabbitai bot reviewed Feb 22, 2026

View reviewed changes

Remove expectedFailure for tests now passing with Windows SemLock

801a5ed

youknowone force-pushed the win-semlock branch from 12e824e to 801a5ed Compare February 22, 2026 16:25

coderabbitai bot reviewed Feb 22, 2026

View reviewed changes

Fix OSError.__reduce__ to preserve winerror when filename is None

f10bc82

When filename is None, __reduce__ was reconstructing a 2-element (errno, msg) tuple, dropping the winerror at position 3 in the original args. Use the original args tuple instead, matching CPython.

coderabbitai bot reviewed Feb 22, 2026

View reviewed changes

crates/vm/src/exceptions.rs Show resolved Hide resolved

Validate maxvalue > 0 in SemLock and document winerror __reduce__ div…

1a5aa69

…ergence

coderabbitai bot reviewed Feb 23, 2026

View reviewed changes

Comments

Conversation

youknowone commented Feb 22, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Feb 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviews paused

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Poem

Uh oh!

github-actions bot commented Feb 22, 2026

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Feb 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

📦 Library Dependencies

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

youknowone commented Feb 22, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Feb 22, 2026 •

edited

Loading

github-actions bot commented Feb 22, 2026 •

edited

Loading