Remove unnecessary condition in Barrier::wait() by twetzel59 · Pull Request #874... - JOYK Joy of Geek, Geek News, Link all geek

Copy link

Contributor

twetzel59 commented on Jul 24

This is my first pull request for Rust, so feel free to call me out if anything is amiss.

After some examination, I realized that the second condition of the "spurious-wakeup-handler" loop in std::sync::Barrier::wait() should always evaluate to true, making it redundant in the && expression.

Here is the affected function before the fix:

#[stable(feature = "rust1", since = "1.0.0")]
pub fn wait(&self) -> BarrierWaitResult {
    let mut lock = self.lock.lock().unwrap();
    let local_gen = lock.generation_id;
    lock.count += 1;
    if lock.count < self.num_threads {
        // We need a while loop to guard against spurious wakeups.
        // https://en.wikipedia.org/wiki/Spurious_wakeup
        while local_gen == lock.generation_id && lock.count < self.num_threads { // fixme
            lock = self.cvar.wait(lock).unwrap();
        }
        BarrierWaitResult(false)
    } else {
        lock.count = 0;
        lock.generation_id = lock.generation_id.wrapping_add(1);
        self.cvar.notify_all();
        BarrierWaitResult(true)
    }
}

At first glance, it seems that the check that lock.count < self.num_threads would be necessary in order for a thread A to detect when another thread B has caused the barrier to reach its thread count, making thread B the "leader".

However, the control flow implicitly results in an invariant that makes observing !(lock.count < self.num_threads), i.e. lock.count >= self.num_threads impossible from thread A.

When thread B, which will be the leader, calls .wait() on this shared instance of the Barrier, it locks the mutex in the first line and saves the MutexGuard in the lock variable. It then increments the value of lock.count. However, it then proceeds to check if lock.count < self.num_threads. Since it is the leader, it is the case that (after the increment of lock.count), the lock count is equal to the number of threads. Thus, the second branch is immediately taken and lock.count is zeroed. Additionally, the generation ID is incremented (with wrap). Then, the condition variable is signalled. But, the other threads are waiting at the line lock = self.cvar.wait(lock).unwrap();, so they cannot resume until thread B's call to Barrier::wait() returns, which drops the MutexGuard acquired in the first let statement and unlocks the mutex.

The order of events is thus:

A thread A calls .wait()
.wait() acquires the mutex, increments lock.count, and takes the first branch
Thread A enters the while loop since the generation ID has not changed and the count is less than the number of threads for the Barrier
Spurious wakeups occur, but both conditions hold, so the thread A waits on the condition variable
This process repeats for N - 2 additional times for non-leader threads A'
Meanwhile, Thread B calls Barrier::wait() on the same barrier that threads A, A', A'', etc. are waiting on. The thread count reaches the number of threads for the Barrier, so all threads should now proceed, with B being the leader. B acquires the mutex and increments the value lock.count only to find that it is not less than self.num_threads. Thus, it immediately clamps self.num_threads back down to 0 and increments the generation. Then, it signals the condvar to tell the A (prime) threads that they may continue.
The A, A', A''... threads wake up and attempt to re-acquire the lock as per the internal operation of a condition variable. When each A has exclusive access to the mutex, it finds that lock.generation_id no longer matches local_generation and the && expression short-circuits -- and even if it were to evaluate it, self.count is definitely less than self.num_threads because it has been reset to 0 by thread B before B dropped its MutexGuard.

Therefore, it my understanding that it would be impossible for the non-leader threads to ever see the second boolean expression evaluate to anything other than true. This PR simply removes that condition.

Any input would be appreciated. Sorry if this is terribly verbose. I'm new to the Rust community and concurrency can be hard to explain in words. Thanks!

Remove unnecessary condition in Barrier::wait() by twetzel59 · Pull Request #874...

twetzel59 commented on Jul 24

Recommend

Sayonara, C++, and hello to Rust!

Fix false positive of `implicit_saturating_sub` with `else` clause by narpfel ·...

Concordium’s DevX Initiative Launches Rust Maintainers Sponsorship Program

Rust Web Development with Bastian Gruber

SEO Clinic Episode II: Hands-on Landing Page Optimization Tips

Speedup int log10 branchless by falk-hueffner · Pull Request #88788 · rust-lang/...

Make Result::as_mut const by woppopo · Pull Request #89977 · rust-lang/rust · Gi...

Digital Millennium Copyright Act

Config Connector overview | Config Connector Documentation

Faster Shipping Speeds for Cross-Border Orders

About Joyk