- 
                Notifications
    You must be signed in to change notification settings 
- Fork 18.4k
Description
Please answer these questions before submitting your issue. Thanks!
- What version of Go are you using (go version)?
go version go1.6.2 freebsd/amd64
- What operating system and processor architecture are you using (go env)?
GOARCH="amd64"
GOBIN=""
GOEXE=""
GOHOSTARCH="amd64"
GOHOSTOS="freebsd"
GOOS="freebsd"
GOPATH=""
GORACE=""
GOROOT="/usr/local/go"
GOTOOLDIR="/usr/local/go/pkg/tool/freebsd_amd64"
GO15VENDOREXPERIMENT="1"
CC="cc"
GOGCCFLAGS="-fPIC -m64 -pthread -fmessage-length=0"
CXX="clang++"- What did you do?
package main
/* stdlib includes */
import (
        "fmt"
        "os/exec"
)
func run(done chan struct{}) {
        cmd := exec.Command("true")
        if err := cmd.Start(); err != nil {
                goto finished
        }
        cmd.Wait()
finished:
        done <- struct{}{}
        return
}
func main() {
        fmt.Println("Starting a bunch of goroutines...")
        // 8 & 16 are arbitrary
        done := make(chan struct{}, 16)
        for i := 0; i < 8; i++ {
                go run(done)
        }
        for {
                select {
                case <-done:
                        go run(done)
                }
        }
}- What did you expect to see?
I expect this strange program to spawn instances of /bin/true in parallel, until I stop it.
- What did you see instead?
Various types of panics caused by what looks to be corruption within the finalizer lists, caused by what I am assuming is based on race conditions. These panics can happen as quickly as 2 minutes, or much longer. 10 minutes seems a good round number.
Occasionally addspecial gets stuck in an infinite loop holding the lock, and the process wedges. This is illustrated in log 1462933614, with x.next pointing to x. This appears to be corruption of that data structure. I have seen processes in this state run for 22 hours.
I understand there is some trepidation expressed in issue #11485 around the locking of the data structures involved.
Here are some sample messages:
- fatal error: runtime.SetFinalizer: finalizer already set
- runtime: nonempty check fails b.log[0]= 0 b.log[1]= 0 b.log[2]= 0 b.log[3]= 0 fatal error: workbuf is empty
1462926841-SetFinalizer-ex1.txt
1462926969-SetFinalizer-ex2.txt
1462933295-nonempty-check-fails.txt
1462933614-wedged.txt
This was run on an 8-core processor, and a 4-core 8-thread processor with ECC RAM, similar results.
Additionally, while this example is an extreme, it also represents the core functionality of a project I've been working on part-time for many months. I'm happy to provide any further assistance diagnosing this issue - I'm very invested!