Fork join_none in for loop with multiple threads sharing semaphore

Hi, Mlsdx
In VCS i get following output during simulation.

               0 a=0
               0 Process_it: get key, a=0
               0 a=1
               0 a=2
               2 Process_it: put key back, a=0
               2 Process_it: get key, a=1
               4 Process_it: put key back, a=1
               4 Process_it: get key, a=2
               6 Process_it: put key back, a=2