I don't get the picture, is the brown rectangle like a screen in a movie theatre? If it is, the SA-5000 sounds nothing like sitting 20 rows back in a movie theatre =/
Okay, for the benefit of you as well as those Head-Fiers out there who don't exactly understand japanese at all, I will try to do an explanation as follows.
リスナー is referring to the listener. (in this case, it's referencing to the listener 'yourself')
ステージ is referring to the stage. (also referring to where the usual performing singer/instruments are involved)
From my own listening judgement, like for example my Audio-technica CK100/Grado RS-1i, vocals/instrument soundstage is placed just as described for the Z-series, while phones like the EX1000/EX800ST is probably akin to what was described as seen in the EX-series. I'm not too sure about the MDR-SA5000, but according to Phile-web's japanese article which I did read sometime back, I think I did remember that the graphical chart was actually supposed to explain that the MDR-SA5000 has more of an opened 'beyond-the-head' soundstage rather than merely just placing the listener/audience 20 rows away from the performing stage.
Hope this answers your question.