What is it about their explanation that is causing confusion? In other words, how far can you get through their explanation before you hit a stumbling point?
Recall that the definition of the common mode voltage is the average value of the two signals. If one of the signals is 0V, then that means that the average of the two is one-half of the other.
Does that help?
You can derive the waveform by simply using the definition. Add the two waveforms together and then divide by two to get the average.