Trying models of similar MOSFETs from different manufacturers would give some clues about the former: can’t do much about the latter!Spice quality = (the models X the operator).
The quoted power handling is likely somewhat unrealistic, but I think the stated thermal-resistance values for the parts are accurate enough to determine proper heat-sinking (or if it is needed).For that matter, how accurate are the datasheet values of the power handling capability of 'typical' parts?
Yes, obviously the accuracy of the simulation is dependent on the accuracy of all the components, including strays.so the overall accuracy will depend on having accurate values for such things as supply impedance and transformer self-capacitance, or running the simulator with a span of values for them and seeing which make a difference.