307 | 05.12.2013 2:47 | Mmry ECC Sensor | Memory | Uncorrectable ECC. CPU: 1, DIMM: B1. - Asserted |
This is a hard failure on the DIMM.
Which most likley resulted in this error as a secondary message
303 | 05.12.2013 2:43 | CATERR | Processor | reports it has been asserted |
This two are very strange. I have seen simular on early Engennering Sample Processors , but not on Production processors. Indicates the BMC can't read the CPU tempeature so fans will all go to 100%
'P2 Therm Ctrl %' sensor has failed and may not be providing a valid reading - Asserted
Might be related to the Dimm is the dimm is hanging the i2C bus but very strange.
I would recommend:
replacing DIMM B1as this is the error that tool the system down.
Update to the newest code stack release for BIOS, BMC, ME and FRUSDR (may fix the PSU messages)