Hi,
It's a bit hard to understand what actually happened without looking at the full kernel log. but the first issue looks like a memory issue with QP registrations which was most likely caused by an issue previous to that. most commonly would be the firmware getting stuck, PCI issue etc...I would swap the HCA with another one to see if the issue follows the card or not.
as for upgrading, this is a really old HCA, so newer MFT versions will most likely not work with it.Are you still in that state even after the server is rebooted ? what does "mst status" show ?