Both companies' models solve five out of six problems, achieving result using general-purpose "reasoning" models