OpenAI releases o1, its first model with ‘reasoning’ abilities

@[email protected] · edit-2 2 months ago

OpenAI releases o1, its first model with ‘reasoning’ abilities

@[email protected] · edit-2 2 months ago

All signs point to this being a finetune of gpt4o with additional chain of thought steps before the final answer. It has exactly the same pitfalls as the existing model (9.11>9.8 tokenization error, failing simple riddles, being unable to assert that the user is wrong, etc.). It’s still a transformer and it’s still next token prediction. They hide the thought steps to mask this fact and to prevent others from benefiting from all of the finetuning data they paid for.

Communist · 2 months ago

It does not fail the 9.11 > 9.8 thing.