Will O1 Ever Escape ChatGPT's Old Training? OVERFIT: AI, Machine Learning, And Deep Learning Made Simple podcast

Artwork

Inhoud geleverd door Brian Carter. Alle podcastinhoud, inclusief afleveringen, afbeeldingen en podcastbeschrijvingen, wordt rechtstreeks geüpload en geleverd door Brian Carter of hun podcastplatformpartner. Als u denkt dat iemand uw auteursrechtelijk beschermde werk zonder uw toestemming gebruikt, kunt u het hier beschreven proces https://nl.player.fm/legal volgen.

OVERFIT: AI, Machine Learning, and Deep Learning Made Simple « »
Will o1 Ever Escape ChatGPT's Old Training?

2d ago 7:41

Delen

MP3•Thuis aflevering

Inhoud geleverd door Brian Carter. Alle podcastinhoud, inclusief afleveringen, afbeeldingen en podcastbeschrijvingen, wordt rechtstreeks geüpload en geleverd door Brian Carter of hun podcastplatformpartner. Als u denkt dat iemand uw auteursrechtelijk beschermde werk zonder uw toestemming gebruikt, kunt u het hier beschreven proces https://nl.player.fm/legal volgen.

This study investigates whether the reasoning abilities of large language models (LLMs) are still influenced by their origins in next-word prediction. The authors examine the performance of a new LLM from OpenAI called o1, which is specifically optimized for reasoning, on tasks that highlight the limitations of LLMs based on their autoregressive nature. While o1 shows significant improvements compared to previous LLMs, it still displays a sensitivity to the probability of both the task and the output, suggesting that reasoning optimization may not fully overcome the probabilistic biases ingrained during training. The study provides evidence for the "teleological perspective," which argues that understanding AI systems requires considering the pressures and optimizations that have shaped them.

Read more: https://arxiv.org/abs/2410.01792

… continue reading

21 afleveringen

Artwork

Will o1 Ever Escape ChatGPT's Old Training?

OVERFIT: AI, Machine Learning, and Deep Learning Made Simple

published 2d ago

Delen

MP3•Thuis aflevering

Inhoud geleverd door Brian Carter. Alle podcastinhoud, inclusief afleveringen, afbeeldingen en podcastbeschrijvingen, wordt rechtstreeks geüpload en geleverd door Brian Carter of hun podcastplatformpartner. Als u denkt dat iemand uw auteursrechtelijk beschermde werk zonder uw toestemming gebruikt, kunt u het hier beschreven proces https://nl.player.fm/legal volgen.

This study investigates whether the reasoning abilities of large language models (LLMs) are still influenced by their origins in next-word prediction. The authors examine the performance of a new LLM from OpenAI called o1, which is specifically optimized for reasoning, on tasks that highlight the limitations of LLMs based on their autoregressive nature. While o1 shows significant improvements compared to previous LLMs, it still displays a sensitivity to the probability of both the task and the output, suggesting that reasoning optimization may not fully overcome the probabilistic biases ingrained during training. The study provides evidence for the "teleological perspective," which argues that understanding AI systems requires considering the pressures and optimizations that have shaped them.

Read more: https://arxiv.org/abs/2410.01792

… continue reading

21 afleveringen

Alle afleveringen

×

Welkom op Player FM!

Player FM scant het web op podcasts van hoge kwaliteit waarvan u nu kunt genieten. Het is de beste podcast-app en werkt op Android, iPhone en internet. Aanmelden om abonnementen op verschillende apparaten te synchroniseren.

Luister naar 500+ onderwerpen