Artwork

Inhoud geleverd door Yannic Kilcher. Alle podcastinhoud, inclusief afleveringen, afbeeldingen en podcastbeschrijvingen, wordt rechtstreeks geüpload en geleverd door Yannic Kilcher of hun podcastplatformpartner. Als u denkt dat iemand uw auteursrechtelijk beschermde werk zonder uw toestemming gebruikt, kunt u het hier beschreven proces https://nl.player.fm/legal volgen.
Player FM - Podcast-app
Ga offline met de app Player FM !

ChatGPT: This AI has a JAILBREAK?! (Unbelievable AI Progress)

31:54
 
Delen
 

Manage episode 351308803 series 2974171
Inhoud geleverd door Yannic Kilcher. Alle podcastinhoud, inclusief afleveringen, afbeeldingen en podcastbeschrijvingen, wordt rechtstreeks geüpload en geleverd door Yannic Kilcher of hun podcastplatformpartner. Als u denkt dat iemand uw auteursrechtelijk beschermde werk zonder uw toestemming gebruikt, kunt u het hier beschreven proces https://nl.player.fm/legal volgen.

#chatgpt #ai #openai

ChatGPT, OpenAI's newest model is a GPT-3 variant that has been fine-tuned using Reinforcement Learning from Human Feedback, and it is taking the world by storm!

Sponsor: Weights & Biases

https://wandb.me/yannic

OUTLINE:

0:00 - Intro

0:40 - Sponsor: Weights & Biases

3:20 - ChatGPT: How does it work?

5:20 - Reinforcement Learning from Human Feedback

7:10 - ChatGPT Origins: The GPT-3.5 Series

8:20 - OpenAI's strategy: Iterative Refinement

9:10 - ChatGPT's amazing capabilities

14:10 - Internals: What we know so far

16:10 - Building a virtual machine in ChatGPT's imagination (insane)

20:15 - Jailbreaks: Circumventing the safety mechanisms

29:25 - How OpenAI sees the future

References:

https://openai.com/blog/chatgpt/

https://openai.com/blog/language-model-safety-and-misuse/

https://beta.openai.com/docs/model-index-for-researchers

https://scale.com/blog/gpt-3-davinci-003-comparison#Conclusion

https://twitter.com/johnvmcdonnell/status/1598470129121374209

https://twitter.com/blennon_/status/1597374826305318912

https://twitter.com/TimKietzmann/status/1598230759118376960/photo/1

https://twitter.com/_lewtun/status/1598056075672027137/photo/2

https://twitter.com/raphaelmilliere/status/1598469100535259136

https://twitter.com/CynthiaSavard/status/1598498138658070530/photo/1

https://twitter.com/tylerangert/status/1598389755997290507/photo/1

https://twitter.com/amasad/status/1598042665375105024/photo/1

https://twitter.com/goodside/status/1598129631609380864/photo/1

https://twitter.com/moyix/status/1598081204846489600/photo/2

https://twitter.com/JusticeRage/status/1598959136531546112

https://twitter.com/yoavgo/status/1598594145605636097

https://twitter.com/EladRichardson/status/1598333315764871174

https://twitter.com/charles_irl/status/1598319027327307785/photo/4

https://twitter.com/jasondebolt/status/1598243854343606273

https://twitter.com/mattshumer_/status/1598185710166896641/photo/1

https://twitter.com/i/web/status/1598246145171804161

https://twitter.com/bleedingedgeai/status/1598378564373471232

https://twitter.com/MasterScrat/status/1598830356115124224

https://twitter.com/Sentdex/status/1598803009844256769

https://twitter.com/harrison_ritz/status/1598828017446371329

https://twitter.com/parafactual/status/1598212029479026689

https://www.engraved.blog/building-a-virtual-machine-inside/

https://twitter.com/317070

https://twitter.com/zehavoc/status/1599193444043268096

https://twitter.com/yoavgo/status/1598360581496459265

https://twitter.com/yoavgo/status/1599037412411596800

https://twitter.com/yoavgo/status/1599045344863879168

https://twitter.com/natfriedman/status/1598477452661383168

https://twitter.com/conradev/status/1598487973351362561/photo/1

https://twitter.com/zswitten/status/1598100186605441024

https://twitter.com/CatEmbedded/status/1599141379879600128/photo/2

https://twitter.com/mattshumer_/status/1599175127148949505

https://twitter.com/vaibhavk97/status/1598930958769860608/photo/1

https://twitter.com/dan_abramov/status/1598800508160024588/photo/1

https://twitter.com/MinqiJiang/status/1598832656422432768/photo/2

https://twitter.com/zswitten/status/1598088280066920453

https://twitter.com/m1guelpf/status/1598203861294252033/photo/1

https://twitter.com/SilasAlberti/status/1598257908567117825/photo/1

https://twitter.com/gf_256/status/1598962842861899776/photo/1

https://twitter.com/zswitten/status/1598088267789787136

https://twitter.com/gf_256/status/1598178469955112961/photo/1

  continue reading

177 afleveringen

Artwork
iconDelen
 
Manage episode 351308803 series 2974171
Inhoud geleverd door Yannic Kilcher. Alle podcastinhoud, inclusief afleveringen, afbeeldingen en podcastbeschrijvingen, wordt rechtstreeks geüpload en geleverd door Yannic Kilcher of hun podcastplatformpartner. Als u denkt dat iemand uw auteursrechtelijk beschermde werk zonder uw toestemming gebruikt, kunt u het hier beschreven proces https://nl.player.fm/legal volgen.

#chatgpt #ai #openai

ChatGPT, OpenAI's newest model is a GPT-3 variant that has been fine-tuned using Reinforcement Learning from Human Feedback, and it is taking the world by storm!

Sponsor: Weights & Biases

https://wandb.me/yannic

OUTLINE:

0:00 - Intro

0:40 - Sponsor: Weights & Biases

3:20 - ChatGPT: How does it work?

5:20 - Reinforcement Learning from Human Feedback

7:10 - ChatGPT Origins: The GPT-3.5 Series

8:20 - OpenAI's strategy: Iterative Refinement

9:10 - ChatGPT's amazing capabilities

14:10 - Internals: What we know so far

16:10 - Building a virtual machine in ChatGPT's imagination (insane)

20:15 - Jailbreaks: Circumventing the safety mechanisms

29:25 - How OpenAI sees the future

References:

https://openai.com/blog/chatgpt/

https://openai.com/blog/language-model-safety-and-misuse/

https://beta.openai.com/docs/model-index-for-researchers

https://scale.com/blog/gpt-3-davinci-003-comparison#Conclusion

https://twitter.com/johnvmcdonnell/status/1598470129121374209

https://twitter.com/blennon_/status/1597374826305318912

https://twitter.com/TimKietzmann/status/1598230759118376960/photo/1

https://twitter.com/_lewtun/status/1598056075672027137/photo/2

https://twitter.com/raphaelmilliere/status/1598469100535259136

https://twitter.com/CynthiaSavard/status/1598498138658070530/photo/1

https://twitter.com/tylerangert/status/1598389755997290507/photo/1

https://twitter.com/amasad/status/1598042665375105024/photo/1

https://twitter.com/goodside/status/1598129631609380864/photo/1

https://twitter.com/moyix/status/1598081204846489600/photo/2

https://twitter.com/JusticeRage/status/1598959136531546112

https://twitter.com/yoavgo/status/1598594145605636097

https://twitter.com/EladRichardson/status/1598333315764871174

https://twitter.com/charles_irl/status/1598319027327307785/photo/4

https://twitter.com/jasondebolt/status/1598243854343606273

https://twitter.com/mattshumer_/status/1598185710166896641/photo/1

https://twitter.com/i/web/status/1598246145171804161

https://twitter.com/bleedingedgeai/status/1598378564373471232

https://twitter.com/MasterScrat/status/1598830356115124224

https://twitter.com/Sentdex/status/1598803009844256769

https://twitter.com/harrison_ritz/status/1598828017446371329

https://twitter.com/parafactual/status/1598212029479026689

https://www.engraved.blog/building-a-virtual-machine-inside/

https://twitter.com/317070

https://twitter.com/zehavoc/status/1599193444043268096

https://twitter.com/yoavgo/status/1598360581496459265

https://twitter.com/yoavgo/status/1599037412411596800

https://twitter.com/yoavgo/status/1599045344863879168

https://twitter.com/natfriedman/status/1598477452661383168

https://twitter.com/conradev/status/1598487973351362561/photo/1

https://twitter.com/zswitten/status/1598100186605441024

https://twitter.com/CatEmbedded/status/1599141379879600128/photo/2

https://twitter.com/mattshumer_/status/1599175127148949505

https://twitter.com/vaibhavk97/status/1598930958769860608/photo/1

https://twitter.com/dan_abramov/status/1598800508160024588/photo/1

https://twitter.com/MinqiJiang/status/1598832656422432768/photo/2

https://twitter.com/zswitten/status/1598088280066920453

https://twitter.com/m1guelpf/status/1598203861294252033/photo/1

https://twitter.com/SilasAlberti/status/1598257908567117825/photo/1

https://twitter.com/gf_256/status/1598962842861899776/photo/1

https://twitter.com/zswitten/status/1598088267789787136

https://twitter.com/gf_256/status/1598178469955112961/photo/1

  continue reading

177 afleveringen

Όλα τα επεισόδια

×
 
Loading …

Welkom op Player FM!

Player FM scant het web op podcasts van hoge kwaliteit waarvan u nu kunt genieten. Het is de beste podcast-app en werkt op Android, iPhone en internet. Aanmelden om abonnementen op verschillende apparaten te synchroniseren.

 

Korte handleiding