[QA] Do Unlearning Methods Remove Information From Language Model Weights? Arxiv Papers podcast

Artwork

Science Igor Melnyk

Inhoud geleverd door Igor Melnyk. Alle podcastinhoud, inclusief afleveringen, afbeeldingen en podcastbeschrijvingen, wordt rechtstreeks geüpload en geleverd door Igor Melnyk of hun podcastplatformpartner. Als u denkt dat iemand uw auteursrechtelijk beschermde werk zonder uw toestemming gebruikt, kunt u het hier beschreven proces https://nl.player.fm/legal volgen.

Arxiv Papers « »
[QA] Do Unlearning Methods Remove Information from Language Model Weights?

6d ago 8:03

Delen

MP3•Thuis aflevering

Inhoud geleverd door Igor Melnyk. Alle podcastinhoud, inclusief afleveringen, afbeeldingen en podcastbeschrijvingen, wordt rechtstreeks geüpload en geleverd door Igor Melnyk of hun podcastplatformpartner. Als u denkt dat iemand uw auteursrechtelijk beschermde werk zonder uw toestemming gebruikt, kunt u het hier beschreven proces https://nl.player.fm/legal volgen.

The paper evaluates unlearning techniques in Large Language Models, revealing that current methods inadequately remove sensitive information, allowing attackers to recover significant pre-unlearning accuracy.

https://arxiv.org/abs//2410.08827

YouTube: https://www.youtube.com/@ArxivPapers

TikTok: https://www.tiktok.com/@arxiv_papers

Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016

Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers

--- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support

… continue reading

1601 afleveringen

#Science #Igor Melnyk

Artwork

[QA] Do Unlearning Methods Remove Information from Language Model Weights?

published 6d ago

Delen

MP3•Thuis aflevering

Inhoud geleverd door Igor Melnyk. Alle podcastinhoud, inclusief afleveringen, afbeeldingen en podcastbeschrijvingen, wordt rechtstreeks geüpload en geleverd door Igor Melnyk of hun podcastplatformpartner. Als u denkt dat iemand uw auteursrechtelijk beschermde werk zonder uw toestemming gebruikt, kunt u het hier beschreven proces https://nl.player.fm/legal volgen.

The paper evaluates unlearning techniques in Large Language Models, revealing that current methods inadequately remove sensitive information, allowing attackers to recover significant pre-unlearning accuracy.

https://arxiv.org/abs//2410.08827

YouTube: https://www.youtube.com/@ArxivPapers

TikTok: https://www.tiktok.com/@arxiv_papers

Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016

Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers

--- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support

… continue reading

1601 afleveringen

#Science #Igor Melnyk

Усі епізоди

×

Welkom op Player FM!

Player FM scant het web op podcasts van hoge kwaliteit waarvan u nu kunt genieten. Het is de beste podcast-app en werkt op Android, iPhone en internet. Aanmelden om abonnementen op verschillende apparaten te synchroniseren.

Luister naar 500+ onderwerpen