Artwork

Inhoud geleverd door Iain Bethune and Iain Bethune (ibethune@exseed.ed.ac.uk). Alle podcastinhoud, inclusief afleveringen, afbeeldingen en podcastbeschrijvingen, wordt rechtstreeks geüpload en geleverd door Iain Bethune and Iain Bethune (ibethune@exseed.ed.ac.uk) of hun podcastplatformpartner. Als u denkt dat iemand uw auteursrechtelijk beschermde werk zonder uw toestemming gebruikt, kunt u het hier beschreven proces https://nl.player.fm/legal volgen.
Player FM - Podcast-app
Ga offline met de app Player FM !

Adventures in the Biology trade : Bioinformatics in the Petabyte era (60 mins, ~42 MB)

1:00:00
 
Delen
 

Manage episode 205984210 series 2307601
Inhoud geleverd door Iain Bethune and Iain Bethune (ibethune@exseed.ed.ac.uk). Alle podcastinhoud, inclusief afleveringen, afbeeldingen en podcastbeschrijvingen, wordt rechtstreeks geüpload en geleverd door Iain Bethune and Iain Bethune (ibethune@exseed.ed.ac.uk) of hun podcastplatformpartner. Als u denkt dat iemand uw auteursrechtelijk beschermde werk zonder uw toestemming gebruikt, kunt u het hier beschreven proces https://nl.player.fm/legal volgen.
Bioinformatics and more widely Computational Biology is a largely data-driven Science. The array of high-throughput technology platforms in the last 10 years mean that the amount of data being generated in this field is likely to enter into Exabytes by 2020. The challenges associated with this are quite different from the data sets generated by High Energy Physics or Astrophysics in that they tend to gathered from a wide variety of different providers. Meta-analyses of these data sets can give startling new insights but come with many caveats - in particular that the quality of the data from each provider can be highly variable. I will spend some time talking about one set of experiences I have dealing with one specific technology platform and in particular how it is clear that the detection of bias in data sets is a key element of any high-throughput analysis. This talk was given as part of our MSc in HPC's 'HPC Ecosystem' course.
  continue reading

19 afleveringen

Artwork
iconDelen
 
Manage episode 205984210 series 2307601
Inhoud geleverd door Iain Bethune and Iain Bethune (ibethune@exseed.ed.ac.uk). Alle podcastinhoud, inclusief afleveringen, afbeeldingen en podcastbeschrijvingen, wordt rechtstreeks geüpload en geleverd door Iain Bethune and Iain Bethune (ibethune@exseed.ed.ac.uk) of hun podcastplatformpartner. Als u denkt dat iemand uw auteursrechtelijk beschermde werk zonder uw toestemming gebruikt, kunt u het hier beschreven proces https://nl.player.fm/legal volgen.
Bioinformatics and more widely Computational Biology is a largely data-driven Science. The array of high-throughput technology platforms in the last 10 years mean that the amount of data being generated in this field is likely to enter into Exabytes by 2020. The challenges associated with this are quite different from the data sets generated by High Energy Physics or Astrophysics in that they tend to gathered from a wide variety of different providers. Meta-analyses of these data sets can give startling new insights but come with many caveats - in particular that the quality of the data from each provider can be highly variable. I will spend some time talking about one set of experiences I have dealing with one specific technology platform and in particular how it is clear that the detection of bias in data sets is a key element of any high-throughput analysis. This talk was given as part of our MSc in HPC's 'HPC Ecosystem' course.
  continue reading

19 afleveringen

Alle afleveringen

×
 
Loading …

Welkom op Player FM!

Player FM scant het web op podcasts van hoge kwaliteit waarvan u nu kunt genieten. Het is de beste podcast-app en werkt op Android, iPhone en internet. Aanmelden om abonnementen op verschillende apparaten te synchroniseren.

 

Korte handleiding