Have you ever wondered how top companies are harnessing the power of data to drive innovation and stay ahead of the competition? In this podcast, we’ll be speaking to some of the best industry minds and unlocking the secrets to leveraging data like never before. Ready for a data deep dive?
…
continue reading
Welcome to Data Brew by Databricks with Denny and Brooke! In this series, we explore various topics in the data and AI community and interview subject matter experts in data engineering/data science. So join us with your morning brew in hand and get ready to dive deep into data + AI! For this first season, we will be focusing on lakehouses – combining the key features of data warehouses, such as ACID transactions, with the scalability of data lakes, directly against low-cost object stores.
…
continue reading
1
Why I Bet My Career on Databricks!
46:51
46:51
Later Afspelen
Later Afspelen
Lijsten
Vind ik leuk
Leuk
46:51
Whether you're an entry-level data engineer, analyst, or BI developer, mastering Databricks could significantly boost your career. Tune in to the second episode of the Databricks diaries, with special guest, Holly Smith - Staff Developer Advocate at Databricks! Learn why she bet her career on Databricks and how you can get started.…
…
continue reading
1
Databricks vs Microsoft Fabric!
1:06:09
1:06:09
Later Afspelen
Later Afspelen
Lijsten
Vind ik leuk
Leuk
1:06:09
Hosted by Daniel Thornton, this episode features industry professionals Michael Amadi and Giles Middleton as they delve into the world of data platforms. Join us as we compare Databricks and Microsoft Fabric, sharing their insights and preferences. Discover which platform is the best fit for different data needs and learn about the key factors to c…
…
continue reading
1
Kumo AI & Relational Deep Learning | Data Brew | Episode 34
43:27
43:27
Later Afspelen
Later Afspelen
Lijsten
Vind ik leuk
Leuk
43:27
In this episode, Jure Leskovec, Co-founder of Kumo AI and Professor of Comuter Science at Stanford University, discusses Relational Deep Learning (RDL) and its role in automating feature engineering. Highlights include: - How RDL enhances predictive modeling. - Applications in fraud detection and recommendation systems. - The use of graph neural ne…
…
continue reading
Welcome to The Databricks Diaries. Have you ever wondered how top companies are harnessing the power of data to drive innovation and stay ahead of the competition? In this podcast, we’ll be speaking to some of the best industry minds and unlocking the secrets to leveraging data like never before. In each episode of the Databricks diaries, we’re ope…
…
continue reading
1
LLMs: Internals, Hallucinations, and Applications | Data Brew | Episode 33
38:50
38:50
Later Afspelen
Later Afspelen
Lijsten
Vind ik leuk
Leuk
38:50
Our fifth season dives into large language models (LLMs), from understanding the internals to the risks of using them and everything in between. While we're at it, we'll be enjoying our morning brew. In this session, we interviewed Chengyin Eng (Senior Data Scientist, Databricks), Sam Raymond (Senior Data Scientist, Databricks), and Joseph Bradley …
…
continue reading
1
Demonstrate–Search–Predict Framework | Data Brew | Episode 32
33:14
33:14
Later Afspelen
Later Afspelen
Lijsten
Vind ik leuk
Leuk
33:14
We will dive into LLMs for our fifth season, from understanding the internals to the risks of using them and everything in between. While we’re at it, we’ll be enjoying our morning brew. In this session, we interviewed Omar Khattab - Computer Science Ph.D. Student at Stanford, creator of DSP (Demonstrate–Search–Predict Framework), to discuss DSP, c…
…
continue reading
1
Generative AI Risks | Data Brew | Episode 31
34:38
34:38
Later Afspelen
Later Afspelen
Lijsten
Vind ik leuk
Leuk
34:38
We will dive into LLMs for our fifth season, from understanding the internals to the risks of using them and everything in between. While we’re at it, we’ll be enjoying our morning brew. In this session, we interviewed Yaron Singer, CEO of Robust Intelligence, Professor of Computer Science at Harvard University, and guest of Data Brew Season 3 (our…
…
continue reading
1
John Snow Labs & SparkNLP | Data Brew | Episode 30
43:17
43:17
Later Afspelen
Later Afspelen
Lijsten
Vind ik leuk
Leuk
43:17
We are back and we will dive into LLMs from understanding the internals to the risks of using them and everything in between. While we’re at it, we’ll be enjoying our morning brew. In this session, we interviewed David Talby who is the CTO at John Snow Labs; they help healthcare & life science companies put AI to good use. David's interests include…
…
continue reading
1
Data Brew Season 4 Episode 6: Professional Athletes
35:49
35:49
Later Afspelen
Later Afspelen
Lijsten
Vind ik leuk
Leuk
35:49
For our fourth season, we focus on connected health and how data & AI augment and improve our daily health. While we’re at it, we’ll be enjoying our morning brew. Shayna Powless and Eli Ankou, professional cyclist for L39ion of Los Angeles and defensive tackle for the Buffalo Bills, respectively, provide valuable insight on how professional athlete…
…
continue reading
1
Data Brew Season 4 Episode 5: Public Health: Education, Access, and Policy
34:39
34:39
Later Afspelen
Later Afspelen
Lijsten
Vind ik leuk
Leuk
34:39
For our fourth season, we focus on connected health and how data & AI augment and improve our daily health. While we’re at it, we’ll be enjoying our morning brew. Matt Willis, Marin County Public Health Officer, shares the three pillars of public health: education, access, and policy, and the critical role data plays in addressing the COVID-19 pand…
…
continue reading
1
Data Brew Season 4 Episode 4: 1283 Days of Running (and Counting)
35:54
35:54
Later Afspelen
Later Afspelen
Lijsten
Vind ik leuk
Leuk
35:54
For our fourth season, we focus on connected health and how data & AI augment and improve our daily health. While we’re at it, we’ll be enjoying our morning brew. Running the length of the US every year, Alexandra Matthiesen shares her motivational secrets for running 1,283 consecutive days (and counting!) and redefining physical and mental limits.…
…
continue reading
1
Data Brew Season 4 Episode 3: Last Man Standing
41:20
41:20
Later Afspelen
Later Afspelen
Lijsten
Vind ik leuk
Leuk
41:20
For our fourth season, we focus on connected health and how data & AI augment and improve our daily health. While we’re at it, we’ll be enjoying our morning brew. Winner of the infamous Last Man Standing race (running 246 miles in 59 hours), Guillaume merges the world of competitive long-distance running with data science to push the boundaries of …
…
continue reading
1
Data Brew Season 4 Episode 2: NBA Analytics
30:16
30:16
Later Afspelen
Later Afspelen
Lijsten
Vind ik leuk
Leuk
30:16
For our fourth season, we focus on connected health and how data & AI augment and improve our daily health. While we’re at it, we’ll be enjoying our morning brew. Alexander Powell chronicles the evolution of sports analytics and how professional sports teams use data as a competitive advantage. See more at databricks.com/data-brew…
…
continue reading
1
Data Brew Season 4 Episode 1: Reducing Injury & Increasing Retention of Industrial Athletes
33:58
33:58
Later Afspelen
Later Afspelen
Lijsten
Vind ik leuk
Leuk
33:58
For our fourth season, we focus on connected health and how data & AI augment and improve our daily health. While we’re at it, we’ll be enjoying our morning brew. Globally, 38,000 people get hurt on the job every hour. In the United States alone, over $250 billion dollars is spent on workplace injury annually. Sean Petterson, founder and CEO of Str…
…
continue reading
1
Data Brew Season 3 Episode 6: Open Source
33:49
33:49
Later Afspelen
Later Afspelen
Lijsten
Vind ik leuk
Leuk
33:49
For our third season, we focus on how leaders use data for change. Whether it’s building data teams or using data as a constructive catalyst, we interview subject matter experts from industry to dive deeper into these topics. For our season 3 finale, Nithya Ruff discusses the open-source ecosystem, ways to contribute to open-source projects (hint: …
…
continue reading
1
Data Brew Season 3 Episode 5: Sustainability & Sake
32:26
32:26
Later Afspelen
Later Afspelen
Lijsten
Vind ik leuk
Leuk
32:26
For our third season, we focus on how leaders use data for change. Whether it’s building data teams or using data as a constructive catalyst, we interview subject matter experts from industry to dive deeper into these topics. We interview Junta Nakai in our most unique location yet - Brooklyn Kura - the first non-Japanese sake distillery in New Yor…
…
continue reading
1
Data Brew Season 3 Episode 4: Executive Education
38:46
38:46
Later Afspelen
Later Afspelen
Lijsten
Vind ik leuk
Leuk
38:46
For our third season, we focus on how leaders use data for change. Whether it’s building data teams or using data as a constructive catalyst, we interview subject matter experts from industry to dive deeper into these topics. Did you know that the average tenure of a board member is longer than the average tenure of a marriage in the United States?…
…
continue reading
1
Data Brew Season 3 Episode 3: 3 T’s to Securing AI Systems: Tests, tests, and more tests
35:01
35:01
Later Afspelen
Later Afspelen
Lijsten
Vind ik leuk
Leuk
35:01
For our third season, we focus on how leaders use data for change. Whether it’s building data teams or using data as a constructive catalyst, we interview subject matter experts from industry to dive deeper into these topics. What does it mean to make your machine learning system “production-ready”? Yaron Singer walks us through the infrastructure,…
…
continue reading
1
Data Brew Season 3 Episode 2: Data Culture Outside ‘The Valley’
35:39
35:39
Later Afspelen
Later Afspelen
Lijsten
Vind ik leuk
Leuk
35:39
For our third season, we focus on how leaders use data for change. Whether it’s building data teams or using data as a constructive catalyst, we interview subject matter experts from industry to dive deeper into these topics. Have you ever had a spam call automatically blocked for you? You can thank First Orion for that - in one day they blocked or…
…
continue reading
1
Data Brew Season 3 Episode 1: Disrupt: Challenge your Business Assumptions
29:45
29:45
Later Afspelen
Later Afspelen
Lijsten
Vind ik leuk
Leuk
29:45
For our third season, we focus on how leaders use data for change. Whether it’s building data teams or using data as a constructive catalyst, we interview subject matter experts from industry to dive deeper into these topics. In this season opener, Elena Donio shares her experience using data and domain knowledge to disrupt the traditional service …
…
continue reading
1
Data Brew Season 2 Episode 9: Data Driven Software
31:12
31:12
Later Afspelen
Later Afspelen
Lijsten
Vind ik leuk
Leuk
31:12
For our second season of Data Brew, we will be focusing on machine learning, from research to production. We will interview folks in academia and industry to discuss topics such as data ethics, production-grade infrastructure for ML, hyperparameter tuning, AutoML, and many more. We branch, version, and test our code, but what if we treated data lik…
…
continue reading
1
Data Brew Season 2 Episode 8: Feature Engineering
31:17
31:17
Later Afspelen
Later Afspelen
Lijsten
Vind ik leuk
Leuk
31:17
For our second season of Data Brew, we will be focusing on machine learning, from research to production. We will interview folks in academia and industry to discuss topics such as data ethics, production-grade infrastructure for ML, hyperparameter tuning, AutoML, and many more. Is there ever a “one-size fits all” approach for feature engineering? …
…
continue reading
1
Data Brew Season 2 Episode 7: Interpretable Machine Learning
37:07
37:07
Later Afspelen
Later Afspelen
Lijsten
Vind ik leuk
Leuk
37:07
For our second season of Data Brew, we will be focusing on machine learning, from research to production. We will interview folks in academia and industry to discuss topics such as data ethics, production-grade infrastructure for ML, hyperparameter tuning, AutoML, and many more. What does it mean for a model to be “interpretable”? Ameet Talwalkar s…
…
continue reading
1
Data Brew Season 2 Episode 6: AutoML
35:55
35:55
Later Afspelen
Later Afspelen
Lijsten
Vind ik leuk
Leuk
35:55
For our second season of Data Brew, we will be focusing on machine learning, from research to production. We will interview folks in academia and industry to discuss topics such as data ethics, production-grade infrastructure for ML, hyperparameter tuning, AutoML, and many more. Erin LeDell shares valuable insight on AutoML, what problems are best …
…
continue reading
1
Data Brew Season 2 Episode 5: ML Applications
32:40
32:40
Later Afspelen
Later Afspelen
Lijsten
Vind ik leuk
Leuk
32:40
For our second season of Data Brew, we will be focusing on machine learning, from research to production. We will interview folks in academia and industry to discuss topics such as data ethics, production-grade infrastructure for ML, hyperparameter tuning, AutoML, and many more. Good machine learning starts with high quality data. Irina Malkova sha…
…
continue reading
1
Data Brew Season 2 Episode 4: Hyperparameter and Neural Architecture Search
33:25
33:25
Later Afspelen
Later Afspelen
Lijsten
Vind ik leuk
Leuk
33:25
For our second season of Data Brew, we will be focusing on machine learning, from research to production. We will interview folks in academia and industry to discuss topics such as data ethics, production-grade infrastructure for ML, hyperparameter tuning, AutoML, and many more. Liam Li is a leading researcher in the fields of hyperparameter optimi…
…
continue reading
1
Data Brew Season 2 Episode 3: Infrastructure for ML
30:34
30:34
Later Afspelen
Later Afspelen
Lijsten
Vind ik leuk
Leuk
30:34
For our second season of Data Brew, we will be focusing on machine learning, from research to production. We will interview folks in academia and industry to discuss topics such as data ethics, production-grade infrastructure for ML, hyperparameter tuning, AutoML, and many more. Adam Oliner discusses how to design your infrastructure to support ML,…
…
continue reading
1
Data Brew Season 2 Episode 2: Data Ethics
25:47
25:47
Later Afspelen
Later Afspelen
Lijsten
Vind ik leuk
Leuk
25:47
For our second season of Data Brew, we will be focusing on machine learning, from research to production. We will interview folks in academia and industry to discuss topics such as data ethics, production-grade infrastructure for ML, hyperparameter tuning, AutoML, and many more. Have you ever wondered how your purchasing behavior may reveal protect…
…
continue reading
1
Data Brew Season 2 Episode 1: ML in Production
30:49
30:49
Later Afspelen
Later Afspelen
Lijsten
Vind ik leuk
Leuk
30:49
For our second season, we will be focusing on machine learning, from research to production. We will interview folks in academia and industry to discuss topics such as data ethics, production-grade infrastructure for ML, hyperparameter tuning, AutoML, and many more. In the season opener, Matei Zaharia discusses how he entered the field of ML, best …
…
continue reading
1
Data Brew Season 1 Episode 6: Journey of Big Data
40:16
40:16
Later Afspelen
Later Afspelen
Lijsten
Vind ik leuk
Leuk
40:16
Jules Damji and Tathagata Das guide us through their journey in big data and the evolution of data architecture in the past 30 years. They discuss some of the biggest changes in industry they’ve seen, as well as trends to look forward to in the coming years. This is a fun episode connecting all four authors of the Learning Spark, 2nd Edition book. …
…
continue reading
1
Data Brew Season 1 Episode 5: Combining Machine Learning and MLflow with your Lakehouse
36:00
36:00
Later Afspelen
Later Afspelen
Lijsten
Vind ik leuk
Leuk
36:00
Ellissa Verseput, ML Engineer at Quby, joins Denny and Brooke to discuss how Quby leverages ML to extract additional value from their data lake and how they manage this process. See more at databricks.com/data-brewDoor Databricks
…
continue reading
1
Data Brew Season 1 Episode 4: BI on Data Lakes - Making it Real for Retail
29:05
29:05
Later Afspelen
Later Afspelen
Lijsten
Vind ik leuk
Leuk
29:05
In this session, we discuss the lessons learned with Lara Minor, Senior Enterprise Data Manager at Columbia Sportswear, on how her team achieved a 70% reduction in pipeline creation time. This had reduced ETL workload times from four hours with previous data warehouses to minutes enabling near real-time analytics. Her team migrated from multiple le…
…
continue reading
1
Data Brew Season 1 Episode 3: Demystifying Delta Lake
25:51
25:51
Later Afspelen
Later Afspelen
Lijsten
Vind ik leuk
Leuk
25:51
Delta Lake is an open source storage layer that brings reliability to data lakes. Delta Lake offers ACID transactions, scalable metadata handling, and unifies streaming and batch data processing. It runs on top of your existing data lake and is fully compatible with Apache Spark APIs. For our “Demystifying Delta Lake” session, we will interview Mic…
…
continue reading
1
Data Brew Season 1 Episode 2: Welcome to Lakehouse
26:10
26:10
Later Afspelen
Later Afspelen
Lijsten
Vind ik leuk
Leuk
26:10
Legacy approaches have failed to deliver on the promise of a single data architecture that can support every downstream use case from BI to AI. Lakehouse aspires to address this by combining the best of data warehouses and data lakes. Ali Ghodsi, Co-Founder and CEO of Databricks, and David Meyer, SVP of Product at Databricks, explain how. See more …
…
continue reading
1
Data Brew Season 1 Episode 1: From data warehousing to data lakes in 40 minutes
44:48
44:48
Later Afspelen
Later Afspelen
Lijsten
Vind ik leuk
Leuk
44:48
In our inaugural episode, we’d like to welcome data warehouse luminaries Barry Devlin, Susan O’Connell, and Donald Farmer to discuss the evolution of data warehouses, data lakes, and lakehouses. See more at databricks.com/data-brewDoor Databricks
…
continue reading