Player FM - Internet Radio Done Right
Checked 4d ago
Toegevoegd dertien weken geleden
Inhoud geleverd door Jacob. Alle podcastinhoud, inclusief afleveringen, afbeeldingen en podcastbeschrijvingen, wordt rechtstreeks geüpload en geleverd door Jacob of hun podcastplatformpartner. Als u denkt dat iemand uw auteursrechtelijk beschermde werk zonder uw toestemming gebruikt, kunt u het hier beschreven proces https://nl.player.fm/legal volgen.
Player FM - Podcast-app
Ga offline met de app Player FM !
Ga offline met de app Player FM !
AI Wanna Talk
Markeer allemaal (on)gespeeld ...
Manage series 3610694
Inhoud geleverd door Jacob. Alle podcastinhoud, inclusief afleveringen, afbeeldingen en podcastbeschrijvingen, wordt rechtstreeks geüpload en geleverd door Jacob of hun podcastplatformpartner. Als u denkt dat iemand uw auteursrechtelijk beschermde werk zonder uw toestemming gebruikt, kunt u het hier beschreven proces https://nl.player.fm/legal volgen.
15-ish minute breakdowns about the latest in AI, productivity, and more delivered in a way that is easy to understand and implement into your daily life.
10 afleveringen
Markeer allemaal (on)gespeeld ...
Manage series 3610694
Inhoud geleverd door Jacob. Alle podcastinhoud, inclusief afleveringen, afbeeldingen en podcastbeschrijvingen, wordt rechtstreeks geüpload en geleverd door Jacob of hun podcastplatformpartner. Als u denkt dat iemand uw auteursrechtelijk beschermde werk zonder uw toestemming gebruikt, kunt u het hier beschreven proces https://nl.player.fm/legal volgen.
15-ish minute breakdowns about the latest in AI, productivity, and more delivered in a way that is easy to understand and implement into your daily life.
10 afleveringen
Alle afleveringen
×In this episode of "AI Wanna Talk," host Jacob Norgord explores a wide range of AI developments, focusing on new tools, platforms, and applications. The episode includes discussion of Perplexity's recent activities including their bid for TikTok, their acquisition of read.cv and their new sports feature. The podcast also examines new AI models, such as OpenAI’s o3-mini , and Apple's AI-generated news summaries. The episode also features a number of AI-powered tools including Windsor, Synthesis Tutor 2.0, FASHN AI, Krea AI, LumaLabs' Ray2, and Grok . Key topics covered include: Perplexity's potential acquisition of TikTok and their purchase of read.cv , and how these might integrate with their search platform. A look at Perplexity's new real-time sports update feature , which can be displayed on a lock screen and provide in-depth game statistics, potentially replacing apps like ESPN. OpenAI’s O3 mini model and its advantages in speed and efficiency. A critique of Apple's AI-generated news summaries and the features of the new iPhone, such as emoji creation and image playground. An overview of Windsurf , an AI development platform that now has web searching capabilities, which allows it to research and implement APIs, and auto-generate memories to improve its programming efficiency. An introduction to Synthesis Tutor 2.0 , an AI-driven tutor for kids that provides visual content and adapts to individual learning styles. A demonstration of FASHN AI’s technology , which allows users to see how clothing designs would look on a model. A look at Krea AI’s 3D layering tool that modifies a static image in real-time based on the orientation of a 3D object. An overview of LumaLabs' Ray2 video model . An update on Grok , which now has its own standalone website, as well as an iOS app, and can search both the web and X (formerly Twitter). The podcast emphasizes how AI is evolving, with Perplexity leading in innovation and companies constantly pushing the boundaries of AI tools and applications. Links & Resources: • Potential Perplexity Acquisition • Read.cv • OpenAI's o3-mini • Apple Pulls Back Generative AI • Synthesis Tutor 2.0 Demo • FASHN AI • Krea 3D • LumaLabs Ray2 • Grok…
Correction: The Bee device records all participants in a conversation while being able to identify and distinguish your voice from other speakers. In this episode of "AI Wanna Talk," host Jacob Norgord dives into the latest developments in AI, covering new apps, models, hardware, and even some thought-provoking ideas about the future of work and human interaction. AI App and Model Updates: xAI's Grok App: A detailed look at xAI's Grok app, including its web-searching capabilities and integration with Twitter, offering a real-time information experience. The app allows image uploads and generation and is currently available on iPhone, with a web app coming soon. Bytedance AI Video Upscaler: Discussion of Bytedance’s new AI video upscaler, which uses stable diffusion techniques to enhance video quality, potentially impacting platforms like TikTok. Cohere’s North Platform: An overview of Cohere’s North platform, an all-in-one AI workspace designed for enterprises. It integrates LLMs, search, and AI agents into workplace tools like Google Drive, Gmail, and GitHub. Meta's Byte-Latent Transformer: Explanation of Meta's new byte-latent transformer, which processes data at the byte level, improving performance, reducing computational resources and handling multiple languages better. This approach bypasses traditional tokenization. Microsoft's Phi Model: Introduction of Microsoft's smaller 14 billion parameter Phi model, noting its ability to run locally and achieve high performance on benchmarks like the MMLU. Kokoro 82M Model: Overview of the Kokoro 82M text-to-speech model, which can run locally with only 350MB of RAM, making it ideal for device integration. AI Hardware and Wearables: OMI Device: An exploration of the OMI wearable, a necklace-like device that provides contextually relevant information based on conversations and aims to connect directly to the brain in the future. B Device: Discussion of the B wrist-worn device, which summarizes conversations, suggests to-dos, and creates daily memories while respecting privacy, priced at $50. Future of AI and Society: AI-Driven Advertising: Insight into how advertising might shift to target AI agents rather than humans, as suggested by Perplexity's CEO. Changing Time Allocation: A discussion of Paul Graham's tweet about increased time spent at home since 2003, raising questions about the impact of AI on our lives and how we'll spend our time. The podcast also touches on the idea of intentional inconvenience in the future. Links & Resources: Adobe TransPixar AI Aravind's Advertising Prediction Bytedance's STAR AI Cohere’s North Meta's Byte-Latent Transformer Time Spent at Home xAI Small Models Microsoft Phi-4 Kokoro-82M Wearables omi Bee…
In this episode of "AI Want to Talk," host Jacob Norgord explores various new developments in AI, focusing on product updates and innovative applications from different companies. This episode emphasizes the evolving capabilities of AI and its potential impact on various industries. AI Product and Feature Updates: OpenAI's Task Feature: Discussion of OpenAI's task automation feature, which allows users to create automations that enable AI to perform specific tasks at specific times, such as weekly weather forecasts. This is designed to enhance the value of AI for users with frequent, recurring requests. TL Draw Computer: Introduction to TL Draw Computer, an online collaborative platform for brainstorming and creating AI and natural language processing workflows. Users can draw out workflows with components like text boxes, images, and audio clips to create complex automations. Google's NotebookLM UI: Overview of the new UI for Google's NotebookLM, featuring a layout with sources on the left and document generation tools on the right, including options for creating podcasts, study guides, FAQs, and timelines. The new interface also provides a central pane for source summaries and question prompts. AI Model and Integration Developments: Perplexity's Acquisition of Carbon: Details on Perplexity's acquisition of Carbon, a retrieval engine that connects external data sources to large language models. This will allow Perplexity to integrate data from apps like Notion and Google Docs, creating a centralized hub for user information. Eleven Labs Flash Model: Announcement of Eleven Labs' new Flash model, which generates speech in 75 milliseconds, enabling more human-like interactions with AI voice models. This model aims for low latency and is being targeted for integration into various products, such as video games. ChatGPT Integration: Discussion of ChatGPT's new feature allowing it to work directly with apps like Apple Notes and Notion, accessing all data within these applications rather than just screen displays. This feature positions ChatGPT as a central interface for interacting with data across various applications. AI Agents and Data: Firecrawl and AI Agents: Introduction to Firecrawl, a company focused on providing AI models with high-quality data by scraping the web for specific data sets. They are also hiring AI agents (not humans) to work within their system and are paying $10,000 to $15,000 for the use of these agents. Vertical AI Agents: Explanation of vertical AI agents, which are specialized AI systems designed for industry-specific tasks. Examples include agents for finance or law that can take action within their respective fields. AI Software Creation: Discussion of platforms like Windsurf, Bolt, and Cursor, which allow users without coding experience to create software using AI. Tempo Labs: Introduction to Tempo Labs, a code-first alternative to Figma, powered by AI. This platform generates functional code by prototyping user interfaces and allowing users to focus on core ideas rather than code. AI and Human Cognition: Human Brain Processing Speed: Exploration of an article highlighting that the human brain processes information at a rate of only 10 bits per second. The podcast discusses how, despite this slow rate, humans are able to distill vast amounts of data efficiently. AI Constraints: Speculation on the idea that mimicking human constraints in AI data processing may be key to achieving more nuanced and contextual understanding in AI. Other Interesting Points: Fake AI Band: A story about a person who created a fake band with AI and made $10 million by also creating fake AI fans. This is framed as a humorous example of how AI can be used in unexpected and potentially fraudulent ways. The focus on AI Agents: Going into 2025, there…
In this episode of "AI Wanna Talk," host Jacob Norgord dives into the latest advancements in AI, exploring practical applications and significant announcements from major tech companies. This episode covers a range of new AI products, features, and research. OpenAI's Projects Feature: Discussion of OpenAI's "Projects" feature , which allows users to upload files and provide explicit instructions to maintain context throughout a conversation with AI models, addressing the common issue of AI forgetting earlier parts of a conversation. This feature is available for ChatGPT Plus or Pro members, or those with a team account. Google's Gemini 2.0 Model and AI Agents: An overview of Google's new Gemini 2.0 family of models , focusing on the "agentic era" where AI acts proactively on the user's behalf. Details on the experimental Gemini 2.0 Flash model, a smaller model designed for low latency and integration into various Google experiences, such as Google Sheets, Docs, and Search. Explanation of the "whisk" experiment , allowing users to combine objects from multiple images. Information on Google's new state-of-the-art video model, Veo 2 , a competitor to OpenAI’s Sora. XAI's Grok and Mainframe's AI Agents: Announcement that XAI's Grok AI assistant is now free for all X users, highlighting its ability to access real-time data from the web and the X platform. An introduction to Mainframe , a company developing AI agents that work without human intervention, focusing on their first stage rollout called "Cobbot," which will consist of a suite of AI agents to accelerate teams. AI and Employment: Discussion of how the company CLA is using AI to boost productivity and potentially replace roles, and the potential implications for the job market and quality of output. Particle News App: Highlight of the Particle News app , which uses a TikTok-like algorithm for personalized news feeds and includes features like article narration and AI-powered Q&A. Social Media and Teen Depression: Exploration of data from Jonathan Height's "The Anxious Generation" , revealing a correlation between the rise of smartphones and social media with the increase in teen depression. Links: Google Labs Veo 2 Waitlist Grok Particle News App Twitter thread on "The Anxious Generation"…
1 Eleven Labs, Amazon’s “Nova” Model, ChatGPT Pro, Microsoft Copilot Vision, Llama 3.3 70B, OpenAI’s Sora, and More 21:11
(Definitely meant 25th power not 25th degree toward the end of the episode) In this episode of "AI Wanta Talk," host Jacob Norgord explores recent AI advancements, focusing on their practical applications. This episode covers several major developments in the AI landscape: Eleven Labs' Innovation in Audio AI Eleven Labs has launched an AI Podcast Generator through their 11 Reader iOS app, enabling podcast creation from various text sources in 32 languages. The company has also introduced a platform for building custom AI agents with configurable voices and response styles. Amazon and OpenAI's New Models Amazon has introduced Nova, their foundational model focused on math, science, coding, and reasoning tasks. OpenAI has launched a $200/month ChatGPT Pro tier, providing advanced access to GPT-4's capabilities. Microsoft and Meta's Developments Microsoft's Copilot Vision enables screen-aware AI assistance within the Edge browser. Meta's Llama 3 demonstrates improved efficiency through quality training data. Productivity and On-Device AI The Twos app introduces PAL (Personal Active List) for AI-powered task management. Apollo AI brings on-device AI capabilities to iOS devices. Google's Advances Gemini exp-1206 features a 2 million token context window, surpassing ChatGPT 4.0 on LM Arena. The Illuminate experiment enables podcast creation with customizable styles. Breakthrough Technologies OpenAI's Sora introduces advanced text-to-video generation capabilities. Google's Willow quantum computing chip achieves significant computational breakthroughs. Sundar Pichai proposes space-based quantum computing collaboration with Elon Musk. Links: Apollo AI Copilot Vision ElevenLabs Google Illuminate ElevenReader (iOS and Android) OpenAI Sora Twos…
1 OpenAI vs. Anthropic, Claude's "Styles" and "MCP", Microsoft AI's "Long-Term Memory", and Bronze's Chroma Acquisition 21:32
In this episode of "AI Wanta Talk," host Jacob Norgord explores recent AI advancements, focusing on their practical applications. This episode focuses on four major developments in the AI landscape: Anthropic AI's Claude New Features Anthropic AI has introduced two new features for its Claude AI chatbot: Styles and Model Context Protocol (MCP). Styles enables users to customize how Claude responds using presets like "concise," "explanatory," or "formal." Model Context Protocol (MCP) acts as a "universal translator" for AI and data sources, allowing Claude to connect with external sources like files or websites and interact with them. MCP enables Claude to perform complex tasks such as generating images based on user requests, writing code, and integrating images into websites. Microsoft AI's Apparent Long-Term Memory Breakthrough Microsoft AI CEO Mustafa Suleyman believes long-term memory is the crucial missing element in current AI chatbots. Microsoft AI is working on incorporating long-term memory into its Copilot chatbot, enabling it to retain information from previous conversations and use it to provide more personalized and accurate responses. The goal is to eliminate the need for users to constantly re-explain context and make interactions with AI more natural and efficient. Bronze AI Acquires Chroma Bronze AI, known for its innovative Bronze file format that creates dynamic music experiences, has acquired Chroma, a company specializing in audiovisual entertainment for mobile devices. The acquisition suggests potential for combining Bronze's evolving music with Chroma's visual expertise, creating a multisensory experience for listeners. The collaboration may push the boundaries of art by blending music and visuals in innovative ways. Links: More info on Claude Styles More info on Model Context Protocal (MCP) Pi Mustafa Suleyman Interview Excerpt Dot by New Computer Bronze “Jasmine” by Jai Paul (Bronze Version)…
1 ChatGPT Now Works With Apps, Google's New Gemini App, Perplexity Shopping, and Hume AI's Storyteller 14:03
In this episode of "AI I Want To Talk," host Jacob Norgord explores recent AI advancements, focusing on their practical applications. The episode covers four major developments in the AI landscape: ChatGPT App Integrations: OpenAI has introduced a new feature allowing ChatGPT to interact with external applications. Currently limited to coding apps like Xcode, TextEdit, iTerm2 Terminal, and VS Code. Provides ChatGPT with context from the user's active code, improving code-related responses. Although currently focused on coding, it has potential for wider application in the future. Future possibilities include integration with design tools like Figma and music software like Ableton. The Google Gemini App: Google has released a new app called Gemini, featuring their AI model of the same name. The app allows users to message Gemini and request image generation. Includes Gemini Live, enabling real-time conversational interaction with the AI. Notable for high-quality voices, rapid response times, and internet search capability. However, the live chat feature might provide inaccurate information for niche queries without citing sources. Perplexity Shopping: Perplexity introduces Perplexity Shopping, a streamlined platform for product research and purchase. Aggregates relevant product information for easy comparison and purchase without navigating multiple websites. Requires a Perplexity Pro membership for direct purchases through the platform. Hume AI's Storyteller Feature: Hume AI is an AI company specializing in voice AI with an emphasis on emotional understanding. Their iPhone app features a storytelling AI that generates images to accompany its narratives. Highlights the potential of AI for innovative storytelling through the combination of voice and image generation. Links: ChatGPT Work with Apps Google Gemini App (Apple) Google Gemini App (Android) Perplexity Shopping Hume AI…
1 ChatGPT Search, Claude Visual PDFs, Ideogram Canvas, Runway Act-One, xAI's API, and Google Learn About 11:26
In this episode of "AI I Want To Talk," host Jacob Norgord explores recent AI advancements, focusing on their practical applications. The episode covers six major developments in the AI landscape: ChatGPT Search: Enables ChatGPT to search the web and inform answers, compensating for its limited knowledge cutoff date. Offers faster search speeds compared to ChatGPT's previous web search feature. Comparable to Perplexity in speed but potentially faster in the future. Provides a Chrome extension to make it the default search engine. Claude's PDF Capabilities: Introduces improved PDF handling, enabling Claude to understand PDFs with non-typed text, including handwritten notes. Overcomes limitations of previous text-extraction methods. Expands possibilities for using Claude with handwritten notes and other non-typed PDFs. Ideogram Canvas: Launches a new feature called Canvas, described as an infinite creative board for organizing, generating, editing, and combining images. Offers a gridless board for uploading and manipulating multiple images to inform AI image creation. Potentially includes text functionality for creating custom fonts. Runway Act One: Unveils Act One, a tool that allows users to record themselves and use their facial expressions to animate characters. Eliminates the need for CGI or motion capture for creating animated characters. Offers a quick and innovative way to create animated content for various media. xAI's API for Grok 2: Releases an API for Grok 2, enabling developers to integrate xAI's Grok AI into their platforms. Offers access to Grok, the chatbot accessible through Twitter or X, for building unique applications. Provides developers with a more distinctive chatbot option for specific use cases. Google Learn About: Introduces a new Google experiment called Learn About, designed for creating personalized learning curriculums. Allows users to input text and images to generate a customized learning path. Provides an AI-powered resource for effective learning, tailored to individual needs and preferences. Links mentioned in the episode: Ideogram Canvas Video Ideogram Runway Act One Video Runway Google Learn About…
1 Anthropic's New Models, "Computer Use", Perplexity App, ProSearch, and ElevenLabs Voice Design 10:00
In this episode of "AI I Want To Talk," host Jacob Norgord explores recent AI advancements, focusing on their practical applications and potential impact on productivity. The episode covers three major developments in the AI landscape: Anthropic's Claude Update: A new version of Claude with improved reasoning and coding capabilities. Introduction of the "computer use" feature, allowing Claude to control a user's computer. Discussion of potential applications in various industries and companies. Acknowledgment of both the exciting possibilities and potential risks associated with this technology. Perplexity Pro Search Improvement: Enhanced generative and multi-step reasoning capabilities. Demonstration of its power through an example from Perplexity's CEO, creating a comprehensive table of key takeaways from Jeff Bezos' shareholder letters. ElevenLabs' Voice Design Feature: Introduction of a new tool allowing users to create custom AI voices using prompts. Brief overview of its potential applications and implications for voice technology. Links mentioned in the episode: agent.exe Perplexity Pro Search Example…
In the debut episode of “AI I Want To Talk,” host Jacob Norgord delves into the realms of AI and productivity, exploring their fascinating intersection. His goal: To simplify AI advancements and make them accessible for everyday use. Jacob introduces NotebookLM with its latest updates, Google’s AI-powered note-taking tool. Unlike ChatGPT or Claude, it integrates various information sources, enabling AI interaction with personalized context. NotebookLM’s “audio overview” feature creates insightful podcasts from user-provided information. Recent upgrades allow source-specific focus, enhancing learning experiences. The episode covers Microsoft’s improved Copilot, developed by former Inflection co-founder Mustafa Suleyman. It shares similarities with Pi, known for concise responses and superior voice quality. Jacob discusses prompt engineering’s evolution. He advocates for natural language communication with AI, moving away from rigid syntax. Generative AI’s probabilistic nature enables this shift. He emphasizes explaining tasks to AI as you would to a human, providing context for better results. Jacob cites Ideogram as an example of effective AI interaction, stressing the importance of holistic task delegation. Links mentioned in the episode: • NotebookLM • Microsoft Copilot • Pi • Ideogram…
Welkom op Player FM!
Player FM scant het web op podcasts van hoge kwaliteit waarvan u nu kunt genieten. Het is de beste podcast-app en werkt op Android, iPhone en internet. Aanmelden om abonnementen op verschillende apparaten te synchroniseren.