\u003C/p>\u003Cp>How AI can level the playing field between top performers and less experienced staff\u003C/p>\u003Cp>The potential for massive cost savings and efficiency gains across various industries\u003C/p>\u003Cp>The ethical implications of AI in the workplace - threat or opportunity?\u003C/p>\u003Cp>Real-world implementation strategies and challenges\u003C/p>\u003Cp>\u003Cbr />\u003C/p>\u003Cp>Whether you're a CEO looking to gain a competitive edge, an HR director aiming to optimize your workforce, or simply curious about the future of work, this episode is a must-listen. We'll separate hype from reality and give you actionable insights on how AI might transform your professional life.\u003C/p>\u003Cp>Tune in for a fascinating glimpse into a future where humans and AI work side by side. \u003C/p>\u003Cp>The workplace revolution is here - are you ready?\u003C/p>","episodic","storage/podcasts/a916dc01-1db2-4f42-aaf0-e30bf94c491d/images/bb0d16b6-e14e-4b9f-8a31-8f81469302e9.jpg",{"image_80":13,"image_180":14,"image_240":15,"image_600":16,"image_1280":17},"storage/podcasts/a916dc01-1db2-4f42-aaf0-e30bf94c491d/images/bb0d16b6-e14e-4b9f-8a31-8f81469302e9_80.jpg","storage/podcasts/a916dc01-1db2-4f42-aaf0-e30bf94c491d/images/bb0d16b6-e14e-4b9f-8a31-8f81469302e9_180.jpg","storage/podcasts/a916dc01-1db2-4f42-aaf0-e30bf94c491d/images/bb0d16b6-e14e-4b9f-8a31-8f81469302e9_240.jpg","storage/podcasts/a916dc01-1db2-4f42-aaf0-e30bf94c491d/images/bb0d16b6-e14e-4b9f-8a31-8f81469302e9_600.jpg","storage/podcasts/a916dc01-1db2-4f42-aaf0-e30bf94c491d/images/bb0d16b6-e14e-4b9f-8a31-8f81469302e9_1280.jpg","https://cloud.mave.digital/58641","Sergio Voropaev",false,32,2,{"rate":24,"count":22},5,[26,29,32],{"name":27,"subcategory":28,"is_main":20},"Образование","Самосовершенствование",{"name":30,"subcategory":31,"is_main":20},"Бизнес","Управление",{"name":33,"is_main":34},"Технологии",true,[36],1,"Lets connect","ceo@greatleveler.com",{"facebook":40,"twitter":41,"instagram":40,"telegram":42,"vk":40,"patreon":40,"boosty":40},null,"https://x.com/greatlevelercom","https://t.me/greatlevelercom",{"apple_id":44,"apple":45,"google":40,"spotify":46,"yandex":47,"vk":40,"castbox":48,"soundstream":40,"deezer":49,"overcast":50,"podcastAddict":50,"pocketCasts":50,"youtube":51,"soundcloud":40,"zvuk":50,"youtubeMusic":52,"myBook":40,"litres":53},1774183463,"https://podcasts.apple.com/ru/podcast/ai-synergy/id1774183463","https://open.spotify.com/show/2799vuVV6ZM7ipuxqHsEmM?si=LFkhdF-2QqWpMAE5xAC0FQ&nd=1&dlsi=0518d31c491e497b","https://music.yandex.ru/album/33938902","https://castbox.fm/channel/id6318548?country=ru","https://deezer.com/show/1001326571","","https://www.youtube.com/playlist?list=PLinPRXtk3-haYmjeEt_urdTKOji-r07l5","https://music.youtube.com/playlist?list=PLinPRXtk3-haYmjeEt_urdTKOji-r07l5","https://www.litres.ru/podcast/sergio-voropaev/ai-synergy-71218483/",[55],{"id":56,"podcast_id":7,"name":19,"info":57,"image":58,"createdAt":59,"updatedAt":60,"contact_id":40},"dba1999e-f8b8-4181-9f09-f7bd44a86280","Founder of Great Leveler AI - a platform helping tech leaders boost productivity by 43% through AI implementation. Former Swiss VC mentor, successful founder of multiple tech startups, and expert in AI business integration and scaling.","storage/podcasts/a916dc01-1db2-4f42-aaf0-e30bf94c491d/contacts/0361e8d5-08a5-4aab-b563-7c950643919e.jpeg","2024-11-14T10:46:22.583Z","2024-11-14T10:46:22.727Z",{"id":62,"number":63,"season":36,"title":64,"description":65,"type":66,"image":11,"audio":67,"duration":68,"is_explicit":20,"code":63,"publish_date":69,"listenings":70,"is_transcription_hidden":20,"text":71,"is_private":20,"plans":40,"video":40,"images":72,"reactions":73,"chapters":79,"relevantEpisodes":80},"cc2a2fe8-8a86-4472-83c4-c7349da3c165",13,"SoundStorm Unleashed: Revolutionizing Audio Generation with Lightning Speed","Dive into the frontier of #audio innovation as we break down\u003Cp>This cutting-edge model generates audio at speeds \u003Cb>100x faster\u003C/b> than previous systems, redefining what's possible in \u003Cb>#music, #podcasts, #games\u003C/b>, and more. Join us as we explore the neural #codecs, parallel #decoding, and confidence-based sampling that make \u003Cb>SoundStorm\u003C/b> so powerful. From hyper-realistic #dialogues to adaptive #soundscapes, discover how this tech could transform #entertainment, #accessibility, and even #healthcare.\u003C/p>","full","storage/podcasts/a916dc01-1db2-4f42-aaf0-e30bf94c491d/episodes/cc2a2fe8-8a86-4472-83c4-c7349da3c165.mp3",1076,"2024-11-13T15:27:38.972Z",19,"Speaker 00 00:00:00\n\nHey, everyone. Welcome back. Ready to dive deep into some cutting edge audio generation. Always. Okay, perfect. Let's get into it then. Today, we're looking at a paper that's, well, hot off the press. Yeah, like submitted just last month, September 2023. It's called Soundstorm, Efficient Parallel Audio Generation. it's Catchy. Right. But not just the name that's impressive. This thing can generate 30 seconds of audio in just half a second.\n\nSpeaker 01 00:00:27\n\nHang on half a second for 30 seconds of audio. That's got to be, what, like 100 times faster than what we're used to?\n\nSpeaker 00 00:00:33\n\nYou got it a hundred times faster and hold on to your hats because this isn't just for, you know, the hardcore tech folks.\n\nSpeaker 01 00:00:39\n\nYeah, this has got to have implications way beyond that.\n\nSpeaker 00 00:00:41\n\nAbsolutely. Think about it. Anyone who listens to music, podcasts, audio books, even those, you know, robotic customer service voices we all love.\n\nSpeaker 01 00:00:50\n\nIt's true. This could really change how we interact with audio across the board.\n\nSpeaker 00 00:00:53\n\nFor sure. But before we get lost in the future, let's take a step back. Yeah. Why has generating this like high quality long form audio been so tricky until now?\n\nSpeaker 01 00:01:04\n\ngood point imagine you're trying to paint a super detailed picture the more intricate it is the longer it takes right audio is kind of the same makes sense high quality audio you know the kind that sounds natural rich it requires a ton of data older methods had to process all that data bit by bit like painstakingly filling in each little detail of that picture we were talking about Oh, so the longer the audio, the longer it took to make.\n\nSpeaker 00 00:01:31\n\nKind of like choosing between a gourmet meal that takes hours to prep or a quick microwave dinner.\n\nSpeaker 01 00:01:36\n\nThat's a great way to put it. You might get amazing quality with the gourmet meal, but who's got the time these days? Right, speed is everything And that's where neural codecs step in. Think of them as these super efficient translators. They convert that complex audio info into this like streamlined digital code and back again, of course.\n\nSpeaker 00 00:01:54\n\nSort of like .zip files for sound.\n\nSpeaker 01 00:01:56\n\nExactly. Soundstream and ENCODEC are examples of those. Definitely sped things up. But even with them, long, high-quality audio still took ages.\n\nSpeaker 00 00:02:05\n\nAnd that's where Soundstorm comes flying in to save the day. So what's so special about it? What's its secret weapon?\n\nSpeaker 01 00:02:10\n\nWell, Soundstorm's a game changer because it brings a bunch of new ideas to the table. First, it uses what's called a conformer model.\n\nSpeaker 00 00:02:17\n\nconformer. Okay, that sounds complicated.\n\nSpeaker 01 00:02:20\n\nIt's actually pretty cool. Think of it like a master code breaker specifically for audio. It's really good at understanding how these neural codecs we talked about structure their code.\n\nSpeaker 00 00:02:31\n\nSo it's like Soundstorm speaks the same language as the Codex, making communication smoother.\n\nSpeaker 01 00:02:36\n\nExactly. And that's just the start. It also uses a divide and conquer strategy. We call it parallel decoding.\n\nSpeaker 00 00:02:42\n\ndivide and conquer. Like when you're building a house, you've got different teams for the plumbing, the electrical framing, all working at the same time.\n\nSpeaker 01 00:02:50\n\nYou got it. Instead of predicting one tiny bit of audio info at a time, it tackles multiple bits all at once. It is. And there's one more trick up its sleeve. Confidence-based sampling. Imagine working on a puzzle, right? You wouldn't just try every piece randomly. You'd focus on the ones you were most sure about.\n\nSpeaker 00 00:03:10\n\nSo Soundstorm zeroes in on the parts of the audio where it's most confident. Makes sense.\n\nSpeaker 01 00:03:15\n\nExactly. And it refines the results over a few iterations, kind of like going back over those puzzle pieces to make sure they fit perfectly.\n\nSpeaker 00 00:03:22\n\nSo we've established that Soundstorm is fast, efficient, focused, but does it actually sound good? I mean, quality matters too.\n\nSpeaker 01 00:03:30\n\nOh, this is where it gets really exciting. Soundstorm isn't just about speed. It actually makes the audio better in a bunch of ways. The research paper highlights some key findings that are pretty mind-blowing, to be honest.\n\nSpeaker 00 00:03:42\n\nlay it on me. What kind of improvements are we talking about?\n\nSpeaker 01 00:03:45\n\nFirst off, speech generated with Soundstorm is way more intelligible. Think about those robotic voices you hear in like older text-to-speech systems.\n\nSpeaker 00 00:03:54\n\nOh yeah, this could be pretty rough.\n\nSpeaker 01 00:03:55\n\nSoundstorm moves us closer to natural sounding speech. So ideal for audio books, virtual assistants, things like that.\n\nSpeaker 00 00:04:02\n\nNo more robot voices. I'm sold already.\n\nSpeaker 01 00:04:05\n\nRight. Another win. Voice preservation. It's amazing at keeping a speaker's voice consistent, even over long stretches of audio. Imagine listening to an audiobook and the narrator's voice suddenly changes halfway through.\n\nSpeaker 00 00:04:19\n\nThat would be awful. Totally pull you out of the story.\n\nSpeaker 01 00:04:22\n\nExactly. And it's not just voices. Soundstorm is great at keeping the whole acoustic environment consistent. Background noise, textures, all that stuff stays consistent, creates a more immersive experience.\n\nSpeaker 00 00:04:34\n\nYeah, those background sounds can make a big difference.\n\nSpeaker 01 00:04:36\n\nHuge. Think movies, video games. Soundstorm makes the whole soundscape feel more believable and immersive.\n\nSpeaker 00 00:04:43\n\nSo we've got speed, intelligibility, voice consistency, a better soundscape. Sounds like a dream come true for anyone working with audio. Now, the paper also mentions Soundstorm being a big deal for something called AudioLM. What's that all about?\n\nSpeaker 01 00:04:57\n\nOh, that's interesting. You know how sometimes you have a tool that's great, but a bit slow? Audio LM is a bit like that, a powerful system for audio generation, but not the fastest. Well, Soundstorm can give it a turbo boost, make it lightning fast without losing any of that quality.\n\nSpeaker 00 00:05:11\n\nSo it's like the ultimate upgrade for AudioLM. Now, I'm super curious about this whole thing with Soundstorm generating entire dialogues. How does that even work?\n\nSpeaker 01 00:05:20\n\nAll right, picture this. You have a movie script with all the dialogue and who's speaking when, right? And you have short voice samples of each actor. You feed that into Soundstorm, along with some other AI models that handle stuff like text to speech and natural language processing. And guess what? Soundstorm can actually generate the whole dialogue with each actor's voice, natural pauses, even those ums and uhs that make it sound real.\n\nSpeaker 00 00:05:47\n\nThat's insane. Like something out of a sci-fi movie.\n\nSpeaker 01 00:05:49\n\nIt is pretty wild. And that brings up a crucial part. As this tech evolves, we got to think about both the amazing potential and the potential risks.\n\nSpeaker 00 00:05:59\n\nYou're talking about the misused side, right? Like making fake audio that sounds totally real.\n\nSpeaker 01 00:06:04\n\nExactly. It's super important to remember that any powerful tech can be used for good or bad. But the good news is researchers are already working on ways to safeguard against misuse, like audio watermarking.\n\nSpeaker 00 00:06:16\n\nAudio watermarking. What's that?\n\nSpeaker 01 00:06:18\n\nImagine embedding a hidden signal into the generated audio that lets you know it's not real. Like a digital fingerprint for audio.\n\nSpeaker 00 00:06:26\n\nSo a built-in light detector for sound.\n\nSpeaker 01 00:06:28\n\nExactly. We're talking about having the tools to tell the real from the fake.\n\nSpeaker 00 00:06:33\n\nThat's reassuring. So to sum up, Soundstorm is incredibly fast, can make high quality audio in all these different ways, and people are already working on safeguards against misuse. Feels like we're at the start of an audio revolution.\n\nSpeaker 01 00:06:47\n\nWe definitely are. But to really get the impact, we got to go a little deeper into the technical nitty gritty, ready to pop open the hood and see what makes Soundstorm tick.\n\nSpeaker 00 00:06:56\n\nYou bet. Let's get technical.\n\nSpeaker 01 00:06:58\n\nAll right, let's get under the hood and see what makes Soundstorm tick. Remember those neural codecs we talked about?\n\nSpeaker 00 00:07:02\n\nThe super efficient translators for audio. Yeah, they were pretty cool. Right.\n\nSpeaker 01 00:07:06\n\nWell, Soundstorm doesn't actually create sound waves directly. It generates instructions for this specific neural codec called SoundStream.\n\nSpeaker 00 00:07:14\n\nIt's on stream, okay.\n\nSpeaker 01 00:07:15\n\nAnd to understand Soundstream, we got to talk about something called residual vector quantization, or RVQ for short.\n\nSpeaker 00 00:07:22\n\nRVQ. Catchy.\n\nSpeaker 01 00:07:24\n\nI know, right? But it's not as scary as it sounds. Imagine packing for a trip. Instead of just chucking clothes in your suitcase, you organize everything neatly into different compartments, yeah? That's kind of what RVQ does with audio data. It divides the audio into these small chunks and then uses a series of tools called quantizers to like represent each chunk with a set of tokens.\n\nSpeaker 00 00:07:50\n\nSo like organizing all the audio information into neat little digital compartments.\n\nSpeaker 01 00:07:54\n\nExactly. And each quantizer in this series focuses on a different level of detail. Think of it like starting with a rough sketch and then adding finer details until you get a complete picture.\n\nSpeaker 00 00:08:04\n\nSo it's a step-by-step process getting more precise as it goes.\n\nSpeaker 01 00:08:07\n\nYou got it. And the result is this layered structure of tokens. The tokens from the early stages, they represent the big picture stuff, while the later ones capture those subtle nuances.\n\nSpeaker 00 00:08:20\n\nOkay, I think I'm following. So that's how SoundStream encodes the audio. Where does SoundSquarm come into this? How does it actually use that encoded information to make sound?\n\nSpeaker 01 00:08:28\n\nThat's where Soundstorm's special architecture comes in. It uses something called a conformer model, really good at handling sequential data, you know, like a string of words in a sentence. Right, right. Or in this case, a stream of those SoundStream tokens we were talking about.\n\nSpeaker 00 00:08:42\n\nSo the conformer analyzes the audio tokens and figures out how they fit together to create a coherent sound.\n\nSpeaker 01 00:08:48\n\nExactly. And remember how SoundStream has this layered structure? SoundStorm really takes advantage of that. It pays more attention to the tokens that represent the big picture information, kind of like focusing on the main points of a lecture rather than getting lost in the tiny details. It is. Allows Soundstorm to process info and generate tokens super quickly. And then we've got parallel decoding. That's what really kicks things into overdrive.\n\nSpeaker 00 00:09:15\n\nDivide and conquer, right. So how's that work in practice?\n\nSpeaker 01 00:09:18\n\nImagine a team of artists working on a huge mural. Instead of painting one section at a time, they divide it into smaller sections, and each artist works on their section simultaneously.\n\nSpeaker 00 00:09:29\n\nAh, so Soundstorm predicts multiple audio tokens at once, like those artists working on different parts of the mural.\n\nSpeaker 01 00:09:35\n\nPrecisely. And this parallel approach works because those finer details in the audio are actually somewhat independent of each other once you know the big picture stuff.\n\nSpeaker 00 00:09:45\n\nSo it can predict those details in parallel without messing things up.\n\nSpeaker 01 00:09:49\n\nExactly. That's a big reason why Soundstorm is so fast. But remember, it's not just randomly predicting tokens. It's using that confidence-based sampling we talked about.\n\nSpeaker 00 00:09:58\n\nRight, the picky eater, choosing the best bits.\n\nSpeaker 01 00:10:01\n\nYep. For each position in the audio sequence, Soundstorm predicts a bunch of possible tokens and then checks how confident it is about each one. Kind of like saying, I'm pretty sure this one's right, but not so sure about that one.\n\nSpeaker 00 00:10:13\n\nSo it's choosing its steps carefully, focusing on where it's most certain.\n\nSpeaker 01 00:10:17\n\nYes. And as it goes through those different levels of detail, predicting tokens for each level, it gets more and more confident. The result is a sequence of sound stream tokens that can then be decoded into, well, beautiful, high quality audio.\n\nSpeaker 00 00:10:30\n\nOkay. We've covered the neural codecs, RVQ, the conformer model, parallel decoding, confidence-based sampling, anything else going on behind the scenes, any other secret ingredients?\n\nSpeaker 01 00:10:43\n\nThere's one more key element, the conditioning signal. Think of it like the conductor of an orchestra. It gives Soundstorm directions on what kind of audio to create.\n\nSpeaker 00 00:10:54\n\nSo it tells Soundstorm what to play, kind of like giving a composer a score.\n\nSpeaker 01 00:10:58\n\nExactly. Without the conditioning signal, Soundstorm wouldn't have a clue what to generate. Speech, music, something else entirely. The signal gives it the context, telling it things like the desired voice, the emotional tone, specific sounds to include.\n\nSpeaker 00 00:11:14\n\nlike a set of instructions, a blueprint to follow.\n\nSpeaker 01 00:11:16\n\nPerfect analogy. Now, in the Soundstorm paper, they use what are called semantic tokens as the conditioning signals.\n\nSpeaker 00 00:11:22\n\nSemantic tokens, those are like super condensed summaries of information.\n\nSpeaker 01 00:11:25\n\nthe CliffsNotes for audio capturing the main ideas. But here's the cool part. Where do those semantic tokens come from?\n\nSpeaker 00 00:11:31\n\nHmm, good question.\n\nSpeaker 01 00:11:32\n\nAnother AI model gets involved here, one that's been trained on tons of audio and text.\n\nSpeaker 00 00:11:38\n\nWow, bringing in the big guns.\n\nSpeaker 01 00:11:40\n\nRight. This model can analyze both text and audio. It learns to connect the dots between the words we say and the sounds we make, like learning a new language by listening to someone speak while also reading the transcript.\n\nSpeaker 00 00:11:53\n\nSo the model learns to understand the relationship between written and spoken language.\n\nSpeaker 01 00:11:57\n\nExactly. And once it's got that down, it can generate those semantic tokens, summarizing the key parts of the audio in a way that Soundstorm can understand.\n\nSpeaker 00 00:12:06\n\nSo it's like a super smart AI assistant that listens to the audio and then writes instructions for Soundstorm.\n\nSpeaker 01 00:12:11\n\nExactly. And those Semante tokens act as a guide shaping the audio Soundstorm generates.\n\nSpeaker 00 00:12:17\n\nOK, so we've got the neural codec, the sound stream encoding, the conformer model, parallel decoding, confidence-based sampling, and the conditioning signal. That's a lot of moving parts.\n\nSpeaker 01 00:12:26\n\nIt is, but they all work together in this intricate dance to make Soundstorm so powerful.\n\nSpeaker 00 00:12:31\n\nIt's mind-boggling to think about.\n\nSpeaker 01 00:12:32\n\nIt really is. And it's a real testament to how far AI has come.\n\nSpeaker 00 00:12:36\n\nNow, what really blew me away was Soundstorm's ability to generate entire dialogues. Creating individual sounds or words is one thing, but stringing together a whole conversation, that seems like a whole other level.\n\nSpeaker 01 00:12:49\n\nIt is, and that's exactly what we'll be diving into next. We'll explore how Soundstorm uses all these amazing components to create those realistic sounding conversations.\n\nSpeaker 00 00:13:01\n\nWelcome back. We've dug into the technical side of Soundstorm. Now let's zoom out a bit and see how this tech could change how we experience audio like in our everyday lives. Let's start with entertainment. Imagine watching a movie, but the music isn't just a prerecorded track. It's generated in real time, shifting with the emotions and action on screen.\n\nSpeaker 01 00:13:20\n\nWait hold on the music would actually change based on what's happening in the movie. That's insane.\n\nSpeaker 00 00:13:24\n\nIt would be incredible, like having your own personal composer creating the score as you watch. Total game changer. Totally. And think about video games. Soundstorm could create these hyper-realistic soundscapes that react to everything you do in the game. You'd feel totally immersed in that virtual world. It'd be like the sounds are as dynamic and unpredictable as the game itself.\n\nSpeaker 01 00:13:44\n\nExactly. And not only could it create new experiences, it could also revamp old ones. Think about classic films dubbed in any language you want, but with the voices perfectly synced, full of emotion, like the original performances.\n\nSpeaker 00 00:13:58\n\nThat would be amazing. No more bad dubbing ruining the movie. You could experience any film in its full glory, no matter what language it was in originally.\n\nSpeaker 01 00:14:06\n\nRight. Speaking of language, Soundstorm could really shake things up for accessibility to like audio descriptions for people who are visually impaired. What if those descriptions weren't just, you know, dry facts, but something way more engaging, something that captures the emotions, the nuances of what's happening visually?\n\nSpeaker 00 00:14:24\n\nIt makes such a huge difference. Entertainment would be truly inclusive so everyone could enjoy it to the fullest.\n\nSpeaker 01 00:14:30\n\nExactly. And personalized learning. That's another area where Soundstorm could really shine. Imagine language learning apps where you practice conversations with a virtual tutor who sounds just like a native speaker, giving you real-time feedback, tailoring the lessons just for you.\n\nSpeaker 00 00:14:44\n\nThat would make language learning way more fun and effective, like having a personal language coach right in your pocket.\n\nSpeaker 01 00:14:50\n\nOkay, so we got entertainment, accessibility, education. What other areas could Soundstorm revolutionize?\n\nSpeaker 00 00:14:58\n\nThe possibilities seem endless.\n\nSpeaker 01 00:15:00\n\nThey do. Healthcare is a big one. Soundstorm could make these realistic simulations for medical training. Doctors and nurses could practice procedures, learn new techniques, all in a safe virtual environment.\n\nSpeaker 00 00:15:13\n\nlike a flight simulator but for surgeons.\n\nSpeaker 01 00:15:15\n\nExactly. They could get better and build confidence without any risks. And think about therapy. Imagine personalized soundscapes to reduce stress, help people relax, even manage chronic pain.\n\nSpeaker 00 00:15:26\n\nIt's like a sonic prescription tailored just for you.\n\nSpeaker 01 00:15:29\n\nAnd let's not forget the creative world. Musicians, sound-o-niners, anyone working with audio. Soundstorm could give them the power to create incredible soundscapes and experiences that we couldn't even dream of before.\n\nSpeaker 00 00:15:41\n\nlike a whole new set of instruments and an infinite range of sounds to play with. But we've got to be realistic. With all this potential, there are going to be challenges, right?\n\nSpeaker 01 00:15:50\n\nFor sure, any powerful tool can be misused. That's why researchers are already on it, creating safeguards to stop SoundSpawn from being used for harmful things, like, you know, generating fake audio to deceive or manipulate people.\n\nSpeaker 00 00:16:03\n\nSo like giving Soundstorm a set of ethics, making sure it's used responsibly. Yeah.\n\nSpeaker 01 00:16:08\n\nPrecisely. We need clear guidelines and ways to tell the difference between real and fake audio. And we got to think about jobs, too, you know?\n\nSpeaker 00 00:16:16\n\nYeah, the potential impact on the workforce.\n\nSpeaker 01 00:16:18\n\nAs Soundstorm gets more advanced, it's likely to automate some jobs that people currently do.\n\nSpeaker 00 00:16:23\n\nThat's a valid point. We need open conversations about those impacts and how to support workers who might be affected.\n\nSpeaker 01 00:16:28\n\nAbsolutely. It's about finding that balance, innovation and social responsibility. Now, looking ahead, what do you think the future holds for Soundstorm? Where do you see it going in the next few years?\n\nSpeaker 00 00:16:41\n\nWell, personalization seems like a big one. We're already seeing it, but I think it's just the beginning.\n\nSpeaker 01 00:16:45\n\nI agree. Imagine devices that learn what you like and create custom soundscapes for you, adapting to your moods, what you're doing, even your surroundings.\n\nSpeaker 00 00:16:54\n\nlike having your own personal DJ and sound designer everywhere you go.\n\nSpeaker 01 00:16:57\n\nAnd I think we'll see Soundstorm merging with other types of AI, the ones that understand images, text, you know, we could have mind-blowing multimedia experiences blurring the lines between what's real and virtual.\n\nSpeaker 00 00:17:10\n\nlike all your senses coming together in this amazing new way.\n\nSpeaker 01 00:17:13\n\nRight. And of course, those ethical guidelines, those safeguards, super crucial as Soundstorm evolves. We have to make sure it's used for good.\n\nSpeaker 00 00:17:21\n\nThis has been an awesome deep dive into Soundstorm. It's crazy to think how this tech could change how we make and experience audio. I'm super excited to see what happens next.\n\nSpeaker 01 00:17:32\n\nMe too. It's a reminder that we're living in this amazing time of tech advancements, and it's up to us to make sure those advancements benefit everyone.\n\nSpeaker 00 00:17:40\n\nSo next time you listen to music, watch a movie, play a game, think about the power of sound and all the possibilities that Soundstorm is opening up.\n\nSpeaker 01 00:17:47\n\nAnd don't forget to check out the show notes for links to audio samples made with Soundstorm. You'll be blown away.\n\nSpeaker 00 00:17:52\n\nUntil next time, keep exploring the world of sound.\n\nSpeaker 01 00:17:55\n\nAnd keep diving deep.",{"image_80":13,"image_180":14,"image_240":15,"image_600":16,"image_1280":17},[74,76],{"type":75,"count":36},"like",{"type":77,"count":78},"dislike","0",[],[81,90,100,110,120,129,138,147,156],{"id":82,"number":83,"season":36,"title":84,"description":85,"type":66,"image":11,"audio":86,"duration":87,"is_explicit":20,"code":83,"publish_date":88,"listenings":70,"is_private":20,"plans":40,"video":40,"images":89},"216dbfed-d0f6-4b90-a774-b3a3a8931523",12,"The Great AI Chip Race: Tech Giants Break Free from NVIDIA","In this episode, we explore how \u003Cb>Amazon, Google\u003C/b>, and other tech behemoths are shaking up the #AI industry by developing their own custom chips. From Amazon's secretive \u003Cb>Annapurna Labs\u003C/b> to Google's powerful \u003Cb>Trillium\u003C/b> processor, discover how this shift could revolutionize AI accessibility and pricing. Learn why major companies are reducing their reliance on \u003Cb>NVIDIA\u003C/b>, the implications for consumers and startups, and what this means for the future of \u003Cb>artificial intelligence\u003C/b>. Join us for an insightful discussion about what might be the biggest power shift in tech since the personal computing revolution.","storage/podcasts/a916dc01-1db2-4f42-aaf0-e30bf94c491d/episodes/216dbfed-d0f6-4b90-a774-b3a3a8931523.mp3",363,"2024-11-13T10:48:08.094Z",{"image_80":13,"image_180":14,"image_240":15,"image_600":16,"image_1280":17},{"id":91,"number":92,"season":36,"title":93,"description":94,"type":66,"image":11,"audio":95,"duration":96,"is_explicit":20,"code":92,"publish_date":97,"listenings":98,"is_private":20,"plans":40,"video":40,"images":99},"708dbdd4-2e2b-4cd8-8491-f50f73a38dde",11,"The Next 18 Months: Anthropic’s Case for Urgent AI Regulation","Join us on \u003Cb>The Next 18 Months\u003C/b>, where we dive deep into \u003Cb>\u003Ca href=\"https://\">Anthropic\u003C/a>\u003C/b>'s compelling vision for the future of artificial intelligence and the crucial role that regulation will play in shaping it. Discover why experts believe we have just 18 months to get crucial safety measures in place. With AI's capabilities advancing at breakneck speed, Anthropic’s latest report warns that time is running out to establish guidelines that protect society without stifling innovation. In this episode, we explore why experts say the next 18 months could make or break AI’s future and discuss the steps Anthropic believes are necessary to responsibly harness this transformative technology. Tune in as we dissect the risks, the revolutionary potential, and the pressing need for policies that balance safety with progress.","storage/podcasts/a916dc01-1db2-4f42-aaf0-e30bf94c491d/episodes/708dbdd4-2e2b-4cd8-8491-f50f73a38dde.mp3",1086,"2024-11-07T08:01:24.402Z",21,{"image_80":13,"image_180":14,"image_240":15,"image_600":16,"image_1280":17},{"id":101,"number":102,"season":36,"title":103,"description":104,"type":66,"image":11,"audio":105,"duration":106,"is_explicit":20,"code":102,"publish_date":107,"listenings":108,"is_private":20,"plans":40,"video":40,"images":109},"b6025c83-32ab-4144-8890-1e1c8256a0e9",10,"Building the Future of Gaming - AI Next-frame prediction.","Join us as we explore the mind-bending world of \u003Cb>AI-powered gaming\u003C/b>, where \u003Cb>next frame prediction technolog\u003C/b>y is revolutionizing how we interact with virtual worlds. We dive deep into groundbreaking projects from\u003Cb> Descartes and Etche\u003C/b>d, including an \u003Cb>AI version of Minecraft\u003C/b> that responds to players' imagination in real-time. Our expert guest breaks down the technology behind these innovations, from the specialized \u003Cb>Sohu chip\u003C/b> to the broader implications for education, healthcare, and creative expression. Discover how AI isn't just changing how we play games – it's reshaping how we interact with technology itself.","storage/podcasts/a916dc01-1db2-4f42-aaf0-e30bf94c491d/episodes/b6025c83-32ab-4144-8890-1e1c8256a0e9.mp3",893,"2024-11-02T11:22:32.534Z",15,{"image_80":13,"image_180":14,"image_240":15,"image_600":16,"image_1280":17},{"id":111,"number":112,"season":36,"title":113,"description":114,"type":66,"image":11,"audio":115,"duration":116,"is_explicit":20,"code":112,"publish_date":117,"listenings":118,"is_private":20,"plans":40,"video":40,"images":119},"da0fa61b-1d59-47f4-83d0-7aa1f0e4edcc",9,"The Search Wars: ChatGPT's New Web Powers vs Google & Perplexity","In today’s episode, we're diving into the evolving world of search engines and how groundbreaking upgrades to \u003Cb>\u003Ca href=\"https://\">ChatGPT\u003C/a>\u003C/b>'s search capabilities could be changing the game. Imagine asking a question and getting a direct, sourced answer instead of endless scrolling. We'll explore the magic behind ChatGPT's new real-time web access, how it stacks up against \u003Cb>\u003Ca href=\"https://\">Google Search\u003C/a>\u003C/b> and \u003Cb>\u003Ca href=\"https://\">Perplexity\u003C/a>\u003C/b>, and why this tech revolution might reshape how we explore, learn, and connect with information. From travel tips to stock updates, join us as we break down this “search revolution”—and debate who might come out on top!","storage/podcasts/a916dc01-1db2-4f42-aaf0-e30bf94c491d/episodes/da0fa61b-1d59-47f4-83d0-7aa1f0e4edcc.mp3",564,"2024-10-31T17:34:05.300Z",18,{"image_80":13,"image_180":14,"image_240":15,"image_600":16,"image_1280":17},{"id":121,"number":122,"season":36,"title":123,"description":124,"type":66,"image":11,"audio":125,"duration":126,"is_explicit":20,"code":122,"publish_date":127,"listenings":118,"is_private":20,"plans":40,"video":40,"images":128},"5e9f1099-955a-476a-83da-d29d29b9062a",8,"AI Mediator: How Google DeepMind’s Habermas Could Transform Conflict Resolution","Imagine a world where AI doesn’t just mediate disagreements but actively helps prevent conflicts from escalating, both in person and online. In this episode, we explore Google DeepMind’s latest breakthrough, Habermas (\u003Ca href=\"https://\">Habermas Machine dataset\u003C/a>)—a powerful AI designed to resolve disputes by finding genuine common ground among diverse viewpoints. Joined by an expert guest, we’ll dive into how this technology works, the promising research behind it, and the vast implications for promoting peace and understanding.","storage/podcasts/a916dc01-1db2-4f42-aaf0-e30bf94c491d/episodes/5e9f1099-955a-476a-83da-d29d29b9062a.mp3",774,"2024-10-30T08:51:40.551Z",{"image_80":13,"image_180":14,"image_240":15,"image_600":16,"image_1280":17},{"id":130,"number":131,"season":36,"title":132,"description":133,"type":66,"image":11,"audio":134,"duration":135,"is_explicit":20,"code":131,"publish_date":136,"listenings":108,"is_private":20,"plans":40,"video":40,"images":137},"d7427338-815f-469d-b655-9cf0acf5747f",7,"The Centaur Conundrum. Cognitive AI model.","Dive into the fascinating world of \u003Cb>\u003Ca href=\"https://huggingface.co/marcelbinz/Llama-3.1-Centaur-70B\">Centaur\u003C/a>\u003C/b>, an ambitious AI model. \u003Cp>Explore how this groundbreaking technology is blurring the lines between artificial intelligence and cognitive science, and uncover the incredible potential it holds for unlocking the secrets of human behavior and cognition.\u003C/p>","storage/podcasts/a916dc01-1db2-4f42-aaf0-e30bf94c491d/episodes/d7427338-815f-469d-b655-9cf0acf5747f.mp3",1598,"2024-10-29T14:14:44.823Z",{"image_80":13,"image_180":14,"image_240":15,"image_600":16,"image_1280":17},{"id":139,"number":140,"season":36,"title":141,"description":142,"type":66,"image":11,"audio":143,"duration":144,"is_explicit":20,"code":140,"publish_date":145,"listenings":98,"is_private":20,"plans":40,"video":40,"images":146},"54fcc72a-a8dc-468b-aeee-88e169dae27c",6,"Digital Minds at Work: The Revolution of Large Action Models","In this episode, we dive deep into the groundbreaking world of Large Action Models (LAMs), with a special focus on Anthropic's Claude 3.5 Haiku. We'll explore how this lightning-fast AI isn't just chatting anymore – it's actively using computers like a human would, opening files, navigating websites, and handling complex digital tasks through innovative pixel-based interaction. \u003Cp>\u003Cbr />\u003C/p>\u003Cp>Keywords: AI technology, Large Action Models, Anthropic, Claude 3.5 Haiku, computer automation, future of work, AI innovation\u003C/p>","storage/podcasts/a916dc01-1db2-4f42-aaf0-e30bf94c491d/episodes/54fcc72a-a8dc-468b-aeee-88e169dae27c.mp3",182,"2024-10-22T16:28:01.262Z",{"image_80":13,"image_180":14,"image_240":15,"image_600":16,"image_1280":17},{"id":148,"number":24,"season":36,"title":149,"description":150,"type":66,"image":11,"audio":151,"duration":152,"is_explicit":20,"code":24,"publish_date":153,"listenings":154,"is_private":20,"plans":40,"video":40,"images":155},"a265ee3a-0825-4a6c-a7a0-89fba9eb267f","AI Dream Teams: How Multi-Agent Platforms Are Revolutionizing Business","Dive into the world of \u003Cb>Asilisc Scope\u003C/b> and multi-agent \u003Cb>AI platforms\u003C/b> that are transforming how businesses operate. Discover how interconnected\u003Cb> AI specialists\u003C/b> can streamline your company's workflow - from accounting to customer service and beyond. Learn how these \u003Cb>AI teams\u003C/b> collaborate under human supervision to tackle complex problems, potentially boosting efficiency and innovation across your entire organization. Join us as we explore the future of AI in business, where your next star employee might just be a team of artificial intelligences.","storage/podcasts/a916dc01-1db2-4f42-aaf0-e30bf94c491d/episodes/a265ee3a-0825-4a6c-a7a0-89fba9eb267f.mp3",494,"2024-10-20T12:58:43.907Z",22,{"image_80":13,"image_180":14,"image_240":15,"image_600":16,"image_1280":17},{"id":157,"number":158,"season":36,"title":159,"description":160,"type":66,"image":11,"audio":161,"duration":162,"is_explicit":20,"code":158,"publish_date":163,"listenings":98,"is_private":20,"plans":40,"video":40,"images":164},"05137fef-b1e0-4222-9de1-844a074a8e08",4,"The Action AI Revolution: How Large Action Models Are","Explore the game-changing world of Large Action Models - AI that doesn't just advise, but acts. Learn how this cutting-edge technology is dramatically accelerating productivity by automating tasks across various business software platforms. We'll dive into the potential benefits, challenges, and ethical considerations of AI that works alongside humans, potentially reshaping the future of work as we know it.","storage/podcasts/a916dc01-1db2-4f42-aaf0-e30bf94c491d/episodes/05137fef-b1e0-4222-9de1-844a074a8e08.mp3",225,"2024-10-20T12:47:01.424Z",{"image_80":13,"image_180":14,"image_240":15,"image_600":16,"image_1280":17},["Reactive",166],{"$ssite-config":167},{"_priority":168,"env":172,"name":173,"url":174},{"name":169,"env":170,"url":171},-10,-15,-4,"production","podcast-website","https://greatleveler.mave.digital/",["Set"],["ShallowReactive",177],{"$63LOZx6kQb":-1},"/ep-13",{"common":180},{"activeTab":181,"isShareActive":20,"episodes":182,"contentPosition":20,"podcast":5,"podcastSlug":183,"showPlayer":20,"activeTrack":40,"pauseTrack":20,"activeEpisode":61,"titleHeight":184,"website":185,"listenUrl":40,"isMobileShareActive":20,"isDataLoaded":34,"favicon":50,"customDomain":40,"episodesCount":184},"listen",[],"greatleveler",0,{"button_text":37,"button_link":38,"is_indexing":34,"ym_id":-1,"gtm_id":-1}]