{"id":14826,"date":"2025-10-30T19:55:03","date_gmt":"2025-10-30T19:55:03","guid":{"rendered":"https:\/\/bitunikey.com\/news\/interview-big-tech-is-training-ai-on-junk-data-intuition\/"},"modified":"2025-10-30T19:55:12","modified_gmt":"2025-10-30T19:55:12","slug":"interview-big-tech-is-training-ai-on-junk-data-intuition","status":"publish","type":"post","link":"https:\/\/bitunikey.com\/news\/interview-big-tech-is-training-ai-on-junk-data-intuition\/","title":{"rendered":"Interview | Big tech is training AI on junk data: Intuition"},"content":{"rendered":"<p><\/p>\n<div class=\"post-detail__content blocks\">\n<p class=\"is-style-lead\">AI models are getting more powerful, but the data they\u2019re trained on is getting worse, says Intuition founder Billy Luedtke. <\/p>\n<div id=\"cn-block-summary-block_a454ec6116be6191756d067ea26ac6a2\" class=\"cn-block-summary\">\n<div class=\"cn-block-summary__nav tabs\">\n        <span class=\"tabs__item is-selected\">Summary<\/span>\n    <\/div>\n<div class=\"cn-block-summary__content\">\n<ul class=\"wp-block-list\">\n<li>AI is only as good as the data we feed it, says Billy Luedtke, founder of Intuition <\/li>\n<li>We\u2019re in a \u201cslop-in, slop-out\u201d era, as AI becomes recursive<\/li>\n<li>Decentralized models have the edge with tech and user experience<\/li>\n<\/ul><\/div>\n<\/div>\n<p><!-- .cn-block-summary --><\/p>\n<p>As AI systems grow more pervasive, users are increasingly running into limitations that are hard to fix. While the models improve, the underlying data these models are trained on remains the same. What is more, recursion, or AI models training on data generated by other AI, might actually make it worse.  <\/p>\n<p>To talk about the future of AI, crypto.news spoke to Billy Luedtke, founder of Intuition, a decentralized protocol focused on bringing verifiable attribution, reputation, and data ownership to AI. Luedtke explains why the current data sets for AI are fundamentally flawed and what can be done to fix it. <\/p>\n<p><strong>Crypto.news: Everyone right now is focused on AI infrastructure \u2014 GPUs, energy, data centers. Are people underestimating the importance of the trust layer in AI? Why is it important?<\/strong><\/p>\n<p>Billy Luedtke: 100%. People are definitely underestimating it \u2014 and it matters for several reasons.<\/p>\n<p>First, we\u2019re entering what I call a \u201cslop-in, slop-out\u201d era. AI is only as good as the data it consumes. But that data \u2014 especially from the open web \u2014 is largely polluted. It\u2019s not clean. It\u2019s not reflective of human intention. Much of it comes from gamified behavior online: likes, reviews, engagement hacks \u2014 all filtered through attention-optimized algorithms.<\/p>\n<p>So when AI scrapes the internet, what it sees isn\u2019t a holistic picture of who we are. It\u2019s seeing people playing the platform. I don\u2019t behave the same way on Twitter as I do in real life. None of us do. We\u2019re optimizing for the algorithm \u2014 not expressing genuine thought.<\/p>\n<p>It\u2019s recursive, too. The platforms train us, and we feed more distorted behavior back in. That creates a feedback loop \u2014 a spiral \u2014 that distorts AI\u2019s perception of humanity even more. We\u2019re not teaching it what we think; we\u2019re teaching it what we think will get likes.<\/p>\n<p>The average user isn\u2019t Googling, comparing sources, or thinking critically. They\u2019re just asking ChatGPT or another model and taking the response at face value.<\/p>\n<p>That\u2019s dangerous. If the model is opaque \u2014 a black box \u2014 and the company that controls it also controls what information you\u2019re shown or not shown, then that\u2019s total narrative control. It\u2019s centralized, unaccountable, and extremely powerful.<\/p>\n<p>Imagine asking Grok for the best podcast, and the answer is whoever paid Elon the most. That\u2019s not intelligence \u2014 it\u2019s just advertising in disguise.<\/p>\n<p><strong>CN: So how do we fix that? How do we build systems that prioritize truth and value instead of engagement?<\/strong><\/p>\n<p>BL: We need to flip the incentives. These systems should serve people \u2014 not institutions, not shareholders, not advertisers. That means building a new layer for the internet: identity and reputation primitives. That\u2019s what we\u2019re doing at Intuition.<\/p>\n<p>We need verifiable attribution: who said what, when, and in what context. And we need a portable, decentralized reputation that helps determine how much we can trust any given source of data \u2014 not based on vibe, but on actual contextual track record.<\/p>\n<p>Reddit is a perfect example. It\u2019s one of the <a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/www.reddit.com\/r\/google\/comments\/1ax1nyh\/reddit_has_struck_a_60_million_deal_with_google\/\" target=\"_blank\" rel=\"nofollow\">largest sources<\/a> of training data for models. But if a user sarcastically says, \u201cJust k*** yourself,\u201d that can get scraped and show up in a model\u2019s recommendation to someone asking for medical advice.<\/p>\n<p>That\u2019s horrifying \u2014 and it\u2019s what happens when models don\u2019t have context, attribution, or reputation weighting. We need to know: Is this person credible in medicine? Are they reputable in finance? Is this a trusted source, or just another random comment?<\/p>\n<p>    <!-- .cn-block-related-link --><\/p>\n<p><strong>CN: When you talk about attribution and reputation, this data needs to be stored somewhere. How do you think about that in terms of infrastructure \u2014 especially with issues like copyright and compensation?<\/strong><\/p>\n<p>BL: That\u2019s exactly what we\u2019re solving at Intuition. Once you have verifiable attribution primitives, you know who created what data. That allows for tokenized ownership of knowledge \u2014 and with that, compensation.<\/p>\n<p>So instead of your data living on Google\u2019s servers or OpenAI\u2019s APIs, it lives on a decentralized knowledge graph. Everyone owns what they contribute. When your data gets traversed or used in an AI output, you get a share of the value it generates.<\/p>\n<p>That matters because right now we\u2019re digital serfs. We spend our most valuable resources \u2014 time, attention, and creativity \u2014 generating data that someone else monetizes. YouTube isn\u2019t valuable because it hosts videos; it\u2019s valuable because people curate it. Without likes, comments, or subscriptions, YouTube is worthless.<\/p>\n<p>So we want a world where everyone can earn from the value they generate \u2014 even if you\u2019re not an influencer or extrovert. If you\u2019re consistently early to finding new artists, for example, your taste has value. You should be able to build a reputation around that and monetize it.<\/p>\n<p><strong>CN: But even if we get transparency, these models are still really hard to interpret. OpenAI itself can\u2019t fully explain how its models make decisions. What happens then?<\/strong><\/p>\n<p>BL: Great point. We can\u2019t fully interpret model behavior \u2014 they\u2019re just too complex. But what we can control is the training data. That\u2019s our lever.<\/p>\n<p>I\u2019ll give you an example: I heard about a <a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/arxiv.org\/abs\/2507.14805\" target=\"_blank\" rel=\"nofollow\">research paper<\/a> where one AI was obsessed with owls and another was great at math. They only trained together on math-related tasks. But at the end, the math AI also started loving owls \u2014 just by absorbing the pattern from the other.<\/p>\n<p>It\u2019s crazy how subliminal and subtle these patterns are. So the only real defense is intention. We need to be deliberate about what data we feed these models. We need to \u201cheal ourselves,\u201d in a way, to show up online in a more authentic, constructive way. Because AI will always reflect the values and distortions of its creators.<\/p>\n<p><strong>CN: Let\u2019s talk business. OpenAI is burning cash. Their infrastructure is extremely expensive. How can a decentralized system like Intuition compete \u2014 financially and technically?<\/strong><\/p>\n<p>BL: There are two core advantages we have: composability and coordination.<\/p>\n<p>Decentralized ecosystems \u2014 especially in crypto \u2014 are incredibly good at coordination. We\u2019ve got global, distributed teams all working on different components of the same larger problem. Instead of one company burning billions fighting the world, we\u2019ve got hundreds of aligned contributors building interoperable tools.<\/p>\n<p>It\u2019s like a mosaic. One team works on agent reputation, another on decentralized storage, another on identity primitives \u2014 and we can stitch those together.<\/p>\n<p>That\u2019s the superpower.<\/p>\n<p>The second advantage is user experience. OpenAI is locked into its moat. They can\u2019t let you port your context from ChatGPT to Grok or Anthropic \u2014 that would erode their defensibility. But we don\u2019t care about vendor lock-in.<\/p>\n<p>In our system, you\u2019ll be able to own your context, take it with you, and plug it into whichever agent you want. That makes for a better experience. People will choose it.<\/p>\n<p>    <!-- .cn-block-related-link --><\/p>\n<p><strong><strong>CN: <\/strong>What about infrastructure costs? Running large models is extremely expensive. Do you see a world where smaller models run locally?<\/strong><\/p>\n<p>BL: Yes, 100%. I actually think that\u2019s where we\u2019re headed \u2014 toward many small models running locally, connected like neurons in a distributed swarm.<\/p>\n<p>Instead of one big monolithic data center, you\u2019ve got billions of consumer devices contributing compute. If we can coordinate them \u2014 which is what crypto excels at \u2014 that becomes a superior architecture.<\/p>\n<p>And this is why we\u2019re also building agent reputation layers. Requests can be routed to the right specialized agent for the job. You don\u2019t need one massive model to do everything. You just need a smart system for task routing \u2014 like an API layer across millions of agents.<\/p>\n<p><strong>CN: What about determinism? LLMs aren\u2019t great for tasks like math, where you want exact answers. Can we combine deterministic code with AI?<\/strong><\/p>\n<p>BL: That\u2019s what I want. We need to bring back determinism into the loop.<\/p>\n<p>We started with symbolic reasoning \u2014 fully deterministic \u2014 and then we swung hard into deep learning, which is nondeterministic. That gave us the explosion we\u2019re seeing now. But the future is neurosymbolic \u2014 combining the best of both.<\/p>\n<p>Let the AI handle the fuzzy reasoning. But also let it trigger deterministic modules \u2014 scripts, functions, logic engines \u2014 where you need precision. Think: \u201cWhich of my friends likes this restaurant?\u201d That should be 100% deterministic.<\/p>\n<p><strong><strong>CN: <\/strong>Zooming out: we\u2019ve seen companies integrate AI across their operations. But results have been mixed. Do you think the current generation of LLMs truly boosts productivity?<\/strong><\/p>\n<p>BL: Absolutely. The singularity is already here \u2014 it\u2019s just unevenly distributed.<\/p>\n<p>If you\u2019re not using AI in your workflow, especially for code or content, you\u2019re working at a fraction of the speed others are. The tech is real, and the efficiency gains are massive. The disruption has already happened. People just haven\u2019t fully realized it yet.<\/p>\n<p><strong>CN: Final question. A lot of people are saying this is a <a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/www.businessinsider.com\/bill-gates-ai-bubble-similar-dot-com-bubble-2025-10\" target=\"_blank\" rel=\"nofollow\">bubble<\/a>. Venture capital is drying up. OpenAI is burning money. Nvidia\u2019s financing its own customers. How does this end?<\/strong><\/p>\n<p>BL: Yes, there\u2019s a bubble \u2014 but the tech is real. Every bubble pops, but what\u2019s left afterward are the foundational technologies. AI is going to be one of them. The dumb money \u2014 all those wrapper apps with no real innovation \u2014 that\u2019s getting flushed. But deep infrastructure teams? They\u2019ll survive.<\/p>\n<p>In fact, this could go one of two ways: We get a soft correction and come back to reality, but progress continues. Or, productivity gains are so immense that AI becomes a deflationary force on the economy. GDP could 10x or 100x in output capacity. If that happens, the spending was worth it \u2014 we level up as a society.<\/p>\n<p>Either way, I\u2019m optimistic. There\u2019ll be chaos and job displacement, yes \u2014 but also the potential for an abundant, post-scarcity world if we build the right foundation.<\/p>\n<p>    <!-- .cn-block-related-link --><\/p><\/div>\n","protected":false},"excerpt":{"rendered":"<p>AI models are getting more powerful, but the data they\u2019re trained on is getting worse, says Intuition founder Billy Luedtke. Summary AI is only as good as the data we&hellip;<\/p>\n","protected":false},"author":1,"featured_media":14827,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-14826","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-cryptocurrency"],"_links":{"self":[{"href":"https:\/\/bitunikey.com\/news\/wp-json\/wp\/v2\/posts\/14826","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/bitunikey.com\/news\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/bitunikey.com\/news\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/bitunikey.com\/news\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/bitunikey.com\/news\/wp-json\/wp\/v2\/comments?post=14826"}],"version-history":[{"count":1,"href":"https:\/\/bitunikey.com\/news\/wp-json\/wp\/v2\/posts\/14826\/revisions"}],"predecessor-version":[{"id":14828,"href":"https:\/\/bitunikey.com\/news\/wp-json\/wp\/v2\/posts\/14826\/revisions\/14828"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/bitunikey.com\/news\/wp-json\/wp\/v2\/media\/14827"}],"wp:attachment":[{"href":"https:\/\/bitunikey.com\/news\/wp-json\/wp\/v2\/media?parent=14826"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/bitunikey.com\/news\/wp-json\/wp\/v2\/categories?post=14826"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/bitunikey.com\/news\/wp-json\/wp\/v2\/tags?post=14826"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}