Spotify’s Shivam Verma talks LLM and personalization

Machine Learning


Shivam Verma, a staff machine learning engineer at Spotify, recently shared insights into how the music and podcast streaming giant is adapting its personalization strategy in the era of large-scale language models (LLMs). Speaking at the AI ​​Engineer Europe event, Verma detailed Spotify’s journey from traditional recommendation systems to leveraging LLM for a more nuanced and personalized user experience.

Spotify's Shivam Verma talks LLM and personalization - AI Engineer

Spotify’s Shivam Verma talks LLM and personalization — from an AI engineer

Visual TL;DR. Traditional Recs evolve in the LLM era. LLM Era enables Semantic ID. Semantic ID enables content/user understanding. Understanding your content/users leads to actionable recommendations. Steerable recommendations enable personalized generation. Spotify’s Shivam Verma talks about Traditional Recs.

  1. Traditional Recs: Multi-stage pipeline for candidate generation, ranking, and scoring
  2. The LLM era: The arrival of large-scale language models opens new avenues for personalization
  3. Semantic ID: Leveraging semantic ID and vector representation of content
  4. Content/User Understanding: LLM helps you understand nuanced content and user preferences
  5. Actionable Recommendation: Move to actionable, context-aware content discovery
  6. Personalized generation: highly personalized generative recommendations
  7. Spotify’s Shivam Verma: Spotify’s Machine Learning Engineer Shares Insights

Visual TL;DR
Visual TL;DR—startuphub.ai Traditional Recs evolve in the LLM era. LLM Era enables Semantic ID. Semantic ID enables content/user understanding. Understanding your content/users gives you actionable recommendations evolve into enable enable leads to traditional rec

LLM era

Semantic ID

Understand your content/users

Actionable recommendations

From startuphub.ai · Publishers behind this format

Visual TL;DR—startuphub.ai Traditional Recs evolve in the LLM era. LLM Era enables Semantic ID. Semantic ID enables content/user understanding. Understanding your content/users gives you actionable recommendations evolve into enable enable leads to traditional rec

LLM era

Semantic ID

understandContent/User

ManeuverableRecommendations

From startuphub.ai · Publishers behind this format

Visual TL;DR—startuphub.ai Traditional Recs evolve in the LLM era. LLM Era enables Semantic ID. Semantic ID enables content/user understanding. Understanding your content/users gives you actionable recommendations evolve into enable enable leads to traditional rec Multi-stage pipeline for candidatesGeneration, ranking, score LLM era The emergence of large-scale language models opens new doorsThe path to personalization Semantic ID Leverage semantic IDs and vectorsExpression of content Understand your content/users LLM helps you understand the nuances.User settings Actionable recommendations Going steerable and context-awarecontent discovery

From startuphub.ai · Publishers behind this format

Visual TL;DR—startuphub.ai Traditional Recs evolve in the LLM era. LLM Era enables Semantic ID. Semantic ID enables content/user understanding. Understanding your content/users gives you actionable recommendations evolve into enable enable leads to traditional rec multi-stagepipelineCandidate… LLM era The advent of the Rajlanguage modelNew opening… Semantic ID Leverage semanticsID and vectorexpression for… understandContent/User LLM supportsunderstand the subtle nuancesContent and users… ManeuverableRecommendations is moving towardsManeuverable,Context-aware…

From startuphub.ai · Publishers behind this format

Visual TL;DR—startuphub.ai Traditional Recs evolve in the LLM era. LLM Era enables Semantic ID. Semantic ID enables content/user understanding. Understanding your content/users leads to actionable recommendations. Steerable recommendations enable personalized generation. Spotify’s Shivam Verma talks about Traditional Recs evolve into enable enable leads to as a result discuss traditional rec Multi-stage pipeline for candidatesGeneration, ranking, score LLM era The emergence of large-scale language models opens new doorsThe path to personalization Semantic ID Leverage semantic IDs and vectorsExpression of content Understand your content/users LLM helps you understand the nuances.User settings Actionable recommendations Going steerable and context-awarecontent discovery personalized generative Very good generative recommendationspersonalized Shivam Verma on Spotify Staff Machine Learning Engineer at Spotifyshare your insights

From startuphub.ai · Publishers behind this format

Visual TL;DR—startuphub.ai Traditional Recs evolve in the LLM era. LLM Era enables Semantic ID. Semantic ID enables content/user understanding. Understanding your content/users leads to actionable recommendations. Steerable recommendations enable personalized generation. Spotify’s Shivam Verma talks about Traditional Recs evolve into enable enable leads to as a result discuss traditional rec multi-stagepipelineCandidate… LLM era The advent of the Rajlanguage modelNew opening… Semantic ID Leverage semanticsID and vectorexpression for… understandContent/User LLM supportsunderstand the subtle nuancesContent and users… ManeuverableRecommendations is moving towardsManeuverable,Context-aware… personalizedgenerative generativeRecommendationsThat’s very… Shivam on SpotifyVelma stuff machinelearning engineerWith Spotify sharing…

From startuphub.ai · Publishers behind this format

From traditional personalization to personalization using LLM

Verma explained that Spotify’s existing recommendation system, called “TradRecs,” has long relied on a multi-step pipeline that includes candidate generation, ranking, and scoring. These systems help deliver personalized playlists, search results, and content feeds across a variety of media types, including music, podcasts, and audiobooks. However, the advent of LLM has opened new avenues for personalization, allowing for a more fluid and context-aware approach.

At the heart of this evolution is how Spotify represents its users and their vast catalog of content. Verma highlighted the use of user embeddings, which are a series of numbers that represent a user’s likes and dislikes. These embeds are the basis for many of Spotify’s personalized products. To bridge the gap between these user expressions and LLM’s language understanding, Spotify employs techniques such as semantic ID and vector embedding.

Leverage semantic ID and vector representations

This process involves creating a vector representation of the content, allowing the LLM to understand not just the words, but the underlying meaning and context. Similarly, user history is converted into semantic IDs and fed to LLM. This approach allows the model to process complex user context, including listening history, explicit prompts, and other implicit signals, to generate more relevant and actionable recommendations.

Verma illustrated this with an example where LLM, with user context such as country, age, and viewing history, can process prompts like “What episode can I listen to next?” and generate personalized recommendations. This differs from traditional systems in that it allows for a more conversational and interactive way to discover content.

The role of the LLM in understanding content and users

Verma emphasized that LLM has been fine-tuned to understand Spotify’s specific catalog and user data. This involves training a model based on Spotify’s vast internal data, including content vectors and user interaction logs. The goal is to enable LLMs to not only understand the meaning of content, but also to more effectively interpret user preferences and context.

It is a shift from a strictly analytical approach to one that incorporates generative capabilities. By transforming user behavior and content metadata into a common semantic space, LLM can generate more creative and personalized recommendations. This includes features such as ‘taste profiles’ where users can provide explicit feedback to further improve the model’s understanding of their preferences.

From “Trad-Recs” to actionable and personalized generated recommendations

Verma concluded by summarizing the transition from traditional recommendation systems to a new era of “tradition-generated” recommendations. He emphasized the key points:

  • Embedding and semantic ID are important building blocks for generative LLM-native recommender systems.
  • Soft token approaches show great potential in personalizing LLMs.
  • Traditional recommenders and sequential modeling remain important for real-world, real-time rankings that complement LLM capabilities.

This evolution aims to give users more control and transparency in their content discovery journey, making the Spotify experience more engaging and personalized than ever before.

© 2026 StartupHub.ai. Unauthorized reproduction is prohibited. Please do not type, scrape, copy, reproduce or republish this article in whole or in part. Use for AI training, fine-tuning, search enhancement generation, or as input to any machine learning system is prohibited without a written license. Substantially similar derivative works will be pursued to the fullest extent of applicable copyright, database, and computer abuse laws. See our Clause.



Source link