Collaborative Filtering at Spotify

Jan 25, 2013281 likes93,393 views

From the NYC Machine Learning meetup on Jan 17, 2013: http://www.meetup.com/NYC-Machine-Learning/events/97871782/ Video is available here: http://vimeo.com/57900625

Music recommendations at Spotify

Erik Bernhardsson

erikbern@spotify.com

Collaborative filtering

Idea:
- If two movies x, y get similar ratings then they are probably similar
- If a lot of users all listen to tracks x, y, z, then those tracks are
probably similar

Aggregate data

Throw away temporal information and just look at the number of times

Supervised collaborative filtering is pretty much matrix completion

Unsupervised learning

- Trying to estimate the density
- i.e. predict probability of future events

Try to predict the future given the past

We can calculate correlation coefficient as an item similarity

- Use something like Pearson, Jaccard, …

Amazon did this for “customers who bought this also bought”

- US patent 7113917

Can speed this up using various LSH tricks

- Twitter: Dimension Independent Similarity Computation (DISCO)

Natural Language Processing has a lot of similar problems

…matrix factorization is one idea

Matrix factorization

- Want to get user vectors and item vectors
- Assume f latent factors (dimensions) for each user/item

Probabilistic Latent Semantic Analysis (PLSA)

- Hofmann, 1999
- Also called PLSI

PLSA, cont.

Optimization problem: maximize log-likelihood

“Collaborative Filtering for Implicit Feedback Datasets”

- Hu, Koren, Volinsky (2008)

“Collaborative Filtering for Implicit Feedback Datasets”, cont.

What happens each iteration

- Assign all latent vectors small random values
- Perform gradient ascent to optimize log-likelihood

Calculate derivative and do gradient ascent

- Assign all latent vectors small random values
- Perform gradient ascent to optimize log-likelihood

Vectors are pretty nice because things are now super fast

- User-item score is a dot product:

- Item-item similarity score is a cosine similarity:

- Both cases have trivial complexity in the number of factors f:

Example: item similarity as a cosine of vectors

We can rank all tracks by the user’s vector

One iteration of a matrix factorization algorithm

“Google News personalization: scalable online collaborative filtering”

So now we solved the problem of recommendations right?

Actually what we really want is to apply it to other domains

Radio

- Artist radio: find related tracks
- Optimize ensemble model based on skip/thumbs data

Learning from feedback is actually pretty hard

Discover Weekly is a personalized mixtape of 30 highly personalized songs that's curated and delivered to Spotify's 75M active users every Monday. It's received high acclaim in the press and reached 1B streams within its first 10 weeks. In this slide deck we dive into the narrative of how Discover Weekly came to be, highlighting technical challenges, data driven development, and the Machine Learning models used to power our recommendations engine.

Machine Learning and Big Data for Music Discovery at SpotifyChing-Wei Chen

46 slides•20.9K views

Spotify is the world’s largest on-demand music streaming company, with over 100 million active users who generate around 2TB of interaction data every day. With over 30 million songs to choose from, discovery and personalization play an essential role in helping users discover the best music for them. In this talk, given at the newly opened Galvanize space in NYC in March 2017, we’ll explain how Spotify uses Latent Space Models and Deep Learning to power features such as Discover Weekly and Release Radar.

Personalized Playlists at SpotifyRohan Agrawal

20 slides•1K views

Spotify uses various machine learning models to power personalized playlist and track recommendations for its over 100 million active users. Latent factor models represent users and songs as vectors in a shared dimensional space to predict listener preferences. Deep learning models analyze audio features to learn song representations. Natural language processing models like Word2Vec represent user listening histories as sequences to predict future interests. While current models are effective, future work includes incorporating more contextual data into embeddings to remove biases and better capture long-term user intents.

Music Personalization At SpotifyVidhya Murali

35 slides•7.3K views

Spotify provides personalized music recommendations to over 100 million active users based on their listening history and the listening history of similar users. It utilizes various recommendation approaches, including collaborative filtering using latent factor models to create lower-dimensional representations of users and songs. Spotify also uses natural language processing models on playlist data and deep learning on audio features to power recommendations. Personalizing music at Spotify's massive scale across 30 million tracks presents challenges around cold starts, repeated consumption, and measuring recommendation quality.

Music recommendations @ MLConf 2014Erik Bernhardsson

44 slides•29K views

This document summarizes Spotify's approach to music discovery and recommendations using machine learning techniques. It discusses how Spotify analyzes billions of user streams to find patterns and make recommendations using collaborative filtering and latent factor models. It also explores combining multiple models like recurrent neural networks, word2vec, and gradient boosted decision trees to improve recommendations. The challenges of evaluating recommendations and optimizing for the right metrics are also summarized.

Music Recommendations at Scale with SparkChris Johnson

65 slides•58.8K views

Spotify uses a range of Machine Learning models to power its music recommendation features including the Discover page, Radio, and Related Artists. Due to the iterative nature of these models they are a natural fit to the Spark computation paradigm and suffer from the IO overhead incurred by Hadoop. In this talk, I review the ALS algorithm for Matrix Factorization with implicit feedback data and how we’ve scaled it up to handle 100s of Billions of data points using Scala, Breeze, and Spark.

DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...Hakka Labs

36 slides•2.4K views

Building Data Pipelines for Music Recommendations at SpotifyVidhya Murali

58 slides•5.3K views

Homepage Personalization at SpotifyOguz Semerci

26 slides•4.1K views

The document discusses homepage personalization at Spotify. It begins by noting that the homepage is an important discovery, personalization, and marketplace tool. It then describes how the homepage is organized into shelves and cards containing content like albums and playlists. It discusses how a ranking algorithm and bandit policy are used to serve personalized recommendations while introducing exploration to avoid feedback loops. Finally, it provides examples of sanity checks used in production to validate that the policy and models are working as intended.

Scala Data Pipelines for Music RecommendationsChris Johnson

50 slides•163.7K views

Are you still building data pipelines with Java and Python? Are you curious about the current buzz in the Big Data community surrounding Scala as a data processing environment? In this talk I'll discuss how Spotify migrated its music recommendations pipeline from Python to Scala. I'll dive into the language specific features that make Scala the ideal candidate for big data processing as well as highlight the rich set of tools and APIs that we take advantage of to process music recommendations for our 50 Million active users including Scalding, Breeze, Kafka, Spark, Parquet, Driven and Zeppelin.

Personalizing the listening experienceMounia Lalmas-Roelleke

35 slides•2.3K views

These are the slides of my talk at the 2019 Netflix Workshop on Personalization, Recommendation and Search (PRS). This talk is based on previous talks on research we are doing at Spotify, but here I focus on the work we do on personalizing Spotify Home, with respect to success, intent & diversity. The link to the workshop is https://prs2019.splashthat.com/. This is research from various people at Spotify, and has been published at RecSys 2018, CIKM 2018 and WWW (The Web Conference) 2019.

Interactive Recommender Systems with Netflix and SpotifyChris Johnson

100 slides•105.5K views

Interactive recommender systems enable the user to steer the received recommendations in the desired direction through explicit interaction with the system. In the larger ecosystem of recommender systems used on a website, it is positioned between a lean-back recommendation experience and an active search for a specific piece of content. Besides this aspect, we will discuss several parts that are especially important for interactive recommender systems, including the following: design of the user interface and its tight integration with the algorithm in the back-end; computational efficiency of the recommender algorithm; as well as choosing the right balance between exploiting the feedback from the user as to provide relevant recommendations, and enabling the user to explore the catalog and steer the recommendations in the desired direction. In particular, we will explore the field of interactive video and music recommendations and their application at Netflix and Spotify. We outline some of the user-experiences built, and discuss the approaches followed to tackle the various aspects of interactive recommendations. We present our insights from user studies and A/B tests. The tutorial targets researchers and practitioners in the field of recommender systems, and will give the participants a unique opportunity to learn about the various aspects of interactive recommender systems in the video and music domain. The tutorial assumes familiarity with the common methods of recommender systems.

Music Personalization : Real time Platforms.Esh Vckay

44 slides•1.7K views

1. The document discusses music personalization techniques at Spotify, including understanding users and music content, using collaborative filtering and latent vector models to make recommendations, and building real-time recommendation systems using Apache Storm. 2. It describes how Spotify uses machine learning techniques like matrix factorization and word2vec to generate latent vectors for users, songs, artists and playlists to measure similarity and make personalized recommendations at scale for its 75 million users. 3. The key challenges are processing huge amounts of data from 1 billion playlists and 1TB of logs daily to provide recommendations for each new user within 3 seconds and in real-time as listening behaviors change.

Recommending and searching @ SpotifyMounia Lalmas-Roelleke

38 slides•9.8K views

Spotify uses both push and pull paradigms to match artists and fans in a personal and relevant way. The push paradigm is exemplified by Home, which surfaces personalized playlists using an algorithm called BaRT. BaRT is a multi-armed bandit algorithm that explores and exploits to select playlists based on a reward function. Research shows personalizing the reward function for each user and playlist type improves results. Search represents the pull paradigm, where users search for specific music. Understanding user intent and mindset helps improve search satisfaction. Both paradigms aim to reduce effort and increase success based on offline and online evaluation. Voice interactions may represent a hybrid paradigm.

Big data and machine learning @ SpotifyOscar Carlsson

40 slides•4.2K views

Machine learning @ Spotify - Madison Big Data MeetupAndy Sloane

39 slides•17.3K views

CF Models for Music Recommendations At SpotifyVidhya Murali

30 slides•1.7K views

How Apache Drives Music Recommendations At SpotifyJosh Baer

37 slides•5.9K views

Recommending and Searching (Research @ Spotify)Mounia Lalmas-Roelleke

38 slides•5.2K views

These are the slides of a talk about some of our research at Spotify, as part of the celebration kickoff of Chalmers AI Research Centre in Gothenburg. I always like to make a story in my talk, and this time I wanted to reflect on the "push" (think recommender system) and "pull" (think search) paradigms. I am using this quote from Nicholas Belkin and Bruce Croft from their Communications of the ACM article published in 1992 to frame my story: "We conclude that information retrieval and information filtering are indeed two sides of the same coin. They work together to help people get the information needed to perform their tasks."

ML+Hadoop at NYC Predictive AnalyticsErik Bernhardsson

41 slides•19.6K views

Search @ Spotify Mounia Lalmas-Roelleke

37 slides•4.7K views

At the BCS Search Solutions 2018, I gave a talk about work on search we are doing at Spotify. The talk described what search means in the context of Spotify, how it differs what we know about search, and the challenges associated with understanding user intents and mindsets in an "entertainment" context. The talk also discussed various efforts at Spotify to understand why users submit search queries, what they expect, how they assess their search experience, and how Spotify responds to these search queries. This is work done with many colleagues at Spotify in Boston, London, New York and Stockholm, and our wonderful summer interns.

Spotify Discover Weekly: The machine learning behind your music recommendationsSophia Ciocca

29 slides•2.1K views

In this presentation, I give an overview of the machine learning algorithms behind Spotify’s extraordinarily popular Discover Weekly playlist. I provide a brief introduction to what the playlist is, explain how music recommendation engines have evolved over time, then break down the three main algorithm types powering Spotify’s recommendations: (1) collaborative filtering, (2) Natural Language Processing (NLP), and (3) Raw audio analysis. Video of the presentation can be found here: https://www.youtube.com/watch?v=PUtYNjInopA

Data at SpotifyDanielle Jabin

31 slides•22.9K views

Danielle Jabin is a data engineer at Spotify who works on A/B testing infrastructure. She describes Spotify's big data landscape, which includes over 40 million active users generating 1.5 TB of compressed data per day. Spotify collects this user data using Kafka for high-volume data collection, processes it using Hadoop on a large cluster, and stores aggregates in databases like PostgreSQL and Cassandra for analytics and visualization.

Interactive Recommender SystemsRoelof van Zwol

100 slides•3.1K views

by Harald Steck (Netflix Inc., US), Roelof van Zwol (Netflix Inc., US) and Chris Johnson (Spotify Inc., US) Slides of the tutorial on interactive recommender systems at the 2015 conference on Recommender Systems (RecSys). Interactive recommender systems enable the user to steer the received recommendations in the desired direction through explicit interaction with the system. In the larger ecosystem of recommender systems used on a website, it is positioned between a lean-back recommendation experience and an active search for a specific piece of content. Besides this aspect, we will discuss several parts that are especially important for interactive recommender systems, including the following: design of the user interface and its tight integration with the algorithm in the back-end; computational efficiency of the recommender algorithm; as well as choosing the right balance between exploiting the feedback from the user as to provide relevant recommendations, and enabling the user to explore the catalog and steer the recommendations in the desired direction. In particular, we will explore the field of interactive video and music recommendations and their application at Netflix and Spotify. We outline some of the user-experiences built, and discuss the approaches followed to tackle the various aspects of interactive recommendations. We present our insights from user studies and A/B tests. The tutorial targets researchers and practitioners in the field of recommender systems, and will give the participants a unique opportunity to learn about the various aspects of interactive recommender systems in the video and music domain. The tutorial assumes familiarity with the common methods of recommender systems. DATE: Wednesday, Sept 16, 2015, 11:00-12:30

Music Recommendation 2018Fabien Gouyon

138 slides•10.7K views

The current revolution in the music industry represents great opportunities and challenges for music recommendation systems. Recommendation systems are now central to music streaming platforms, which are rapidly increasing in listenership and becoming the top source of revenue for the music industry. It is increasingly more common for a music listener to simply access music than to purchase and own it in a personal collection. In this scenario, recommendation calls no longer for a one-shot recommendation for the purpose of a track or album purchase, but for a recommendation of a listening experience, comprising a very wide range of challenges, such as sequential recommendation, or conversational and contextual recommendations. Recommendation technologies now impact all actors in the rich and complex music industry ecosystem (listeners, labels, music makers and producers, concert halls, advertisers, etc.).

Real-Time Recommendations with Hopsworks and OpenSearch - MLOps World 2022Jim Dowling

71 slides•740 views

Approximate nearest neighbor methods and vector models – NYC ML meetupErik Bernhardsson

62 slides•23.3K views

Nearest neighbors refers to something that is conceptually very simple. For a set of points in some space (possibly many dimensions), we want to find the closest k neighbors quickly. This presentation covers a library called Annoy built my me that that helps you do (approximate) nearest neighbor queries in high dimensional spaces. We're going through vector models, how to measure similarity, and why nearest neighbor queries are useful.

Chapter 02 collaborative recommendationAravindharamanan S

46 slides•178 views

- User-based collaborative filtering uses the ratings of similar users to predict ratings for a target user. Similarity is commonly measured using Pearson correlation. Predictions are generated by taking a weighted average of similar users' ratings. - Item-based collaborative filtering finds similar items to those a user has rated and uses the user's ratings of similar items to predict new ratings. Cosine similarity is commonly used to find similar items. - Collaborative filtering approaches struggle with data sparsity as they require overlapping ratings between users or items to find similarities. Techniques like singular value decomposition aim to address this by reducing the user-item rating matrix to fewer factors to better capture similarities despite sparsity.

Chapter 02 collaborative recommendationAravindharamanan S

46 slides•146 views

The document discusses collaborative filtering approaches for recommender systems. It covers user-based and item-based nearest neighbor collaborative filtering methods. It describes how similarity between users or items is measured using approaches like Pearson correlation and cosine similarity. It also discusses challenges like data sparsity and different algorithmic improvements and model-based approaches like matrix factorization using singular value decomposition.

More Related Content

What's hot (20)

Building Data Pipelines for Music Recommendations at SpotifyVidhya Murali

58 slides•5.3K views

Homepage Personalization at SpotifyOguz Semerci

26 slides•4.1K views

Scala Data Pipelines for Music RecommendationsChris Johnson

50 slides•163.7K views

Personalizing the listening experienceMounia Lalmas-Roelleke

35 slides•2.3K views

Interactive Recommender Systems with Netflix and SpotifyChris Johnson

100 slides•105.5K views

Music Personalization : Real time Platforms.Esh Vckay

44 slides•1.7K views

Recommending and searching @ SpotifyMounia Lalmas-Roelleke

38 slides•9.8K views

Big data and machine learning @ SpotifyOscar Carlsson

40 slides•4.2K views

Machine learning @ Spotify - Madison Big Data MeetupAndy Sloane

39 slides•17.3K views

CF Models for Music Recommendations At SpotifyVidhya Murali

30 slides•1.7K views

How Apache Drives Music Recommendations At SpotifyJosh Baer

37 slides•5.9K views

Recommending and Searching (Research @ Spotify)Mounia Lalmas-Roelleke

38 slides•5.2K views

ML+Hadoop at NYC Predictive AnalyticsErik Bernhardsson

41 slides•19.6K views

Search @ Spotify Mounia Lalmas-Roelleke

37 slides•4.7K views

Spotify Discover Weekly: The machine learning behind your music recommendationsSophia Ciocca

29 slides•2.1K views

Data at SpotifyDanielle Jabin

31 slides•22.9K views

Interactive Recommender SystemsRoelof van Zwol

100 slides•3.1K views

Music Recommendation 2018Fabien Gouyon

138 slides•10.7K views

Real-Time Recommendations with Hopsworks and OpenSearch - MLOps World 2022Jim Dowling

71 slides•740 views

Approximate nearest neighbor methods and vector models – NYC ML meetupErik Bernhardsson

62 slides•23.3K views

Building Data Pipelines for Music Recommendations at SpotifyVidhya Murali

58 slides•5.3K views

Homepage Personalization at SpotifyOguz Semerci

26 slides•4.1K views

Scala Data Pipelines for Music RecommendationsChris Johnson

50 slides•163.7K views

Personalizing the listening experienceMounia Lalmas-Roelleke

35 slides•2.3K views

Interactive Recommender Systems with Netflix and SpotifyChris Johnson

100 slides•105.5K views

Music Personalization : Real time Platforms.Esh Vckay

44 slides•1.7K views

Recommending and searching @ SpotifyMounia Lalmas-Roelleke

38 slides•9.8K views

Big data and machine learning @ SpotifyOscar Carlsson

40 slides•4.2K views

Machine learning @ Spotify - Madison Big Data MeetupAndy Sloane

39 slides•17.3K views

CF Models for Music Recommendations At SpotifyVidhya Murali

30 slides•1.7K views

How Apache Drives Music Recommendations At SpotifyJosh Baer

37 slides•5.9K views

Recommending and Searching (Research @ Spotify)Mounia Lalmas-Roelleke

38 slides•5.2K views

ML+Hadoop at NYC Predictive AnalyticsErik Bernhardsson

41 slides•19.6K views

Search @ Spotify Mounia Lalmas-Roelleke

37 slides•4.7K views

Spotify Discover Weekly: The machine learning behind your music recommendationsSophia Ciocca

29 slides•2.1K views

Data at SpotifyDanielle Jabin

31 slides•22.9K views

Interactive Recommender SystemsRoelof van Zwol

100 slides•3.1K views

Music Recommendation 2018Fabien Gouyon

138 slides•10.7K views

Real-Time Recommendations with Hopsworks and OpenSearch - MLOps World 2022Jim Dowling

71 slides•740 views

Approximate nearest neighbor methods and vector models – NYC ML meetupErik Bernhardsson

62 slides•23.3K views

Similar to Collaborative Filtering at Spotify (20)

Chapter 02 collaborative recommendationAravindharamanan S

46 slides•178 views

Chapter 02 collaborative recommendationAravindharamanan S

46 slides•146 views

Summer internship 2014 report by Rishabh Misra, Thapar UniversityRishabh Misra

17 slides•373 views

The document summarizes an internship focused on recommender systems and reinforcement learning. The intern read research papers on topics like contextual bandits and matrix factorization techniques. They implemented algorithms such as LinUCB, GLM-UCB, and probabilistic matrix factorization on datasets like MovieLens. The intern also worked on features for a collaborative tweet recommendation system and referred to machine learning courses.

Buidling large scale recommendation engineKeeyong Han

27 slides•11.9K views

This document provides an overview of recommendation engines and how to build one using Hadoop and Mahout. It defines recommendation engines and lists common examples. It also discusses different recommendation strategies like item-based and user-based approaches. The document introduces Hadoop and Mahout, and describes how to build a recommendation engine pipeline using these tools. It provides details on preprocessing data, building recommendation models with Mahout, and serving recommendations. It also addresses some challenges and provides guidance on using Mahout to generate item and user recommendations at scale in Hadoop.

Btp 3rd ReportDinesh Yadav

7 slides•532 views

The document discusses a music recommendation system project that uses content-based filtering and collaborative filtering techniques. Content-based filtering extracts features from songs to find similar songs based on acoustic content. Collaborative filtering matches users based on similar tastes and ratings to generate recommendations. The project has developed a website using Ruby on Rails for the frontend and Python for the backend. Current work involves completing the collaborative filtering approach and exploring query by humming algorithms.

Recommendation System ExplainedCrossing Minds

64 slides•13.1K views

What really are recommendations engines nowadays? This presentation introduces the foundations of recommendation algorithms, and covers common approaches as well as some of the most advanced techniques. Although more focused on efficiency than theoretical properties, basics of matrix algebra and optimization-based machine learning are used through the presentation. Table of Contents: 1. Collaborative Filtering 1.1 User-User 1.2 Item-Item 1.3 User-Item * Matrix Factorization * Stochastic Gradient Descent (SGD) * Truncated Singular Value Decomposition (SVD) * Alternating Least Square (ALS) * Deep Learning 2. Content Extraction * Item-Item Similarities * Deep Content Extraction: NLP, CNN, LSTM 3. Hybrid Models 4. In Production 4.1 Problematics 4.2 Solutions 4.3 Tools

FashiondatascSuman Bhattacharya, PhD

35 slides•834 views

1. Fashion companies are leveraging data science and personalization to improve customers' shopping experience. Computer vision and deep learning allow them to use visual data from photos. 2. Recommendation systems combine traditional signals like ratings and clicks with visual signals from photos to improve accuracy. 3. Rigorous experimentation is important to test recommendation systems. Techniques like A/B testing and multi-armed bandits help optimize the personalized experience for each customer.

Models for Information Retrieval and RecommendationArjen de Vries

91 slides•2.5K views

Online information services personalize the user experience by applying recommendation systems to identify the information that is most relevant to the user. The question how to estimate relevance has been the core concept in the field of information retrieval for many years. Not so surprisingly then, it turns out that the methods used in online recommendation systems are closely related to the models developed in the information retrieval area. In this lecture, I present a unified approach to information retrieval and collaborative filtering, and demonstrate how this let’s us turn a standard information retrieval system into a state-of-the-art recommendation system.

Apache Mahout Tutorial - Recommendation - 2013/2014 Cataldo Musto

117 slides•37.9K views

Here are the key steps for Exercise 3: 1. Create a FileDataModel object, passing in the CSV file 2. Instantiate different UserSimilarity objects like PearsonCorrelationSimilarity, EuclideanDistanceSimilarity 3. Calculate similarities between users by calling userSimilarity() on the similarity objects, passing the user IDs 4. Print out the similarities to compare the different measures The CSV file should contain enough user preference data (user IDs, item IDs, ratings) for the similarity calculations to be meaningful. This exercise demonstrates how to easily plug different similarity functions into Mahout's common interfaces.

Tag And Tag Based Recommendergu wendong

26 slides•3.5K views

This document summarizes tag-based recommenders and social tagging systems. It discusses: 1) Social tagging systems allow users to collaboratively tag and categorize content. Popular social tagging sites include Delicious, Flickr, YouTube, etc. Tagging systems have features like tag sharing and selection. 2) Tag recommenders aim to encourage tagging and reuse of common tags. Recommender techniques discussed include most popular, collaborative filtering, tensor factorization, and graph-based methods. 3) The document presents the speaker's work on tag-based collaborative filtering which improves neighbor selection by considering tag semantic similarity between users. Their IUI 2008 paper shows their tag-based approach improves recommendation performance over traditional collaborative filtering.

Recsys 2018 overview and highlightsSandra Garcia

42 slides•154 views

This document summarizes some of the key topics and presentations from the Recsys 2018 conference. It discusses the growing popularity of deep learning and reinforcement learning in recommender systems. It provides an overview of Netflix's use of reinforcement learning for artwork recommendations. It also summarizes several papers presented at the conference, including ones on calibrated recommendations, reciprocal recommenders, the Recsys challenge on playlist continuation, and evaluating metrics for top-N recommendations. Finally, it discusses some mixed methods approaches and tutorials presented at the conference.

Lecture Notes on Recommender System IntroductionPerumalPitchandi

101 slides•405 views

This document provides an overview of recommender systems and the techniques used to build them. It discusses collaborative filtering, content-based filtering, knowledge-based recommendations, and hybrid approaches. For collaborative filtering, it describes user-based and item-based approaches, including measuring similarity, making predictions, and generating recommendations. It also discusses evaluation techniques and advanced topics like explanations.

Intelligent SearchTed Dunning

43 slides•838 views

Igor Kostiuk “Как приручить музыкальную рекомендательную систему”Dakiry

36 slides•187 views

This document discusses different approaches for training music recommender systems using deep learning techniques. It describes using collaborative filtering to obtain latent representations of songs from user listening data. Convolutional neural networks can then be trained to predict these latent factors directly from mel-spectrograms of audio clips. This allows the system to recommend new songs without requiring a user's listening history. The document outlines the development process, including extracting mel-spectrograms from audio, performing weighted matrix factorization on user data, and using the neural network to map spectrograms to the latent factors.

Collaborative Filtering Recommendation SystemMilind Gokhale

35 slides•12.5K views

Collaborative filtering is a technique used in recommender systems to predict a user's preferences based on other similar users' preferences. It involves collecting ratings data from users, calculating similarities between users or items, and making recommendations. Common approaches include user-user collaborative filtering, item-item collaborative filtering, and probabilistic matrix factorization. Recommender systems are evaluated both offline using metrics like MAE and RMSE, and through online user testing.

Item Based Collaborative Filtering Recommendation Algorithmsnextlib

48 slides•21.8K views

The document summarizes research on item-based collaborative filtering recommendation algorithms. It analyzes techniques for computing item-item similarities and generating recommendations from the similarities. Experimental results show that item-based collaborative filtering provides better quality recommendations than user-based approaches, especially for sparse datasets. The regression-based prediction computation technique outperforms the weighted sum approach.

AI&BigData Lab 2016. Игорь Костюк: Как приручить музыкальную рекомендательную...GeeksLab Odessa

30 slides•344 views

4.6.16 AI&BigData Lab Upcoming events: goo.gl/I2gJ4H Это — рекомендательная система. Если взглянуть на нее со стороны, то она крепко застряла между Collaborative filtering и Content-based filtering. Используются рекомендательные системы уже давно, но рекомендации все еще не идеальны. Обычно проблемы — это выбор технологий или там фреймворка… А у нас — cold-start problem, semantic gap и др.!

Recommendation Systems RoadtripThe Real Dyl

38 slides•197 views

Twente ir-course 20-10-2010Arjen de Vries

67 slides•369 views

Search logs from user interactions with image archives can be analyzed and utilized in three ways: 1. To understand user search behavior and how professional users search differently than average users. 2. As training data to automatically annotate images with concepts using similar queries and clicked images, though reliability varies by concept. 3. As additional positive training samples to improve automated image classification systems, especially when combined with manually annotated samples.

Recommenders SystemsTariq Hassan

36 slides•381 views

Recommender systems aim to recommend items like books, movies, or products to users based on their preferences. There are two main approaches: collaborative filtering, which recommends items liked by similar users, and content-based filtering, which recommends items similar to those a user has liked based on item attributes. Both have strengths and weaknesses, so hybrid systems combining the approaches can provide the best recommendations.

Chapter 02 collaborative recommendationAravindharamanan S

46 slides•178 views

Chapter 02 collaborative recommendationAravindharamanan S

46 slides•146 views

Summer internship 2014 report by Rishabh Misra, Thapar UniversityRishabh Misra

17 slides•373 views

Buidling large scale recommendation engineKeeyong Han

27 slides•11.9K views

Btp 3rd ReportDinesh Yadav

7 slides•532 views

Recommendation System ExplainedCrossing Minds

64 slides•13.1K views

FashiondatascSuman Bhattacharya, PhD

35 slides•834 views

Models for Information Retrieval and RecommendationArjen de Vries

91 slides•2.5K views

Apache Mahout Tutorial - Recommendation - 2013/2014 Cataldo Musto

117 slides•37.9K views

Tag And Tag Based Recommendergu wendong

26 slides•3.5K views

Recsys 2018 overview and highlightsSandra Garcia

42 slides•154 views

Lecture Notes on Recommender System IntroductionPerumalPitchandi

101 slides•405 views

Intelligent SearchTed Dunning

43 slides•838 views

Igor Kostiuk “Как приручить музыкальную рекомендательную систему”Dakiry

36 slides•187 views

Collaborative Filtering Recommendation SystemMilind Gokhale

35 slides•12.5K views

Item Based Collaborative Filtering Recommendation Algorithmsnextlib

48 slides•21.8K views

AI&BigData Lab 2016. Игорь Костюк: Как приручить музыкальную рекомендательную...GeeksLab Odessa

30 slides•344 views

Recommendation Systems RoadtripThe Real Dyl

38 slides•197 views

Twente ir-course 20-10-2010Arjen de Vries

67 slides•369 views

Recommenders SystemsTariq Hassan

36 slides•381 views

Recently uploaded (20)

How to Optimize Your AWS Environment for Improved Cloud PerformanceThousandEyes

20 slides•123 views

Designing for Multiple Blockchains in Industry EcosystemsDilum Bandara

22 slides•51 views

Our proposed method employs a Design Structure Matrix (DSM) and Domain Mapping Matrix (DMM) to derive candidate shared ledger combinations, offering insights into when centralized web services or point-to-point messages may be more suitable than shared ledgers. We also share our experiences developing a prototype for an agricultural traceability platform and present a genetic-algorithm-based DSM and DMM clustering technique.

UiPath Automation Developer Associate Training Series 2025 - Session 7DianaGray10

13 slides•73 views

In session 7, you will learn about Orchestrator for Automation Developers and how this ties into the big picture. For this session, you will need to take this self-paced training: Orchestrator Overview for Automation Developers - 2 modules - 1h 30m - https://academy.uipath.com/courses/orchestrator-overview-for-automation-developers ⁉️ For any questions you may have, please use the dedicated Forum thread. You can tag the hosts and mentors directly and they will reply as soon as possible.

TrustArc Webinar: Strategies for Future-Proofing Privacy for HealthcareTrustArc

16 slides•199 views

With increasing attention to healthcare privacy and enforcement actions proposed with the HIPPA Privacy Rules Changes planned for 2025, healthcare leaders must understand how to grow and maintain privacy programs effectively and have insights into their privacy methods. Indeed, the healthcare industry faces numerous new challenges, including the rapid adoption of virtual health and other digital innovations, consumers’ increasing involvement in care decision-making, and the push for interoperable data and data analytics. How can the industry adapt? Join our panel on this webinar as we explore the privacy risks and challenges the healthcare industry will likely encounter in 2025 and how healthcare organizations can use privacy as a differentiating factor. This webinar will review: - Current benchmarks of privacy management maturity in healthcare organizations - Upcoming data privacy vulnerabilities and opportunities resulting from healthcare’s digital transformation efforts - How healthcare companies can differentiate themselves with their privacy program

Ansible Variables in Playbook - RHCE.pdfRHCSA Guru

11 slides•34 views

B2B SaaS - Reduce Churn using Proactive Support.pdfVijay Chandran

10 slides•90 views

Churn can sink a B2B SaaS business—65% of companies hover at 10% or less annually, but every loss counts. My new white paper, Reducing Churn in B2B SaaS Through Proactive Support, shows how acting before issues hit can save the day. Proactive support—think check-ins and analytics—cuts churn by 25-30%, with top firms hitting 5%. Check out this chart: [Insert Bar Chart: 5%-15% churn, most ≤10%]. Want to keep customers longer? Automate alerts and prioritize risks. I’ve packed strategies, data, and real examples into this paper

RBM - PIXIAGE - AskPixi Page - Inpixon-MWC 2025.pptxquinlan4

20 slides•183 views

Revolutionizing GPU-as-a-Service for Maximum EfficiencyAI Infra Forum

30 slides•17 views

In this session, we'll explore our cutting-edge GPU-as-a-Service solution designed to transform enterprise AI operations. Learn how our MemVerge.ai platform maximizes GPU utilization, streamlines workload management, and ensures uninterrupted operations through innovative features like Dynamic GPU Surfing. We'll dive into key use cases, from training large language models to enterprise-scale AI deployment. We'll demonstrate how our solution benefits various stakeholders – from platform engineers to data scientists and decision-makers. Discover how our platform optimizes costs while maintaining data security and sovereignty.

AI Revolution unleashed with AI Foundry at AI Tour BrusselsNathan Bijnens

43 slides•129 views

Fine-Tuning Large Language Models with Declarative ML Orchestration - Shivay ...All Things Open

32 slides•22 views

Presented at All Things Open AI 2025 Presented by Shivay Lamba - Couchbase Title: Fine-Tuning Large Language Models with Declarative ML Orchestration Abstract: Large Language Models used in tools like ChatGPT are everywhere; however, only a few organisations with massive computing resources are capable of training such large models. While eager to fine-tune these models for specific applications, the broader ML community often grapples with significant infrastructure challenges. In the session, the audience will understand how open-source ML tooling like Flyte (a Linux Foundation open-source orchestration platform) can be used to provide a declarative specification for the infrastructure required for a wide array of ML workloads, including the fine-tuning of LLMs, even with limited resources. Thus the attendee will learn how to leverage open-source ML toolings like Flyte's capabilities to streamline their ML workflows, overcome infrastructure constraints, reduce cost and unlock the full potential of LLMs in their specific use case. Thus making it easier for a larger audience to leverage and train LLMs. Find more info about All Things Open: On the web: https://www.allthingsopen.org/ Twitter: https://twitter.com/AllThingsOpen LinkedIn: https://www.linkedin.com/company/all-things-open/ Instagram: https://www.instagram.com/allthingsopen/ Facebook: https://www.facebook.com/AllThingsOpen Mastodon: https://mastodon.social/@allthingsopen Threads: https://www.threads.net/@allthingsopen Bluesky: https://bsky.app/profile/allthingsopen.bsky.social 2025 conference: https://2025.allthingsopen.org/

Windows Client Privilege Escalation-Shared.pptxOddvar Moe

53 slides•1.4K views

Mastering NIST CSF 2.0 - The New Govern Function.pdfBachir Benyammi

42 slides•78 views

Mastering NIST CSF 2.0 - The New Govern Function Join us for an insightful webinar on mastering the latest updates to the NIST Cybersecurity Framework (CSF) 2.0, with a special focus on the newly introduced "Govern" function delivered by one of our founding members, Bachir Benyammi, Managing Director at Cyber Practice. This session will cover key components such as leadership and accountability, policy development, strategic alignment, and continuous monitoring and improvement. Don't miss this opportunity to enhance your organization's cybersecurity posture and stay ahead of emerging threats. Secure your spot today and take the first step towards a more resilient cybersecurity strategy! Event hosted by Sofiane Chafai, ISC2 El Djazair Chapter President Watch the webinar on our YouTube channel: https://youtu.be/ty0giFH6Qp0

Harnessing the Power of AI in Salesforce.pdfrabiabajaj1

11 slides•28 views

The Rise of AI Agents-From Automation to Autonomous TechnologyImpelsys Inc.

11 slides•22 views

Leveraging Knowledge Graphs for RAG: A Smarter Approach to Contextual AI Appl...All Things Open

28 slides•44 views

Presented at All Things Open AI 2025 Presented by David vonThenen - DigitalOcean Title: Leveraging Knowledge Graphs for RAG: A Smarter Approach to Contextual AI Applications Abstract: In the ever-evolving field of AI, retrieval-augmented generation (RAG) systems have become critical for delivering high-quality, contextually relevant answers in applications powered by large language models (LLMs). While vector databases have traditionally dominated RAG applications, graph databases, specifically knowledge graphs, offer a transformative approach to contextual AI that’s often overlooked. This approach provides unique advantages for applications requiring deep insights, intelligent search, and reasoning over both structured and unstructured sources, making it ideal for complex business scenarios. Attendees will leave with an understanding of how to build a RAG system using a graph database and practical skills for data querying and insights retrieval. By comparing graph and vector database approaches, we’ll highlight when and why graph databases may offer superior benefits for managing complex data relationships. The session will provide concrete examples and advanced techniques, empowering participants to incorporate knowledge graphs into their AI systems for better data-driven outcomes and improved LLM performance. This discussion will conclude with a live demo showcasing key techniques and insights covered in this talk. Find more info about All Things Open: On the web: https://www.allthingsopen.org/ Twitter: https://twitter.com/AllThingsOpen LinkedIn: https://www.linkedin.com/company/all-things-open/ Instagram: https://www.instagram.com/allthingsopen/ Facebook: https://www.facebook.com/AllThingsOpen Mastodon: https://mastodon.social/@allthingsopen Threads: https://www.threads.net/@allthingsopen Bluesky: https://bsky.app/profile/allthingsopen.bsky.social 2025 conference: https://2025.allthingsopen.org/

Comparative Analysis of Reasoning TechniquesHoussemEddineDEGHA

1 slide•85 views

Gen AI: AI Agents - Making LLMs work together in an organized way - Brent Las...All Things Open

51 slides•27 views

Presented at All Things Open AI 2025 Presented by Brent Laster - Tech Skills Transformations Title: Gen AI: AI Agents - Making LLMs work together in an organized way Abstract: AI Agents are combinations of LLMs, tools, and custom roles that can autonomously perform tasks and make decisions based on context and user input. Multiple agents can be managed together to cooperatively handle individual tasks that are part of a larger project to accomplish an overall goal. By combining capabilities like tool access, multi-step reasoning, and real-time adjustments, agents can construct and complete complex workflows and intelligent solutions. In this presentation, we'll look at what AI agents are, how they work, and how you can create and put them to work. Find more info about All Things Open: On the web: https://www.allthingsopen.org/ Twitter: https://twitter.com/AllThingsOpen LinkedIn: https://www.linkedin.com/company/all-things-open/ Instagram: https://www.instagram.com/allthingsopen/ Facebook: https://www.facebook.com/AllThingsOpen Mastodon: https://mastodon.social/@allthingsopen Threads: https://www.threads.net/@allthingsopen Bluesky: https://bsky.app/profile/allthingsopen.bsky.social 2025 conference: https://2025.allthingsopen.org/

How to manage technology risk and corporate growthArlen Meyers, MD, MBA

6 slides•36 views

Leveraging Pre-Trained Transformer Models for Protein Function Prediction - T...All Things Open

23 slides•21 views

Presented at All Things Open AI 2025 Presented by Tia Pope - North Carolina A&T Title: Leveraging Pre-Trained Transformer Models for Protein Function Prediction Abstract: Transformer-based models, such as ProtGPT2 and ESM, are revolutionizing protein sequence analysis by enabling detailed embeddings and advanced function prediction. This talk provides a hands-on introduction to using pre-trained open-source transformer models for generating protein embeddings and leveraging them for classification tasks. Attendees will learn to tokenize sequences, extract embeddings, and implement machine-learning pipelines for protein function annotation based on Gene Ontology (GO) or Enzyme Commission (EC) numbers. This session will showcase how pre-trained transformers can democratize access to advanced protein analysis techniques while addressing scalability and explainability challenges. After the talk, the speaker will provide a notebook to test basic functionality, enabling participants to explore the concepts discussed. Find more info about All Things Open: On the web: https://www.allthingsopen.org/ Twitter: https://twitter.com/AllThingsOpen LinkedIn: https://www.linkedin.com/company/all-things-open/ Instagram: https://www.instagram.com/allthingsopen/ Facebook: https://www.facebook.com/AllThingsOpen Mastodon: https://mastodon.social/@allthingsopen Threads: https://www.threads.net/@allthingsopen Bluesky: https://bsky.app/profile/allthingsopen.bsky.social 2025 conference: https://2025.allthingsopen.org/

Graphs & GraphRAG - Essential Ingredients for GenAINeo4j

42 slides•84 views

Knowledge graphs are emerging as useful and often necessary for bringing Enterprise GenAI projects from PoC into production. They make GenAI more dependable, transparent and secure across a wide variety of use cases. They are also helpful in GenAI application development: providing a human-navigable view of relevant knowledge that can be queried and visualised. This talk will share up-to-date learnings from the evolving field of knowledge graphs; why more & more organisations are using knowledge graphs to achieve GenAI successes; and practical definitions, tools, and tips for getting started.

How to Optimize Your AWS Environment for Improved Cloud PerformanceThousandEyes

20 slides•123 views

Designing for Multiple Blockchains in Industry EcosystemsDilum Bandara

22 slides•51 views

UiPath Automation Developer Associate Training Series 2025 - Session 7DianaGray10

13 slides•73 views

TrustArc Webinar: Strategies for Future-Proofing Privacy for HealthcareTrustArc

16 slides•199 views

Ansible Variables in Playbook - RHCE.pdfRHCSA Guru

11 slides•34 views

B2B SaaS - Reduce Churn using Proactive Support.pdfVijay Chandran

10 slides•90 views

RBM - PIXIAGE - AskPixi Page - Inpixon-MWC 2025.pptxquinlan4

20 slides•183 views

Revolutionizing GPU-as-a-Service for Maximum EfficiencyAI Infra Forum

30 slides•17 views

AI Revolution unleashed with AI Foundry at AI Tour BrusselsNathan Bijnens

43 slides•129 views

Fine-Tuning Large Language Models with Declarative ML Orchestration - Shivay ...All Things Open

32 slides•22 views

Windows Client Privilege Escalation-Shared.pptxOddvar Moe

53 slides•1.4K views

Mastering NIST CSF 2.0 - The New Govern Function.pdfBachir Benyammi

42 slides•78 views

Harnessing the Power of AI in Salesforce.pdfrabiabajaj1

11 slides•28 views

The Rise of AI Agents-From Automation to Autonomous TechnologyImpelsys Inc.

11 slides•22 views

Leveraging Knowledge Graphs for RAG: A Smarter Approach to Contextual AI Appl...All Things Open

28 slides•44 views

Comparative Analysis of Reasoning TechniquesHoussemEddineDEGHA

1 slide•85 views

Gen AI: AI Agents - Making LLMs work together in an organized way - Brent Las...All Things Open

51 slides•27 views

How to manage technology risk and corporate growthArlen Meyers, MD, MBA

6 slides•36 views

Leveraging Pre-Trained Transformer Models for Protein Function Prediction - T...All Things Open

23 slides•21 views

Graphs & GraphRAG - Essential Ingredients for GenAINeo4j

42 slides•84 views