SlideShare a Scribd company logo
1 of 19
Download to read offline
Users and Noise: The Magic Barrier of Recommender Systems



 Alan Said, Brijnesh J. Jain, Sascha Narr, Till Plumbaum
  Competence Center Information Retrieval & Machine Learning


 @alansaid, @saschanarr, @matip
Outline

β–Ί The Magic Barrier
β–Ί Empirical Risk Minimization

β–Ί Deriving the Magic Barrier

β–Ί User Study

β–Ί Conclusion




           20 July 2012   The Magic Barrier   2
The Magic Barrier




         20 July 2012   The Magic Barrier   3
The Magic Barrier

β–Ί No magic involved....
β–Ί Coined by Herlocker et al. in 2004

      β€œ...an algorithm cannot be more accurate than the variance in
       a user’s ratings for the same item.”
      The maximum level of prediction that a recommender
       algorithm can attain.



β–Ί   What does this mean?




            20 July 2012   The Magic Barrier                       4
The Magic Barrier




         20 July 2012   The Magic Barrier   5
The Magic Barrier

β–Ί   Even a β€œperfect” recommender should not reach RMSE = 0 or
    Precision @ N = 1

β–Ί   Why?
       People are inconsistent and noisy in their ratings
       β€œperfect” accuracy is not perfect

β–Ί   So?
       Knowing the highest possible level of accuracy, we can stop
        optimizing our algorithms at β€œperfect” (before overfitting)




              20 July 2012    The Magic Barrier                       6
The Magic Barrier




So – how do we find the magic barrier?

We employ the Empirical Risk Minimization principle and a
 statistical model for user inconsistencies




           20 July 2012   The Magic Barrier                 7
The Magic Barrier – User Inconsistencies

Assumption:
    If a user were to re-rate all previously rated items, keeping in
     mind the inconsistency, the ratings would differ, i.e.
            π‘Ÿ 𝑒𝑖 = πœ‡ 𝑒𝑖 + πœ€ 𝑒𝑖

        where
          ο€­ πœ‡ 𝑒𝑖 is the expected rating, and
          ο€­ πœ€ 𝑒𝑖 the rating error (has zero mean)




            20 July 2012          The Magic Barrier                     8
Empirical Risk Minimization

β–Ί   … is a principle in statistical learning theory which defines a
    family of learning algorithms and is used to give theoretical
    bounds on the performance of learning
    algorithms.[Wikipedia]




              20 July 2012   The Magic Barrier                        9
Empirical Risk Minimization

β–Ί   We formulate our risk function as
       𝑅 𝑓 = 𝑒,𝑖,π‘Ÿ 𝑝 𝑒, 𝑖, π‘Ÿ 𝑓 𝑒, 𝑖 βˆ’ π‘Ÿ 2                     The prediction error

        The probability of user u rating item i with score r

β–Ί   Keeping the assumption in mind, we formulate the risk for a
    true, unknown, rating function as the sum of the noise
    variance, i.e.
        𝑅 π‘“βˆ— = 𝑒,𝑖 𝑝 𝑒, 𝑖 𝕍 πœ€ 𝑒𝑖
           where 𝕍 πœ€ 𝑒𝑖 is the noise variance




               20 July 2012         The Magic Barrier                                 10
Deriving the Magic Barrier

β–Ί   We want to express the risk function in terms of a magic barrier
    for RMSE – we take the root of the risk function

       ℬ 𝒰×ℐ =              𝑒,𝑖   𝑝 𝑒, 𝑖 𝕍 πœ€ 𝑒𝑖

       RMSE=0 iff πœ€ 𝑒𝑖 = 0 over all ratings users and items
β–Ί In terms of RMSE we can express this as

       𝐸 𝑅𝑀𝑆𝐸 𝑓 = ℬ 𝒰×ℐ + 𝐸 𝑓 > ℬ 𝒰×ℐ
       where 𝐸 𝑓 is the error




              20 July 2012             The Magic Barrier          11
Estimating the Magic Barrier

1.   For each user-item pair in our population
      a) Sample ratings on a regular basis, i.e. re-ratings
      b) Estimate the expected value of ratings
                                                     π‘š
                                                1
                                     πœ‡ 𝑒𝑖     =           π‘Ÿ 𝑑 𝑒𝑖
                                                π‘š
                                                    𝑑=1

     c. Estimate the rating variance
                                          π‘š
                                     1                                  2
                        πœ€ 𝑒𝑖   2
                                   =
                                     π‘š
                                               πœ‡ 𝑒𝑖 βˆ’        π‘Ÿπ‘‘    𝑒𝑖
                                         𝑑=1

2.   Estimate the magic barrier by taking the average
                                                1
                                    ℬ=                                  πœ€ 𝑒𝑖 2
                                                𝒳
                                                          𝑒𝑖 βˆˆπ’³




             20 July 2012                The Magic Barrier                       12
A real-world user study




     20 July 2012   The Magic Barrier   13
A User Study

β–Ί We teamed up with moviepilot.de
      Germany’s largest online movie recommendation community
      Ratings scale 1-10 stars (Netflix: 1-5 stars)
β–Ί Created a re-rating UI

      Users were asked to re-rate at least 20 movies
        1 new rating (so-called opinions) per movie
     Collected data:
        306 users
        6,299 new opinions
        2,329 movies




           20 July 2012       The Magic Barrier              14
A User Study




      User study                             moviepilot




          20 July 2012   The Magic Barrier                15
A User Study


                    ~4 ratings steps          Room for improvement

                                        ~1 rating steps




 Predictions vs               Ratings above                    Ratings below
    Ratings                   user’s average                   user’s average
                    Overall                   Opinions above                Opinions below
                  Magic Barrier               user’s average                user’s average

              20 July 2012         The Magic Barrier                                  16
Conclusion

β–Ί We created a mathematical characterization of the magic
  barrier
β–Ί We performed a user study on a commercial movie

  recommendation website and estimated its magic barrier
β–Ί We concluded the commercial recommender engine still has

  room for improvement

β–Ί   No magic




               20 July 2012   The Magic Barrier              17
More?

β–Ί   Estimating the Magic Barrier of Recommender Systems: A User Study
         SIGIR 2012

β–Ί   Magic Barrier explained
       http://irml.dailab.de

β–Ί   Movie rating and explanation user study
       http://j.mp/ratingexplain

β–Ί   Recommender Systems Wiki
        www.recsyswiki.com

β–Ί   Recommender Systems Challenge
        www.recsyschallenge.com


               20 July 2012     The Magic Barrier                       18
Questions?




β–Ί   Thank You for Listening!




              20 July 2012     The Magic Barrier   19

More Related Content

More from Alan Said

Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...
Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...
Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...Alan Said
Β 
The Magic Barrier of Recommender Systems - No Magic, Just Ratings
The Magic Barrier of Recommender Systems - No Magic, Just RatingsThe Magic Barrier of Recommender Systems - No Magic, Just Ratings
The Magic Barrier of Recommender Systems - No Magic, Just RatingsAlan Said
Β 
A Top-N Recommender System Evaluation Protocol Inspired by Deployed Systems
A Top-N Recommender System Evaluation Protocol Inspired by Deployed SystemsA Top-N Recommender System Evaluation Protocol Inspired by Deployed Systems
A Top-N Recommender System Evaluation Protocol Inspired by Deployed SystemsAlan Said
Β 
Information Retrieval and User-centric Recommender System Evaluation
Information Retrieval and User-centric Recommender System EvaluationInformation Retrieval and User-centric Recommender System Evaluation
Information Retrieval and User-centric Recommender System EvaluationAlan Said
Β 
User-Centric Evaluation of a K-Furthest Neighbor Collaborative Filtering Reco...
User-Centric Evaluation of a K-Furthest Neighbor Collaborative Filtering Reco...User-Centric Evaluation of a K-Furthest Neighbor Collaborative Filtering Reco...
User-Centric Evaluation of a K-Furthest Neighbor Collaborative Filtering Reco...Alan Said
Β 
A 3D Approach to Recommender System Evaluation
A 3D Approach to Recommender System EvaluationA 3D Approach to Recommender System Evaluation
A 3D Approach to Recommender System EvaluationAlan Said
Β 
State of RecSys: Recap of RecSys 2012
State of RecSys: Recap of RecSys 2012State of RecSys: Recap of RecSys 2012
State of RecSys: Recap of RecSys 2012Alan Said
Β 
RecSysChallenge Opening
RecSysChallenge OpeningRecSysChallenge Opening
RecSysChallenge OpeningAlan Said
Β 
Best Practices in Recommender System Challenges
Best Practices in Recommender System ChallengesBest Practices in Recommender System Challenges
Best Practices in Recommender System ChallengesAlan Said
Β 
Estimating the Magic Barrier of Recommender Systems: A User Study
Estimating the Magic Barrier of Recommender Systems: A User StudyEstimating the Magic Barrier of Recommender Systems: A User Study
Estimating the Magic Barrier of Recommender Systems: A User StudyAlan Said
Β 
Analyzing Weighting Schemes in Collaborative Filtering: Cold Start, Post Cold...
Analyzing Weighting Schemes in Collaborative Filtering: Cold Start, Post Cold...Analyzing Weighting Schemes in Collaborative Filtering: Cold Start, Post Cold...
Analyzing Weighting Schemes in Collaborative Filtering: Cold Start, Post Cold...Alan Said
Β 
CaRR 2012 Opening Presentation
CaRR 2012 Opening PresentationCaRR 2012 Opening Presentation
CaRR 2012 Opening PresentationAlan Said
Β 
Personalizing Tags: A Folksonomy-like Approach for Recommending Movies
Personalizing Tags: A Folksonomy-like Approach for Recommending MoviesPersonalizing Tags: A Folksonomy-like Approach for Recommending Movies
Personalizing Tags: A Folksonomy-like Approach for Recommending MoviesAlan Said
Β 
Inferring Contextual User Profiles - Improving Recommender Performance
Inferring Contextual User Profiles - Improving Recommender PerformanceInferring Contextual User Profiles - Improving Recommender Performance
Inferring Contextual User Profiles - Improving Recommender PerformanceAlan Said
Β 
Using Social- and Pseudo-Social Networks to Improve Recommendation Quality
Using Social- and Pseudo-Social Networks to Improve Recommendation QualityUsing Social- and Pseudo-Social Networks to Improve Recommendation Quality
Using Social- and Pseudo-Social Networks to Improve Recommendation QualityAlan Said
Β 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender SystemsAlan Said
Β 

More from Alan Said (16)

Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...
Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...
Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...
Β 
The Magic Barrier of Recommender Systems - No Magic, Just Ratings
The Magic Barrier of Recommender Systems - No Magic, Just RatingsThe Magic Barrier of Recommender Systems - No Magic, Just Ratings
The Magic Barrier of Recommender Systems - No Magic, Just Ratings
Β 
A Top-N Recommender System Evaluation Protocol Inspired by Deployed Systems
A Top-N Recommender System Evaluation Protocol Inspired by Deployed SystemsA Top-N Recommender System Evaluation Protocol Inspired by Deployed Systems
A Top-N Recommender System Evaluation Protocol Inspired by Deployed Systems
Β 
Information Retrieval and User-centric Recommender System Evaluation
Information Retrieval and User-centric Recommender System EvaluationInformation Retrieval and User-centric Recommender System Evaluation
Information Retrieval and User-centric Recommender System Evaluation
Β 
User-Centric Evaluation of a K-Furthest Neighbor Collaborative Filtering Reco...
User-Centric Evaluation of a K-Furthest Neighbor Collaborative Filtering Reco...User-Centric Evaluation of a K-Furthest Neighbor Collaborative Filtering Reco...
User-Centric Evaluation of a K-Furthest Neighbor Collaborative Filtering Reco...
Β 
A 3D Approach to Recommender System Evaluation
A 3D Approach to Recommender System EvaluationA 3D Approach to Recommender System Evaluation
A 3D Approach to Recommender System Evaluation
Β 
State of RecSys: Recap of RecSys 2012
State of RecSys: Recap of RecSys 2012State of RecSys: Recap of RecSys 2012
State of RecSys: Recap of RecSys 2012
Β 
RecSysChallenge Opening
RecSysChallenge OpeningRecSysChallenge Opening
RecSysChallenge Opening
Β 
Best Practices in Recommender System Challenges
Best Practices in Recommender System ChallengesBest Practices in Recommender System Challenges
Best Practices in Recommender System Challenges
Β 
Estimating the Magic Barrier of Recommender Systems: A User Study
Estimating the Magic Barrier of Recommender Systems: A User StudyEstimating the Magic Barrier of Recommender Systems: A User Study
Estimating the Magic Barrier of Recommender Systems: A User Study
Β 
Analyzing Weighting Schemes in Collaborative Filtering: Cold Start, Post Cold...
Analyzing Weighting Schemes in Collaborative Filtering: Cold Start, Post Cold...Analyzing Weighting Schemes in Collaborative Filtering: Cold Start, Post Cold...
Analyzing Weighting Schemes in Collaborative Filtering: Cold Start, Post Cold...
Β 
CaRR 2012 Opening Presentation
CaRR 2012 Opening PresentationCaRR 2012 Opening Presentation
CaRR 2012 Opening Presentation
Β 
Personalizing Tags: A Folksonomy-like Approach for Recommending Movies
Personalizing Tags: A Folksonomy-like Approach for Recommending MoviesPersonalizing Tags: A Folksonomy-like Approach for Recommending Movies
Personalizing Tags: A Folksonomy-like Approach for Recommending Movies
Β 
Inferring Contextual User Profiles - Improving Recommender Performance
Inferring Contextual User Profiles - Improving Recommender PerformanceInferring Contextual User Profiles - Improving Recommender Performance
Inferring Contextual User Profiles - Improving Recommender Performance
Β 
Using Social- and Pseudo-Social Networks to Improve Recommendation Quality
Using Social- and Pseudo-Social Networks to Improve Recommendation QualityUsing Social- and Pseudo-Social Networks to Improve Recommendation Quality
Using Social- and Pseudo-Social Networks to Improve Recommendation Quality
Β 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
Β 

Recently uploaded

Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
Β 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
Β 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
Β 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
Β 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
Β 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
Β 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
Β 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
Β 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
Β 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
Β 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
Β 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
Β 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
Β 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
Β 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
Β 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
Β 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
Β 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
Β 

Recently uploaded (20)

Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
Β 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
Β 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
Β 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
Β 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
Β 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
Β 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
Β 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
Β 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Β 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
Β 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
Β 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
Β 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
Β 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Β 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Β 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
Β 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
Β 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
Β 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Β 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
Β 

Users and Noise: The Magic Barrier of Recommender Systems

  • 1. Users and Noise: The Magic Barrier of Recommender Systems Alan Said, Brijnesh J. Jain, Sascha Narr, Till Plumbaum Competence Center Information Retrieval & Machine Learning @alansaid, @saschanarr, @matip
  • 2. Outline β–Ί The Magic Barrier β–Ί Empirical Risk Minimization β–Ί Deriving the Magic Barrier β–Ί User Study β–Ί Conclusion 20 July 2012 The Magic Barrier 2
  • 3. The Magic Barrier 20 July 2012 The Magic Barrier 3
  • 4. The Magic Barrier β–Ί No magic involved.... β–Ί Coined by Herlocker et al. in 2004  β€œ...an algorithm cannot be more accurate than the variance in a user’s ratings for the same item.”  The maximum level of prediction that a recommender algorithm can attain. β–Ί What does this mean? 20 July 2012 The Magic Barrier 4
  • 5. The Magic Barrier 20 July 2012 The Magic Barrier 5
  • 6. The Magic Barrier β–Ί Even a β€œperfect” recommender should not reach RMSE = 0 or Precision @ N = 1 β–Ί Why?  People are inconsistent and noisy in their ratings  β€œperfect” accuracy is not perfect β–Ί So?  Knowing the highest possible level of accuracy, we can stop optimizing our algorithms at β€œperfect” (before overfitting) 20 July 2012 The Magic Barrier 6
  • 7. The Magic Barrier So – how do we find the magic barrier? We employ the Empirical Risk Minimization principle and a statistical model for user inconsistencies 20 July 2012 The Magic Barrier 7
  • 8. The Magic Barrier – User Inconsistencies Assumption:  If a user were to re-rate all previously rated items, keeping in mind the inconsistency, the ratings would differ, i.e. π‘Ÿ 𝑒𝑖 = πœ‡ 𝑒𝑖 + πœ€ 𝑒𝑖  where ο€­ πœ‡ 𝑒𝑖 is the expected rating, and ο€­ πœ€ 𝑒𝑖 the rating error (has zero mean) 20 July 2012 The Magic Barrier 8
  • 9. Empirical Risk Minimization β–Ί … is a principle in statistical learning theory which defines a family of learning algorithms and is used to give theoretical bounds on the performance of learning algorithms.[Wikipedia] 20 July 2012 The Magic Barrier 9
  • 10. Empirical Risk Minimization β–Ί We formulate our risk function as  𝑅 𝑓 = 𝑒,𝑖,π‘Ÿ 𝑝 𝑒, 𝑖, π‘Ÿ 𝑓 𝑒, 𝑖 βˆ’ π‘Ÿ 2 The prediction error The probability of user u rating item i with score r β–Ί Keeping the assumption in mind, we formulate the risk for a true, unknown, rating function as the sum of the noise variance, i.e.  𝑅 π‘“βˆ— = 𝑒,𝑖 𝑝 𝑒, 𝑖 𝕍 πœ€ 𝑒𝑖  where 𝕍 πœ€ 𝑒𝑖 is the noise variance 20 July 2012 The Magic Barrier 10
  • 11. Deriving the Magic Barrier β–Ί We want to express the risk function in terms of a magic barrier for RMSE – we take the root of the risk function  ℬ 𝒰×ℐ = 𝑒,𝑖 𝑝 𝑒, 𝑖 𝕍 πœ€ 𝑒𝑖  RMSE=0 iff πœ€ 𝑒𝑖 = 0 over all ratings users and items β–Ί In terms of RMSE we can express this as  𝐸 𝑅𝑀𝑆𝐸 𝑓 = ℬ 𝒰×ℐ + 𝐸 𝑓 > ℬ 𝒰×ℐ  where 𝐸 𝑓 is the error 20 July 2012 The Magic Barrier 11
  • 12. Estimating the Magic Barrier 1. For each user-item pair in our population a) Sample ratings on a regular basis, i.e. re-ratings b) Estimate the expected value of ratings π‘š 1 πœ‡ 𝑒𝑖 = π‘Ÿ 𝑑 𝑒𝑖 π‘š 𝑑=1 c. Estimate the rating variance π‘š 1 2 πœ€ 𝑒𝑖 2 = π‘š πœ‡ 𝑒𝑖 βˆ’ π‘Ÿπ‘‘ 𝑒𝑖 𝑑=1 2. Estimate the magic barrier by taking the average 1 ℬ= πœ€ 𝑒𝑖 2 𝒳 𝑒𝑖 βˆˆπ’³ 20 July 2012 The Magic Barrier 12
  • 13. A real-world user study 20 July 2012 The Magic Barrier 13
  • 14. A User Study β–Ί We teamed up with moviepilot.de  Germany’s largest online movie recommendation community  Ratings scale 1-10 stars (Netflix: 1-5 stars) β–Ί Created a re-rating UI  Users were asked to re-rate at least 20 movies  1 new rating (so-called opinions) per movie  Collected data:  306 users  6,299 new opinions  2,329 movies 20 July 2012 The Magic Barrier 14
  • 15. A User Study User study moviepilot 20 July 2012 The Magic Barrier 15
  • 16. A User Study ~4 ratings steps Room for improvement ~1 rating steps Predictions vs Ratings above Ratings below Ratings user’s average user’s average Overall Opinions above Opinions below Magic Barrier user’s average user’s average 20 July 2012 The Magic Barrier 16
  • 17. Conclusion β–Ί We created a mathematical characterization of the magic barrier β–Ί We performed a user study on a commercial movie recommendation website and estimated its magic barrier β–Ί We concluded the commercial recommender engine still has room for improvement β–Ί No magic 20 July 2012 The Magic Barrier 17
  • 18. More? β–Ί Estimating the Magic Barrier of Recommender Systems: A User Study  SIGIR 2012 β–Ί Magic Barrier explained  http://irml.dailab.de β–Ί Movie rating and explanation user study  http://j.mp/ratingexplain β–Ί Recommender Systems Wiki  www.recsyswiki.com β–Ί Recommender Systems Challenge  www.recsyschallenge.com 20 July 2012 The Magic Barrier 18
  • 19. Questions? β–Ί Thank You for Listening! 20 July 2012 The Magic Barrier 19