Víctor Codina’s PhD thesis defended under 1000001 Labs’ supervision

Victor 1

Víctor Codina addresses current issues related to how the distributional semantics of concepts describing the entities of the recommendation space can be exploited to mitigate the data-sparsity problem and improve the prediction accuracy with respect to state-of-the-art recommendation techniques. Víctor also throws light on two novel semantically-enhanced prediction models that address the sparsity-related limitations:

(1) a content-based approach, which exploits the distributional semantics of item’s attributes during item and user-profile matching;

(2) a context-aware recommendation approach that exploits the distributional semantics of contextual conditions during context modeling.

Víctor Codina, Exploiting Distributional Semantics for Content-Based and Context-Aware Recommendation

13 June 2014

11:00 – 13:00

Exploiting distributional semantics for recommendation

Artificial Intelligence

PhD defence ceremony

Supervisor

Luigi Ceccaroni, 1000001 Labs

Entrance

This event is open to the public.

The summary:

During the last decade, the use of recommender systems has been increasingly growing to the point that, nowadays, the success of many well-known services depends on these technologies. Recommenders Systems help people to tackle the choice overload problem by effectively presenting new content adapted to the user’s preferences. However, current recommendation algorithms commonly suffer from data sparsity, which refers to the incapability of producing acceptable recommendations until a minimum amount of users’ ratings are available for training the prediction models.

This thesis investigates how the distributional semantics of concepts describing the entities of the recommendation space can be exploited to mitigate the data-sparsity problem and improve the prediction accuracy with respect to state-of-the-art recommendation techniques. The fundamental idea behind distributional semantics is that concepts repeatedly co-occurring in the same context or usage tend to be related. In this thesis, we propose and evaluate two novel semantically-enhanced prediction models that address the sparsity-related limitations: (1) a content-based approach, which exploits the distributional semantics of item’s attributes during item and user-profile matching, and (2) a context-aware recommendation approach that exploits the distributional semantics of contextual conditions during context modeling. We demonstrate in an exhaustive experimental evaluation that the proposed algorithms outperform state-of-the-art ones, especially when data are sparse.

Finally, this thesis presents a recommendation framework, which extends the widespread machine learning library Apache Mahout, including all the proposed and evaluated recommendation algorithms as well as a tool for offline evaluation and meta-parameter optimization. The framework has been developed to allow other researchers to reproduce the described evaluation experiments and make new progress on the Recommender Systems field easier.

Published by Technical University of Catalonia (UPC)

PhD evaluation committee:

Andrej Košir, Associate professor at University of Ljubljana
Javier Vazquez, Associate Professor at the Technical University of Catalonia
Francesco Ricci, Associate professor at Free University of Bolzano
Linas Baltrunas, Researcher at Telefonica Research
Marc Torrens, Chief Technology Officer at Strands