Optimization techniques for Kernel Logistic Regression on large-scale datasets: A comparative study

Name: Optimization techniques for Kernel Logistic Regression on large-scale datasets: A comparative study
Start: 2023-06-28T00:00:00Z
End: 2023-06-30T00:00:00Z
Location: Sevilla, Spain

José Ángel Martín-Baos, Ricardo García-Ródenas, Luis Rodriguez-Benitez

Code Project Slides

Date

Jun 28, 2023 — Jun 30, 2023

Event

5th International Conference on Advanced Research Methods and Analytics (CARMA 2023)

Location

Sevilla, Spain

Abstract

In recent years, machine learning techniques have been increasingly applied to modelling the decision-making processes of individuals. One technique that has shown good results in the literature for modelling complex behaviours is the Kernel Logistic Regression (KLR). However, standard KLR implementations have a time complexity of 𝒪(𝑛^3), which is not feasible for large datasets. To overcome this limitation, one of the purposed alternatives is to approximate the kernel matrix using the Nyström method. The aim of this work is to evaluate the Nyström KLR model on large-scale datasets and to study, at the experimental level, which of the optimisation techniques that allow training this model is the most efficient. As results, the authors show that the Nyström method efficiently computes the objective function and its gradient, enabling the training of KLR models with up to 10^5 parameters. Then, it is evaluated the performance of several optimisation methods, including gradient descend, Momentum, Adam, and L-BFGS-B. It can be concluded that L-BFGS-B is the most efficient method for training the Nyström KLR model. However, given enough computational time and proper hyperparameter tuning, the Adam method can also yield good results.