ML Blog

Posts

An Embedding Learning Framework for Numerical Features in CTR Prediction

April 12, 2024

In this post we're going to review a paper written by our Chinese friends and for which I couldn't find an implementation out there (at least using PyTorch and TF; note that you can find an implementation in MindSpore but I felt quite discouraged to go through their code as it uses a lot of wrappers). This paper is perfect for beginner to intermediate level Machine Learning coders to get their hands on implementing NN papers. The story behind me reading this paper is pretty straightforward: I was making some research about building an expressive deep learning model for a kind of CTR (click-through rate, meaning number of clicks divided by number of impressions) prediction model at work, and it turns out that DeepFM (Deep Factorisation Machines) was state-of-the-art (at least at the time of reading). Building a CTR model usually means you're working with some kind of search data, which usually comes flavoured with high-dimensional categorical features, with each feature...

Estimation of conditional mixture Weibull distribution with right-censored data using neural network for time-to-event analysis

September 13, 2023

Paper link This paper extends classical parametric time-to-event models (a part of Survival Analysis (SA)) using a neural network (NN) architecture. While providing the implementation, we will be reviewing the following concepts: Theoretical concepts: Survival analysis Parametric models Maximum Likelihood estimate Drawing random numbers from a given distribution Implementation details ( colab, p=1 and colab, any p ): Custom loss and architecture with Tensorflow How to use Tensorflow probability as an alternative to build the model architecture and its loss Time-to-event prediction with simulated data (the ones discussed in the paper cf. Figure 4) The key take away of this article is how to leverage Tensorflow Probability for survival analysis. Using the framework removes a lot of pain, typically yielding a code base that is easy to modify and that can accomodate every setting described in this article (i.e using a Mixture or not, using the Weibull or some other distribution). ...

Search This Blog

ML Blog

Posts

TiDE - Forecasting in the lodging world

An Embedding Learning Framework for Numerical Features in CTR Prediction

Estimation of conditional mixture Weibull distribution with right-censored data using neural network for time-to-event analysis