Full list view

Visualizing Syracuse Series

We naturally plot (x, y) on a plane, but what about complex data ? Series for instance. For this kind, traditional plot does not help. In this article, taking Syracuse series as an example, I describe how to process the data to get a nice 2D plot where some analysis can be performed.

Jul 22, 23

Tags: machine learning graphs mathematics series Syracuse

Embedding Bokeh Plots with Jekyll

This website is generated thanks to Jekyll. Embedding a Bokeh plot as an html file was easy. However, this way prevents from adding multiple plot. In this article I describe one possible solution.

Jun 21, 23

Tags: programming bokeh jekyll

Ensemble Clustering experiments.

Experiments against NMI evaluation.

Jun 15, 23

Tags: machine learning unsupervised clustering ensemble

Ensemble Clustering Evaluation Issues.

EClust are often evaluated using NMI and ARI. If they were good measure, you would get a score of one for a perfect consensus, and 0 for a very bad one. There are very simple experiments showing they do not behave this way. This article will show the problem and describe where it comes from. Next, we will propose alternative measure this consensus functions.

Jun 01, 23

Tags: clustering ensemble machine learning scoring unsupervised

Ensemble Clustering.

Ensemble clustering aims to combine multiple clustering together to get a better consensus clustering. Here, I will present the main approaches, and how evaluation is performed. I will introduce some reflection about scoring and diversity.

May 23, 23

Tags: machine learning unsupervised clustering ensemble

Clustering algorithms

A short introduction to clustering algorithms and their related problems. Disclaimer - this is not an introduction to KMeans or DBSCAN, here, the main concept and evaluation strategies are presented.

May 23, 23

Tags: machine learning clustering black box understanding

Information Theory Cheat Sheet

Here we (try to) explain and illustrate information theory common formula.

Apr 01, 23

Tags: entropy information theory Shannon

Distance, Divergence, and Similarity - A Cheat Sheet

When you read around thousand of papers a year, you become aware that there are many ways to measure distance or similarity between stuff. They do not apply to the same object, the same data-structure, so this variety is necessary.

Mar 01, 23

Tags: distance metric loss similarity divergence scoring

Writing is not that easy

Writing is hard. It is not as easy as pressing the "enter" button to publish your post. It requires more than writing down a bunch of ideas. People cannot read in our mind, therefore we need to organize ideas in a coherent way, around a red line or a story. Additionally, finishing everything requires commitment, checking everything, arranging stuff in a nice way. Nevertheless, even if it is really time consuming, you learn plenty of stuff.

Feb 02, 23

Tags: tech writting blog being famous

Napoleon X Challenge - Final presentation

I got invited to the Collège de France to present the results. Here is the recorded video.

Jan 01, 23

Tags: Machine Learning Time Series Cryptocurrencies




Information about pagination