We naturally plot (x, y) on a plane, but what about complex data ? Series for instance. For this kind, traditional plot does not help. In this article, taking Syracuse series as an example, I describe how to process the data to get a nice 2D plot where some analysis can be performed.
Tags: machine learning graphs mathematics series Syracuse
This website is generated thanks to Jekyll. Embedding a Bokeh plot as an html file was easy. However, this way prevents from adding multiple plot. In this article I describe one possible solution.
Tags: programming bokeh jekyll
Experiments against NMI evaluation.
Tags: machine learning unsupervised clustering ensemble
EClust are often evaluated using NMI and ARI. If they were good measure, you would get a score of one for a perfect consensus, and 0 for a very bad one. There are very simple experiments showing they do not behave this way. This article will show the problem and describe where it comes from. Next, we will propose alternative measure this consensus functions.
Tags: clustering ensemble machine learning scoring unsupervised
Ensemble clustering aims to combine multiple clustering together to get a better consensus clustering. Here, I will present the main approaches, and how evaluation is performed. I will introduce some reflection about scoring and diversity.
Tags: machine learning unsupervised clustering ensemble
A short introduction to clustering algorithms and their related problems. Disclaimer - this is not an introduction to KMeans or DBSCAN, here, the main concept and evaluation strategies are presented.
Tags: machine learning clustering black box understanding
Here we (try to) explain and illustrate information theory common formula.
Tags: entropy information theory Shannon
When you read around thousand of papers a year, you become aware that there are many ways to measure distance or similarity between stuff. They do not apply to the same object, the same data-structure, so this variety is necessary.
Tags: distance metric loss similarity divergence scoring
Writing is hard. It is not as easy as pressing the "enter" button to publish your post. It requires more than writing down a bunch of ideas. People cannot read in our mind, therefore we need to organize ideas in a coherent way, around a red line or a story. Additionally, finishing everything requires commitment, checking everything, arranging stuff in a nice way. Nevertheless, even if it is really time consuming, you learn plenty of stuff.
Tags: tech writting blog being famous
I got invited to the Collège de France to present the results. Here is the recorded video.
Tags: Machine Learning Time Series Cryptocurrencies