Menu

Vincent MICHEL

Paris

En résumé

I am currently a Data Scientist / Senior Developer at PriceMinister/Rakuten, working on machine learning and data mining, Recommendations engines and Record Linkage.
I develop mostly in Python (pandas, scikit-learn), using big-data technologies (such as Hadoop, Cassandra, GlusterFS, Couchbase, among others), and works on statistical analysis and machine learning.

Formerly, I worked as a Data Scientist / R&D engineer at Logilab, especially in the context of Web semantic. I developed tools for the CubicWeb framework, and I contributed to Scikit-learn, a Python module for machine learning.

I hold a PhD in Computer Science, and more especially in Machine Learning, Data Mining, applied to neuroimaging. I am an ESPCI (Ecole Supérieure de Physique et de Chimie Industrielles de Paris) engineer, and I hold a Master 2 in Applied Mathematics, Computer Vision and Machine Learning, from the Ecole Normale Supérieure de Cachan (Cachan). I also followed the teaching Unit in Anatomy and Imagery of the central nervous system at the Faculty of Medicine Pitié-Salpétrière (Paris).


Sofware skills:
* Python (development, trainer), with more than 6 years of experience.
* Hadoop/HDFS, parallel computing, data streaming (ZMQ).
* SQL databases (PostgreSQL, Postgis, SQLite).
* NoSQL databases (Redis, Cassandra, Couchbase).
* Agile development and software engineering.
* Web (HTML, CSS, Javascript, CubicWeb famework)..
* Mercurial, Git.
* Debian/CentOS, Windows, IOs.


Data engineering skills:
* Import and data structuration (SQL, NoSQL, Semantic Web, record linkage).
* Datamining and machine learning (Scikit-learn, NLTK, Pandas).
* Statistical learning (feature selection, predictive model, regularization and Bayesian methods).
* Data visualization.
* Deep-learning for images classification.

Research interests / Application fields:
* E-commerce.
* Catalogues/Bibliographical data.
* Medical data.
* OpenData

Mes compétences :
Gestion de projet
Web Sémantique
Python

Entreprises

  • Priceminister - Datascientist / Software engineer

    Paris 2014 - maintenant I am currently a Data Scientist / Senior Developer at PriceMinister/Rakuten, working on machine learning and data mining, Recommendations engines and Record Linkage.
    I develop mostly in Python (pandas, scikit-learn), using big-data technologies (such as Hadoop, Cassandra, GlusterFS, Couchbase, among others), and works on statistical analysis and machine learning.
  • Logilab - Ingénieur R&D

    PARIS 2011 - 2014 Ingénieur R&D chez Logilab (http://www.logilab.fr/), un des spécialistes dans l'informatique scientifique et la gestion de connaissances.
    Expert en apprentissage statistique, je travaille sur des thématiques de fouilles de données dans des bases de très grandes dimensions, de prédiction et de classification automatique de documents.
    Je développe aussi en Python des interfaces Web (connaissances en HTML, Javascript, CSS) et Web sémantique pour des bases de données (SQLLite, PostgreSQL), en utilisant CubicWeb (http://www.cubicweb.org/).
  • Carnegie Mellon University (Pittsburgh - USA) - Machine Learning department - Invited student

    2009 - 2009 Invited student for two months.
    Machine learning for brain imaging data.
  • CEA - Neurospin - Doctorant INRIA

    2007 - 2010 I began my PhD in October 2007 under the supervision of Gilles Celeux , Christine Keribin (Dept. de mathématiques, Université Paris Sud, Select Team) and Bertrand Thirion. I have a funding from the LRI (Laboratoire de Recherche en Informatique, Université Paris Sud), and from the INRIA. During this thesis, I have developed some statistical learning methods for the study of fMRI data.

    Research interests

    * fMRI data analysis.
    * Statistical learning, feature selection, predictive model, regularization.
    * Spatial information, clustering.
    * Bayesian methods.
    * Participation in the development of Scikit-learn, a library of statistical learning in Python.


    http://parietal.saclay.inria.fr/Members/vincent-michel
  • Sony CSL Paris - Stagiaire

    2006 - 2006 Projet de recherche en robotique : recherche de modèles biologiquement plausibles d'apprentissage dans le cervelet.
    Application à la robotique.

Formations

Réseau

Annuaire des membres :