Cathy MAUGIS-RABUSSEAU

Professor

Institut de Mathématique de Toulouse / INSA Toulouse

About me

Professor at INSA Toulouse
Member of the Institut de Mathématique de Toulouse (IMT)
Team: Statistics and Optimization
Member of the Mathematics, Biology and Health
Leader of the Biostatistics Platform Toulouse

Interests

Mixture models
Unsupervised classification
Testing procedures
Statistical methods for genomic data analysis
Application to microarray, RNA-seq, single-cell RNAseq, spatial transcriptomic data analysis

Education

HDR in Applied Mathematics, 2022

Université Paul Sabatier, Toulouse, France.
PhD in Applied Mathematics, 2008

Université Paris-Sud 11, France.
MS in Mathematics, 2005

Université Paris-Sud 11, France.

Projects

DEFIANT

DEFIANT - An interdisciplinary approach to the design of effective nanoparticle-based antimicrobials

DDisc

DDisc - Double-dipping in single-cell RNAseq (2021-2024).

Single Cell

Single Cell - Projet TTIL 2018 INP-INSA-ISAE (2018-2019).

MixStatSeq

Mixture-based procedures for statistical analysis of RNA-seq data (ANR JCJC)

Softwares

ASTEC-sc = A Shiny application To Explore Clusterings of single cell RNA seq data.
ASTEC-sc is an interactive single-cell RNA-seq application. Written with the R package Shiny, this application allows you to upload a SingleCellExperiment (SCE) object containing count, normcount and logcount data sets, different cell clusterings, coordinates dimensionality reduction methods … This application allows you to visualize cells in dimensionality reduction methods and expressed genes, to compare cell clusterings and to determine marker genes. It is maintained by Nicolas Enjalbert-Courrech

R package maskmeans

This package is devoted to perform an aggregation / splitting multi-view K-means algorithm, starting with an initial clustering partition or matrix of posterior probabilities. The goal is to refine/improve the clustering obtained on the first, primary view by using additional data views; in addition, views which contain only noise or partially concordant information are down-weighted by the algorithm.

The Bioconductor package coseq

This package is devoted to the co-expression analysis of sequencing data. It contains the Poisson mixture models developed in HTSCluster (see below), the strategy based on Gaussian mixture models on transformed profiles (see Rau and Maugis-Rabusseau, 2016 for more details) and the use of the K-means algorithm for RNA-seq profiles after transformation via the centered log ratio (CLR) or log centered log ratio (logCLR) transformation (see Godichon-Baggioni et al, 2017).

R package SelvarMix

The R-package SelvarMix for variable selection in model-based clustering and discriminant analysis with a regularization approach

The R package HTSCluster

This package implements two parameterizations of a Poisson mixture model to cluster observations (e.g., genes) in high throughput sequencing data. Parameter estimation is performed using either the EM or CEM algorithm, and the BIC or ICL criteria are used for model selection (i.e., to choose the number of clusters).

Publications

IGLOO: An Iterative Global Exploration and Local Optimization Algorithm to Find Diverse Low-Energy Conformations of Flexible Molecules

William MARGERIT, Antoine CHARPENTIER, Cathy MAUGIS-RABUSSEAU, Johann Christian SCHON, Nathalie TARRAT, Juan CORTES

Selective inference after convex clustering with l1 penalization

François BACHOC, Cathy MAUGIS-RABUSSEAU, Pierre NEUVIAL

Quelques contributions autour des modèles de mélanges et pour l'analyse de données transcriptomiques

Cathy MAUGIS-RABUSSEAU

The DendrisCHIP® Technology as a New, Rapid and Reliable Molecular Method for the Diagnosis of Osteoarticular Infections

Elodie BERNARD, Thomas PEYRET, Mathilde PLINET, Yohan CONTIE, Thomas CAZAUDARRE, Yannick ROUQUET, Matthieu BERNIER, Stéphanie PESANT, Richard FABRE, Aurore ANTON, Cathy MAUGIS-RABUSSEAU, Jean-Marie FRANCOIS

Insights on the control of yeast single-cell growth variability by members of the Trehalose Phosphate Synthase (TPS) complex

Sevan ARABACIYAN, Michael SAINT-ANTOINE, Cathy MAUGIS-RABUSSEAU, Jean-Marie FRANCOIS, Abhyudai SINGH, Jean-Luc PARROU, Jean-Pascal CAPP

Supermix : sparse regularization for mixtures.

Yohann De CASTRO, Sébastien GADAT, Clément MARTEAU, Cathy MAUGIS-RABUSSEAU

Multiview cluster aggregation and splitting, with an application to multiomic breast cancer data

Antoine GODICHON-BAGGIONI, Cathy MAUGIS-RABUSSEAU, Andréa RAU

Parameter recovery in two-component contamination mixtures: The $L^2$ strategy

Sébastien GADAT, Jonas KAHN, Clément MARTEAU, Cathy MAUGIS-RABUSSEAU

Variable selection in model-based clustering and discriminant analysis with a regularization approach

Gilles CELEUX, Cathy MAUGIS-RABUSSEAU, Mohammed SEDKI

Clustering transformed compositional data using K-means, with applications in gene expression and bicycle sharing system data

Antoine GODICHON-BAGGIONI, Cathy MAUGIS-RABUSSEAU, Andréa RAU

Multidimensional two-component Gaussian mixtures detection

Béatrice LAURENT-BONNEAU, Clément MARTEAU, Cathy MAUGIS-RABUSSEAU

Synthetic data sets for the identification of key ingredients for RNA-seq differential analysis

Guillem RIGAILL, Sandrine BALZERGUE, Veronique BRUNAUD, Eddy BLONDET, Andréa RAU, Odile ROGIER, Jose CAIUS, Cathy MAUGIS-RABUSSEAU, Ludivine SOUBIGOU-TACONNAT, Sebastien AUBOURG

Transformation and model choice for RNA-seq co-expression analysis

Andréa RAU, Cathy MAUGIS-RABUSSEAU

Chapter 10 : Clustering of co-expressed genes.

Marie-Laure MARTIN-MAGNIETTE, Cathy MAUGIS-RABUSSEAU, Andréa RAU

Chapter 9: High-dimensional clustering.

Christophe BIERNACKI, Cathy MAUGIS-RABUSSEAU

On the estimation of mixtures of Poisson regression models with large number of components

Panagiotis PAPASTAMOULIS, Marie-Laure MARTIN-MAGNIETTE, Cathy MAUGIS-RABUSSEAU

Non-asymptotic detection of two-component mixtures with unknown means

Béatrice LAURENT-BONNEAU, Clément MARTEAU, Cathy MAUGIS-RABUSSEAU

Co-expression analysis of high-throughput transcriptome sequencing data with Poisson mixture models

Andréa RAU, Cathy MAUGIS-RABUSSEAU, Marie-Laure MARTIN-MAGNIETTE, Gilles CELEUX

Comparing model selection and regularization approaches to variable selection in model-based clustering

Gilles CELEUX, Marie-Laure MARTIN-MAGNIETTE, Cathy MAUGIS-RABUSSEAU, Adrian E RAFTERY

Adaptive density estimation for clustering with Gaussian mixtures

Cathy MAUGIS-RABUSSEAU, Bertrand MICHEL

A sparse variable selection procedure in model-based clustering

Caroline MEYNET, Cathy MAUGIS-RABUSSEAU

SelvarClustMV: Variable selection approach in model-based clustering allowing for missing values

Cathy MAUGIS-RABUSSEAU, Marie-Laure MARTIN-MAGNIETTE, Sandra PELLETIER

Slope heuristics: overview and implementation

Jean-Patrick BAUDRY, Cathy MAUGIS, Bertrand MICHEL

A non asymptotic penalized criterion for Gaussian mixture model selection

Cathy MAUGIS, Bertrand MICHEL

Data-driven penalty calibration: a case study for Gaussian mixture model selection

Cathy MAUGIS, Bertrand MICHEL

Variable selection in model-based discriminant analysis

Cathy MAUGIS, Gilles CELEUX, Marie-Laure MARTIN-MAGNIETTE

Sélection de variables pour la classification par mélanges gaussiens pour prédire la fonction des gènes orphelins

Cathy MAUGIS, Marie-Laure MARTIN-MAGNIETTE, Jean-Philippe TAMBY, Jean-Pierre RENOU, Alain LECHARNY, Sebastien AUBOURG, Gilles CELEUX

Variable selection for clustering with Gaussian mixture models

Cathy MAUGIS, Gilles CELEUX, Marie-Laure MARTIN-MAGNIETTE

Variable selection in model-based clustering: A general variable role modeling

Cathy MAUGIS, Gilles CELEUX, Marie-Laure MARTIN-MAGNIETTE

Variable selection for model-based clustering. Application for transcriptome data analysis

Cathy MAUGIS

Contact

cathy.maugis@insa-toulouse.fr
+33 5 61 55 92 30 (INSA Bur. 116) / +33 5 61 55 86 48 (UPS-1R1 Bur 208)
INSA Toulouse
Département Génie Mathématique et Modélisation (GMM)
Bat 12
135 avenue de Rangueil
31007 Toulouse Cedex,

Cathy MAUGIS-RABUSSEAU

Professor

Institut de Mathématique de Toulouse / INSA Toulouse

About me

Projects

DEFIANT

DDisc

Single Cell

MixStatSeq

RNA-Seq et Stat

Softwares

ASTEC-sc (Shiny app.)

R package maskmeans

The Bioconductor package coseq

R package SelvarMix

The R package HTSCluster

R package Capushe

R package HTSDiff

SelvarClust - SelvarClustIndep - SelvarClustMV

Publications

Contact