## SCHEDULE

All lectures are in room 11-13 in Casa Convalescencia.

*** The schedule may still slightly change ***

###### TUESDAY, 29 June

###### WEDNESDAY, 30 June

08:30-09:20 REGISTRATION

09:10-09:20 Welcome

09:20-10:00 Nicolò Cesa-Bianchi (University of Milan)

short break

10:10-10:50 Alessandro Rudi (INRIA - Paris)

10:50-11:30 COFFEE BREAK

11:30-12:10 Steffen Lauritzen (University of Copenhagen): Locally associated graphical models and mixed convex exponential families

short break

12:20-13:00 Anna Ben-Hamou (Sorbonne Université)

13:00-15:00 LUNCH BREAK

15:00-15:40 TBA

Short break

15:50-16:30 TBA

16:30-17:00 AFTERNOON COFFEE

09:20-10:00 Judith Rousseau (Oxford)

short break

10:10-10:50 Silvio Lattanzi (Google)

10:50-11:30 COFFEE BREAK

11:30-12:10 Andreas Maurer (Istituto Italiano di Technologia): Useful bounds for U-statistics

short break

12:20-13:00 Rémi Bardenet (CNRS)

13:00-15:00 LUNCH BREAK

15:00-15:40 TBA

Short break

15:50-16:30 TBA

16:30-17:00 AFTERNOON COFFEE

18:00-DRINKS-SNACKS.

###### THURSDAY, 1 July

###### FRIDAY, 2 July

09:20-10:00 Pradeep Ravikumar (Carnegie Mellon University)

short break

10:10-10:50 Mohamed Ndaoud (University of Southern California)

10:50-11:30 COFFEE BREAK

11:30-12:10 Wicher Bergsma (London School of Economics): Regression modelling with I-priors

Short break

12:20-13:00 Lorenzo Rosasco (University of Genoa)

12:40-15:00 LUNCH BREAK

15:00-15:40 Alessandro Rinaldo (Carnegie Mellon University)

Short break

15:50-16:30 TBA

16:30-17:00 AFTERNOON COFFEE

09:30-10:10 Benjamin Recht (UC Berkeley)

10:10-11:00 COFFEE BREAK

11:00-11:40 Samory Kpotufe (Columbia University)

Short break

12:00-12:40 Jianqing Fan (Princeton University)

*** FREE AFTERNOON

## ABSTRACTS

Wicher Bergsma (London School of Economics and Political Science)

Regression modelling with I-priors

Standard objective priors, such as the Jeffreys prior, g-prior, or reference prior, have the drawback that they can only be used for finite dimensional parameters. We introduce a new objective prior based on the Fisher information, which can be used for both finite and infinite dimensional parameters. The posterior mean or mode then provides a regularized estimator of the parameter. The I-prior is generally defined as a maximum entropy prior and has the intuitively appealing property that the more information is available on a linear functional of the parameter, the larger the prior variance, and the smaller the influence of the prior mean on the posterior distribution.

The I-prior methodology is particularly attractive from a computational point of view for estimating a parametric or nonparametric regression function. The I-prior for a regression function is very simple, namely Gaussian with covariance kernel proportional to the Fisher information on the regression function. We use the I-prior methodology to give a unifying framework for estimating a variety of regression models, including varying coefficient, multilevel, longitudinal models, and models with functional covariates and responses.

Advantages compared to competing methods, such as Gaussian process regression or Tikhonov regularization, are ease of estimation and model comparison. In particular, we develop an EM algorithm with a simple E and M step for estimating hyperparameters, facilitating estimation for complex models. We also propose a novel parsimonious model formulation, requiring a single scale parameter for each (possibly multidimensional) covariate and no further parameters for interaction effects. This simplifies estimation because fewer hyperparameters need to be estimated, and also simplifies model comparison of models with the same covariates but different interaction effects; in this case, the model with the highest estimated likelihood can be selected.

Using a number of widely analysed real data sets we show that predictive performance of our methodology is competitive. An R-package implementing the methodology is available (Jamil, 2020).

Steffen Lauritzen (University of Copenhagen)

Locally associated graphical models and mixed convex exponential families

Abstract: The notion of multivariate total positivity has proved to be useful in finance and psychology but may be too restrictive in other applications. We propose a concept of local association, where highly connected components in a graphical model are positively associated and study its properties. Our main motivation comes from gene expression data, where graphical models have become a popular exploratory tool. The models are instances of what we term mixed convex exponential families and we show that a mixed dual likelihood estimator has simple exact properties for such families as well as asymptotic properties similar to the maximum likelihood estimator. We further relax the positivity assumption by penalizing negative partial correlations in what we term the positive graphical lasso. Finally, we develop a GOLAZO algorithm based on block-coordinate descent that applies to a number of optimization procedures that arise in the context of graphical models, including the estimation problems described above. We derive results on existence of the optimum for such problems. (based on the article arXiv:2008.04688)

Andreas Maurer (Istituto Italiano di Technologia)

Useful bounds for U-statistics

Abstract: I will present a Bernstein inequality for nondegenerate U-statistics with bounded kernels. The inequality reduces to the standard Bernstein-inequality for order one, it gives the correct variance term and avoids the combinatorial catastrophy, which the decoupling techniques incur for U-statistics of higher order. The bound can be made data-dependent by estimation of the variance term, and it can be equally applied to computationally more attractive incomplete U-statistics. Time permitting I will also give a bound for the suprema of U-processes in terms of Gaussian width.