Louis Tiao loo-EE tee-OW

Research Scientist

Meta - Central Applied Science (CAS) Team, NYC

Bio

Hi. I am Louis Tiao, a machine learning (ML) research scientist with a broad interest in probabilistic ML and a particular focus on approximate Bayesian inference and Gaussian processes, and their applications to Bayesian optimization and graph representation learning. I obtained my PhD from the University of Sydney, where I worked with Edwin Bonilla and Fabio Ramos. Our research has garnered recognition at premier conferences like NeurIPS and ICML, where our work has routinely been selected as Oral/Spotlight talks.

Starting in August, I will be joining Meta in New York City as a Research Scientist in the Adaptive Experimentation (AE) arm of the Central Applied Science (CAS) Team. In this role, I will continue to advance the frontiers of Bayesian optimization, Gaussian processes, and more broadly, sample-efficient decision-making and probabilistic modeling, with a focus on applications in automated machine learning (AutoML) and deep learning.

Before pursuing my doctorate, I obtained a BSc in Computer Science with First Class Honours from the University of New South Wales (UNSW Sydney), where I majored in artificial intelligence (AI) and minored in mathematics. I began my professional career in ML as a software engineer at National ICT Australia (NICTA), which later merged into CSIRO’s Data61—the AI research division of Australia’s national science agency.

During my PhD, I’ve had the privilege of collaborating with exceptional people through multiple rewarding industrial research internships. In the Summer and Fall of 2019, I completed a research internship at Amazon in Berlin, Germany, where I worked with Matthias Seeger, Cédric Archambeau, and Aaron Klein. Between Fall 2021 and Spring 2022, I worked with Vincent Dutordoir and Victor Picheny at Secondmind Labs in Cambridge, UK. More recently, in Summer 2022, I extended my stay in Cambridge and returned to Amazon, reuniting virtually with my former Berlin team.

Download my resumé .

Interests

Probabilistic Machine Learning
Approximate Bayesian Inference
Gaussian Processes
Bayesian Optimization

Education

Ph.D. in Computer Science (Machine Learning), 2023
University of Sydney
B.Sc. (Honours Class 1) in Computer Science (Artificial Intelligence and Mathematics), 2015
University of New South Wales

News

One paper accepted to ICML2023

Our paper “Spherical Inducing Features for Orthogonally-Decoupled Gaussian Processes” was accepted to ICML2023 as an Oral Presentation!

Last updated on Apr 26, 2023 1 min read News

One paper accepted to NeurIPS2022

Our paper “Batch Bayesian Optimisation via Density-ratio Estimation with Guarantees”, led by Rafael Oliveira, was paper accepted to NeurIPS2022!

Last updated on Oct 20, 2022 0 min read News

Employment

Research Experience

Research Scientist

Meta

August 2024 – Present New York, NY, United States

Applied Scientist Intern

Amazon Web Services

May 2022 – September 2022 Cambridge, United Kingdom

As an applied scientist intern at Amazon Web Services (AWS), I led an explorative research project focused on addressing the challenges of hyperparameter optimization for large language models (LLMs). Our primary objective was to gain a comprehensive understanding of the scaling behavior of LLMs and investigate the feasibility of extrapolating optimal hyperparameters from smaller LLMs to their massive counterparts. This hands-on work involved orchestrating the parallel training of multiple LLMs from scratch across numerous GPU cloud instances to gain insights into their scaling dynamics.

During this internship, I was fortunate to be reunited with Aaron Klein, Matthias Seeger, and Cédric Archambeau, with whom I had previously collaborated during an earlier internship at AWS Berlin.

Doctoral Student Researcher

Secondmind

September 2021 – April 2022 Cambridge, United Kingdom

As a student researcher at Secondmind (formerly Prowler.io), a research-intensive AI startup renowned for its innovations in Bayesian optimization (BO) and Gaussian processes (GPs), I contributed impactful research and open-source code aligned with their focus on advancing probabilistic ML. Specifically, I developed open-source software to facilitate sampling efficiently from GPs, substantially improving their accessibility and functionality. Additionally, I led a research initiative to improve the integration of neural networks (NNs) with GP approximations, bridging a critical gap between probabilistic methods and deep learning. These efforts culminated in a research paper that was selected for an oral presentation at the International Conference on Machine Learning (ICML).

I had the privilege of collaborating closely with Vincent Dutordoir and Victor Picheny during this period.

Applied Scientist Intern

Amazon Web Services

June 2019 – December 2019 Berlin, Germany

As an applied scientist intern at Amazon Web Services (AWS), I contributed to the development of the Automatic Model Tuning functionality in AWS SageMaker. My primary focus was on advancing AutoML and hyperparameter optimization, particularly Bayesian optimization (BO) methods. I spearheaded a research project aimed at integrating multi-fidelity BO with asynchronous parallelism to significantly improve the efficiency and scalability of model tuning. This initiative led to the development of a research paper and the release of open-source code within the AutoGluon, subsequently forming the basis of the SyneTune library.

I had the privilege of working closely with Matthias Seeger, Cédric Archambeau, and Aaron Klein during this internship.

Software Engineer

CSIRO’s Data61

July 2016 – April 2019 Sydney, Australia

As a machine learning (ML) software engineer at CSIRO’s Data61, the AI research division of Australia’s national science agency, I was an integral part of the Inference Systems Engineering Team, specializing in probabilistic ML for diverse problem domains. Our focus encompassed areas such as spatial inference and Bayesian experimental design, with a primary emphasis on scalability. I led the development of new microservices and contributed to the development of open-source libraries for large-scale Bayesian deep learning. I also had a stint with the Graph Analytics Engineering Team, where my contributions to research on graph representation learning led to a research paper selected for a spotlight presentation at the Conference on Neural Information Processing Systems (NeurIPS).

Software Engineer

National ICT Australia (NICTA)

May 2015 – June 2016 Sydney, Australia

As a machine learning (ML) software engineer at NICTA, I was part of an interdisciplinary ML research team contributing to the Big Data Knowledge Discovery initiative, which engaged with leading scientists across various natural sciences domains to develop Bayesian ML software frameworks to support Australia’s evolving scientific research landscape. During this time, I led the development and release of numerous open-source libraries for applying Bayesian ML at scale.

Research Intern

Commonwealth Scientific and Industrial Research Organisation (CSIRO)

November 2013 – February 2014 Sydney, Australia

As a summer vacation scholar at CSIRO’s Language and Social Computing team, I applied cutting-edge machine learning (ML) and natural language processing (NLP) techniques to build a robust text classification system for automated sentiment analysis.

Select Publications

Louis Tiao

September, 2023

Probabilistic Machine Learning in the Age of Deep Learning: New Perspectives for Gaussian Processes, Bayesian Optimization and Beyond (PhD Thesis)

This thesis explores the intersection of deep learning and probabilistic machine learning to enhance the capabilities of artificial intelligence. It addresses the limitations of Gaussian processes (GPs) in practical applications, particularly in comparison to neural networks (NNs), and proposes advancements such as improved approximations and a novel formulation of Bayesian optimization (BO) that seamlessly integrates deep learning methods. The contributions aim to enrich the interplay between deep learning and probabilistic ML, advancing the foundations of AI and fostering the development of more capable and reliable automated decision-making systems.

Louis Tiao, Vincent Dutordoir, Victor Picheny

April, 2023 In ICML2023. Accepted as Oral Presentation

Spherical Inducing Features for Orthogonally-Decoupled Gaussian Processes

Despite their many desirable properties, Gaussian processes (GPs) are often compared unfavorably to deep neural networks (NNs) for lacking the ability to learn representations. Recent efforts to bridge the gap between GPs and deep NNs have yielded a new class of inter-domain variational GPs in which the inducing variables correspond to hidden units of a feedforward NN. In this work, we examine some practical issues associated with this approach and propose an extension that leverages the orthogonal decomposition of GPs to mitigate these limitations. In particular, we introduce spherical inter-domain features to construct more flexible data-dependent basis functions for both the principal and orthogonal components of the GP approximation and show that incorporating NN activation features under this framework not only alleviates these shortcomings but is more scalable than alternative strategies. Experiments on multiple benchmark datasets demonstrate the effectiveness of our approach.

Louis Tiao, Aaron Klein, Matthias Seeger, Edwin v. Bonilla, Cédric Archambeau, Fabio Ramos

May, 2021 In ICML2021. Accepted as Long Presentation (Awarded to Top 3% of Papers)

BORE: Bayesian Optimization by Density-Ratio Estimation

We reformulate the computation of the acquisition function in Bayesian optimization (BO) as a probabilistic classification problem, providing advantages in scalability, flexibility, and representational capacity, while casting aside the limitations of tractability constraints on the model.

Pantelis Elinas, Edwin v. Bonilla, Louis Tiao

June, 2020 In NeurIPS2020. Accepted as Spotlight Presentation (Awarded to Top 3% of Papers)

Variational Inference for Graph Convolutional Networks in the Absence of Graph Data and Adversarial Settings

Our proposed framework uses a joint probabilistic model and stochastic variational inference to improve the performance and robustness of graph convolutional networks (GCNs) in scenarios without input graph data, outperforming state-of-the-art algorithms on semi-supervised classification tasks.

Aaron Klein, Louis Tiao, Thibaut Lienart, Cédric Archambeau, Matthias Seeger

March, 2020

Model-based Asynchronous Hyperparameter and Neural Architecture Search

We introduce a model-based method for asynchronous multi-fidelity hyperparameter and neural architecture search that combines the strengths of asynchronous Hyperband and Gaussian process-based Bayesian optimization, achieving substantial speed-ups over current state-of-the-art methods on challenging benchmarks for tabular data, image classification, and language modeling.

Projects

Last updated on Dec 5, 2023

Fourier decomposition of Gaussian processes III

An anatomy of samples from a Gaussian process posterior

Recent & Upcoming Talks

Contributed Talk: Cycle-Consistent Adversarial Learning as Approximate Bayesian Inference

Jul 14, 2018 3:20 PM — 3:30 PM Stockholm, Sweden

Louis Tiao

Contributed Talk: Cycle-Consistent Adversarial Learning as Approximate Bayesian Inference

Teaching

Courses

COMP9418: Advanced Topics in Statistical Machine Learning (UNSW Sydney)

This course has a primary focus on probabilistic machine learning methods, covering the topics of exact and approximate inference in directed and undirected probabilistic graphical models – continuous latent variable models, structured prediction models, and non-parametric models based on Gaussian processes.

Lab exercise on Gaussian Process Regression, running in JupyterLab.

This course emphasized maintaining a good balance between theory and practice. As the teaching assistant (TA) for this course, my primary responsibility was to create lab exercises that aid students in gaining hands-on experience with these methods, specifically applying them to real-world data using the most current tools and libraries. The labs were Python-based, and relied heavily on the Python scientific computing and data analysis stack (NumPy, SciPy, Matplotlib, Seaborn, Pandas, IPython/Jupyter notebooks), and the popular machine learning libraries scikit-learn and TensorFlow.

Students were given the chance to experiment with a broad range of methods on various problems, such as Markov chain Monte Carlo (MCMC) for Bayesian logistic regression, probabilistic PCA (PPCA), factor analysis (FA) and independent component analysis (ICA) for dimensionality reduction, hidden Markov models (HMMs) for speech recognition, conditional random fields (CRFs) for named-entity recognition, and Gaussian processes (GPs) for regression and classification.

Publications

Quickly discover relevant content by filtering publications.

Louis Tiao (2023). Probabilistic Machine Learning in the Age of Deep Learning: New Perspectives for Gaussian Processes, Bayesian Optimization and Beyond (PhD Thesis).

PDF Preprint Full Acknowledgements

Louis Tiao, Vincent Dutordoir, Victor Picheny (2023). Spherical Inducing Features for Orthogonally-Decoupled Gaussian Processes. In ICML2023. Accepted as Oral Presentation.

PDF Cite Code Poster Slides Conference Proceeding

Rafael Oliveira, Louis Tiao, Fabio Ramos (2022). Batch Bayesian Optimisation via Density-ratio Estimation with Guarantees. In NeurIPS2022.

PDF Code

Louis Tiao, Aaron Klein, Matthias Seeger, Edwin v. Bonilla, Cédric Archambeau, Fabio Ramos (2021). BORE: Bayesian Optimization by Density-Ratio Estimation. In ICML2021. Accepted as Long Presentation (Awarded to Top 3% of Papers).

PDF Cite Code Poster Slides Video Conference Proceeding Supplementary material

Pantelis Elinas, Edwin v. Bonilla, Louis Tiao (2020). Variational Inference for Graph Convolutional Networks in the Absence of Graph Data and Adversarial Settings. In NeurIPS2020. Accepted as Spotlight Presentation (Awarded to Top 3% of Papers).

Louis Tiao loo-EE tee-OW

Research Scientist

Meta - Central Applied Science (CAS) Team, NYC

Bio

News

Employment

Select Publications

Recent Posts

Projects

Recent & Upcoming Talks

Teaching

COMP9418: Advanced Topics in Statistical Machine Learning (UNSW Sydney)

Publications

Contact