Back to news

NeurIPS Paper Reviews 2024 #7

7 February 2025

News
Quantitative Research

Cedric, Quantitative Researcher

In this paper review series, our team of researchers and machine learning practitioners discuss the papers they found most interesting at NeurIPS 2024.

Here, discover the perspectives of Quantitative Researcher, Cedric.

Preference Alignment with Flow Matching

Minu Kim, Yongsik Lee, Sehyeok Kang, Jihwan Oh, Song Chong, Se-Young Yun

A big trend at NeurIPS this year was generative modelling, in particular diffusion and flow matching methods.

This paper applies some of the advances in flow matching to reinforcement learning with human feedback, a form of preference alignment where the aim is to align the behaviour of a given model with human (or an AI proxy) preference.

Whereas some of the previous techniques in that field require access to the model weights (and to possibly significant computing power) for fine-tuning, or learning a reward model that can be prone to overfitting, Preference Flow Matching (PFM) only requires access to the inference model as a black-box, and to a way of determining which of two model samples is preferred for a given conditioning input, without learning any reward model.

Given these and a distribution of inputs, one can define the distributions of less preferred data and of more preferred data in the sample space by comparing outputs two by two. PFM then learns a time-dependent flow from the former to the latter. At inference time, given a sample from the base model, one can simply flow it towards the more preferred distribution to obtain a better sample.

The authors apply their techniques to several datasets, notably MNIST and IMBD where the preference is given by the logits from a CNN or a sentiment classifier, and various offline reinforcement learning tasks from D4RL, demonstrating that the preference objective is attained.

They also include several theoretical results, showing that PFM indeed “narrows” the base model distribution towards the points where the preference is increasing. Finally, they note that an iterative application of PFM is possible and can be beneficial.

Preference Alignment with Flow Matching

NeurIPS 2023 Paper Reviews

Read paper reviews from NeurIPS 2023 from a number of our quantitative researchers and machine learning practitioners.

Read now

A Generative Model of Symmetry Transformations

James Urquhart Allingham, Bruno Kacper Mlodozeniec, Shreyas Padhy, Javier Antoran, David Krueger, Richard E. Turner, Eric Nalisnick, José Miguel Hernández-Lobato

This paper proposes a Symmetry-aware Generative Model (SGM), a method for modelling data distributions presenting potential symmetries by learning the data distribution along each orbit of a prescribed symmetry group.

More specifically, given a group acting on a space and a set of observations from that space, the SGM learns a function mapping arbitrary data points to a choice of representative for their orbit, and the distribution of the data along the orbits (as a normalizing flow on the symmetry group).

These two networks are trained using maximum likelihood. The authors also introduce an invertibility loss to account for the fact that image transformations are usually not invertible due to boundary effects and interpolation onto a discrete grid, as well as an invariance loss to ensure the choice of orbit representative is consistent over the orbit.

This dataset representation allows the inspection of the data distribution along the orbit of a given element, in order to find symmetries or the absence thereof, by simply querying the distribution associated to the orbit.

As an experiment, the authors investigate the MNIST, galaxy-MNIST and dSprites datasets under affine transformations and colour rotations; their model convincingly recovers the symmetries introduced in these datasets. Their method also facilitates the creation of data augmentations which are aware of the symmetry already present in the data, and of models which are invariant to these symmetries.

They show on the MNIST datasets that both VAEs working at the level of the orbit representatives (as computed by SGM) and VAEs augmented by SGM outperform VAEs and classically-augmented VAEs, especially in the low-data regime.

A Generative Model of Symmetry Transformations

Quantitative Research and Machine Learning

Want to learn more about life as a researcher at G-Research?

Learn more

Latest News

G-Research May 2025 Grant Winners

18 Jun 2025

Each month, we provide up to £2,000 in grant money to early career researchers in quantitative disciplines. Hear from our May grant winners.

Read article

G-Research 2025 PhD prize winners: University of Warwick

04 Jun 2025

Every year, G-Research runs a number of different PhD prizes in Maths and Data Science at universities in the UK, Europe and beyond. We're pleased to announce the winners of this prize, run in conjunction with the University of Warwick.

Read article

G-Research 2025 PhD prize winners: University of Oxford

29 May 2025

Read article

Latest Events

Quantitative Engineering
Quantitative Research

G-Research networking drinks at EuroPython 2025

16 Jul 2025 Shared on confirmation of your place

More info

Quantitative Engineering
Quantitative Research

ML in PL Conference 2025

15 Oct 2025 - 18 Oct 2025 Copernicus Science Centre, Warsaw, Poland

More info

Quantitative Engineering
Quantitative Research

SIAM Conference on Financial Mathematics and Engineering

15 Jul 2025 - 18 Jul 2025 Hyatt Regency Miami, 400 SE 2nd St, Miami, FL 33131, United States

More info

NeurIPS Paper Reviews 2024 #7

Cedric, Quantitative Researcher

Preference Alignment with Flow Matching

A Generative Model of Symmetry Transformations

Read more paper reviews

Latest News

Latest Events

G-Research networking drinks at EuroPython 2025

ML in PL Conference 2025

SIAM Conference on Financial Mathematics and Engineering

Stay up to date with
G-Research

NeurIPS Paper Reviews 2024 #7

Cedric, Quantitative Researcher

Preference Alignment with Flow Matching

A Generative Model of Symmetry Transformations

Read more paper reviews

Latest News

Latest Events

G-Research networking drinks at EuroPython 2025

ML in PL Conference 2025

SIAM Conference on Financial Mathematics and Engineering

Stay up to date with G-Research

Stay up to date with
G-Research