EuCAIFCon 2024

Name: EuCAIFCon 2024
Start: 2024-04-30T08:15:00+02:00
End: 2024-05-03T13:35:00+02:00
Location: Amsterdam, Hotel CASA

30 April 2024 to 3 May 2024

Amsterdam, Hotel CASA

Europe/Amsterdam timezone

Contact

Contribution List

223. Theoretical high-energy physics and AI

Matthew Schwartz (Harvard University)

30/04/2024, 09:10

Summary talks

224. Experimental particle physics and AI

Gregor Kasieczka (University of Hamburg)

30/04/2024, 10:20

Summary talks

234. Panel discussion I: Directions in AI and fundamental physics

Anna Scaife (The University of Manchester/Alan Turing Institute), François Charton (META), Matthew D. Schwartz (Harvard University), Miles Cranmer (University of Cambridge)

30/04/2024, 11:00

Plenary panel discussion

209. Differentiable Vertex Fitting for Jet Flavour Tagging

Rachel Smith (SLAC National Accelerator Laboratory)

30/04/2024, 13:30

Session A

Flashtalk with Poster

1.2 Generative models & Simulation of physical systems

We propose a differentiable vertex fitting algorithm that can be used for secondary vertex fitting, and that can be seamlessly integrated into neural networks for jet flavour tagging. Vertex fitting is formulated as an optimization problem where gradients of the optimized solution vertex are defined through implicit differentiation and can be passed to upstream or downstream neural network...

27. Long-Lived Particles Anomaly Detection with Parametrized Quantum Circuits

Simone Bordoni (Sapienza Università di Roma and INFN Roma)

30/04/2024, 13:30

Session A

Flashtalk with Poster

1.4 Hardware acceleration & FPGAs

We investigate the possibility to apply quantum machine learning techniques for data analysis, with particular regard to an interesting use-case in high-energy physics. We propose an anomaly detection algorithm based on a parametrized quantum circuit. This algorithm was trained on a classical computer and tested with simulations as well as on real quantum hardware. Tests on NISQ devices were...

149. Magnet Design Optimisation with Supervised Deep Neural Networks

Florian Stummer (CERN)

30/04/2024, 13:30

Session A

Flashtalk with Poster

1.1 Pattern recognition & Image analysis

Recent years have shown that more and more tasks can be effectively aided by AI. Often supervised learning methods, which are based on labelled data, lead to excellent results. Artificial neural networks, that were trained on this data, allow to make accurate predictions, also for cases, that were not explicitly covered by the training data potentially leading to a more optimal solution for a...

33. Simulation of Z2 model using Variational Autoregressive Network (VAN).

Vaibhav Chahar (Marian Smoluchowski Institute of Physics, Jagiellonian University, Poland)

30/04/2024, 13:30

Session A

Flashtalk with Poster

1.3 Simulation-based inference

Markov chain Monte Carlo (MCMC) simulations is a very powerful approach to tackle a large variety of problems in all computational science. The recent advances in machine learning techniques have provided new ideas in the domain of Monte Carlo simulations. The ability of artificial neural networks to model a very wide class of probability distributions through the Variational Autoregressive...

240. WG 5 discussion

Dr Christoph Weniger (University of Amsterdam), Tilman Plehn (Universität Heidelberg, ITP)

30/04/2024, 13:30

EuCAIF WG

46. Artificial Intelligence techniques in KM3NeT

Evangelia Drakopoulou

30/04/2024, 13:33

Session A

Flashtalk with Poster

1.3 Simulation-based inference

KM3NeT is a research infrastructure housing two underwater Cherenkov telescopes located in the Mediterranean Sea. It consists of two configurations which are currently under construction: ARCA with 230 detection units corresponding to 1 cubic kilometre of instrumented water volume and ORCA with 115 detection units corresponding to a mass of 7 Mton. The ARCA (Astroparticle Research with Cosmics...

169. Learning new physics with a (kernel) machine

Marco Letizia (MaLGa Center, Università di Genova and INFN)

30/04/2024, 13:33

Session A

Flashtalk with Poster

1.1 Pattern recognition & Image analysis

The lack of new physics discoveries at the LHC calls for an effort to to go beyond model-driven analyses. In this talk I will present the New Physics Learning Machine, a methodology powered by machine learning to perform a signal-agnostic and multivariate likelihood ratio test (arXiv:2305.14137). I will focus on an implementation based on kernel methods, which is efficient and scalable while...

188. Parameter estimation from quantum-jump data using neural networks

Enrico Rinaldi (Quantinuum K. K. and RIKEN)

30/04/2024, 13:33

Session A

Flashtalk with Poster

1.4 Hardware acceleration & FPGAs

Estimating unknown parameters of open quantum systems is an important task that is common to many branches of quantum technologies, from metrology to computing. When open quantum systems are monitored and a signal is continuously acquired, this signal can be used to efficiently extract information about the interactions in the system. Previous works have demonstrated a Bayesian framework for...

190. Physics-Informed Neural Networks for Gravitational Waves

Matteo Scialpi (Università degli Studi di Ferrara)

30/04/2024, 13:33

Session A

Flashtalk with Poster

1.2 Generative models & Simulation of physical systems

Detected Gravitational Waves are goldmines of information on the compact binary emitting systems. Usually MCMC techniques infer parameter's values in a 15-dimensional parameter space in an accurate way, but they are very lengthy. On the other hand, Physics-Informed Neural Networks (PINNs) are a rapidly emerging branch of Supervised Machine Learning, devoted precisely to solve physical...

104. Generative models and lattice field theory

Mathis Gerdes (University of Amsterdam)

30/04/2024, 13:36

Session A

Flashtalk with Poster

1.2 Generative models & Simulation of physical systems

Generative models, particularly normalizing flows, have recently been proposed to speed up lattice field theory sample generation. We have explored the role symmetry considerations and ML concepts like transfer learning may have, by applying novel continuous normalizing flows to a scalar field theory. Beyond that, interesting connections exist between renormalization group theory and...

116. ML-based Unfolding Techniques for High Energy Physics

Nathan Huetsch (Heidelberg University)

30/04/2024, 13:36

Session A

Flashtalk with Poster

1.3 Simulation-based inference

Particle physics detectors introduce distortions in the observed data due to their finite resolution and other experimental factors, the task of correcting for these effects is known as unfolding. While traditional unfolding methods are restricted to binned distributions of a single observable, recently proposed ML-based methods enable unbinned, high-dimensional unfolding over the entire phase...

115. Quantum and classical methods for ground state optimisation in quantum many-body problems

Thomas Spriggs (TU Delft)

30/04/2024, 13:36

Session A

Flashtalk with Poster

1.4 Hardware acceleration & FPGAs

We currently find ourselves in the era of noisy intermediate-scale quantum (NISQ) computing, where quantum computing applications are limited yet promising. In this work I will overview two algorithms for computing the ground state and dynamics of the transverse field Ising model as a testbed for more complex models. The Variational Quantum Eigensolver (VQE) algorithm leverages quantum...

130. Searching for gravitational waves from stellar-mass binary black holes early inspiral

Xue-Ting Zhang (Sun Yat-sen University)

30/04/2024, 13:36

Session A

Flashtalk with Poster

1.1 Pattern recognition & Image analysis

The early inspiral from stellar-mass black hole binaries can emit milli-Hertz gravitational wave signals, making them detectable sources for space-borne gravitational wave missions like TianQin. However, the traditional matched filtering technique poses a significant challenge for analyzing these kinds of signals, as it requires an impractically high number of templates ranging from 10^31 to...

185. Building sparse kernel methods via dictionary learning. Expressive, regularized and interpretable models for statistical anomaly detection

Gaia Grosso (IAIFI/MIT/Harvard)

30/04/2024, 13:39

Session A

Flashtalk with Poster

1.3 Simulation-based inference

Statistical anomaly detection empowered by AI is a subject of growing interest in high-energy physics and astrophysics. AI provides a multidimensional and highly automatized solution to enable signal-agnostic data validation, and new physics searches.
The unsupervised nature of the anomaly detection task combined with the highly complex nature of the LHC and astrophysical data give rise to a...

136. Hybrid quantum graph neural networks for particle tracking in high energy physics

Matteo Argenton (INFN and University of Ferrara)

30/04/2024, 13:39

Session A

Flashtalk with Poster

1.4 Hardware acceleration & FPGAs

Tracking charged particles in high-energy physics experiments is one of the most computationally demanding steps in the data analysis pipeline.
As we approach the High Luminosity LHC era, with an estimate increase in the number of proton-proton interactions per beam collision by a factor 3-5 (from 50 to 140-200 primary interactions per collision on average), particle tracking will become even...

129. Importance nested sampling with normalizing flows for gravitational-wave inference

Michael Williams (University of Portsmouth)

30/04/2024, 13:39

Session A

Flashtalk with Poster

1.2 Generative models & Simulation of physical systems

Nested sampling has become an important tool for inference in astronomical data analysis. However, it is often computationally expensive to run. This poses a challenge for certain applications, such as gravitational-wave inference. To address this, we previously introduced nessai, a nested sampling algorithm that incorporates normalizing flows to accelerate gravitational-wave inference by up...

87. Searching for Dark Matter Subhalos in Astronomical Data using Deep Learning

Sven Põder (National Institute of Chemical Physics and Biophysics (EE))

30/04/2024, 13:39

Session A

Flashtalk with Poster

1.1 Pattern recognition & Image analysis

Due to poor observational constraints on the low-mass end of the subhalo mass function, the detection of dark matter (DM) subhalos lacking a visible counterpart on sub-galactic scales would provide valuable information about the nature of DM. Novel indirect probes for DM substructure within the Milky Way (MW) are stellar wakes, which are perturbations of the stellar medium induced by DM...

5. Active Learning for Gravitational Wave modelling

Tomas Andrade (ICCUB (Institute of Cosmos Sciences University of Barcelona))

30/04/2024, 13:42

Talk without Poster

1.1 Pattern recognition & Image analysis

As a new era of gravitational wave detections rapidly unfolds, the importance of having accurate models for their signals becomes increasingly important.

The best model for gravitational waves are the fully-fledged simulations of General Relativity, although their daunting cost make it prohibitive to perform data analysis. To alleviate this, the community has developed a variety of...

18. Exploration of QCD matter under extreme conditions meets machine learning

Kai Zhou (CUHK-Shenzhen, FIAS)

30/04/2024, 13:42

Talk without Poster

1.2 Generative models & Simulation of physical systems

This presentation will highlight the impactful role of machine learning (ML) in high energy nuclear physics, particularly in studying QCD matter under extreme conditions. The presentation will focus on three key applications: analyzing heavy ion collisions, reconstructing neutron star Equation of State (EoS), and advancing lattice field theory studies.

In heavy ion collisions, ML techniques...

196. Hardware implementation of quantum machine learning predictors for ultra-low latency applications

Andrea Triossi (University of Padova)

30/04/2024, 13:42

Talk without Poster

1.4 Hardware acceleration & FPGAs

Tensor Networks (TNs) is a computational paradigm used for representing quantum many-body systems. Recent works show how TNs can be applied to perform Machine Learning (ML) tasks, yielding comparable results to standard supervised learning techniques. In particular [1] leveraged Tree Tensor Networks (TTNs) to achieve the classification of particle flavor state in the context of High Energy...

3. pop-cosmos: comprehensive forward modelling of photometric galaxy survey data

Stephen Thorp (Oskar Klein Centre, Stockholm University)

30/04/2024, 13:42

Talk without Poster

1.3 Simulation-based inference

Projects such as the imminent Vera C. Rubin Observatory are critical tools for understanding cosmological questions like the nature of dark energy. By observing huge numbers of galaxies, they enable us to map the large scale structure of the Universe. To do this, however, we need reliable ways of estimating galaxy redshifts from only photometry. I will present an overview of our pop-cosmos...

19. Ameliorating transient noise bursts in gravitational-wave searches for intermediate-mass black holes

Melissa Lopez

30/04/2024, 14:02

Talk without Poster

1.1 Pattern recognition & Image analysis

The formation mechanism of supermassive black holes is yet unknown, despite their presence in nearly every galaxy, including the Milky Way. As stellar evolution predicts that stars cannot collapse to black holes $\gtrsim 50 - 130\, \text{M}_{\odot}$ due to pair-instability, plausible formation mechanisms include the hierarchical mergers of intermediate-mass black holes (IMBHs). The direct...

24. Calibrating Bayesian Tension Statistics with Neural Ratio Estimators

Harry Bevins (University of Cambridge)

30/04/2024, 14:02

Talk without Poster

1.3 Simulation-based inference

Quantifying tension between different experimental efforts aiming to constrain the same physical models is essential for validating our understanding of the Universe. A commonly used metric of tension is the ratio, R, of the joint Bayesian evidence to the product of individual evidences for two experimental datasets under some common model. R can be interpreted as a measure of our relative...

56. Deep Learning-Based Data Processing in Large-Sized Telescopes of the Cherenkov Telescope Array: FPGA Implementation and Performance Comparison with GPUs

Iaroslava Bezshyiko (University of Zurich)

30/04/2024, 14:02

Talk without Poster

1.4 Hardware acceleration & FPGAs

The Large-Sized Telescope (LST) is one of three telescope types being built as part of the Cherenkov Telescope Array Observatory (CTAO) to cover the lower energy range between 20 GeV and 200 GeV. The Large-Sized Telescope prototype (LST-1), installed at the La Palma Observatory Roque de Los Muchachos, is currently being commissioned and has successfully taken data since November 2019. The...

49. Machine learning for lattice field theory and back

Gert Aarts (Swansea University and ECT*)

30/04/2024, 14:02

Talk without Poster

1.2 Generative models & Simulation of physical systems

Recently, machine learning has become a popular tool in lattice field theory. Here I will report on some applications of (lattice) field theory methods to further understand ML, illustrated using the Restricted Boltzmann Machine and stochastic quantisation as simple examples.

29. Leveraging Physics-Informed Graph Neural Networks for Enhanced Combinatorial Optimization

Lorenzo Colantonio (Sapienza Università di Roma)

30/04/2024, 14:22

Session A

Flashtalk with Poster

1.1 Pattern recognition & Image analysis

We introduce an innovative approach to combinatorial optimization problems through Physics-Informed Graph Neural Networks (GNNs). We combine the structural advantages of GNNs with physics-based algorithms, enhancing solution accuracy and computational efficiency. With respect to available literature we were able to design and train a deep graph neural network model able to solve the graph...

36. Machine learning for radiometer calibration in global 21cm cosmology

Mr Samuel Alan Kossoff Leeney (University of Cambridge)

30/04/2024, 14:22

Session A

Flashtalk with Poster

1.3 Simulation-based inference

In this talk we propose a Physics based AI framework for precise radiometer calibration in global 21cm cosmology. These experiments aim to study formation of the first stars and galaxies by detecting the faint 21-cm radio emission from neutral hydrogen. The global or sky-averaged signal is predicted to be five orders of magnitude dimmer than the foregrounds. Therefore detection of the signal...

28. Model compression and simplification pipelines for fast and explainable deep neural network inference in FPGAs in HEP

Graziella Russo (Sapienza Università di Roma and INFN Roma)

30/04/2024, 14:22

Session A

Flashtalk with Poster

1.4 Hardware acceleration & FPGAs

Resource utilization plays a crucial role for successful implementation of fast real-time inference for deep neural networks on latest generation of hardware accelerators (FPGAs, SoCs, ACAPs, GPUs). To fulfil the needs of the triggers that are in development for the upgraded LHC detectors, we have developed a multi-stage compression approach based on conventional compression strategies...

26. Quantum Probabilistic Diffusion Models

Andrea Cacioppo (Sapienza Università di Roma and INFN Roma)

30/04/2024, 14:22

Session A

Flashtalk with Poster

1.2 Generative models & Simulation of physical systems

We propose a quantum version of a generative diffusion model. In this algorithm, artificial neural networks are replaced with parameterized quantum circuits, in order to directly generate quantum states. We present both a full quantum and a latent quantum version of the algorithm; we also present a conditioned version of these models. The models' performances have been evaluated using...

173. A Hybrid Approach to Anomaly Detection in Particle Physics

Dennis Noll (Lawrence Berkeley National Lab (US))

30/04/2024, 14:25

Session A

Flashtalk with Poster

1.1 Pattern recognition & Image analysis

The ongoing search for physics beyond the Standard Model imposes a growing demand for highly sensitive anomaly detection methods. Various approaches to anomaly detection exist, and prominent techniques include semi-supervised and unsupervised training of neural networks. While semi-supervised approaches often require sophisticated methods for precise background estimation, unsupervised methods...

108. Normalizing flows for jointly predicting photometry and photometric redshifts

Laura Cabayol Garcia (IFAE/PIC)

30/04/2024, 14:25

Session A

Flashtalk with Poster

1.2 Generative models & Simulation of physical systems

Traditionally, machine-learning methods have mostly focused on making predictions without providing explicit probability distributions. However, the importance of predicting probability distributions lies in its understanding of the model’s level of confidence and the range of potential outcomes. Unlike point estimates, which offer a single value, probability distributions offer a range of...

158. Studies on track finding algorithms based on machine learning with GPU and FPGA

Maria Carnesale

30/04/2024, 14:25

Session A

Flashtalk with Poster

1.4 Hardware acceleration & FPGAs

Track finding in high-density environments is a key challenge for experiments at modern accelerators. In this presentation we describe the performance obtained running machine learning models for a typical Muon High Level Trigger at LHC experiments. These models are designed for hit position reconstruction and track pattern recognition with a tracking detector, on a commercially available...

103. Adaptive Machine Learning on FPGAs: Bridging Simulated and Real-World Data in High-Energy Physics

Marius Köppel (ETH Zürich)

30/04/2024, 14:28

Session A

Flashtalk with Poster

1.4 Hardware acceleration & FPGAs

In the realm of high-energy physics, the advent of machine learning has revolutionized data analysis, especially in managing the vast volumes of data produced by particle detectors.
Facing the challenge of analyzing unlabelled, high-volume detector data, advanced machine learning solutions become indispensable.
Our research introduces a machine learning approach that effectively bridges the...

128. Boosted object reconstruction with Monte-Carlo truth supervised Graph Neural Networks

Jacan Chaplais (University of Southampton)

30/04/2024, 14:28

Session A

Flashtalk with Poster

1.1 Pattern recognition & Image analysis

The High Luminosity upgrade for the Large Hadron Collider (HL-LHC) is due to come online in 2029. This will result in an unprecedented throughput of collision event data. Identifying and analysing meaningful signals within this information poses a formidable challenge in the search for new physics. The demand for automatic tools capable of physically-aware and data-driven inference, which can...

65. Kicking it Off(-shell) with Direct Diffusion

Sofia Palacios Schweitzer (ITP Heidelberg)

30/04/2024, 14:28

Session A

Flashtalk with Poster

1.2 Generative models & Simulation of physical systems

Off-shell effects in large LHC backgrounds are crucial for precision predictions and, at the same time, challenging to simulate. We show how a generative diffusion network learns off-shell kinematics given the much simpler on-shell process. It generates off-shell configurations fast and precisely, while reproducing even challenging on-shell features.

150. PolySwyft: a sequential simulation-based nested sampler

Kilian Scheutwinkel (University of Cambridge)

30/04/2024, 14:28

Session A

Flashtalk with Poster

1.3 Simulation-based inference

PolySwyft is an implementation of a sequential simulation-based nested sampler by merging two algorithms that are commonly used for Bayesian inference: PolyChord and swyft. PolySwyft uses the NRE functionality of swyft and generates a new joint training dataset with PolyChord to iteratively estimate more accurate posterior distributions. PolySwyft can be terminated using pre-defined rounds...

38. Accelerating the search for mass bumps using the Data-Directed Paradigm

Fannie Bilodeau (Université de Montréal)

30/04/2024, 14:31

Session A

Flashtalk with Poster

1.1 Pattern recognition & Image analysis

The Data-Directed paradigm (DDP) represents an innovative approach to efficiently investigate new physics across diverse spectra, which are in the presence of smoothly falling Standard Model (SM) backgrounds. Diverging from the conventional analysis employed in collider particle physics, DDP eliminates the necessity for a simulated or functionally derived background estimate. Instead, it...

113. Exploring the Universe with Radio Astronomy and AI

Lara Alegre (Heidelberg University)

30/04/2024, 14:31

Session A

Flashtalk with Poster

1.2 Generative models & Simulation of physical systems

New radio telescopes, such as the SKA, will revolutionise our understanding of the Universe. They can detect the faintest distant galaxies and provide high-resolution observations of nearby galaxies. This allows for detailed statistical studies and insights into the formation and evolution of galaxies across cosmic time. These telescopes also play a crucial role in unravelling the physical...

71. Extracting Dark Matter Halo Parameters with Overheated Exoplanets

María Benito (Tartu observatory, University of Tartu)

30/04/2024, 14:31

Session A

Flashtalk with Poster

1.3 Simulation-based inference

The Galactic centre serves as a laboratory for fundamental physics, particularly in the context of indirect dark matter searches. This study explores the potential of the James Webb Space Telescope to shed light on self-annihilating, sub-GeV dark matter candidates by examining their influence on exoplanet overheating and providing sensitivity estimates via probabilistic programming languages.

17. Real-Time Detection of Low-Energy Events with 2DCNN on FPGA's for the DUNE Data Selection System

Akshay Malige (Columbia biveristy)

30/04/2024, 14:31

Session A

Flashtalk with Poster

1.4 Hardware acceleration & FPGAs

Abstract: Large-scale physics experiments generating high data rates impose significant demands on the data acquisition system (DAQ). The Deep Underground Neutrino Experiment (DUNE) is a next-generation experiment for neutrino science at the Fermi National Accelerator Laboratory in Batavia, Illinois. It will consist of a massive detector operating continually for over a decade, resulting in...

147. Characterizing the Fermi-LAT high-latitude sky with simulation-based inference

Christopher Eckner (LAPP, CNRS)

30/04/2024, 14:50

Session A

Flashtalk with Poster

2.3 Simulation-based inference

The GeV gamma-ray sky, as observed by the Fermi Large Area Telescope (Fermi LAT), harbours a plethora of localised point-like sources. At high latitudes ($|b| >30^{\circ}$), most of these sources are of extragalactic origin. The source-count distribution as a function of their flux, $\mathrm{d}N/\mathrm{d}S$, is a well-established quantity to summarise this population. We employ sequential...

22. Deep Learning for Cosmic-Ray Observatories

Jonas Glombitza (Erlangen Centre for Astroparticle Physics)

30/04/2024, 14:50

Session A

Flashtalk with Poster

2.4 Hardware acceleration & FPGAs

The Pierre Auger Observatory, located in the Argentinian Pampa, is the world's largest cosmic-ray experiment. It offers the most precise measurements of cosmic particles at ultra-high energies by measuring their induced air showers. The centerpiece of the Observatory is the surface detector (SD) consisting of over 1,660 water-Cherenkov detectors that cover an area of 3,000 km$^2$ and measure...

102. Emulation by committee: faster AGN fitting

Benjamin Ricketts (SRON)

30/04/2024, 14:50

Session A

Flashtalk with Poster

2.2 Generative models & Simulation of physical systems

Neural network emulators are frequently used to speed up the computation of physical models in physics. However, they generally include only a few input parameters due to the difficulty of generating a dense enough grid of training data pairs in high dimensional parameter spaces. This becomes particularly apparent for cases where they replace physical models that take a long time to compute....

50. Quantum Computing for Track Reconstruction at LHCb

Miriam Lucio Martinez

30/04/2024, 14:50

Session A

Flashtalk with Poster

2.1 Pattern recognition & Image analysis

The expected increase in the recorded dataset for future upgrades of the main experiments at the Large Hadron Collider (LHC) at CERN, including the LHCb detector, while having a limited bandwidth, comes with computational challenges that classic computing struggles to solve. Emerging technologies such as Quantum Computing (QC), which exploits the principles of superposition and interference,...

241. WG 1 discussion

Lukas Heinrich (Technische Universitat Munchen), Tobias Golling (University of Geneva)

30/04/2024, 14:50

EuCAIF WG

203. Advances in developing deep neural networks for finding primary vertices in proton-proton collisions at the LHC

Simon Akar (University of Cincinnati)

30/04/2024, 14:53

Session A

Flashtalk with Poster

2.1 Pattern recognition & Image analysis

We have been studying the use of deep neural networks (DNNs) to identify and locate primary vertices (PVs) in proton-proton collisions at the LHC. Previously reported results demonstrate that a hybrid architecture, using a fully connected network (FCN) as the first stage and a convolutional neural network (CNN) as the second stage provides better efficiency than the default heuristic...

72. Estimating classical mutual information for spin systems and field theories using generative neural networks

Dr Piotr Korcyl (Institute of Theoretical Physics, Jagiellonian University)

30/04/2024, 14:53

Session A

Flashtalk with Poster

2.2 Generative models & Simulation of physical systems

Mutual information is one of the basic information-theoretic measures of correlations between different subsystems. It may carry interesting physical information about the phase of the system. It is notoriously difficult to estimate as it involves sums over all possible system and subsystem states. In this talk, I describe a direct approach to estimate the bipartite mutual information using...

210. Fast Inference of Deep Learning Models with SOFIE

Lorenzo Moneta (CERN)

30/04/2024, 14:53

Session A

Flashtalk with Poster

2.4 Hardware acceleration & FPGAs

Machine learning, especially Deep Learning, has become a valuable tool for researchers in High Energy Physics (HEP) to process and analyse their data. Popular Python-based machine learning libraries, such as Keras and PyTorch, offer good solutions for training deep learning models also in CPU or GPU environments. However, they do not always provide a good solution for inference. They may only...

100. Simulation-Based Supernova Ia Cosmology

Konstantin Karchev (SISSA / GRAPPA)

30/04/2024, 14:53

Session A

Flashtalk with Poster

2.3 Simulation-based inference

Type Ia supernovae (SNae Ia) are instrumental in constraining cosmological parameters, particularly dark energy. State-of-the-art likelihood-based analyses scale poorly to future large datasets, are limited to simplified probabilistic descriptions of e.g. peculiar velocities, photometric redshift uncertainties, instrumental noise, and selection effects, and must explicitly sample a...

66. CaloMan: Fast generation of calorimeter showers with density estimation on learned manifolds

Humberto Reyes-Gonzalez (RWTH Aachen)

30/04/2024, 14:56

Session A

Flashtalk with Poster

2.2 Generative models & Simulation of physical systems

The efficient simulation of particle propagation and interaction within the detectors of the Large Hadron Collider (LHC) is of primary importance for precision measurements and new physics searches. The most computationally expensive step of the simulation pipeline is the generation of calorimeter showers, and will become ever more costly and high-dimensional as the LHC moves into its high...

10. Einstein Telescope: binary black holes gravitational wave signals detection from three detectors combined data using deep learning

Wathela Alhassan (Nicolaus Copernicus Astronomical center)

30/04/2024, 14:56

Session A

Flashtalk with Poster

2.1 Pattern recognition & Image analysis

Continuing from our prior work \citep{10.1093/mnras/stac3797}, where a single detector data of the Einstein Telescope (ET) was evaluated for the detection of binary black hole (BBHs) using deep learning (DL). In this work we explored the detection efficiency of BBHs using data combined from all the three proposed detectors of ET, with five different lower frequency cutoff ($F_{low}$): 5 Hz, 10...

101. Gaussian processes for managing model uncertainty in gravitational wave analyses

Daniel Williams (University of Glasgow)

30/04/2024, 14:56

Session A

Flashtalk with Poster

2.4 Hardware acceleration & FPGAs

Successfully and accurately inferring the properties of compact binary mergers observed by facilities including Virgo and LIGO requires accurate and fast waveform models. Direct calculation from general relativity is not currently feasible, and approximations that are used to produce tractable models necessarily induce errors.
Using Gaussian process regression (GPR), we have developed a...

68. Optimizing bayesian inference in cosmology with Marginal Neural Ratio Estimation

Guillermo Franco Abellan

30/04/2024, 14:56

Session A

Flashtalk with Poster

2.3 Simulation-based inference

Forthcoming large-scale structure (LSS) Stage IV surveys will provide us with unprecedented data to probe the nature of dark matter and dark energy. However, analysing these data with conventional Markov Chain Monte Carlo (MCMC) methods will be challenging, due to the increase in the number of nuisance parameters and the presence of intractable likelihoods. In this talk, I discuss the first...

96. Flow-based generative models for particle calorimeter simulation

Claudius Krause (HEPHY Vienna (ÖAW))

30/04/2024, 14:59

Session A

Flashtalk with Poster

2.2 Generative models & Simulation of physical systems

Simulation is the crucial connection between particle physics theory and experiment. Our ability to simulate particle collision based on first principles allows us to analyze and understand the vast amount of data of the Large Hadron Collider (LHC) experiments. This, however, comes at a cost: A lot of computational resources are needed to simulate all necessary interactions to the required...

63. Full-event reconstruction using CNN-based models on calibrated waveforms for the Large-Sized Telescope prototype of the Cherenkov Telescope Array

Iaroslava Bezshyiko (University of Zurich)

30/04/2024, 14:59

Session A

Flashtalk with Poster

2.4 Hardware acceleration & FPGAs

The next-generation ground-based gamma-ray observatory, the Cherenkov Telescope Array Observatory (CTAO), will consist of two arrays of tens of imaging atmospheric Cherenkov telescopes (IACTs) to be built in the Northern and Southern Hemispheres, aiming to improve the sensitivity of current-generation instruments by a factor of five to ten. Three different sizes of IACTs are proposed to cover...

219. Machine Learning applications at the ATLAS experiment

Judita Mamuzic (IFAE - Barcelona)

30/04/2024, 14:59

Session A

Flashtalk with Poster

2.1 Pattern recognition & Image analysis

Machine learning (ML) plays a significant role in data mining at the High Energy Physics experiments. An overview of ML applications at the ATLAS experiments will be shown, with highlights in Physics Beyond the Standard model searches using anomaly detection and active learning. Additionally, advances in the object reconstruction and improvements in simulation using ML will be shown.

131. Stochastic Gravitational Wave Background Analysis with SBI

James Alvey (University of Amsterdam)

30/04/2024, 14:59

Session A

Flashtalk with Poster

2.3 Simulation-based inference

In some sense, the detection of a stochastic gravitational wave background (SGWB) is one of the most subtle GW analysis challenges facing the community in the next-generation detector era. For example, at an experiment such as LISA, to extract the SGWB contributions, we must simultaneously: detect and analyse thousands of highly overlapping sources including massive binary black holes mergers...

166. COSMOPOWER: fully-differentiable Bayesian cosmology with neural emulators

Alessio Spurio Mancini (Royal Holloway, University of London)

30/04/2024, 15:02

Talk without Poster

2.3 Simulation-based inference

COSMOPOWER is a state-of-the-art Machine Learning framework adopted by all major Large-Scale Structure (LSS) and Cosmic Microwave Background (CMB) international collaborations for acceleration of their cosmological inference pipelines. It achieves orders-of-magnitude acceleration by replacing the expensive computation of cosmological power spectra, traditionally performed with a Boltzmann...

135. Graph Neural Networks for charged-particle track reconstruction

Dr Jan Stark (Laboratoire des deux Infinis)

30/04/2024, 15:02

Talk without Poster

2.1 Pattern recognition & Image analysis

In particle collider experiments, such as the ATLAS and CMS experiments at CERN, high-energy particles collide and shatter into a plethora of charged particles traversing a silicon detector and leaving energy deposits, or hits, on the detector modules. The reconstruction of charged-particle trajectories (tracks) from these hits, an integral part in any physics program at the Large Hadron...

32. The MadNIS Reloaded

Ramon Winterhalder (UCLouvain)

30/04/2024, 15:02

Talk without Poster

2.2 Generative models & Simulation of physical systems

Theory predictions for the LHC require precise numerical phase-space integration and generation of unweighted events. We combine machine-learned multi-channel weights with a normalizing flow for importance sampling to improve classical methods for numerical integration. By integrating buffered training for potentially expensive integrands, VEGAS initialization, symmetry-aware channels, and...

133. Utilizing machine learning for the Data Analysis of AGATA’s PSA database.

Mojahed Abushawish (Lyon-IP2I)

30/04/2024, 15:02

Talk without Poster

2.4 Hardware acceleration & FPGAs

In-beam gamma-ray spectroscopy, particularly with high-velocity recoil nuclei, necessitates precise Doppler correction. The Advanced GAmma Tracking Array (AGATA) represents a groundbreaking development in gamma-ray spectrometers, boasting the ability to track gamma-rays within the detector. This capability leads to exceptional position resolution which ensures optimal Doppler...

111. Advancing Digital Transformation in Research on Universe and Matter in Germany

Martin Erdmann (RWTH Aachen University)

30/04/2024, 15:22

Talk without Poster

2.1 Pattern recognition & Image analysis

Research on Universe and Matter (ErUM) at major infrastructures such as CERN or large observatories, jointly conducted with university groups, is an important driver for the digital transformation. In Germany, about 20.000 scientists are working on ErUM-related sciences and can benefit from actual methods of artificial intelligence. The central networking and transfer office ErUM-Data-Hub...

121. Advancing Generative Modelling of Calorimeter Showers on Three Frontiers

Thorsten Buss (University of Hamburg (DE))

30/04/2024, 15:22

Talk without Poster

2.2 Generative models & Simulation of physical systems

Traditional physics simulations are fundamental in the field of particle physics. Common simulation tools like Geant4, are very precise, but comparatively slow. Generative machine learning can be used to speed up such simulations.
Calorimeter data can be represented either as images or as point clouds, i.e. permutation-invariant lists of measurements.
We advance the generative models for...

144. Doubling the Detection Rate of Ultra-High Energy Neutrinos through a Neural Network Trigger

Christian Glaser (Uppsala University)

30/04/2024, 15:22

Talk without Poster

2.4 Hardware acceleration & FPGAs

This contribution presents the ERC-funded project NuRadioOpt, which aims to substantially increase the detection rate of ultra-high-energy (UHE) cosmic neutrinos for large in-ice radio arrays such as the Radio Neutrino Observatory Greenland (RNO-G, under construction) and the envisioned IceCube-Gen2 project. These detectors consist of autonomous compact detector stations with very limited...

151. Networks Learning the Universe: From 3D (cosmological inference) to 1D (classification of spectra)

Caroline Heneka

30/04/2024, 15:22

Talk without Poster

2.3 Simulation-based inference

With new astronomical surveys, we are entering a data-driven era in cosmology. Modern machine learning methods are up for the task to optimally learn the Universe from low to high redshift. In 3D, tomography of the large-scale structure (LSS) via the 21cm line of hydrogen targeted by the SKA (Square Kilometre Array) can both teach about properties of sources and gaseous media between, while...

2. Anomaly aware machine learning for dark matter direct detection at DARWIN

Andre Scaffidi (SISSA)

30/04/2024, 15:42

Session A

Flashtalk with Poster

2.3 Simulation-based inference

This talk presents a novel approach to dark matter direct detection using anomaly-aware machine learning techniques in the DARWIN next-generation dark matter direct detection experiment. I will introduce a semi-unsupervised deep learning pipeline that falls under the umbrella of generalized Simulation-Based Inference (SBI), an approach that allows one to effectively learn likelihoods straight...

42. Feature selection techniques for CR isotope identification with the AMS-02 experiment in space

Marta Borchiellini

30/04/2024, 15:42

Session A

Flashtalk with Poster

2.1 Pattern recognition & Image analysis

Isotopic composition measurements of singly-charged Cosmic Rays (CR) provide essential insights into CR transport in the Galaxy. The Alpha Magnetic Spectrometer (AMS-02) can identify singly-charged isotopes up to about 10 GeV/n. However, their identification presents challenges due to the small abundance of CR deuterons compared to the proton background. In particular, a high accuracy for the...

180. Realtime Anomaly Detection with the CMS Level-1 Global Trigger Test Crate

Sioni Summers

30/04/2024, 15:42

Session A

Flashtalk with Poster

2.4 Hardware acceleration & FPGAs

We present the preparation, deployment, and testing of an autoencoder trained for unbiased detection of new physics signatures in the CMS experiment Global Trigger test crate FPGAs during LHC Run 3. The Global Trigger makes the final decision whether to readout or discard the data from each LHC collision, which occur at a rate of 40 MHz, within a 50 ns latency. The Neural Network makes a...

59. The Calorimeter Pyramid: Rethinking the design of generative calorimeter shower models

Simon Schnake (DESY / RWTH Aachen University)

30/04/2024, 15:42

Session A

Flashtalk with Poster

2.2 Generative models & Simulation of physical systems

The simulation of calorimeter showers is computationally intensive, leading to the development of generative models as substitutes. We propose a framework for designing generative models for calorimeter showers that combines the strengths of voxel and point cloud approaches to improve both accuracy and computational efficiency. Our approach employs a pyramid-shaped design, where the base of...

76. Choose Your Diffusion: Efficient and flexible way to accelerate the diffusion model dynamics in fast physics simulation

Cheng Jiang (University of Edinburgh)

30/04/2024, 15:45

Session A

Flashtalk with Poster

2.2 Generative models & Simulation of physical systems

The diffusion model has demonstrated promising results in image generation, recently becoming mainstream and representing a notable advancement for many generative modeling tasks. Prior applications of the diffusion model for both fast event and detector simulation in high energy physics have shown exceptional performance, providing a viable solution to generate sufficient statistics within a...

120. Clustering Considerations for Nested Sampling

Adam Ormondroyd (University of Cambridge)

30/04/2024, 15:45

Session A

Flashtalk with Poster

2.3 Simulation-based inference

PolyChord was originally advertised encouraging users to experiment with their own clustering algorithms. Identifying clusters of nested sampling live points is critical for PolyChord to perform nested sampling correctly. We have updated the Python interface of PolyChordLite to allow straightforward substitution of different clustering methods.

Recent reconstructions of the...

114. interTwin - an interdisciplinary Digital Twin Engine for Science

Kalliopi Tsolaki (CERN)

30/04/2024, 15:45

Session A

Flashtalk with Poster

2.1 Pattern recognition & Image analysis

The interTwin project, funded by the European Commission, is at the forefront of leveraging 'Digital Twins' across various scientific domains, with a particular emphasis on physics and earth observation. One of the most advanced use-cases of interTwin is event generation for particle detector simulation at CERN. interTwin enables particle detector simulations to leverage AI methodologies on...

212. Machine Learning-based Data Compression

Per Alexander Ekman (Lund University)

30/04/2024, 15:45

Session A

Flashtalk with Poster

2.4 Hardware acceleration & FPGAs

"Data deluge" refers to the situation where the sheer volume of new data generated overwhelms the capacity of institutions to manage it and researchers to use it[1]. Data Deluge is becoming a common problem in industry and big science facilities like the synchrotron laboratory MAX IV and the Large Hadron Collider at CERN[2].

As a novel solution to this problem, a small cross-disciplinary...

16. Costless Performance Gains in Nested Sampling for Applications to AI and Gravitational Waves

Metha Prathaban (University of Cambridge)

30/04/2024, 15:48

Session A

Flashtalk with Poster

2.1 Pattern recognition & Image analysis

Nested sampling is a tool for posterior estimation and model comparison across a wide variety of cross-disciplinary fields, and is used in Simulation Based Inference and AI emulation. This talk explores the performance and accuracy gains to be made in high dimensional nested sampling by rescuing the discarded likelihood evaluations available in present nested sampling runs, and is thus useful...

91. Diffusion meets Nested Sampling

David Yallup (University of Cambridge)

30/04/2024, 15:48

Session A

Flashtalk with Poster

2.2 Generative models & Simulation of physical systems

Sampling techniques are a stalwart of reliable inference in the physical sciences, with the nested sampling paradigm emerging in the last decade as a ubiquitous tool for model fitting and comparison. Parallel developments in the field of generative machine learning have enabled advances in many applications of sampling methods in scientific inference pipelines.

This work explores the...

64. Embedded Neural Networks on FPGAs for Real-Time Computation of the Energy Deposited in the ATLAS Liquid Argon Calorimeter

Raphael Bertrand (CPPM)

30/04/2024, 15:48

Session A

Flashtalk with Poster

2.4 Hardware acceleration & FPGAs

The Phase-II upgrade of the LHC will increase its instantaneous luminosity by a factor of 5-7 leading to the HL-LHC. The ATLAS Liquid Argon (LAr) calorimeter measures the energy of particles produced in LHC collisions. In order to enhance the ATLAS physics discovery potential in the blurred environment created by the pileup, it is crucial to have an excellent energy resolution and an accurate...

124. Enhancing Robustness: BSM Parameter Inference with n1D-CNN and Novel Data Augmentation

Yong Sheng Koay (Uppsala University)

30/04/2024, 15:48

Session A

Flashtalk with Poster

2.3 Simulation-based inference

This study explores the inference of BSM models and their parameters from kinematic distributions of collider signals through an n-channel 1D-Convolutional Neural Network (n1D-CNN). Our approach enables simultaneous inference from distributions of any fixed number of observables. As our training data are computationally expensive simulations, we also introduce a novel data augmentation...

90. Calculating entanglement entropy with generative neural networks

Dawid Zapolski (Jagiellonian University in Kraków)

30/04/2024, 15:51

Session A

Flashtalk with Poster

2.2 Generative models & Simulation of physical systems

Quantum entanglement, a fundamental concept for understanding physics at atomic and subatomic scales, is explored in this presentation. We introduce a novel technique for computing quantum entanglement (Rényi) entropy, grounded on the replica trick and leveraging the abilities of generative neural networks for accurate partition function calculations. The approach is demonstrated on the...

39. Efficient Parameter Space Exploration in BSM Theories with Batched Multi-Objective Constraint Active Search

Mr Mauricio A. Diaz (University of Southampton)

30/04/2024, 15:51

Session A

Flashtalk with Poster

2.1 Pattern recognition & Image analysis

Phenomenological analyses in beyond the Standard Model (BSM) theories assess the viability of BSM models by testing them against current experimental data, aiming to explain new physics signals. However, these analyses face significant challenges. The parameter space in BSM models are commonly large and high dimensional. The regions capable of accommodating a combination of experimental...

222. Fully Bayesian Forecasts with Neural Bayes Ratio Estimation

Thomas Gessey-Jones (University of Cambridge)

30/04/2024, 15:51

Session A

Flashtalk with Poster

2.3 Simulation-based inference

Sensitivity forecasts inform the design of experiments and the direction of theoretical efforts. To arrive at representative results, Bayesian forecasts should marginalize their conclusions over uncertain parameters and noise realizations rather than picking fiducial values. However, this is typically computationally infeasible with current methods for forecasts of an experiment’s ability to...

225. Methods in AI for Science

François Charton (Meta)

30/04/2024, 16:20

AI highlight

183. FlashSim: an end-to-end fast simulation prototype using Normalizing Flow

Francesco Vaselli (Scuola Normale Superiore & INFN Pisa)

30/04/2024, 17:10

Session B

Flashtalk with Poster

3.2 Physics-informed AI & Integration of physics and ML

Analyses in HEP experiments often rely on large MC simulated datasets. These datasets are usually produced with full-simulation approaches based on Geant4, or exploiting parametric “fast” simulations introducing approximations and reducing the computational cost.

In the present work, we discuss a prototype of a fast simulation framework that we call “FlashSim” targeting analysis level data...

143. Flavour Tagging with Graph Neural Networks with the ATLAS experiment

Walter Leinonen (Nikhef)

30/04/2024, 17:10

Session B

Flashtalk with Poster

3.4 Foundation models and related techniques

Flavour-tagging, the identification of jets originating from b and c quarks, is a critical component of the physics programme of the ATLAS experiment. Current flavour-tagging algorithms rely on the outputs of “low level” taggers, which are a mixture of manually optimised, physically informed algorithms and machine learning models. A new approach, instead uses a single machine learning model...

82. Next-Generation Background Removal for Astronomical Images based on Diffusion Models

Rodney Nicolaas

30/04/2024, 17:10

Session B

Flashtalk with Poster

3.1 Pattern recognition & Image analysis

One of the main challenges in astronomical imaging is getting as much signal as possible with as little noise as possible. The better the signal, the more sure one can be that the science done with the images is sound. However increasing the signal-to-noise-ratio on the detector is hard and expensive. Therefore a lot of research is focused on improving post-processing techniques to gain as...

152. Precision-Machine Learning for the Matrix Element Method

Theo Heimel (Heidelberg University)

30/04/2024, 17:10

Session B

Flashtalk with Poster

3.3 Hardware acceleration, FPGAs & Uncertainty quantification

The matrix element method is the LHC inference method of choice for limited statistics. We present a dedicated machine learning framework, based on efficient phase-space integration, a learned acceptance and transfer function. It is based on a choice of INN and diffusion networks, and a transformer to solve jet combinatorics. We showcase this setup for the CP-phase of the top Yukawa coupling...

242. WG 2 discussion

Pietro Vischia (Universidad de Oviedo and Instituto de Ciencias y Tecnologías Espaciales de Asturias (ICTEA)), Tommaso Dorigo (INFN Sezione di Padova)

30/04/2024, 17:10

EuCAIF WG

139. A deep learning method for the gamma-ray identification with the DAMPE space mission

Jennifer Maria Frieden (EPFL)

30/04/2024, 17:13

Session B

Flashtalk with Poster

3.1 Pattern recognition & Image analysis

The Dark Matter Particle Explorer (DAMPE) is the largest calorimeter-based space-borne experiment. Since its launch in December 2015, DAMPE detects electrons, positrons and gamma rays from few GeV to 10 TeV, as well as protons and heavier nuclei from 10 GeV to 100 TeV. The study of galactic and extragalactic gamma-ray sources and diffuse emissions as well as the search for dark-matter...

202. A Neural-Network-defined Gaussian Mixture Model for particle identification in LHCb

Edoardo Franzoso (INFN, Sezione di Ferrara (IT))

30/04/2024, 17:13

Session B

Flashtalk with Poster

3.2 Physics-informed AI & Integration of physics and ML

Presented is a novel method for analyzing particle identification (PID) by incorporating machine learning techniques, applied to a physics case within the fixed-target program at the LHCb experiment at CERN. Typically, a PID classifier is constructed by integrating responses from specialized subdetectors, utilizing diverse techniques to ensure redundancy and broad kinematic coverage. The...

70. Robust Uncertainty Quantification in Parton Distribution Function Inference

Mark Costantini (University of Cambridge)

30/04/2024, 17:13

Session B

Flashtalk with Poster

3.3 Hardware acceleration, FPGAs & Uncertainty quantification

Uncertainty quantification (UQ) is crucial for reliable predictions in inverse problems, where the model parameters are inferred from limited and noisy data. Monte Carlo methods offer a powerful approach to quantifying uncertainty in inverse problems, but their effectiveness hinges on the accuracy of the input data. This talk explores the robustness of an inverse problem methodology that...

164. Towards the first time ever measurement of the $gg\rightarrow ZH$ process at the LHC using Transformer networks

Geoffrey Gilles

30/04/2024, 17:13

Session B

Flashtalk with Poster

3.4 Foundation models and related techniques

Our primary objective is to achieve a pioneering measurement of the challenging $gg\rightarrow ZH$ process in Large Hadron Collider (LHC) data to extract new physics contributions in the context of the Standard Model Effective Field Theory (SMEFT) framework. By leveraging the power of multi-head attention mechanism within Transformer encoders, we developed an innovative approach to efficiency...

141. Characterizing the High Energy Gamma-ray Sources using Deep Learning

Saptashwa Bhattacharyya (University of Nova Gorica)

30/04/2024, 17:16

Talk without Poster

3.1 Pattern recognition & Image analysis

The Cherenkov Telescope Array (CTA) is entering its production phase and the upcoming data will drastically improve the point source sensitivity compared to previous imaging atmospheric Cherenkov telescopes. The Galactic Plane Survey (GPS), proposed as one of the Key Science Projects for CTA observation will focus on the observation of the inner galactic region ($|b|<6^∘$).
Here we discuss...

106. Evaluating Generative Models with non-parametric two-sample tests

Dr Samuele Grossi (University of Genova)

30/04/2024, 17:16

Session B

Flashtalk with Poster

3.3 Hardware acceleration, FPGAs & Uncertainty quantification

The problem of comparing two high-dimensional samples to test the null hypothesis that they are drawn from the same distribution is a fundamental question in statistical hypothesis testing. This study presents a comprehensive comparison of various non-parametric two-sample tests, specifically focusing on their statistical power in high-dimensional settings. The tests are built from univariate...

172. LHC Event Generation with JetGPT

Jonas Spinner (Heidelberg University)

30/04/2024, 17:16

Session B

Flashtalk with Poster

3.4 Foundation models and related techniques

Generative networks are promising tools for fast event generation for the LHC, yet struggle to meet the required precision when scaling up to particles in the final state. We employ the flexibility of autoregressive transformers to tackle this challenge, focusing on Z and top quark pair production with additional jets. We demonstrate the use of classifiers in combination with the...

118. Validating Explainable AI Techniques through High Energy Physics Data

Mariagrazia Monteleone (Politecnico di Milano)

30/04/2024, 17:16

Session B

Flashtalk with Poster

3.2 Physics-informed AI & Integration of physics and ML

Within the Compact Muon Solenoid (CMS) Collaboration, various Deep Neural Networks (DNNs) and Machine Learning (MLs) approaches have been employed to investigate the production of a new massive particle that undergoes decay into Higgs Boson pairs (HH) which further decay into a pair of b-quarks and a pair of tau leptons and discriminate the HH signal from the backgrounds.
However, these...

69. Estimation of Machine Learning model uncertainty in particle physics event classifiers

julia vazquez escobar (Centro de Investigaciones Energéticas Medioambientales y Tecnológicas (CIEMAT))

30/04/2024, 17:19

Session B

Flashtalk with Poster

3.3 Hardware acceleration, FPGAs & Uncertainty quantification

Particle physics experiments entail the collection of large data samples of complex information. In order to produce and detect low probability processes of interest (signal), a huge number of particle collisions must be carried out. This type of experiments produces huge sets of observations where most of them are of no interest (background). For this reason, a mechanism able to differentiate...

126. Transformer-inspired models for particle track reconstruction

Yue Zhao (SURF, the Netherlands)

30/04/2024, 17:19

Session B

Flashtalk with Poster

3.4 Foundation models and related techniques

Particle track reconstruction is a fundamental aspect of experimental analysis in high-energy particle physics. Conventional methodologies for track reconstruction are suboptimal in terms of efficiency in anticipation of the High Luminosity phase of the Large Hadron Collider. This has motivated researchers to explore the latest developments in deep learning for their scalability and potential...

177. Understanding galaxy cluster evolution with contrastive learning

Dr Urmila Chadayammuri (MPIA Heidelberg)

30/04/2024, 17:19

Session A

Flashtalk with Poster

3.2 Physics-informed AI & Integration of physics and ML

The intracluster medium (ICM) holds signatures of the dynamical history of
the galaxy cluster, including the dark matter density profile, mergers with
other clusters, and energetic activity (from supernovae and supermassive
black holes) in its member galaxies. For all but the most relaxed galaxy
clusters observed at high spatial resolution by instruments such as the
Chandra and...

157. Generic representations of jets at detector-level with self-supervised learning

Patrick Rieck (New York University)

30/04/2024, 17:22

Talk without Poster

3.4 Foundation models and related techniques

Supervised learning has been used successfully for jet classification and to predict a range of jet properties, such as mass and energy. Each model learns to encode jet features, resulting in a representation that is tailored to its specific task. But could the common elements underlying such tasks be combined in a single foundation model to extract features generically? To address this...

52. Modeling blazar broadband emission with convolutional neural network

Prof. Narek Sahakyan (ICRANet Armenia)

30/04/2024, 17:22

Talk without Poster

3.2 Physics-informed AI & Integration of physics and ML

Blazars are among the most powerful extragalactic sources, emitting across the entire electromagnetic spectrum, from radio to very high energy gamma-ray bands. As powerful sources of non-thermal radiation, blazars are frequently monitored using various telescopes, leading to the accumulation of substantial multi-wavelength data over different time periods. Also, over the years, the complexity...

57. Studying Adversarial Deep Learning techniques in the context of High-Energy Physics

Timo Saala

30/04/2024, 17:22

Talk without Poster

3.3 Hardware acceleration, FPGAs & Uncertainty quantification

Adversarial deep learning techniques are based on changing input distributions (adversaries), with the goal of causing false classifications when input to a deep neural network classifier. Adversaries aim to maximize the output error while only exerting minimal perturbations to the input data. Moreover, various techniques to defend against such attacks have been developed in the past. While...

213. Enhancing Electron Identification Using RCNet: A Deep CNN Approach for RICH Ring Reconstruction

Ivan Kisel (Uni-Frankfurt, FIAS)

30/04/2024, 17:36

Talk without Poster

3.1 Pattern recognition & Image analysis

The Ring Imaging Cherenkov (RICH) detector is integral to the CBM experiment's electron identification process, aiming to distinguish electrons and suppress pions in the study of dielectronic decay channels of vector mesons. This study is crucial for exploring the phase diagram of strongly interacting matter under conditions of high net baryon densities and moderate temperatures, as...

88. A Real-Time Tool for anomaly detection in Advanced Virgo's Auxiliary channels

Luca Negri (Utrecht University)

30/04/2024, 17:42

Talk without Poster

3.4 Foundation models and related techniques

The Advanced Virgo interferometer is a complex machine constantly monitored by a vast array of sensors, producing the auxiliary channels datastream. Many analytical tools aid in the task of navigating the information cointained in the $\sim 10^5$ channels, but the limitations of the linear algorithms can hinder their capabilities of correctly assessing the health of the instrument. In this...

109. Attention to the strengths of physics interactions: Enhanced Deep Learning Event Classification for Particle Physics Experiments

Polina Moskvitina

30/04/2024, 17:42

Talk without Poster

3.2 Physics-informed AI & Integration of physics and ML

A major task in particle physics is the measurement of rare signal processes. These measurements are highly dependent on the classification accuracy of these events in relation to the huge background of other Standard Model processes. Reducing the background by a few tens of percent with the same signal efficiency can already increase the sensitivity considerably.

This study demonstrates...

89. Hyperparameter optimization of neural networks for proton structure analyses

Dr Tanjona Rabemananjara (nikhef)

30/04/2024, 17:42

Talk without Poster

3.3 Hardware acceleration, FPGAs & Uncertainty quantification

The quantum-chromodynamic substructure of hadrons at the smallest scales relies critically on the accurate interpretation of abundant experimental data generated by large-scale infrastructures such as the Large Hadron Collider. Comparing a multitude of measured cross sections with the latest higher-order theory predictions, we probe the validity of the standard model of particles with...

44. GNN for Λ Hyperon Reconstruction in the WASA-FRS Experiment

Snehankit Pattnaik (GSI Helmholtzzentrum für Schwerionenforschung GmbH(GSI))

30/04/2024, 17:56

Session B

Flashtalk with Poster

3.1 Pattern recognition & Image analysis

This research introduces a physics-driven graph neural network (GNN) [1] tailored for the identification and reconstruction of $\Lambda$ hyperons in the WASA-FRS [2] experiment. The reconstructed $\Lambda$ hyperons serve as calibration processes, essential for the primary objective of the experiment, namely to detect hypertritons. This GNN is based upon successfully developed machine learning...

117. End-to-End Object Reconstruction in a Sampling-Calorimeter using YOLO

Pruthvi Suryadevara (Tata Institute of Fundemental Research)

30/04/2024, 17:59

Session B

Flashtalk with Poster

3.1 Pattern recognition & Image analysis

The upcoming silicon-based sampling calorimeters, such as the high-granularity calorimeter of the CMS experiment, will have unprecedented granularity in both the lateral and longitudinal dimensions. We expect these calorimeters to greatly benefit from machine learning-based reconstruction techniques. With the novel idea of interpreting the multiple sampling layers of calorimeters in the $\eta$...

78. Deep learning techniques in the study of the hypertriton puzzle

Christophe Rappold (Instituto de Estructura de la Materia - CSIC)

30/04/2024, 18:02

Session B

Flashtalk with Poster

3.1 Pattern recognition & Image analysis

Recent experiments with high-energy heavy ion beams challenge the current understanding of light hypernuclei (sub-atomic nuclei exhibiting strangeness), particularly the hypertriton [1,2,3,4,5,6,7,8]. This perplexing situation, known as the "hypertriton puzzle," is the focal point of our European-Japanese collaboration between CSIC – Spain, GSI-FAIR – Germany and RIKEN – Japan within the...

21. Learning the ‘Match’ Manifold to Accelerate Template Bank Generation

Susanna Green (University of Portsmouth)

30/04/2024, 18:02

Session B

Flashtalk with Poster

3.4 Foundation models and related techniques

In 2015, the first gravitational wave from a binary black hole merger was detected and since then, Ligo-Virgo-Kagra have observed many binary black hole mergers. However, identifying these cosmic events is computationally expensive. Therefore, fast data analysis will be essential in order to make future gravitational-wave observations a success. Template banks are used to identify potential...

179. Machine-learning analysis of cosmic-ray nuclei data from the AMS-02 experiment

Dr Shahid Khan (University of Geneva)

30/04/2024, 18:02

Session B

Flashtalk with Poster

3.2 Physics-informed AI & Integration of physics and ML

The Alpha Magnetic Spectrometer-02 (AMS-02) experiment is a magnetic spectrometer on the International Space Station (ISS) that can measure the flux of particles from cosmic sources in a rigidity window ranging from GVs to a few TVs and up to at least Nickel (charge Z=28). High-precision measurements of fluxes of rare nuclei, such as Sc, Ti, and Mn, provide unique constraints to models of...

165. Next generation cosmological analysis with a re-usable library of machine learning emulators across a variety of cosmological models

Dily Duan Yi Ong

30/04/2024, 18:02

Session B

Flashtalk with Poster

3.3 Hardware acceleration, FPGAs & Uncertainty quantification

In recent years, disparities have emerged within the context of the concordance model regarding the estimated value of the Hubble constant H0 [1907.10625] using Cosmic Microwave Background (CMB) and Supernovae data (commonly referred to as the Hubble tension), the clustering σ8 [1610.04606] using CMB and weak lensing, and the curvature ΩK [1908.09139, 1911.02087] using CMB and lensing/BAO, and...

214. Deep support vector data description models on an analog in-memory computing platform for real-time unsupervised anomaly detection.

Dominique Kosters (Radboud Universiteit, IBM, IMM,)

30/04/2024, 18:05

Session B

Flashtalk with Poster

3.3 Hardware acceleration, FPGAs & Uncertainty quantification

One of the most important challenges in High Energy Physics today is to find rare new physics signals among an abundance of Standard Model proton-proton collisions, also known as anomaly detection. Deep Learning (DL) based techniques for this anomaly detection problem are increasing in popularity [1]. One such DL technique is the Deep SVDD model [2], which shows great results when applied to...

105. Generating Lagrangians for particle theories

Dr Eliel Camargo-Molina (Uppsala University)

30/04/2024, 18:05

Session B

Flashtalk with Poster

3.2 Physics-informed AI & Integration of physics and ML

We report progress in using transformer models to generate particle theory Lagrangians. By treating Lagrangians as complex, rule-based constructs similar to linguistic expressions, we employ transformer architectures —proven in language processing tasks— to model and predict Lagrangians. A dedicated dataset, which includes the Standard Model and a variety of its extensions featuring various...

107. The flash-simulation of the LHCb experiment using the Lamarr framework

Matteo Barbetti (INFN-CNAF)

30/04/2024, 18:05

Session B

Flashtalk with Poster

3.4 Foundation models and related techniques

In the LHCb experiment, during Run2, more than 90% of the computing resources available to the Collaboration were used for detector simulation. The detector and trigger upgrades introduced for Run3 allow to collect larger datasets that, in turn, will require larger simulated samples. Despite the use of a variety of fast simulation options, the demands for simulations will far exceed the...

97. Lorentz-Equivariant Geometric Algebra Transformers for High-Energy Physics

Víctor Bresó Pla (University of Heidelberg)

30/04/2024, 18:08

Session B

Flashtalk with Poster

3.2 Physics-informed AI & Integration of physics and ML

195. Reconstruction of Low Mass Vector Mesons via Dimuon decay channel using Machine Learning Technique for the CBM Experiment at FAIR

Mr Abhishek Kumar Sharma (Aligarh Muslim Universty, Aligarh, India)

30/04/2024, 18:08

Session B

Flashtalk with Poster

3.3 Hardware acceleration, FPGAs & Uncertainty quantification

The Compressed Baryonic Matter (CBM) experiment, located at the Facility for Antiproton and Ion Research (FAIR) accelerator complex in Darmstadt, Germany, aims to study the phase diagram of strongly interacting matter in the realm of high net baryon densities and moderate temperatures. The SIS-100 accelerator ring at FAIR produces accelerated beams up to the energies of about 30 GeV for...

110. Utilizing Artificial Intelligence Technologies for the Enhancement of X-ray Spectroscopy with Metallic-Magnetic Calorimeters

Marc Oliver Herdrich (Helmholtz Institute Jena)

30/04/2024, 18:08

Session B

Flashtalk with Poster

3.4 Foundation models and related techniques

With metallic-magnetic calorimeters (MMCs) - like the maXs-detector series developed within this collaboration - promising new tools for high precision x-ray spectroscopy application have become available. Because of their unique working principles, MMCs combine several advantages over conventional energy- and wavelength-dispersive photon detectors. They can reach spectral resolving powers of...

204. Deep learning predicted elliptic flow of identified particles in heavy-ion collisions at the RHIC and LHC energies

Gergely Gábor Barnaföldi (HUN-REN Wigner RCP)

30/04/2024, 18:11

Session B

Flashtalk with Poster

3.3 Hardware acceleration, FPGAs & Uncertainty quantification

Machine Learning (ML) techniques have been employed for the high energy physics (HEP) community since the early 80s to deal with a broad spectrum of problems. This work explores the prospects of using Deep Learning techniques to estimate elliptic flow (v2) in heavy-ion collisions at the RHIC and LHC energies. A novel method is developed to process the input
observables from track-level...

7. Quark/gluon discrimination and top tagging with dual attention transformer

Daohan Wang (OeAW HEPHY)

30/04/2024, 18:11

Session B

Flashtalk with Poster

3.4 Foundation models and related techniques

Jet tagging is a crucial classification task in high energy physics. Recently the performance of jet tagging has been significantly improved by the application of deep learning techniques. In this talk, we introduce a new architecture for jet tagging: the particle dual attention transformer (P-DAT). This novel transformer architecture stands out by concurrently capturing both global and local...

55. Turning optimal classifiers into anomaly detectors

Adrian Rubio Jimenez

30/04/2024, 18:11

Session B

Flashtalk with Poster

3.2 Physics-informed AI & Integration of physics and ML

This study investigates the adaptation of leading classifiers, such as Transformers and Convolutional Graph Neural Networks, as anomaly detectors using different training techniques. The focus lies in their utilization with proton-proton collisions simulated by the DarkMachines collaboration, where some exotic signatures are aimed to be detected as anomalies.

Adaptations of these...

226. Accelerator Physics and AI

Verena Kain (CERN)

01/05/2024, 09:00

Summary talks

227. Nuclear Physics and AI

Amber Boehnlein (Jefferson Lab)

01/05/2024, 10:10

Summary talks

235. Panel discussion II: AI Infrastructure

Daniele Bonacorsi (University of Bologna), Sofia Vallecorsa (CERN), Thea Aarrestad (ETH Zurich), Tobias Golling (University of Geneva)

01/05/2024, 10:50

Plenary panel discussion

237. Poster session A

01/05/2024, 12:00

Poster session

Id Title Presenters
2 Anomaly aware machine learning for dark matter direct detection at DARWIN Andre Scaffidi
10 Einstein Telescope: binary black holes gravitational wave signals detection from three detectors combined data using deep learning Wathela Alhassan
16 Costless Performance Gains in Nested Sampling for Applications to AI and Gravitational Waves Metha Prathaban
17 Real-Time...

228. Prospects for AI in physics and astronomy

Max Welling (University of Amsterdam)

01/05/2024, 15:00

AI highlight

4. Analyzing ML-enabled Full Population Model for Galaxy SEDs with Unsupervised Learning and Mutual Information

Dr Sinan Deger (University of Cambridge, Institute of Astronomy)

01/05/2024, 16:00

Session B

Flashtalk with Poster

4.2 Simulation-based inference

The next generation of observatories such as the Vera C. Rubin Observatory and Euclid are posing a massive data challenge. An obstacle we need to overcome is the inference of accurate redshifts from photometric observations that can be limited to a handful of bands. We addressed this challenge with a forward modeling framework, pop-COSMOS, calibrated by fitting a population model to...

127. Sensitivity of strong lenses to substructure with machine learning

Conor O'Riordan (Max Planck Institute for Astrophysics)

01/05/2024, 16:00

Session B

Flashtalk with Poster

4.1 Pattern recognition, Image analysis & Uncertainty quantification

Strong gravitational lensing has become one of the most important tools for investigating the nature of dark matter (DM). With a technique called gravitational imaging, the number and mass of dark matter subhaloes can be measured in strong lenses, constraining the underlying DM model.

Gravitational imaging however is an expensive method and requires adaptation in astronomy's current "big...

153. Unsupervised Classification of Radio Sources Through Self-Supervised Representation Learning

Nicolas Baron Perez (Universität Hamburg)

01/05/2024, 16:00

Session B

Flashtalk with Poster

4.3 Physics-informed AI, Foundation models and related techniques

Foundation models are increasingly prominent in various physics subfields. Moreover, the application of supervised machine learning methods in astronomy suffers from scarce training data. We explore computer vision foundation models, focusing on their application to radio astronomical image data.
Specifically, we explore the unsupervised, morphological classification of radio sources through...

146. Unsupervised tagging of semivisible jets with energy-based autoencoders in CMS

Florian Eble (ETH Zurich)

01/05/2024, 16:00

Session B

Flashtalk with Poster

4.4 Explainable AI

A particularly interesting application of autoencoders (AE) for High Energy Physics is their use as anomaly detection (AD) algorithms to perform a signal-agnostic search for new physics. This is achieved by training AEs on standard model physics and tagging potential new physics events as anomalies. The use of an AE as an AD algorithm relies on the assumption that the network better...

243. WG 3 discussion

Caterina Doglioni, Gabrijela Zaharijas (Center for Astrophysics and Cosmology, University of Nova Gorica)

01/05/2024, 16:00

EuCAIF WG

41. Convolutional neural network search for long-duration transient gravitational waves from glitching pulsars

Rodrigo Tenorio (University of the Balearic Islands)

01/05/2024, 16:03

Session B

Flashtalk with Poster

4.2 Simulation-based inference

Machine learning can be a powerful tool to discover new signal types in astronomical data. In our recent study, we have applied it for the first time to search for long-duration transient gravitational waves triggered by pulsar glitches, which could yield physical insight into the mostly unknown depths of the pulsar. Other methods previously applied to search for such signals rely on matched...

85. Finetuning Foundation Models for Joint Analysis Optimization

Lukas Heinrich (Technische Universitat Munchen)

01/05/2024, 16:03

Session B

Flashtalk with Poster

4.3 Physics-informed AI, Foundation models and related techniques

In this work we demonstrate that significant gains in performance and data efficiency can be achieved moving beyond the standard paradigm of sequential optimization in High Energy Physics (HEP). We conceptually connect HEP reconstruction and analysis to modern machine learning workflows such as pretraining, finetuning, domain adaptation and high-dimensional embedding spaces and quantify the...

67. Multi-class classification of gamma-ray sources and the nature of excess of GeV gamma rays near the Galactic center

Dmitry Malyshev (Erlangen Center for Astroparticle Physics)

01/05/2024, 16:03

Session B

Flashtalk with Poster

4.4 Explainable AI

Catalogs of sources have many sources with unknown physical nature. In particular, Fermi-LAT catalogs of gamma-ray sources have about one third of sources with unknown multi-wavelength counterparts. Some of the gamma-ray sources may be visible only in gamma rays, such as distant pulsars with radio jets not pointing at the observer. Machine learning algorithms provide a tool to perform a...

99. Quark/gluon tagging in CMS Open Data with CWoLa and TopicFlow

Ayodele Ore (Heidelberg University - ITP)

01/05/2024, 16:03

Session B

Flashtalk with Poster

4.1 Pattern recognition, Image analysis & Uncertainty quantification

Methods for training jet taggers directly on real data are well motivated due to both the ambiguity of parton labels and the potential for mismodelled jet substructure in Monte Carlo. This talk presents a study of weakly-supervised learning applied to Z+jet and dijet events in CMS Open Data. In order to measure the discrimination power in real data, we consider three different estimates of the...

134. A fast convolutional neural network for online particle track recognition

Viola Cavallini (University of Ferrara and INFN, Italy)

01/05/2024, 16:06

Session B

Flashtalk with Poster

4.1 Pattern recognition, Image analysis & Uncertainty quantification

Timepix4 is a hybrid pixel detector readout ASIC developed by the Medipix4 Collaboration at CERN. It consists of a matrix of about 230\,k pixels, each equipped with amplifier, discriminator and time-to-digital converter with 195 ps bin size that allows to measure both time-of-arrival and time-over-threshold of the hits. Due to its characteristics, it can be exploited in a wide range of fields,...

182. b-hive: a modular training framework for state-of-the-art object-tagging within the python ecosystem at the CMS experiment

Niclas Eich

01/05/2024, 16:06

Session B

Flashtalk with Poster

4.4 Explainable AI

In high-energy physics (HEP), neural-network (NN) based algorithms have found many applications, such as quark-flavor identification of jets in experiments like the Compact Muon Solenoid (CMS) at the Large Hadron Collider (LHC) at CERN. Unfortunately, complete training pipelines often encounter application-specific obstacles like the processing of many and large files of HEP data format such...

217. Tuning neural posterior estimation for gravitational wave inference

Alex Kolmus (Radboud University)

01/05/2024, 16:06

Session B

Flashtalk with Poster

4.2 Simulation-based inference

Modern simulation-based inference techniques leverage neural networks to solve inverse problems efficiently. One notable strategy is neural posterior estimation (NPE), wherein a neural network parameterizes a distribution to approximate the posterior. This approach is particularly advantageous for tackling low-latency or high-volume inverse problems. However, the accuracy of NPE varies...

220. Using ML based Unfolding to reduce error on lattice QCD observables

Simran Singh (University of Bielefeld)

01/05/2024, 16:06

Session B

Flashtalk with Poster

4.3 Physics-informed AI, Foundation models and related techniques

In any lattice QCD based study, gauge configurations have to be generated using some form of Monte Carlo simulations. These are then used to compute physical observables. In these measurements, physical observables (like the chiral condensate or baryon number density) can be expressed as a trace of a combination of products of the inverse fermion matrix. These traces are usually estimated...

95. Energy-based graph autoencoders for semivisible jet tagging in the Lund representation

Roberto Seidita (ETH Zürich)

01/05/2024, 16:09

Session B

Flashtalk with Poster

4.4 Explainable AI

Semivisible jets are a novel signature arising in Hidden Valley (HV) extensions of the SM with a confining interaction [1]. Originating from a double shower and hadronization process and containing undetectable dark bound states, semivisible jets are expected to have a substantially different radiation pattern compared to SM jets.

Unsupervised...

193. Improving Two-Neutron Detection Efficiency on the NEBULA Detector using XGBoost Algorithm

Yutian Li (RIKEN, spin-isospin lab)

01/05/2024, 16:09

Session B

Flashtalk with Poster

4.1 Pattern recognition, Image analysis & Uncertainty quantification

In the field of nuclear physics, multi-neutron detection plays a critical role in revealing specific nuclear properties(e.g. the structure of light exotic nuclei or four-neutron resonance states). However, one neutron can interact several times in different bars of neutron detector array, since it will likely pass through the detectors without losing all its energy. The phenomenon commonly...

35. Masked particle modelling

Sam Klein (University of Geneva)

01/05/2024, 16:09

Talk without Poster

4.3 Physics-informed AI, Foundation models and related techniques

The Bert pretraining paradigm has proven to be highly effective in many domains including natural language processing, image processing and biology. To apply the Bert paradigm the data needs to be described as a set of tokens, and each token needs to be labelled. To date the Bert paradigm has not been explored in the context of HEP. The samples that form the data used in HEP can be described...

137. Normalising flows for dense matter equation of state inference from gravitational wave observations of neutron star mergers

Jessica Irwin (University of Glasgow)

01/05/2024, 16:09

Talk without Poster

4.2 Simulation-based inference

We present a Machine Learning approach to perform fully Bayesian
inference of the neutron star equation of state given results from
parameter estimation from gravitational wave signals of binary neutron
star (BNS) mergers. The detection of gravitational waves from BNS merger
GW170817 during the second observing run of the ground based
gravitational wave detector network provided a new...

123. Explainable deep learning models for cosmological structure formation

Luisa Lucie-Smith (Max Planck Institute for Astrophysics)

01/05/2024, 16:12

Talk without Poster

4.4 Explainable AI

I will present an explainable deep learning framework for extracting new knowledge about the underlying physics of cosmological structure formation. I will focus on an application to dark matter halos, which form the building blocks of the cosmic large-scale structure and wherein galaxy formation takes place. The goal is to use an interpretable neural network to generate a compressed, “latent”...

191. Reinforcement learning for automatic data quality monitoring in HEP experiments

Ms Olivia Jullian Parra (CERN)

01/05/2024, 16:12

Talk without Poster

4.1 Pattern recognition, Image analysis & Uncertainty quantification

The application of modern Machine Learning (ML) techniques for anomaly detection in collider physics is a very active and prolific field, with use cases that include the exploration of physics beyond the Standard Model and the detection of faults in the experimental setup. Our primary focus is on data-quality monitoring. Within large experimental collaborations, this anomaly detection task...

94. A Strong Gravitational Lens Is Worth a Thousand Dark Matter Halos: Inference on Small-Scale Structure Using Sequential Methods

Sebastian Wagner-Carena (Flatiron / NYU)

01/05/2024, 16:29

Talk without Poster

4.2 Simulation-based inference

Strong gravitational lenses are a singular probe of the Universe's small-scale structure --- they are sensitive to the gravitational effects of low-mass ($<10^{10} M_\odot$) halos even without a luminous counterpart. Recent strong-lensing analyses of dark matter structure rely on simulation-based inference (SBI). Modern SBI methods, which leverage neural networks as density estimators, have...

194. JERALD: high-resolution dark matter and baryonic maps from cheap N-body simulations

Mauro Rigo (Scuola Internazionale Superiore di Studi Avanzati (SISSA), Trieste Italy)

01/05/2024, 16:29

Talk without Poster

4.3 Physics-informed AI, Foundation models and related techniques

We present a newly developed code, JERALD - JAX Enhanced Resolution Approximate Lagrangian Dynamics -, that builds on the Lagrangian Deep Learning method (LDL) of Dai and Seljak (2021), improving on the time and the memory requirements of the original code. JERALD takes as input DM particle positions from a low-resolution, computationally inexpensive run of the approximate N-body simulator...

61. Reconstructing the Hubble function with physics-informed neural networks

Lennart Roever (Heidelberg University, ARI)

01/05/2024, 16:32

Talk without Poster

4.1 Pattern recognition, Image analysis & Uncertainty quantification

The Hubble function entirely characterizes a given Friedmann-Robertson-Walker spacetime as a consequence of homogeneity and isotropy on cosmological scales. In conjunction with the gravitational field equation, it can be related to the densities of the cosmological fluids and their respective equation of state. The type Ia supernovae allow to constrain the evolution of the luminosity distance...

93. Searches for exotic objects among Fermi-LAT gamma-ray sources with weakly supervised machine learning

Kathrin Nippel (RWTH Aachen University)

01/05/2024, 16:32

Talk without Poster

4.4 Explainable AI

Weakly supervised machine learning has emerged as a powerful tool in particle physics, enabling the classification of data without relying on extensive labeled examples. This approach holds immense potential for the identification of exotic objects in the gamma-ray sky, particularly those arising from dark matter annihilation. In this contribution, we present our methodology for exploring this...

205. Anomaly detection search for BSM physics in ATLAS experiment at LHC

Dr Francesco Cirotto (Universita' degli studi di Napoli "Federico II" and INFN)

01/05/2024, 16:49

Session B

Flashtalk with Poster

4.3 Physics-informed AI, Foundation models and related techniques

Traditionally, searches for new physics use complex computer simulations to reproduce what Standard Model processes should look like in collisions recorded by the LHC experiments. These are then compared to simulations of new-physics models (e.g. dark matter, supersymmetry, etc.).
The lack of evidence for new interactions and particles since the Higgs boson’s discovery has motivated the...

25. Optimal, fast, and robust inference of reionization-era cosmology with the 21cmPIE-INN

Benedikt Schosser (Universität Heidelberg, ARI)

01/05/2024, 16:49

Session B

Flashtalk with Poster

4.2 Simulation-based inference

Modern machine learning will allow for simulation-based inference from reionization-era 21cm observations at the Square Kilometre Array. Our framework combines a convolutional summary network and a conditional invertible network through a physics-inspired latent representation. It allows for an optimal and extremely fast determination of the posteriors of astrophysical and cosmological...

215. Application of science-informed AI in experimental particle physics and neuroscience

Peter Levai (HUN-REN Wigner RCP)

01/05/2024, 16:52

Session B

Flashtalk with Poster

4.1 Pattern recognition, Image analysis & Uncertainty quantification

Accelerator-based experiments in particle physics and medical experiments in neuroscience generate petabytes of data, where well-defined questions could be answered by intense computing analysis, however, new correlations may remain hidden in the huge data-sea. On the other hand, physics/neuroscience-informed AI/ML can help to discover new connections, integrating seamlessly data and...

163. Model selection with normalizing flows

Rahul Srinivasan

01/05/2024, 16:52

Session B

Flashtalk with Poster

4.4 Explainable AI

The Bayesian evidence can be used to compare and select models based on observed data. However, calculating the evidence can be computationally expensive and sometimes analytically intractable. I present a novel method for rapid computation of the Bayesian evidence based on normalizing flows that rely only on the existence of a set of independent and identically distributed samples extracted...

54. OmniJet-𝛼: The first cross-task foundation model for particle physics

Joschka Birk (University of Hamburg)

01/05/2024, 16:52

Session B

Flashtalk with Poster

4.3 Physics-informed AI, Foundation models and related techniques

Foundation models are multi-dataset and multi-task machine learning methods that once pre-trained can be fine-tuned for a large variety of downstream applications. The successful development of such general-purpose models for physics data would be a major breakthrough as they could improve the achievable physics performance while at the same time drastically reduce the required amount of...

208. Simulation Based Inference from the CD-EoR 21-cm signal

Anchal Saxena (University of Groningen)

01/05/2024, 16:52

Session B

Flashtalk with Poster

4.2 Simulation-based inference

Cosmic Dawn (CD) and Epoch of Reionization (EoR) are epochs of the Universe which host invaluable information about the cosmology and astrophysics of X-ray heating and hydrogen reionization. Radio interferometric observations of the 21-cm line at high redshifts have the potential to revolutionize our understanding of the Universe during this time. However, modelling the evolution of these...

112. Applying hierarchical autoregressive neural networks for three-dimensional Ising model

Mateusz Winiarski (Faculty of Physics, Astronomy and Applied Computer Science, Jagiellonian University in Kraków, Poland)

01/05/2024, 16:55

Session B

Flashtalk with Poster

4.3 Physics-informed AI, Foundation models and related techniques

A recent proposal suggests using autoregressive neural networks to approximate multi-dimensional probability distributions found in lattice field theories or statistical mechanics. Unlike Monte Carlo algorithms, these networks can serve as variational approximators to evaluate extensive properties of statistical systems, such as free energy.

In the case of two-dimensional systems, the...

83. Flexible conditional normalizing flow distributions over manifolds: the jammy-flows toolkit

Dr Thorsten Glüsenkamp (Uppsala University)

01/05/2024, 16:55

Session B

Flashtalk with Poster

4.2 Simulation-based inference

Recently, conditional normalizing flows have shown promise to directly approximate the posterior distribution via amortized stochastic variational inference from raw simulation data without resorting to likelihood modelling.
In this contribution, I will discuss an open-source GitHub package, "jammy_flows", a pytorch-based project which comes with many state of the art normalizing flows out of...

60. Increasing the model agnosticity of weakly supervised anomaly detection

Marie Hein (RWTH Aachen University)

01/05/2024, 16:55

Session B

Flashtalk with Poster

4.1 Pattern recognition, Image analysis & Uncertainty quantification

Weakly supervised methods have emerged as a powerful tool for anomaly detection at the LHC. While these methods have shown remarkable performance on specific signatures, their application in an even more model-agnostic manner requires using higher dimensional feature spaces compared to the first publications on this topic. We present two directions towards more model agnosticity, either by...

75. Symbolic regression for precision LHC physics

Manuel Morales-Alvarado (University of Cambridge)

01/05/2024, 16:55

Session B

Flashtalk with Poster

4.4 Explainable AI

Machine learning, in its conventional form, has often been criticised for being a black box, providing outputs without a clear rationale. To obtain more interpretable results we can make use of symbolic regression (SR) which, as opposed to traditional regression techniques, goes beyond curve-fitting and attempts to determine the underlying mathematical equations that best describe the data. In...

145. A deep learning method for the trajectory reconstruction of gamma rays with the DAMPE space mission

Parzival Nussbaum (EPFL)

01/05/2024, 16:58

Session B

Flashtalk with Poster

4.2 Simulation-based inference

The Dark Matter Particle Explorer (DAMPE), a satellite-borne experiment capable of detecting gamma rays from few GeV to 10 TeV, studies the galactic and extragalactic gamma-ray sky and is at the forefront of the search for dark-matter spectral lines in the gamma-ray spectrum. In this contribution we detail the development of a convolutional neural network (CNN) model for the trajectory...

98. Galaxy redshift estimations with transfer and multi-task learning

Martin Boerstad Eriksen (IFAE-PIC)

01/05/2024, 16:58

Session B

Flashtalk with Poster

4.1 Pattern recognition, Image analysis & Uncertainty quantification

Dark energy has ushered in a golden age of astronomical galaxy surveys, allowing for the meticulous mapping of galaxy distributions to constrain models of dark energy and dark matter. The majority of these surveys rely on measuring galaxy redshifts through a limited set of observations in broad optical bands. While determining redshift is theoretically a straightforward machine learning...

201. Reconstructing the Neutron Star Equation of State with Bayesian deep learning

Giulia Ventagli (Czech Academy of Science, Institute of Physics)

01/05/2024, 16:58

Session B

Flashtalk with Poster

4.4 Explainable AI

We present a pipeline to infer the equation of state of neutron stars from observations based on deep neural networks. In particular, using the standard (deterministic), as well as Bayesian (probabilistic) deep networks, we explore how one can infer the interior speed of sound of the star given a set of mock observations of total stellar mass, stellar radius and tidal deformability. We...

58. Gradient-Annihilated PINNs for Solving Riemann Problems: Application to Relativistic Hydrodynamics

Antonio Ferrer Sánchez (Universidad de Valencia, Departamento de Ingeniería Electrónica)

01/05/2024, 17:01

Session B

Flashtalk with Poster

4.1 Pattern recognition, Image analysis & Uncertainty quantification

Physics-Informed Neural Networks (PINNs) have gained significant attention in the field of deep learning for their ability to tackle physical scenarios, gaining significant interest since its inception in scientific literature. These networks optimize neural architectures by incorporating inductive biases derived from knowledge of physics. To embed the underlying physics, a suitable loss...

40. Reconstructing dynamics and masses from gravitational waveforms

Joe Bayley (University of Glasgow)

01/05/2024, 17:01

Session B

Flashtalk with Poster

4.4 Explainable AI

Whilst gravitational waves from compact binary signals are well modelled, other transient signals do not not necessarily have a clearly defined waveform. Searches for these kinds of signals are often un-modelled so do not say much about the system that produced the gravitational wave. Having a method that can extract some information on the structure and dynamics of the system could be crucial...

230. Gravitational wave physics and AI

Elena Cuoco (European Gravitational Observatory)

02/05/2024, 09:00

Summary talks

231. AI ethics and fundamental physics

Savannah Thais (Columbia University)

02/05/2024, 10:10

AI highlight

236. Panel discussion III: EuCAIF - Building a European Coalition for AI in Fundamental physics

Amber Boehnlein (Jefferson Lab), Elena Cuoco (European Gravitational Observatory), Johan Messchendorp (GSI Helmholtzzentrum für Schwerionenforschung GmbH(GSI)), Michael Kagan (SLAC National Accelerator Laboratory)

02/05/2024, 10:50

Plenary panel discussion

238. Poster session B

02/05/2024, 12:00

Poster session

IDs Title Presenters
4 Analyzing ML-enabled Full Population Model for Galaxy SEDs with Unsupervised Learning and Mutual Information Sinan Deger
7 Quark/gluon discrimination and top tagging with dual attention transformer Daohan Wang
21 Learning the ‘Match’ Manifold to Accelerate Template Bank Generation Susanna Green
25 Optimal, fast, and robust inference of reionization-era cosmology with...

232. Cosmology and AI

Benjamin Wandelt (Sorbonne University)

02/05/2024, 15:00

Summary talks

12. Fair Universe HiggsML Uncertainty Challenge

David Rousseau (IJCLab-Orsay)

02/05/2024, 16:00

Talk without Poster

5.3 Uncertainty quantification, Pattern recognition and Simulation-based inference

The Fair Universe project is building a large-compute-scale AI ecosystem for sharing datasets, training large models and hosting challenges and benchmarks. Furthermore, the project is exploiting this ecosystem for an AI challenge series focused on minimizing the effects of systematic uncertainties in High-Energy Physics (HEP), and on predicting accurate confidence intervals.

This talk will...

47. Generative models for transient noise studies in Gravitational Waves detectors

Dr Francesco Sarandrea (INFN), Dr Lorenzo Asprea (INFN)

02/05/2024, 16:00

Talk without Poster

5.1 Generative models & Simulation of physical systems

Non-Gaussian transient noise artifacts, commonly referred to as glitches, are one of the most challenging limitations in the study of gravitational-wave interferometer data due to their similarities with astrophysical sources signals in the time and frequency domains. Therefore, exploring novel methods to recover physical information from data corrupted by glitches is essential. In our work,...

207. Improved Fixed Point Actions from Gauge Equivariant Neural Networks

Andreas Ipp

02/05/2024, 16:00

Talk without Poster

5.2 Physics-informed AI & Integration of physics and ML

Gauge symmetry is fundamental to describing quantum chromodynamics on a lattice. While the local nature of gauge symmetry presents challenges for machine learning due to the vast and intricate parameter space, which involves distinct group transformations at each spacetime point, it remains a fundamental and indispensable prior in physics. Lattice gauge equivariant convolutional neural...

53. Self-supervision for data-driven anomaly detection at the LHC

Luigi Favaro (University of Heidelberg - ITP)

02/05/2024, 16:00

Talk without Poster

5.4 Foundation models and related techniques, Variational inference

Anomaly detection at the LHC broadens the search for BSM effects by making no assumptions about the signal hypothesis. We employ ML to perform density estimation on raw data and use the density estimate for anomaly detection. A neural network can learn the physics content of the raw data. However, the gain in sensitivity to features of interest can be hindered by redundant information already...

244. WG 4

Andreas Ipp, Sascha Caron (Nikhef)

02/05/2024, 16:00

EuCAIF WG

189. New developments and applications of a Deep-learning-based Full Event Interpretation (DFEI) in proton-proton collisions

Dr Simone Capelli (University of Milano-Bicocca)

02/05/2024, 16:20

Talk without Poster

5.3 Uncertainty quantification, Pattern recognition and Simulation-based inference

The LHCb experiment at the Large Hadron Collider (LHC) is designed to perform high-precision measurements of heavy-hadron decays, which requires the collection of large data samples and a good understanding and suppression of multiple background sources. Both factors are challenged by a five-fold increase in the average number of proton-proton collisions per bunch crossing, corresponding to a...

84. Real-time Gravitational Wave Data Analysis with Machine Learning

Katya Govorkova

02/05/2024, 16:20

Talk without Poster

5.2 Physics-informed AI & Integration of physics and ML

In recent years, deep learning algorithms have excelled in various domains, including Astronomy. Despite this success, few deep learning models are planned for online deployment in the O4 data collection run of the LIGO-Virgo-KAGRA collaboration. This is partly due to a lack of standardized software tools for quick implementation and deployment of novel ideas with confidence in production...

86. Reconstruction of cosmological initial conditions with sequential simulation-based inference

Dr Christoph Weniger, Guillermo Franco Abellán, Noemi Anau Montel, Oleg Savchenko (University of Amsterdam)

02/05/2024, 16:20

Talk without Poster

5.1 Generative models & Simulation of physical systems

Knowledge of the primordial matter density field from which the present non-linear observations formed is of fundamental importance for cosmology, as it contains an immense wealth of information about the physics, evolution, and initial conditions of the universe. Reconstructing this density field from the galaxy survey data is a notoriously difficult task, requiring sophisticated statistical...

140. Scientific Discovery from Ordered Information Decomposition

Matthew Ho (Institut d'Astrophysique de Paris)

02/05/2024, 16:20

Talk without Poster

5.4 Foundation models and related techniques, Variational inference

How can we gain physical intuition in real-world datasets using ‘black-box’ machine learning? In this talk, I will discuss how ordered component analyses can be used to seperate, identify, and understand physical signals in astronomical datasets. We introduce Information Ordered Bottlenecks (IOBs), a neural layer designed to adaptively compress data into latent variables organized by...

132. A surrogate model to optimize injection efficiency in PSI muEDM Experiment

Ritwika Chakraborty (Paul Scherrer Institute)

02/05/2024, 16:40

Talk without Poster

5.3 Uncertainty quantification, Pattern recognition and Simulation-based inference

A dedicated experimental search for a muon electric dipole moment (EDM) is being set up in PSI. This experiment will search for a muon EDM signal with a final precision of \SI{6e-23}{e \cdot cm} using the frozen-spin technique. This will be the most stringent test of the muon EDM to date, improving the current experimental limit by 3 orders of magnitude. A crucial component of the experiment...

162. AI-Driven Exploration of Strongly Interacting Nuclear Matter under Extreme Conditions

Dr Manjunath Omana Kuttan (Frankfurt Institute for Advanced Studies)

02/05/2024, 16:40

Talk without Poster

5.1 Generative models & Simulation of physical systems

Investigating the properties of QCD matter at extreme temperatures and densities is a fundamental objective of high energy nuclear physics. Such matter can be created in facilities like CERN and FAIR for short periods of time through heavy-ion collisions. Particularly interesting are the intermediate energy heavy-ion collision experiments such as CBM@FAIR, STAR-FXT@RHIC and experiments at NICA...

43. Integrating Explainable AI in Modern High-Energy Physics (the MUCCA Project)

Joseph Carmignani (University of Liverpool)

02/05/2024, 16:40

Talk without Poster

5.2 Physics-informed AI & Integration of physics and ML

The Multi-disciplinary Use Cases for Convergent new Approaches to AI explainability (MUCCA) project is pioneering efforts to enhance the transparency and interpretability of AI algorithms in complex scientific endeavours. The presented study focuses on the role of Explainable AI (xAI) in the domain of high-energy physics (HEP). Approaches based on Machine Learning (ML) methodologies, from...

51. AI-driven discovery of charm quarks in the proton

Juan Rojo

02/05/2024, 17:00

Talk without Poster

5.3 Uncertainty quantification, Pattern recognition and Simulation-based inference

The theory of the strong force, quantum chromodynamics, describes the proton in terms of its constituents, the quarks and gluons. A major conundrum since the formulation of QCD five decades ago has been whether heavy quarks also exist as a part of the proton wavefunction determined by non-perturbative dynamics: so-called intrinsic heavy quarks. Innumerable efforts to establish intrinsic charm...

229. Astroparticle Physics and AI

Siddarth Mishra-Sharma (MIT)

03/05/2024, 09:00

Summary talks

233. Outlook: AI for fundamental physics

Kyle Cranmer (University of Wisconsin Madison)

03/05/2024, 11:35

Final keynote

221. Addressing Real-World Noise Challenges in Gravitational Wave Parameter Estimation with Truncated Marginal Neural Ratio Estimation

Alexandra Wernersson (University of Amsterdam)

Flashtalk with Poster

Gravitational wave parameter estimation plays a crucial role in understanding astrophysical phenomena, yet it is often challenged by real-world noise inherent in the detection process. In this work, we use the simulation-based-inference pipeline PEREGRINE to do robust parameter estimation and tailor it to address the complexities of real noise in gravitational wave data analysis. We aim to...

156. Denoising Graph Super-Resolution with Diffusion Models and Transformers for Improved Particle Reconstruction

Nilotpal Kakati (Weizmann Institute of Science)

Flashtalk with Poster

Accurately reconstructing particles from detector data is a critical challenge in experimental particle physics. The detector's spatial resolution, specifically the calorimeter's granularity, plays a crucial role in determining the quality of the particle reconstruction. It also sets the upper limit for the algorithm's theoretical capabilities. Super-resolution techniques can be explored as a...

8. Detector and physics simulation using heavy ion collisions at NICA-SPD

Mr Rishav Pandey (Larsen and Toubro Limited)

Flashtalk with Poster

The space-time picture of hadron formation in high-energy collisions with nuclear targets is still poorly known. The tests of hadron formation was suggested for the first stage of SPD running. They will require measuring charged pion and proton spectra with the precision better than $10\%$. A research has been carried out to check feasibility of such studies at SPD. In this work,...

171. Exhaustive Symbolic Regression: Learning Astrophysics directly from Data

Harry Desmond (University of Portsmouth)

Flashtalk with Poster

A key challenge in the field of AI is to make machine-assisted discovery interpretable, enabling it not only to uncover correlations but also to enhance our physical understanding of the world. A nascent branch of machine learning -- Symbolic Regression (SR) -- aims to discover the optimal functional representations of datasets, producing perfectly interpretable outputs (equations) by...

239. Flavour Tagging in Run 3 at LHCb

Micol Olocco (TU Dortmund)

Session B

One of the main goals of the LHCb experiment is to study charge-parity violation by looking at the decays of the large variety of beauty mesons created in pp collisions at LHC. Such studies are particularly challenging in the presence of B−B oscillations as the B meson flavour at production time might be different from the flavour at its decay time. Flavour Tagging algorithms exploit the...

161. HGPflow: Physics-inspired full event particle reconstruction in collider experiments with HyperGraphs

Nilotpal Kakati (Weizmann Institute of Science)

Flashtalk with Poster

Accurate particle reconstruction from detector data is a fundamental task in experimental particle physics. Traditional methods are becoming sub-optimal in the face of the increasing demands of the High Lumi phase of the LHC, making machine learning-based approaches more relevant.

Incorporating physics knowledge into machine learning-based reconstruction can enhance performance and provide...

174. Large Language Model as a Literature Review Tool: A Case Study in Cosmology

Yilun Guan (University of Toronto)

Flashtalk with Poster

Large language models (LLMs) have revolutionized how we interact with knowledge, serving as a critical link that allows human to interact with large datasets through natural language. This study explores how the scientific community, especially in fundamental physics, can harness the power of LLMs to augment research processes. A significant challenge in science is keeping pace with the...

216. Large Language Models in Fundamental Physics: An Interdisciplinary Roadmap

Talk without Poster

This talk presents a comprehensive analysis of the potential role of Large Language Models (LLMs) and Question-Answering Machines (QAMs) in augmenting the field of fundamental physics, drawing upon a nuanced synthesis of insights from an interdisciplinary consortium encompassing various sub-disciplines of physics, philosophy of science, and computer science.

The primary objective of this...

148. lsbi: linear simulation based inference

Will Handley (University Of Cambridge)

Flashtalk with Poster

Simulation-based inference is undergoing a renaissance in statistics and machine learning. With several packages implementing the state-of-the-art in expressive AI [mackelab/sbi] [undark-lab/swyft], it is now being effectively applied to a wide range of problems in the physical sciences, biology, and beyond.

Given the rapid pace of AI, there is little expectation that the implementations...

178. Mutli-scale cross-attention transformer encoder for event classification

Ahmed Hammad (Theory center, KEK, Japan)

Flashtalk with Poster

We deploy an advanced Machine Learning environment, leveraging a
multi-scale cross-attention encoder for event classification. Our multi-modal network can extract information from the jet substructure and the kinematics of the final state particles through self-attention transformer layers. The diverse learned information is subsequently integrated to improve classification performance using...

13. Out-of-Distribution Multi-set Generation with Context Extrapolation for Amortized Simulation and Inverse Problems

Hosein Hashemi (ORIGINS Cluster)

Flashtalk with Poster

Addressing the challenge of Out-of-Distribution (OOD) multi-set generation, we introduce YonedaVAE, a novel equivariant deep generative model inspired by Category Theory, motivating the Yoneda-Pooling mechanism. This approach presents a learnable Yoneda Embedding to encode the relationships between objects in a category, providing a dynamic and generalizable representation of complex...

218. SympFlow: Neural Symplectic Flows for dissipative systems and its application to GW modelling.

Dr Priscilla Canizares (University of Cambridge)

Flashtalk with Poster

Hamiltonian and Lagrangian equations of motion are the workhorse of Theoretical Physics. The behaviour of physical systems is analytically described by a set of, usually complex, PDEs and ODEs. Consequently, the time evolution of such systems requires numerical integrators and in the case of, e.g., black hole binary evolution, this is, in most cases, computationally expensive or even...