Home

Matteo Lionello

AI / ML Lead for Applied Product Innovation

Machine learning researcher and applied AI consultant with 8+ years of experience designing AI systems, model pipelines, and data-driven products across audio, multimodal analysis, and perception modelling.

Curriculum


I help teams translate AI capabilities into practical product opportunities: defining use cases, designing ML architectures, building evaluation workflows, and connecting technical feasibility with user and business value. My work spans deep learning, signal processing, real-time audio enhancement, multimodal modelling, and applied research systems.

Education and Job Experience


The Neuro – Montreal Neurological Institute (MNI), McGill, Canada April – October 2025
Visiting at The Auditory Cognitive Neuroscience Laboratory, Dr. Robert Zatorre
Part of a project in: "EEG and TMS for Music Perception and Cognition"
IMT Lucca, Italy November 2022 – thesis submitted November 2025
PhD student in Cognitive, Computational and Social Neuroscience at Social and Affective Neuroscience (SANe)
"Emotions in Music: How Naturalistic Music Listening Encodes Acoustic and Emotional Features in the Brain and Is Shaped by Extra-Musical Context"
Machine Learning research consultancy for audio , UK 2021 - 2022
The Bartlett Institute, University College London (UCL), London October 2018 - February 2021
MPhil Student at the Bartlett Institute
Thesis title: A new methodology for modelling urban soundscapes: a psychometric revisitation of the current standard and a Bayesian approach for individual response prediction
Universitat Pompeu Fabra, Barcelona September - December 2017
Visiting student at Music Technology Group, MTG - TELMI ERC project
Project title: A Machine Learning Approach to Violin Vibrato Modelling in Audio Performances and a Didactic Application for Mobile Devices
Aalborg University, Copenhagen October 2016 - June 2018
MSc in Sound and Music Computing, SMC
Thesis title: Deep Learning for Sounds Representation and Generation
University of Padova, Padova September 2011 - November 2015
BSc in Information Engineering
Thesis title: Interactive Soundscapes: Design of a physical space augmented by dynamic sound rendering.

Reviewer activity for Journals:


  • The Journal of the Acoustical Society of America (JASA), Acoustical Society of America
  • European Physical Journal, Springer
  • Neurocomputing, Elsevier
  • Quality and Quantity, Springer Science+Business Media
  • Building Simulation Journal, Springer

Conferences and Workshops


  • Git and Github Tutorial and Workshop at BrainHack Lucca 2025* December 2 to 5, 2025 Lucca
  • BrainHack Lucca 2025 (BHL25)* December 2 to 5, 2025 Lucca
  • Git and Github Tutorial and Workshop at BrainHack Lucca 2024* December 2 to 5, 2024 Lucca
  • BrainHack Lucca 2024 (BHL24)* December 2 to 5, 2024 Lucca
  • Audio Mostly (AUDIOMOSTLY 2024) September 18 to 20, 2024 Milan
  • Organization for Human Brain Mapping 2024 (OHBM24) July 23 to 27, 2024 Seoul
  • Società Italiana di Psicofisiologia e Neuroscienze Cognitive (SIPF) November 9 to 11, 2023 Siena
  • International Conference on Acoustics, Speech, and Signal Processing (ICASSP) May 4 to 8, 2020 Barcelona
  • International Congress on Acoustics (ICA2019) September 9 to 13, 2019 Aachen
  • Machine Learning for Acoustics Summer School (UKANSS19) August 5 to 9, 2019 Gregynog Hall, Tregynon
  • Soundscape Workshop (IOA) June 25, 2019 London
  • Sound and Music Computing Conference (SMC Conference) July 4 to 7, 2018 Limassol
  • International Workshop on Machine Learning and Music (MML) October 6, 2017 Barcelona
  • New Interfaces for Musical Expression (NIME) May 15 to 19, 2017 Copenhagen
  • Sound and Music Computing Summer School August 26 to 30, 2016 Hamburg

Prizes and Awards


  • IMT Lucca - Fully funded PhD scholarship in Cognitive, Computational and Social Neuroscience
  • UKAN Acoustics Network Summer School Grant
  • Pounds 500,00 Young Scientist Conference Attendance Award
  • Studentship in urban sound environment, EU ERC.
  • Awarded grant of DKK 6.963,00 by OTTO MØNSTEDS FOND.

Technical Knowledge


Programming Languages:
Java, Python, Matlab, Unix, basic knowledge of C++ and PHP
Libraries:
Keras, TensorFlow, Pytorch, Pytorch Lightning
Protocols:
OpenSoundsControl (OSC) and TCP/IP
Databases:
basic knowledge of SQL and MySQL
Tools:
Claude Code, MCP, Agentic Engineering, Git & GitHub, GoogleCloud, PureData, LaTeX and basic knowledge of PhP

Language Skills


Italian: Native language; English: C1 (7.5 IELTS, October 2018); Spanish: B\'asico

Projects


Applied ML system design · AI product discovery · Model evaluation · Real-time ML pipelines · Multimodal data · Audio AI · LLM-assisted workflows · Technical mentoring · Research-to-product translation · Cross-functional communication

Past and Ongoing Collaborations


Collaboration on an EEG multisensory integration project (a project by Martina Battista at SEED, IMT Lucca)

address: IMT, Lucca

Collaboration on a TMS/EEG data acquisition project (a project by Giorgio Lazzari at MusiCognition Lab, Universita di Pavia)

address: Neuro, McGill, Montreal

Collaboration on NeuroInfer: software for automated reverse inference (a project by Davide Coraci at MIND, IMT Lucca)

address: IMT, Lucca

Research at the Social and Affective Neuroscience Group (SANE)


Emotions in Music: How Naturalistic Music Listening Encodes Acoustic and Emotional Features in the Brain and Is Shaped by Extra-Musical Context

address: IMT, Lucca

A review of theoretical and empirical work on emotion in music, predictive coding and reward, acoustic and extra-musical cues, and the neuroimaging of

A review of theoretical and empirical work on emotion in music, predictive coding and reward, acoustic and extra-musical cues, and the neuroimaging of music and emotion. Motivates the use of naturalistic, extended musical stimuli and lays out the multi-level framework (encoding, connectivity, context) that structures the thesis. show less

NaPuCcoL Non-Parametric Combination for Group-Level Inference

address: IMT, Lucca
link: GitHub Repo

Introduces and validates NaPuCco, a Non-Parametric Combination framework for one-sample, unsigned fMRI statistics that preserves idiographic voxel-wise encoding

Introduces and validates NaPuCco, a Non-Parametric Combination framework for one-sample, unsigned fMRI statistics that preserves idiographic voxel-wise encoding patterns at the group level. Simulation studies show robust control of false positives and good power for voxel- and cluster-wise inference, and an application to naturalistic music listening recovers auditory cortices and right-hemisphere regions sensitive to low-level acoustic features. show less

Music and Emotion in Functional Connectivity

address: IMT, Lucca
link: GitHub Repo

Presents BOSS (Brain Orchestra Synchronization Study), a modular toolbox that uses searchlight-based canonical correlation analysis to disentangle

Presents BOSS (Brain Orchestra Synchronization Study), a modular toolbox that uses searchlight-based canonical correlation analysis to disentangle acoustic- vs emotion-related feature contributions to functional connectivity. During naturalistic music listening, the work reveals distinct connectivity fingerprints for acoustic and emotional features, including emotion-sensitive modulation of midbrain–thalamic pathways and amygdala-centered networks linked to valuation and affect. show less

A Listening Behavioural Study With Contextual Information Manipulation

address: IMT, Lucca
link: website

Uses large language models to generate and manipulate program-note style descriptions with different affective valence before symphonic listening

Uses large language models to generate and manipulate program-note style descriptions with different affective valence before symphonic listening in an online, naturalistic paradigm. Shows that contextual information systematically and bidirectionally biases continuous emotion ratings, categorical judgments, and textual feedback, demonstrating that programme notes act as an interpretative lens on the emotional trajectory of the music. show less

Segmentation and analysis of face expressions during naturalistic affective stimuli

Audio Enhancement - real time


Audio quality enhancement

Designed low-latency deep learning pipelines for real-time audio enhancement, including preprocessing, model architecture, evaluation workflows, and deployment-oriented constraints. Relevant to AI product development where latency, reliability, and user-facing quality matter.

Real time audio denoising

Designed low-latency deep learning pipelines for real-time audio denoising, including preprocessing, model architecture, and evaluation workflows.

Urban Soundscape index modelling


Bayesian-NN for individual soundscape assessment predictions and a perceptual index design, development, evaluation

A Bayesian modelling approach has been implemented to represent and analyse the uncertainty highlighted in the psychometric revisitation of the current standard soundscape ISO.

According to some commonly agreed criteria to u

Psychometric revisitation of ISO protocol for soundscape measuring and data collection protocol design

address: University College London, London
link: Pyscometric revistation of ISO protocol
link: Data collection protocol design

Likert scales are useful for collecting data on attitudes and perceptions from large samples of people. In particular, they have become a well-established tool in soundscape studies for conducting in situ surveys to determine how people experience urban public spaces. However, it is still

Likert scales are useful for collecting data on attitudes and perceptions from large samples of people. In particular, they have become a well-established tool in soundscape studies for conducting in situ surveys to determine how people experience urban public spaces. However, it is still unclear whether the metrics of the scales are consistently interpreted during a typical assessment task. The current work aims at identifying some general trends in the interpretation of Likert scale metrics and introducing a procedure for the derivation of metric corrections by analyzing a case study dataset of 984 soundscape assessments across 11 urban locations in London. According to ISO/TS 12913-2:2018, soundscapes can be assessed through the scaling of 8 dimensions: pleasant, annoying, vibrant, monotonous, eventful, uneventful, calm, and chaotic. The hypothesis underlying this study is that a link exists between correlations across the percentage of assessments falling in each Likert scale category and a dilation/compression factor affecting the interpretation of the scales metric. The outcome of this metric correction value derivation is introduced for soundscape, and a new projection of the London soundscapes according to the corrected circumplex space is compared with the initial ISO circumplex space. The overall results show a general non-equidistant interpretation of the scales, particularly on the vibrant-monotonous direction. The implications of this correction have been demonstrated through a Linear Ridge Classifier task for predicting the London soundscape responses using objective acoustic parameters, which shows significant improvement when applied to the corrected data. The results suggest that the corrected values account for the non-equidistant interpretation of the Likert metrics, thereby allowing mathematical operations to be viable when applied to the data. - From the abstract show less

Music recommendation system


Parametric t-SNE (ANN) for music recommendation system, playlist generation and browser GUI for online music streaming providers

address: Aalborg University, Copenhagen
link: VIMEO

This project presented the development of the model and a user interface for a music-space exploration based on the t-SNE dimension reduction technique, aiming at

This project presented the development of the model and a user interface for a music-space exploration based on the t-SNE dimension reduction technique, aiming at preserving the shapes and structure of a high dimensional dataset of songs, dictated by N-dimensional features vector, to its projection onto a plane. We investigate different models obtained from using different structures of hidden layers, pre-training technique, features selection, and data pre-processing. The resulting output model has been used to build a music-space of 20000 songs, visually rendered for browser interaction, providing the user a certain degree of freedom to explore it by changing the features to highlight, offering an immersive experience for music exploration and playlist generation. show less

Deep Learning for Music


Violin Vibrato modelling (SVM) and GUI on android rendering with real time pitch detection

address: MTG Pompeu Fabra Universitat, Barcellona
link: A Machine Learning Approach to Violin Vibrato Modelling in Audio Performances and a Didactic Application for Mobile Devices
link VIMEO

We present a machine learning approach to model vibrato in classical music violin audio performances. A set of descriptors have been extracted from the music scores of the performed pieces and used to train a model for classifying notes into vibrato or non-vibrato, as well as for predicting the performed vibrato amplitude and frequency.

We present a machine learning approach to model vibrato in classical music violin audio performances. A set of descriptors have been extracted from the music scores of the performed pieces and used to train a model for classifying notes into vibrato or non-vibrato, as well as for predicting the performed vibrato amplitude and frequency. In addition to score features we have included a feature regarding the fingering used in the performance. The results show that the fingering feature affects consistently the prediction of the vibrato amplitude. Finally, an implementation of the resulting models is proposed as a didactic real-time feedback system to assist violin students in performing pieces using vibrato as an expressive resource. show less

LSTM modelling for melody and rhythmic structure extraction and generation

address: Aalborg University, Copenhagen

One of the most suggestive concept in artificial intelligence applied to music, shown in several recent studies, is the concept of style. Even if the definition of style is hard to be explicated without considering also historical and social contexts, from a basic overview we can think about it as the sequence of patterns composing the structure of a music piece. The style could be thought indeed as a particular pattern hidden

One of the most suggestive concept in artificial intelligence applied to music, shown in several recent studies, is the concept of style. Even if the definition of style is hard to be explicated without considering also historical and social contexts, from a basic overview we can think about it as the sequence of patterns composing the structure of a music piece. The style could be thought indeed as a particular pattern hidden in a sequence of symbols describing a music work, doing so it will be easy to manage the information represented by the structure containing the symbols, using them in order to manipulate the style itself or to replicate it. Making a deeper step into this subject we can separate the main goal in two different problems. One first problem is, given an audio support from which extract the style information, find an automatic way to reduce the original audio to a stream of symbols in which the style is encoded. The second problem concerns using this stream of symbols to decode the style they are carrying; in order to do that it is introduced a support structure capable to decode and memorize the piece structure that will be re-used to compose new music sequences referring to the same style of the original one. The purpose of this study is the evaluation of Long Short Term Memory network, for music generation from a percussive sequence as an example. The sequence segmented and symbolized through a single-linkage algorithm based on MFCC analysis, and the fed into the network. Then the network is trained with the analyzed data and gains the ability to generate new percussive sequences, according to the example. The results are compared with the previously implemented method of Variable Length Markov Chain models for music generation. show less

Variational Autoencoder for sounds morphing

address: Aalborg university
link: End-To-End_Dilated Variational Autoencoder with Bottleneck Discriminative Loss for Sound Morphing

This project was developed with tensorflow 1.4 where probabilistic inference was not yet available. Two strategies for end-to-end variational autoencoders (VAE) for sound morphing are compared: VAE with

This project was developed with tensorflow 1.4 where probabilistic inference was not yet available. Two strategies for end-to-end variational autoencoders (VAE) for sound morphing are compared: VAE with dilation layers (DC-VAE) and VAE only with regular convolutional layers (CC-VAE). The training strategy used a combination of the following loss functions: 1) the time-domain mean-squared error for reconstructing the input signal, 2) the Kullback-Leibler divergence to the standard normal distribution in the bottleneck layer, and 3) the classification loss calculated from the bottleneck representation. On a database of spoken digits, 1-nearest neighbor classification was used to show that the sound classes separate in the bottleneck layer. We introduce the Mel-frequency cepstrum coefficient dynamic time warping (MFCC-DTW) deviation as a measure of how well the VAE decoder projects the class center in the latent (bottleneck) layer to the center of the sounds of that class in the audio domain. In terms of MFCC-DTW deviation and 1-NN classification, DC-VAE outperforms CC-VAE. These results limited to the current parametrization and the dataset indicate that DC-VAE is more suitable for sound morphing than CC-VAE, since the DC-VAE decoder better preserves the topology when mapping from the audio domain to the latent space. show less

Artistic augmented space


An Interactive Soundscape augmented space

Address: University of Padua
link: VIMEO

A sonic augmented physical space through granular synthesis: this installation was inspired by Truax's Entrance to the Harbour, from The Vancouver Soundscape 1973 and Pacific Fanfare from The Vancouver Soundscape 1996. It reproduced a collage of sounds referring to a real soundscape. The sounds were spacialized in

A sonic augmented physical space through granular synthesis: this installation was inspired by Truax's Entrance to the Harbour, from The Vancouver Soundscape 1973 and Pacific Fanfare from The Vancouver Soundscape 1996. It reproduced a collage of sounds referring to a real soundscape. The sounds were spacialized in a room according to the users's detected position by a camera set on the top of the room. The user had the possibility to explore both the spatial structure of the soundscape and the acoustic structure of its sounds by entering in a central area of the room, where a granular decomposition of the sounds was applied. Launched in the late '70s at the Simon Fraser University, Soundscape Composition is a set of composite strategies working on sound environment. Traditionally linked with granular synthesis, it uses electronic music tools to elaborate environmental sounds, preserving their original contexts. The current state of the art technologies allowed the design and the development of an interactive environment inspired by soundscape composition, in which a user can explore a sound augmented reality referring to a real soundscape. The soundscape exploration occurs on two different layers: on a first layer the user can spatially explore the soundscape, while on a second layer an exploration of the structural composition of the sounds that build the soundscape takes place. This second kind of exploration happens through a granular analysis of the sounds. The user moving through the installation modifies the synthesis and sound dynamic parameters, building a cognitive structure of the augmented environment. The sound feedback of the environment modifies user's awareness and, consequently, his decisions on how to move within it. show less

Environmental Sonification

address: Aalborg University, Copenhagen
link: YOUTUBE

Competitive workshop for developing solutions to the new Lighting-Sound system at the AAU University bridge - winning project

Publications


2026:

M. Lionello (2026), "Music-evoked affect and causal approaches to listening", Manuscript in preparation for Music Perception.

M. Lionello (2026), "Feature-conditioned multivariate functional connectivity during naturalistic music listening dissociates acoustic and emotion contributions", Manuscript in preparation for Cortex.

M. Lionello, L. Cecchetti (2026), "Contextual framing via program notes modulates felt emotion and listeners synchrony during full-length symphonic music", Under review at Music Perception.

M. Lionello, L. Cecchetti (2026), "Non-Parametric Combination for Group-Level Inference: From Simulations to Validation", Manuscript in preparation for NeuroImage.

2021:

M. Lionello, F. Aletta, J. Kang, (2021) "Introducing a Method for Intervals Correction on Multiple Likert Scales: A Case Study on an Urban Soundscape Data Collection Instrument" Frontiers in Psychology

A. Mitchell, T. Oberman, F. Aletta, M. Kachlicka, M. Lionello, M. Erfanian, J. Kang, (2021). "Investigating urban soundscapes of the COVID-19 lockdown: A predictive soundscape modeling approach" The Journal of the Acoustical Society of America

2020:

M. Lionello, F. Aletta, J. Kang, (2020) "A systematic review of prediction models for the experience of urban soundscapes" in Applied Acoustics

A. Mitchell, T. Oberman, F. Aletta, M. Erfanian, M. Kachlicka, M. Lionello, J. Kang, (2020) "The Soundscape Indices (SSID) Protocol: A Method for Urban Soundscape Surveys—Questionnaires with Acoustical and Contextual Information" in Applied Sciences

M. Lionello, F. Aletta, J. Kang. (2019) "On the dimension and scaling analysis of soundscape assessment tools: a case study about the “Method A” of ISO/TS 12913-2:2018"

2019:

F. Aletta, T. Oberman, J. Kang, M. Erfanian, M. Kachlicka, M. Lionello, A. Mitchell, (2019) "Associations between soundscape experience and self-reported wellbeing in open public urban spaces: a field study", The LANCET,

M. Lionello, H. Purwins (2019) "End-To-End Dilated Variational Autoencoder with Bottleneck Discriminative Loss for Sound Morphing - A Preliminary Study" DOI: 10.13140/RG.2.2.21572.58240/1.

2018:

M. Lionello, L. Pietrogrande, H. Purwins, M. Abou-Zleikha (2018) "Exploration of Musical Space with Parametric t-SNE in a Browser Interface" Proceedings to the 15th Sound and Music Computing Conference, Limassol, Cyprus.

M. Lionello, R. Ramirez, (2018) "A Machine Learning Approach to Violin Vibrato Modelling in Audio Performances and a Didactic Application for Mobile Devices" Proceedings to the 15th Sound and Music Computing Conference, Limassol, Cyprus.

M. Lionello, H. Purwins "Deep Learning for Sounds Representation and Generation" Master thesis, 2018. Aalborg University, Copenhagen available here

2017:

M. Lionello, M. Mandanici, S. Canazza, E. Micheloni, (2017) Interactive Soundscapes: Developing a Physical Space Augmented through Dynamic Sound Rendering and Granular Synthesis" Proceedings of the 14th Sound and Music Computing Conference, Espoo, Finland.

Complete list of publications available on my personal Research Gate page

Summary


Professional Profile:

I am an applied Machine Learning researcher and AI engineer with 10 years of experience designing AI systems across audio, signal processing, multimodal data, and human perception. My work spans deep learning, real-time audio enhancement, representation learning, recommendation systems, Bayesian modelling, and neuroimaging applications. My background combines engineering, industry-facing ML consulting, and academic research at institutions including UCL, the School for Advanced Studies Lucca, and the Montreal Neurological Institute. I have worked across the full AI development lifecycle: problem definition, data preprocessing, feature extraction, model architecture, evaluation workflows, and deployment-oriented pipelines. I am particularly interested in translating advanced AI capabilities into practical, reliable, and product-relevant systems. My strength lies in connecting technical depth with structured thinking, helping teams identify feasible AI opportunities, design robust ML workflows, and communicate complex ideas clearly across technical and non-technical stakeholders.

Educational Background:

My academic path began with a Bachelor’s degree in Information Engineering at the University of Padua, followed by specialized training in Sound and Music Computing at Aalborg University, with industry collaborations and a visiting research period at the Music Technology Group in Barcelona. I later completed an MPhil at University College London, where I developed machine learning methods for urban soundscape prediction as part of the ERC-funded Soundscape Indices project. Beyond technical pursuits, my scholarly motivation extends to ecological preservation and the confluence of engineering with artistic and humanistic domains. Today, my work sits at the intersection of applied AI, human perception, audio technologies, and data-driven product innovation.

Contacts


News