Algorithms Begin to Listen: ICASSP 2026 Launches a New Phase in Musical Perception

12:54, 23 April

In 2026, the automated evaluation of musical aesthetics reached a new milestone with the launch of the Automatic Song Aesthetics Evaluation Challenge at the ICASSP international conference.

Its objective is to train algorithms to predict human aesthetic judgments of musical compositions. This is not about music generation.

It is an attempt to understand what makes music beautiful.

When Music Becomes an Object of Machine Perception

In recent years, artificial intelligence has focused heavily on learning to generate sound:

melodies
harmonies
timbres
voices

But the new challenge is different: can an algorithm evaluate musicality just like a listener?

This is precisely the focus of the ICASSP 2026 competition.

Models are provided with audio clips and tasked with predicting human aesthetic scores across several perceptual dimensions:

sonic cohesion
naturalness
memorability
clarity
musicality

These parameters reflect human perception rather than the technical quality of the recording.

From Generation to Perception

This marks a significant turning point in the evolution of musical AI.

While previous systems focused on sound creation, they are now learning to interpret its aesthetic value.

Essentially, this represents a shift from synthesis to understanding—from the algorithm as a performer
to the algorithm as a listener.

Musical Aesthetics Becomes a Measurable Field of Research

The ICASSP 2026 Challenge builds upon work initiated with the SongEval 2025 dataset, which established the first large-scale database of human aesthetic evaluations for musical excerpts.

Now, this field is maturing into an international competition for AI models.

This signifies that musical beauty is increasingly becoming a subject of computational analysis

while simultaneously remaining a uniquely human experience.

A Space for Collective Listening

Meanwhile, research groups at Queen Mary University of London and Imperial College London continue to investigate musical perception and the interaction between listeners and technologically generated music.

Listening to music is increasingly becoming a collaborative space where humans and algorithms interact.

Not a replacement, but a collaboration.

What This Adds to the Sound of the Planet

Today, for the first time, algorithms are learning to do more than just produce sound.

They are learning to listen. And through this evolution, music is gradually becoming more than just a performing art—it is becoming a science of perception, where humans and technology begin to hear as one.