Skip to main content
UCL Logo Navigate back to homepage

Main navigation

  • Home
  • Study

    Study

    • Study at UCL
    • Prospective students
    • Current students
    • Accommodation
    • Careers
    • Doctoral School
    • Immigration and visas
    • Student finances
    • Support and wellbeing
  • Research

    Research

    • Research at UCL
    • Engage with us
    • Explore our Research
    • Initiatives and networks
    • Research news
  • Engage

    Engage

    • Engage with UCL
    • Alumni
    • Business partnerships and collaboration
    • Global engagement
    • News and Media relations
    • Policy and political engagement
    • Schools and priority groups
    • Give to UCL
  • About

    About

    • About UCL
    • Who we are
    • Faculties
    • Governance
    • President and Provost
    • Strategy
    • UCL's Bicentenary
  • UCL Logo Active parent page: Brain Sciences
    • Study
    • Research
    • About the Faculty
    • Institutes and Divisions
    • Active parent page: News and Events
    • Contact

Computer-generated voices are easier to understand than human ones, research suggests

UCL and University of Roehampton researchers have discovered that AI-generated voice clones are easier to understand than human speakers in noisy environments.

29 April 2026

Photo of a robot and human

Breadcrumb trail

  • UCL Faculty of Brain Sciences

Faculty menu

  • Current page: Faculty news
  • Events
  • PG Open Events

Breadcrumb trail

  • UCL Faculty of Brain Sciences
  • News and Events
  • Computer-generated voices are easier to understand than human ones, research suggests

From digital assistants like Siri and Alexa to automated telemarketers and answering machines, synthetic voices are increasingly a part of our lives. With the expansion of generative AI, a new type of synthetic voice has been developed: voice clones, which can duplicate a person’s voice from only a few seconds of recorded speech.

In a paper published in The Journal of the Acoustical Society of America, researchers from UCL and the University of Roehampton evaluated the intelligibility of humans and voice clones. They found that voice clones are easier than humans to understand in noisy environments.

Voice clones differ from traditional synthetic voices in the amount of sampling they require. Synthetic voices like Siri require a voice actor to spend hours in a recording booth. In contrast, a voice clone can be made from as little as 10 seconds of speech, significantly expanding the number of potential voices as well as the number of potential applications.

Professor Patti Adank (UCL Division of Psychology and Language Sciences) and Dr Han Wang (University of Roehampton) specialise in studying human perception of unclear speech and were fascinated by the idea of machine-replicated speech. A key question they were looking to answer was just how easy voice clones are for the average person to understand. They suspected that voice clones would simply be poor representations of actual human voices and that people would struggle to understand them. What they found could not be more different.

“I thought initially that voice clones would be less intelligible because they were unfamiliar,” said Professor Adank, “I found they were up to 20% more intelligible, which was quite shocking. A small part of our paper is talking about that experiment, and then a large part is me and my collaborator frantically trying to find out what it is that makes those voice clones more intelligible.”

The duo initially presented volunteers with human voices and voice clones, asking them to rate their intelligibility. After finding that voice clones were consistently rated easier to understand, they repeated the experiment with elderly volunteers to determine if being hard-of-hearing alters the effect; with American volunteers to judge if the accent plays a role; and with a filter designed to mimic cochlear implants. In every case, voice clones emerged victorious.

After examining over 100 acoustic measurements, Professor Adank believes the only way to solve the mystery is to work with collaborators who specialise in text-to-speech systems to adapt an existing open-source cloning system.

“I am now going to try and recreate [the effect] by studying how synthesizers work and how they use digital signal processing to generate those voices, just to get a bit of a handle on this,” said Professor Adank.

Related:

  • Paper in JASA
  • Professor Patti Adank’s academic profile
  • UCL Division of Psychology and Language Sciences

UCL footer

Visit

  • Bloomsbury Theatre and Studio
  • Library, Museums and Collections
  • UCL Maps
  • UCL Shop
  • Contact UCL

Students

  • Accommodation
  • Current Students
  • Moodle
  • Students' Union

Staff

  • Inside UCL
  • Staff Intranet
  • Work at UCL
  • Human Resources
UCL Logo

University College London, Gower Street, London, WC1E 6BT

Tel: +44 (0) 20 7679 2000

UCL social media menu

  • Link to Instagram
  • Link to LinkedIn
  • Link to Youtube
  • Link to TikTok
  • Link to Facebook
  • Link to Bluesky
  • Link to Threads
  • Link to Soundcloud
Here, it can happen.
Back to top

Essential

  • Disclaimer
  • Freedom of Information
  • Accessibility
  • Cookies
  • Privacy
  • Slavery statement
  • Log in

© 2026 UCL