PhD student position at the Department of Speech, Music and Hearing at KTH Royal Institute of Technology.

Project description

We are looking for a PhD student  interested in Artificial Intelligence, Natural Language Processing and Speech Technology, that will work in a newly funded project at the Department of Speech, Music and Hearing within the School of Electrical Engineering and Computer Science at KTH. The project is financed by the Swedish AI-program WASP (Wallenberg AI, Autonomous Systems and Software Program)), which offers a graduate school with research visits, partner universities, and visiting lecturers.

The newly started project is titled “Thinking Fast and Slow: Real-time Speech Generation for Conversational AI”.  The aim of the project is to develop AI-models capable of generating spoken responses in an incremental fashion, mirroring the nuanced and dynamic nature of human conversation. Our approach is inspired by our previous pioneering efforts in the realm of incremental and predictive models for dialogue, which have laid the groundwork for this project. We aim to construct a dual-component system, based on large language models, consisting of a ‘System I’ module for the rapid generation of response prefixes and a ‘System II’ for crafting more considered and detailed responses. This will be complemented by the development of an incremental speech synthesizer, designed to modulate speech rate and prosody in real-time, in response to the unfolding dialogue context. The models will be evaluated in both offline and online settings, employing both simulated interactions to refine our models under controlled conditions and real-world scenarios to validate their effectiveness in practical applications, for example in human-robot interaction.

The position is mainly a research position, with a small fraction of departmental duties (e.g. teaching).

Supervision: Professor Gabriel Skantze and Assoc. Prof. Gustav Eje Henter

View all positions
We use cookies to personalise content and ads, to provide social media features and to analyse our traffic. We also share information about your use of our site with our social media, advertising and analytics partners. View more
Cookies settings
Privacy & Cookie policy
Privacy & Cookies policy
Cookie name Active
The WASP website uses cookies. Cookies are small text files that are stored on a visitor’s computer and can be used to follow the visitor’s actions on the website. There are two types of cookie:
  • permanent cookies, which remain on a visitor’s computer for a certain, pre-determined duration,
  • session cookies, which are stored temporarily in the computer memory during the period under which a visitor views the website. Session cookies disappear when the visitor closes the web browser.
Permanent cookies are used to store any personal settings that are used. If you do not want cookies to be used, you can switch them off in the security settings of the web browser. It is also possible to set the security of the web browser such that the computer asks you each time a website wants to store a cookie on your computer. The web browser can also delete previously stored cookies: the help function for the web browser contains more information about this. The Swedish Post and Telecom Authority is the supervisory authority in this field. It provides further information about cookies on its website,
Save settings
Cookies settings