01
PERSONA DESIGN
Defining the voice characteristics of an assistant
PERSONA CARACHTERISTICS
Microsoft Cognitive Service’s standard voices will be used by both 1st party and 3rd party apps in different applications and scenarios. Voice is one of the interactive ways between products/services and end users, so it should have the following qualifications:
THE PERFECT ASSISTANT
- The voice sounds youthful, energising and positive
- The voice should sound reliable and pleasant, as a close friend, a guide, a favourite teacher who you love to talk with
- The voice should be able to convey the information in a natural conversational style, but with a bit of professional confidence of knowledge so that users can fully trust the information
- The voice should be a mix of traditional and modern, simple and closer to the root of the languages, but also catching up with the moving world with a right balance of seriousness and kindness
- The voice definitely should not sound robotic but very humane, easy-going, warm and approachable, sensitive to boundaries and likes to keep the conversations to the point and crisp, sensitive to contexts and situations and skilfully adjusts demeanour and behaviour accordingly, can be humorous and funny at times but knows when to be serious.
- The voice should sound proactive, intuitive and understands the user’s needs and offer multiple solutions and alternatives to choose from
VOICE QUALITY REQUIREMENTS
Pitch: medium, neither too soft nor too loud. The pitch should be such that it doesn’t indicate a boring and dull voice.
Accent: should be a common, standard national wide accent
Pace: medium, neither too fast nor too slow. Too fast means that one eats words and becomes incoherent, too slow will sounds dragged as well as boring and cannot hold one’s attention Articulation: more conversational and easier to understand Speaking style: should blend with casualness, professionalism and empathy, should not sound mechanical or speaking from a rehearsed script.
Timbre: clear, smooth and melodious, smiling voice which indicates cheerfulness and positivity. Empathy and warm for content which need to show understanding