[CS logo]

CSPP 56553 - Artificial Intelligence
Winter 2004
Homework #6: Due March 3, 2004


Goals

Through this assignment you will:

Non-Programming Alternative

Modeling Pronunciation

In lecture we discussed a model of the pronunciation of the word about that had been extracted from the Switchboard corpus, a collection of conversational telephone speech. Here we consider pronunciation from a different data source - TIMIT - a phonetically structured corpus of read speech. For TIMIT, participants were asked to a read back a set of sentence prompts. These sentences were constructed to cause each phoneme to appear in as many contexts as possible. Below, you will see a set of pronunciations for the word "permanent" automatically extracted from close manual phonetic transcriptions of the TIMIT recordings. We will construct a weighted automaton model of this word and use it to perform some calculations.

Part A

Identify the states and the legal transitions between states.

Part B

Compute the weights (transition probabilities) for each transition in your automaton, based on the small corpus of pronunciations.

Part C

What is the probability of the pronunciation "pcl p er m ax nx ix n q" according to the model? ('q' represents a glottal stop; it's not a typo)

Part D

Based on this automaton, what is the most probable pronunciation? What is its probability?

Programming Alternative

Problem 1

Implement the Viterbi algorithm.

Apply your implementation to either the "tomato" or "about" automata. Demonstrate the option of the algorithm on two pronunciations.

Note: You only need to return the maximum probability; you do not need to return the path (unless you want the extra challenge).