Text this: Audio-visual automatic speech recognition using Dynamic Bayesian Networks