Describir: Proactive spoken dialogue interaction in multi-party environments