Vision and Text: Search, Generation and Translation