The NLP Cypher | 07.18.21
Sometimes… cool things happen. A new chatbot from Facebook AI was released this Friday with remarkable features. This chatbot, BlenderBot 2.0, is an improvement on their previous bot from last year. The bot has better long-term memory and can search the internet for information during conversation! This is a convenient improvement versus traditional bots since information is not statically “memorized” but instead has the option to be dynamic and “staying up to date” via the internet. 🤯
I’ve recently tested the model and trialed the smaller 400M variant. Currently, there exists two variants:
- BlenderBot 2.0 400m:
--model-file zoo:blenderbot2/blenderbot2_400M/model
- BlenderBot 2.0 2.7B:
--model-file zoo:blenderbot2/blenderbot2_3B/model
Getting started with ParlAI is straightforward. For this model, you need to specify two arguments: model file and a search server (for the internet queries). FYI, the search server argument I have yet to figure out. I tried different approaches such as specifying a normal search URL e.g. ‘https://www.google.com/search?q=’ and even an API URL endpoint (as seen below with Bing API) but both approaches were unsuccessful. The ParlAI documentation doesn’t give an example URL to use so right now I’m still debugging this issue. However, you can still download and interact with the bot except it wont’ leverage the internet for queries but instead, will use its memory. However, at least we are in prime position to get the model up and running when the internet server issue gets resolved.
For testing the 400M variant, you’ll need a minimum of 20–25GB of RAM for the model to fit in memory and a minimum of 6GB of VRAM for a GPU at inference time. (my testing used a V100 🥶). You’ll need a virtual machine instance in the cloud to get it running since you’ll most likely run into issues if you choose to use Colab. (this is also confirmed in ParlAI’s GitHub issues thread).
Most bots thus far have been pretty boring but I think this new internet search feature will make them a lot more useful to users. Using an AI model that queries the internet for information will most likely be the most scalable solution for open-ended dialogue in the near future.
…Talkin’ bout Search…