Share Your Science: Training a Machine to Answer Questions About Images

Aishwarya Agrawal, PhD student at Virginia Tech shares how her team is using NVIDIA GPUs and deep learning to automatically answer a wide range of questions about arbitrary images.

According to Agrawal and her collaborators, the system may one day be used by the visually impaired to help navigate real-world environments, such as informing the user when it is safe to cross the street.

To learn more, try the online demo or read their research paper “VQA: Visual Question Answering”.

Share your GPU-accelerated science with us at http://nvda.ly/Vpjxr and with the world on #ShareYourScience.

Watch more scientists and researchers share how accelerated computing is benefiting their work at http://nvda.ly/X7WpH

 

  • Barry Myers

    “VQA: Visual Question Answering”. This link above is broken.ERROR

    The requested URL could not be retrieved

    The following error was encountered while trying to retrieve the URL: http://arxiv.org/pdf/1505.00468v6.pdf

    Unable to determine IP address from host name arxiv.org

    The DNS server returned:

    Server Failure: The name server was unable to process this query.

    This means that the cache was not able to resolve the hostname presented in the URL. Check if the address is correct.

    Your cache administrator is webmaster.