YorkSpace
    • English
    • français
  • English 
    • English
    • français
  • Login
View Item 
  •   YorkSpace Home
  • York University Libraries
  • YUL research and professional contributions
  • View Item
  •   YorkSpace Home
  • York University Libraries
  • YUL research and professional contributions
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Scalable Content-Based Analysis of Images in Web Archives with TensorFlow and the Archives Unleashed Toolkit

Thumbnail
View/Open
Main article (10.47Mb)
JCDL Poster (25.56Mb)
Date
2019
Author
Yang, Hsiu-Wei
Liu, Linqing
Milligan, Ian
Ruest, Nick
Lin, Jimmy


Metadata
Show full item record
Abstract
We demonstrate the integration of the Archives Unleashed Toolkit, a scalable platform for exploring web archives, with Google's TensorFlow deep learning toolkit to provide scholars with content-based image analysis capabilities. By applying pretrained deep neural networks for object detection, we are able to extract images of common objects from a 4TB web archive of GeoCities, which we then compile into browsable collages. This case study illustrates the types of interesting analyses enabled by combining big data and deep learning capabilities.
Citation
Hsiu-Wei Yang, Linqing Liu, Ian Milligan, Nick Ruest, and Jimmy Lin. “Scalable Content-Based Analysis of Images in Web Archives with TensorFlow and the Archives Unleashed Toolkit.” Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, Vol. 19 (2019).
Hsiu-Wei Yang, Linqing Liu, Ian Milligan, Nick Ruest, and Jimmy Lin. “Scalable Content-Based Analysis of Images in Web Archives with TensorFlow and the Archives Unleashed Toolkit.” Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, Vol. 19 (2019).
URI
https://yorkspace-new.library.yorku.ca/xmlui/handle/10315/36161
https://doi.org/10.1109/JCDL.2019.00107
Collections
  • YUL research and professional contributions

All items in the YorkSpace institutional repository are protected by copyright, with all rights reserved except where explicitly noted.

YorkU LogoContact Us | Send Feedback
link to sitemap

 

Browse

All of YorkSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

My Account

LoginRegister

Statistics

View Usage Statistics

All items in the YorkSpace institutional repository are protected by copyright, with all rights reserved except where explicitly noted.

YorkU LogoContact Us | Send Feedback
link to sitemap