EMA: Brazilian Cultural Heritage Image Dataset - Towards AI-based metadata annotation of digital collections

Abstract

Metadata annotation in digital collections is typically conducted by several specialized professionals, configuring a complex, labor-intensive, and time-consuming activity, leading to human failure, high costs, and problems in retrieving information accordingly. Recent advances in artificial intelligence, particularly Deep Learning techniques, have shown their potential in performing visual recognition and interpretation of objects on images. In this context, the present work introduces EMA, a Brazilian cultural heritage image dataset with over 11,000 labeled images of objects from seventeen Brazilian museums. EMA dataset is a contribution towards the development of automated metadata annotation tools. The paper also presents baseline ResNet50 results for the dataset, resulting in an over 86% recognition rate.

Details

Creators
Vagner de Oliveira; Dalton Martins; Paula Costa
Institutions
University of Campinas
Date
Keywords
thesaurus; automatic annotation; machine learning

Publication Type
short paper
License
CC-BY 4.0 International
Download
(unknown) bytes
Video Stream
here
Collaborative Notes
here

View This Publication