EMA: Brazilian Cultural Heritage Image Dataset - Towards AI-based metadata annotation of digital collections

EMA: Brazilian Cultural Heritage Image Dataset - Towards AI-based metadata annotation of digital collections

Abstract

Metadata annotation in digital collections is typically conducted by several specialized professionals, configuring a complex, labor-intensive, and time-consuming activity, leading to human failure, high costs, and problems in retrieving information accordingly. Recent advances in artificial intelligence, particularly Deep Learning techniques, have shown their potential in performing visual recognition and interpretation of objects on images. In this context, the present work introduces EMA, a Brazilian cultural heritage image dataset with over 11,000 labeled images of objects from seventeen Brazilian museums. EMA dataset is a contribution towards the development of automated metadata annotation tools. The paper also presents baseline ResNet50 results for the dataset, resulting in an over 86% recognition rate.

Details

Creators: Vagner de Oliveira; Dalton Martins; Paula Costa
Institutions: University of Campinas
Date
Keywords: thesaurus; automatic annotation; machine learning
Publication Type: short paper
License: CC-BY 4.0 International
Download: (unknown) bytes
Video Stream: here
Collaborative Notes: here

View This Publication