Heuristic once learning for image & text duality information processing

Li, Weigang; Martins, Luiz; Ferreira, Nikson; Miranda, Christian; Althoff, Lucas; Pessoa, Walner; Farias, Mylenè; Jacobi, Ricardo; Rincon, Mauricio

Use este identificador para citar ou linkar para este item: http://repositorio.unb.br/handle/10482/52392

Arquivos associados a este item:

Não existem arquivos associados a este item.

Título:	Heuristic once learning for image & text duality information processing
Autor(es):	Li, Weigang Martins, Luiz Ferreira, Nikson Miranda, Christian Althoff, Lucas Pessoa, Walner Farias, Mylenè Jacobi, Ricardo Rincon, Mauricio
ORCID:	https://orcid.org/0000-0003-1826-1850 https://orcid.org/0000-0003-0089-3905
Afiliação do autor:	University of Brasilia, Department of Computer Science University of Brasilia, Department of Computer Science University of Brasilia, Department of Computer Science University of Brasilia, Department of Computer Science University of Brasilia, Department of Computer Science University of Brasilia, Department of Computer Science University of Brasilia, Department of Computer Science University of Brasilia, Department of Computer Science University of Brasilia, Department of Computer Science
Assunto:	Heurística Rede Neurais Convolucionais (CNNs) Visão computacional Aprendizagem profunda Imagem
Data de publicação:	Dez-2022
Editora:	IEEE
Referência:	WEIGANG, Li; MARTINS, Luiz; FERREIRA, Nikson; MIRANDA, Christian; ALTHOFF, Lucas; PESSOA, Walner; FARIAS, Mylenè; JACOBI, Ricardo; RINCON, Mauricio. Heuristic once learning for image & text duality information processing. In: IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, DIGITAL TWIN, PRIVACY COMPUTING, METAVERSE, AUTONOMOUS & TRUSTED VEHICLE – SMARTWORLD/UIC/SCALCOM/DIGITALTWIN/PRICOMP/META, 2022, Haikou. Proceedings [...]. Haikou: IEEE, 2022. p. 1353-1359. DOI: 10.1109/SmartWorld-UIC-ATC-ScalCom-DigitalTwin-PriComp-Metaverse56740.2022.00195. Disponível em: https://ieeexplore.ieee.org/document/10189581. Acesso em: 6 ago. 2025.
Abstract:	Few-shot learning is an important mechanism to minimize the need for the labeling of large amounts of data and taking advantage of transfer learning. To identify image/text input with duality property, this research proposes a “Heuristic once learning (HOL)” mechanism to investigate multi-modal input processing similar to human-like behavior. First, we create an image/text data set of big Latin letters composed of small letters and another data set composed of Arabic, Chinese and Roman numerals. Secondly, we use Convolutional Neural Networks (CNN) for pre-training the dataset of letters to get structural features. Thirdly, using the acquired knowledge, a Self-organizing Map (SOM) and Contrastive Language-Image Pretraining (CLIP) are tested separately using zero-shot learning. Siamese Networks and Vision Transformer (ViT) are also tested using one-shot learning by knowledge transfer to identify the features of unknown characters. The research results show the potential and challenges to realize HOL and make a useful attempt for the development of general agents.
Unidade Acadêmica:	Instituto de Ciências Exatas (IE) Departamento de Ciência da Computação (IE CIC)
Programa de pós-graduação:	Programa de Pós-Graduação em Informática
DOI:	10.1109/SmartWorld-UIC-ATC-ScalCom-DigitalTwin-PriComp-Metaverse56740.2022.00195
Versão da editora:	https://ieeexplore.ieee.org/document/10189581
Aparece nas coleções:	Trabalhos apresentados em evento

Mostrar registro completo do item Visualizar estatísticas