Prototype for Document Retrieval for the Military Police of Paraná with Retrieval Augmented Generation and Gemini AI in a Dockerized Environment

Prototype for Document Retrieval for the Military Police of Paraná with Retrieval Augmented Generation and Gemini AI in a Dockerized Environment

Authors

  • Cleiton Giacomelli da Silva Polícia Militar do Paraná Author

DOI:

https://doi.org/10.51473/rcmos.v1i2.2024.736

Abstract

This paper presents the development of a prototype for document retrieval for the Military Police of Paraná (PMPR) using the Retrieval Augmented Generation (RAG) strategy and the Gemini AI Flash 1.5 language model. The prototype was implemented in a containerized environment with Docker, aiming to ensure portability and reproducibility. The RAG strategy combines traditional search with advanced language models to generate more accurate and complete answers to user queries. The prototype was tested with real questions and preliminary results demonstrate the system's ability to understand the questions and provide relevant answers, based on the information contained in the PMPR documents. The paper discusses the potential of the prototype to assist military police officers in accessing relevant information, overcoming the limitations of the current form of document retrieval in the institution.

Downloads

Download data is not yet available.

References

BAUMANN, P. et al. Large Language Models for Retrieval Augmented Generation: A Comprehensive Survey. arXiv preprint arXiv:2308.01186, 2023.

BOETTIGER, C. An introduction to Docker for reproducible research. ACM SIGOPS Operating Systems Review, v. 49, n. 1, p. 71-79, 2015.

BRASIL. Ministério da Justiça e Segurança Pública. Portaria nº 332, de 27 de abril de 2023. Dispõe sobre o Manual de Uso da Força Policial. Diário Oficial da União, Brasília, DF, 28 abr. 2023. Seção 1, p. 47.

LEWIS, P. et al. Retrieval-augmented generation for knowledge-intensive nlp tasks. Advances in Neural Information Processing Systems, v. 33, p. 9459-9474, 2020.

N8N. n8n - Workflow Automation. Disponível em: https://n8n.io/. Acesso em: 06 nov. 2024.

QDRANT. Qdrant - Vector Database. Disponível em: https://qdrant.tech/. Acesso em: 06 nov. 2024.

Published

2024-11-12

How to Cite

GIACOMELLI DA SILVA, Cleiton. Prototype for Document Retrieval for the Military Police of Paraná with Retrieval Augmented Generation and Gemini AI in a Dockerized Environment: Prototype for Document Retrieval for the Military Police of Paraná with Retrieval Augmented Generation and Gemini AI in a Dockerized Environment. Multidisciplinary Scientific Journal The Knowledge, Brasil, v. 1, n. 2, 2024. DOI: 10.51473/rcmos.v1i2.2024.736. Disponível em: https://submissoesrevistacientificaosaber.com/index.php/rcmos/article/view/736.. Acesso em: 21 nov. 2024.