Hello, I'm

João Manoel Herrera
Pinheiro

Senior Data Engineer & PhD Candidate

0 Citations
0 Publications
João Manoel Herrera Pinheiro
SP, Brazil

About Me

Senior Data Engineer and Data Scientist with over 5 years of experience in the banking and technology sectors, including significant tenures at major institutions in LATAM like Itaú Unibanco and Cielo. I specialize in architecting scalable machine learning systems and high-performance data engineering pipelines within cloud-native environments. My professional background is defined by a proven track record of delivering high-impact data products, optimizing large-scale infrastructure, and deploying production-ready ML models.

Also serves as a lecturer in artificial intelligence, data science and computer vision. Holds an M.Sc. in Engineering and an MBA in Software Engineering, with publications in peer-reviewed journals, including IEEE and Nature. Currently pursuing a PhD in Electrical Engineering, with research focused on computer vision, machine learning, and image processing.

T

Research Interests

Computer Vision Deep Learning Machine Learning Image Preprocessing Robotics

Professional Experience

Senior Data Engineer

Cielo S.A Jan 2025 — Present

Led Data Engineering and Architecture for Customer Service, owning a large-scale AWS and Databricks platform supporting IVR, WhatsApp, and ML-driven analytics.

Data Engineer & Data Scientist

Itaú Unibanco S.A Jun 2021 — Jan 2026

Led migration of a high-volume financial data platform from Teradata to AWS (EMR, Glue, Lambda, Step Functions), reducing processing time by 78% and saving $3.2M annually.

Developed ML models for CRM and financial product recommendation at 100M+ scale, increasing conversion by 250% and generating $9.3M/year.

Researcher

USP Jul 2023 — Present

Leading research on projects with a budget exceeding R$48 million acting as Specialist in Machine Learning, Computer Vision and Cloud Computing.

Education

Master of Business Administration — MBA, Specialization in Software Engineering

University of Sao Paulo (USP)

Oct 2024 — June 2026

GPA 4.0

Specialization in Education

UNIVESP

July 2024 — July 2026

Bachelor of Engineering — BE in Mechatronics, Robotics

University of Sao Paulo (USP)

Feb 2016 — Dec 2022

Selected Publications

Google Scholar · ORCID — 77+ citations · h-index: 4 · i10-index: 1

2026

Automated identification of Ichneumonoidea wasps via YOLO-based deep learning: Integrating HiresCam for Explainable AI

Joao Manoel Herrera Pinheiro, Gabriela Do Nascimento Herrera, Alvaro Doria Dos Santos, Luciana Bueno Dos Reis Fernandes, Ricardo V Godoy, Eduardo AB Almeida, Helena Carolina Onody, Marcelo Andrade Da Costa Vieira, Angelica Maria Penteado-Dias, Marcelo Becker

ArXiv preprint

2026

Descriptor: Parasitoid Wasps and Associated Hymenoptera Dataset (DAPWH)

Joao Manoel Herrera Pinheiro, Gabriela Do Nascimento Herrera, Luciana Bueno Dos Reis Fernandes, Alvaro Doria Dos Santos, Ricardo V Godoy, Eduardo AB Almeida, Helena Carolina Onody, Marcelo Andrade Da Costa Vieira, Angelica Maria Penteado-Dias, Marcelo Becker

IEEE Data Descriptions

0
2026

The impact of feature scaling in machine learning: Effects on regression and classification tasks

João Manoel Herrera Pinheiro, Suzana Vilas Boas de Oliveira, Thiago Henrique Segreto Silva, Pedro Antonio Rabelo Saraiva, Enzo Ferreira de Souza, Ricardo V Godoy, Leonardo André Ambrosio, Marcelo Becker

Nature Scientific Data

5
2025

The impact of feature scaling in machine learning: Effects on regression and classification tasks

João Manoel Herrera Pinheiro, Suzana Vilas Boas de Oliveira, Thiago Henrique Segreto Silva, Pedro Antonio Rabelo Saraiva, Enzo Ferreira de Souza, Ricardo V Godoy, Leonardo André Ambrosio, Marcelo Becker

IEEE Access

60

Technical Skills

Data Science & ML

Python
TensorFlow & PyTorch
scikit-learn & NumPy
OpenCV & TorchVision
Pandas, SciPy & Optuna

Data Engineering

PySpark & Apache Spark
SQL (Presto & Hive)
ETL & Data Pipelines
Data Lake & DWH Architectures
Scala & Data Products

Cloud & Infrastructure

AWS (EMR, Glue, Lambda)
Databricks & Snowflake
DevOps (Docker, CI/CD, Git)
Terraform & Linux
LaTeX & Markdown

Research Domains & Architectures

Artificial Intelligence Deep Learning Computer Vision Image Processing Object Detection (YOLO) Semantic Segmentation (U-NET) Image Classification Convolutional Neural Networks (CNNs) Vision Transformers (ViT) Transfer Learning & Fine-tuning Mask R-CNN ResNet & ConvNeXT EfficientNet Biomedical Engineering Robotics Statistics

Projects

NA

2001 Engenharia

2001

Get in Touch

Open to international opportunities and research collaborations.