CV#
Professional Experience#
Cielo, Remote — Senior Data Scientist#
01/2025 - Current#
Led the development of a cloud data infrastructure on AWS and Databricks using with Git to automate deployments and achieving a 67% improvement in data processing speed.
Reduced AWS costs by 75% through monitoring and optimization saving R$1.3M annually.
Provided technical leadership in data productization, driving operational KPIs improvement by over 45%.
Improved ETL process documentation in AWS by 50% through the use of Git for version control.
Implemented machine learning models to optimize the customer service journey, increase KPIs by 45%.
Utilized AWS Athena to analyze customer data, which contributed to a 10-point increase in NPS
Developed PySpark data pipelines and Data Quality solutions, improving daily indicators by 120%.
Promoted best practices in data engineering and machine learning by leading internal technical training.
Itau Unibanco, Remote — Data Engineering#
04/2023 - 01/2025#
Led the migration of a legacy system to a modern AWS data platform, engineering an infrastructure with EMR, Lambda, Glue, and Step Functions,implementing CI/CD practices that resulted in a 65% reduction in processing time and annual savings of R$15.7M.
Automated manual table loads, reducing load times by 95% using Glue, Step Function and Lambda.
Integrated data environments into the public cloud using Linux and SQL, cutting legacy table usage by 93%.
Enhanced data quality with scalable solutions using Glue, PySpark, improving integrity and control by 375%.
Developed and maintained ETL pipelines for 270+ tables using Glue, EMR, Lambda and Step Functions.
Acted as a technical reference for data architecture design, contributing to the development of a strong data-driven culture within the bank.
Itau Unibanco, Remote — Data Scientist,#
06/2021 - 04/2023#
Increased product conversion by 250% through classification models, generating an annual R$50M in revenue.
Implemented clustering techniques, resulting in a 20% improvement in campaign personalization.
Performed bias-variance analysis, model explainability, and overfitting mitigation for credit models.
Developed and maintained ETL pipelines for 15+ machine learning models using EMR,Glue and SageMaker.
Automated data and machine learning pipelines using Python, PySpark, reducing operational effort by 75%.
Education#
University of Sao Paulo, Machine Learning, Computer Vision - MSc, 2026
University of Sao Paulo, Software Engineering - MBA, 2026
Virtual University of the State of São Paulo, Education - Specialization, 2026
University of Sao Paulo, Mechatronics Engineering - BE, 2022