oh_long_johnson

Pedro Mendes

https://github.com/phrmendes
https://www.linkedin.com/in/pedrohrmendes
📧 pedrohrmendes@proton.me

👤 About Me

I am a Data Engineer with expertise in designing and scaling high-performance data infrastructure and BI platforms. My experience spans large-scale orchestration, event-driven architectures, Kubernetes management, microservices architecture, backend development (Python, Go), and distributed data processing. I specialize in building scalable ETL/ELT frameworks and data pipelines, implementing DevOps practices for data infrastructure, and driving initiatives around data quality, observability, and governance. I have hands-on experience with modern data stacks and data warehouses (BigQuery, dbt, Prefect/Airflow), data modeling and normalization, REST API design, comprehensive testing practices, and AI applications including LLMs and forecasting models. I thrive in collaborative, cross-functional environments and my passion lies in building reliable, scalable data systems that enable data-driven decision-making across organizations.

💼 Work Experience

Data Engineer

I manage Kubernetes clusters and Prefect orchestration for ETL/ELT operations processing hundreds of GB daily for Rio de Janeiro’s public data lake. I develop microservices in Python and Go with comprehensive testing practices, manage CI/CD pipelines via GitHub Actions and ArgoCD, and implement observability solutions using OpenTelemetry. I built chatbots and AI agents using LLMs to automate internal processes across the city’s public administration. I also manage the full infrastructure stack using IaC tools (Terraform, Ansible), including HA databases (MongoDB, PostgreSQL), Kubernetes clusters, and on-premise VMs, while supporting teams with deployment debugging and troubleshooting.

DevOps / Site Reliability Engineer

I managed Kubernetes clusters and CI/CD pipelines via Azure DevOps and ArgoCD, and implemented observability solutions using New Relic. I developed microservices in Python and managed the full infrastructure stack using IaC tools (Terraform, Ansible, Helm), including Kubernetes clusters and cloud infrastructure. I defined and monitored SLAs, SLOs, and SLIs, managed error budgets, and supported teams with deployment debugging and troubleshooting to ensure system reliability and performance.

Data Analyst

My main activity was related to extracting and processing data on the private health sector in Brazil. To do this, I created ETL pipelines using R and Python to process large amounts of data and used exploratory data analysis to produce reports and dashboards for public use. I also managed databases using PostgreSQL and serverless databases such as DuckDB and Apache Arrow. I was also responsible for forecasting relevant indicators using time series econometric tools (mainly VAR-related models).

🛠️ Skills

Skill Level Tools
Python Advanced pandas, scikit-learn, Django, FastAPI, PyTorch, Streamlit, dbt
Infrastructure as Code Advanced Ansible, Terraform
Orchestration and Data Engineering Advanced Prefect, Apache Airflow, Airbyte
Continuous Deployment Advanced ArgoCD, FluxCD
Containers Advanced Docker, Kubernetes
Continuous Integration Advanced GitHub Actions, GitLab CI
Google Cloud Advanced BigQuery, Compute Engine, GKE
Observability Advanced Prometheus, Grafana, OpenTelemetry
AWS Intermediate Lambda, EC2, RDS
Azure Intermediate CosmosDB, Azure DevOps, Azure Functions
Go Intermediate
JS/TS Intermediate Node.js
Databases Intermediate PostgreSQL, MariaDB, MongoDB
Caching & Messaging Intermediate Redis, RabbitMQ, Kafka
Distributed Processing Intermediate Apache Spark

🎓 Education

Federal University of ABC

BA degree in Economics

BA degree in Sciences and Humanities (Basic cycle for Economics)

Fatec São Caetano do Sul

Information Security

🌐 Language Proficiency

Language Level
Portuguese Native
English Professional working proficiency
Spanish Professional working proficiency

Download