Walter Gonzalez
Development
San Salvador, El Salvador
Skills
Data Engineering
About
The candidate's skills align with Database Specialists (Information and Communication Technology). The candidate also has skills associated with System Developers and Analysts (Information and Communication Technology). The candidate has 9 years of work experience.
View more
Work Experience
Data Engineer
Applaudo Studios (Walmart)
August 2020 - Present
- Creating data integration processes for legacy mainframe files, transferring them to Azure ADLS and Google Cloud Storage (Data Lake) using Spark and Scala. Utilizing a custom ETL tool based on apache Airflow for scheduling Spark application jobs. Optimizing and tuning the performance of ETL pipelines with Scala/Spark. Creating tables and views in Google BigQuery using GCS as a source. Developing complex queries and stored procedures in BigQuery, adhering to best practices for handling large datasets and achieving optimal performance. Creating CI/CD pipelines to build and deploy ETL processes automatically using Concord. Extracting and transforming data from Zoho Rest API to ingest the data into Google Cloud Storage and Bigquery using Google Cloud Functions with python and pandas dataframes Conducting data profiling and data discovery to identify outliers in the datasets.
Google BigQuery
January 2024 - January 2024
Spark Performance Tuning
August 2023 - August 2023
Scala Language
July 2021 - July 2021
- Professional-Lightbend
Apache Spark with Scala
September 2020 - September 2020
ETL Developer
Telus International
July 2016 - August 2020
- Using Apache Airflow for the creation of ETL pipelines to extract data from different sources like databases, plain files, XML files, JSON, etc., to the data lake in the Google Cloud Platform. Creating integration processes to facilitate communication between the client's data sources (FTP, CMS system, files, Google Spreadsheet, etc.) and their various platforms for the purpose of creating BI dashboards. Using SQL (Oracle and SQL Server) and Google BigQuery to load data from any kind of source. Using the ETL tool Talend and custom Java code to create data integration processes to ingest data into the data lake (Google Cloud Storage). Creation of datamarts using Kimball dimensional modeling approach for different business areas for the creation of BI dashboards. Creating processes with PySpark, Dataproc, and Apache Airflow to migrate data between SQL Server database and Google BigQuery.
Talend Data Integration
November 2019 - November 2019
- Udemy
Sense Administration & Developer
Qlik
November 2018 - November 2018
- Qlik
Google BigQuery
August 2017 - August 2017
- Data to Insights -Coursera
PlSql Developer
Tigo El Salvador
July 2015 - July 2016
- Creation of pipeline processes (procedures, functions) in Oracle PL/SQL to load the data from different platforms like AS400, Oracle EBS, as well as delimited format files to the tables of ODS, and then to the data warehouse tables. Creation of Excel reports connected to the data warehouse tables for various departments of the business, such as finance and treasury, to provide them with the ability to make informed decisions. Creating complex SQL statements using PL/SQL to efficiently ingest data into the ODS insta Courses 2017-07 SCRUM Fundamentals Certification Scrum Study