Close this
Close this

Walter Gonzalez

Development
San Salvador, El Salvador

Skills

Data Engineering

About

The candidate's skills align with Database Specialists (Information and Communication Technology). The candidate also has skills associated with System Developers and Analysts (Information and Communication Technology). The candidate has 9 years of work experience.
View more

Work Experience

Data Engineer

Applaudo Studios (Walmart)
August 2020 - Present
  • Creating data integration processes for legacy mainframe files, transferring them to Azure ADLS and Google Cloud Storage (Data Lake) using Spark and Scala. Utilizing a custom ETL tool based on apache Airflow for scheduling Spark application jobs. Optimizing and tuning the performance of ETL pipelines with Scala/Spark. Creating tables and views in Google BigQuery using GCS as a source. Developing complex queries and stored procedures in BigQuery, adhering to best practices for handling large datasets and achieving optimal performance. Creating CI/CD pipelines to build and deploy ETL processes automatically using Concord. Extracting and transforming data from Zoho Rest API to ingest the data into Google Cloud Storage and Bigquery using Google Cloud Functions with python and pandas dataframes Conducting data profiling and data discovery to identify outliers in the datasets.

Google BigQuery
January 2024 - January 2024

Spark Performance Tuning
August 2023 - August 2023

Scala Language
July 2021 - July 2021
  • Professional-Lightbend

Apache Spark with Scala
September 2020 - September 2020

ETL Developer

Telus International
July 2016 - August 2020
  • Using Apache Airflow for the creation of ETL pipelines to extract data from different sources like databases, plain files, XML files, JSON, etc., to the data lake in the Google Cloud Platform. Creating integration processes to facilitate communication between the client's data sources (FTP, CMS system, files, Google Spreadsheet, etc.) and their various platforms for the purpose of creating BI dashboards. Using SQL (Oracle and SQL Server) and Google BigQuery to load data from any kind of source. Using the ETL tool Talend and custom Java code to create data integration processes to ingest data into the data lake (Google Cloud Storage). Creation of datamarts using Kimball dimensional modeling approach for different business areas for the creation of BI dashboards. Creating processes with PySpark, Dataproc, and Apache Airflow to migrate data between SQL Server database and Google BigQuery.

Talend Data Integration
November 2019 - November 2019
  • Udemy

Sense Administration & Developer

Qlik
November 2018 - November 2018
  • Qlik

Google BigQuery
August 2017 - August 2017
  • Data to Insights -Coursera

PlSql Developer

Tigo El Salvador
July 2015 - July 2016
  • Creation of pipeline processes (procedures, functions) in Oracle PL/SQL to load the data from different platforms like AS400, Oracle EBS, as well as delimited format files to the tables of ODS, and then to the data warehouse tables. Creation of Excel reports connected to the data warehouse tables for various departments of the business, such as finance and treasury, to provide them with the ability to make informed decisions. Creating complex SQL statements using PL/SQL to efficiently ingest data into the ODS insta Courses 2017-07 SCRUM Fundamentals Certification Scrum Study

Education

Universidad De El Salvador

Bachelor of Science