Close this
Close this

Amber Pande

Development
Madhya Pradesh, India

Skills

Data Engineering

About

Amber Pande's skills align with System Developers and Analysts (Information and Communication Technology). Amber also has skills associated with Database Specialists (Information and Communication Technology). Amber Pande appears to be a low-to-mid level candidate, with 3 years of experience.
View more

Work Experience

Data Engineer

Data.ai
August 2023 - Present
  • Remote • Created and managed ETL pipelines utilizing Python, PostgreSQL, Snowflake, PySpark and AWS tools. • Amplified data processing speed by 90% and data quality by 70% through a redesigned Apache Spark workflow.

Data Engineering Associate

Accenture
January 2022 - August 2023
  • Remote • Engineered ETL pipelines using Python, PostgreSQL, Apache Kafka and AWS (Cloudwatch, S3, Lambda, Events), facilitating data ingestion from XML, JSON, and CSV data formats into a centralised data lake. • Undertook seamless Python-scripted data migration from on-premise MS SQL server to AWS RDS PostgreSQL cloud instance. • Devised an automated Python emailing tool, integrated with data scraping pipelines, distributing updated database content hourly. • Reduced AWS RDS storage cost by 65% by developing a pipeline to compress historical time-series data from AWS RDS PostgreSQL database to Parquet file format and load it in AWS S3 data lake without data loss. • Utilised technologies such as Python, Pandas, PostgreSQL, Apache Kafka, AWS Lambda, AWS CloudWatch, AWS S3 and Git. for processing time-series financial data.

Data Science Intern

Netlink Software Group America Inc
March 2021 - January 2022
  • Bhopal, MP • Developed a face recognition system to log attendance, incorportaing Python, OpenCV, Tensorflow, and Keras. • Conceived efficient face detection, feature extraction and data cleaning algorithms, enhancing model predictive accuracy. • System tested in real-world environment, achieving over 95% accuracy, reducing attendance recording time to 1 ms. • Utilised technologies such as Django, Python, OpenCV, TensorFlow, SQLite, Git. Projects

Iris Data Analytics
January 2021 - December 2021
  • | Python, scikit-learn, Pandas, Git 01/2021 - 01/2021 • Conducted exploratory data analysis(EDA) on the Iris dataset, using pandas to read the data and analyze the structure and content of the dataset. • Preprocessed the data by encoding the categorical variable "Species" using LabelEncoder from scikit-learn library. • Built a Logistic Regression model to predict the species of Iris flowers using the independent variables sepal length, sepal width, petal length, and petal width. • View Project

Handwritten Digit Classifier
June 2020 - July 2020
  • | Python, Kotlin, TensorFlow, Keras, Git 06/2020 - 07/2020 • Developed an Android application to use a deep learning model to recognize hand-drawn digits between 0-9 • Used Keras API to build a TensorFlow model to classify the digits images and achieved 99.4% accuracy score on test data. • View Project

Education

Senior Secondary

CBSE

Patel College of Science and Technology

Bachelor of Technology in Computer Science