Muhammad Mubashar Siddique
Development
Islamabad Capital Territory, Pakistan
Skills
Data Science
About
M Mubashar Siddique's skills align with IT R&D Professionals (Information and Communication Technology). M also has skills associated with System Developers and Analysts (Information and Communication Technology). M Mubashar Siddique has 6 years of work experience.
Work Experience
Research Officer as Data Engineer & AI Engineer
January 2022 - Present
- Country: Pakistan 1. Designing and implementing efficient data pipelines. 2. Developing and maintaining ETL workflows using Python 3. Integrating various data sources, such as Kafka, flume, with Spark Streaming. 4. Developing custom code and scripts to process and transform data as it streams through the pipeline. 5. Architecting, Building, and Implementing cloud data architecture, utilizing AWS, Azure, and Google Cloud to process, store, and analyze large volumes of data. 6. Ensuring data quality and consistency across all data sources by developing and implementing data validation and cleansing processes. 7. Continuously monitoring and optimizing data pipelines for performance, reliability, and scalability. 8. Fine Tuned LLM using C-RLFT to train model on mixed quality dataset with appropriate weightage to quality dataset utilizing openchat. 9. Deployed LLM using hugging-face text-generation-interface (TGI) to achieve tokens/s streaming and scaled it using k8s. 10. Working with ML & DL models
Research Associate as NLP & ML Engineer
National Center for Cyber Security (NCCS)
January 2021 - May 2022
- | Country: Pakistan • Audio multi anomaly detection Project. • Mobile Forensic Using Natural Language Processing • I extract information through Data bases of mobile application using NLP. • I worked with data cleaning, data integration, ML models, DL models to create application using python • We create Forensic AI application to investigate the cyber crime happens through mobile • Retrained previously deployed models to counter data drift and achieved 99.97% Precision and 97.89% Recall with appropriate feature engineering and acquiring more data using clever use of text summarization, Document similarity. • Developed Document similarity and text summarization utilizing Glove, word2vec, LDA and cosine similarity to extract filter relevant data from entire dataset. • Acquired Data from various sources and merged it to form central data repository using SQL, performed various analysis and presented findings to relevant stakeholders.
Data Scientist & Analyst
PIN-UP GLOBAL
January 2020 - February 2021
- • I responsible for assisting with data collection, ingestion, and wrangling. • Conducting exploratory analysis to extract insights and uncover important groups, trends, and relationships in data. • Developing functional prototypes of data products incorporating analytics, machine learning & deep learning. • Working on Databases to extract data and then pre-processing, Model applying
Jr. Python Developer
Visit a bit technologies Inc
January 2019 - May 2020
- | Country: Pakistan • Utilized Python, Django and Flask Framework to design server applications Performed research to explore and identify new technological platforms. • Work on Postgres, MSSQL as Database • Creating predictive models for AI, ML and DL-based features.
Education