About
Education
University of Minnesota - Twin Cities
Major: Management Information Systems
Minors: Computer Science, Statistics, Business Analytics
Work Experience
Data Scientist | 2022.06 - Present | Coyote Logistics (UPS Company)
- Migrated on-prem model trainer to cloud by dockerizing and deploying pricing model trainer via Azure CI/CD pipeline for seamless deployment and easier back-testing in cloud environment. Integrated Airflow to trigger model trainer.
- Developed event-triggered ETL pipeline to ingest daily forecasts stored in CosmosDB into a cloud data lake and push latest forecast to on-prem SQL database using Pyspark, Databricks, Azure Synapse, Data Factory, and stored procedure.
- Developed multiple databricks ETL pipeline and multi-page streamlit dashboard to evaluate pricing model’s performance over time.
- Created an Airflow DAG for automated data quality tests with failure alerts.
Data Analyst Intern | 2021. 07 – 2021. 08 | Electronic Arts (EA)
- Developed a web application that finds similar bugs in the past using TF-IDF text mining technique – expected to reduce 40% in bug search time and 10% less duplicate bug tickets
- Extracted 15,000 bug history data and created a Power BI dashboard to evaluate software engineer’s bug fix performance based on four KPIs: number of bugs fixed, bug fix time taken, bug criticality, and bug fix fail rate.
- Presented my projects to 50+ stakeholders from multiple levels of technical expertise in Asia and Korea studio analytics meeting.
Data Analyst Intern | 2019. 05 - 2019. 06 | LINE Corporation
- Built and managed dashboards to support overall game health, user experience and provided insights into historical, current, and projected sales.
- Saved 30 minutes of writing manual daily reports by automating query execution in HiveSQL and Python
- Communicated daily with game planners for user analysis and improvements for a newly published game.
Data Scientist Intern | 2016. 05 - 2016. 08 | Coyote Logistics (UPS Company)
- Developed an end-to-end Carrier Recommendation web application integrated with a predictive model on cancelled loads by using Python, C# .NET, and ZeroMQ.
- Built a random forest model to predict cancelled loads that sales representatives can refer to before making a contract with carriers, resulting in expected savings of $140,000 per week.
- Streamlined server and client communication through ZeroMQ and handled > 3.5 million records of data.
BI Engineer Intern | 2015. 06 - 2015. 08 | Coyote Logistics (UPS Company)
- Developed a Self-Reporting Dashboard Service tool using Elasticsearch ELK Stack, which allowed non-technical users to create their own KPI dashboards – expected to reduce 20% of report requests to the BI team.
- Built a web application with end-to-end data pipelines using Elasticsearch ELK Stack and RabbitMQ
- Conducted interviews and surveys to find out the end users’ experience with the existing reporting service and reflected their opinions on the project.
Skills and Certification
Language : Python, R, SQL
DB: SQL Server, NoSQL (MongoDB, Elasticsearch)
Visualization: Tableau, Power BI
Misc. Tools: Elasticsearch (ELK Stack), MongoDB, Message Queues (RabbitMQ, ZeroMQ), API, Hadoop
Certificates : Microsoft Azure Data Engineer Associate, SQL Deverloper Certificate, Linux Professional Institute Certificate, Big Data Analytics Management Certificate
Contact Information
Email: jiehwany@gmail.com
Linkedin : https://linkedin.com/jiehwan_yang
Github : https://github.com/jiehwan94
Out of curiosity and for self-improvement, I like to experiment my hypotheses on seemingly petty and yet fascinating phenomena and challenge myself learning new technologies by blogging and building side projects using various tools. I am constantly evolving my interests to pursue a career in Data Science.
My current interests lie in the intersection of data science and logistics/transportation.