Night view in Seoul city

KYOOSIK KIM


Data Science - Machine Learning


   

PROJECT


The following projects are displayed in the order of newest to oldest. I recommend seeing the recent projects as they best represent the skills I have now. However, browsing my early works could help you understand how quickly I learned data science techniques. All the projects including the following can be found on my Github. Also, you can take a look at the Data Visualization on my Tableau Gallery.
NYC MTA

Mar 2019

Conducted text analysis on Amazon 500k+ food reviews such as POS/Sentiment/NMF analysis using SpaCy and NLTK, and built a grade prediction model using Keras sequential/functional API

See Details
NYC MTA

Dec 2018

Accessed IBRD loan data via API to analyze and build a preditive model by examining features for classification of loan cancellation

See Details

Dec 2018

Queried HMDA data and analyzed a series of features to survey mortgage approval by various factors as well as predict classes using ML models, RFE, PCA, and Ensemble methods.

See Details

Nov 2018

Analyzed D.C. house prices and selected features to build the MLR model using Adjusted R-square, AIC, DFBETAs to meet the underlying assumptions

See Details

Oct 2018

Evaluated multiple machine learning models with Accuracy, Precision and Recall, ROC-AUC for rain vs dry prediction

See Details

Aug 2018

Merged two datasets to investigate the relationship between heart disease mortality and farmer's market using fundamental analytical techniques

See Details

July 2018

Utilized geographical libraries to visualize New Youk City MTA Popularity

See Details

PUBLICATION


JAN 2019: Ridge Regression for Better Usage on Towards Data Science

APR 2019: Decision Tree for Better Usage on Towards Data Science

SKILL SET


Data Science
  • Python (numpy, pandas, matplotlib, seaborn)
  • R (dplyr, tidyr, ggplot2, car, caret, ggmap)
  • SAS
Machine Learning/Deep Learning
  • Scikit-Learn
  • Keras
Database
  • SQL
  • MongoDB
Others
  • Excel, Git, AWS, Hadoop
Visualization
  • Tableau
  • Plotly

Natural Language Processing
  • SpaCy
  • NLTK
Programming
  • Java
  • C++

EXPERIENCE


Arkansas State University - USA

Research Assistant at Statistics and Biology Department (Current)
  • Helped with R programming for statistics and biology joint projects
Master's Degree in Computer Science & Graduate Certificate in Data Science
  • Machine Learning, Data Science, Data Mining, Statistical Methods, Data Analysis (GPA 3.9)
  • Selected as an honor student for exceptional academic performance

Government Employee Pension Service - South Korea

Operations Analyst at Financial Investment Division
  • Led a business process reengineering project with Deloitte Consulting to reduce costs for developing a new financial system (cost estimate $10+mil down to $5mil)
  • Analyzed business processes for the internal users such as traders and analysts to collaborate with the IT department
  • Queried and reported business performances on a regular basis and at requests
  • Managed quantitative data supplies from companies including Bloomberg
Intern at Management & Administration department
  • Assisted with maintaining and reporting accounting data

Other experiences
  • Sergeant in the US Army in South Korea: Managed the administration at the company headquarter

CONTACT


Currently, I am looking for a team I can make contributions to with my data science skills. While I am interested in finance and economy, I am open to any type of data. Anyone who wants to know more about me is welcome to contact me. Email is the fastest way that you can hear back from me.