public bigdata

Kaggle 필사 관리 페이지 본문

데이터분석/Kaggle Study

Kaggle 필사 관리 페이지

public bigdata 2020. 2. 9. 17:21

커리큘럼 : 캐글 코리아 블로그

커리큘럼 참여 방법

  • 필사적으로 필사하세요
  • 커널의 A 부터 Z 까지 다 똑같이 따라 적기!
  • 똑같이 3번적고 다음 커널로 넘어가시면 됩니다.

Binary classification : Tabular data

1st level. Titanic: Machine Learning from Disaster

 

 

2nd level. Porto Seguro’s Safe Driver Prediction

3rd level. Home Credit Default Risk

Multi-class classification : Tabular data

1st level. Costa Rican Household Poverty Level Prediction

  • A Complete Introduction and Walkthrough
  • 3250feats->532 feats using shap[LB: 0.436]
  • XGBoost

Binary classification : Image classification

1st level. Statoil/C-CORE Iceberg Classifier Challenge

  • Keras Model for Beginners (0.210 on LB)+EDA+R&D
  • Transfer Learning with VGG-16 CNN+AUG LB 0.1712
  • Submarineering.EVEN BETTER PUBLIC SCORE until now.
  • Keras+TF LB 0.18

Multi-class classification : Image classification

1st level. TensorFlow Speech Recognition Challenge

  • Speech representation and data exploration
  • Light-Weight CNN LB 0.74
  • WavCeption V1: a 1-D Inception approach (LB 0.76)

Regression : Tabular data

1st level. New York City Taxi Trip Duration

  • Dynamics of New York city - Animation
  • EDA + Baseline Model
  • Beat the benchmark!

2nd level. Zillow Prize: Zillow’s Home Value Prediction (Zestimate)

  • Simple Exploration Notebook - Zillow Prize
  • Simple XGBoost Starter (~0.0655)
  • Zillow EDA On Missing Values & Multicollinearity
  • XGBoost, LightGBM, and OLS and NN

Object segmentation : Deep learning

1st level. 2018 Data Science Bowl

  • Teaching notebook for total imaging newbies
  • Keras U-Net starter - LB 0.277
  • Nuclei Overview to Submission

Natural language processing : classification, regression

1st level. Spooky Author Identification

  • Spooky NLP and Topic Modelling tutorial
  • Approaching (Almost) Any NLP Problem on Kaggle
  • Simple Feature Engg Notebook - Spooky Author

2nd level. Mercari Price Suggestion Challenge

  • Mercari Interactive EDA + Topic Modelling
  • A simple nn solution with Keras (~0.48611 PL)
  • Ridge (LB 0.41943)
  • LGB and FM [18th Place - 0.40604]

3rd level. Toxic Comment Classification Challenge

  • [For Beginners] Tackling Toxic Using Keras
  • Stop the S@#$ - Toxic Comments EDA
  • Logistic regression with words and char n-grams
  • Classifying multi-label comments (0.9741 lb)

Other dataset : anomaly detection, visualization

1st level. Credit Card Fraud Detection

  • In depth skewed data classif. (93% recall acc now)
  • Anomaly Detection - Credit Card Fraud Analysis
  • Semi-Supervised Anomaly Detection Survey

2nd level. Kaggle Machine Learning & Data Science Survey 2017

  • Novice to Grandmaster
  • What do Kagglers say about Data Science ?
  • PLOTLY TUTORIAL - 1