Data Pre-processing using Scikit-learn

Data pre-processing is one technique of data mining using that you can convert your raw data into an understandable format. In his practical, we will take one dataset and performing the following task.

  1. Data Encoding
  2. Normalization
  3. Standardization
  4. Imputing the Missing Values
  5. Discretization
  • Rescaling Data
  • Standardizing Data
  • Normalizing Data
  • Binarizing Data
  • Mean Removal
  • One Hot Encoding
  • Label Encoding
  • Describing the dataset
  • Shape of the dataset
  • Extracting data from the dataset
  • Performing operations around a variable




Love podcasts or audiobooks? Learn on the go with our new app.

Recommended from Medium

APS Failure Detection in Scania Trucks

Autonomous car with Reinforcement Learning — part 2: track following

Auto-Correction and Suggestion using LSTM based Char2Vec Model

Productionizing Object Detection Models with Dash Enterprise

Breast Cancer Classification with Deep learning

Bosch Traffic Sign Recognition: Silver Medal Approach

Summary of Intro to NLP Training

What’s better than a GAN? Two GANs!

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store


More from Medium

Soybean Crop Yield Prediction with ML Regression Techniques— Part 1: Tabular data

End-to-End Light-Weight Machine learning model deployment (Using Statistics, Python, Streamlit…

Comprehensive Guide to Decision Tree Learning for Classification

Recognizing Handwritten Digits with scikit-learn