alternative
  • Home (current)
  • About
  • Tutorial
    Technologies
    C#
    Deep Learning
    Statistics for AIML
    Natural Language Processing
    Machine Learning
    SQL -Structured Query Language
    Python
    Ethical Hacking
    Placement Preparation
    Quantitative Aptitude
    View All Tutorial
  • Quiz
    C#
    SQL -Structured Query Language
    Quantitative Aptitude
    Java
    View All Quiz Course
  • Q & A
    C#
    Quantitative Aptitude
    Java
    View All Q & A course
  • Programs
  • Articles
    Identity And Access Management
    Artificial Intelligence & Machine Learning Project
    How to publish your local website on github pages with a custom domain name?
    How to download and install Xampp on Window Operating System ?
    How To Download And Install MySql Workbench
    How to install Pycharm ?
    How to install Python ?
    How to download and install Visual Studio IDE taking an example of C# (C Sharp)
    View All Post
  • Tools
    Program Compiler
    Sql Compiler
    Replace Multiple Text
    Meta Data From Multiple Url
  • Contact
  • User
    Login
    Register

Machine Learning - Machine Learning Development Life Cycle - Data Gathering Tutorial

  • Working with CSV file
  • Working with JSON/SQL (JavaScript Object Notation/ Structured Query Language)
  • Fetch API
  • Web scraping

 

  • Working with CSV file

 

Import pandas

# Opening a local CSV file

df = pd.read_csv(‘file.csv’)

 

# Opening a CSV file from URL using the request module

# use sep=’\t’ to open files separated by tab i.e .tsv

# use names=[‘col1name’,’col2name’,….] to give column name to dataset

# To convert any unique numeric column to an index column, then write index_col=’emp_id’

# To use the first row as the header, then write header=1

# To filter out specific or particular columns from the whole dataset, then use usecols = [‘col1name’,’col2name’,….]

# use skip row to skip a particular row and nrows to show only n number of row

# By default encoding is utf-8. If the dataset is of another encoding, then use the Encoding parameter i.e encoding=’latin-1’, etc.

# Skip bad lines, ex- some row has 5 columns, and some have 6 columns, it will throw a parser error. So, in this case, use error_bad_lines = False, it will skip bad lines

# dtype parameter, to convert datatype use e.g dtype={‘target’:int}

# Handling Dates – to convert string to date use parse_dates =[‘col_name’]

# Convertors – use to apply transformation or function on a particular column

# na_values parameter – To make a particular value to nan. E.g  _ to NaN

# Loading a huge dataset in chunks, use chunk size



 

  • Working with JSON/SQL (JavaScript Object Notation/ Structured Query Language)

 

  • Fetch API
  • Web Scraping
Machine Learning

Machine Learning

  • Introduction
  • Overview
    • Type Of Machine Learning
    • Batch Vs Online Machine Learning
    • Instance Vs Model Based Learning
    • Challenges in Machine Learning
    • Machine Learning Development Life Cycle
  • Machine Learning Development Life Cycle
    • Framing the Problem
    • Data Gathering
    • Understanding your Data
    • Exploratory Data Analysis (EDA)
    • Feature Engineering
    • Principal Component Analysis
    • Column Transformer
    • Machine Learning Pipelines
    • Mathematical Transformation
    • Binning and Binarization | Discretization | Quantile Binning | KMeans Binning
  • Supervised Learning
    • Overview
    • Linear Regression [Regression]
    • Multiple Linear Regression
    • Polynomial Linear Regression [Regression]
    • Bias Variance Trade Off
    • Regularization
    • LOGISTIC REGRESSION [Regression & Classification]
    • Polynomial Logistic Regression
    • Support Vector Machines / Support Vector Regressor
    • Naïve Bayes Classifier [classification]
    • Decision Tree
    • Entropy
    • Information Gain
    • K Nearest Neighbor (KNN)
    • Neural Network (MultiLayer Perceptron)
  • Ensemble Learning
    • Introduction to Ensemble Learning
    • Basic Ensemble Techniques
    • Advanced Ensemble Techniques
    • Random Forest Classifier
    • Boosting
  • UnSupervised Learning
    • Overview
    • K Mean Clustering

About Fresherbell

Best learning portal that provides you great learning experience of various technologies with modern compilation tools and technique

Important Links

Don't hesitate to give us a call or send us a contact form message

Terms & Conditions
Privacy Policy
Contact Us

Social Media

© Untitled. All rights reserved. Demo Images: Unsplash. Design: HTML5 UP.

Toggle