Hi There! Welcome to Data 101.
I'll be using Google's Colab and Python in the article.
To learn how to read files, refer to the previous article in this series :
In Data Exploration, we look into finding the number of instances of records, number of columns and rows, the columns and data types in each column...
I will be using pandas library throughout.
To import the data:
import pandas as pd airlines = pd.read_csv('airlines.csv') airports = pd.read_csv('airports.csv')
Viewing First and Last few rows
To view the first 5 rows on the data set, we use
.head() from pandas.
You can use
.head(x) to get x number of rows from the top
To view the last 5 rows on the data set, we use
.tail() from pandas.
Viewing Rows and Columns
To know the number of columns and rows in your data frame, use
.shape. It returns the dimensions of your data set:
To view the columns in the data frame, use
Viewing Data Types
To view datatypes of each data in each column, use
Other tricks you could use are:
x.rows x.info() x.describe() x.unique() x.describe()