Introduction to Pandas. PPT

Public account: You and the cabin by: Peter Editor: Peter

Hello, I’m Peter

I recently condensed my introductory articles for Pandas into a simple POWERPOINT presentation. Pandas is a panda library. It is a panda library. It is a panda library.

Pandas’ two data structures
11 ways to create a DataFrame
DataFrame number fetching technique
Data processing cornerstone: Pandas Data Exploration
Pandas Data type
Pandas Focuses on the groupby, rank, and sort_values mechanisms
Pandas Handles the missing and repeated values
Pandas Merges data: Merge and concat
Axis rotation operations: unstack and stack
Pandas pivot table

It is very helpful to learn Pandas. At the end of the article, there are specific ways to obtain PPT

Content of the PPT

Two big data structures

Pandas has two data structures: Series and DataFrame.

Series data: consists of name, index, and values
DataFrame data: Can be viewed as multiple Series data

Pandas’ data processing is more often associated with dataframes. The following chart highlights 11 different ways to generate DataFrame type data.

After we get a copy of the data and read it as a DataFrame, we need to find the data we need. How do we view or find the data we want? The following is a variety of methods to take the number, which can be roughly divided into:

Expression fetch
Index/attribute fetch
Slice access
Use functions to take numbers and so on

With a piece of data, before processing, we must check some basic information of the data: data size, dimension, field type, missing value, etc., we call this work: data exploration

Pandas, Python native, and NUMPY are compared in the following:

The following pages describe the functions commonly used by Pandas: groupby, rank, sort_VALUES, DROP_DUPLICATED, merge, and concat. They are mainly used in data pretreatment, data combination, data deduplication, PivotTable production and other aspects