Category: Database, Data

Pandas is a third-party library for numerical computing based on NumPy.

NumPy is a third-party library for numerical computing, optimized for working with single- and multi-dimensional arrays.

In order to work with data we need to create coherent data structures to store it, or read them from an external source.

We can specify the index this way: We can create Dataframe from different ways but three of the most used are: If we might like to name each row with a non numeric index, we might want to specify it in this attribute.

In order to read an external file in CSV format we can do it calling to read_csv method inside pandas: We can specify with index attribute which one of the columns we want to be the row name.

Related Articles