DataFrame (pandas)

Applies to: python

A pandas DataFrame is a labeled 2D table (rows and named columns), the workhorse of data analysis in Python. A single column is a Series. You filter, group, and aggregate it without explicit loops.

import pandas as pd
df = pd.read_csv("flights.csv")
df[df.delay > 0].groupby("airline")["delay"].mean()

See also: numpy-array