pandas Python Library: Difference between revisions

Latest revision as of 15:36, 24 July 2023

Context:
- It can (typically) support a pandas Data Structure, such as pandas.DataFrame and pandas.Series.
Example(s)
- pandas v0.14.1.
- …
Counter-Example(s):
- numpy Library.
- SciPy Library.
- R Dataframe.
See: PyData, Tabular Data, OLAP Aggregation, OLAP Drill Down, Moving Window Function, Rolling Regression.

(Wikipedia, 2017) ⇒ https://en.wikipedia.org/wiki/Pandas_(software) Retrieved:2017-6-5.
- pandas is a software library written for the Python programming language for data manipulation and analysis. In particular, it offers data structures and operations for manipulating numerical tables and time series. Pandas is free software released under the three-clause BSD license. ^[1] The name is derived from the term “panel data", an econometrics term for multidimensional structured data sets.

http://pandas.pydata.org/
- pandas is an open source, BSD-licensed library providing high-performance, easy-to-use data structures and data analysis tools for the Python programming language.

(McKinney, 2012) ⇒ Wes McKinney. (2012). “Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython." O'Reilly Media. ISBN:9781449323615

@@ Line 37: / Line 37: @@
 *** [[Tabular data with heterogeneously-typed columns]], as in an [[SQL table]] or [[Excel spreadsheet]].
 *** [[Ordered time series data|Ordered]] and [[unordered time series data|unordered]] (not necessarily [[fixed-frequency]]) [[time series data]].
-*** [[Arbitrary matrix data]] ([[homogeneously typed matrix|homogeneously typed]] or [[heterogeneously typed matrix |heterogeneous]]) with [[row label|row]] and [[column label]]s
+*** [[Arbitrary matrix data]] ([[homogeneously typed matrix|homogeneously typed]] or [[heterogeneously typed matrix |heterogeneous]]) with [[row label|row]] and [[column label]]s.
 *** Any other form of observational / statistical data sets. The data actually need not be labeled at all to be placed into a pandas data structure
 ** The two primary [[data structures of pandas]], [[Series (1-dimensional)]] and [[DataFrame (2-dimensional)]], handle the vast majority of typical use cases in finance, statistics, social science, and many areas of engineering. For [[R user]]s, [[DataFrame]] provides everything that [[R’s data.frame]] provides and much more. [[pandas Python Library|pandas]] is built on top of [[NumPy]] and is intended to integrate well within a [[scientific computing environment]] with many [[other 3rd party librari]]es.