pandas Python Library: Difference between revisions

From GM-RKB
Jump to navigation Jump to search
m (Text replacement - "]] *** " to "]]. *** ")
m (Text replacement - "]]s **" to "]]s. **")
 
Line 37: Line 37:
*** [[Tabular data with heterogeneously-typed columns]], as in an [[SQL table]] or [[Excel spreadsheet]].
*** [[Tabular data with heterogeneously-typed columns]], as in an [[SQL table]] or [[Excel spreadsheet]].
*** [[Ordered time series data|Ordered]] and [[unordered time series data|unordered]] (not necessarily [[fixed-frequency]]) [[time series data]].
*** [[Ordered time series data|Ordered]] and [[unordered time series data|unordered]] (not necessarily [[fixed-frequency]]) [[time series data]].
*** [[Arbitrary matrix data]] ([[homogeneously typed matrix|homogeneously typed]] or [[heterogeneously typed matrix |heterogeneous]]) with [[row label|row]] and [[column label]]s
*** [[Arbitrary matrix data]] ([[homogeneously typed matrix|homogeneously typed]] or [[heterogeneously typed matrix |heterogeneous]]) with [[row label|row]] and [[column label]]s.
*** Any other form of observational / statistical data sets. The data actually need not be labeled at all to be placed into a pandas data structure
*** Any other form of observational / statistical data sets. The data actually need not be labeled at all to be placed into a pandas data structure
** The two primary [[data structures of pandas]], [[Series (1-dimensional)]] and [[DataFrame (2-dimensional)]], handle the vast majority of typical use cases in finance, statistics, social science, and many areas of engineering. For [[R user]]s, [[DataFrame]] provides everything that [[R’s data.frame]] provides and much more. [[pandas Python Library|pandas]] is built on top of [[NumPy]] and is intended to integrate well within a [[scientific computing environment]] with many [[other 3rd party librari]]es.
** The two primary [[data structures of pandas]], [[Series (1-dimensional)]] and [[DataFrame (2-dimensional)]], handle the vast majority of typical use cases in finance, statistics, social science, and many areas of engineering. For [[R user]]s, [[DataFrame]] provides everything that [[R’s data.frame]] provides and much more. [[pandas Python Library|pandas]] is built on top of [[NumPy]] and is intended to integrate well within a [[scientific computing environment]] with many [[other 3rd party librari]]es.

Latest revision as of 15:36, 24 July 2023

A pandas Python Library is a Python data transformation library and a Python data analysis library.



References

2017

2017b

2013a

  • http://pandas.pydata.org/
    • pandas is an open source, BSD-licensed library providing high-performance, easy-to-use data structures and data analysis tools for the Python programming language.

2013b

2012

  • (McKinney, 2012) ⇒ Wes McKinney. (2012). “Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython." O'Reilly Media. ISBN:9781449323615