pandas Python Library: Difference between revisions

From GM-RKB
Jump to navigation Jump to search
 
m (Text replacement - "]]s **" to "]]s. **")
 
(31 intermediate revisions by 3 users not shown)
Line 1: Line 1:
A [[Pandas Python Library]] is a [[Python Data Transformation Library]] and a [[Python Data Analysis Library]].
A [[pandas Python Library]] is a [[Python data transformation library]] and a [[Python data analysis library]].
* <B>See:</B> [[Python Library]], [[SciPy]], [[NymPy]].
* <B>Context:</B>
** It can (typically) support a [[pandas Data Structure]], such as [[pandas.DataFrame]] and [[pandas.Series]].
* <B>Example(s)</B>
** [[pandas v0.14.1]].
** …
* <B>Counter-Example(s):</B>
** [[numpy Library]].
** [[SciPy Library]].
** [[R Dataframe]].
* <B>See:</B> [[PyData]], [[Tabular Data]], [[OLAP Aggregation]], [[OLAP Drill Down]], [[Moving Window Function]], [[Rolling Regression]].
 
----
----
----
----
==References==
 
== References ==
 
* http://pandas.pydata.org/
* http://pandas.pydata.org/
* http://pandas.pydata.org/pandas-docs/stable/10min.html


===2013===
=== 2017 ===
* (Wikipedia, 2013) &rArr; http://en.wikipedia.org/wiki/Pandas (software) Retrieved:2013-10-29.
* (Wikipedia, 2017) ⇒ https://en.wikipedia.org/wiki/Pandas_(software) Retrieved:2017-6-5.
** '''Pandas''' is a software library written for the [[Python (programming language)|Python]] programming language for data manipulation and analysis. In particular, it offers data structures and operations for manipulating numerical tables and [[time series]]. Pandas is [[free software]] released under the 3-clause [[BSD license]]. <ref> http://pandas.pydata.org/pandas-docs/stable/overview.html#license </ref>
** '''pandas''' is a [[software library]] written for the [[Python (programming language)|Python programming language]] for data manipulation and analysis. In particular, it offers data structures and operations for manipulating numerical tables and [[time series]]. Pandas is [[free software]] released under the three-clause [[BSD license]]. <ref> http://pandas.pydata.org/pandas-docs/stable/overview.html#license </ref> The name is derived from the term “[[panel data]]", an [[econometrics]] term for multidimensional structured data sets.
<references/>
<references/>
=== 2017b ===
* https://medium.com/@kailashahirwar/essential-cheat-sheets-for-machine-learning-and-deep-learning-researchers-efb6a8ebd2e5
** QUOTE: <HTML><IMG WIDTH=700 SRC=https://cdn-images-1.medium.com/max/1000/1*o_CO_8Plpi2Ac3s-Gs5IQg.png></HTML>
=== 2013a ===
* http://pandas.pydata.org/
* http://pandas.pydata.org/
** pandas is an open source, BSD-licensed library providing high-performance, easy-to-use data structures and data analysis tools for the Python programming language.
** pandas is an open source, BSD-licensed library providing high-performance, easy-to-use data structures and data analysis tools for the Python programming language.
=== 2013b ===
* http://pandas.pydata.org/pandas-docs/stable/
* http://pandas.pydata.org/pandas-docs/stable/
** [[pandas]] is a Python package providing fast, flexible, and expressive data structures designed to make working with “relational” or “labeled” data both easy and intuitive. It aims to be the fundamental high-level building block for doing practical, real world data analysis in Python. Additionally, [[pandas|it]] has the broader goal of becoming the most powerful and [[flexible system|flexible]] [[open source]] [[data analysis / manipulation tool]] available in any language. It is already well on its way toward this goal. <P> pandas is well suited for many different kinds of data:
** [[pandas Python Library|pandas]] is a Python package providing fast, flexible, and expressive data structures designed to make working with “relational” or “labeled” data both easy and intuitive. It aims to be the fundamental high-level building block for doing practical, real world data analysis in Python. Additionally, [[pandas Python Library|it]] has the broader goal of becoming the most powerful and [[flexible system|flexible]] [[open source]] [[data analysis / manipulation tool]] available in any language. It is already well on its way toward this goal.       <P>         pandas is well suited for many different kinds of data:
*** [[Tabular data with heterogeneously-typed columns]], as in an [[SQL table]] or [[Excel spreadsheet]]
*** [[Tabular data with heterogeneously-typed columns]], as in an [[SQL table]] or [[Excel spreadsheet]].
*** [[Ordered time series data|Ordered]] and [[unordered time series data|unordered]] (not necessarily [[fixed-frequency]]) [[time series data]].
*** [[Ordered time series data|Ordered]] and [[unordered time series data|unordered]] (not necessarily [[fixed-frequency]]) [[time series data]].
*** [[Arbitrary matrix data]] ([[homogeneously typed matrix|homogeneously typed]] or [[heterogeneously typed matrix |heterogeneous]]) with [[row label|row]] and [[column label]]s
*** [[Arbitrary matrix data]] ([[homogeneously typed matrix|homogeneously typed]] or [[heterogeneously typed matrix |heterogeneous]]) with [[row label|row]] and [[column label]]s.
*** Any other form of observational / statistical data sets. The data actually need not be labeled at all to be placed into a pandas data structure
*** Any other form of observational / statistical data sets. The data actually need not be labeled at all to be placed into a pandas data structure
** The two primary [[data structures of pandas]], [[Series (1-dimensional)]] and [[DataFrame (2-dimensional)]], handle the vast majority of typical use cases in finance, statistics, social science, and many areas of engineering. For [[R user]]s, [[DataFrame]] provides everything that [[R’s data.frame]] provides and much more. [[pandas]] is built on top of [[NumPy]] and is intended to integrate well within a [[scientific computing environment]] with many [[other 3rd party librari]]es.
** The two primary [[data structures of pandas]], [[Series (1-dimensional)]] and [[DataFrame (2-dimensional)]], handle the vast majority of typical use cases in finance, statistics, social science, and many areas of engineering. For [[R user]]s, [[DataFrame]] provides everything that [[R’s data.frame]] provides and much more. [[pandas Python Library|pandas]] is built on top of [[NumPy]] and is intended to integrate well within a [[scientific computing environment]] with many [[other 3rd party librari]]es.
 
=== 2012 ===
* ([[2012_PythonforDataAnalysisDataWrangl|McKinney, 2012]]) ⇒ [[Wes McKinney]]. ([[2012]]). “Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython." O'Reilly Media. ISBN:9781449323615


----
----
__NOTOC__
[[Category:Concept]]
[[Category:Concept]]
__NOTOC__

Latest revision as of 15:36, 24 July 2023

A pandas Python Library is a Python data transformation library and a Python data analysis library.



References

2017

2017b

2013a

  • http://pandas.pydata.org/
    • pandas is an open source, BSD-licensed library providing high-performance, easy-to-use data structures and data analysis tools for the Python programming language.

2013b

2012

  • (McKinney, 2012) ⇒ Wes McKinney. (2012). “Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython." O'Reilly Media. ISBN:9781449323615