pandas Python Library: Difference between revisions

Latest revision as of 15:36, 24 July 2023

A pandas Python Library is a Python data transformation library and a Python data analysis library.

Context:
- It can (typically) support a pandas Data Structure, such as pandas.DataFrame and pandas.Series.
Example(s)
- pandas v0.14.1.
- …
Counter-Example(s):
- numpy Library.
- SciPy Library.
- R Dataframe.
See: PyData, Tabular Data, OLAP Aggregation, OLAP Drill Down, Moving Window Function, Rolling Regression.

References

2017

(Wikipedia, 2017) ⇒ https://en.wikipedia.org/wiki/Pandas_(software) Retrieved:2017-6-5.
- pandas is a software library written for the Python programming language for data manipulation and analysis. In particular, it offers data structures and operations for manipulating numerical tables and time series. Pandas is free software released under the three-clause BSD license. ^[1] The name is derived from the term “panel data", an econometrics term for multidimensional structured data sets.

↑ http://pandas.pydata.org/pandas-docs/stable/overview.html#license

2017b

https://medium.com/@kailashahirwar/essential-cheat-sheets-for-machine-learning-and-deep-learning-researchers-efb6a8ebd2e5
- QUOTE:

2013a

http://pandas.pydata.org/
- pandas is an open source, BSD-licensed library providing high-performance, easy-to-use data structures and data analysis tools for the Python programming language.

2013b

http://pandas.pydata.org/pandas-docs/stable/
- pandas is a Python package providing fast, flexible, and expressive data structures designed to make working with “relational” or “labeled” data both easy and intuitive. It aims to be the fundamental high-level building block for doing practical, real world data analysis in Python. Additionally, it has the broader goal of becoming the most powerful and flexible open source data analysis / manipulation tool available in any language. It is already well on its way toward this goal.
  pandas is well suited for many different kinds of data:
  - Tabular data with heterogeneously-typed columns, as in an SQL table or Excel spreadsheet.
  - Ordered and unordered (not necessarily fixed-frequency) time series data.
  - Arbitrary matrix data (homogeneously typed or heterogeneous) with row and column labels.
  - Any other form of observational / statistical data sets. The data actually need not be labeled at all to be placed into a pandas data structure
- The two primary data structures of pandas, Series (1-dimensional) and DataFrame (2-dimensional), handle the vast majority of typical use cases in finance, statistics, social science, and many areas of engineering. For R users, DataFrame provides everything that R’s data.frame provides and much more. pandas is built on top of NumPy and is intended to integrate well within a scientific computing environment with many other 3rd party libraries.

2012

(McKinney, 2012) ⇒ Wes McKinney. (2012). “Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython." O'Reilly Media. ISBN:9781449323615

[1] ttp://pandas.pydata.org/pandas-docs/stable/overview.html#license

[1]

@@ Line 1: / Line 1: @@
-A [[Pandas Python Library]] is a [[Python Data Transformation Library]] and a [[Python Data Analysis Library]].
+A [[pandas Python Library]] is a [[Python data transformation library]] and a [[Python data analysis library]].
-* <B>See:</B> [[Python Library]], [[SciPy]], [[NymPy]].
+* <B>Context:</B>
+** It can (typically) support a [[pandas Data Structure]], such as [[pandas.DataFrame]] and [[pandas.Series]].
+* <B>Example(s)</B>
+** [[pandas v0.14.1]].
+** …
+* <B>Counter-Example(s):</B>
+** [[numpy Library]].
+** [[SciPy Library]].
+** [[R Dataframe]].
+* <B>See:</B> [[PyData]], [[Tabular Data]], [[OLAP Aggregation]], [[OLAP Drill Down]], [[Moving Window Function]], [[Rolling Regression]].
 ----
 ----
-==References==
+== References ==
 * http://pandas.pydata.org/
+* http://pandas.pydata.org/pandas-docs/stable/10min.html
-===2013===
+=== 2017 ===
-* (Wikipedia, 2013) &rArr; http://en.wikipedia.org/wiki/Pandas (software) Retrieved:2013-10-29.
+* (Wikipedia, 2017) ⇒ https://en.wikipedia.org/wiki/Pandas_(software) Retrieved:2017-6-5.
-** '''Pandas''' is a software library written for the [[Python (programming language)|Python]] programming language for data manipulation and analysis. In particular, it offers data structures and operations for manipulating numerical tables and [[time series]]. Pandas is [[free software]] released under the 3-clause [[BSD license]]. <ref> http://pandas.pydata.org/pandas-docs/stable/overview.html#license </ref>
+** '''pandas''' is a [[software library]] written for the [[Python (programming language)|Python programming language]] for data manipulation and analysis. In particular, it offers data structures and operations for manipulating numerical tables and [[time series]]. Pandas is [[free software]] released under the three-clause [[BSD license]]. <ref> http://pandas.pydata.org/pandas-docs/stable/overview.html#license </ref> The name is derived from the term “[[panel data]]", an [[econometrics]] term for multidimensional structured data sets.
 <references/>
+=== 2017b ===
+* https://medium.com/@kailashahirwar/essential-cheat-sheets-for-machine-learning-and-deep-learning-researchers-efb6a8ebd2e5
+** QUOTE: <HTML><IMG WIDTH=700 SRC=https://cdn-images-1.medium.com/max/1000/1*o_CO_8Plpi2Ac3s-Gs5IQg.png></HTML>
+=== 2013a ===
 * http://pandas.pydata.org/
 ** pandas is an open source, BSD-licensed library providing high-performance, easy-to-use data structures and data analysis tools for the Python programming language.
+=== 2013b ===
 * http://pandas.pydata.org/pandas-docs/stable/
-** [[pandas]] is a Python package providing fast, flexible, and expressive data structures designed to make working with “relational” or “labeled” data both easy and intuitive. It aims to be the fundamental high-level building block for doing practical, real world data analysis in Python. Additionally, [[pandas|it]] has the broader goal of becoming the most powerful and [[flexible system|flexible]] [[open source]] [[data analysis / manipulation tool]] available in any language. It is already well on its way toward this goal.  <P> pandas is well suited for many different kinds of data:
+** [[pandas Python Library|pandas]] is a Python package providing fast, flexible, and expressive data structures designed to make working with “relational” or “labeled” data both easy and intuitive. It aims to be the fundamental high-level building block for doing practical, real world data analysis in Python. Additionally, [[pandas Python Library|it]] has the broader goal of becoming the most powerful and [[flexible system|flexible]] [[open source]] [[data analysis / manipulation tool]] available in any language. It is already well on its way toward this goal.        <P>         pandas is well suited for many different kinds of data:
-*** [[Tabular data with heterogeneously-typed columns]], as in an [[SQL table]] or [[Excel spreadsheet]]
+*** [[Tabular data with heterogeneously-typed columns]], as in an [[SQL table]] or [[Excel spreadsheet]].
 *** [[Ordered time series data|Ordered]] and [[unordered time series data|unordered]] (not necessarily [[fixed-frequency]]) [[time series data]].
-*** [[Arbitrary matrix data]] ([[homogeneously typed matrix|homogeneously typed]] or [[heterogeneously typed matrix |heterogeneous]]) with [[row label|row]] and [[column label]]s
+*** [[Arbitrary matrix data]] ([[homogeneously typed matrix|homogeneously typed]] or [[heterogeneously typed matrix |heterogeneous]]) with [[row label|row]] and [[column label]]s.
 *** Any other form of observational / statistical data sets. The data actually need not be labeled at all to be placed into a pandas data structure
-** The two primary [[data structures of pandas]], [[Series (1-dimensional)]] and [[DataFrame (2-dimensional)]], handle the vast majority of typical use cases in finance, statistics, social science, and many areas of engineering. For [[R user]]s, [[DataFrame]] provides everything that [[R’s data.frame]] provides and much more. [[pandas]] is built on top of [[NumPy]] and is intended to integrate well within a [[scientific computing environment]] with many [[other 3rd party librari]]es.
+** The two primary [[data structures of pandas]], [[Series (1-dimensional)]] and [[DataFrame (2-dimensional)]], handle the vast majority of typical use cases in finance, statistics, social science, and many areas of engineering. For [[R user]]s, [[DataFrame]] provides everything that [[R’s data.frame]] provides and much more. [[pandas Python Library|pandas]] is built on top of [[NumPy]] and is intended to integrate well within a [[scientific computing environment]] with many [[other 3rd party librari]]es.
+=== 2012 ===
+* ([[2012_PythonforDataAnalysisDataWrangl|McKinney, 2012]]) ⇒ [[Wes McKinney]]. ([[2012]]). “Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython." O'Reilly Media. ISBN:9781449323615
 ----
+__NOTOC__
 [[Category:Concept]]
-__NOTOC__

pandas Python Library: Difference between revisions

Latest revision as of 15:36, 24 July 2023

References

2017

2017b

2013a

2013b

2012

Navigation menu

Search