pandas Python Library: Difference between revisions
Jump to navigation
Jump to search
(Created page with "A Pandas Python Library is a Python Data Transformation Library and a Python Data Analysis Library. * <B>See:</B> Python Library, SciPy, NymPy. ---- --...") |
m (Text replacement - "]]s **" to "]]s. **") |
||
(31 intermediate revisions by 3 users not shown) | |||
Line 1: | Line 1: | ||
A [[ | A [[pandas Python Library]] is a [[Python data transformation library]] and a [[Python data analysis library]]. | ||
* <B>See:</B> [[ | * <B>Context:</B> | ||
** It can (typically) support a [[pandas Data Structure]], such as [[pandas.DataFrame]] and [[pandas.Series]]. | |||
* <B>Example(s)</B> | |||
** [[pandas v0.14.1]]. | |||
** … | |||
* <B>Counter-Example(s):</B> | |||
** [[numpy Library]]. | |||
** [[SciPy Library]]. | |||
** [[R Dataframe]]. | |||
* <B>See:</B> [[PyData]], [[Tabular Data]], [[OLAP Aggregation]], [[OLAP Drill Down]], [[Moving Window Function]], [[Rolling Regression]]. | |||
---- | ---- | ||
---- | ---- | ||
==References== | |||
== References == | |||
* http://pandas.pydata.org/ | * http://pandas.pydata.org/ | ||
* http://pandas.pydata.org/pandas-docs/stable/10min.html | |||
=== | === 2017 === | ||
* (Wikipedia, | * (Wikipedia, 2017) ⇒ https://en.wikipedia.org/wiki/Pandas_(software) Retrieved:2017-6-5. | ||
** ''' | ** '''pandas''' is a [[software library]] written for the [[Python (programming language)|Python programming language]] for data manipulation and analysis. In particular, it offers data structures and operations for manipulating numerical tables and [[time series]]. Pandas is [[free software]] released under the three-clause [[BSD license]]. <ref> http://pandas.pydata.org/pandas-docs/stable/overview.html#license </ref> The name is derived from the term “[[panel data]]", an [[econometrics]] term for multidimensional structured data sets. | ||
<references/> | <references/> | ||
=== 2017b === | |||
* https://medium.com/@kailashahirwar/essential-cheat-sheets-for-machine-learning-and-deep-learning-researchers-efb6a8ebd2e5 | |||
** QUOTE: <HTML><IMG WIDTH=700 SRC=https://cdn-images-1.medium.com/max/1000/1*o_CO_8Plpi2Ac3s-Gs5IQg.png></HTML> | |||
=== 2013a === | |||
* http://pandas.pydata.org/ | * http://pandas.pydata.org/ | ||
** pandas is an open source, BSD-licensed library providing high-performance, easy-to-use data structures and data analysis tools for the Python programming language. | ** pandas is an open source, BSD-licensed library providing high-performance, easy-to-use data structures and data analysis tools for the Python programming language. | ||
=== 2013b === | |||
* http://pandas.pydata.org/pandas-docs/stable/ | * http://pandas.pydata.org/pandas-docs/stable/ | ||
** [[pandas]] is a Python package providing fast, flexible, and expressive data structures designed to make working with “relational” or “labeled” data both easy and intuitive. It aims to be the fundamental high-level building block for doing practical, real world data analysis in Python. Additionally, [[pandas|it]] has the broader goal of becoming the most powerful and [[flexible system|flexible]] [[open source]] [[data analysis / manipulation tool]] available in any language. It is already well on its way toward this goal. | ** [[pandas Python Library|pandas]] is a Python package providing fast, flexible, and expressive data structures designed to make working with “relational” or “labeled” data both easy and intuitive. It aims to be the fundamental high-level building block for doing practical, real world data analysis in Python. Additionally, [[pandas Python Library|it]] has the broader goal of becoming the most powerful and [[flexible system|flexible]] [[open source]] [[data analysis / manipulation tool]] available in any language. It is already well on its way toward this goal. <P> pandas is well suited for many different kinds of data: | ||
*** [[Tabular data with heterogeneously-typed columns]], as in an [[SQL table]] or [[Excel spreadsheet]] | *** [[Tabular data with heterogeneously-typed columns]], as in an [[SQL table]] or [[Excel spreadsheet]]. | ||
*** [[Ordered time series data|Ordered]] and [[unordered time series data|unordered]] (not necessarily [[fixed-frequency]]) [[time series data]]. | *** [[Ordered time series data|Ordered]] and [[unordered time series data|unordered]] (not necessarily [[fixed-frequency]]) [[time series data]]. | ||
*** [[Arbitrary matrix data]] ([[homogeneously typed matrix|homogeneously typed]] or [[heterogeneously typed matrix |heterogeneous]]) with [[row label|row]] and [[column label]]s | *** [[Arbitrary matrix data]] ([[homogeneously typed matrix|homogeneously typed]] or [[heterogeneously typed matrix |heterogeneous]]) with [[row label|row]] and [[column label]]s. | ||
*** Any other form of observational / statistical data sets. The data actually need not be labeled at all to be placed into a pandas data structure | *** Any other form of observational / statistical data sets. The data actually need not be labeled at all to be placed into a pandas data structure | ||
** The two primary [[data structures of pandas]], [[Series (1-dimensional)]] and [[DataFrame (2-dimensional)]], handle the vast majority of typical use cases in finance, statistics, social science, and many areas of engineering. For [[R user]]s, [[DataFrame]] provides everything that [[R’s data.frame]] provides and much more. [[pandas]] is built on top of [[NumPy]] and is intended to integrate well within a [[scientific computing environment]] with many [[other 3rd party librari]]es. | ** The two primary [[data structures of pandas]], [[Series (1-dimensional)]] and [[DataFrame (2-dimensional)]], handle the vast majority of typical use cases in finance, statistics, social science, and many areas of engineering. For [[R user]]s, [[DataFrame]] provides everything that [[R’s data.frame]] provides and much more. [[pandas Python Library|pandas]] is built on top of [[NumPy]] and is intended to integrate well within a [[scientific computing environment]] with many [[other 3rd party librari]]es. | ||
=== 2012 === | |||
* ([[2012_PythonforDataAnalysisDataWrangl|McKinney, 2012]]) ⇒ [[Wes McKinney]]. ([[2012]]). “Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython." O'Reilly Media. ISBN:9781449323615 | |||
---- | ---- | ||
__NOTOC__ | |||
[[Category:Concept]] | [[Category:Concept]] | ||
Latest revision as of 15:36, 24 July 2023
A pandas Python Library is a Python data transformation library and a Python data analysis library.
- Context:
- It can (typically) support a pandas Data Structure, such as pandas.DataFrame and pandas.Series.
- Example(s)
- Counter-Example(s):
- See: PyData, Tabular Data, OLAP Aggregation, OLAP Drill Down, Moving Window Function, Rolling Regression.
References
2017
- (Wikipedia, 2017) ⇒ https://en.wikipedia.org/wiki/Pandas_(software) Retrieved:2017-6-5.
- pandas is a software library written for the Python programming language for data manipulation and analysis. In particular, it offers data structures and operations for manipulating numerical tables and time series. Pandas is free software released under the three-clause BSD license. [1] The name is derived from the term “panel data", an econometrics term for multidimensional structured data sets.
2017b
2013a
- http://pandas.pydata.org/
- pandas is an open source, BSD-licensed library providing high-performance, easy-to-use data structures and data analysis tools for the Python programming language.
2013b
- http://pandas.pydata.org/pandas-docs/stable/
- pandas is a Python package providing fast, flexible, and expressive data structures designed to make working with “relational” or “labeled” data both easy and intuitive. It aims to be the fundamental high-level building block for doing practical, real world data analysis in Python. Additionally, it has the broader goal of becoming the most powerful and flexible open source data analysis / manipulation tool available in any language. It is already well on its way toward this goal.
pandas is well suited for many different kinds of data:
- Tabular data with heterogeneously-typed columns, as in an SQL table or Excel spreadsheet.
- Ordered and unordered (not necessarily fixed-frequency) time series data.
- Arbitrary matrix data (homogeneously typed or heterogeneous) with row and column labels.
- Any other form of observational / statistical data sets. The data actually need not be labeled at all to be placed into a pandas data structure
- The two primary data structures of pandas, Series (1-dimensional) and DataFrame (2-dimensional), handle the vast majority of typical use cases in finance, statistics, social science, and many areas of engineering. For R users, DataFrame provides everything that R’s data.frame provides and much more. pandas is built on top of NumPy and is intended to integrate well within a scientific computing environment with many other 3rd party libraries.
- pandas is a Python package providing fast, flexible, and expressive data structures designed to make working with “relational” or “labeled” data both easy and intuitive. It aims to be the fundamental high-level building block for doing practical, real world data analysis in Python. Additionally, it has the broader goal of becoming the most powerful and flexible open source data analysis / manipulation tool available in any language. It is already well on its way toward this goal.
2012
- (McKinney, 2012) ⇒ Wes McKinney. (2012). “Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython." O'Reilly Media. ISBN:9781449323615