Data Scientist

Jump to: navigation, search

A Data Scientist is an applied scientist who can perform data science tasks (using data science systems).



  • (Wikipedia, 2018) ⇒ Retrieved:2018-7-25.
    • Data science is an interdisciplinary field that uses scientific methods, processes, algorithms and systems to extract knowledge and insights from data in various forms, both structured and unstructured, similar to data mining. Data science is a "concept to unify statistics, data analysis, machine learning and their related methods" in order to "understand and analyze actual phenomena" with data.[1] It employs techniques and theories drawn from many fields within the context of mathematics, statistics, information science, and computer science. Turing award winner Jim Gray imagined data science as a "fourth paradigm" of science (empirical, theoretical, computational and now data-driven) and asserted that "everything about science is changing because of the impact of information technology" and the data deluge.[2] [3] When Harvard Business Review called it "The Sexiest Job of the 21st Century",[4] the term "data science" became a buzzword, and is now often applied to business analytics,[5] business intelligence, predictive modeling, or any arbitrary use of data, or used as a glamorized term for statistics.[6] In many cases, earlier approaches and solutions are now simply rebranded as "data science" to be more attractive, which can cause the term to become "dilute[d] beyond usefulness." While many university programs now offer a data science degree, there exists no consensus on a definition or suitable curriculum contents.[5] To its discredit, however, many data science and big data projects fail to deliver useful results, often as a result of poor management and utilization of resources.


    • QUOTE: ... We decided to restructure data science along three tracks. These described what we were looking for and are areas we want to attract talent. The Analytics track is ideal for those who are skilled at asking a great question, exploring cuts of the data in a revealing way, automating analysis through dashboards and visualizations, and driving changes in the business as a result of recommendations. The Algorithms track would be the home for those with expertise in machine learning, passionate about creating business value by infusing data in our product and processes. And the Inference track would be perfect for our statisticians, economists, and social scientists using statistics to improve our decision making and measure the impact of our work. ...




    • … A practitioner of data science is called a data scientist. The term was coined by DJ Patil and Jeff Hammerbacher.[7] Data scientists solve complex data problems through employing deep expertise in some scientific discipline. It is generally expected that data scientists are able to work with various elements of mathematics, statistics and computer science, although expertise in these subjects are not required. However, a data scientist is most likely to be an expert in only one or two of these disciplines and proficient in another two or three. There is probably no living person who is an expert in all of these disciplines - if so they would be extremely rare. This means that data science must be practiced as a team, where across the membership of the team there is expertise and proficiency across all the disciplines.

      Good data scientists are able to apply their skills to achieve a broad spectrum of end results. Some of these include the ability to find and interpret rich data sources, manage large amounts of data despite hardware, software and bandwidth constraints, merge data sources together, ensure consistency of data-sets, create visualizations to aid in understanding data and building rich tools that enable others to work effectively. The skill-sets and competencies that data scientists employ vary widely. Data scientists are an integral part of competitive intelligence, a newly emerging field that encompasses a number of activities, such as data mining and analysis, that can help businesses gain a competitive edge.[8]

      A major goal of data science is to make it easier for others to find and coalesce data with greater ease. Data science technologies impact how we access data and conduct research across various domains, including the biological sciences, medical informatics, social sciences and the humanities.

  1. Cite error: Invalid <ref> tag; no text was provided for refs named Hayashi
  2. Cite error: Invalid <ref> tag; no text was provided for refs named TansleyTolle2009
  3. Cite error: Invalid <ref> tag; no text was provided for refs named BellHey2009
  4. Cite error: Invalid <ref> tag; no text was provided for refs named Harvard
  5. 5.0 5.1 Cite error: Invalid <ref> tag; no text was provided for refs named GilPress
  6. Cite error: Invalid <ref> tag; no text was provided for refs named NateSilver
  7. Template:Cite news
  8. LaPonsie, Maryalene. "Data scientists: The Hottest Job You Haven't Heard Of". Retrieved 7 October 2012.