Data Science

in #data7 years ago

This diagram shows the data science process.

  • Data is collected from sensors in the environment, represented by the globe.
  • Data is "cleaned" or otherwise processed to produce a data set (typically a data table) usable for processing.
  • Exploratory data analysis and statistical modeling may then be performed.
  • A "data product" is a program such as retailers use to suggest new purchases based on purchase history. It can also create data and feed it back into the environment.

18341930_1481722371877798_343902385719500029_n.png

Sort:  

In general terms, the science of data is the extraction of knowledge of data sets1,2. It uses techniques and theories derived from several other broader fields of mathematics, mainly statistics, information theory and information technology, including signal processing, probabilistic models, automatic learning, Statistical learning, computer programming, data engineering, pattern recognition and learning, visualization, prophetic analytics, uncertainty modeling, data storage, data compression and computing High performance. Methods that adapt to mass data are particularly interesting in the science of data, although discipline is generally not considered to be limited to this data.