freeCodeCamp/guide/chinese/data-science-tools/detail/index.md

1.1 KiB

title localeTitle
detail 详情

什么是数据科学

数据科学是一个多学科领域,结合了技能

    software engineering and statistics with domain experience to 
    support the end-to-end analysis of large and diverse data sets, 
    ultimately uncovering value for an organization and then 
    communicating that value to stakeholders as actionable results. 

数据科学家

Person who is better at statistics than any software engineer and 
    better at software engineering than any statistician. 

你需要什么技能?

* Mathematics - Calculus, Linear Algebra 
 * Statistics - Hypothesis, Testing, Regression 
 * Programming - SQL, R/Python 
 * Machine Learning - Supervised and Unsupervised Learning, Model Fitting 
 * Business/Product Intuition - Interpret and communicate results to non-technical audience 

生命周期

1 - Identify or Formulate Problem 
 2 - Data Preparation 
 3 - Data Exploration 
 4 - Transform and Select 
 5 - Build Model 
 6 - Validate Model 
 7 - Deploy Model 
 8 - Evalute or Monitor Results