41 lines
1.1 KiB
Markdown
41 lines
1.1 KiB
Markdown
|
---
|
||
|
title: detail
|
||
|
localeTitle: detalle
|
||
|
---
|
||
|
## ¿Qué es la ciencia de datos?
|
||
|
|
||
|
### La ciencia de datos es un campo multidisciplinario que combina habilidades en
|
||
|
```
|
||
|
software engineering and statistics with domain experience to
|
||
|
support the end-to-end analysis of large and diverse data sets,
|
||
|
ultimately uncovering value for an organization and then
|
||
|
communicating that value to stakeholders as actionable results.
|
||
|
```
|
||
|
|
||
|
## Científico de datos
|
||
|
```
|
||
|
Person who is better at statistics than any software engineer and
|
||
|
better at software engineering than any statistician.
|
||
|
```
|
||
|
|
||
|
## ¿Qué habilidades se necesitan?
|
||
|
```
|
||
|
* Mathematics - Calculus, Linear Algebra
|
||
|
* Statistics - Hypothesis, Testing, Regression
|
||
|
* Programming - SQL, R/Python
|
||
|
* Machine Learning - Supervised and Unsupervised Learning, Model Fitting
|
||
|
* Business/Product Intuition - Interpret and communicate results to non-technical audience
|
||
|
```
|
||
|
|
||
|
## Ciclo vital
|
||
|
```
|
||
|
1 - Identify or Formulate Problem
|
||
|
2 - Data Preparation
|
||
|
3 - Data Exploration
|
||
|
4 - Transform and Select
|
||
|
5 - Build Model
|
||
|
6 - Validate Model
|
||
|
7 - Deploy Model
|
||
|
8 - Evalute or Monitor Results
|
||
|
|
||
|
```
|