How is data science not just statistics, rebranded?
When I studied statistics in university we studied clustering, regression, hypothesis testing, confidence intervals, visualisations, charting, general linear models... how is this different from data science?
Is it just because we have more computing power, so we rebranded it? Instead of doing frequentist analyses because those are easy to do with smaller computers, now we do more Bayesian stats, on bigger computers, and therefore call it data science?
From the one data science course I took: compared to a traditional stats class, there is a bit more focus on data acquisition and pre processing, and more usage of our of the box tools, a bit less on some of theory (I don’t recall hearing about type I and type II errors).
Of course, since it was a professional course there is probably the assumption that students already know these things…
I’ve periodically looked into it the differences and mostly it seems to come down to priorities. Statistics prioritizes mathematical rigor all the way through the process from collection to analysis. ML / Data science has more algorithmic concerns. Even the theoretical aspects of the two fields emphasize that distinction.
When I studied statistics in university we studied clustering, regression, hypothesis testing, confidence intervals, visualisations, charting, general linear models... how is this different from data science?
Is it just because we have more computing power, so we rebranded it? Instead of doing frequentist analyses because those are easy to do with smaller computers, now we do more Bayesian stats, on bigger computers, and therefore call it data science?