Big Data

The best R package for learning to “think about visualization”

As a beginning data scientist, you’ll have quite a few subject areas that you need to learn (and eventually master). While you’ll certainly need to learn some math and statistics, math and stats are not the first things I recommend to most beginners.

Almost always, I recommend that people start with data visualization.

The reason for this, is that data visualization is so critical to almost every part of getting things done as a data scientist: reporting, analysis, exploratory analysis (e.g., EDA prior to machine learning). You need data visualization constantly. It’s necessary for nearly every data scientist at all levels.

Related Post:  Big Data & AI Conference Dallas 2019

Furthermore, I’ve argued that at junior levels of a data team job hierarchy, data visualization (when combined with data manipulation) is sufficient for being productive. If you’re a junior member of a data team, your core responsibilities may exclusively revolve around visualization (i.e., reporting, analysis, etc).

Because it’s necessary (and in some cases, sufficient) for productivity, it’s a skill that you need to master early.

Related Post:  Big Data & AI Conference Dallas 2019

ggplot2 is the visualization tool I recommend

Continue reading

R statistical language logo

Please share:
Tags:

We Recommend These Services

Register now for Big Data & AI Conference, international Big Data and AI conference in Dallas, TX (USA), June 27 - 29, 2019

Reasons to use control panel for your server

Register for the End-to-end Machine Learning with TensorFlow on Google Cloud Platform workshop. It will be conducted by the manager of Google's Cloud AI Advocacy team

Launch an SSD VPS in Europe, USA, Asia & Australia on Vultr's KVM-based Cloud platform starting at $5:00/month (15 GB SSD, 768 MB of RAM).


Leave a Comment

Your email address will not be published. Required fields are marked *

*