Who is a Data Scientist? How does he differ from a Data Analyst?

Data Science is the fact of collecting, analyzing, and interpreting large amounts of data for driving insights, to help a business improve, using advanced technologies. A term coined by DJ Patil and Jeff Hammerbacher in 2008, as the first claims, to define their jobs at Linkedin and Facebook.

A Data Scientist is someone who is data-driven with high-level technical skills, capable of building complex algorithms using large amounts of data, to answer questions and drive strategy in the organization. The responsibilities and main tasks of a Data Scientist may vary from a company to another, depending on its size, its type, etc. The role of a Data scientist can include Data Collection, Cleansing, Analytics, and Data Visualization in a small company, just ML Algorithms in a big company, and all the below in a medium-sized company.

1*7IMev5xslc9FLxr9hHhpFw

 

But how a Data Analyst is different?

There is a lot of grey area with many job titles that may not always be accurate in reflecting what is one’s actual job and responsibilities. A Data Analyst may deal with many similar tasks like a Data Scientist (working with data, writing queries, analyzing and deriving information, etc.), and the skills of both may overlap (knowing SQL, R/Python, Tableau, mathematics, algorithms, Data visualization, etc.), however, the two are not the same:

While the Data Analyst works with SQL and BI tools to answer questions from the Business team by curating the insights from data, the Data Scientist builds statistical models, works with Machine Learning, and formulates questions that will help the business, by predicting the future based on past patterns.

1*z_3DC7mK8UzYDPXWixw6aQ

 

A Data Scientist is someone who is better at statistics than any Data/Software Engineer, and better at engineering than any Analyst/Statistician.

 


On the other hand, here are 9 must have skills for a Data Scientist by kdnuggets:

  1. Education
  2. R Programming
  3. Python Coding
  4. Hadoop Platform
  5. SQL Database/Coding
  6. Apache Spark
  7. Machine Learning and AI
  8. Data Visualization
  9. Unstructured data

 

Uncategorized

mostlyfad View All →

Computer Engineer • Entrepreneur • Blogger

1 Comment Leave a comment

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google photo

You are commenting using your Google account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s

%d bloggers like this: