Guest blog post by Laetitia Van Cauwenberge
Salary mostly depends on experience, education, location, industry, and unfortunately, factors such as gender. Also, most data scientists have all the three skills and more (R + Python + SQL), so it is hard to assess which one is the most valuable.
Source for picture: click here (numbers are from 2014)
You could do a survey, asking data scientists which skills they were hired for, and break the results down into 8 categories:
- no R, no Python, no SQL
- R, no Python, no SQL
- no R, Python, no SQL
- no R, no Python, SQL
- R, Python, no SQL
- R, no Python, SQL
- no R, Python, SQL
- R, Python, SQL
And then compute the average salary in each category, maybe broken down by education / experience / location. Or you can think logically, concluding that
- Python can do everything (web development, data science, and so on), in particular in production mode, has great data science libraries, and thus commands the highest salary boost
- R is somewhat specialized and limited to statistical analysis, thus commanding a lower salary boost
- SQL is extensively used and a very popular skill; in addition SQL queries are much easier to automate or outsource than Python coding, thus commanding the lowest salary boost. Though not knowing SQL might mean no job offer even if you are an expert in Python or R.
Then you can do some search using Indeed.com, and get salary per location for these three skills. Below are numbers for San Francisco, as of April 2016 (big contrast with the above picture corresponding to 2014, but the numbers below are for all job ads including architects and software engineers, not just data scientists). The number in parentheses represents the number of job openings.
- $90,000+ (3776)
- $100,000+ (3316)
- $110,000+ (2451)
- $120,000+ (1492)
- $130,000+ (744)
- $40,000+ (1437)
- $65,000+ (1138)
- $85,000+ (894)
- $105,000+ (583)
- $120,000+ (338)
- $75,000+ (5084)
- $90,000+ (4068)
- $100,000+ (3152)
- $110,000+ (2042)
- $120,000+ (1201)
It seems, if this data is correct, that SQL commands higher salaries than R, contrarily to my intuition. These numbers, based on recent job ads, can be found here (you can select the skill or location). Do you know why SQL commands higher earnings than R?
Other interesting search results for San Francisco include the following:
Data Science
- $65,000+ (6743)
- $90,000+ (5288)
- $105,000+ (3972)
- $115,000+ (2843)
- $125,000+ (1667)
Python + R
- $75,000+ (70)
- $95,000+ (54)
- $110,000+ (44)
- $120,000+ (30)
- $130,000+ (16)
Python + SQL
- $95,000+ (22)
- $110,000+ (17)
- $115,000+ (16)
- $125,000+ (9)
- $140,000+ (5)
R + SQL
- $80,000+ (19)
- $85,000+ (17)
- $100,000+ (13)
- $110,000+ (10)
- $125,000+ (4)
Sounds like Python + SQL might be the best combination. What is weird is that Python + R seems to command a lower salary than just Python alone. And if you know SQL, it gives you a nice boost if you are an R programmer.
DSC Resources
- Career: Training | Books | Cheat Sheet | Apprenticeship | Certification | Salary Surveys | Jobs
- Knowledge: Research | Competitions | Webinars | Our Book | Members Only | Search DSC
- Buzz: Business News | Announcements | Events | RSS Feeds
- Misc: Top Links | Code Snippets | External Resources | Best Blogs | Subscribe | For Bloggers
Additional Reading
- What statisticians think about data scientists
- Data Science Compared to 16 Analytic Disciplines
- 10 types of data scientists
- 91 job interview questions for data scientists
- 50 Questions to Test True Data Science Knowledge
- 24 Uses of Statistical Modeling
- 21 data science systems used by Amazon to operate its business
- Top 20 Big Data Experts to Follow (Includes Scoring Algorithm)
- 5 Data Science Leaders Share their Predictions for 2016 and Beyond
- 50 Articles about Hadoop and Related Topics
- 10 Modern Statistical Concepts Discovered by Data Scientists
- Top data science keywords on DSC
- 4 easy steps to becoming a data scientist
- 22 tips for better data science
- How to detect spurious correlations, and how to find the real ones
- 17 short tutorials all data scientists should read (and practice)
- High versus low-level data science
Follow us on Twitter: @DataScienceCtrl | @AnalyticBridge
Comments