Data Science

Data Scientists: What They Do, How Much They Make

Data Scientists: What They Do, How Much They Make
Data science isn't one of those fields where you can fake it until you make it. In this role, you better know what you're doing. Image from Unsplash
Christa Terry profile
Christa Terry March 13, 2023

Quant nerds can make a lot of money in data science, provided they have the skills and the credentials to prove they know what they're doing. Here's why data scientists make bank and what you need to do to become one.

Data Science Programs You Should Consider

Advertisement
Article continues here

Most of the data scientists in the world, if LinkedIn co-founder Allen Blue is to be believed, are already gainfully employed. “There are very few data scientists out there passing out their resumes,” he told the Knowledge at Wharton blog, adding that data science and machine learning jobs represent five of the top 15 fastest-growing jobs in America. The Bureau of Labor Statistics confirms that, setting the job market growth rate between 2021 and 2031 for data scientists in the United States at 36 percent, seven times faster than the rate for the overall job market.

The takeaway? There are more data science jobs than there are data scientists to fill them. If you’re looking for job security (and who isn’t), you can find it in this highly technical, highly specialized field.

Just because demand for data scientists is high doesn’t mean you should become one, however. If you’re researching careers in data science because you’re operating under the assumption that it’s an easy path to big money, you need to take another look at the skill set data scientists have and what data scientists actually do. Data scientists need to be experts in math, statistics, programming, communication, and problem-solving. But more importantly, they need to love data. Harvard Business Review called data scientist the sexiest job of the 21st century, but the reality is that this is a job for hardcore numbers nerds.

Massachusetts Institute of Technology graduate Kay Aull put it like this in a Quora thread about choosing a data science career: “There’s data everywhere. Have you ever said ‘Hmm, I wonder…’ and then lost hours of your life trying to kludge together an analysis? If so, you might be the right kind of weirdo for data science. If you just want to learn a few buzzwords and then stroll into a six-figure job, there are other fields better suited for that.”

Let’s assume you’re not here for the buzzwords or because you want us to confirm that data scientists make good money (they do). In this article, we’re going to dive deep into the question “What does a data scientist do?” and cover the following ground:

  • What are data scientists responsible for?
  • Do data scientists and data analysts do the same things?
  • What kind of experience is important for a data scientist?
  • What training do data scientists typically have?
  • What companies hire data scientists?
  • What kinds of projects can data scientists work on?
  • What tools do data scientists use?
  • How much money can you make as a data scientist?

What are data scientists responsible for?

Figuring out what data scientists do isn’t always easy. As you research the question, you’ll probably come across answers like this: Data scientists develop predictive and prescriptive algorithms built on clean data sets. Unless you’re already a data scientist, that explanation may be head-scratchily vague. This description of what data scientists do is easier to understand: Data scientists gather and organize large amounts of data to solve process and strategy problems in business and in other enterprises.

We can even break it down further. Data scientists explore data with powerful tools (which they sometimes design themselves) to discover meaning. It’s still a vague answer, but purposefully so, because data science is useful across industries. Whether a data scientist works in pharmaceuticals or finance doesn’t matter. They’re still using huge sets of information to solve abstract problems or predict future events.

Here’s an example of data science in action: An insurance company wants to reduce costs by catching a specific cancer earlier when treatment is less expensive. It uses analysis of data related to patient screenings, treatments, and outcomes to determine which subscribers will benefit most from which screenings.

Here’s another: An e-commerce company wants to increase the amount customers spend in a single shopping trip. It uses analysis of past customer behavior to generate product recommendations related to what’s already in a customer’s virtual shopping cart.

Data scientists do more than just make money and save money for corporations, however. Nonprofit organizations, academic institutions, and governments all use data science to optimize processes, answer strategy questions, and identify trends. Sometimes data scientists are given a complex problem, and they have to decide which information they’ll need to solve it. Sometimes they’re given information and asked to extract as much meaning from it as possible. In both cases, math, statistics, and programming will come into play.

Advertisement

“I’M READY FOR A DEGREE!”

University and Program Name Learn More

Do data scientists and data analysts do the same things?

Yes and no. Data analysts work with information in many of the same ways data scientists do. And the work of data scientists and data analysts can overlap. The biggest differences between these two professionals can be found in the base level of technical expertise each possesses. Data science is a lot more technical and hands-on, and data scientists are much more likely to have coding skills, machine learning expertise, an understanding of artificial intelligence, experience with predictive analytics, and mathematical modeling expertise. They’re also more likely to have advanced degrees. In fact, data analysts sometimes advance into data scientist positions after earning a master’s degree in data analytics.

What kind of experience is important for a data scientist?

To do what they do with large data sets, data scientists need to be proficient in:

  • Machine learning concepts and techniques like neural networks, reinforcement learning, supervised machine learning, deep learning, decision trees, logistic regression, predictive models, and adversarial learning. These are the kinds of skills that allow data scientists to drive business decisions and solve organizational problems.
  • Working with unstructured data, which is any data that doesn’t fit neatly into database tables. Tracking structured data like customer purchase information is relatively simple. Factoring in data sources like customer reviews, social media posts, product photos, and other unstructured data is a lot more difficult. Data scientists have to be comfortable organizing data and manipulating this kind of information.
  • Data visualization, because most people can’t understand why information matters until it’s translated into an easy-to-understand visual format. In data science, a picture is worth a million number sets, so data scientists need experience working with data visualization tools like ggplot, Matplotlib, and Tableau.

Getting experience before working in a data scientist role or even a degree is pretty easy nowadays. Just look at what data scientists do and try to do it yourself. There are lots of free data sets you play around with—Kaggle has 19,000 public datasets—and plenty of free data manipulation tools and programming tutorials. You can learn so much about what data scientists do (and whether you want to devote your career to doing those things) by spending your free time visualizing data and building machine learning models.

You may also land a job more quickly if you build up a portfolio of personal projects that highlight your technical skills and knack for communicating results in a way non-techy folks can understand.

What training do data scientists typically have?

The answer to this question isn’t particularly straightforward. Unsurprisingly, most data scientists have both bachelor’s degrees and advanced degrees (47 percent hold master’s degrees and 47 percent hold doctorate degrees), and most specialize in computer science, math, statistics, or engineering for their entire academic careers. But data scientists also come from undergraduate backgrounds like electrical engineering, physics, and biology, and many data scientists are at least partially self-taught.

When aspiring data scientists are ready to enroll in a master’s degree or doctoral program, they typically opt to study subjects like data analytics, data engineering, applied statistics, and business intelligence because there still aren’t that many schools offering data science degrees. Some high-profile colleges and universities offer master’s degrees in data science master; they include Harvard University, Columbia University, and Tufts University. You’ll also find relevant graduate degree programs at:

  • Boston University
  • Carnegie Mellon University
  • Drexel University
  • Johns Hopkins University
  • New York University
  • Northwestern University
  • Stevens Institute of Technology

Data science training doesn’t end at graduation because earning an advanced degree won’t turn you into a data scientist. The reason there are so many self-taught data scientists is that degree programs don’t necessarily cover the ins and outs of things like Hadoop data lakes or Big Data querying—and if they do, the information they give students may be out of date by the time those students get their diplomas. New data science techniques and tools are being developed all the time. One thing you’ll never stop doing if you become a data scientist is learning.

Do I need a master’s in data science?

Yes, and not only because 90 percent of data science professionals have advanced degrees. Nearly half of all job postings for data scientists specify that candidates without master’s degrees need not apply.

There’s still a talent shortage in the world of data science, but innovations in automation are shifting the barriers to entry in a big way. “Entry-level” data science jobs go to analysts who know their way around DataRobot and autoML. The top-paying data science jobs are reserved for those with deep domain knowledge, sophisticated tech skills, and advanced degrees. Competition for these positions is fierce.

Do I need a PhD in data science?

The frustrating answer is maybe. Close to 50 percent of data scientists have doctoral degrees, and this is a rare field where PhDs aren’t funneled into academic positions. That doesn’t mean you need one to launch a successful career in data science, however. More job openings in data science call for candidates with master’s degrees than doctorates, but some companies—especially those that specialize in data science and high-profile tech firms—do hire data science PhDs in greater numbers. If your dream is to work for a big-name technology company or in research and development, a data science doctorate can help smooth your path.

Master’s in data science: related degrees

Something to consider as you research degree options is that the Master of Science in Data Science, or MSDS, is just one possible academic pathway for data scientists. People enter this field with graduate degrees in disciplines like:

  • Applied statistics: Data science heavily relies on statistical methods to analyze data, make inferences, and build predictive models. Applied statistics provides the foundational techniques for data collection, hypothesis testing, and data interpretation.
  • Business intelligence: Data science enhances business intelligence by providing advanced analytical techniques to extract insights from data. It supports decision-making processes by developing predictive models and performing complex data analyses.
  • Computational mathematics: Data science uses computational mathematics for algorithm development and numerical analysis. Techniques such as optimization, numerical linear algebra, and complex system modeling are essential for processing and analyzing large datasets.
  • Computer engineering: Data science benefits from computer engineering in terms of hardware and software optimization for data processing tasks. Efficient storage, data retrieval, and high-performance computing solutions developed by computer engineers are crucial for handling big data.
  • Computer science: Data science builds on computer science principles, including algorithms, data structures, and software engineering. It incorporates machine learning, artificial intelligence, and database management to develop and deploy data-driven solutions.
  • Cybersecurity: Data science applies machine learning and statistical analysis to detect anomalies and threats in cybersecurity. Techniques such as pattern recognition and predictive modeling help in identifying and mitigating security risks.
  • Data analytics: Data science encompasses data analytics, using advanced techniques to analyze and interpret complex datasets. It extends data analytics by applying machine learning, predictive modeling, and statistical methods to uncover deeper insights.
  • Data engineering: Data science relies on data engineering to create robust data pipelines, ensuring the efficient collection, storage, and processing of data. Data engineers build the infrastructure that supports data science activities, enabling scalable and reliable data workflows.
  • Information systems: Data science integrates with information systems to enhance data management and decision support. It uses data from various information systems to perform analyses, build models, and generate actionable insights for business and organizational strategies.

Data science master’s programs like UVA’s MSDS are arguably the best option for aspiring data scientists, but you should also look into degrees like the:

Master’s in data science: on-campus vs. online

The differences between online MSDS programs and data science master’s programs on campus vary substantially from school to school. Some colleges and universities make no distinction between online and on-campus programs. Students choose from among the same specialization options, take the same core courses and elective classes, follow the same course schedule, study under the same professors, and complete the same number of credit hours of work. They may even pay the same tuition rate and have access to the same financial aid packages.

Other schools develop entirely new online master’s programs in data science designed to accommodate distance learners’ needs. These programs typically offer students more flexibility in the form of asynchronous courses, a relaxed course schedule, or more generous project deadlines.

This is why reading guides for specific programs (not just schools) is so important. Consider two programs: Stevens Institute of Technology‘s online MS in Data Science and the school’s MSDS delivered on campus. Students in both programs take many of the same core classes, but the on-campus data science master’s offers concentrations while the online MSDS offers a much more streamlined experience. Whether you’ll get more out of one versus the other has less to do with delivery formats than with the content of individual programs.

Is online as good as on-campus?

At schools with strong programs, the answer is an unequivocal yes, for two reasons. First, students graduate with the same knowledge and skills regardless of which delivery format they choose. Students may not take the same courses or have to meet identical thesis, capstone project, practicum, or research requirements, but the goals of online MS in Data Science and on-campus data science master’s programs are the same.

Second, the best online MSDS programs take steps to replicate the on-campus experience by giving students lots of facetime with faculty members and coordinating opportunities for students to connect virtually with one another and with industry leaders. Some higher education institutions host optional or mandatory immersion weekends or campus residencies for online MSDS students. Others encourage distance learners to come to campus for recruitment and other networking events. And many schools offer online students the same pre- and post-graduation career support that on-campus learners receive.

Ultimately, the value of any degree program is up to you. You can maximize the ROI of an online data science degree by taking advantage of all the networking opportunities, mentorship programs, recruiting initiatives, and career services support available—the same way you would in a program delivered on campus.

Master’s in data science admissions

The baseline prerequisites you’ll need to meet when applying to online MSDS programs will likely include:

  • A four-year bachelor’s degree in a discipline like mathematics, computer science, analytics, or business. Many data science master’s programs don’t require that students have majored in a specific field but do expect applicants to have high GPAs, be proficient in statistics, and have some experience working with datasets and data structures.
  • Competitive GRE and/or GMAT scores. Some programs allow prospective students to apply without standardized test scores but set the academic and professional bar for incoming students higher.
  • Relevant work experience in fields like analytics, computer science, or mathematics. Many MSDS programs are geared toward seasoned professionals with backgrounds that already include some data science work or research credits.

Be aware that because data science master’s programs are still relatively new and often quite small, admission can be very competitive. Admission requirements for online master’s programs in data science vary by institution. Still, it’s not unusual for students in programs at top colleges and universities to already be data scientists with letters of recommendation that speak to their professional skills.

Master’s in data science curriculum

Core courses and electives can vary significantly from school to school, but the curricula in most full-time and part-time MSDS programs share several common features. Students in nearly all data science master’s programs tackle coursework focused on artificial intelligence, advanced programming languages, algorithm design, and data mining. Most programs are project-based and involve copious hands-on work.

The best way to get a sense of what you’ll learn when you pursue a master’s in data science online is to compare the courses offered in different programs.

Students in the University of Virginia’s online Master of Science in Data Science take courses that include:

  • Bayesian Machine Learning
  • Big Data Analytics
  • Data Mining
  • Ethics of Big Data
  • Exploratory Text Analytics
  • Foundations of Computer Science
  • Linear Models for Data Science
  • Machine Learning
  • Practice and Application of Data Science
  • Programming and Systems for Data Science

Meanwhile, students pursuing Stevens Institute of Technology’s online MSDS take:

  • Advanced Optimization Methods
  • Applied Machine Learning
  • Database Management Systems
  • Deep Learning
  • Distributed Systems and Cloud Computing
  • Dynamic Programming and Reinforcement Learning
  • Natural Language Processing
  • Numerical Linear Algebra for Big Data
  • Time Series Analysis

And students at Drexel University pursuing the school’s online Master of Science in Data Science take core and elective courses like:

  • Advanced Programming Techniques
  • Applied Cloud Computing
  • Applied Machine Learning for Data Science
  • Data Acquisition and Pre-Processing
  • Data Analysis and Interpretation
  • Data and Digital Stewardship
  • Database Management Systems
  • Deep Learning
  • Machine Learning
  • Natural Language Processing with Deep Learning
  • Social Network Analytics

As you do more research into what it’s like to study data science online, where to pursue master’s degrees for data science professionals, and how going to graduate school can improve your prospects, keep your career goals in mind. Some programs are designed for generalists, while others focus on specialty areas of data science. There are even MSDS programs for data scientists who want to transition into management. The right program for you will be the one that supports your aspirations.

What companies hire data scientists?

Take a guess, and you might say that big tech companies like IBM, Amazon, and Microsoft are employing the most data scientists. You wouldn’t be wrong. Diffbot’s 2019 State of Data Science, Engineering & AI Report identified the companies with the largest data-related workforces:

  • IBM (2,563 data workers)
  • Amazon (1,846 data workers)
  • Microsoft (1,800 data workers)
  • Facebook (1,220 data workers)
  • Oracle (1,210 data workers)
  • Google (904 data workers)
  • Apple (568 data workers)

But the tech giants aren’t the only companies that can see value in what data scientists do. Financial companies like Fidelity Investments and Bank of America hire data scientists to interpret data. So do pharmaceutical companies like Bristol-Myers Squibb. Communications companies, retailers like Target and PetSmart, hospitals, marketing firms, and even fashion houses find applications for data science. Data scientists can be found doing what they do just about everywhere because data science can answer so many kinds of questions—and boost profits.

What kinds of projects can data scientists work on?

  • Reporting and dashboarding: Data scientists help stakeholders understand the significance of large amounts of data by building auto-updating dashboards that present data in an easy-to-grasp way. A reporting dashboard might report sales numbers, sales trends, and sales projections. They help non-tech managers and executives analyze data to drive decision-making.
  • Spam filters and other filters: Companies need a way to filter out malicious data, fake data, and other information that is of low or no value. Data scientists can use historical data to analyze new data and get rid of junk.
  • Machine learning models: Data scientists can build machine learning models to classify, cluster, or correlate different kinds of data.
  • Robotics: Predicting how learning robots like self-driving cars will behave given changing conditions is a job for data science.
  • Fraud identification: Companies of all sorts need to be able to identify fraud before it affects customers. Data science can be used to ID everything from fake coupons and coupon misuse to account theft and identity theft.
  • Market analysis and sentiment analysis: Sales data, social media data, demographic data, and more can be leveraged by data scientists to help brands better understand their existing customers and target hard-to-reach potential customers
  • Manufacturing analysis: Data science can be used to optimize manufacturing processes, make production cycles shorter, and decrease the cost of production.
  • Safety analysis: Some data scientists use historic and current information to predict when and why accidents are likely to happen so manufacturers of things like cars and planes can make corrections before vehicles go to market.

What tools do data scientists use?

The list of tools data scientists use is long, but it’s important to remember that computer technology changes rapidly. By the time you read this article, there may be newer, more effective data science tools that are making it easier to clean, sort, and analyze information. Right now, data scientists rely on tools like:

  • Frameworks like Hadoop, Mahout, Apache, Hive and Pig
  • Programming languages, such as R, Java, Python, and SQL
  • Git/GitHub
  • Programming language interfaces like Jupyter Notebooks
  • Orange, IBM Watson and other automated machine learning architecture building frameworks
  • Data visualization tools like D3.js and Tableau
  • Databases like NoSQL, MongoDB, Cassandra, and MySQL
  • Python programming language packages like Pandas, Numpy, Scipy, and Matplotlib
  • Large-language model features such as ChatGPT’s Code Interpreter

Data science is about more than just using tools, however. Data scientists also have to know when to use different tools. Small data sets can be cleaned and analyzed using Excel, for example. Given how many sophisticated data science tools there are, it’s surprisingly useful. When time is of the essence and there’s a lot of data, R language is the best choice.

How much money can you make as a data scientist?

A lot, if you know what you’re doing and have an advanced quantitative degree. The Bureau of Labor Statistics sets median data scientist annual pay at just over $100,000. Those in top-paying states fare substantially better:

  • Washington ($148,000)
  • California ($140,000)
  • Virginia ($139,000)
  • New York ($134,000)
  • New Jersey ($134,000)

Top-paying sectors include:

  • Taxi and Limousine Service ($172,000)
  • Computer and Peripheral Equipment Manufacturing ($171,000)
  • Media Streaming, Distribution, and Social Media ($164,000)
  • Search Engines and Information Services ($163,000)
  • Semiconductor ($157,000)

(Updated on July 3, 2024)

How useful is this page?

Click on a star to rate it!

Since you found this page useful...mind sharing it?

We are sorry this page was not useful for you!

Please help us improve it

How can this content be more valuable?

Questions or feedback? Email editor@noodle.com

About the Editor

Tom Meltzer spent over 20 years writing and teaching for The Princeton Review, where he was lead author of the company's popular guide to colleges, before joining Noodle.

To learn more about our editorial standards, you can click here.


Share

Data Science Programs You Should Consider

Advertisement

You May Also Like To Read


Categorized as: Data ScienceInformation Technology & Engineering