in Technology by
Of which steps is the data analysis process typically composed?

1 Answer

0 votes
by
  1. Finding a relevant business problem to solve: Often neglected, this is the most important step of the process, since generating business value is the end goal of any data analyst. Having a clear objective and restricting the data space to be explored is paramount to avoiding wasting resources. Since it requires deep knowledge of the problem domain, this step may be executed by a domain expert other than the data analyst.
  2. Data extraction: The next step is to collect data for analysis. It could be as simple as loading a CSV file, but more often than not it involves gathering data from multiple sources and formats.
  3. Data cleansing: After gathering the data, the dataset needs to be prepared for processing. Likely the most time-consuming step, data cleansing can include handling missing fields, corrupt data, outliers, and duplicate entries.
  4. Data exploration: This is often what comes to mind when thinking of data analysis. Data exploration involves generating statistics, features, and visualizations from the data to better understand its underlying patterns. This then leads to insights that might generate business value.
  5. Data modeling and model validation (optional): Training a statistical or machine learning model is not always required, as a data analyst usually generates value through insights found in the data exploration step, but it may uncover additional information. Easily interpretable models, like linear or tree-based models, and clustering techniques often expose patterns that would be otherwise difficult to detect with data visualization alone.
  6. Storytelling: This last step encompasses every bit of information uncovered previously to finally present a solution to—or at least a path to continue exploring—the business problem proposed in the first step. It’s all about being able to clearly communicate findings to stakeholders and convincing them to take a course of action that will lead to creating business value.

These are the most common steps of data analysis. Although they have been presented as a list, more often than not they are not executed sequentially and some steps may require several iterations as new data sources are added and information is uncovered.

Related questions

0 votes
    In any collaboration, data ownership is typically determined by:...
asked Nov 29, 2020 in Technology by Editorial Staff
0 votes
    Which of the following process involves structuring datasets to facilitate analysis? (a) Data tidying (b) Data ... and answers pdf, Data Science interview questions for beginners...
asked Oct 30, 2021 in Education by JackTerrance
0 votes
    Which of the following analysis is a statistical process for estimating the relationships among variables? (a) ... and answers pdf, Data Science interview questions for beginners...
asked Oct 29, 2021 in Education by JackTerrance
0 votes
    The application program typically communicates with a database server, through ___________ or other protocols, ... in portion Database Programming Techniques of Database Management...
asked Oct 10, 2021 in Education by JackTerrance
0 votes
    What are the stages in the compilation process? Select one: a. Feasibility study system design and testing b. ... of the above? Select the correct answer from above options...
asked Nov 27, 2021 in Education by JackTerrance
0 votes
    Which steps are used in the RFQ process?...
asked Mar 15, 2021 by JackTerrance
0 votes
    fill in the blank:_______folder contains the mails which are composed but not yet set. Select the correct answer from above options...
asked Nov 29, 2021 in Education by JackTerrance
0 votes
    Assuming a six-person team composed of three developers, two testers, and one PO, with no vacations, then the estimated initial velocity before time off...
asked Nov 30, 2020 in Technology by Editorial Staff
0 votes
    Name the assembly in which the MVC framework is typically defined....
asked Dec 8, 2020 in Technology by JackTerrance
0 votes
    What is the Supreme Court Collegium composed of? Please answer the above question....
asked Aug 3, 2022 in Education by JackTerrance
0 votes
    Order List Priority of Research Education was Composed by? Bohem Henrry Burman Jonh Select the correct answer from above options...
asked Dec 28, 2021 in Education by JackTerrance
0 votes
    An element is composed of: a. ON tag and OFF tag b. The content between the opening and closing tag c. Both a and b Select the correct answer from above options...
asked Nov 27, 2021 in Education by JackTerrance
0 votes
    Dietary fibres are composed of (a) cellulose (b) proteins (c) amylase (d) unsaturated fats Select ... ,Science proposed by,electromagnetic theory engineering physics,Science nptel...
asked Nov 8, 2021 in Education by JackTerrance
...