in Technology by
You are given a data set consisting of variables with more than 30 percent missing values. How will you deal with them?

1 Answer

0 votes
by

The following are ways to handle missing data values:

If the data set is large, we can just simply remove the rows with missing data values. It is the quickest way; we use the rest of the data to predict the values.

For smaller data sets, we can substitute missing values with the mean or average of the rest of the data using the pandas' data frame in python. There are different ways to do so, such as df.mean(), df.fillna(mean).

Related questions

0 votes
0 votes
0 votes
    A number of cats got together and decided to kill between them 999919 rats. Every cat killed an equal number of rats. Each cat ... cat killed ? A) 1009 B) 991 C) 2000 D) 1000...
asked Feb 12, 2021 in Education by JackTerrance
0 votes
    You are given a dataset on cancer detection. You have built a classification model and achieved an accuracy of 96 percent ... with your model performance? What can you do about it?...
asked Apr 27, 2021 in Technology by JackTerrance
0 votes
    Which of the following machine learning algorithms can be used for inputting missing values of both categorical and continuous variables?...
asked Apr 27, 2021 in Technology by JackTerrance
0 votes
    When there are more than one independent variables in the model, then the linear model is termed as _______ (a ... of R Programming Select the correct answer from above options...
asked Feb 10, 2022 in Education by JackTerrance
0 votes
    You are solving a data science problem that is ridden with many missing values. You want to clean the data before ... do to replace these missing values? Develop your own function....
asked Oct 16, 2020 in Technology by JackTerrance
0 votes
    are set to control the pro .. are used to input information in a more user friendly way than tables. fa database ... in a . table Select the correct answer from above options...
asked Nov 28, 2021 in Education by JackTerrance
0 votes
    In each of the following pairs, which will conuct more electricity? a. Cu wire at 30∘C and 60∘C . b. ... 1M propanioc acid solutions. Select the correct answer from above options...
asked Jan 5, 2022 in Education by JackTerrance
0 votes
    A gas at 350 K and 15 bar has molar volume 20 percent smaller than that for an ideal gas under the ... attractive forces are dominant Select the correct answer from above options...
asked Jan 2, 2022 in Education by JackTerrance
0 votes
    A csv file consists of missing values that are represented by hashtags (“#”) and ampersands (“&”). How can you read this type of a csv file in R?...
asked Oct 16, 2020 in Technology by JackTerrance
0 votes
    A _____________ constraint requires that an entity belong to no more than one lower-level entity set. ... chapter Database Systems Design and Implementation of Database Management...
asked Oct 10, 2021 in Education by JackTerrance
0 votes
    If an entity set is a lower-level entity set in more than one ISA relationship, then the entity ... in section Database Systems Design and Implementation of Database Management...
asked Oct 10, 2021 in Education by JackTerrance
0 votes
    Want to remove the lines from data frame from that : Have NAs across all columns a b c d e f 1 YASH00000206234 0 ... 0 1 2 3 2 Select the correct answer from above options...
asked Jan 22, 2022 in Education by JackTerrance
0 votes
    Consider the employee work-team example, and assume that certain employees participate in more than one ... Database Interview Questions and Answers for Freshers and Experience...
asked Oct 11, 2021 in Education by JackTerrance
...