Fully discuss whether we should omit a predictor from the modeling stage if it does not reflects any connection with the target variable in the EDA stage, and why.

foass77W 2020-12-30 Answered
Fully discuss whether we should omit a predictor from the modeling stage if it does not reflects any connection with the target variable in the EDA stage, and why.
You can still ask an expert for help

Expert Community at Your Service

  • Live experts 24/7
  • Questions are typically answered in as fast as 30 minutes
  • Personalized clear answers
Learn more

Solve your problem for the price of one coffee

  • Available 24/7
  • Math expert for every subject
  • Pay only if we can solve it
Ask Question

Expert Answer

Nichole Watt
Answered 2020-12-31 Author has 100 answers

Step 1 In simple linear regression method we find the relationship between two variables that is dependent variable and independent variable by using scatter diagram is the graphical method to check the relation between two variables. The simple linear regression equation is given by , Y=β0+β1X Where Y is dependent variable X is inependent variable β0 is intercept of regression line β1 is the slope of the regression line In machine learning we called dependent variable (Y-variable) as target variable and independent (or Predictor)variable(X-variable) as feature vector.

Step 2 Yes ,we should omit a predictor from the modeling stage if it does not reflects any connection with the target variable in the EDA stage. Exploratory data analysis is the method of analyzing data sets to summarize their main characteristics within data visualization .This method was discovered by John Tukey. This step is important before you starting the machine learning or modelling of your data. In exploratory data analysis many graphical methods are available to check the relationship between two variables for example scatter plot, multi-vari chart, run chart ,pareto chart using the techniques we check the relationship between two variables if it does not show any relationship then we omit that predictor variable . Model specification is the method to determine which independent variables are included and excluded from the regression equation. Sometimes investigator measure too many variables but include some of them only and omit the variable that does not show any relationship with dependent or target variable .If investigator omits important variable from model the estimates for the variables that included can be biased and this is known as omitted variable bias . and it increase the bias in our model. To avoid bias in regression we omit variable that does not show any reflect connection with the target variable.

Not exactly what you’re looking for?
Ask My Question

Expert Community at Your Service

  • Live experts 24/7
  • Questions are typically answered in as fast as 30 minutes
  • Personalized clear answers
Learn more

Relevant Questions

asked 2021-09-17
The function y=3.5x+2.8 represents the cost y (in dollars) of a taxi ride of x miles.
a. Identify the independent and dependent variables.
b. You have enough money to travel at most 20 miles in the taxi. Find the domain and range of the function.
asked 2021-09-07
A city water department is proposing the construction of a new water pipe, as shown. The new pipe will be perpendicular to the old pipe. Write an equation that represents the new pipe.
asked 2021-06-22
Describe mathematical modeling in you r own words.
asked 2021-05-28
Use the strategy for solving word problems, modeling the verbal conditions of the problem with a linear inequality. A company manufactures and sells blank audiocassette tapes. The weekly fixed cost is $10,000 and it costs $0.40 to produce each tape. The selling price is $2.00 per tape. How many tapes must be produced and sold each week for the company to generate a profit?
asked 2021-09-22
The expression 728
asked 2021-09-22
What term describes the use of mathematical equations in the modeling of linear aspects of ecosystems? a. analytical modeling, b. simulation modeling, c. conceptual modeling, d. individual-based modeling.
asked 2021-09-20
The area (in square centimeters) of a square coaster can be represented by d2+8d+16.
a. Write and expression that represents the side length of the coaster.
b. Write and expression for the perimeter of the coaster.