Perform some initial analysis and create visualizations using Tableau Public. Create some visual plots and charts describing the data and information it is trying to give out.

DS 510 W – Introduction to Data Science

Final Project

In this project you will investigate the impact of a number of automobile engine factors on the vehicle’s mpg. The dataset auto-mpg.csv contains information for 398 different automobile models. Information regarding the number of cylinders, displacement, horsepower, weight, acceleration, model year, origin, and car name as well as mpg are contained in the file.

Perform some initial analysis and create visualizations using Tableau Public (reference will be available in week 9).

Create some visual plots and charts describing the data and information it is trying to give out.

Using the first 300 samples in the auto-mpg.csv, run a simple linear regression and multiple linear regression to determine the relationship between mpg and appropriate independent variable/(s).  Report all the appropriate information regarding your regression.

  • Multiple R-squared
  • Adjusted R-squared
  • Complete Linear Regression equation

Maintain a log of above values for all models.

For the remaining 98 samples in the dataset, use your best linear model(s) to predict each automobile’s mpg and report how your predictions compare to the car’s actual reported mpg.

  • Residual Plot

As a part of submission, share the code and report explaining the research. You can submit your code by compiling the report on RStudio. Directions to save complete code on word / PDF file is as below.

What are the strengths and advantages of Bardach’s eight-fold path? How can Bardach’s model help with the review and understanding of a policy problem? Does the model have any shortcomings?

Bardach’s eightfold path

  • What are the strengths and advantages of Bardach’s eight-fold path? (you have to search what Bardach’s eight fold path is)
  • How can Bardach’s model help with the review and understanding of a policy problem?
  • Does the model have any shortcomings?

Why do some gangs become a danger to their broader communities?

GANGS DF#6

In Discussion Forum 6, post your response to the following discussion question. Reply to at least two classmates’ responses by the date indicated in the Course Calendar.

Why do some gangs become a danger to their broader communities?

The researchers broke these results down further using body shape as a third factor. Describe why the inclusion of this factor complicates the analysis. In other words, why is this not a standard 2×2×3 experiment?

DS-520 DATA ANALYSIS AND DECISION MAKING

FINAL EXAM

NAME:

SPIRIT ID:

 

  1. Where are your eyes?

The objectifying gaze, often referred to as “ogling” or “checking out,” can have many adverse consequences. A group of researchers used eye-tracking technology to better understand the nature and causes for this gaze. They asked 29 women and 36 men to look at images of college-aged women. Each woman had the same clothes and neutral expression but varied in body shape (ideal, average, and below average). Prior to looking at the images, each participant was told to focus on either the appearances or personalities of the women. Here is a summary of the amount of time (in milliseconds) the eyes focused on the chest of the women:

  1. Plot the means. Do you think there is an interaction? Explain your answer.
  2. Do you think the marginal means would be useful for understanding the results of this study? Explain why or why not.
  3. The researchers broke these results down further using body shape as a third factor. Describe why the inclusion of this factor complicates the analysis. In other words, why is this not a standard 2×2×3 experiment?

 

 

 

  1. A study reported the following results for data analyzed using a two-way ANOVA at the 5% significance level:
  2. What can you conclude from the information given?
  3. What additional information would you need to write a summary of the results for this study?

 

  1. 3. The effects of two stimulant drugs. An experimenter was interested in investigating the effects of two stimulant drugs (labeled A and B). She divided 25 rats equally into five groups (placebo, Drug A low, Drug A high, Drug B low, and Drug B high) and, 20 minutes after injection of the drug, recorded each rat’s activity level (higher score is more active). The following table summarizes the results:
  2. Plot the means versus the type of treatment. Does there appear to be a difference in the activity level? Explain.
  3. Is it reasonable to assume that the variances are equal? Explain your answer and, if reasonable, compute sp.
  4. Give the degrees of freedom for the F statistic.
  5. The F statistic is 2.64. Find the associated P-value and state your conclusions.

 

 

  1. The National Survey of Student Engagement found that 87% of students report that their peers at least “sometimes” copy information from the Internet in their papers without reporting the source. Assume that the sample size is 430,000.
  2. Find the margin of error for 99% confidence.
  3. Here are some items from the report that summarizes the survey. More than 430,000 students from 730 four-year colleges and universities participated. The average response rate was 43% and ranged from 15% to 89%. Institutions pay a participation fee of between $3000 and $7500 based on the size of their undergraduate enrollment. Discuss these facts as possible sources of error in this study. How do you think these errors would compare with the error that you calculated in part (a)?

 

  1. 5. Food neophobia is a personality trait associated with avoiding unfamiliar foods. In one study of 564 children who were two to six years of age, the degree of food neophobia and the frequency of consumption of different types of food were measured. Here is a summary of the correlations:

Perform the significance test for each correlation and write a summary about food neophobia and the consumption of different types of food.

 

 

BONUS:

Laptops and other digital technologies with wireless access to the Internet are becoming more and more common in the classroom. While numerous studies have shown that these technologies can be used effectively as part of teaching, there is concern that these technologies can also distract learners if used for off-task behaviors.

In one study that looked at the effects of off-task multitasking with digital technologies in the classroom, a total of 145 undergraduates were randomly assigned to one of seven conditions.12 Each condition involved performing a task simultaneously during lecture. The study consisted of three 20-minute lectures, each followed by a 15-item quiz. The following table summarizes the conditions and quiz results (mean proportion correct):

  1. For this analysis, let’s consider the average of the three quizzes as the response. Compute this mean for each condition.
  2. The analysis of these average scores results in SSG=0.22178 and SSE=2.00238. Test the null hypothesis that the mean scores across all conditions are equal.
  3. Using the means from part (a) and the Bonferroni method, determine which pairs of means differ significantly at the 0.05 significance level. (Hint: There are 21 pairwise comparisons, so the critical t-value is 3.095. Also, it is best to order the means from smallest to largest to help with pairwise comparisons.)
  4. Summarize your results from parts (b) and (c) in a short report.

Python’s pow function returns the result of raising a number to a given power. Define a function expo that performs this task, and state its computational complexity using big-0 notation.

Complete questions only 1- 3

Projects

1. A sequential search of a sorted list can halt when the target is less than a given element in the list. Define a modified version of this algorithm, and state the computational complexity, using big-0 notation, of its best-, worst-, and average-case performances.

2. The list method reverse reverses the elements in the list. Define a function amed reverse that reverses the elements in its list argument (without using the method reverse!). Try to make this function as efficient as possible, and state its computational complexity using big-0 notation.

3. Python’s pow function returns the result of raising a number to a given power. Define a function expo that performs this task, and state its computational complexity using big-0 notation. The first argument of this function is the number, and the second argument is the exponent (nonnegative numbers only). You may use either a loop or a recursive function in your implementation. Caution: do not use Python’s ** operator or pow function in this exercise!

4. An alternative strategy for the expo function uses the following recursive definition:

expo(number, exponent)

= 1, when exponent

= 0 = number * expo(number, exponent – 1), when exponent is odd

= (expo(number, exponent // 2))2, when exponent is even

Define a recursive function expo that uses this strategy, and state its computational complexity using big-0 notation.

5. Python’s 1 i st method sort includes the keyword argument reverse, whose default value is False. The programmer can override this value to sort a list in descending order. Modify the selection Sort function discussed in this chapter so that it allows the programmer to supply this additional argument to redirect the sort.

For problem size n, algorithms A and B perform n2 and 1/2 n2 + n instructions, respectively. Which algorithm does more work? Are there particular problem si for which one algorithm performs significantly better than the other? Are the particular problem sizes for which both algorithms perform approximatel e same amount of work?

Search Algorithms

Assume that each of the following expressions indicates the number of operations performed by an algorithm for a problem size of n. Point out the dominant term of each algorithm, and use big-0 notation to classify it.

  • a. 2″ — 4n2 + 5n
  • b. 3n2 + 6
  • c. n3 +n2 —n

For problem size n, algorithms A and B perform n2 and 1/2 n2 + n instructions, respectively. Which algorithm does more work? Are there particular problem si for which one algorithm performs significantly better than the other? Are the particular problem sizes for which both algorithms perform approximatel e same amount of work?

At what point does an n4 algorithm begin to perform better than a Ti algorithm?

Discuss social factors that influence people or groups to conform to the actions of others. Indicate how behaviors and motivation are impacted by the presence of others.

Instructions According to the World Health Organization (WHO), in 2019

Discuss social factors that influence people or groups to conform to the actions of others.

Indicate how behaviors and motivation are impacted by the presence of others. (How does this apply to COVID-19?)

Indicate the structures of the brain that are involved in emotion and motivation. (How could a person’s emotions related to fear drive their behaviors during this pandemic?)

Examine the article’s generalizability to various areas of psychology.

Why would some people choose to follow the orders to avoid social contact and others allow desire for human interaction to be their driving force?

Describe what a VLAN is and why it is more prevalent today in networks. What are the advantages of a VLAN? Are there any disadvantages? What are the security issues, if any with a VLAN?

Programming Question

Write a paper. It should be double-spaced and be between 9 – 12 pages. The paper will need to be double spaced and comply with either the APA or MLA styles required by the college.

Use a minimum of 7 sources (not including the book). And, your last slide (or page if you choose the paper) should contain the list of sources.

Do not plagiarize. If you use a source, show that on the slide or page. Document your sources at the end.

VLAN’s

    • Describe what a VLAN is and why it is more prevalent today in networks.
    • What are the advantages of a VLAN? Are there any disadvantages?
    • What are the security issues, if any with a VLAN?
    • Describe what trunking is. Why would you use it?
    • Create a drawing showing 2 or 3 VLAN’s and the concept of trunking. Ensure your drawing shows hosts connected to the VLAN as well as a router at the beginning of the network. Optionally, find a diagram on the Internet with this information (and provide the source).
    • For each of the VLAN’s, label the purpose of the VLAN (i.e., Accounting Department, Payroll, etc.)
    • Provide examples (from the Internet) of equipment (to include switches) that can be used with VLAN’s. Specify the model of switch that you would use.
    • Describe how to create a Wireless VLAN
    • Describe how a VLAN compares (or does not compare) to a subnet. What are the differences? What are the similarities?

What are the biggest risks and threats to the Province of New Brunswick, Canada from climate change to 2050. What information did you use to determine that?

Report on New Brunswick Province (CLIMATE CHANGE)

1. What are the biggest risks and threats to the Province of New Brunswick, Canada from climate change to 2050. What information did you use to determine that?

2. How ready is New Brunswick to manage those threats today? Is it working to become better at managing those threats and risks? How so? What is their timeline? Do you have any sense of the costs involved? Are the approaches short-term or long-term? What difference might that make?

3. What, briefly, is your thinking so far on how to address the insufficiencies? Is there a major policy shift you can suggest? Or some smaller policy changes?

4. How would you speed up their adaptation? What kinds of policy shifts are required to move thing more quickly? What are the most important initiatives to be taken in New Brunswick?

5. Can you suggest ways of educating residents about real risks and vulnerabilities, and possible adaptations? How would you get buy in to difficult decisions?

Explain the interrelationship between innovation and entrepreneurship. Compare the risks and benefits of a social-organization as it relates to change and innovation.

Innovation, Technology, and Risk

Write a 2–3 page paper in which the following are addressed:

  1. Explain the interrelationship between innovation and entrepreneurship.
  2. Compare the risks and benefits of a social-organization as it relates to change and innovation.
  3. Speculate how artificial intelligence (AI) and robotics will influence organizations in the next 10 years. Provide specific examples of how this new technology has encouraged new business creation or growth.
  4. Describe at least two risks that new technologies may present to existing industry models and economic systems.

Include at least two references from professional and reputable sources to support your assertions or arguments.