Skip to main content

Posts

Showing posts from December, 2019

Project: Building a Predictive Model

You are a data scientist working for University of South Florida. Your boss wants to develop a predictive model to automatically make a prediction on students' graduation rates based on several factors (variables). You have College dataset ( College.csv ) , which is also available in the ISLR package.  R code Studio

Final Project

Final Project Step 1 Data set: College.csv- Statistics for a large number of US Colleges from the 1995 issue of US News and World Report. This dataset was taken from the StatLib library which is maintained at Carnegie Mellon University. It was used in the ASA Statistical Graphics Section’s 1995 Data Analysis Exposition. Project goal: based on the college set data giving from ISLR package, I want to be able to determine students' graduation rates based on several factors (variables). Step 2   Hypothesis-   The fraction of students from the top 10%   of the class predict what fraction graduates better than top 25% of high school graduate student entering college. Null Hypothesis-   The fraction of students from the top 10% of the class don’t predict what fraction graduates better than top 25% of high school graduate student entering college. Step 3 R Codes I'm going be using Public school only variables from college data sets:...