About the course
This course is focussed more on practical learning rather than theoretical. Dealing with concrete datasets and analysis of data has made this course unique. In this course, then you will be able to learn to use datasets and create analytical charts like Bar graphs, pie charts, histograms, box plots, 3D scatter plots etc.
After completing this course, you will be able to:
- Become proficient in dealing the graphics libraries in R.
- Efficiently work with datasets.
- Generate your own analysis reports.
- Boost your hireability through innovative and independent learning.
- Get a certificate on successful completion of the course.
The course can be taken by:
Students: All students who are pursuing professional graduate/post-graduate courses in Data and statistical analysis.
Teachers/Faculties: All teachers/faculties who want to acquire new skills.
Professionals: All IT professionals from the data analysis industry.
Why learn R Programming?
R is the most useful programming language and has a very big area of application. R Programming is the best mechanism for statistics, data analysis, and machine learning. R Programming is the best approach to create reproducible, excessive-quality analysis. R programming makes applying algorithms to your data easy, and there are many packages you can use to implement your algorithmic methods and analysis. R is also expanding on another purpose as reporting and web development. There is a definite shortage of trained resources in India who can do analytics with R. The few who do have the right skills find themselves in great demand as organizations look to ramp up their R capabilities. So, it’s worth learning R Programming, if you are looking for a successful career in data science.
- 24X7 Access: You can view lectures as per your own convenience.
- Online lectures: 8 hours of online lectures with high-quality videos.
- Hands-on practice: Includes source code files for hands-on practice.
- Updated Quality content: Content is latest and gets updated regularly to meet the current industry demands.
Test & Evaluation
Each lecture will have a quiz containing a set of multiple choice questions. Apart from that, there will be a final test based on multiple choice questions.
Your evaluation will include the overall scores achieved in each lecture quiz and the final test.
Certification requires you to complete all the lectures, quizzes, and the final test. Your certificate will be generated online after successful completion of course.
Topics to be covered
- R- Basics
In this Chapter, we have started with step-by-step Installation of R and the R Studio which is a GUI based IDE for R language. We have also explained package installation on R, built in datasets in R, manual data entry, data importing, tabular to row data conversion. We have also looked at the default colors present in the data and a more elaborate color options named "Colorbrewer".
- Steps to Install 'R'
In this lesson, you will learn some of the steps to install R in your system.
- R-Studio Installation
In this lesson, you will learn to install R Studio, which is a GUI based Integrated Development Environment (IDE).
- Using R Materials
In this lesson, you will learn to download R materials and then use these R Materials in R Studio.
- R-Studio Interface
In this lesson, you will learn about the different interfaces of the R Studio such as R Script, Console Section, R Environment and Graphical Output Section.
- Steps to Install Packages
In this lesson, you will learn about various packages that are available in R and how to use them.
- Default Data-Sets in R
In this lesson, you will learn about the default datasets which are already installed in R. These are those packages which are by default installed and loaded in R.
- Manual Data Entry
R Programming provides different ways to enter the data manually. In this lesson, you will learn about manual data entry in R.
- Data Importing
In R, there are different cases in which the data is required to be imported in order to use it. In this lesson, you will learn to import the data.
- Tabular to Row Data Conversion
In R, the data has to be stored in a specific format so that it can be easily understood and used. In this lesson, you will learn to arrange the data in rows and columns.
- R - Colors
We use R color for R data manipulation, with the help of R Color, our graphical output looks a lot better. In this lesson, you will learn about R Color.
- Overview - 'Colorbrewer'
In this lesson, you will learn about an external package named RColorbrewer. By installing this package we can use R color brewer.
- Colors in R: Summary
In this lesson, we will summarize what we have learned so far in R Color, and will discuss about other applications of R Color.
- Steps to Install 'R'
- Introduction to Charts
This chapter covers the details about various charts in R. R programming has multiple libraries which can be used to create charts like Bar charts, pie charts, histograms, box-plots etc. A bar graph or a bar chart is the representation of data in bars. On other hands, a pie chart is the representation of data or values as sectors within the circle each represented with a different color to distinguish them. Box-plot is used for getting information about possible outliers in the data sample. Various ways to save plots as images has also been explained in the unit.
- Bar Charts
R language is mostly famous for graphical representation. A Bar Chart is a very good example of this. In this lesson, you will learn about the Bar Charts.
- Pie Charts
In this lesson, you will learn about Pie Charts for graphical representation. A Pie Chart is also a very good source of data representation.
The histogram is suitable for visualizing distribution of numerical data over a continuous interval, or a certain time period. In this lesson, you will learn about Histograms.
When there is a requirement of possible outliers while analyzing the data, then Box plots are used. In this lesson, you will learn about Box plots.
- Customized Graphs
In this lesson, you will learn to customize the graphs and also see the effect of customization on your graph.
In this lesson, you will learn to present the data in the image format. In order to do that, you will first import the data in the image form and then present the data through the image.
- Layering Plots: Summary
In this lesson, you will get the summary for the plotted datasets by using the Layered Plots.
- Bar Charts
- Introduction to Statistics
This chapter covers the basic concept of statistics viz frequencies, descriptive, hypothesis testing and chi-square testing in R programming. The frequency distribution of a data variable is the count of data that is occurring within a collection of non-repeated categories. Descriptive statistics gives summary statistics of the data and is the basis of advanced analysis of data. We then had a look on inferential statistics methods. In this unit, we have explained single proportion testing, single mean testing and Chi-square testing, which is used to infer results based on the sample data characteristics and hypothesized values. We have also done a univariate analysis to find patterns in the data.
In this lesson, you will now learn to calculate the frequency of data and analyze the data after changing it from frequency to density.
In this lesson, you will now learn about the descriptive statistics. These are those figures which are used for summarizing the data.
- Single Proportion Testing
In this lesson, you will learn something about inferential statistics, for which you will now be making use of the single proportion testing.
- Single Mean Testing
In this lesson, you will learn about single mean testing. Single mean testing performs mean test for a sample in comparison to an aim value.
- Chi-Square Test
In this lesson, you will learn about the Chi-Square test. This is the test which is used to determine the goodness for fit for the categorical variable.
- Univariate Analysis
In this lesson, you will learn about Univariate Analysis Data, which is used to present the sample of a variable in a amazing way.
- Descriptive Statistics: Summary
In this lesson, you will get the summary about Descriptive Statistics through a dataset.
- Manipulating Data
This chapter covers the details of working with data. We can have outliers in the data and its treatment is explained. Outliers are those observations which occur very infrequently and might be the result of errors while observing. Proper treatment of data is necessary for the unbiased result. This might includes subsetting, sorting, extracting unique observations renaming variables, creating new variables etc. Each of these tasks can be accomplished using the set of newly introduced packages.
In this lesson, you will learn to treat the present outliers in the data. In order to do this, you will use a categorical data to understand the outliers.
- Transformation of Variables
In this lesson, you will learn to transform the variable to fit better in the assumption of data analysis.
- Composite Variables
In this lesson, you will demonstrate the functionality of the composite variable using the random variable.
- Working with Missing Data
In this lesson, you will learn to deal with the missing data which are also seen often in your data. You need to treat them in such a way that your figures are not biased.
- Working with Outliers: Summary
In this lesson, you will get the complete summary of whatever you have learned so far in outliers.
- Managing Huge Data
In this chapter, we have worked with cases, subgroups and files. Any data set is like an enclosed or shelled collection. It consists of cases which are exactly the objects in the same collection with each case having one or more attributes or qualities known as variables. This lesson covers working with subgroups and merging files. Merging means that different datasets or files are combined together within a single dataset or file. R programming includes the method to merge the files.
- Working with Cases
In this lesson, you will learn the method of customizing your analysis for a particular parameter in a set of data.
- Working with Subgroups
In this lesson, you will look at the demonstration of by which you can obtain all the descriptive calculations of all the values of a particular variable at a time.
- Working with Files - Merging
In this lesson, you will learn about a very useful method of combining the different data in a same unit. This method is called as Merging.
- Working with Subgroups: Summary
In this lesson, you get a complete summary of all the analysis of this section with subgrouping.
- Working with Cases
- Association: Presentation
In this chapter, the Bar charts, Box plots and scatter plots have been demonstrated. A bar chart or a bar graph represents the data with the help of bars or rectangles. The values of the variables are determined by the height or length of the rectangle be it vertical or horizontal. A box plot is an exploratory graphic which enables us to encapsulate the features of quantitative variables. A scatter plot pairs up the values of two quantitative variables in a dataset and represent them as geometric points in the Cartesian diagram.
- Bar Charts
In this lesson you will learn about the different ways to analyze your data with the help of Bar charts.
- Box Plots
In this lesson, you will make use of the iris dataset to summarize and present data with the help of Box Plots.
- Scatter Plots
In this lesson, you will explore the quantitative relation between the variables with the help of a scatter plot using the iris dataset and the swiss dataset.
- Working with Plots: Summary
In this chapter we will summarize all the information about the appropriate section.
- Bar Charts
- Associations: Statistics
In this chapter, the statistical concepts like correlation, regression, proportions etc have been covered in detail. A correlation is a statistical method or technique to display if there is a relation between pairs of variables or how strongly the pairs of variables are related. Regression is the most critical fundamental tool for statistical analysis frequently used in various research fields. Bivariate regression is the simplest linear regression procedure. Then we also demonstrated a few tests as well in the later part of the chapter such as T-test, one-factor analysis of variance, proportions etc.
In this lesson, you will learn about the correlation. In mathematical terms, Correlation is equivalent to the covariance of the two variables divided by the product of the standard deviation of each data sample.
- Bivariate Regression
In this lesson, you will explore Bivariate Regression with the help of appropriate line regression and vector equations.
In this lesson, you will learn about T-Statistics by comparing the calculated values of two samples with the T-test using the Iris Dataset.
- Paired T-Test
In this lesson, you will learn to examine the difference between two samples by creating two Random Variables, through Paired T-test.
In this lesson, you will learn to test the similarities of the content from two populations or groups with ANOVA test.
In this lesson, you will learn to compare the categorical groups with the help of proportion.
- Chi-Square Test
In this lesson, you will learn to make use of the Chi-Square test to perform independent testing between the two specific variables.
- Statistics for Bivariate Associations
In this lesson, you will learn about the statistics of Bivariate Associations using some packages available in R.
- Association Stats: Summary
In this lesson, you will get the summary of all the testing and other statistics of the appropriate section which you have used in this section.
- Advanced Charts
In this chapter, the method of creating bar charts for mean, scatter plots for grouped data, scatter plot matrices and a very interesting and visual 3D scatter plot have been covered in detail..
- Bar Charts for Mean
In this lesson, you will learn about drawing the bar charts for multiple variables defined by different categories.
- Scatter Plots for Grouped Data
In this lesson, you will learn to plot a Scatter Plot for multiple variables by loading the CSV file in R.
- Scatter Plot Matrices
In this lesson, you will learn to plot a Scatter Plot by loading the matrix data.
- 3D Scatter Plots
In this lesson, you will learn to plot a grouped data with the help of 3D Scatter Plot.
- Charts for Multiple Variables: Summary
In this lesson, you will get the complete summary of various plotting techniques and Bar Charts.
- Bar Charts for Mean
- Multiple Variable Statistics
In this chapter, some relatively advanced topics such as multiple regression, two factor ANOVA, cluster analysis and principal component & factor analysis have been covered in detail. These topics are very important specially multiple regression which is used very extensively in research papers and industry to establish the relationship between variables.
- Multiple Regression
In this lesson, you will learn about Multiple Variable Statistics, which is the most common tool of Multiple Regression with the help of an inbuilt database. This tool is used to analyze the data.
- Two-Factor ANOVA
In this lesson, you will learn to analyze two-factor ANOVA with the help of toothgrowth, through which you will learn to interact between two-categorical terms.
- Cluster Analysis
In this lesson, you will learn about cluster analysis which creates clusters or groups based on the values of variables.
- Principal Component / Factor Analysis
In this lesson, you will learn to search the components using Principal component analysis which will explain the most viewed variations in the data.
- Multiple Variable Statistics: Summary
In this lesson, the course will be concluded with the summary of whatever has been covered in this section so far.
- R Programming Final Quiz
In this lesson, there is a Quiz containing a set of questions. This test is for self evaluation of the candidate of the overall understanding of the course content. The course will not be considered successfully completed if this quiz is omitted or ignored.
- Multiple Regression