Active Outline
General Information
- Course ID (CB01A and CB01B)
- CISD344H
- Course Title (CB02)
- R Programming
- Course Credit Status
- Non-Credit
- Effective Term
- Fall 2024
- Course Description
- This course is an introduction to the R programming language and its utility in big data analytics. Topics covered include data objects, data cleansing, merging and sorting, statistical analysis of data, data graphics, and visualization, and working with R-Studio.
- Faculty Requirements
- Discipline 1
- [Computer Science]
- FSA
- [FHDA FSA - COMPUTER SCIENCE]
- Course Family
- Not Applicable
Course Justification
This is a noncredit enhanced course that belongs on the certificate of completion in Database Development Practitioner. An introduction to the R programming language will be provided, which is a free software environment for statistical computing and graphics and is widely used among statisticians and data miners for developing statistical software and data analysis.
Foothill Equivalency
- Does the course have a Foothill equivalent?
- No
- Foothill Course ID
Formerly Statement
Course Development Options
- Basic Skill Status (CB08)
- Course is not a basic skills course.
- Grade Options
- Letter Grade
- Pass/No Pass
- Repeat Limit
- 99
Transferability & Gen. Ed. Options
- Transferability
- Not transferable
Units and Hours
Summary
- Minimum Credit Units
- 0.0
- Maximum Credit Units
- 0.0
Weekly Student Hours
Type | In Class | Out of Class |
---|---|---|
Lecture Hours | 4.0 | 8.0 |
Laboratory Hours | 1.5 | 0.0 |
Course Student Hours
- Course Duration (Weeks)
- 12.0
- Hours per unit divisor
- 36.0
Course In-Class (Contact) Hours
- Lecture
- 48.0
- Laboratory
- 18.0
- Total
- 66.0
Course Out-of-Class Hours
- Lecture
- 96.0
- Laboratory
- 0.0
- NA
- 0.0
- Total
- 96.0
Prerequisite(s)
Corequisite(s)
Advisory(ies)
ESL D272. and ESL D273., or ESL D472. and ESL D473., or eligibility for EWRT D001A or EWRT D01AH or ESL D005.
CIS D022A or CIS D036A or CIS D040.
Limitation(s) on Enrollment
Entrance Skill(s)
General Course Statement(s)
NONCREDIT: (This is a noncredit enhanced, CTE course.)
Methods of Instruction
Lecture and visual aids
Discussion of assigned reading
Discussion and problem solving performed in class
Collaborative learning and small group exercises
Collaborative projects
Assignments
- Reading: Required reading from the textbook and class notes
- Programs: 7-10 programming homework assignments.
- Group Project: Data exploration and visualization of assigned datasets.
Methods of Evaluation
- One or two midterm examinations requiring some programming, concepts clarification and exhibiting mastery of R programming constructs presented in the course.
- A final examination requiring concepts clarification and exhibiting mastery of data exploration, analysis and visualization principles.
- Evaluation of programming assignments and group project, based on correctness, documentation, code quality, and test plan executions.
Essential Student Materials/Essential College Facilities
Essential Student Materials:
- None
- None
Examples of Primary Texts and References
Author | Title | Publisher | Date/Edition | ISBN |
---|---|---|---|---|
Wickham, Hadley and Grolemund, Garrett: | R for Data Science: Import, Tidy, Transform, Visualize, and Model Data | O'Reilly. | January 31, 2017/1st Edition. | 978-1491910399 |
Campbell, Matthew | Learn RStudio IDE: Quick, Effective, and Productive Data Science | Apress | 2019/1st edition | 978-1484245101 |
Examples of Supporting Texts and References
None.
Learning Outcomes and Objectives
Course Objectives
- Describe R basics
- Exhibit understanding of R data objects.
- Illustrate basic data transformation concepts.
- Demonstrate extracting data from various sources.
- Perform data manipulations to enable analysis.
- Analyze data to derive patterns and hypotheses.
- Design data visualizations to demonstrate analyses.
CSLOs
- Design, implement and debug R programs to process data from various sources for data analysis.
- Use R-graphics to display and visualize data.
Outline
- Describe R basics
- What is R?
- Introduction to R and RStudio
- Installing and using R packages
- Working with R workspaces
- Exhibit understanding of R data objects.
- Vectors
- Matrices
- Data Frames
- Lists
- Local data import/export
- Illustrate basic data transformation concepts.
- Variables
- Character and String Manipulation
- Dates and Timestamps
- Regular Expressions
- Control Statements
- Functions
- Demonstrate extracting data from various sources.
- Web data capture
- API data sources
- Connecting to external data sources
- Data in single and distributed environments
- Perform data manipulations to enable analysis.
- Using 'dplyr'
- Reshaping data
- Cleansing data
- Merging data
- Splitting data
- Conversion of data
- Analyze data to derive patterns and hypotheses.
- Data architecture patterns
- Correlation clustering
- Predictive analysis
- Groupwise operations
- Data redundancy
- Descriptive statistics
- Regression
- Hypothesis testing
- Design data visualizations to demonstrate analyses.
- Core concepts of data graphics and visualization
- R graphics engines
- Base
- Grid
- Lattice
- ggplot2
- Customizing graphics with 'ggplot2'
- Titles
- Coordinate systems
- Scales
- Themes
- Axis labels
- Legends
Lab Topics
- Data types and data structures
- Flow control and looping
- Writing and calling functions
- Split/apply/combine pattern
- Working with character data and regular expressions
- Regular expressions and web scraping
- Reshaping data and database access
- Simulation
- Optimization
- Data and predictive analysis