Getting started with Machine Learning in R (PCA) + BYO Data

Getting started with Machine Learning in R (PCA) + BYO Data

This is the registration page for Getting started with Machine Learning in R (PCA) + BYO Data, run by the Sydney Informatics Hub

By Sydney Informatics Hub, Core Research Facilities, DVCR, The University of Sydney

Date and time

Thu, 30 May 2024 9:30 AM - 12:00 PM AEST

Location

Function Room 277 (and online)

Merewether Building H04 The University of Sydney Camperdown, NSW 2006 Australia

About this event

  • 2 hours 30 minutes

This is the third and last of three interactive training sessions designed for you to learn more about supervised and unsupervised machine learning in R.

The content of these 3 sessions is linked and participants are encouraged to attend all three sessions to gain the most value and insight from the training series.


Lead Trainer: Dr Giorgia Mori, Data Science Trainer, Sydney Informatics Hub (SIH)


Format: This hybrid workshop will take place over a 2.5h morning session and it builds on this Machine Learning video and on the training sessions on regression and classification.


Learning outcomes: By the end of the workshop you should be able to:

  • use EDA techniques to understand the structure and characteristics of datasets;
  • use skills in data preprocessing and feature engineering to prepare data for predictive modeling tasks;
  • learn how to select appropriate modeling techniques based on the nature of the data;
  • explore dimensionality reduction methods such as principal component analysis (PCA);
  • understand the concept of feature selection and feature extraction to reduce the complexity of datasets.
  • Practice implementing dimensionality reduction techniques to address issues associated with high-dimensional data.


Who the workshop is for: This workshop is for Academic and Professional Staff, research students, and affiliates of The University of Sydney (with a valid UniKey). Please use your University of Sydney email address to register i.e. @sydney.edu.au, @uni.sydney.edu.au, etc


This workshop requires you to have:

Some knowledge of R. You should be familiar with;

  • some of the core packages of the tidyverse, including dplyr and its functions for data manipulation;
  • the magrittr pipe operator (%>%);
  • the ggplot2 package for data visualization.

You will need a laptop with R and RStudio installed.


If you have any question please contact the trainer giorgia.mori@sydney.edu.au or the training team sih.training@sydney.edu.au.


This workshop is part of a series of data science training events. If you'd like to hear when registrations open for other events, please subscribe to Sydney Informatics Hub newsletters.

Tickets

Organised by

The Sydney Informatics Hub provides support, training, and advice on research data, analyses and computing. Talk to us about your computing infrastructure, data science, digital tools and data governance needs. We can also assist you in choosing the best platforms to facilitate your workflow and collaboration. See https://www.sydney.edu.au/research/facilities/sydney-informatics-hub.html for more information.