Advanced Machine Learning Assignment — 2022

Submission

Please submit your solution electronically via vUWS. (1) Submit a report as PDF via Turnitin. (2) Create a zip file with your code (use zip, do not use rar), and any other file you want to submit, and upload it to vUWS (to where you got this assignment text), and please include the signed and completed cover sheet that you can find at the end of the document.

Submission is due on 2 Nov 2022, 11:59pm.

Miniracer

Figure 1: 4 frames from miniracer. There are three possible values for a pixel: +2 for the car (the dark 2  x 2 square), 0 for drivable track segments (1   6 pixels, in white), and +1 for non-drivable terrain (here in grey). When the front of the car bumps into non-drivable terrain, the episode finishes. The rear of the car is allowed to go off road.

In this assignment we work with data and a simulation of a simple racing game. The car is represented by the black square in the screenshots above.

In this game, the car remains at the bottom of the screen, and can either move left, right, or keep the current position. At every step, the track scrolls down by one, simulating the driving car. The size of the screenshot is 16 × 16 pixels.

Preparation Download the minirace.py and sprites.py python files. The class Minirace implements the racing game simulation. Running sprites.py will create datasets of screenshots for your first task.

A new racing game can be created like here:

from minirace import Minirace therace = Minirace(level=1)

×  

In this, level sets the information a RL agent gets from the environment. The car is 2 2 pixels, and cannot leave the field. The track segments are 6 pixels wide, and have positions from 1 (left) to 5 (right), and the car has 7 different positions (from 0 to 6). The front of the car (in the second row from the bottom, row 1) must remain on

drivable terrain at all times. The rear of the car (in the first row from the bottom, row 0) is allowed to come off road with no penalty.

At each step during a race, the agent will get a reward of +1. Once the front of the car comes off road, the episode finishes.

Task 1: Train a CNN to predict a clear road ahead                                                                                                                                   15 points The python program sprites.py creates a training and test set of “minirace” scenes,

×  

trainingpix.csv (1024 examples) and testingpix.csv (256 examples). Each row represents a 16 16 screenshot (flattened in row-major order), plus an extra value of either 0 or 1 that indicates if the car can safely drive straight without going off-road in the immediate next step (i.e., there are 257 columns).

Steps

  1. Create the datasets by running the sprites.py code.
  2. Create a CNN that predicts the whether the car can safely remain on the current position (i.e., drive straight) without crashing into non-drivable terrain.
    • Describe (no programming): what is a good loss function for this problem?
    • Implement and train the CNN on the training set.
    • Compute the accuracy of your model on the test data set.
  3. Your are free to choose the architecture of your network, but there should be at least one convolutional layer.
  4. You can normalise/standardise the data if it helps improve the training.

What to submit:

  • A description of your CNN and the training. Calculate the size of each layer, and include it in the description.
  • Include the explanation for the loss function in your description.
  • For how long did you train your model (number of epochs, time taken)? What is the performance on the test set?
  • Submit the python code for your solution (either as .py or .ipynb).

Task 2: Train a convolutional autoencoder                                                                                                                                     10 points

Create a convolutional autoencoder that compresses the racing game screenshots to a small number of bytes (the encoder), and transforms them back to original (in the de- coder part).

Steps

  1. Create and train an undercomplete convolutional autoencoder and train it using the training data set from the first task.
  2. You can choose the architecture of the network and size of the representation h = f (x). The goal is to learn a representation that is smaller than the original, and still leads to recognisable reconstructions of the original.
  3. (No programming): Explain the difference between an undercomplete and a de- noising autoencoder.
×  

(No programming): The input images are 16 16 = 256 pixels. What is the size of your hidden representation h = f (x) (the middle layer size of your autoencoder). Include your calculation in your report.

What to submit:

  • Submit the python code of your undercomplete autoencoder (either as .py or

.ipynb).

  • For your report, write a brief description of your steps to create the model and your prediction. Include the description undercomplete vs. denoising autoencoder, and your calculations. How do you measure the quality of your model?
  • Include screenshots of 1-2 output images next to the original inputs (e.g., select a good and a bad example).

Task 3: Create a RL agent for Minirace (level 1)                                                                                                                                   15 points The code in minirace.py provides an environment to create an agent that can be

trained with reinforcement learning (a complete description at the end of this sheet). The following is a description of the environment dynamics:

  • The square represents the car, it is 2 pixels wide. The car always appears in the bottom row, and at each step of the simulation the track scrolls by one row below the car.
  • The agent can control the steering of the car, by moving it two pixels to the left or right. The agent can also choose to do nothing, in which case the car drives straight. The car cannot be moved outside the boundaries.
  • The agent will receive a positive reward at each step where the front part of the car is still on track.
  • An episode is finished when the front of the car hits non-drivable terrain.

In a level 1 version of the game, the observed state (the information made available to the agent after each step) consists of one number: dx. It is the relative position of the middle of the track right in front of the car (i.e., the piece of track in the third row from the bottom of the image). When the track turns left in front of the car, this value will be negative, and when the track turns right, dx is positive. As the track is six pixels wide, the car can drive either on the left, middle, or right of a piece of track (it does not need to drive in the middle of the road).

For this task, you should initialise the simulation like this:

therace = Minirace(level=1)

When you run the simulation, step() returns dx (…, 2, 1, 0, 1, 2, …) for the state.

Steps

  1. Manually create a policy (no RL) that successfully plays drives the car, just se- lecting actions based on the state information. The minirace.py code contains a function mypolicy() that you should modify for this task.
  2. (No programming) How many different values for dx are possible in theory (if you ignore that the car may crash)? If you were to create a tabular reinforcement learning agent, what size is your table for this problem (number of rows and columns)?
  3. Create a (tabular or deep) TD agent that learns to drive. If you decide to use ϵ– greedy action selection, set ϵ = 1, initially, and reduce it during your training to a minimum of 0.01. Keep your training going until you are either happy with the result or the performance does not improve1.

When you run your training, reset the environment after every episode. Store the sum of rewards. After or during the training, plot the total sum of rewards per episode. This plot — the Training Reward plot — indicates the extent to which your agent is learning to improve his cumulative reward. It is your decision when

1This means: do not stop just because ϵ reached 0.01 – you may want to stop earlier, or you may want to keep going, just do not reduce ϵ any further.

to stop training. It is not required to submit a perfectly performing agent, but show how it learns.

  • After you decide the training to be completed, run 50 test episodes using your trained policy, but with ϵ = 0.0 for all 50 episodes. Again, reset the environment at the beginning of each episode. Calculate the average over sum-of-rewards-per- episode (call this the Test-Average), and the standard deviation (the Test-Standard- Deviation). These values indicate how your trained agent performs.

What to submit:

  • Submit the python code of your solutions (both the manual strategy, and the code of your RL learner).
  • For your report, describe the solution, mention the Test-Average and Test-Standard- Deviation, and include the Training Reward plot described above. After how many episodes did you decide to stop training, and how long did it take?

Task 4: Create a RL agent for Minirace (level 2)                                                                                                                                   10 points In a level 2 version of the game, the observed state (the information made available to

the agent after each step) consists of two numbers: dx1, dx2. The first value (dx1) is the same as dx in level 1 – the relative position of the (middle of the) track in front of the car. The second value (dx2) is the position of the subsequent track (in row 4), relative to the track in front of the car (in row 3).

A second difference is that the track can be more curved: sometimes the track will only overlap on the left or right edge. This means the agent cannot always drive in the middle of the track, because the car can only move one step to the left or right at a time.

For this task, you can initialise like this:

therace = Minirace(level=2)

In the level, step() returns two unnormalised pixel difference values (i.e., two values from …, 2, 1, 0, 1, 2, …).

Steps

  1. Create a RL agent (using a RL method of your choice) that finds a policy using (all) level 2 state information. A suggested discount factor is γ = 0.95.
  2. You can choose the algorithm (a tabular approach, deep TD or deep policy gradi- ent).
  3. Try to train an agent that achieves a running reward > 50 (the minirace.py

file has an example for how to calculate this).

  • If you use a neural network, not go overboard with the number of hidden layers as this will significantly increase training time. Try one hidden layer.
  • Write a description explaining how your approach works, and how it performs. If some (or all) of your attempts are unsuccessful, also describe some of the things that did not work, and which changes made a difference.

What to submit:

  • Submit the python code of your solutions.
  • For your report, describe the solution, mention the Test-Average and Test-Standard- Deviation, and include the Training Reward plot described above.

Tips

  1. For the RL-tasks, it often takes some time until the learning picks up, but they should not take hours. If the agent doesn’t learn, explore different learning rates. For Adam, try values between 5e-3 (faster) and 1e-4 (slower).
  2. Even if the learning does not work, remember that we would like to see that you understood the ideas behind the code. Describe the ideas that you tried, and still submit your code but say what the problem was.

Minirace python code

If you put the minirace.py file into your working directory, you can import the class like this:

from  minirace import  Minirace therace = Minirace( level=1)

The Minirace class has several functions that you will have to use. The file contains an example and an explanation for many of the functions (check it out), but here is a brief list:

therace = Minirace( level=level)

n = therace. observationspace() state = therace. state()

state = therace. transition( action) done = therace. terminal()

r = therace. reward()

state , r, done = therace. step( action) state = therace. reset()

action = therace. sampleaction() therace. render( text = False , reward=r) x, z, d = therace.s1

pix = therace. to pix(x, z)

You can ask or answer questions about how to use the files provided with this assign- ment on discord, as long as they are general python / programming questions, for exam- ple if the code provided does not work for you as expected. You must not ask or answer questions to the machine learning questions in this assignment anywhere, including dis- cord. If in doubt, ask your friendly lecturers or tutor first.

Assignment Cover Sheet

School of Computer, Data, and Mathematical Sciences

Student Name 
Student Number 
Unit Name and NumberINFO7001: Advanced Machine Learning
Title of AssignmentAssignment 1
Due Date2 Nov 2022
Date Submitted 
DECLARATION I hold a copy of this assignment that I can produce if the original is lost or damaged.   I hereby certify that no part of this assignment/product has been copied from any other student’s work or from any other source except where due acknowledgement is made in the assignment. No part of this assignment/product has been written/pro- duced for me by another person except where such collaboration has been authorised by the subject lecturer/tutor concerned. Signature: . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . (Note: An examiner or lecturer/tutor has the right not to mark this assignment if the above declaration has not been signed)

Task 1    Task 2    Task 3    Task 4    Total

Mark

Possible       15          10          15          10         50

The maximum points possible for this assignment is 50.

Order Now

Get expert help for Advanced Machine Learning Assignment and many more. 24X7 help, plag-free solution. Order online now!

Universal Assignment (April 20, 2024) Advanced Machine Learning Assignment — 2022. Retrieved from https://universalassignment.com/advanced-machine-learning-assignment-2022/.
"Advanced Machine Learning Assignment — 2022." Universal Assignment - April 20, 2024, https://universalassignment.com/advanced-machine-learning-assignment-2022/
Universal Assignment November 4, 2022 Advanced Machine Learning Assignment — 2022., viewed April 20, 2024,<https://universalassignment.com/advanced-machine-learning-assignment-2022/>
Universal Assignment - Advanced Machine Learning Assignment — 2022. [Internet]. [Accessed April 20, 2024]. Available from: https://universalassignment.com/advanced-machine-learning-assignment-2022/
"Advanced Machine Learning Assignment — 2022." Universal Assignment - Accessed April 20, 2024. https://universalassignment.com/advanced-machine-learning-assignment-2022/
"Advanced Machine Learning Assignment — 2022." Universal Assignment [Online]. Available: https://universalassignment.com/advanced-machine-learning-assignment-2022/. [Accessed: April 20, 2024]

Please note along with our service, we will provide you with the following deliverables:

Please do not hesitate to put forward any queries regarding the service provision.

We look forward to having you on board with us.

Categories

Get 90%* Discount on Assignment Help

Most Frequent Questions & Answers

Universal Assignment Services is the best place to get help in your all kind of assignment help. We have 172+ experts available, who can help you to get HD+ grades. We also provide Free Plag report, Free Revisions,Best Price in the industry guaranteed.

We provide all kinds of assignmednt help, Report writing, Essay Writing, Dissertations, Thesis writing, Research Proposal, Research Report, Home work help, Question Answers help, Case studies, mathematical and Statistical tasks, Website development, Android application, Resume/CV writing, SOP(Statement of Purpose) Writing, Blog/Article, Poster making and so on.

We are available round the clock, 24X7, 365 days. You can appach us to our Whatsapp number +1 (613)778 8542 or email to info@universalassignment.com . We provide Free revision policy, if you need and revisions to be done on the task, we will do the same for you as soon as possible.

We provide services mainly to all major institutes and Universities in Australia, Canada, China, Malaysia, India, South Africa, New Zealand, Singapore, the United Arab Emirates, the United Kingdom, and the United States.

We provide lucrative discounts from 28% to 70% as per the wordcount, Technicality, Deadline and the number of your previous assignments done with us.

After your assignment request our team will check and update you the best suitable service for you alongwith the charges for the task. After confirmation and payment team will start the work and provide the task as per the deadline.

Yes, we will provide Plagirism free task and a free turnitin report along with the task without any extra cost.

No, if the main requirement is same, you don’t have to pay any additional amount. But it there is a additional requirement, then you have to pay the balance amount in order to get the revised solution.

The Fees are as minimum as $10 per page(1 page=250 words) and in case of a big task, we provide huge discounts.

We accept all the major Credit and Debit Cards for the payment. We do accept Paypal also.

Popular Assignments

Assignment: Implement five dangerous software errors

Due: Monday, 6 May 2024, 3:00 PM The requirements for assessment 1: Too many developers are prioritising functionality and performance over security. Either that, or they just don’t come from a security background, so they don’t have security in mind when they are developing the application, therefore leaving the business

Read More »

LNDN08003 DATA ANALYTICS FINAL PROJECT

Business School                                                                 London campus Session 2023-24                                                                   Trimester 2 Module Code: LNDN08003 DATA ANALYTICS FINAL PROJECT Due Date: 12th APRIL 2024 Answer ALL questions. LNDN08003–Data Analytics Group Empirical Research Project Question 2-The project (2500 maximum word limit) The datasets for this assignment should be downloaded from the World Development Indicators (WDI)

Read More »

Microprocessor Based Systems: Embedded Burglar Alarm System

ASSIGNMENT BRIEF 2023/24 Microprocessor Based Systems   Embedded Burglar Alarm System Learning Outcomes This assignment achieves the following learning outcomes:   LO 2 -Use software for developing embedded systems in ‘C’ and testing microcontroller systems including the use of design tools such as Integrated Development Environments and In Circuit Debugger.

Read More »

Imagine you are an IT professional and your manager asked you to give a presentation about various financial tools used to help with decisions for investing in IT and/or security

Part 1, scenario: Imagine you are an IT professional and your manager asked you to give a presentation about various financial tools used to help with decisions for investing in IT and/or security. The presentation will be given to entry-level IT and security employees to understand financial investing. To simulate

Read More »

DX5600 Digital Artefact and Research Report

COLLEGE OF ENGINEERING, DESIGN AND PHYSICAL SCIENCES BRUNEL DESIGN SCHOOL DIGITAL MEDIA MSC DIGITAL DESIGN AND BRANDING MSC DIGITAL DESIGN (3D ANIMTION) MSC DIGITAL DESIGN (MOTION GRAPHICS) MSC DIGITAL DESIGN (IMMERSIVE MIXED REALITY) DIGITAL ARTEFACT AND RESEARCH REPORT                                                                 Module Code: DX5600 Module Title: MSc Dissertation Module Leader: XXXXXXXXXXXXXXXXX Assessment Title:

Read More »

Bsc Public Health and Health Promotion (Top up) LSC LONDON

Health and Work Assignment Brief.                 Assessment brief: A case study of 4,000 words (weighted at 100%) Students will present a series of complementary pieces of written work that:   a) analyse the key workplace issues; b) evaluate current or proposed strategies for managing them from a public health/health promotion perspective

Read More »

6HW109 Environmental Management and Sustainable Health

ASSESSMENT BRIEF MODULE CODE: 6HW109 MODULE TITLE: Environmental Management and Sustainable Health MODULE LEADER: XXXXXXXXX ACADEMIC YEAR: 2022-23 1        Demonstrate a critical awareness of the concept of Environmental Management linked to Health 2        Critically analyse climate change and health public policies. 3        Demonstrate a critical awareness of the concept of

Read More »

PROFESSIONAL SECURE NETWORKS COCS71196

PROFESSIONAL SECURE NETWORKS– Case Study Assessment Information Module Title: PROFESSIONAL SECURE NETWORKS   Module Code: COCS71196 Submission Deadline: 10th May 2024 by 3:30pm Instructions to candidates This assignment is one of two parts of the formal assessment for COCS71196 and is therefore compulsory. The assignment is weighted at 50% of

Read More »

CYBERCRIME FORENSIC ANALYSIS – COCS71193

CYBERCRIME FORENSIC ANALYSIS – COCS71193 Assignment Specification Weighted at 100% of the module mark. Learning Outcomes being assessed by this portfolio. Submission Deadline: Monday 6th May 2024, 1600Hrs. Requirements & Marking Scheme General Guidelines: This is an individual assessment comprised of four parts and is weighted at 100% of the

Read More »

Social Media Campaigns (SMC) Spring 2024 – Winter 2024

Unit: Dynamic Websites Assignment title: Social Media Campaigns (SMC) Spring 2024 – Winter 2024 Students must not use templates that they have not designed or created in this module assessment. This includes website building applications, free HTML5 website templates, or any software that is available to them to help with

Read More »

ABCJ3103 NEWS WRITING AND REPORTING Assignment

ASSIGNMENT/ TUGASAN _________________________________________________________________________ ABCJ3103 NEWS WRITING AND REPORTING PENULISAN DAN PELAPORAN BERITA JANUARY 2024 SEMESTER SPECIFIC INSTRUCTION / ARAHAN KHUSUS Jawab dalam bahasa Melayu atau bahasa Inggeris. Jumlah patah perkataan: 2500 – 3000 patah perkataan tidak termasuk rujukan. Hantar tugasan SEKALI sahaja dalam PELBAGAIfail. Tugasan ini dihantar secara ONLINE. Tarikh

Read More »

ABCM2103 INFORMATION TECHNOLOGY, MEDIA AND SOCIETY Assignment

ASSIGNMENT/ TUGASAN _________________________________________________________________________ ABCM2103 INFORMATION TECHNOLOGY, MEDIA AND SOCIETY TEKNOLOGI MAKLUMAT, MEDIA DAN MASYARAKAT JANUARY 2021 SPECIFIC INSTRUCTION / ARAHAN KHUSUS Jawab dalam Bahasa Melayu atau Bahasa Inggeris. Jumlah patah perkataan : 2500 – 3000 patah perkataan tidak termasuk rujukan. Hantar tugasan SEKALI sahaja dalam SATU fail. Tugasan ini dihantar

Read More »

ABCR3203 COMMUNICATION LAW Assignment

ASSIGNMENT/ TUGASAN _________________________________________________________________________ ABCR3203 COMMUNICATION LAW UNDANG-UNDANG KOMUNIKASI JANUARY 2024 SEMESTER SPECIFIC INSTRUCTION / ARAHAN KHUSUS Jawab dalam Bahasa Melayu atau Bahasa Inggeris. Jumlah patah perkataan : 2500 – 3000 patah perkataan tidak termasuk rujukan. Hantar tugasan SEKALI sahaja dalam SATU fail. Tugasan ini dihantar secara ONLINE. Tarikh penghantaran        :

Read More »

ORGANISATIONAL STRATEGY PLANNING AND MANAGEMENT ASSIGNMENT

POSTGRADUATE DIPLOMA IN BUSINESS MANAGEMENT ORGANISATIONAL STRATEGY PLANNING AND MANAGEMENT ASSIGNMENT NOTE: At postgraduate level, you are expected to substantiate your answers with evidence from independent research. INTRODUCTION TO THE ASSIGNMENT • This assignment consists of FOUR compulsory questions. Please answer all of them. • When you answer, preferably use

Read More »

Solution: Scenario 1, Mirror therapy in patients post stroke

Title: Scenario 1, Mirror therapy in patients post stroke Part 1 : Summary Ramachandran and colleagues developed mirror therapy to treat amputees’ agony from phantom limbs. Patients were able to feel their amputated limb without experiencing any pain by presenting them a mirror image of their healthy arm. Since then,

Read More »

Solution: Exploring the Dominance of Silence

Slide 1: Title – Exploring the Dominance of Silence The title, “Exploring the Dominance of Silence,” sets the stage for a deep dive into the portrayal of silence in Philip K. Dick’s “Do Androids Dream of Electric Sheep?” Our presentation will dissect the literary techniques used by the author to

Read More »

Solution: Assessment: Critical Reflection S2 2023

The policies that hampered the cultural survival of Indigenous groups have a major effect on their health (Coffin, 2007). Cultural isolation can cause an identity crisis and a sense of loss, which can exacerbate mental health problems. Indigenous people have greater rates of chronic illness and impairment due to historical

Read More »

Solution: The Market – Product and Competition Analysis

Section 1: The Market – Product and Competition Analysis Industry and Competition Analysis: The baking mix market is very competitive, but My Better Batch is entering it anyhow. The prepackaged baking mixes sold in this market allow busy people to have bakery-quality products on the table quickly without sacrificing quality

Read More »

Solution: PDCA model for Riot

Student Name: Student ID: University Name: Date: Learning Outcome 1: Engage actively in recognizing a new product/service for Riot and detect the vital tasks required for its effective growth. In this comprehensive learning outcome, Riot’s progress towards innovation superiority is characterized by a deliberate scheme that draws on components from

Read More »

Solution: EDEN 100 – ASSIGNMENT 1

Part 1: Reflections on the Register Variables Use the questions in Column 1 and analyse the sample oral interactions provided under the assessment tile. The transcript for Viv’s conversation is provided on pages 4-5. Probe Questions  Link to readings and theory Interaction 1 Interaction 2 PART 1 – ANALYSING THE

Read More »

Solution: TCP/IP Questions

Table of Contents Question 1. 1 1. IPSec datagram protocol 1 2. Source and destination IP addresses in original IP datagram.. 1 3. Source and destination IP addresses in new IP header 2 4. Protocol number in the protocol field of the new IP header 2 5. Information and Bob.

Read More »

Solution: Fundamentals of Employment Assistance Program and Counselling

ASSESSMENT 3 Subject: Fundamentals of Employment Assistance Program and Counselling Case study Question 1 a)     Major Issues for Theo that could be addressed in counselling: b)    Issues to Address First in Short-Term Counselling:             The cognitive processes of memory, focus, and decision-making are all impacted by insufficient sleep. Such cognitive

Read More »

Solution: EQUITY AND INCLUSION IN EARLY CHILDHOOD IN AUSTRALIA

Written Policy Recommendation Name: Student Number: Email: Date: Introduction: The early years of a child’s life are important for their holistic development, making early childhood education a foundation for their future accomplishments. Nevertheless, guaranteeing equality and inclusion in early childhood education stays a major problem in our society. This policy

Read More »

Solution: Report Health Issue

Table of Contents Executive Summary                                                                                                   3 Introduction                                                                                                                5 Examination of the Chosen Health Issue in the Context of Lambeth                        5 Application of Health Inequality Framework and Analysis of Determinants: Psychotropic Drug Use in Lambeth                                                                           6 Exploration and Discussion of Strategies to Manage Psychotropic Drug Use in Lambeth                                                                                                                        7 Conclusion                                                                                                                  8

Read More »

Solution: Section III: Marketing

Section III: Marketing Channels for Advertising: Understanding Who Makes Baking Product Purchase Decisions is Crucial for My Better Batch’s Business Success (Sampson et al, 2017). Home bakers may make up a disproportionate share of the decision-makers in the UK. As a result, My Better Batch has to target people, especially

Read More »

Can't Find Your Assignment?

Open chat
1
Free Assistance
Universal Assignment
Hello 👋
How can we help you?