
Background
Early Incident Identification
Disc Consulting Enterprises (DCE) has identified some potentially suspicious attacks on their network and computer systems. The attacks are thought to be a new type of attack from a skilled threat actor. To date, the attacks have only been identified ‘after the fact’ by examining post-exploitation activities of the attacker on compromised systems.
Unfortunately, the attackers are skilled enough to evade detection and the exact mechanisms of their exploits have not been identified.
The incident response team, including IT services, security operations, security architecture, risk management, the CISO (Chief Information Security Officer), and the CTO (Chief Technology Officer) have been meeting regularly to determine next steps.
It has been suggested that the security architecture and operations teams could try to implement some real-time threat detection using machine learning models that build on earlier consultancy your firm has completed (i.e., building upon your Assessment 1 work).
Data description
The data have already been provided (in Assessment 1), and the ML team (you) have undertaken some initial cleaning and analysis.
Things to keep in mind:
- Each event record is a snapshot triggered by an individual network ‘packet’. The exact triggering conditions for the snapshot are unknown. But it is known that multiple packets are exchanged in a ‘TCP conversation’ between the source and the target before an event is triggered and a record created. It is also known that each event record is anomalous in some way (the SIEM logs many events that may be suspicious).
- The malicious events account for a very small amount of data. As such, your training needs to consider the “imbalanced” data and the effect these data may have on accuracy (both specificity and sensitivity).
A very small proportion of the data are known to be corrupted by their source systems and some data are incomplete or incorrectly tagged. The incident response team indicated this is likely to be less than a few hundred records.
Assembled Payload Size (continuous) | The total size of the inbound suspicious payload. Note: This would contain the data sent by the attacker in the “TCP conversation” up until the event was triggered |
DYNRiskA Score (continuous) | An un-tested in-built risk score assigned by a new SIEM plug-in |
IPV6 Traffic (binary) | A flag indicating whether the triggering packet was using IPV6 or IPV4 protocols (True = IPV6) |
Response Size (continuous) | The total size of the reply data in the TCP conversation prior to the triggering packet |
Source Ping Time (ms) (continuous) | The ‘ping’ time to the IP address which triggered the event record. This is affected by network structure, number of ‘hops’ and even physical distances. E.g.: < 1 ms is typically local to the device1-5ms is usually located in the local network5-50ms is often geographically local to a country~100-250ms is trans-continental to servers250+ may be trans-continental to a small network. Note, these are estimates only and many factors can influence ping times. |
Operating System (Categorical) | A limited ‘guess’ as to the operating system that generated the inbound suspicious connection. This is not accurate, but it should be somewhat consistent for each ‘connection’ |
Connection State (Categorical) | An indication of the TCP connection state at the time the packet was triggered. |
Connection Rate (continuous) | The number of connections per second by the inbound suspicious connection made prior to the event record creation |
Ingress Router (Binary) | DCE has two main network connections to the ‘world’. This field indicates which connection the events arrived through |
Server Response Packet Time (ms) (continuous) | An estimation of the time from when the payload was sent to when the reply |
packet was generated. This may indicate server processing time/load for the event | |
Packet Size (continuous) | The size of the triggering packet |
Packet TTL (continuous) | The time-to-live of the previous inbound packet. TTL can be a measure of how many ‘hops’ (routers) a packet has traversed before arriving at our network. |
Source IP Concurrent Connection (Continuous) | How many concurrent connections were open from the source IP at the time the event was triggered |
Class (Binary) | Indicates if the event was confirmed malicious, i.e. 0 = Non-malicious, 1 = Malicious |
The needle in the haystack
The data were gathered over a period of time and processed by several systems in order to associate specific events with confirmed malicious activities. However, the number of confirmed malicious events was very low, with these events accounting for less than 1% of all logged network events.
Because the events associated with malicious traffic are quite rare, rate of ‘false negatives’ and ‘false positives’ are important.
Scenario
Following the meetings of the security incident response team, it has been decided to try to make an ‘early warning’ system that extends the functionality of their current SIEM. It has been proposed that DCE engage 3rd party developers to create a ‘smart detection plugin’ for the SIEM.
The goal is to have a plug-in that would extract data from real-time network events, send it to an external system (your R script) and receive a classification in return.
However, for the plugin to be effective it must consider the alert-fatigue experienced by security operations teams as excessive false-positives can lead to the team ignoring real incidents. But, because the impact of a successful attack is very high, false negatives could result in attackers overtaking the whole network.
You job
Your job is to develop the detection algorithms that will provide the most accurate incident detection. You do not need to concern yourself about the specifics of the SIEM plugin or software integration, i.e., your task is to focus on accurate classification of malicious events using R.
You are to test and evaluate two machine learning algorithms to determine which supervised learning model is best for the task as described.
Task
You are to import and clean the same MLData2023.csv, that was used in the previous assignment. Then run, tune and evaluate two supervised ML algorithms (each with two types of training data) to identify the most accurate way of classifying malicious events.
Part 1 – General data preparation and cleaning
- Import the MLData2023.csv into R Studio. This version is the same as Assignment 1.
- Write the appropriate code in R Studio to prepare and clean the MLData2023 dataset as follows:
- Clean the whole dataset based on what you have suggested / feedback received for Assignment 1.
- Filter the data to only include cases labelled with Class = 0 or 1.
- For the feature Operating.System, merge the three Windows categories together to form a new category, say Windows_All. Furthermore, merge iOS, Linux (Unknown), and Other to form the new category named Others. Hint: use the forcats:: fct_collapse(.) function.
- Similarly, for the feature Connection.State, merge INVALID, NEW and RELATED for form the new category named Others.
- Select only the complete cases using the na.omit(.) function, and name the dataset MLData2023_cleaned.
Briefly outline the preparation and cleaning process in your report and why you believe the above steps were necessary.
- Use the code below to generated two training datasets (one unbalanced mydata.ub.train and one balanced mydata.b.train) along with the testing set (mydata.test). Make sure you enter your student ID into the command set.seed(.).
For each of your two ML modelling approaches, you will need to:
statistics (i.e. CV results, tables and plots), where appropriate. If you are using repeated CVs, a minimum of 2 repeats are required.
For the precision, recall and F-score metrics, you will need to do a bit of research as to how they can be calculated. Make sure you define each of the above metrics in the context of the study.
What to submitGather your findings into a report (maximum of 5 pages) and citing relevant sources, if necessary. Present how and why the data was ‘cleaned and prepared’, how the ML models were tuned and provide the relevant CV results. Lastly, present how they performed to each other in both the unbalanced and balanced scenarios. You may use graphs, tables and images where appropriate to help your reader understand your findings. All tables and figures should be appropriately captioned, and referenced in-text. Make a final recommendation on which ML modelling approach is the best for this task. Your final report should look professional, include appropriate headings and subheadings, should cite facts and reference source materials in APA-7th format. Your submission must include the following:
Note that no marks will be given if the results you have provided cannot be confirmed by your code. No more than 20% of your code can be from online resources, including ChatGPT. Furthermore, all pages exceeding the 5-page limit will not be read or examined. Marking Criteria
Academic Misconduct Edith Cowan University regards academic misconduct of any form as unacceptable. Academic misconduct, which includes but is not limited to, plagiarism; unauthorised collaboration; cheating in examinations; theft of other student’s work; collusion; inadequate and incorrect referencing; will be dealt with in accordance with the ECU Rule 40 Academic Misconduct (including Plagiarism) Policy. Ensure that you are familiar with the Academic Misconduct Rules. Assignment ExtensionsInstructions to apply for extensions are available on the ECU Online Extension Request and Tracking System to formally lodge your assignment extension request. The link is also available on Canvas in the Assignment section. Normal work commitments, family commitments and extra-curricular activities are not accepted as grounds for granting you an extension of time because you are expected to plan ahead for your assessment due dates. Where the assignment is submitted not more than 7 days late, the penalty shall, for each day that it is late, be 5% of the maximum assessment available for the assignment. Where the assignment is more than 7 days late, a mark of zero shall be awarded. ![]() Get expert help for Machine Learning Modelling and many more. 24X7 help, plag free solution. Order online now! Universal Assignment (September 1, 2025) Assignment 2 Machine Learning Modelling. Retrieved from https://universalassignment.com/assignment-2-machine-learning-modelling/. "Assignment 2 Machine Learning Modelling." Universal Assignment - September 1, 2025, https://universalassignment.com/assignment-2-machine-learning-modelling/ Universal Assignment May 22, 2023 Assignment 2 Machine Learning Modelling., viewed September 1, 2025,<https://universalassignment.com/assignment-2-machine-learning-modelling/> Universal Assignment - Assignment 2 Machine Learning Modelling. [Internet]. [Accessed September 1, 2025]. Available from: https://universalassignment.com/assignment-2-machine-learning-modelling/ "Assignment 2 Machine Learning Modelling." Universal Assignment - Accessed September 1, 2025. https://universalassignment.com/assignment-2-machine-learning-modelling/ "Assignment 2 Machine Learning Modelling." Universal Assignment [Online]. Available: https://universalassignment.com/assignment-2-machine-learning-modelling/. [Accessed: September 1, 2025] Please note along with our service, we will provide you with the following deliverables:
Please do not hesitate to put forward any queries regarding the service provision. We look forward to having you on board with us. ![]() Recent AssignmentsCategoriesGet 90%* Discount on Assignment HelpMost Frequent Questions & AnswersUniversal Assignment Services is the best place to get help in your all kind of assignment help. We have 172+ experts available, who can help you to get HD+ grades. We also provide Free Plag report, Free Revisions,Best Price in the industry guaranteed. We provide all kinds of assignmednt help, Report writing, Essay Writing, Dissertations, Thesis writing, Research Proposal, Research Report, Home work help, Question Answers help, Case studies, mathematical and Statistical tasks, Website development, Android application, Resume/CV writing, SOP(Statement of Purpose) Writing, Blog/Article, Poster making and so on. We are available round the clock, 24X7, 365 days. You can appach us to our Whatsapp number +1 (613)778 8542 or email to info@universalassignment.com . We provide Free revision policy, if you need and revisions to be done on the task, we will do the same for you as soon as possible. We provide services mainly to all major institutes and Universities in Australia, Canada, China, Malaysia, India, South Africa, New Zealand, Singapore, the United Arab Emirates, the United Kingdom, and the United States. We provide lucrative discounts from 28% to 70% as per the wordcount, Technicality, Deadline and the number of your previous assignments done with us. After your assignment request our team will check and update you the best suitable service for you alongwith the charges for the task. After confirmation and payment team will start the work and provide the task as per the deadline. Yes, we will provide Plagirism free task and a free turnitin report along with the task without any extra cost. No, if the main requirement is same, you don’t have to pay any additional amount. But it there is a additional requirement, then you have to pay the balance amount in order to get the revised solution. The Fees are as minimum as $10 per page(1 page=250 words) and in case of a big task, we provide huge discounts. We accept all the major Credit and Debit Cards for the payment. We do accept Paypal also. Popular AssignmentsAssignment Help in St Albans, MelbourneIntroduction St Albans is a multicultural suburb located approximately 17 km north-west of Melbourne’s CBD. Known for its diverse community, vibrant shopping precincts, and residential charm, St Albans is a popular choice for students seeking affordable living with good access to educational institutions. Nearby universities and TAFE campuses include Victoria Assignment Help in Springvale South, MelbourneIntroduction Springvale South is a peaceful suburb located approximately 27 km south-east of Melbourne’s CBD. Known for its residential charm, green spaces, and family-friendly environment, Springvale South is ideal for students seeking a quieter lifestyle while remaining connected to educational hubs. Nearby universities and TAFE campuses include Monash University (Clayton Assignment Help in Springvale, MelbourneIntroduction Springvale is a bustling multicultural suburb located approximately 22 km south-east of Melbourne’s CBD. Known for its vibrant community, diverse cuisines, and commercial precincts, Springvale attracts students seeking both cultural experiences and proximity to educational institutions. Nearby universities and TAFE campuses include Monash University (Clayton Campus), Deakin University (Burwood Assignment Help in Spotswood, MelbourneIntroduction Spotswood is a peaceful suburb located approximately 8 km south-west of Melbourne’s CBD. Known for its residential charm, local parks, and family-friendly atmosphere, Spotswood offers students a quiet and supportive environment for studying. Its proximity to educational institutions such as Victoria University (Footscray Park Campus), RMIT University, and TAFE Assignment Help in South Yarra, MelbourneIntroduction South Yarra is a prestigious inner-city suburb located just 4 km south-east of Melbourne’s CBD. Known for its upscale shopping precincts, cafés, cultural attractions, and vibrant nightlife, South Yarra attracts students seeking a dynamic urban lifestyle with excellent access to universities and TAFE campuses. Nearby institutions include RMIT University, Assignment Help in Southbank, MelbourneIntroduction Southbank is a vibrant inner-city suburb located just across the Yarra River from Melbourne’s CBD. Known for its arts and entertainment precinct, high-rise apartments, and cultural hubs, Southbank is home to students seeking a dynamic lifestyle close to universities and TAFE campuses. Nearby institutions include RMIT University, University of Assignment Help in South Morang, MelbourneAssignment Help in South Morang, Melbourne Introduction South Morang is a thriving suburb located approximately 21 km north-east of Melbourne’s CBD. Known for its family-friendly atmosphere, modern residential developments, and excellent amenities, South Morang has become a popular choice for students seeking a peaceful yet well-connected study environment. With convenient Assignment Help in South Melbourne, MelbourneIntroduction South Melbourne is a vibrant inner-city suburb located just 2 km south of Melbourne’s CBD. Known for its historic architecture, bustling markets, trendy cafés, and proximity to the Arts Precinct, South Melbourne attracts students who want a lively urban lifestyle with easy access to universities and TAFE campuses. Nearby Assignment Help in South Kingsville, MelbourneIntroduction South Kingsville is a charming inner-west suburb located approximately 7 km south-west of Melbourne’s CBD. Known for its tree-lined streets, historic homes, and a close-knit community, South Kingsville offers students a calm and welcoming environment while remaining close to major educational institutions. Students in this suburb have easy access Assignment Help in Somerton, MelbourneIntroduction Somerton is an industrial and semi-residential suburb located approximately 22 km north of Melbourne’s CBD. Known for its convenient access to the Hume Freeway and Melbourne Airport, Somerton is ideal for students who prefer a quieter environment while being well connected to educational institutions across the city. Nearby universities Assignment Help in Skye, MelbourneIntroduction Skye is a growing suburb located approximately 38 km south-east of Melbourne’s CBD. Known for its residential estates, open green spaces, and family-friendly atmosphere, Skye is ideal for students who prefer a quieter lifestyle while remaining connected to the city and educational institutions. The suburb is well-linked by major Assignment Help in Seaholme, MelbourneIntroduction Seaholme is a charming bayside suburb located approximately 14 km south-west of Melbourne’s CBD. Known for its quiet streets, coastal parks, and proximity to Altona Beach, Seaholme offers students a peaceful study environment while remaining connected to the city. With easy access via public transport and major roads, students Assignment Help in Seddon, MelbourneIntroduction Seddon is a vibrant inner-west suburb located approximately 7 km west of Melbourne’s CBD. Known for its lively café culture, boutique shops, and historic architecture, Seddon combines a charming village atmosphere with convenient access to the city. Its proximity to Victoria University (Footscray Campus), RMIT University, and other TAFE Assignment Help in Seabrook, MelbourneIntroduction Seabrook is a coastal suburb located about 23 km south-west of Melbourne’s CBD. Known for its serene beaches, family-friendly environment, and green spaces, Seabrook offers students a peaceful setting while remaining well connected to educational institutions across Melbourne. Public transport and road access via the Princes Highway and nearby Assignment Help in Scoresby, MelbourneIntroduction Scoresby is a well-established suburb located about 28 km east of Melbourne’s CBD. Known for its mix of residential and commercial areas, Scoresby offers a peaceful and convenient lifestyle for students. The suburb is in close proximity to educational institutions such as Monash University (Clayton Campus), Swinburne University, and Assignment Help in Sandringham, MelbourneIntroduction Sandringham is a picturesque bayside suburb located approximately 16 km south-east of Melbourne’s CBD. Famous for its beautiful beaches, Sandringham Yacht Club, and relaxed coastal lifestyle, Sandringham is ideal for students seeking a serene study environment while remaining close to the city. The suburb has convenient transport links via Assignment Help in Sandhurst, MelbourneIntroduction Sandhurst is an upscale suburb located approximately 38 km south-east of Melbourne’s CBD. Known for its leafy streets, modern housing estates, and peaceful environment, Sandhurst is ideal for students seeking a calm residential lifestyle while maintaining access to educational institutions in Melbourne’s south-eastern corridor. With nearby transport links and Assignment Help in Roxburgh Park, MelbourneIntroduction Roxburgh Park is a vibrant and fast-growing suburb located around 23 km north of Melbourne’s CBD. Known for its multicultural community, modern residential developments, and family-friendly environment, Roxburgh Park has become a popular choice for students and young professionals alike. With excellent transport connections via the Roxburgh Park Train Assignment Help in Rowville, MelbourneIntroduction Rowville is a well-established suburb located around 27 km south-east of Melbourne’s CBD. Known for its family-friendly vibe, spacious parks like Stud Park Reserve, and excellent community facilities, Rowville offers students a balanced mix of suburban comfort and easy access to education hubs. With major roads such as the Assignment Help in Rosanna, MelbourneIntroduction Rosanna is a leafy and family-friendly suburb located about 12 km north-east of Melbourne’s CBD. Known for its green streets, Rosanna Parklands, and strong community atmosphere, it provides students with a calm environment while staying close to the city. The suburb is well connected by Rosanna Station and major Assignment Help in Ripponlea, MelbourneIntroduction Ripponlea is a charming inner-city suburb located just 8 km south-east of Melbourne’s CBD. Famous for the heritage-listed Rippon Lea Estate and its leafy residential streets, Ripponlea offers students a mix of cultural richness and convenient city living. With easy tram and train connections, students can travel effortlessly to Assignment Help in Ringwood North, MelbourneIntroduction Ringwood North is a leafy and family-friendly suburb located about 28 km east of Melbourne’s CBD. Known for its green streets, parks, and welcoming community, Ringwood North offers a peaceful residential lifestyle while remaining close to Melbourne’s educational and business hubs. With strong transport connections via nearby Ringwood Station Assignment Help in Ringwood East, MelbourneIntroduction Ringwood East is a peaceful residential suburb located around 25 km east of Melbourne’s CBD. Known for its leafy streets, friendly community, and excellent access to parks and schools, Ringwood East is ideal for students seeking a quieter lifestyle while still being close to Melbourne’s educational hubs. With its Assignment Help in Ringwood, MelbourneIntroduction Ringwood is a bustling suburb located around 25 km east of Melbourne’s CBD. Known for its leafy residential streets, modern shopping centres like Eastland, and easy access to the scenic Dandenong Ranges, Ringwood offers a great lifestyle for students and professionals. The suburb is well connected via Ringwood Station Assignment Help in Richmond, MelbourneIntroduction Richmond is one of Melbourne’s most iconic inner-city suburbs, located just 3 km east of the CBD. Famous for its lively café culture, bustling shopping streets like Bridge Road, and its thriving arts and sports scene, Richmond blends lifestyle and convenience perfectly. With excellent tram and train connections, students Assignment Help in Reservoir, MelbourneAssignment Help in Reservoir, MelbourneIntroductionReservoir is a thriving suburb located just 12 kilometres north of Melbourne’s CBD. Known for its multicultural community, leafy parks, and welcoming neighbourhoods, Reservoir strikes the perfect balance between suburban comfort and city convenience. It is well-connected through trains, buses, and trams, making travel easy for Assignment Help in Parkville – Universal AssignmentParkville, located just 3 km north of Melbourne CBD, is one of Melbourne’s most prestigious academic hubs. It is home to The University of Melbourne, Monash University (Pharmacy campus), and major research institutes like the Walter and Eliza Hall Institute and the Royal Melbourne Hospital precinct. With such a concentration Assignment Help in Oakleigh East – Universal AssignmentOakleigh East, located about 17 km south-east of Melbourne CBD, is a peaceful and residential suburb in the City of Monash, known for its family-friendly environment and convenient access to shopping and educational facilities. With nearby institutions like Monash University, Holmesglen Institute, and TAFE Victoria, many students living in Oakleigh Assignment Help in Preston – Universal AssignmentPreston, located about 9 km north of Melbourne’s CBD, is a thriving multicultural suburb that attracts many university and TAFE students. With the presence of Melbourne Polytechnic (Preston Campus) and close proximity to La Trobe University (Bundoora campus) and RMIT University, Preston has a large student population. While the suburb Assignment Help in Point Cook – Universal AssignmentPoint Cook, located around 25 km southwest of Melbourne’s CBD, is a fast-growing residential suburb in the City of Wyndham. Known for its modern housing estates, coastal beauty, and cultural diversity, Point Cook is also home to many university and TAFE students. With Victoria University, RMIT, Deakin University, and the Can't Find Your Assignment?Free Assistance 1
Universal Assignment
Hello 👋 How can we help you? Open chat
|