Project Conditions
• Do not provide or show your assignment work to any other person, other than the teaching staff of COMP4336/9336.
• Do not publish your assignment solution (data/code) via the Internet. For example, do not place your assignment in a public GitHub repository. Sharing, publishing, or distributing your assignment work even after the completion of COMP4336/9336 is not permitted.
Project Topic: Data-Driven WiFi Enhancement
Objective
This project is designed to deepen your understanding of the factors that influence retransmissions in WiFi networks, with a focus on how these retransmissions impact network performance. Through this project, you will gain handsAdd -on experience in data collection, performance analysis, and applying basic machine learning techniques to real-world network data.
Learning Outcomes:
• Practical Data Collection: You will learn how to effectively capture and analyse real-world WiFi traffic data, identifying key patterns related to retransmissions.
• Performance Insight: By analysing the impact of retransmissions on network performance, you will develop a deeper understanding of the challenges in WiFi network management.
• Machine Learning Application: You will apply machine learning to predict retransmission events, gaining insight into the predictive power of various network factors.
• Critical Thinking and Innovation: The project will culminate in designing protocol enhancements that address the identified issues, fostering your ability to innovate within existing network frameworks.
• Real-World Application: The skills and knowledge gained from this project will be directly applicable to solving real-world networking challenges, preparing you for careers in network engineering, cybersecurity, or research and development.
Fig 1. Project Tasks
Project Tasks and Components
1. Data Collection:
You will use Wireshark or other protocol analysers to collect WiFi traffic data from operational WiFi networks. Your goal will be to gather a rich dataset that includes both normal and retransmitted packets. Pay attention to the following:
• Time and Location Diversity: You should capture data from at least two crowded locations where you can observe significant WiFi retransmissions. Note that in light-traffic scenarios, there would be not many WiFi collisions and retransmissions, making data collection not particularly useful for your project. Therefore, it is crucial to choose locations with high network activity and high density of WiFi Access Points (APs), such as libraries, UNSW CSE buildings, or similar busy areas. For a given location, collect data across different times of the day and over multiple days spanning different weeks. You have the flexibility to choose the locations and times for your data collection.
• Data Volume: Collect enough data to ensure that your conclusions are statistically valid and robust enough for training machine learning models. Adequate data will help the model generalize well and minimize the risk of overfitting.Add
2. Performance Analysis:
Once the data is collected, you will analyze the impact of WiFi collisions and retransmissions on network performance. This involves identifying retransmitted packets and calculating the delay caused by these retransmissions. You will explore how retransmission frequency correlates with overall network latency, throughput, and efficiency, and visualize these relationships to uncover patterns and insights.
Task details:
wlan.fc.retry == 1
• Retransmission Analysis:
To analyze the retransmissions in your dataset, use Wireshark or an equivalent packet sniffing tool to identify and filter retransmitted packets. Focus on the 802.11 frame retransmission indicators. For each packet, track how many times it has been retransmitted (e.g., once, twice, etc.). Present the retransmission distribution in your dataset, showing how often packets are retransmitted. You can display this using a bar chart or histogram, where the x-axis represents the number of retransmissions and the y-axis represents the number of occurrences.
• Network Performance Analysis:
Do the network performance analysis by choosing one of the most congested networks in your dataset. Next, calculate key network performance metrics such as latency, throughput, and efficiency.
o Latency: Locate a data packet and the corresponding ACK frame. Calculate the latency as the difference between the ACK packet’s timestamp and the data packet’s timestamp.
o Throughput: Calculate the throughput as the total amount of data successfully transmitted over a given period. You can compute this by summing up the data packet sizes filtered for a given device (MAC address) and dividing by the duration of the capture.
o Efficiency: Evaluate the efficiency by comparing the amount of useful data transmitted to the total number of transmissions, factoring in retransmissions. To present your findings, create graphs and charts that show the relationship between retransmission frequency and network performance metrics such as latency, throughput, and efficiency. Use visualization tools like Matplotlib or Seaborn to create plots that clearly depict these relationships.
3. Machine Learning Evaluation:
You will apply machine learning techniques to evaluate the factors that influence retransmissions. By extracting features such as signal strength, packet size, and channel utilization, you will train a machine learning model to predict the likelihood of pachttps://.comket retransmission. This model will help
you understand which factors are most critical in determining whether a packet is successfully
transmitted or needs to be retransmitted.
• Feature Extraction: Extract relevant features from your dataset, such as signal strength, packet size, and channel utilization, to serve as inputs for your machine Add learning model.
• Model Training and Evaluation: Train some basic machine learning models e.g., kNN, Random Forest, and SVM, to predict packet retransmissions. Evaluate the models’ performance using a portion of your dataset and analyze the results.
• Model Interpretation: Discuss which factors/features are most influential in predicting retransmissions and the significance of these factors for network performance.
Resources for machine learning:
• Books & Online Documents
▪ Scikit-learn Documentation: The official documentation is comprehensive, providing guides, examples, and references that are invaluable for both beginners and advanced users.
▪ Python Machine Learning by Sebastian Raschka
• Online Courses
▪ Scikit-learn in Python for Machine Learning
▪ Scikit-learn for Beginners
4. Protocol Enhancement Design:
o Based on the insights gained from your machine learning analysis, you will propose enhancements to the WiFi protocol aimed at reducing retransmissions and improving overall network performance. While implementing your proposed enhancement is not required, you should demonstrate your proposal’s feasibility through analysis and discussions. Here are some example enhancements you might consider (but don’t feel limited to these):
▪ Adaptive Retransmission Mechanisms: Design mechanisms that adjust retransmission strategies based on real-time network conditions, such as varying the backoff time or retransmission count based on signal strength or congestion levels.
▪ Dynamic Channel Allocation: Propose a method for the AP to dynamically switch the channel that reduces the likelihood of retransmissions.
▪ Error Control Enhancements: Suggest improvements to error detection and correction mechanisms to prevent the need for retransmissions under certain conditions.
o Critical Thinking: Explain the rationale behind your proposed enhancements, considering trade-offs such as complexity, scalability, and potential impacts on existing WiFi standards. Evaluate how these changes might be implemented in real-world networks and what challenges might arise.
Submission Requirements
1. Collected DataseAssignment Project Exam Helpt: Submit the dataset in a zip file, organized according to the template to be
2. PDF Report: Submit a single PDF report (maximum 10 pages) that includes:
• A detailed explanation of the data collection process and detailed metadata of your datasets.
• Analysis of the impact of retransmissions on network performance, including visualizations and Add insights derived from the analysis.
• A discussion of the machine learning evaluation, covering feature extraction, model training, and model performance comparison.
• The design and critical analysis of proposed protocol enhancements, including the rationale behind your decisions, trade-offs considered, and feasibility for real-world implementation.
3. Jupyter Notebook and Code:
Submit all code used for data analysis and performance evaluation, either as standalone scripts or within the Jupyter notebook. Include a Jupyter notebook (Python 3) that demonstrates the machine learning process, from importing the dataset to training and evaluating models using scikit-learn (or other machine learning libraries). Ensure that the notebook is well-documented with explanations of each step.
Submission Process:
Marking Rubic:
Task Excellent (85-100%) Average (50-84%) Poor (0-49%)
Data
Collection
Data
Analysis
(25%) Professional analysis with detailed procedures.
Visualizations are unclear or missing. The report is difficult to follow and lacks connection to course content.
Machine
Learning
(25%) Dataset is preprocessed correctly. At least two machine learning
Assignme
algorithms are compared. Perform analysis to prove has adequate data
collection. Clear examples https:/
of model usage and thorough analysis of performance are
provided. Add Basic preprocessing is done, but comparison of models is
nt Project Exlimited or
not wellexplained. Some examples are unclear or missing, with limited analysis of model
am
Helpmodel
s used without comparison, or analysis is severely lacking or
omincorrect.
coder
Protocol Enhancem
ent Design and
Discusion
(25%) Demonstrates strong understanding of wireless protocols and proposes innovative enhancements with clear feasibility and benefit analysis. Some understanding of wireless protocols is demonstrated, proposed enhancements appear to have some benefit, but lacks comprehensive discussion. Limited or no understanding of wireless protocols is demonstrated. Proposed enhancements are vague or unfeasible.
Required Knowledge and Skills
• Understanding of WiFi Protocols: Knowledge of the 802.11 WiFi standard, including packet structure and transmission mechanisms, and the ability to think creatively about protocol improvements.
• Wireshark Proficiency: Familiarity with Wireshark or similar tools for WiFi network data capture and analysis.
• Basic Python Programming: Ability to use Python for data analysis and machine learning, including libraries such as Pandas, Scikit-learn, and Matplotlib.
Important Notes
• Adopt a passive approach to data collection and limit your data collection to data that do not have any user privacy issues associated with it.
End of Project Specs

![[SOLVED] Comp4336 – individual term project/assignment p0](https://assignmentchef.com/wp-content/uploads/2022/08/downloadzip.jpg)

![[SOLVED] COMM1100 2024 Term 3 Assessment 3 Case Study Analysis SQL](https://assignmentchef.com/wp-content/uploads/2022/08/downloadzip-1200x1200.jpg)
Reviews
There are no reviews yet.