91精品国产91久久久久久_国产精品二区一区二区aⅴ污介绍_一本久久a久久精品vr综合_亚洲视频一区二区三区

合肥生活安徽新聞合肥交通合肥房產(chǎn)生活服務(wù)合肥教育合肥招聘合肥旅游文化藝術(shù)合肥美食合肥地圖合肥社保合肥醫(yī)院企業(yè)服務(wù)合肥法律

TCS3393 DATA MINING代做、代寫Python/Java編程

時(shí)間:2024-03-24  來(lái)源:合肥網(wǎng)hfw.cc  作者:hfw.cc 我要糾錯(cuò)



FACULTY OF ENGINEERING, BUILT-ENVIRONMENT, AND INFORMATION
TECHNOLOGY (FOEBEIT)
BACHELOR OF INFORMATION TECHNOLOGY (HONS)
JANUARY-MAY 2024 INTAKE
TCS3393 DATA MINING
GROUP ASSIGNMENT [2-3 members per group]
This assignment is worth 25% of the overall marks available for this module. This assignment
aims to help the student explore and analyse a set of data and reconstruct it into meaningful
representations for decision-making.
The online landscape is ever-evolving, with websites serving as crucial assets for businesses,
organizations, and individuals. As the internet continues to grow, the need for accurate and
efficient website classification becomes paramount. Understanding the nature of websites, their
content, and the user experience they provide is vital for various purposes, including online
security, marketing strategies, and content filtering.
Embarking on a data science project, you collaborate with a cybersecurity firm dedicated to
enhancing web security measures. The firm provides you with a rich dataset encompassing
various attributes of websites, including their URLs, user comments, and assigned categories.
Your objective is to develop a classification model capable of accurately categorizing websites
based on these variables.
The dataset includes information on the URLs of different websites, user comments associated
with those websites, and pre-existing categories assigned to them. The challenge lies in creating
a model that not only accurately classifies websites but also adapts to the dynamic nature of the
online environment, where new types of websites constantly emerge.
Introduction
2
Your goal is to implement advanced data analysis techniques to train a model that enhances the
efficiency of web classification.
Techniques
The techniques used to explore the dataset using various data exploration, manipulation,
transformation, and visualization techniques are covered in the course. As an additional feature,
you must explore further concepts which can improve the retrieval effects. The datasetprovided
for this assignment is related to the website classification.
Dataset
This dataset contains information on 1407 websites URL. It includes 3 variables that describe
various categories of websites. The dataset will be analyzed using subsets of these variables for
descriptive and quantitative analyses, depending on the specific models used.
Objective:
Develop a classification model to categorize websitesusing advanced data science techniques.The
model should robustly classify the website based on comments stated in the dataset.
Tasks:
1. Data Exploration:
• Conduct an initial exploration of the dataset to understand its structure, size, and
variables.
• Examine the distribution of website categories to identify any imbalances in the
dataset.
• Explore the distribution of URLs and user comments length to gain insights into the
data.
Assignment Task: Websites Classification
3
2. Descriptive Analysis:
A. Basic Exploration:
• Describe the structure of the dataset. How many observations and variables
does it contain?
• What are the data types of the variables in the dataset?
B. Statistical Summary:
• Provide a statistical summary of the 'Category' variable. What are the most
common website categories?
• Calculate basic descriptive statistics (mean, median, standard deviation) for
relevant numeric variables.
C. URL Analysis:
• Analyze the distribution of website URLs. Are there any patterns or
commonalities?
• Are there any outlier URLs that need special attention?
3. Data Preprocessing:
A. Cleaning Text Data:
• Explore the 'cleaned_website_text' variable. What preprocessing steps would
you take to clean text data for analysis?
• Implement text cleaning techniques and explain their importance in preparing
data for text-based analysis.
B. Handling Missing Values:
• Identify if there are any missing values in the dataset. Propose strategies for
handling missing values, specifically in the 'cleaned_website_text' column.
4. Visualization:
A. Category Distribution Visualization:
• Create a bar chart or pie chart to visually represent the distribution of website
categories.
• How does the visualization help in understanding the balance or imbalance of
the dataset?
B. Text Data Visualization:
• Generate word clouds or frequency plots for the 'cleaned_website_text'
variable. What insights can be gained from these visualizations?
4
5. Model Development
A. Data Mining Analysis:
• Split the dataset into training and testing sets for model evaluation.
• Implement various machine learning algorithms for classification, such as logistic
regression, decision trees, or random forests.
B. Training and Evaluation
• Evaluate the performance of each model using metrics like accuracy, precision, recall,
and F**score.
• Discuss the challenges and considerations specific to evaluating a model for website
classification.
6. Advanced Techniques:
i. Feature Engineering:
• Propose additional features that could enhance the model's performance.
How might these features capture more nuanced information about websites?
ii.Dynamic Nature of Websites:
• Given the dynamic nature of the online environment, how could the model
adapt to newly emerging website types? Discuss strategies for model
adaptation.
7. Create Dashboard, Report and Conclusions:
• Summarize the findings, including insights gained from exploratory data analysis and
the performance of the classification model.
• How interpretable is the chosen model? Can you explain the decision-making process
of the model in the context of website classification?
• Provide recommendations for further improvements or considerations in the dynamic
landscape of web classification.
• Reflect on the challenges encountered during the analysis. What potential
improvements or future work would you recommend to enhance the model's
performance?
This assignment allows students to apply knowledge of data exploration, preprocessing, data
modelling, and model building to solve a real-world problem in the business domain. It also
encourages them to explore additional concepts for improving model performance.
5
• The complete Python program (source code (ipynb)) and report must be submitted to
Blackboard.
• Python Script (Program Code):
o Name the file under your name and SUKD number.
o Start the first two lines in your program by typing your name and SUKD
number. For example:
# Nor Anis Sulaiman
#SUKD20231234
o For each question, give an ID and explain what you want to discover. For example:
a. Explore the distribution of website categories in the dataset. Are there any specific
categories that are more prevalent than others?
b. Visualize the distribution of URL lengths and user comments lengths. Are there patterns
or outliers that could be informative for the classification model?
c. What steps would you take to clean and preprocess the URLs and user comments for
effective analysis?
d. How might you handle any missing values in the dataset, and what impact could they
have on the classification model?
e. Provide descriptive statistics for key variables such as URL lengths and user comments
lengths. What insights can be derived from these statistics?
f. Explore potential additional features that could enhance the model's ability to classify
websites accurately.
g. How might the inclusion of features derived from URLs or user comments contribute
to the overall model performance?
h. Choose a classification algorithm suitable for website classification. Explain your
choice.
i. Implement the chosen algorithm using Python and relevant libraries. What
considerations should be taken into account during the model implementation phase?
j. Split the dataset into training and testing sets. How would you assess the performance
of the model using metrics like accuracy, precision, recall, and F**score?
k. Discuss potential challenges in evaluating the model's effectiveness and generalization
to new websites.
l. Create visualizations to interpret the model's predictions and showcase its classification
performance.
Deliverables
6
As part of the assessment, you must submit the project report in printed and softcopy form,
which should have the following format:
A) Cover Page:
All reports must be prepared with a front cover. A protective transparent plastic sheet can be
placed in front of the report to protect the front cover. The front cover should be presented with
the following details:
o Module
o Coursework Title
o Intake
o Student name and ID
o Date Assigned (the date the report was handed out).
o Date Completed (the date the report is due to be handed in).
B) Contents:
• Introduction and assumptions (if any)
• Data import / Cleaning / pre-processing / transformation
• Each question must start in a separate page and contains:
o Analysis Techniques - data exploration / manipulation / visualization
o Screenshot of source code with the explanation.
o Screenshot of output/plot with the explanation.
o Outline the findings based on the results obtained.
• The extra feature explanation must be on a separate page and contain:
Documents: Coursework Report
7
o Screenshot of source code with the explanation.
o Screenshot of output/plot with the explanation.
o Explain how adding this extra feature can improve the results.
C) Conclusion
• Depth and breadth of analysis
• Quality and depth of feedback on the analysis process
• Reflection on learning and areas for improvement
D) References
• The font size used in the report must be 12pt, and the font is Times New Roman. Full
source code is not allowed to be included in the report. The report must be typed and
clearly printed.
• You may source algorithms and information from the Internet or books. Proper
referencing of the resources should be evident in the document.
• All references must be made using the APA (American Psychological Association)
referencing style as shown below:
o The theory was first propounded in 1970 (Larsen, A.E. 1971), but since then has
been refuted; M.K. Larsen (1983) is among those most energetic in their
opposition……….
o /**Following source code obtained from (Danang, S.N. 2002)*/
int noshape=2;
noshape=GetShape();
• A list of references at the end of your document or source code must be specified in the
following format:
Larsen, A.E. 1971, A Guide to the Aquatic Science Literature, McGraw-Hill, London.
Larsen, M.K. 1983, British Medical Journal [Online], Available from
http://libinfor.ume.maine.edu/acquatic.htm (Accessed 19 November 1995)
Danang, S.N., 2002, Finding Similar Images [Online], The Code Project, *Available
from http://www.codeproject.com/bitmap/cbir.asp, [Accessed 14th *September 2006]
Further information on other types of citation is available in Petrie, A., 2003, UWE
Library Services Study Skills: How to reference [online], England, University of
請(qǐng)加QQ:99515681  郵箱:99515681@qq.com   WX:codehelp 

掃一掃在手機(jī)打開(kāi)當(dāng)前頁(yè)
  • 上一篇:ECM1410代做、代寫java編程設(shè)計(jì)
  • 下一篇:代做CS 550、代寫c++,Java編程語(yǔ)言
  • 無(wú)相關(guān)信息
    合肥生活資訊

    合肥圖文信息
    2025年10月份更新拼多多改銷助手小象助手多多出評(píng)軟件
    2025年10月份更新拼多多改銷助手小象助手多
    有限元分析 CAE仿真分析服務(wù)-企業(yè)/產(chǎn)品研發(fā)/客戶要求/設(shè)計(jì)優(yōu)化
    有限元分析 CAE仿真分析服務(wù)-企業(yè)/產(chǎn)品研發(fā)
    急尋熱仿真分析?代做熱仿真服務(wù)+熱設(shè)計(jì)優(yōu)化
    急尋熱仿真分析?代做熱仿真服務(wù)+熱設(shè)計(jì)優(yōu)化
    出評(píng) 開(kāi)團(tuán)工具
    出評(píng) 開(kāi)團(tuán)工具
    挖掘機(jī)濾芯提升發(fā)動(dòng)機(jī)性能
    挖掘機(jī)濾芯提升發(fā)動(dòng)機(jī)性能
    海信羅馬假日洗衣機(jī)亮相AWE  復(fù)古美學(xué)與現(xiàn)代科技完美結(jié)合
    海信羅馬假日洗衣機(jī)亮相AWE 復(fù)古美學(xué)與現(xiàn)代
    合肥機(jī)場(chǎng)巴士4號(hào)線
    合肥機(jī)場(chǎng)巴士4號(hào)線
    合肥機(jī)場(chǎng)巴士3號(hào)線
    合肥機(jī)場(chǎng)巴士3號(hào)線
  • 短信驗(yàn)證碼 目錄網(wǎng) 排行網(wǎng)

    關(guān)于我們 | 打賞支持 | 廣告服務(wù) | 聯(lián)系我們 | 網(wǎng)站地圖 | 免責(zé)聲明 | 幫助中心 | 友情鏈接 |

    Copyright © 2025 hfw.cc Inc. All Rights Reserved. 合肥網(wǎng) 版權(quán)所有
    ICP備06013414號(hào)-3 公安備 42010502001045

    91精品国产91久久久久久_国产精品二区一区二区aⅴ污介绍_一本久久a久久精品vr综合_亚洲视频一区二区三区
    久久精品国产第一区二区三区最新章节| 自拍偷拍亚洲综合| 99视频+国产日韩欧美| 精品亚洲porn| 亚洲欧美自拍偷拍| 午夜不卡在线视频| 欧美白人最猛性xxxxx69交| 亚洲精品专区| 成人激情文学综合网| 亚洲成年人影院| 国产欧美一区二区精品性色超碰| 色婷婷精品久久二区二区蜜臂av | 欧美日韩成人一区二区| 国产在线欧美| 国产精品 欧美精品| 亚洲一区二区三区三| 久久综合狠狠综合久久激情 | 久久久精品黄色| 欧美丝袜自拍制服另类| 亚洲人成人一区二区三区| 懂色av一区二区三区蜜臀| 亚洲综合视频在线观看| 久久亚洲二区三区| 欧美日免费三级在线| 国产精品一区视频| 牛牛国产精品| 国产成人精品影视| 免费看黄色91| 亚洲制服丝袜av| 国产精品久久三| 日韩精品一区二区三区中文精品| 色天天综合久久久久综合片| 亚洲国产精品123| 99久久综合国产精品| 精品亚洲国产成人av制服丝袜| 亚洲综合在线视频| 中文字幕亚洲欧美在线不卡| 日韩欧美国产综合一区| 欧美日韩一区二区在线视频| 亚洲影视在线| 亚洲精品乱码久久久久久蜜桃麻豆| 96av麻豆蜜桃一区二区| 国产一区二区三区四| 秋霞国产午夜精品免费视频| 一区二区在线观看视频| 国产精品国产三级国产a| 久久综合久久综合九色| 日韩欧美亚洲一区二区| 欧美日本在线观看| 在线欧美小视频| 色呦呦国产精品| 久久久久久网| 亚洲一区成人| 亚洲免费观看视频| 久久久99精品久久| 2021中文字幕一区亚洲| 欧美成人综合网站| 日韩欧美三级在线| 日韩午夜激情视频| 欧美一区日韩一区| 欧美一区二区精品久久911| 欧美日韩精品免费观看视频| 欧美性猛片xxxx免费看久爱| 久久综合中文| 久久在线精品| 欧美亚洲一区二区在线观看| 91久久香蕉国产日韩欧美9色| 久久久久高清| 色偷偷成人一区二区三区91| 久久蜜桃精品| 欧美在线高清视频| 欧美日韩在线免费视频| 在线不卡免费av| 日韩小视频在线观看专区| 精品日韩一区二区三区免费视频| 欧美成人午夜电影| 欧美精品一区二区三区蜜臀| 久久九九久久九九| 欧美国产精品中文字幕| 亚洲视频狠狠干| 亚洲一区自拍偷拍| 日本在线观看不卡视频| 加勒比av一区二区| 东方aⅴ免费观看久久av| 9i看片成人免费高清| 欧美日韩大片一区二区三区| 亚洲国产精品日韩| 亚洲欧美日韩另类精品一区二区三区| 久久综合网络一区二区| 欧美日韩在线三级| 欧美成人vr18sexvr| 国产日韩欧美高清在线| **性色生活片久久毛片| 夜夜嗨av一区二区三区| 免费看欧美女人艹b| 国产传媒日韩欧美成人| 91女神在线视频| 99视频日韩| 在线影视一区二区三区| 日韩欧美国产一区二区在线播放| 国产欧美日韩不卡免费| 亚洲欧美激情在线| 老鸭窝一区二区久久精品| 国产成人综合自拍| 国产精品分类| 久久综合精品一区| 日韩亚洲欧美一区| 中文字幕日韩精品一区| 日韩精品每日更新| 成人国产在线观看| 亚洲人成在线影院| 欧美日韩精品一区二区| 久久香蕉国产线看观看99| 亚洲黄网站在线观看| 久久91精品久久久久久秒播| 日韩午夜激情av| 国产精品久久久久久久蜜臀 | 国产三级精品在线不卡| 欧美理论在线播放| 欧美激情一区二区三区在线| 午夜私人影院久久久久| 成人在线一区二区三区| 一区二区三区高清视频在线观看| 欧洲av一区二区嗯嗯嗯啊| 久久综合久久久久88| 亚洲不卡一区二区三区| 国产91富婆露脸刺激对白| 亚洲精品裸体| 欧美一级一级性生活免费录像| 国产精品不卡一区二区三区| 男人的天堂亚洲一区| 欧美aⅴ99久久黑人专区| 久久天天狠狠| 国产亚洲综合av| 青青草国产精品亚洲专区无| 91麻豆.com| 91国偷自产一区二区三区成为亚洲经典 | 成人动漫一区二区在线| 国产精品日韩一区二区三区| 欧美大片在线观看一区二区| 怡红院av一区二区三区| 成人激情黄色小说| 久久精品观看| 国产精品无码永久免费888| 老司机精品视频线观看86| 亚洲无线一线二线三线区别av| 欧美日韩国产综合久久| 伊人婷婷欧美激情| 91在线精品一区二区三区| 91官网在线观看| 亚洲婷婷在线视频| 成人伦理片在线| 欧美亚洲动漫制服丝袜| 亚洲欧美电影一区二区| 成人免费视频视频在线观看免费 | 久久久不卡网国产精品一区| 美脚の诱脚舐め脚责91| 最新亚洲一区| 久久久精品中文字幕麻豆发布| 久久99久久99小草精品免视看| 亚洲人成免费| 国产视频视频一区| 国内精品久久久久影院薰衣草| 国产午夜精品一区二区三区欧美 | 97久久超碰国产精品电影| 欧美日韩在线免费视频| 亚洲一二三区在线观看| 欧美日韩在线一二三| 国产一区二区h| 亚洲欧美不卡| ㊣最新国产の精品bt伙计久久| 国产69精品久久久久777| 色哦色哦哦色天天综合| 一区二区三区在线视频播放| 色综合网色综合| 日韩欧美国产一区二区三区 | 红杏aⅴ成人免费视频| 2023国产精品| 成人一区在线观看| 欧美日韩高清一区二区| 三级一区在线视频先锋 | 亚洲美女免费视频| 欧美日韩天天操| 26uuu国产在线精品一区二区| 国产美女一区二区| 欧美性猛交xxxx乱大交退制版| 亚洲国产日产av| 国产一区二区三区免费不卡| 一区在线播放视频| 午夜视频一区| 中文字幕第一区第二区| 91免费在线视频观看| www激情久久| 波多野结衣中文字幕一区| 日韩欧美不卡在线观看视频| 国产精品一卡二卡在线观看| 欧美喷水一区二区| 加勒比av一区二区| 在线电影一区二区三区| 国产精品一级黄|