Web and Text Analytics

Testimonials

Aleksandra Shcherbakova

Master of Management

“I have met some truly impressive talents and kind personalities among our lecturers."

Angelica Balajadia

Master of Management

“I have found my people here, I have networked here. I am open to doors opening and so many doors are opening already."

Wiranpatchara (Sandy) Wongchanapai

Master of International Business

"Whilst studying at ICMS, I was able to have a strong work/study life balance".

Cameron Colvin

Bachelor of Business (Marketing)

"My work experience boosted my confidence and allowed me to apply theoretical knowledge in practical settings."

Taylor McLeod

Bachelor of Business (Sports Management)

“The small, intimate environment in my lectures makes me feel like I am being heard and that I am getting the most out of my studies."

Julie Williams

Bachelor of Business (Fashion and Global Brand Management)

"I chose ICMS because they have provided me with the opportunity to kickstart my career through their many industry partners".

Keegan Du Preez

Bachelor of Business Management (Accounting)

"ICMS is realistic. They understand the realistic expectations of various markets. It’s not just textbook knowledge, especially compared to every other competitor".

Michaela Mayes

Bachelor of Business (Marketing)

"ICMS' Work Integrated Learning gave me the ability to gain real life experience before I even graduated".

Alana Williams

Bachelor of Business (Accounting)

“I worked full time whilst studying, with ICMS allowing for such a great work to study ratio and balance, where I was flexibly allowed to do both.”

Juliette Wilson

Bachelor of Business (Marketing)

"I think the biggest thing is that you get a personalised experience, and you’ll be treated like a person, not a number".

Calum McKnight

Bachelor of Business (Marketing)

“Very quickly, ICMS gave me a circle of friends who are pretty driven – but who also like to have fun – and when I see where they are now it’s pretty exciting and it drives me further.”

Ewan Metcalfe

Bachelor of Business (Sports Management)

"Through being at this college and the geographical location, it always reminded me to stay grounded, keep working hard to further improve my life and stay happy".

2027 Course Guide

Download Now

This subject is available under ICMS undergraduate degrees, please click the button below to find an undergraduate course for you.

BROWSE UNDERGRADUATE DEGREES WITH THIS SUBJECT

Subject Code:

DAT302A

Subject Type:

Specialisation

Credit Points:

3 credit points

Pre-requisite/Co-requisite:

DAT203A Big Data Systems

Course level of study pre-requisite:

a total of 24 credit points (15 credit points, including ICT101A, ICT102A, ICT103A, DAT101A from level 100 and 9 credit points from level 200 core subjects) prior enrolling into level 300 core and specialisation subjects.

Subject Level:

300

Subject Rationale:

With the emergence of Semantic Web and new advances in related technologies, more companies are investing in extracting value from the growing web-based content. Today’s organisations have begun to pay more attention to web data and analytics as a new driver of competitive advantage. This leads to a heavy reliance on tools and technologies that analyse web-based and web-generated data which contains a large amount of unstructured textual data. As a result, over the last few years, web and text analytics have become an essential part of business intelligence and data analytics, helping businesses understand how users interact with websites, make more informed decisions, and advance their strategic planning.

This subject equips students with a wide range of knowledge and skills required to perform web and text analytics. It covers key topics such as extracting and processing web-related data, similarity-, association-, and classification analyses, topic modelling as well as semantic and sentiment analyses. Privacy and ethical web analytics will also be discussed.

Students will gain necessary skills to be able to help organisations across industries to tap into the power of web and text analytics and to improve their decision making and subsequently overall performance.

Learning Outcomes:

a) Assess different types of data embedded in web applications including textual data.

b) Critically evaluate and apply appropriate methods and analytical techniques to extract and process web-based data and integrate them ethically with organisational datasets.

c) Analyse different patterns and hidden relationships in web-based data using relevant web analytics techniques.

d) Design and implement web analytics pipelines to perform sentiment and semantic analyses to extract insights for organisations.

e) Formulate and present insights and recommendations to various stakeholders to translate website and textual data into valuable digital assets.

Student Assessment:


No	Assessment Task	Weighting	Learning Outcomes
1	Data Extraction and Preparation Project	30%	a, b, d
2	Data Processing Project	30%	b, c, d
3	Case Study
	Part A Report	25%	a, b, c, d, e
	Part B Video Presentation	15%

Broad Topics to be Covered:

Topic:

Week 1: Introduction to Web-Based Data

Types of web-based data
Extracting web-based data from APIs
Social webs and their content
Natural language and its structure
Natural language, text, and textual data
Stop words

Week 2: Extracting Web-Based Data

Web scraping and data extraction
Finding relevant URLs (e.g., sitemap.xml)
Implications of Web 3.0/ Semantic Web
Introduction to web digging with Python
Privacy and ethical web analytics

Week 3: Preparing Web-Based Data for Analysis

Data pre-processing pipeline
Attribute standardisation
Noise and Regular expressions
Character normalisation
Tokenisation algorithms

Week 4: Feature Engineering and Syntactic Similarity

Vectorising documents
Document-term matrix (DTM)
Similarity matrix (SM)
Bag of words
Models of TF-IDF

Week 5: Text Classification Algorithms

Train-Test Split
Overview of web and text analytics algorithms

Supervised learning algorithms
Unsupervised learning algorithms

Selecting the model
Training the model

Week 6: Operation and Evaluation of Text Classification Algorithms

Model validation and accuracy metrics
Parameter and hyper parameter tuning
Classification confidence
Feature importance
Predictive text modelling

Week 7: Topic Modelling

Corpus parameters
Nonnegative Matrix Factorisation (NMF)
Latent semantic analysis
Latent Dirichlet analysis
Visualising topic models

Week 8: Text Summarisation

Extractive methods
Topic representation modelling
Distributional semantics

Week 9: Semantic Relationships

Word embeddings
Similarity queries
Dimensionality reduction
Constructing a similarity tree

Week 10: Sentiment Analysis

Lexicon-based approaches
Supervised learning approach
Transfer learning approach

Week 11: Review and Reflection

Limitations of web and text analytics
Future of web and text analytics
Implications and potential of Deep Learning in the context of web and text analytics
Review and reflection

Please note that these topics are often refined and subject to change so for up to date weekly topics and suggested reading resources, please refer to the Moodle subject page.