Full Text Available

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

Exploring the application of Word2Vec to basket transaction data in the grocery retail industry

In this thesis, we explore the application of Word2vec to basket transaction data provided by a large grocery retailer in South Africa. Word2vec is an algorithm based on representation learning. The objective of the exploration is to establish whether the application of Word2vec to basket transactio...

Full description

Saved in:
Bibliographic Details
Main Author: De Swardt, Gideon Jacobus
Other Authors: Nyirenda, Juwa Chiza
Format: Thesis
Language:English
Published: Department of Statistical Sciences 2022
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1867613159492157440
access_status_str Open Access
author De Swardt, Gideon Jacobus
author2 Nyirenda, Juwa Chiza
author_browse De Swardt, Gideon Jacobus
Nyirenda, Juwa Chiza
author_facet Nyirenda, Juwa Chiza
De Swardt, Gideon Jacobus
author_sort De Swardt, Gideon Jacobus
collection Thesis
description In this thesis, we explore the application of Word2vec to basket transaction data provided by a large grocery retailer in South Africa. Word2vec is an algorithm based on representation learning. The objective of the exploration is to establish whether the application of Word2vec to basket transaction data would generate product embeddings that represent a useful relationship between products. Furthermore, we compareWord2vec's outputs and performance to traditional methods for studying product relationships which include Association Rules Mining (ARM) and Recommendation Systems. The results from the experiments showed that indeed product embeddings created by Word2vec on transaction data are meaningful and useful. It was clear that the idea of using transactions in the place of sentences to the neural network, provides analogous results to that of a natural language task. Word2vec clearly demonstrated its ability to cluster products that are homogeneous or fulfill similar needs. Furthermore this sort of product relationship was not provided by any other traditional methods, which was clear when comparing the outputs to that of ARM and Recommendation Systems. We also show that usingWord2vec could potentially provide insight on truly complementary products that ARM perhaps fails to do. Word2vec also proved to be incredibly scalable, taking input data of 20 times the size of what traditional methods could handle on a local computer. We end with a description of a potential application of the ideas learnt during the course of this study, with a real business problem, that we believe could lead to an enhanced customer shopping experience and in turn increase revenue and profits for the retailer.
format Thesis
id oai:open.uct.ac.za:11427/36434
institution University of Cape Town (South Africa)
language eng
last_indexed 2026-06-10T12:31:43.046Z
license_str Not specified — see source repository
provenance_str_mv Harvested via OAI-PMH from UCTD — University of Cape Town Open Access Repository
publishDate 2022
publishDateRange 2022
publishDateSort 2022
publisher Department of Statistical Sciences
publisherStr Department of Statistical Sciences
record_format dspace
source_str UCTD — University of Cape Town Open Access Repository
spelling oai:open.uct.ac.za:11427/36434 Exploring the application of Word2Vec to basket transaction data in the grocery retail industry De Swardt, Gideon Jacobus Nyirenda, Juwa Chiza statistical sciences In this thesis, we explore the application of Word2vec to basket transaction data provided by a large grocery retailer in South Africa. Word2vec is an algorithm based on representation learning. The objective of the exploration is to establish whether the application of Word2vec to basket transaction data would generate product embeddings that represent a useful relationship between products. Furthermore, we compareWord2vec's outputs and performance to traditional methods for studying product relationships which include Association Rules Mining (ARM) and Recommendation Systems. The results from the experiments showed that indeed product embeddings created by Word2vec on transaction data are meaningful and useful. It was clear that the idea of using transactions in the place of sentences to the neural network, provides analogous results to that of a natural language task. Word2vec clearly demonstrated its ability to cluster products that are homogeneous or fulfill similar needs. Furthermore this sort of product relationship was not provided by any other traditional methods, which was clear when comparing the outputs to that of ARM and Recommendation Systems. We also show that usingWord2vec could potentially provide insight on truly complementary products that ARM perhaps fails to do. Word2vec also proved to be incredibly scalable, taking input data of 20 times the size of what traditional methods could handle on a local computer. We end with a description of a potential application of the ideas learnt during the course of this study, with a real business problem, that we believe could lead to an enhanced customer shopping experience and in turn increase revenue and profits for the retailer. 2022-05-30T10:33:38Z 2022-05-30T10:33:38Z 2022 2022-05-30T10:29:42Z Master Thesis Masters MSc http://hdl.handle.net/11427/36434 eng application/pdf Department of Statistical Sciences Faculty of Science
spellingShingle statistical sciences
De Swardt, Gideon Jacobus
Exploring the application of Word2Vec to basket transaction data in the grocery retail industry
thesis_degree_str Master's
title Exploring the application of Word2Vec to basket transaction data in the grocery retail industry
title_full Exploring the application of Word2Vec to basket transaction data in the grocery retail industry
title_fullStr Exploring the application of Word2Vec to basket transaction data in the grocery retail industry
title_full_unstemmed Exploring the application of Word2Vec to basket transaction data in the grocery retail industry
title_short Exploring the application of Word2Vec to basket transaction data in the grocery retail industry
title_sort exploring the application of word2vec to basket transaction data in the grocery retail industry
topic statistical sciences
url http://hdl.handle.net/11427/36434
work_keys_str_mv AT deswardtgideonjacobus exploringtheapplicationofword2vectobaskettransactiondatainthegroceryretailindustry