Full Text Available

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

Investigating the effect of paralogs on microarray gene-set analysis

Includes abstract.

Saved in:
Bibliographic Details
Main Author: Faure, André
Other Authors: Mulder, Nicola
Format: Thesis
Language:English
Published: Department of Molecular and Cell Biology 2014
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1867613224516452352
access_status_str Open Access
author Faure, André
author2 Mulder, Nicola
author_browse Faure, André
Mulder, Nicola
author_facet Mulder, Nicola
Faure, André
author_sort Faure, André
collection Thesis
description Includes abstract.
format Thesis
id oai:open.uct.ac.za:11427/4260
institution University of Cape Town (South Africa)
language eng
last_indexed 2026-06-10T12:32:44.899Z
license_str Not specified — see source repository
provenance_str_mv Harvested via OAI-PMH from UCTD — University of Cape Town Open Access Repository
publishDate 2014
publishDateRange 2014
publishDateSort 2014
publisher Department of Molecular and Cell Biology
publisherStr Department of Molecular and Cell Biology
record_format dspace
source_str UCTD — University of Cape Town Open Access Repository
spelling oai:open.uct.ac.za:11427/4260 Investigating the effect of paralogs on microarray gene-set analysis Faure, André Mulder, Nicola Seoighe, Cathal Cell Biology Includes abstract. Includes bibliographical references. In order to interpret the results obtained from a microarray experiment, researchers often shift focus from analysis of individual differentially expressed genes to analyses of sets of genes. These gene-set analysis (GSA) methods use previously accumulated biological knowledge from databases such as the Gene Ontology (GO) or KEGG to group genes into sets based on their annotations. They aim to rank these gene sets in a way that reflects their relative importance in the experimental situation in question. The objective is that this approach reveals sets of genes with subtle but coordinated behaviour implicating specific biological processes or pathways in the response under study. Several GSA methods have been proposed and debates have ensued on the statistical foundations of the different approaches and the various hypothesis tests used. In particular, criticism has been directed at methods that rely on a strict cut-off to determine significant genes and those that assume genes are expressed independently. We show that paralogs, which typically have high sequence identity and similar molecular functions also exhibit high correlation in their expression patterns. This, together with the fact that the calculation of gene-set significance by all GSA methods is influenced by the number of genes in the gene set, means that sets with high numbers of paralogs are ranked in a biased manner that reflects more the redundant and dependent nature of para logs than any biological phenomenon. 2014-07-30T17:37:33Z 2014-07-30T17:37:33Z 2008 Master Thesis Masters MSc http://hdl.handle.net/11427/4260 eng application/pdf Department of Molecular and Cell Biology Faculty of Science University of Cape Town
spellingShingle Cell Biology
Faure, André
Investigating the effect of paralogs on microarray gene-set analysis
thesis_degree_str Master's
title Investigating the effect of paralogs on microarray gene-set analysis
title_full Investigating the effect of paralogs on microarray gene-set analysis
title_fullStr Investigating the effect of paralogs on microarray gene-set analysis
title_full_unstemmed Investigating the effect of paralogs on microarray gene-set analysis
title_short Investigating the effect of paralogs on microarray gene-set analysis
title_sort investigating the effect of paralogs on microarray gene set analysis
topic Cell Biology
url http://hdl.handle.net/11427/4260
work_keys_str_mv AT faureandre investigatingtheeffectofparalogsonmicroarraygenesetanalysis