English   español  
Please use this identifier to cite or link to this item: http://hdl.handle.net/10261/8383
Share/Impact:
Statistics
logo share SHARE   Add this article to your Mendeley library MendeleyBASE
Visualizar otros formatos: MARC | Dublin Core | RDF | ORE | MODS | METS | DIDL
Exportar a otros formatos:

Title

Computational approaches to study transcriptional regulation in the human genome

AuthorsVaquerizas Erdocia, Juan Manuel
AdvisorLuscombe, Nicholas M.; Gómez Puertas, Paulino
KeywordsTranscripción genética
Modelos computacionales
Issue Date2008
PublisherUniversidad Autónoma de Madrid
AbstractIt is essential for an organism’s viability to ensure that the correct sets of genes are expressed in the right place and at the right time. There are several mechanisms by which cells regulate the amount of protein produced from genes under different conditions. One of the most basic is transcriptional regulation. By controlling the recruitment of RNA polymerase and associated factors to gene promoters, and the assembly of the transcription initiation complex, transcription factors regulate the transcriptional process, and therefore the expression of particular genes. A large number of human diseases are caused by malfunctions in transcriptional regulation, highlighting the importance of this system. Here I present a computational study of transcriptional regulation in the human genome. First I identify and analyse the properties of 1,369 sequence-specific DNA-binding transcription factors in the human genome. We show that: (i) 80% of transcription factors belong to just three protein families, with the C2H2-Zn finger family being the most common; ii) 40% of factors are spatially clustered in specific chromosomal regions, and as a result may function in a co-ordinated manner; iii) transcription factors either function specifically in one or two tissues or ubiquitously across the whole body, giving rise to a two-tier organisation of global and local regulators; and iv) groups of transcription factors have arisen in the human lineage at key events during evolution (such as the appearance of mammalian organisms). Secondly, I examine how sequence variation in the human genome, and in particular single nucleotide polymorphisms (SNPs), disrupt the normal function of the transcriptional regulatory system. I predict functional nucleotide sequence motifs (such as transcription factor binding sites and exonic splicing enhancers) inside or in the proximity of genes, and identify SNPs that overlap with them. Despite the simplicity of the approach, many of the predicted disruptive SNPs have been validated experimentally and have been associated with diseases.
Finally, none of the above results could have been obtained without the development of methods and tools required to perform a robust analysis of the data. In the past ten years the tandem development of high-throughput technology along with the sequencing of numerous genomes have produced a flood of data describing biological systems from a global perspective. These new data types often require special statistical or mathematical treatment in order to interpret them. I have devoted a large part of this dissertation towards creating methods and web-tools to analyse genomic data. These include approaches for: (i) cDNA microarray normalisation and quality control; (ii) identifying differentially expressed genes; (iii) building sets of genes with class prediction properties; (iv) performing transcription factor annotation of microarray experiments; (v) assessing the sensitivity and specificity of gene level measurements for Affymetrix GeneChips; (vi) detecting tissue-specific expression from microarray data; and (vii) detecting binding signal for ChIPchip tiling arrays experiments.
DescriptionTesis Doctoral inédita leída en la Universidad Autónoma de Madrid, Facultad de Ciencias, Departamento de Biología Molecular. Fecha de lectura: 22-02-2008
URIhttp://hdl.handle.net/10261/8383
Appears in Collections:(CBM) Tesis
Files in This Item:
File Description SizeFormat 
Juan Manuel Vaquerizas.pdf11,44 MBAdobe PDFThumbnail
View/Open
Show full item record
Review this work
 


WARNING: Items in Digital.CSIC are protected by copyright, with all rights reserved, unless otherwise indicated.