English   español  
Please use this identifier to cite or link to this item: http://hdl.handle.net/10261/3260
logo share SHARE logo core CORE   Add this article to your Mendeley library MendeleyBASE

Visualizar otros formatos: MARC | Dublin Core | RDF | ORE | MODS | METS | DIDL | DATACITE
Exportar a otros formatos:


Fast comparison of DNA sequences by oligonucleotide profiling

AuthorsArnau, Vicente; Gallach, Miguel; Marín, Ignacio
KeywordsDNA sequence comparison
Oligonucleotide Profiling
Issue Date28-Feb-2008
PublisherBioMed Central
CitationBMC Research Notes 2008, 1:5
Abstract[Background] The comparison of DNA sequences is a traditional problem in genomics and bioinformatics. Many new opportunities emerge due to the improvement of personal computers, allowing the implementation of novel strategies of analysis.
[Findings] We describe a new program, called UVWORD, which determines the number of times that each DNA word present in a sequence (target) is found in a second sequence (source), a procedure that we have called oligonucleotide profiling. On a standard computer, the user may search for words of a size ranging from k = 1 to k = 14 nucleotides. Average counts for groups of contiguous words may also be established. The rate of analysis on standard computers is from 3.4 (k = 14) to 16 millions of words per second (when k = 1 - 8). This makes feasible the fast screening of even the longest known DNA molecules.
[Discussion] We show that the combination of the ability of analyzing words of relatively long size, which occur very rarely by chance, and the fast speed of the program allows to perform novel types of screenings, complementary to those provided by standard programs such as BLAST. This method can be used to determine oligonucleotide content, to characterize the distribution of repetitive sequences in chromosomes, to determine the evolutionary conservation of sequences in different species, to establish regions of similar DNA among chromosomes or genomes, etc.
DescriptionProvisional abstact and full-text PDF files correspond to the article as it appeared upon acceptance. Fully formatted PDF and final abstract will be made available soon.
Technical Note
Appears in Collections:(IBV) Artículos
Files in This Item:
File Description SizeFormat 
Fast_comparison_provisional.pdfMain text of the paper153,94 kBAdobe PDFThumbnail
Fast_comparison_provisional_supp1.pdfAdditional file: Summary of biological examples402,85 kBAdobe PDFThumbnail
Show full item record
Review this work

WARNING: Items in Digital.CSIC are protected by copyright, with all rights reserved, unless otherwise indicated.