El carácter lingüístico y la coherencia textual presentes en el ADN ‘no codificante’
DOI:
https://doi.org/10.22201/iia.14055066p.2007.22542Keywords:
genomics, Statistical linguistics, intronicDNAAbstract
In this work it is theoreticized that, much in the same manner that one can notice that words are not randomly distributed in a coherently ordered text, it is also possible to see that the order in the nucleotide disposition in a genomic fragment is a proof of coherence. And it is also, a proof of codification (in the sense of information content) of the entire genome. It is intended to show through statistical methods that the so-called non-coding DNA (nc-DNA) has a nonrandom probability distribution in a similar fashion to a coherently ordered text. We studied the probability distributions of “words” associated with nc-DNA by means of statistical methods in order to dilucidate its linguistic properties. We briefly discuss the implications of the results we found, as well as some possible applications. In order to study a wide range of species, we analyze a complete bacterial genome (Mycoplasma pneumoniae), as well as fragments of the fruit fly (Drosophila melanogaster), a kind of domestic cat (Felis catus), a tree
Downloads
Downloads
How to Cite
Issue
Section
License

http://creativecommons.org/licenses/by-nc-nd/4.0/