Google matrix analysis of DNA sequences

Vivek Kandiah; Dima Shepelyansky

doi:10.1371/journal.pone.0061519

Article Dans Une Revue PLoS ONE Année : 2013

Google matrix analysis of DNA sequences

(1) , (1)

Vivek Kandiah

Fonction : Auteur

Cohérence Quantique (LPT)

Dima Shepelyansky

Fonction : Auteur
PersonId : 828690
ORCID : 0000-0002-2752-0765

Cohérence Quantique (LPT)

Résumé

For DNA sequences of various species we construct the Google matrix G of Markov transitions between nearby words composed of several letters. The statistical distribution of matrix elements of this matrix is shown to be described by a power law with the exponent being close to those of outgoing links in such scale-free networks as the World Wide Web (WWW). At the same time the sum of ingoing matrix elements is characterized by the exponent being significantly larger than those typical for WWW networks. This results in a slow algebraic decay of the PageRank probability determined by the distribution of ingoing elements. The spectrum of G is characterized by a large gap leading to a rapid relaxation process on the DNA sequence networks. We introduce the PageRank proximity correlator between different species which determines their statistical similarity from the view point of Markov chains. The properties of other eigenstates of the Google matrix are also discussed. Our results establish scale-free features of DNA sequence networks showing their similarities and distinctions with the WWW and linguistic networks.

Mots clés

PageRank Google matrix DNA sequences

Domaines

Recherche d'information [cs.IR] Physique et Société [physics.soc-ph]

Dima Shepelyansky : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00772069

Soumis le : mercredi 9 janvier 2013-19:59:21

Dernière modification le : jeudi 23 novembre 2023-10:42:03

Dates et versions

hal-00772069 , version 1 (09-01-2013)

Identifiants

HAL Id : hal-00772069 , version 1
ARXIV : 1301.1626
DOI : 10.1371/journal.pone.0061519
PUBMEDCENTRAL : PMC3650020

Citer

Vivek Kandiah, Dima Shepelyansky. Google matrix analysis of DNA sequences. PLoS ONE, 2013, 8(5), pp.e61519. ⟨10.1371/journal.pone.0061519⟩. ⟨hal-00772069⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

IRSAMC LPT CNRS LPT_ICQ UNIV-UT3 UT3-TOULOUSEINP

96 Consultations

1 Téléchargements

Google matrix analysis of DNA sequences

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager