untitled

<OAI-PMH schemaLocation=http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd> <responseDate>2018-01-17T12:07:41Z</responseDate> <request identifier=oai:HAL:hal-01544795v1 verb=GetRecord metadataPrefix=oai_dc>http://api.archives-ouvertes.fr/oai/hal/</request> <GetRecord> <record> <header> <identifier>oai:HAL:hal-01544795v1</identifier> <datestamp>2018-01-11</datestamp> <setSpec>type:ART</setSpec> <setSpec>subject:sdv</setSpec> <setSpec>collection:EVOL_PARIS_SEINE-AIRE</setSpec> <setSpec>collection:EVOLUTION_PARIS_SEINE</setSpec> <setSpec>collection:UPMC</setSpec> <setSpec>collection:CNRS</setSpec> <setSpec>collection:UNIV-AG</setSpec> <setSpec>collection:UNICE</setSpec> <setSpec>collection:SAE</setSpec> <setSpec>collection:GIP-BE</setSpec> <setSpec>collection:UPMC_POLE_4</setSpec> <setSpec>collection:IBPS</setSpec> <setSpec>collection:UCA-TEST</setSpec> <setSpec>collection:UNIV-COTEDAZUR</setSpec> </header> <metadata><dc> <publisher>HAL CCSD</publisher> <title lang=en>MosaicFinder: identification of fused gene families in sequence similarity networks</title> <creator>Jachiet, Pierre-Alain</creator> <creator>Pogorelcnik, Romain</creator> <creator>Berry, Anne</creator> <creator>Lopez, Philippe</creator> <creator>Bapteste, Eric</creator> <contributor>Adaptation, Intégration, Réticulation et Evolution (AIRE) ; Systématique, adaptation, évolution (SAE) ; Université Pierre et Marie Curie - Paris 6 (UPMC) - Centre National de la Recherche Scientifique (CNRS) - Université Pierre et Marie Curie - Paris 6 (UPMC) - Centre National de la Recherche Scientifique (CNRS) - Evolution Paris Seine ; Université Nice Sophia Antipolis (UNS) ; Université Côte d'Azur (UCA) - Université Côte d'Azur (UCA) - Centre National de la Recherche Scientifique (CNRS) - Université des Antilles et de la Guyane (UAG) - Université Pierre et Marie Curie - Paris 6 (UPMC) - Université Nice Sophia Antipolis (UNS) ; Université Côte d'Azur (UCA) - Université Côte d'Azur (UCA) - Université des Antilles et de la Guyane (UAG)</contributor> <contributor>AMX PhD grant</contributor> <contributor> French Agency for Research (DEFIS program TODO)</contributor> <contributor> [ANR-09-EMER-010]</contributor> <description>International audience</description> <source>ISSN: 1367-4803</source> <source>EISSN: 1460-2059</source> <source>Bioinformatics</source> <publisher>Oxford University Press (OUP)</publisher> <identifier>hal-01544795</identifier> <identifier>https://hal.archives-ouvertes.fr/hal-01544795</identifier> <source>https://hal.archives-ouvertes.fr/hal-01544795</source> <source>Bioinformatics, Oxford University Press (OUP), 2013, 29 (7), pp.837-844. 〈10.1093/bioinformatics/btt049〉</source> <identifier>DOI : 10.1093/bioinformatics/btt049</identifier> <relation>info:eu-repo/semantics/altIdentifier/doi/10.1093/bioinformatics/btt049</relation> <language>en</language> <subject>[SDV.BID] Life Sciences [q-bio]/Biodiversity</subject> <type>info:eu-repo/semantics/article</type> <type>Journal articles</type> <description lang=en>Motivation: Gene fusion is an important evolutionary process. It can yield valuable information to infer the interactions and functions of proteins. Fused genes have been identified as non-transitive patterns of similarity in triplets of genes. To be computationally tractable, this approach usually imposes an a priori distinction between a dataset in which fused genes are searched for, and a dataset that may have provided genetic material for fusion. This reduces the `genetic space' in which fusion can be discovered, as only a subset of triplets of genes is investigated. Moreover, this approach may have a high-false-positive rate, and it does not identify gene families descending from a common fusion event. Results: We represent similarities between sequences as a network. This leads to an efficient formulation of previous methods of fused gene identification, which we implemented in the Python program FusedTriplets. Furthermore, we propose a new characterization of families of fused genes, as clique minimal separators of the sequence similarity network. This well-studied graph topology provides a robust and fast method of detection, well suited for automatic analyses of big datasets. We implemented this method in the C++ program MosaicFinder, which additionally uses local alignments to discard false-positive candidates and indicates potential fusion points. The grouping into families will help distinguish sequencing or prediction errors from real biological fusions, and it will yield additional insight into the function and history of fused genes.</description> <date>2013-04</date> </dc> </metadata> </record> </GetRecord> </OAI-PMH>