Publications

association rules astrostatistics big data bioinformatics clustering deep learning education energy feature selection forecasting IoT natural disasters pattern recognition precision agriculture time series transfer learning XAI

Show all

68 entries « ‹ 1 of 2 › »

2023

O. Cardozo and V. Ojeda and R. Parra and J. C. Mello-Román and J. L. Noguera Vázquez and M. García-Torres and F. Divina and S. Grillo and C. Villalba and J. Facon

Dataset of fundus images for the diagnosis of ocular toxoplasmosis Journal Article

In: Data in Brief, pp. 109056, 2023.

Abstract | Links | BibTeX | Tags: bioinformatics

M. Vázquez-Marrufo and E. Sarrias-Arrabal and M. García-Torres and R. Martín-Clemente and G. Izquierdo

A systematic review of the application of machine-learning algorithms in multiple sclerosis Journal Article

In: Neurología (English Edition), 2023.

Abstract | Links | BibTeX | Tags: bioinformatics

2022

F. Delgado-Chaves and P. M. Martínez-García and A. Herrero-Ruiz and F. Gómez-Vela and F. Divina and S. Jimeno-González and F. Cortés-Ledesma

Data of transcriptional effects of the merbarone-mediated inhibition of TOP2 Journal Article

In: Data in Brief, vol. 44, pp. 108499, 2022.

Abstract | Links | BibTeX | Tags: bioinformatics

D. Aquino-Brítez and J.A. Gómez and J.L. Vázquez Noguera and M. García-Torres and J.C. Mello Román and P.E. Gardel-Sotomayor and V.E. Castillo Benitez and I. Castro Matto and D.P. Pinto-Roa and J. Facon and S.A. Grillo

Automatic Diagnosis of Diabetic Retinopathy from Fundus Images Using Neuro-Evolutionary Algorithms Journal Article

In: Studies in Health Technology and Informatics, vol. 290, pp. 689–693, 2022.

Abstract | Links | BibTeX | Tags: bioinformatics

2021

A. J. Pérez-Pulido and G. Asencio-Cortés and A. M. Brokate-Llanos and G. Brea-Calvo and M. R. Rodríguez-Griñolo and A. Garzón and M. J. Muñoz

Serial co-expression analysis of host factors from SARS-CoV viruses highly converges with former high-throughput screenings and proposes key regulators Journal Article

In: Briefings in Bioinformatics, vol. 22, no. 2, pp. 1038–1052, 2021.

Abstract | Links | BibTeX | Tags: bioinformatics

@article{pulido2021,

title = {Serial co-expression analysis of host factors from SARS-CoV viruses highly converges with former high-throughput screenings and proposes key regulators},

author = {A. J. Pérez-Pulido and G. Asencio-Cortés and A. M. Brokate-Llanos and G. Brea-Calvo and M. R. Rodríguez-Griñolo and A. Garzón and M. J. Muñoz},

url = {https://academic.oup.com/bib/article/22/2/1038/6103172},

doi = {10.1093/bib/bbaa419},

year  = {2021},

date = {2021-01-01},

journal = {Briefings in Bioinformatics},

volume = {22},

number = {2},

pages = {1038--1052},

abstract = {The current genomics era is bringing an unprecedented growth in the amount of gene expression data, only comparable to the exponential growth of sequences in databases during the last decades. This data allow the design of secondary analyses that take advantage of this information to create new knowledge. One of these feasible analyses is the evaluation of the expression level for a gene through a series of different conditions or cell types. Based on this idea, we have developed Automatic and Serial Analysis of CO-expression, which performs expression profiles for a given gene along hundreds of heterogeneous and normalized transcriptomics experiments and discover other genes that show either a similar or an inverse behavior. It might help to discover co-regulated genes, and common transcriptional regulators in any biological model. The present severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) pandemic is an opportunity to test this novel approach due to the wealth of data that are being generated, which could be used for validating results. Thus, we have identified 35 host factors in the literature putatively involved in the infectious cycle of SARS-CoV viruses and searched for genes tightly co-expressed with them. We have found 1899 co-expressed genes whose assigned functions are strongly related to viral cycles. Moreover, this set of genes heavily overlaps with those identified by former laboratory.},

keywords = {bioinformatics},

pubstate = {published},

tppubtype = {article}

}

R. Parra and V. Ojeda and J.L. Vázquez Noguera and M. García-Torres and J.C. Mello-Román and C. Villalba and J. Facon and F. Divina and O. Cardozo and V. Castillo

A Trust-Based Methodology to Evaluate Deep Learning Models for Automatic Diagnosis of Ocular Toxoplasmosis from Fundus Images Journal Article

In: Diagnostics, vol. 11, no. 11, pp. 1951, 2021.

Links | BibTeX | Tags: bioinformatics, deep learning, pattern recognition

P.M. Martínez-García and M. García-Torres and F. Divina and J. Terrón-Bautista and I. Delgado-Sainz and F. Gómez-Vela and F. Cortés-Ledesma

Genome-wide prediction of topoisomerase II $beta$ binding by architectural factors and chromatin accessibility Journal Article

In: PLoS computational biology, vol. 17, no. 1, pp. e1007814, 2021.

Links | BibTeX | Tags: bioinformatics

A. Lopez-Fernandez and D. Rodriguez-Baena and F. Gomez-Vela and F. Divina and M. Garcia-Torres

A multi-GPU biclustering algorithm for binary datasets Journal Article

In: Journal of Parallel and Distributed Computing, vol. 147, pp. 209–219, 2021.

Links | BibTeX | Tags: bioinformatics, pattern recognition

V.E. Castillo Benítez and I. Castro Matto and J.C. Mello Román and J.L. Vázquez Noguera and M. García-Torres and J. Ayala and D.P. Pinto-Roa and P.E. Gardel-Sotomayor and J. Facon and S.A. Grillo

Dataset from fundus images for the study of diabetic retinopathy Journal Article

In: Data in Brief, vol. 36, pp. 107068, 2021.

Abstract | Links | BibTeX | Tags: bioinformatics

H. Ho Shin and C. Sauer Ayala and P. Pérez-Estigarribia and S.A. Grillo and L. Segovia-Cabrera and M. García-Torres and C. Gaona and S. Irala and M.E. Pedrozo and G. Sequera and J.L. Vázquez Noguera and E. De Los Santos

A Mathematical Model for COVID-19 with Variable Transmissibility and Hospitalizations: A Case Study in Paraguay Journal Article

In: Applied Sciences, vol. 11, no. 20, pp. 9726, 2021.

Abstract | Links | BibTeX | Tags: bioinformatics

2020

L. Melgar-García and D. Gutiérrez-Avilés and C. Rubio-Escudero and A. Troncoso

High-content screening images streaming analysis using the STriGen methodology Conference

SAC 35th Annual ACM Symposium on Applied Computing, 2020.

Links | BibTeX | Tags: bioinformatics

F. M. Delgado-Chaves and F. Gómez-Vela and F. Divina and M. García-Torres and D. S. Rodríguez-Baena

Computational Analysis of the Global Effects of Ly6E in the Immune Response to Coronavirus Infection Using Gene Networks Journal Article

In: Genes, vol. 11, no. 7, pp. 831-864, 2020.

Abstract | BibTeX | Tags: bioinformatics

@article{Delgado-Chaves20,

title = {Computational Analysis of the Global Effects of Ly6E in the Immune Response to Coronavirus Infection Using Gene Networks},

author = {F. M. Delgado-Chaves and F. Gómez-Vela and F. Divina and M. García-Torres and D. S. Rodríguez-Baena},

year  = {2020},

date = {2020-01-01},

journal = {Genes},

volume = {11},

number = {7},

pages = {831-864},

abstract = {Gene networks have arisen as a promising tool in the comprehensive modeling and analysis of complex diseases. Particularly in viral infections, the understanding of the host-pathogen mechanisms, and the immune response to these, is considered a major goal for the rational design of appropriate therapies. For this reason, the use of gene networks may well encourage therapy-associated research in the context of the coronavirus pandemic, orchestrating experimental scrutiny and reducing costs. In this work, gene co-expression networks were reconstructed from RNA-Seq expression data with the aim of analyzing the time-resolved effects of gene Ly6E in the immune response against the coronavirus responsible for murine hepatitis (MHV). Through the integration of differential expression analyses and reconstructed networks exploration, significant differences in the immune response to virus were observed in Ly6E∆HSC compared to wild type animals. Results show that Ly6E ablation at hematopoietic stem cells (HSCs) leads to a progressive impaired immune response in both liver and spleen. Specifically, depletion of the normal leukocyte mediated immunity and chemokine signaling is observed in the liver of Ly6E∆HSC mice. On the other hand, the immune response in the spleen, which seemed to be mediated by an intense chromatin activity in the normal situation, is replaced by ECM remodeling in Ly6E∆HSC mice. These findings, which require further experimental characterization, could be extrapolated to other coronaviruses and motivate the efforts towards novel antiviral approaches.},

keywords = {bioinformatics},

pubstate = {published},

tppubtype = {article}

}

T. Vanhaeren and F. Divina and M. García-Torres and F. Gómez-Vela and W. Vanhoof and P. M. Martínez-García

A Comparative Study of Supervised Machine Learning Algorithms for the Prediction of Long-Range Chromatin Interactions Journal Article

In: Genes, vol. 11, no. 9, pp. 985, 2020.

Abstract | BibTeX | Tags: bioinformatics

@article{Vanhaeren20,

title = {A Comparative Study of Supervised Machine Learning Algorithms for the Prediction of Long-Range Chromatin Interactions},

author = {T. Vanhaeren and

F. Divina and

M. García-Torres and

F. Gómez-Vela and

W. Vanhoof and

P. M. Martínez-García},

year = {2020},

date = {2020-01-01},

journal = {Genes},

volume = {11},

number = {9},

pages = {985},

abstract = {The role of three-dimensional genome organization as a critical regulator of gene expression has become increasingly clear over the last decade. Most of our understanding of this association comes from the study of long range chromatin interaction maps provided by Chromatin Conformation Capture-based techniques, which have greatly improved in recent years. Since these procedures are experimentally laborious and expensive, in silico prediction has emerged as an alternative strategy to generate virtual maps in cell types and conditions for which experimental data of chromatin interactions is not available. Several methods have been based on predictive models trained on one-dimensional (1D) sequencing features, yielding promising results. However, different approaches vary both in the way they model chromatin interactions and in the machine learning-based strategy they rely on, making it challenging to carry out performance comparison of existing methods. In this study, we use publicly available 1D sequencing signals to model cohesin-mediated chromatin interactions in two human cell lines and evaluate the prediction performance of six popular machine learning algorithms: decision trees, random forests, gradient boosting, support vector machines, multi-layer perceptron and deep learning. Our approach accurately predicts long-range interactions and reveals that gradient boosting significantly outperforms the other five methods, yielding accuracies of about 95%. We show that chromatin features in close genomic proximity to the anchors cover most of the predictive information, as has been previously reported. Moreover, we demonstrate that gradient boosting models trained with different subsets of chromatin features, unlike the other methods tested, are able to produce accurate predictions. In this regard, and besides architectural proteins, transcription factors are shown to be highly informative. Our study provides a framework for the systematic prediction of long-range chromatin interactions, identifies gradient boosting as the best suited algorithm for this task and highlights cell-type specific binding of transcription factors at the anchors as important determinants of chromatin wiring mediated by cohesin},

keywords = {bioinformatics},

pubstate = {published},

tppubtype = {article}

}

The role of three-dimensional genome organization as a critical regulator of gene expression has become increasingly clear over the last decade. Most of our understanding of this association comes from the study of long range chromatin interaction maps provided by Chromatin Conformation Capture-based techniques, which have greatly improved in recent years. Since these procedures are experimentally laborious and expensive, in silico prediction has emerged as an alternative strategy to generate virtual maps in cell types and conditions for which experimental data of chromatin interactions is not available. Several methods have been based on predictive models trained on one-dimensional (1D) sequencing features, yielding promising results. However, different approaches vary both in the way they model chromatin interactions and in the machine learning-based strategy they rely on, making it challenging to carry out performance comparison of existing methods. In this study, we use publicly available 1D sequencing signals to model cohesin-mediated chromatin interactions in two human cell lines and evaluate the prediction performance of six popular machine learning algorithms: decision trees, random forests, gradient boosting, support vector machines, multi-layer perceptron and deep learning. Our approach accurately predicts long-range interactions and reveals that gradient boosting significantly outperforms the other five methods, yielding accuracies of about 95%. We show that chromatin features in close genomic proximity to the anchors cover most of the predictive information, as has been previously reported. Moreover, we demonstrate that gradient boosting models trained with different subsets of chromatin features, unlike the other methods tested, are able to produce accurate predictions. In this regard, and besides architectural proteins, transcription factors are shown to be highly informative. Our study provides a framework for the systematic prediction of long-range chromatin interactions, identifies gradient boosting as the best suited algorithm for this task and highlights cell-type specific binding of transcription factors at the anchors as important determinants of chromatin wiring mediated by cohesin

2019

F. M Delgado-Chaves and F. Gómez-Vela and M. García-Torres and F. Divina and J.L. Vázquez Noguera

Computational Inference of Gene Co-Expression Networks for the identification of Lung Carcinoma Biomarkers: An Ensemble Approach Journal Article

In: Genes, vol. 10, no. 12, pp. 962, 2019.

Abstract | Links | BibTeX | Tags: bioinformatics

F. Gómez-Vela and F. M Delgado-Chaves and D.S. Rodríguez-Baena and M. García-Torres and F. Divina

Ensemble and Greedy Approach for the Reconstruction of Large Gene Co-Expression Networks Journal Article

In: Entropy, vol. 21, no. 12, pp. 1139, 2019.

Abstract | Links | BibTeX | Tags: bioinformatics

E.L. Mangas and A. Rubio and R. Álvarez-Marín and G. Labrador-Herrera and J. Pachón and M. Eugenia Pachón-Ibáñez and F. Divina and A.J. Pérez-Pulido

Pangenome of Acinetobacter baumannii uncovers two groups of genomes, one of them with genes involved in CRISPR/Cas defence systems associated with the absence of plasmids and exclusive genes for biofilm formation Journal Article

In: Microbial Genomics, pp. mgen000309, 2019.

Abstract | Links | BibTeX | Tags: bioinformatics

@article{MG2019,

title = {Pangenome of Acinetobacter baumannii uncovers two groups of genomes, one of them with genes involved in CRISPR/Cas defence systems associated with the absence of plasmids and exclusive genes for biofilm formation},

author = {E.L. Mangas and A. Rubio and R. Álvarez-Marín and G. Labrador-Herrera and J. Pachón and M. Eugenia Pachón-Ibáñez and F. Divina and A.J. Pérez-Pulido},

url = {https://www.microbiologyresearch.org/content/journal/mgen/10.1099/mgen.0.000309},

doi = {https://doi.org/10.1099/mgen.0.000309},

year  = {2019},

date = {2019-01-01},

journal = {Microbial Genomics},

pages = {mgen000309},

abstract = {Acinetobacter baumannii is an opportunistic bacterium that causes hospital-acquired infections with a high mortality and morbidity, since there are strains resistant to virtually any kind of antibiotic. The chase to find novel strategies to fight against this microbe can be favoured by knowledge of the complete catalogue of genes of the species, and their relationship with the specific characteristics of different isolates. In this work, we performed a genomics analysis of almost 2500 strains. Two different groups of genomes were found based on the number of shared genes. One of these groups rarely has plasmids, and bears clustered regularly interspaced short palindromic repeat (CRISPR) sequences, in addition to CRISPR-associated genes (cas genes) or restriction-modification system genes. This fact strongly supports the lack of plasmids. Furthermore, the scarce plasmids in this group also bear CRISPR sequences, and specifically contain genes involved in prokaryotic toxin–antitoxin systems that could either act as the still little known CRISPR type IV system or be the precursors of other novel CRISPR/Cas systems. In addition, a limited set of strains present a new cas9-like gene, which may complement the other cas genes in inhibiting the entrance of new plasmids into the bacteria. Finally, this group has exclusive genes involved in biofilm formation, which would connect CRISPR systems to the biogenesis of these bacterial resistance structures.},

keywords = {bioinformatics},

pubstate = {published},

tppubtype = {article}

}

2018

D. Gutiérrez-Avilés and R. Giráldez and F. J. Gil-Cumbreras and C. Rubio-Escudero

TRIQ: a new method to evaluate triclusters Journal Article

In: BioData Mining, vol. 11, no. 1, pp. 15, 2018.

Abstract | Links | BibTeX | Tags: bioinformatics, time series

J. A. Nepomuceno and A. Troncoso and J. S. Aguilar-Ruiz

Pairwise gene GO-based measures for biclustering of high-dimensional expression data Journal Article

In: BioData Mining, vol. 11, no. 4, 2018.

Abstract | Links | BibTeX | Tags: bioinformatics

@article{BIODM2018,

title = {Pairwise gene GO-based measures for biclustering of high-dimensional expression data},

author = {J. A. Nepomuceno and A. Troncoso and J. S. Aguilar-Ruiz},

url = {https://www.ncbi.nlm.nih.gov/pubmed/29610579},

doi = {10.1186/s13040-018-0165-9},

year  = {2018},

date = {2018-01-01},

journal = {BioData Mining},

volume = {11},

number = {4},

abstract = {BACKGROUND: Biclustering algorithms search for groups of genes that share the same behavior under a subset of samples in gene expression data. Nowadays, the biological knowledge available in public repositories can be used to drive these algorithms to find biclusters composed of groups of genes functionally coherent. On the other hand, a distance among genes can be defined according to their information stored in Gene Ontology (GO). Gene pairwise GO semantic similarity measures report a value for each pair of genes which establishes their functional similarity. A scatter search-based algorithm that optimizes a merit function that integrates GO information is studied in this paper. This merit function uses a term that addresses the information through a GO measure. RESULTS: The effect of two possible different gene pairwise GO measures on the performance of the algorithm is analyzed. Firstly, three well known yeast datasets with approximately one thousand of genes are studied. Secondly, a group of human datasets related to clinical data of cancer is also explored by the algorithm. Most of these data are high-dimensional datasets composed of a huge number of genes. The resultant biclusters reveal groups of genes linked by a same functionality when the search procedure is driven by one of the proposed GO measures. Furthermore, a qualitative biological study of a group of biclusters show their relevance from a cancer disease perspective. CONCLUSIONS: It can be concluded that the integration of biological information improves the performance of the biclustering process. The two different GO measures studied show an improvement in the results obtained for the yeast dataset. However, if datasets are composed of a huge number of genes, only one of them really improves the algorithm performance. This second case constitutes a clear option to explore interesting datasets from a clinical point of view.},

keywords = {bioinformatics},

pubstate = {published},

tppubtype = {article}

}

P. Manuel Martínez-García and M. García-Torres and F. Divina and F. Gómez-Vela and F. Cortés-Ledesma

Analysis of Relevance and Redundance on Topoisomerase 2b (TOP2B) Binding Sites: A Feature Selection Approach Conference

International Conference on the Applications of Evolutionary Computation, 2018.

Links | BibTeX | Tags: bioinformatics

E. Pereda and M. García-Torres and B. Melián and S. Ma~nas and L. Méndez and J. González

The Blessing of Dimensionality: Feature Selection Outperforms Functional Connectivity-based Feature Transformation to Classify ADHD Subjects from EEG Patterns of Phase Synchronisation Journal Article

In: PLoS ONE, vol. 13, no. 8, 2018.

Links | BibTeX | Tags: bioinformatics

2016

D. Gutiérrez-Avilés and C. Rubio-Escudero

TRIQ: A Comprehensive Evaluation Measure for Triclustering Algorithms Conference

Hybrid Artificial Intelligent Systems: 11th International Conference, HAIS 2016, Seville, Spain, April 18-20, 2016, Proceedings, Lecture Notes in Computer Science 2016.

Links | BibTeX | Tags: bioinformatics, time series

J. A. Nepomuceno and A. Troncoso and I. Nepomuceno and J. S. Aguilar-Ruiz

Biclustering of gene expression data based on SimUI semantic similarity measure Conference

HAIS 11th International Conference on Hybrid Artificial Intelligence Systems, Lecture Notes in Computer Science 2016.

Links | BibTeX | Tags: bioinformatics

2015

D. Gutiérrez-Avilés and C. Rubio-Escudero

MSL: A Measure to Evaluate Three-dimensional Patterns in Gene Expression Data Journal Article

In: Evolutionary Bioinformatics, vol. 11, pp. 121—135, 2015.

Abstract | Links | BibTeX | Tags: bioinformatics, time series

J. A. Nepomuceno and A. Troncoso and J. S. Aguilar-Ruiz

Scatter Search-based identification of local patterns with positive and negative correlations in gene expression data Journal Article

In: Applied Soft Computing, vol. 35, pp. 637-651, 2015.

Abstract | Links | BibTeX | Tags: bioinformatics

A. E. Marquez-Chamorro and G. Asencio-Cortes and C. E. Santiesteban-Toca and J. S. Aguilar-Ruiz

Soft computing methods for the prediction of protein tertiary structures: A survey Journal Article

In: Applied Soft Computing, no. 35, pp. 398-410, 2015, ISSN: 1568-4946.

Abstract | Links | BibTeX | Tags: bioinformatics

G. Asencio-Cortes and J. S. Aguilar-Ruiz and A. E. Marquez-Chamorro

An Efficient Nearest Neighbor Method for Protein Contact Prediction Conference

Hybrid Artificial Intelligent Systems, 2015, ISBN: 978-3-319-19644-2.

Abstract | BibTeX | Tags: bioinformatics

J. A. Nepomuceno and A. Troncoso and J. S. Aguilar-Ruiz

Integrating biological knowledge based on functional annotations for biclustering of gene expression data Journal Article

In: Computers Methods and Programs in Biomedicine, vol. 119, no. 3, pp. 163-180, 2015.

Abstract | Links | BibTeX | Tags: bioinformatics

@article{CMPB2015,

title = {Integrating biological knowledge based on functional annotations for biclustering of gene expression data},

author = {J. A. Nepomuceno and A. Troncoso and J. S. Aguilar-Ruiz},

url = {https://www.sciencedirect.com/science/article/pii/S0169260715000450},

doi = {10.1016/j.cmpb.2015.02.010},

year  = {2015},

date = {2015-00-00},

journal = {Computers Methods and Programs in Biomedicine},

volume = {119},

number = {3},

pages = {163-180},

abstract = {Gene expression data analysis is based on the assumption that co-expressed genes imply co-regulated genes. This assumption is being reformulated because the co-expression of a group of genes may be the result of an independent activation with respect to the same experimental condition and not due to the same regulatory regime. For this reason, traditional techniques are recently being improved with the use of prior biological knowledge from open-access repositories together with gene expression data. Biclustering is an unsupervised machine learning technique that searches patterns in gene expression data matrices. A scatter search-based biclustering algorithm that integrates biological information is proposed in this paper. In addition to the gene expression data matrix, the input of the algorithm is only a direct annotation file that relates each gene to a set of terms from a biological repository where genes are annotated. Two different biological measures, FracGO and SimNTO, are proposed to integrate this information by means of its addition to-be-optimized fitness function in the scatter search scheme. The measure FracGO is based on the biological enrichment and SimNTO is based on the overlapping among GO annotations of pairs of genes. Experimental results evaluate the proposed algorithm for two datasets and show the algorithm performs better when biological knowledge is integrated. Moreover, the analysis and comparison between the two different biological measures is presented and it is concluded that the differences depend on both the data source and how the annotation file has been built in the case GO is used. It is also shown that the proposed algorithm obtains a greater number of enriched biclusters than other classical biclustering algorithms typically used as benchmark and an analysis of the overlapping among biclusters reveals that the biclusters obtained present a low overlapping. The proposed methodology is a general-purpose algorithm which allows the integration of biological information from several sources and can be extended to other biclustering algorithms based on the optimization of a merit function.},

keywords = {bioinformatics},

pubstate = {published},

tppubtype = {article}

}

Gene expression data analysis is based on the assumption that co-expressed genes imply co-regulated genes. This assumption is being reformulated because the co-expression of a group of genes may be the result of an independent activation with respect to the same experimental condition and not due to the same regulatory regime. For this reason, traditional techniques are recently being improved with the use of prior biological knowledge from open-access repositories together with gene expression data. Biclustering is an unsupervised machine learning technique that searches patterns in gene expression data matrices. A scatter search-based biclustering algorithm that integrates biological information is proposed in this paper. In addition to the gene expression data matrix, the input of the algorithm is only a direct annotation file that relates each gene to a set of terms from a biological repository where genes are annotated. Two different biological measures, FracGO and SimNTO, are proposed to integrate this information by means of its addition to-be-optimized fitness function in the scatter search scheme. The measure FracGO is based on the biological enrichment and SimNTO is based on the overlapping among GO annotations of pairs of genes. Experimental results evaluate the proposed algorithm for two datasets and show the algorithm performs better when biological knowledge is integrated. Moreover, the analysis and comparison between the two different biological measures is presented and it is concluded that the differences depend on both the data source and how the annotation file has been built in the case GO is used. It is also shown that the proposed algorithm obtains a greater number of enriched biclusters than other classical biclustering algorithms typically used as benchmark and an analysis of the overlapping among biclusters reveals that the biclusters obtained present a low overlapping. The proposed methodology is a general-purpose algorithm which allows the integration of biological information from several sources and can be extended to other biclustering algorithms based on the optimization of a merit function.

2014

D. Gutiérrez-Avilés and C. Rubio-Escudero

Mining 3D Patterns from Gene Expression Temporal Data: A New Tricluster Evaluation Measure Journal Article

In: The Scientific World Journal, vol. 2014, pp. 1-16, 2014.

Abstract | Links | BibTeX | Tags: bioinformatics, time series

D. Gutiérrez-Avilés and C. Rubio-Escudero

LSL: A new measure to evaluate triclusters Conference

2014 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 2014.

Links | BibTeX | Tags: bioinformatics, time series

A. E. Marquez-Chamorro and G. Asencio-Cortes and F. Divina and J. S. Aguilar-Ruiz

Evolutionary decision rules for predicting protein contact maps Journal Article

In: Pattern Analysis and Applications, vol. 4, no. 17, pp. 725-737, 2014, ISSN: 1433-7541.

Abstract | Links | BibTeX | Tags: bioinformatics

D. Gutiérrez-Avilés and C. Rubio-Escudero and F. Martínez-Álvarez and J.C. Riquelme

TriGen: A genetic algorithm to mine triclusters in temporal gene expression data Journal Article

In: Neurocomputing, vol. 132, pp. 42-53, 2014.

Abstract | Links | BibTeX | Tags: bioinformatics

2013

M. García-Torres and R. Arma~nanzas and C. Bielza and P. Larra~naga

Comparison of metaheuristic strategies for peakbin selection in proteomic mass spectrometry data Journal Article

In: Information Sciences, vol. 222, pp. 229-246, 2013.

Links | BibTeX | Tags: bioinformatics, feature selection

2012

G. Asencio-Cortes and J. S. Aguilar-Ruiz and A. E. Marquez-Chamorro and R. Ruiz and C. E. Santiesteban-Toca

Prediction of Mitochondrial Matrix Protein Structures Based on Feature Selection and Fragment Assembly Conference

Evolutionary Computation, Machine Learning and Data Mining in Bioinformatics, Springer Berlin Heidelberg, Berlin, Heidelberg, 2012, ISBN: 978-3-642-29066-4.

Abstract | BibTeX | Tags: bioinformatics

@conference{10.1007/978-3-642-29066-4_14b,

title = {Prediction of Mitochondrial Matrix Protein Structures Based on Feature Selection and Fragment Assembly},

author = {G. Asencio-Cortes and J. S. Aguilar-Ruiz and A. E. Marquez-Chamorro and R. Ruiz and C. E. Santiesteban-Toca},

editor = {Giacobini, Mario and Vanneschi, Leonardo and Bush, William S.},

isbn = {978-3-642-29066-4},

year  = {2012},

date = {2012-01-01},

booktitle = {Evolutionary Computation, Machine Learning and Data Mining in Bioinformatics},

pages = {156-167},

publisher = {Springer Berlin Heidelberg},

address = {Berlin, Heidelberg},

abstract = {Protein structure prediction consists in determining the thre-e-dimensional conformation of a protein based only on its amino acid sequence. This is currently a difficult and significant challenge in structural bioinformatics because these structures are necessary for drug designing. This work proposes a method that reconstructs protein structures from protein fragments assembled according to their physico-chemical similarities, using information extracted from known protein structures. Our prediction system produces distance maps to represent protein structures, which provides more information than contact maps, which are predicted by many proposals in the literature. Most commonly used amino acid physico-chemical properties are hydrophobicity, polarity and charge. In our method, we performed a feature selection on the 544 properties of the AAindex repository, resulting in 16 properties which were used to predictions. We tested our proposal on 74 mitochondrial matrix proteins with a maximum sequence identity of 30% obtained from the Protein Data Bank. We achieved a recall of 0.80 and a precision of 0.79 with an 8-angstrom cut-off and a minimum sequence separation of 7 amino acids. Finally, we compared our system with other relevant proposal on the same benchmark and we achieved a recall improvement of 50.82%. Therefore, for the studied proteins, our method provides a notable improvement in terms of recall.},

keywords = {bioinformatics},

pubstate = {published},

tppubtype = {conference}

}

A. E. Marquez-Chamorro and F. Divina and J. S. Aguilar-Ruiz and J. Bacardit and G. Asencio-Cortes and C. E. Santiesteban-Toca

A NSGA-II Algorithm for the Residue-Residue Contact Prediction Conference

Evolutionary Computation, Machine Learning and Data Mining in Bioinformatics, Springer Berlin Heidelberg, Berlin, Heidelberg, 2012, ISBN: 978-3-642-29066-4.

Abstract | BibTeX | Tags: bioinformatics

C. E. Santiesteban-Toca and G. Asencio-Cortes and A. E. Marquez-Chamorro and J. S. Aguilar-Ruiz

Short-Range Interactions and Decision Tree-Based Protein Contact Map Predictor Conference

Evolutionary Computation, Machine Learning and Data Mining in Bioinformatics, Springer Berlin Heidelberg, Berlin, Heidelberg, 2012, ISBN: 978-3-642-29066-4.

Abstract | BibTeX | Tags: bioinformatics

D. Gutiérrez-Avilés and F. Martínez-Álvarez and C. Rubio-Escudero and J. C. Riquelme

Finding motifs in DNA sequences Workshop

Spanish Conference on Technologies and Fuzzy Logic (ESTYLF'12), 2012.

BibTeX | Tags: bioinformatics

2011

A. E. Marquez-Chamorro and F. Divina and J. S. Aguilar-Ruiz and G. Asencio-Cortes

A multi-objective genetic algorithm for the Protein Structure Prediction Conference

2011 11th International Conference on Intelligent Systems Design and Applications, 2011, ISSN: 2164-7151.

Links | BibTeX | Tags: bioinformatics

D. Gutiérrez-Avilés and C. Rubio-Escudero and J. C. Riquelme

Revisiting the yeast cell cycle problem with the improved TriGen algorithm Conference

2011 Third World Congress on Nature and Biologically Inspired Computing, 2011.

Links | BibTeX | Tags: bioinformatics, time series

D. Gutiérrez-Avilés and C. Rubio-Escudero and J. C. Riquelme

Unravelling the Yeast Cell Cycle Using the TriGen Algorithm Conference

Advances in Artificial Intelligence, 2011.

Links | BibTeX | Tags: bioinformatics, time series

J. A. Nepomuceno and A. Troncoso and J. S. Aguilar-Ruiz

Biclustering of Gene Expression Data by Correlation-Based Scatter Search Journal Article

In: BioData Mining, vol. 4, no. 3, 2011.

Abstract | Links | BibTeX | Tags: bioinformatics

@article{BIODM2011,

title = {Biclustering of Gene Expression Data by Correlation-Based Scatter Search},

author = {J. A. Nepomuceno and A. Troncoso and J. S. Aguilar-Ruiz},

url = {https://link.springer.com/article/10.1186/1756-0381-4-3},

doi = {10.1186/1756-0381-4-3},

year  = {2011},

date = {2011-01-01},

journal = {BioData Mining},

volume = {4},

number = {3},

abstract = {Background: The analysis of data generated by microarray technology is very useful to understand how the genetic information becomes functional gene products. Biclustering algorithms can determine a group of genes which are co-expressed under a set of experimental conditions. Recently, new biclustering methods based on metaheuristics have been proposed. Most of them use the Mean Squared Residue as merit function but interesting and relevant patterns from a biological point of view such as shifting and scaling patterns may not be detected using this measure. However, it is important to discover this type of patterns since commonly the genes can present a similar behavior although their expression levels vary in different ranges or magnitudes. Methods: Scatter Search is an evolutionary technique that is based on the evolution of a small set of solutions which are chosen according to quality and diversity criteria. This paper presents a Scatter Search with the aim of finding biclusters from gene expression data. In this algorithm the proposed fitness function is based on the linear correlation among genes to detect shifting and scaling patterns from genes and an improvement method is included in order to select just positively correlated genes. Results: The proposed algorithm has been tested with three real data sets such as Yeast Cell Cycle dataset, human B-cells lymphoma dataset and Yeast Stress dataset, finding a remarkable number of biclusters with shifting and scaling patterns. In addition, the performance of the proposed method and fitness function are compared to that of CC, OPSM, ISA, BiMax, xMotifs and Samba using Gene the Ontology Database.},

keywords = {bioinformatics},

pubstate = {published},

tppubtype = {article}

}

J. A. Nepomuceno and A. Troncoso and J. S. Aguilar-Ruiz

Inferring Genes Coexpression Networks with Biclustering Based on Scatter Search Conference

ISDA 11th International Conference on Intelligent Systems Design and Applications, 2011.

Links | BibTeX | Tags: bioinformatics

J. A. Nepomuceno and A. Troncoso and J. S. Aguilar-Ruiz

A Local Search in Scatter Search for Improving Biclusters Conference

NABIC 3th Congress on Natural and Biologically Inspired Computing, 2011.

Links | BibTeX | Tags: bioinformatics

G. Asencio-Cortes and J. S. Aguilar-Ruiz

Predicting protein distance maps according to physicochemical properties Conference

vol. 8, 2011, ISSN: 1613-4516.

Links | BibTeX | Tags: bioinformatics

A. E. Marquez-Chamorro and F. Divina and J. S. Aguilar-Ruiz and G. Asencio-Cortes

Residue-Residue Contact Prediction Based on Evolutionary Computation Conference

5th International Conference on Practical Applications of Computational Biology & Bioinformatics (PACBB 2011), Springer Berlin Heidelberg, 2011, ISBN: 978-3-642-19914-1.

Abstract | BibTeX | Tags: bioinformatics

G. Asencio-Cortes and J. S. Aguilar-Ruiz and A. E. Marquez-Chamorro

Prediction of Protein Distance Maps by Assembling Fragments According to Physicochemical Similarities Conference

5th International Conference on Practical Applications of Computational Biology & Bioinformatics (PACBB 2011), 2011, ISBN: 978-3-642-19914-1.

Abstract | BibTeX | Tags: bioinformatics

A. E. Marquez-Chamorro and F. Divina and J. S. Aguilar-Ruiz and G. Asencio-Cortes

An Evolutionary Approach for Protein Contact Map Prediction Conference

Evolutionary Computation, Machine Learning and Data Mining in Bioinformatics, 2011, ISBN: 978-3-642-20389-3.

Abstract | BibTeX | Tags: bioinformatics

G. Asencio-Cortes and J. S. Aguilar-Ruiz and A. E. Marquez-Chamorro

A Nearest Neighbour-Based Approach for Viral Protein Structure Prediction Conference

Evolutionary Computation, Machine Learning and Data Mining in Bioinformatics, 2011, ISBN: 978-3-642-20389-3.

Abstract | BibTeX | Tags: bioinformatics

C. E. Santiesteban-Toca and A. E. Marquez-Chamorro and G. Asencio-Cortes and J. S. Aguilar-Ruiz

A Decision Tree-Based Method for Protein Contact Map Prediction Conference

Evolutionary Computation, Machine Learning and Data Mining in Bioinformatics, 2011, ISBN: 978-3-642-20389-3.

Abstract | BibTeX | Tags: bioinformatics

C. Rubio-Escudero and F. Martínez-Álvarez and M. Martínez-Ballesteros and J. C. Riquelme

On the use of algorithms to discover motifs in DNA sequences Conference

IEEE International Conference on Intelligent Systems Design and Applications (ISDA'11), 2011.

Links | BibTeX | Tags: bioinformatics

F. Gómez-Vela and F. Martínez-Álvarez and C. D. Barranco and N. Díaz-Díaz and D. S. Rodríguez-Baena and J. S. Aguilar-Ruiz

Pattern recognition in biological time series Conference

Conference of the Spanish Association for Artificial Intelligence (CAEPIA'11), Lecture Notes in Artificial Intelligence 2011.

Links | BibTeX | Tags: bioinformatics, time series

68 entries « ‹ 1 of 2 › »