Difference between revisions of "Template:Article of the week"

From LIMSWiki
Jump to navigationJump to search
(Updated article of the week text.)
(Updated article of the week text)
 
(285 intermediate revisions by the same user not shown)
Line 1: Line 1:
<div style="float: left; margin: 0.5em 0.9em 0.4em 0em;">[[File:Fig3 Panahiazar JofBiomedInformatics2017 72-8.jpg|240px]]</div>
<div style="float: left; margin: 0.5em 0.9em 0.4em 0em;">[[File:Fig1 Bispo-Silva Geosciences23 13-11.png|240px]]</div>
'''"[[Journal:Predicting biomedical metadata in CEDAR: A study of Gene Expression Omnibus (GEO)|Predicting biomedical metadata in CEDAR: A study of Gene Expression Omnibus (GEO)]]"'''
'''"[[Journal:Geochemical biodegraded oil classification using a machine learning approach|Geochemical biodegraded oil classification using a machine learning approach]]"'''


A crucial and limiting factor in data reuse is the lack of accurate, structured, and complete descriptions of data, known as metadata. Towards improving the quantity and quality of metadata, we propose a novel metadata prediction framework to learn associations from existing metadata that can be used to predict metadata values. We evaluate our framework in the context of experimental metadata from the Gene Expression Omnibus (GEO). We applied four rule mining algorithms to the most common structured metadata elements (sample type, molecular type, platform, label type and organism) from over 1.3 million GEO records. We examined the quality of well supported rules from each algorithm and visualized the dependencies among metadata elements. Finally, we evaluated the performance of the algorithms in terms of accuracy, precision, recall, and F-measure. We found that PART is the best algorithm outperforming Apriori, Predictive Apriori, and Decision Table.
[[Chromatography|Chromatographic]] oil analysis is an important step for the identification of biodegraded petroleum via peak visualization and interpretation of phenomena that explain the oil geochemistry. However, analyses of chromatogram components by geochemists are comparative, visual, and consequently slow. This article aims to improve the chromatogram analysis process performed during geochemical interpretation by proposing the use of [[convolutional neural network]]s (CNN), which are deep learning techniques widely used by big tech companies. Two hundred and twenty-one (221) chromatographic oil images from different worldwide basins (Brazil, USA, Portugal, Angola, and Venezuela) were used. The [[open-source software]] Orange Data Mining was used to process images by CNN. The CNN algorithm extracts, pixel by pixel, recurring features from the images through convolutional operations ... ('''[[Journal:Geochemical biodegraded oil classification using a machine learning approach|Full article...]]''')<br />
 
''Recently featured'':
All algorithms perform significantly better in predicting class values than the majority vote classifier. We found that the performance of the algorithms is related to the dimensionality of the GEO elements. ('''[[Journal:Predicting biomedical metadata in CEDAR: A study of Gene Expression Omnibus (GEO)|Full article...]]''')<br />
{{flowlist |
<br />
* [[Journal:Knowledge of internal quality control for laboratory tests among laboratory personnel working in a biochemistry department of a tertiary care center: A descriptive cross-sectional study|Knowledge of internal quality control for laboratory tests among laboratory personnel working in a biochemistry department of a tertiary care center: A descriptive cross-sectional study]]
''Recently featured'':  
* [[Journal:Sigma metrics as a valuable tool for effective analytical performance and quality control planning in the clinical laboratory: A retrospective study|Sigma metrics as a valuable tool for effective analytical performance and quality control planning in the clinical laboratory: A retrospective study]]
: ▪ [[Journal:Rapid development of entity-based data models for bioinformatics with persistence object-oriented design and structured interfaces|Rapid development of entity-based data models for bioinformatics with persistence object-oriented design and structured interfaces]]
* [[Journal:Why do we need food systems informatics? Introduction to this special collection on smart and connected regional food systems|Why do we need food systems informatics? Introduction to this special collection on smart and connected regional food systems]]
: ▪ [[Journal:Bioinformatics education in pathology training: Current scope and future direction|Bioinformatics education in pathology training: Current scope and future direction]]
}}
: ▪ [[Journal:FluxCTTX: A LIMS-based tool for management and analysis of cytotoxicity assays data|FluxCTTX: A LIMS-based tool for management and analysis of cytotoxicity assays data]]

Latest revision as of 13:37, 13 May 2024

Fig1 Bispo-Silva Geosciences23 13-11.png

"Geochemical biodegraded oil classification using a machine learning approach"

Chromatographic oil analysis is an important step for the identification of biodegraded petroleum via peak visualization and interpretation of phenomena that explain the oil geochemistry. However, analyses of chromatogram components by geochemists are comparative, visual, and consequently slow. This article aims to improve the chromatogram analysis process performed during geochemical interpretation by proposing the use of convolutional neural networks (CNN), which are deep learning techniques widely used by big tech companies. Two hundred and twenty-one (221) chromatographic oil images from different worldwide basins (Brazil, USA, Portugal, Angola, and Venezuela) were used. The open-source software Orange Data Mining was used to process images by CNN. The CNN algorithm extracts, pixel by pixel, recurring features from the images through convolutional operations ... (Full article...)
Recently featured: