10.3791/66098-v 06:45 min

A Mouse Model of the Associating Liver Partition and Portal Vein Ligation for Staged Hepatectomy Procedure Aided by Microscopy

10.3791/63083-v 07:53 min

Interphase Fluorescence in situ Hybridization of Bone Marrow Smears of Multiple Myeloma

10.3791/69257-v 03:05 min

Influence of Emotional Factors on the Efficacy of Acupuncture Treatment for Overweight Complicated with Hyperlipidemia: A Retrospective Cohort Study

10.3791/67133-v 09:16 min

U-Shaped Horizontal Swimming Technique for Preparing High-Quality Sperm with Low DNA Fragmentation Index

10.3791/66564-v 10:21 min

Protein Target Prediction and Validation of Small Molecule Compound

10.3791/66336-v 03:47 min

Enhancement of Facial Rejuvenation Through a Combination of 1565 nm Non-Ablative Fractional Laser with 30% Supramolecular Salicylic Acid

10.3791/66274-v 04:33 min

Association Between Sleep Quality and Cognitive Symptoms in Patients with Major Depressive Disorder

10.3791/63772-v 05:57 min

Platelet-Rich Plasma Lysate for Treatment of Eye Surface Diseases

10.3791/63719-v 08:35 min

Improved Renal Denervation Mitigated Hypertension Induced by Angiotensin II Infusion

10.3791/63619-v 14:15 min

A Heterotopic Mouse Model for Studying Laryngeal Transplantation

10.3791/61547-v 06:35 min

Sodium Taurocholate Induced Severe Acute Pancreatitis in C57BL/6 Mice

10.3791/60733-v

Cooling or Warming the Esophagus to Reduce Esophageal Injury During Left Atrial Ablation in the Treatment of Atrial Fibrillation

Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications

作者： Dibakar Sigdel*1,2, Vincent Kyi*1,2, Aiden Zhang*1, Shaun P. Setty3, David A. Liem1,2,4, Yu Shi5, Xuan Wang5, Jiaming Shen5, Wei Wang1,6,7, JiaWei Han5, Peipei Ping1,2,4,6 1The NIH BD2K Center of Excellence in Biomedical Computing,University of California, Los Angeles, 2Department of Physiology,University of California, Los Angeles, 3Department of Pediatric and Adult Congenital Heart Surgery,Miller Children's and Women's Hospital and Long Beach Memorial Hospital, 4Department of Medicine/Cardiology,University of California, Los Angeles, 5NIH BD2K Program Centers of Excellence for Big Data Computing -- KnowEng Center, Department of Computer Science,University of Illinois at Urbana-Champaign (UIUC), 6Scalable Analytics Institute (ScAi),University of California, Los Angeles, 7Department of Computer Science,University of California, Los Angeles

简介：

Overview

This article presents a protocol for building a cloud-based phrase mining platform that facilitates the association of biomedical entities with specific diseases. The approach enhances efficiency and accessibility in biomedical research.

Key Study Components

Area of Science

Biomedical literature analysis
Text mining techniques
Entity-category association

Background

Manual evaluation of entity-category associations is time-consuming.
Phrase mining tools can improve research efficiency.
Cloud-based platforms enable broader access to text mining resources.
Protocols can guide new users in implementing these tools.

Purpose of Study

To automate the identification of phrase-category associations.
To provide a systematic approach for analyzing biomedical literature.
To enhance the usability of phrase mining tools for researchers.

Methods Used

Step-by-step protocol for creating a text-cube from biomedical publications.
Use of medical subject headings (MeSH) for defining categories.
Implementation of Python scripts for data processing and analysis.
Logging and debugging mechanisms to ensure process reliability.

Main Results

Successful creation of a text-cube for document categorization.
Automated mapping of entities to categories using MeSH descriptors.
Generation of metadata and statistics for various age groups.
Comparison of document counts across different subcategories.

Conclusions

The protocol significantly streamlines the process of entity-category association.
Cloud-based tools enhance accessibility for biomedical researchers.
Future applications may include broader analyses across various biomedical domains.

Frequently Asked Questions

What is the main advantage of the proposed protocol?

The protocol improves efficiency in evaluating entity-category associations compared to manual methods.

How can new users implement this protocol?

New users can follow the step-by-step instructions provided in the article and utilize the references.

What tools are required to use the phrase mining platform?

Users need access to a cloud environment and must ensure that the Elasticsearch server is running.

What types of entities can be analyzed using this method?

The method can analyze proteins, genomes, and chemicals associated with specific diseases.

How is the text-cube created?

The text-cube is created by running a specific Python script after preparing the necessary input files.

What is the significance of the metadata generated?

The metadata allows for context-aware analysis and comparison across different biomedical categories.

We present a protocol and associated programming code as well as metadata samples to support a cloud-based automated identification of phrases-category association representing unique concepts in user selected knowledge domain in biomedical literature. The phrase-category association quantified by this protocol can facilitate in depth analysis in the selected knowledge domain.

标签: Cloud-based Phrase Mining, Entity Category Association, Biomedical Publications, Proteins, Genomes, Chemicals, Diseases, Text Mining Tools, Medical Subject Headings, Text-cube, Mesh Descriptors, Algorithm, PMID Table, Data Directory, JSON Files,