Using IBM Content Manager for genomic data annotation and quality assurance tasks

ABSTRACT As the amount of heterogeneous genomic data and related annotations continues to grow, a flexible and easy-to-access data management solution is required to integrate such data and diverse annotation tasks. This preliminary report describes the benefits of using IBM DB2® Content Manager software by conducting task-oriented grape genome annotations, along with data quality-assurance checks throughout the annotation process. To demonstrate the usability of this application, we describe the implementation of two real-life content-based genome annotation case scenarios: 1) expressed sequence tags annotation; and 2) sequence annotation related to simple sequence repeat markers. The IBM DB2 Content Manager allows users to easily construct content-based genomic information applications as rapidly built and readily adapted customized content documents with attributes within an easy-to-use interface system. Users can simultaneously conduct the annotation quality checks while making annotations by utilizing a built-in standardized data quality-control assurance procedure referred to as annotation “routing.” The system provides search features or cross-links with different annotation contents or data formats. The data quality workflow and procedure within the system also resulted in accuracy and consistency in the data annotation and curation lifecycle.

KEYWORDS

SHARE & LIKE

COMMENTS

ABOUT THE AUTHOR

IBM journal of research and development

0 Following 2 Fans 0 Projects 70 Articles

SIMILAR ARTICLES

ABSTRACT The IBM Blue Gene®/Q supercomputer is designed for highly efficient computing for problems dominated by floating-point computation. Its tar

Read More

ABSTRACT Enterprise adoption of cloud computing often requires a significant transformation of existing information technology (IT) systems and proc

Read More

ABSTRACT The heart of a Blue Gene®/Q system is the Blue Gene/Q Compute (BQC) chip, which combines processors, memory, and communication functions on

Read More

ABSTRACT The heart of a Blue Gene®/Q system is the Blue Gene/Q Compute (BQC) chip, which combines processors, memory, and communication functions on

Read More

ABSTRACT The IBM Blue Gene®/Q supercomputer is designed for highly efficient computing for problems dominated by floating-point computation. Its tar

Read More

In this paper, we explain the techniques used in IBM Blue Gene®/Q Compute chips to achieve high energy efficiency. Architectural techniques include the

Read More

In this paper, we explain the techniques used in IBM Blue Gene®/Q Compute chips to achieve high energy efficiency. Architectural techniques include the

Read More

ABSTRACT In order to understand application-level power/performance tradeoffs on current computer systems, runtime monitoring capabilities are neede

Read More

ABSTRACT In order to understand application-level power/performance tradeoffs on current computer systems, runtime monitoring capabilities are neede

Read More

ABSTRACT The principal focus areas for system software on the IBM Blue Gene®/Q include ultrascalability and high reliability while delivering the fu

Read More