A proposal for a drug information database and text templates for generating package inserts

Ryo Okuya, Masaomi Kimura, Michiko Ohkura, Fumito Tsuchiya

Research output: Contribution to journalArticle

4 Citations (Scopus)

Abstract

To prevent prescription errors caused by information systems, a database to store complete and accurate drug information in a user-friendly format is needed. In previous studies, the primary method for obtaining data stored in a database is to extract drug information from package inserts by employing pattern matching or more sophisticated methods such as text mining. However, it is difficult to obtain a complete database because there is no strict rule concerning expressions used to describe drug information in package inserts. The authors' strategy was to first build a database and then automatically generate package inserts by embedding data in the database using templates. To create this database, the support of pharmaceutical companies to input accurate data is required. It is expected that this system will work, because these companies can earn merit for newly developed drugs to decrease the effort to create package inserts from scratch. This study designed the table schemata for the database and text templates to generate the package inserts. To handle the variety of drug-specific information in the package inserts, this information in drug composition descriptions was replaced with labels and the replacement descriptions utilizing cluster analysis were analyzed. To improve the method by which frequently repeated ingredient information and/or supplementary information are stored, the method was modified by introducing repeat tags in the templates to indicate repetition and improving the insertion of data into the database. The validity of this method was confirmed by inputting the drug information described in existing package inserts and checking that the method could regenerate the descriptions in the original package insert. In future research, the table schemata and text templates will be extended to regenerate other information in the package inserts.

Original languageEnglish
Pages (from-to)161-169
Number of pages9
JournalDrug, Healthcare and Patient Safety
Volume5
Issue number1
DOIs
Publication statusPublished - 2013 Jul 26

Keywords

  • Cluster analysis
  • Drug database
  • Drug information
  • Medical safety
  • Package insert

ASJC Scopus subject areas

  • Health Policy
  • Pharmacology

Cite this

A proposal for a drug information database and text templates for generating package inserts. / Okuya, Ryo; Kimura, Masaomi; Ohkura, Michiko; Tsuchiya, Fumito.

In: Drug, Healthcare and Patient Safety, Vol. 5, No. 1, 26.07.2013, p. 161-169.

Research output: Contribution to journalArticle

@article{3169085d47b64795bf2cfffa967f9393,
title = "A proposal for a drug information database and text templates for generating package inserts",
abstract = "To prevent prescription errors caused by information systems, a database to store complete and accurate drug information in a user-friendly format is needed. In previous studies, the primary method for obtaining data stored in a database is to extract drug information from package inserts by employing pattern matching or more sophisticated methods such as text mining. However, it is difficult to obtain a complete database because there is no strict rule concerning expressions used to describe drug information in package inserts. The authors' strategy was to first build a database and then automatically generate package inserts by embedding data in the database using templates. To create this database, the support of pharmaceutical companies to input accurate data is required. It is expected that this system will work, because these companies can earn merit for newly developed drugs to decrease the effort to create package inserts from scratch. This study designed the table schemata for the database and text templates to generate the package inserts. To handle the variety of drug-specific information in the package inserts, this information in drug composition descriptions was replaced with labels and the replacement descriptions utilizing cluster analysis were analyzed. To improve the method by which frequently repeated ingredient information and/or supplementary information are stored, the method was modified by introducing repeat tags in the templates to indicate repetition and improving the insertion of data into the database. The validity of this method was confirmed by inputting the drug information described in existing package inserts and checking that the method could regenerate the descriptions in the original package insert. In future research, the table schemata and text templates will be extended to regenerate other information in the package inserts.",
keywords = "Cluster analysis, Drug database, Drug information, Medical safety, Package insert",
author = "Ryo Okuya and Masaomi Kimura and Michiko Ohkura and Fumito Tsuchiya",
year = "2013",
month = "7",
day = "26",
doi = "10.2147/DHPS.S43303",
language = "English",
volume = "5",
pages = "161--169",
journal = "Drug, Healthcare and Patient Safety",
issn = "1179-1365",
publisher = "Dove Medical Press Limited",
number = "1",

}

TY - JOUR

T1 - A proposal for a drug information database and text templates for generating package inserts

AU - Okuya, Ryo

AU - Kimura, Masaomi

AU - Ohkura, Michiko

AU - Tsuchiya, Fumito

PY - 2013/7/26

Y1 - 2013/7/26

N2 - To prevent prescription errors caused by information systems, a database to store complete and accurate drug information in a user-friendly format is needed. In previous studies, the primary method for obtaining data stored in a database is to extract drug information from package inserts by employing pattern matching or more sophisticated methods such as text mining. However, it is difficult to obtain a complete database because there is no strict rule concerning expressions used to describe drug information in package inserts. The authors' strategy was to first build a database and then automatically generate package inserts by embedding data in the database using templates. To create this database, the support of pharmaceutical companies to input accurate data is required. It is expected that this system will work, because these companies can earn merit for newly developed drugs to decrease the effort to create package inserts from scratch. This study designed the table schemata for the database and text templates to generate the package inserts. To handle the variety of drug-specific information in the package inserts, this information in drug composition descriptions was replaced with labels and the replacement descriptions utilizing cluster analysis were analyzed. To improve the method by which frequently repeated ingredient information and/or supplementary information are stored, the method was modified by introducing repeat tags in the templates to indicate repetition and improving the insertion of data into the database. The validity of this method was confirmed by inputting the drug information described in existing package inserts and checking that the method could regenerate the descriptions in the original package insert. In future research, the table schemata and text templates will be extended to regenerate other information in the package inserts.

AB - To prevent prescription errors caused by information systems, a database to store complete and accurate drug information in a user-friendly format is needed. In previous studies, the primary method for obtaining data stored in a database is to extract drug information from package inserts by employing pattern matching or more sophisticated methods such as text mining. However, it is difficult to obtain a complete database because there is no strict rule concerning expressions used to describe drug information in package inserts. The authors' strategy was to first build a database and then automatically generate package inserts by embedding data in the database using templates. To create this database, the support of pharmaceutical companies to input accurate data is required. It is expected that this system will work, because these companies can earn merit for newly developed drugs to decrease the effort to create package inserts from scratch. This study designed the table schemata for the database and text templates to generate the package inserts. To handle the variety of drug-specific information in the package inserts, this information in drug composition descriptions was replaced with labels and the replacement descriptions utilizing cluster analysis were analyzed. To improve the method by which frequently repeated ingredient information and/or supplementary information are stored, the method was modified by introducing repeat tags in the templates to indicate repetition and improving the insertion of data into the database. The validity of this method was confirmed by inputting the drug information described in existing package inserts and checking that the method could regenerate the descriptions in the original package insert. In future research, the table schemata and text templates will be extended to regenerate other information in the package inserts.

KW - Cluster analysis

KW - Drug database

KW - Drug information

KW - Medical safety

KW - Package insert

UR - http://www.scopus.com/inward/record.url?scp=84880795803&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84880795803&partnerID=8YFLogxK

U2 - 10.2147/DHPS.S43303

DO - 10.2147/DHPS.S43303

M3 - Article

C2 - 23930079

AN - SCOPUS:84880795803

VL - 5

SP - 161

EP - 169

JO - Drug, Healthcare and Patient Safety

JF - Drug, Healthcare and Patient Safety

SN - 1179-1365

IS - 1

ER -