US20090076847A1 - Method and system for the analysis and association of patient-specific and population-based genomic data with drug safety adverse event data - Google Patents

Method and system for the analysis and association of patient-specific and population-based genomic data with drug safety adverse event data Download PDF

Info

Publication number
US20090076847A1
US20090076847A1 US12/271,224 US27122408A US2009076847A1 US 20090076847 A1 US20090076847 A1 US 20090076847A1 US 27122408 A US27122408 A US 27122408A US 2009076847 A1 US2009076847 A1 US 2009076847A1
Authority
US
United States
Prior art keywords
drug
data
patient
drugs
database
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US12/271,224
Inventor
Victor Gogolak
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Druglogic Inc
Original Assignee
Victor Gogolak
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Victor Gogolak filed Critical Victor Gogolak
Priority to US12/271,224 priority Critical patent/US20090076847A1/en
Publication of US20090076847A1 publication Critical patent/US20090076847A1/en
Priority to US15/059,997 priority patent/US20170061080A1/en
Assigned to DRUGLOGIC, INC. reassignment DRUGLOGIC, INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: GOGOLAK, VICTOR
Abandoned legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0637Strategic management or analysis, e.g. setting a goal or target of an organisation; Planning actions based on goals; Analysis or evaluation of effectiveness of goals
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B20/00ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
    • G16B20/20Allele or variant detection, e.g. single nucleotide polymorphism [SNP] detection
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B20/00ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
    • G16B20/40Population genetics; Linkage disequilibrium
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • G16B40/20Supervised data analysis
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B50/00ICT programming tools or database systems specially adapted for bioinformatics
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/70ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for mining of medical data, e.g. analysing previous cases of other patients
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H70/00ICT specially adapted for the handling or processing of medical references
    • G16H70/40ICT specially adapted for the handling or processing of medical references relating to drugs, e.g. their side effects or intended usage
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B20/00ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H10/00ICT specially adapted for the handling or processing of patient-related medical or healthcare data
    • G16H10/20ICT specially adapted for the handling or processing of patient-related medical or healthcare data for electronic clinical trials or questionnaires
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99941Database schema or data structure
    • Y10S707/99942Manipulating data structure, e.g. compression, compaction, compilation
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99941Database schema or data structure
    • Y10S707/99944Object-oriented database structure
    • Y10S707/99945Object-oriented database structure processing

Definitions

  • the present invention relates to a drug safety database. More particularly, the invention relates to a method and system for creating and utilizing a database that relates drugs, adverse events, patient characteristics, and in particular, genetic information.
  • Pharmacogenetics is the study of individual response to drugs as a function of genetic differences. These responses relate to how a drug functions in any given individual, how it is metabolized, its toxicity and dosage requirements. With the human genome project, pharmacogenetics has expanded into pharmacogenomics. Pharmacogenomics goes beyond pharmacogenetics, with the potential to find uses from drug discovery and development, target discovery and validation, and clinical trials; and to get that information into the doctor's office so that the right medicine is given to the right patient at the right time.
  • SNP single-nucleotide polymorphism
  • Pharmacogenomics is being applied to pharmacodynamics, how a drug affects a disease. Additionally, pharmacogenomics is being applied to pharmacokinetics, or how the body processes a drug. While the pathway for drug intervention is usually well known, there are two important mechanisms to consider in pharmacokinetics—the pathway that metabolizes the drug itself, and other pathways that drugs and their metabolites may inadvertently and adversely affect. It is in these two areas that drug safety comes into play. In the first, there are distinctions among human genotypes in the ability to metabolize drugs. If a drug is not metabolized as predicted in the clinical trials, it could potentially build up to toxic levels. Different segments of the population metabolize drugs differently, providing a variety of potential reactions to a drug which can impact the dosage, safety, and efficacy of that drug and its usefulness for an individual patient.
  • a drug and its metabolites may affect other pathways for varying genotypes (or phenotypes).
  • genotypes or phenotypes.
  • the present invention addresses the needs in the above area of drug safety by providing a system of relating drug safety data, adverse reactions, pharmacogenetics, pharmacogenomics and demographic data.
  • the system described herein is in the area of genomic drug safety, and accordingly provides a method to evaluate existing and/or potential drugs from a genomic point of view.
  • This method takes the drug dimension, with its different characteristics (e.g. chemical class) and adds information regarding the metabolic pathway of drugs (including current and historical drugs), thereby creating genetically relevant taxonomies of drugs.
  • the method also includes demographic information such as the phenotype and genotype of a particular patient involved with a particular reaction (case). These extensions allow the resulting application to correlate drugs with the metabolizing characteristics of specific patient genotypes. As such, the method shows how these drug/genotype interactions lead to increased or decreased chances for particular adverse reactions.
  • the method creates and utilizes a database which correlates drugs, adverse events and patient characteristics.
  • the method focuses primarily on genetic information, particularly SNPs and gene variants that relate to drugs or drug classes and adverse reactions. Whereas drugs generally metabolize in one or two pathways, adverse events can occur in any of the body systems or pathways, even those not originally believed to be impacted by the drug, leading to extreme complexity.
  • data is collected, for example, from a multitude of sources, and is then analyzed and stored in a standardized data structure.
  • a method of developing “mappings” allows data to be consistently compared and analyzed even if the original sources use incompatible language.
  • the method takes existing drug data on efficacy and safety, such as is found in drug labels, monographs, and post market information, and solicits new data, such as through clinical trial data in the literature, or from patient and medical histories, and then associates those relationships with patient characteristics including genetic and environmental factors (e.g., diet, age, sex, race, etc.).
  • new data such as through clinical trial data in the literature, or from patient and medical histories, and then associates those relationships with patient characteristics including genetic and environmental factors (e.g., diet, age, sex, race, etc.).
  • the resulting databases can be used in drug research, using the analytic techniques as provided herein can be used to sort among lead compounds to determine those with the lowest possible side effects based on population genetics. It can also be used in prescribing drugs, matching a patient not only to the most efficacious drug, but also to the most effective drug with the least side effects and the lowest required dosage based on the patient's specific genetic and environmental markers.
  • an additional benefit of the present invention is the capability to prepare a database of known relationships and apply them prior to or at the point-of-care (POC). This allows the physician to check a particular patient profile against the drug to be prescribed to assess the risk of an adverse drug reaction for that patient.
  • POC point-of-care
  • the method described herein provides risk analysis of the use of particular drugs based on chemical, proteomic, genomic, and demographic information for both diagnostic and therapeutic purposes.
  • the method uses an innovative complex of tools that associate historical drug safety data with both drug characteristics and patient genetic make-up.
  • One embodiment of the method includes summary data on populations that have had adverse drug reactions along with their genetic profile.
  • Another embodiment provides broad population genetics data for prescribing decisions made on drugs and therapeutics, based on the likelihood of adverse drug reactions.
  • a further embodiment compares individuals with the drug reaction profile of a given population. For example, if a patient profile matches the genetic profile of those prone to liver disorders, then the physicians/pharmacists will avoid prescribing or dispensing drugs with potential adverse liver effects to that patient.
  • FIG. 1 is a schematic block diagram illustrating one embodiment of a method of mapping various data sources.
  • FIG. 2 is a block diagram illustrating one embodiment of the steps utilized in performing the method of the present invention.
  • FIG. 3 is a block diagram illustrating one embodiment of the present invention integrating information on drugs, demographics and adverse reactions.
  • the present invention comprises a system and method for creating, storing and using patient-specific and population-based genomic drug safety data including at least one or more integrated databases; a selector for selecting at least one drug for analysis (based on the generic, brand name or therapeutic category); a profiler for displaying statistics that describe one or more behaviors of the drug in multiple dimensions; a series of at least two filters and the means to control the filters individually and in combination; at least one data mining engine.
  • the data mining engine is a correlator, a proportional analysis engine, and a comparator; and a graphical user interface for displaying the results of the analysis.
  • Dimensions such as age, sex, weight, diet, dates, reactions, doses, outcomes, illnesses, report source, and concomitant drugs can be analyzed in combinations of two dimensions, in combinations of three dimensions, in combinations of four dimensions, and other combinations.
  • the method described herein also permits analysis of the association between outcomes and other dimensions.
  • FIG. 1 shows the relationship among the various data sources and how data is drawn into a consistent data structure, one that is referenced to standard dictionaries, regardless of which dictionaries were used in the source.
  • source data is gathered to create a composite genomic relational drug safety database 10 .
  • This source data is collected from databases covering three general areas: adverse event database 20 , drug information database 30 , and patient or genomic database 40 .
  • Adverse event source database 20 includes data on drugs, reactions, demographics and outcomes.
  • Sources of this data are databases such as, but not limited to, the FDA's Spontaneous Regulatory System (“SRS”), the FDA's Adverse Event Reporting System (“AERS”), the World Health Organization Adverse event database, or other country-specific regulatory or epidemiological databases, such as the UK Advert system and the General Practice Research Database (GPRD) as well as other databases and sources, which may be domestic, foreign and/or international in scope.
  • SRS Spontaneous Regulatory System
  • AERS FDA's Adverse Event Reporting System
  • GPRD General Practice Research Database
  • Unexpected or previously unrecognized adverse drug effects can take the form of single reactions, groups of reactions, or increases in a labeled reaction. These adverse reactions may be due to the exposure of a greater number of people to the drug, or the reactions of a particular demographic group. Such information is continually updated as new cases are released and can be added to the information for use in the present method.
  • Adverse event data 20 may also be provided, for example, from pharmaceutical corporations, hospitals, physicians, health insurers, and state, federal and international agencies.
  • a primary source of pharmaceutical industry data is the individual adverse event databases of the various pharmaceutical corporation safety departments. In each case, source data may be focused on clinical trails, post-market surveillance, research databases, or the like. Unedited data in each source database is referred to as “verbatim.” Clinical trial data available in literature includes safety data. Other information is collected and can be accessed from the World Health Organization (WHO), PEM, the General Practice Research Database (GPRD), and so forth.
  • Drug information source database 30 may include the type or class of drug, metabolic pathways, and drug pharmacokinetics and pharmacodynamics. Such drug source database 30 provides drug taxonomies that offer characteristics of drugs including metabolites, clearance rates, peak serum levels, pharmacodynamics, therapeutic category, chemical structure, or a way to group drugs and explore the relationship to both reactions and genotypes.
  • Patient source database 40 may include genetic data, demographic, phenotypic, and drug and patient adverse drug reaction (“ADR”) history.
  • This source data may be acquired by accessing, soliciting, or assembling data on patients experiencing adverse drug reactions, and comparing the data against data from a control set of a broad population who are not taking the drug/drugs in question in order to see the relationship between certain reactions and genotype/phenotype.
  • ADR adverse drug reaction
  • This source data may be acquired by accessing, soliciting, or assembling data on patients experiencing adverse drug reactions, and comparing the data against data from a control set of a broad population who are not taking the drug/drugs in question in order to see the relationship between certain reactions and genotype/phenotype.
  • light skinned people a kind of phenotype with genotypic background
  • sunburn and may additionally be particularly sensitive to certain drugs.
  • Population genetics information includes a wide variety of sources including DNA samples solicited directly from people who have had documented adverse reactions to certain drugs.
  • reference data 50 from various accepted canonical references including dictionaries, thesauruses, taxonomies, and hierarchies, is gathered and used in the genomic drug safety database 10 .
  • Examples of such references include the Medical Dictionary for Regulatory activities (“MedDRATM”), National Drug Code Directory (“NDDC”), and the FDA Orange Book.
  • MedDRATM Medical Dictionary for Regulatory activities
  • NDDC National Drug Code Directory
  • FDA Orange Book FDA Orange Book
  • step 100 data is assembled from sources that relate drugs to reactions and characteristics, and patients to medical history and genotype as described above with respect to FIG. 1 .
  • step 200 of the method data is implanted into a data structure, information that has been broken down to its most fundamental level is mapped to reference dictionaries, and source data is parsed into a relational database structure.
  • transformation from raw source data to a relational structure preferably includes parsing each data source into an image with fields tailored to its corresponding source. Subsequently, the images are consolidated into a single safety table space.
  • the method provides the ability to add many “dimensions” such as age, sex, dates, reactions, doses, outcomes, report source, and concomitant drugs. These dimensions can be entered as structured, narrative, numerical, or categorical variables. Hierarchies in all dimensions (in both preferred and custom paths) are defined as required by the particular end user.
  • the safety table space provides a common set of fields for the parsed source data.
  • an information management system integrates data from a plurality of interconnected local databases, providing users with access to a virtual database that ties drug, reaction and patient parameters together.
  • a record repository is provided for accepting data from a medical service provider and linking it to a patient database, allowing documents for a patient to be retrieved through demographic data.
  • An additional method provides an information extraction system wherein users ask questions about documents in a database and the system responds with information extracted from the documents.
  • copious amounts of data regarding adverse events associated with a particular product are received and analyzed in view of known adverse events associated with the product, providing new uses for the product as well as a catalog of adverse event information for a large number of population sub-groups.
  • FIG. 3 shows the relationship of three information categories and their various parameters (characteristics).
  • the category entitled “Reactions” of “Adverse Reactions” ( 20 ) has been structured by various means: COSTART, WHOART, and most recently MedDRATM. “Reactions” are additionally characterized by their seriousness (e.g. the FDA's “Designated Medical Events,” or DME).
  • the method provides a means to integrate these various dictionaries.
  • the second entity is the product or “drug”. It is classified by therapeutic class, chemical class, metabolic pathway, metabolites or structure. The method provides a system to integrate these parameters.
  • genotype The last entity, named “demographics,” has typically rested on either broad parameters such as age and sex, or on the more specific characteristics of weight, diet, climate, phenotype, proteome, genotype and environment. Recent work in genetics has identified a critical SNP or gene, a defining characteristic for an individual—the genotype.
  • data cleanup may be performed independently of parsing source data into a safety database. This allows cleanup to be continual, ongoing, and iterative, either before or after one or more source databases are processed into the pharmacovigilance database which determines if the events are or are not due to the drug.
  • Source database cleanup is an incremental process, proceeding from automated cleanup of certain errors, through human-assisted cleanup of ambiguous entries, to human correction of identified gross errors.
  • Specific cleanup tasks include noise reduction (e.g., suppression of non-alpha characters noise words and combination words), adjustment for misspellings, adjustment for dislocations, transpositions and resolution of possible redundant entries.
  • reactions, drugs, and counts of the occurrence (by case and absolute) of each are extracted from the parsed source data.
  • the counts are then grouped. In this embodiment, grouping is by order of magnitude of the count.
  • the bulk of data cleanup is performed on a computing platform separate from database storage.
  • a spreadsheet application such as Microsoft® Excel® is used to track cleanup operations.
  • the first column in such a spreadsheet may contain the verbatim term; the second column may contain a noise-suppressed verbatim term; the fourth column may contain the spell-checked verbatim term, and so on.
  • Other data cleanup applications such as Metaphone (discussed below), also reside on this separate computing platform. However, cleanup applications need not reside on a separate computing platform, or may be accessible via the Internet or other computer network.
  • Noise reduction involves suppression of words and characters that are typically unnecessary in determining the correct name for drug or reaction verbatim.
  • Noise words and characters include, but are not limited to non-alpha characters (such as numbers, diacriticals, brackets, and control characters), words (e.g., “mg” or “tablet”), combination words (e.g., “20 mg” with no space). For example, both “Tylenol (500 mg)” and “Tylenol Capsules” would be reduced to “Tylenol.”
  • a list of noise words and noise punctuation is stored in database tables associated with lexical processing. Non-alpha characters, such as control characters, are also suppressed at this stage.
  • misspellings are detected and corrected using known tools such as spell checkers, sound-alike suggestion programs, a verbatim replacement table, and human inspection.
  • a preferred spell checker operates on noise-suppressed verbatim terms, making a series of spelling variations on terms not found in the reference sources. These variations are used as the basis for searching reference sources and suggesting candidate canonical terms.
  • Reference sources include standard and special-purpose dictionaries. The variations introduced include:
  • Metaphone is a published phonetic code algorithm similar to Soundex, which is a sound based indexing system. Every word has a four-letter Metaphone value that can be calculated.
  • the Metaphone suggester calculates the Metaphone value for each entry in the reference sources and for each unresolved verbatim term. Those reference source terms having a Metaphone value matching that of an unresolved verbatim term will be offered as a suggestion to a database developer for resolution.
  • the Metaphone value for both “prosac” and “prozack” is PRSK; the Metaphone value for both “Claritin” and “Klariton” is “KLRT.” Where no candidates satisfy the developer, an option is provided for accepting a surrogate term from the developer.
  • Various embodiments of the method described herein include steps for capturing and using domain specific lexical knowledge not easily applied through noise reduction or spell checking. At the basic level, this amounts to use of a replacement table, containing mappings from known errors to corrected canonical terms. On a more sophisticated level, as domain-specific knowledge is accumulated, autocoders are employed to capture human decision-making experience regarding cleanup.
  • Dislocation errors Human interaction is particularly useful in identification and correction of dislocation errors, i.e., where a term valid in one field (e.g., headache/reaction) appears in a field where it is not valid (e.g., headache/drug). Dislocation errors are identified in preferred embodiments where a term does not fit the type of the field it is found in, but nonetheless exists in reference sources outside the scope of the particular field.
  • Redundant entries are identified and removed with operator assistance.
  • a “case” may include all data regarding the adverse events experienced by one person taking a drug.
  • a sequence of events regarding a single individual taking a drug should not be recorded as separate cases (potentially duplicating the adverse events associated with the case). This is important for correct statistical views of the data.
  • the method provides tools to operators to allow identification and consolidation of redundant cases. In preferred embodiments of the present invention, multiple cases involving the same person over a contiguous period are presented to an operator for determination as to whether or not such entries actually represent one case with multiple (or possibly single-occurrence, multiple-reported) events.
  • a case concerning an “eye pain” reaction is amended fifteen times, only one instance of eye pain should be aggregated for this individual case.
  • preferred embodiments of the method match successor reports with their predecessors using data inherent in the records, and compare other information in the records to gauge the quality of the match. For example, two cases may match on the “case identification” field, or a “drug manufacturer identification” field, or a “report date.” Those cases known to be redundant, and those cases showing a link between records, are presented to researchers for resolution. In alternate embodiments, resolution between likely redundant cases is accomplished via an expert system.
  • verbatim terrns such as drug and reaction terms, that have been parsed into a safety database and cleaned are then mapped to “tokens” from the reference data sources.
  • the word “token” refers to the specific term(s), from one or more of the reference sources, that is associated with one or more of the verbatim terms in a manner that allows a search for the token to return results containing the verbatim term(s) linked to the token.
  • token refers to the specific term(s), from one or more of the reference sources, that is associated with one or more of the verbatim terms in a manner that allows a search for the token to return results containing the verbatim term(s) linked to the token.
  • the verbatim term is mapped to the reference term as token.
  • no exact match is found between verbatim (cleaned or otherwise) and reference data terms, the present invention presents a series of steps for resolving such unmatched terms.
  • valid variations in terminology may also be resolved through mapping to reference data tokens.
  • “PROZAC” and other trade names for flouxetine are preferably mapped to the generic “flouxetine.”
  • luliberin, gonadotropin releasing hormone, GnRH, gonadotropin releasing factor, luteinizing hormone releasing hormone, LHRH, and LH-FSH RH are equivalents and may be considered as such for analyzing adverse effects.
  • different chemical derivatives, such as esters, salts, or acidic or basic forms of the same drug may be grouped together, where a reference data term exists, under the same token in order to analyze adverse drug events.
  • source data verbatim terms may be nominated as token candidates; frequency of occurrence and absolute count being typical bases for nominating a term as a token candidate.
  • Verbatim drug, patient and reaction terms may be grouped by order of magnitude of absolute count.
  • token candidates are chosen from accepted reference sources such as MedDRATM, Coding Symbols for a Thesarus of Adverse Reaction Terms (“COSTART”), GPRD, and World Health Organization Adverse Drug Reaction Terminology (“WHOART”).
  • token candidates are chosen from corresponding canonical sources such as the National Drug Code Directory (“NDCD”), WHO Drug Dictionaries for drugs, and the FDA's Orange Book.
  • this process may be used for multiple database dimensions in addition to drug and reaction, e.g., outcomes, where the definition of “serious” outcomes can differ over time and between reference sources.
  • This mapping enables those searches of the database focused on tokenized fields, e.g., drug, patient, and reaction fields, to be executed with greater confidence.
  • variability in source event data entry typically a difficult-to-control aspect of data collection on a large scale, is mitigated as a source of error.
  • Corrected verbatim is mapped to reference canonical terms and structures. As noted earlier, where an exact match exists between a verbatim term (source or cleaned) and a reference term, the verbatim term is mapped to the reference term as token. Where no exact match is found between verbatim (cleaned or otherwise) and reference data terms, the method presents a series of steps for resolving such unmatched terms. In those instances where a user is presented with a number of assigned unresolved entries, the method presents the user with suggestions identified by lexical processing (e.g., Metaphone, fixed list) for each unresolved verbatim term. The user may then select from this list or enter a surrogate term.
  • lexical processing e.g., Metaphone, fixed list
  • Cleaned source data reaction terms may be mapped to standardized hierarchies such as WHOART, COSTART, and MedDRA. Specifically, cleaned source data reaction terms are mapped to multiple levels (and possibly multiple entries within a level) of the hierarchy. In preferred embodiments, mapping of cleaned verbatim reaction terms proceeds in a fashion similar to mapping of drug terms. While the preferred embodiments perform mapping on cleaned source data, it should be understood that mapping may be performed on uncleaned, or even unparsed, source data.
  • Transparency in the process of moving from source data verbatim terms to a cleaned safety database with verbatim terms mapped to tokens is important to both database developers/operators and to end users.
  • the present method captures the way source data terms have been cleaned and mapped as the “pedigree” of each term.
  • the pedigree of a term is the link between the mapped term and the decisions made during data cleanup. End users typically wish to verify the pedigree of the data they use.
  • retained data includes one or more of the following as appropriate: verbatim term, token mapped to, source of the verbatim term, number of occurrences of the verbatim term, number of cases in which the verbatim term appears, which type of cleanup (if any) was performed, a cross-reference to where the token is defined, and dates of the earliest and latest reported occurrence.
  • the method may be implemented on a single computer or across a network of computers, e.g., a local area network or on or across the Internet.
  • Preferred embodiments include implementations on computer-readable media storing a computer program product performing one or more of the steps described herein.
  • Such a computer program product contains modules implementing the steps as inter-related functions as described herein.
  • the databases, data management software and analytical software may reside in any combination of one or more local workstations and one or more network servers.
  • the database used in the present method sets up different information sources in a composite relational data structure, as illustrated in FIG. 3 .
  • This composition correlates patient information 40 (demographics, genetic make-up, environment), drug information 30 (class, therapeutic use, structure), and events of adverse reactions 20 (at different levels of detail).
  • patient information 40 demographics, genetic make-up, environment
  • drug information 30 class, therapeutic use, structure
  • events of adverse reactions 20 at different levels of detail.
  • the information in these cases includes both the verbatim data from the original sources as well as a standard reference dictionary term for each element of data. The latter is critical to ensure the ability to compare cases across drugs, patients and events.
  • step 300 of the method described herein, statistical associations are analyzed.
  • the present method looks at cases and determines two outcomes:
  • the method provides a way to apply these techniques, in a novel way, to categorical variables. That is, variables that retain non-numeric values. Reactions such as headache, rash, genotype, drug chemical class, etc. are the key parameters.
  • the database provides a classification taxonomy structure for drug and genomic data. As taxonomies develop over time (typically based on new ways of grouping drugs or new ways of grouping SNPs and gene variants), the method provides parameters and hierarchies to enable richer correlations. As drugs or genes are grouped, they provide the basis for “coherent processing” in signal detection, i.e., they have the ability to group information to reinforce the “signal” that a particular drug or genotype is, in fact, a possible cause of a reaction.
  • Similar structures are used for reactions and outcomes (e.g., hospitalization or death) as well as a drug classification template that allows mapping between drug and pathway, proteome, drug classification and others, in search of potential associations at the drug level; and a SNPs and gene classification template that allows a variety of high/medium/low level detail (e.g., broad SNPs region for certain reactions to specify SNPs related to Stevens-Johnson Syndrome).
  • the present method analyzes the multivariate relationships of drug safety, therefore making the database more usable to researchers and clinicians.
  • Many aspects of the use of drug, reaction, and genetic/environmental information depend on having the ability to consolidate data across many different databases, with specific applications of standard categories and standard dictionaries allowing the consolidated data to provide meaning.
  • These applications include: a database on demographics, drugs, reactions, and outcomes linked to standard dictionaries; a selector/profiler/filter that allows a researcher to hypothesize on (i.e., groups of reactions or symptoms) and filter confounding elements (e.g., the known pathway effects etc.) in order to enhance the ability to uncover unwanted effects, by analyzing similar drugs chemically, or pathways genetically; a set of analytical engines that respectively analyze different aspects of the database (which is basically organized into historical or current “cases”, i.e. a patient), with a certain drug or drugs, and with certain reactions and outcomes.
  • a selector/profiler/filter that allows a researcher to hypothesize on (i.e., groups of reactions or symptoms) and filter confounding elements (e.g., the known pathway effects etc.) in order to enhance the ability to uncover unwanted effects, by analyzing similar drugs chemically, or pathways genetically
  • a set of analytical engines that respectively analyze different aspects of the database (which is basically organized into historical or current “cases”,
  • proportional Engines PRR, OR
  • correlators that look for associations
  • differing engines that compare different sets of data (e.g., one population or another) in search of variations
  • Neural Network and other learning machine paradigms that apply heuristics to large databases in order to model and classify, e.g. by comparing weights of associations
  • a set of data display, viewing and visualization that helps the researcher use his/her insight and pattern recognition capabilities to see, for example, genetic patterns or patterns of pathways that are involved for certain genotypes.
  • the present method uses both automated and semi-automated techniques to blend and create the best data mining and hypothesis testing.
  • the present method combines detection as well as association algorithms to help identify the relationships among a drug, a given patient genotype, and the reactions and outcomes that could result.
  • the present method accesses the system database, wherein a set of cases may be selected for analysis. Having selected the case-set of interest, the method then preferably proceeds to a profile, which preferably displays statistically-derived values that describe the behavior of the drug of interest based upon patient genotype. From the profile, the method can then preferably proceed to employ one or more filters that permit recalculation of the statistics by selecting among available variables.
  • a set of cases is determined, for example, by the use of one or more filters, the cases can then preferably be submitted to one or more data mining engines.
  • data mining engines may include a correlator engine, which provides information on analyses that have been previously completed—including date and time, task number, and generic drug. Each listing ends with a hyperlink that a user can employ to view the results of a search.
  • a “delete” function is preferably provided to manage this list.
  • Step 400 of the method provides the data mining and extraction capabilities in which the results of statistical analysis in step 300 are compared against selected thresholds or criteria to extract the data of interest.
  • each dimension in the present invention will have a natural “filter” framework based on the number of parameter (dimension) variables and their number in the database.
  • one or more of a combination of filters can be used to select “cases” and perform a restricted analysis of step 300 .
  • This interaction between steps 300 and 400 allows an individual researcher to use his/her hypothesis to adjust the analysis. For example, a set of adverse events are really a reflection of non-efficacy of a certain drug, such as reported adverse event of “depression” for a patient taking an anti-depressant. These reactions could be filtered out as part of step 300 .
  • Output from the data mining engines is preferably displayed using a graphical viewer, which permits the user to present the data in a variety of formats, including, but not limited to a sortable table, a sortable line listing, and a radar screen, thus, allowing rapid identification of signals and providing the user the ability to drill down to individual case details.
  • a graphical viewer which permits the user to present the data in a variety of formats, including, but not limited to a sortable table, a sortable line listing, and a radar screen, thus, allowing rapid identification of signals and providing the user the ability to drill down to individual case details.
  • the method of the present invention permits choosing a profile, applying one or more filters, processing the set of cases using the data mining engines, and displaying the results for a user or viewer.
  • the present invention therefore includes a means to assemble data, create a database, and finally produce a summary and individual outputs based on the implied parameter “triplet” consisting of patient, drug, and adverse event data.
  • the result is a structural database that combines a variety of drug characteristics, drug class and pathway characteristics, and population as well as individual genetics.
  • the present invention provides a method for applying genomic-based adverse event data in a drug lifecycle.
  • ADRs Adverse Drug Reactions
  • the database of the present invention was developed based on clinical outcomes.
  • the system and method works backward from ADRs toward the statistical distribution of genotypes potentially associated with the event.
  • the grouping of SNPs and gene variants provides a “cluster” that can be associated with a higher-than-expected reaction. For example, it may be that roughly 1% of patients experience headache with a particular drug, but 30% of those patients sharing a particular genotype have experienced headache with the drug. This may potentially link that genotype to headaches associated with the drug.
  • the system and method disclosed herein may be used during the research and pre-clinical trial stage of drug development for reviewing a set of drugs against the genetic background of a population in order to determine the ADR profile.
  • the system and method may also be used during clinical trials, comparing the actual experience of ADRs with the database, for example, using proportional analysis (or any of the above techniques) to see if the trial population exhibits unexpected adverse events.
  • the system and method may additionally be used during the diagnosis and prescription process at a point of care for checking a particular patient's genetic profile against a drug to assess or determine the probability of an adverse drug reaction for a drug (especially those considered serious by the healthcare provider) compared to other drugs.
  • the system and method may further be used on a continuing basis for collecting post-market data on drugs by retrieving the genetic profile of patients exhibiting adverse events for a drug, and updating the present invention database with this information.
  • phenotypic markers e.g. blue eyes, blond hair, fair skin—people of Nordic decent
  • environmental factors such as diet, work, or smoking habit may be related as well.
  • the system and method further utilize a drug utilization review (“DUR”), incorporating the genomic dimension to the patient specific DUR.
  • DUR application uses the link among the three elements discussed above, drug reactions and genotype, to create methods for avoiding ADRs for a given patient.
  • DUR drug utilization review
  • the DUR application of the database would assess the drug and rate its potential for adverse reactions for that individual.
  • Other factors such as environment, nutrition, foodstuffs, beverages, exposure to toxins, chemicals, supplements, herbal remedies, and the use of other drugs, would also continue to be considered, based on the best available evidence.
  • the system and method assign a risk parameter that combines drug, genetic, and outcome information of drugs on the drug label, to create a rapid means to improve drugs for an individual patient, or assess the risk of a specific drug for that patient.
  • the system may assign a PIN to an individual, and then provide a means by which to query the database on the relative risk for a drug, as described below.
  • a PIN based system By using a PIN based system, there is provided the maintenance of privacy.
  • the method and system described herein may be embodied in a network environment.
  • a central or other network accessible means
  • a scale could be used (such as described below).
  • This method would allow any subscriber or enrollee to a healthcare program using the method to input genomic data from any location, for example, a genomics lab that examines the patient, or enrollees profile. The data on other aspects of the patient would similarly be entered, all via PIN. Then, at any time, the DUR application would allow network access to the risk assessment.
  • the present method may use a parameter or coefficient specifically designed to measure risk. Although many weightings may be used, this parameter would provide, on a suitable scale (e.g., 0 to 1.0 or other normalized scale), a weighted assessment of genetic risk for the patient based on the probability that certain drug reactions are likely for the patient genotype.
  • the scale takes into account the closeness of the drug to the drug in the database (it could only share chemical class) and the closeness of the genetic profile of the patient to the average genetic profile of the database, again using a closeness fraction based on the number of standard deviations from the mean fit. This scale would then provide a nominal risk, with an error range, for the individual.
  • any actual scale that uses a linear, or logarithmic weighting (or other), that accounts for a closeness adjustment for the drug and for the genotype, will then be used to report the expected ADRs. Note, in the preferred embodiment, the most serious reaction would be used and weighted more heavily (using, for example, the FDA CDER Designated Medical Event list).
  • the scale would be refined over time as better tests and more population statistics are added. Thus, both the database and the precision of the scale will improve over time. The use of a numerical range that healthcare providers become accustomed to will allow ease of interpretation of the results.
  • the method described herein provides for personalized medicine application in drug safety.
  • the system and method draw a relationship between genetic type and predispositions to adverse reactions for certain drugs or drug types. Given a sufficient set of specific genetic information on an individual, there exists the potential to create a “profile” for that individual.
  • the realm of adverse events involves numerous other areas of an individual's metabolism. In fact, it goes beyond that to include the influence of environmental factors such as coffee drinker, smoker, traveler, etc., that changes an individual's environment and medical situation into which drugs are prescribed.
  • the system and method allow, for example, an individual's profile to be privately stored and accessed via a PIN. Then, given the drug a physician is considering, the patient's background can be checked for potential risk.
  • various embodiments of the system and method provide several benefits. These benefits include warning a patient or physician that certain drugs have produced reactions associated with the patient's genotype; a broad understanding of gene and SNP relationships to pathways, proteome and drugs in those pathways; a statistical understanding of the genetic behavior of drug classes; a structural database that allows a drill-down on genetic differences for more specific reactions; a database that makes ADR profile associations with proteome possible; a database that increases the potential to uncover the multiple ways certain ADRs could develop; a method for scoring the genomic based risk of certain adverse events for drugs or drug classes; a method for preserving patient privacy while permitting clinical labs, physicians, etc., to access the information for a particular patient; adjustment of the genetic risk by correlation with environmental factors; a method for adding details to the genomic database as new information is made available; a broad, statistical understanding of population genetic impact on the percentage risk of certain adverse events with certain drugs or drug classes; and a method for adding data from sources that may have been based on different vocabularies.

Abstract

A method for assessing and analyzing one or more drugs, adverse effects and associated risks, and patient characteristics resulting from the use of at least drug of interest is disclosed. The method comprises the steps of selecting one or more cases for analysis, said cases describing the behavior between at least one drug of interest and a patient genotype; profiling statistically derived values from multiple cases related to the safety of the at least one drug, wherein at least one filter is employed for deriving said values; at least one data mining engine; and an output device for displaying the analytic results from the data mining engine. A system for performing the method is likewise disclosed.

Description

    RELATED APPLICATIONS
  • This application is a continuation application of U.S. Ser. No. 10/229,119, filed Aug. 28, 2002, which claims the benefit of U.S. Provisional Application No. 60/315,525, filed Aug. 29, 2001, and is related to each of the following applications: U.S. patent application Ser. No. 09/681,586, filed May 2, 2001; U.S. patent application Ser. No. 09/681,587, filed May 2, 2001; U.S. patent application Ser. No. 09/681,583, filed May 2, 2001; and U.S. patent application Ser. No. 09/845,722, filed May 2, 2001, the disclosures of which are incorporated herein by reference in their entireties.
  • TECHNICAL FIELD
  • The present invention relates to a drug safety database. More particularly, the invention relates to a method and system for creating and utilizing a database that relates drugs, adverse events, patient characteristics, and in particular, genetic information.
  • BACKGROUND OF THE INVENTION
  • Publicly and privately developed pharmacological data is readily available from both reference data and source data. Statistical information has been collected for many years on adverse reactions to drugs, including information on prescriptions, nutraceuticals, and over-the-counter medications. With this information, databases have been created that provide both reporting and data analysis of adverse drug reactions. Typically, this data is provided in a format that is not amenable to searching, such as in ASCII format.
  • Additionally, these databases are often in different structures and language formats, decreasing the efficiency and impeding effective use. Further, the variations in terminology and software languages employed by these disparate databases complicates conventional queries, making the results unreliable.
  • Various methods and techniques have been developed to address the need to provide ready access to pharmacological data and adverse events. However, none of these partial solutions, such as the Freedom of Information data provided by the FDA (which relies on “flat files”) or standard dictionaries such as the Medical Dictionary for Regulatory Activities (MedDRA™), have been integrated to allow consistent analysis and results.
  • Pharmacogenetics is the study of individual response to drugs as a function of genetic differences. These responses relate to how a drug functions in any given individual, how it is metabolized, its toxicity and dosage requirements. With the human genome project, pharmacogenetics has expanded into pharmacogenomics. Pharmacogenomics goes beyond pharmacogenetics, with the potential to find uses from drug discovery and development, target discovery and validation, and clinical trials; and to get that information into the doctor's office so that the right medicine is given to the right patient at the right time.
  • For pharmacogenomics to be effective, markers are needed that are indicative of the connection between drug response and genetic makeup. One such marker that is being diligently pursued is the single-nucleotide polymorphism (SNP). Databases are presently available that furnish a map of over a million SNPs. From this data, information has been collected regarding the allelic frequency of a SNP within an ethnically diverse population. There are also other databases which are more narrowly tailored and focus only on particular groups of SNPs such as those that code for proteins; provide a data set related to ADME (absorption, distribution, metabolism and excretion) genes and SNPs that are associated with how the body responds to drugs. Instead of single SNPs, some databases focus on SNPs that are found in haplotypes, which work together to cause a particular drug response.
  • Pharmacogenomics is being applied to pharmacodynamics, how a drug affects a disease. Additionally, pharmacogenomics is being applied to pharmacokinetics, or how the body processes a drug. While the pathway for drug intervention is usually well known, there are two important mechanisms to consider in pharmacokinetics—the pathway that metabolizes the drug itself, and other pathways that drugs and their metabolites may inadvertently and adversely affect. It is in these two areas that drug safety comes into play. In the first, there are distinctions among human genotypes in the ability to metabolize drugs. If a drug is not metabolized as predicted in the clinical trials, it could potentially build up to toxic levels. Different segments of the population metabolize drugs differently, providing a variety of potential reactions to a drug which can impact the dosage, safety, and efficacy of that drug and its usefulness for an individual patient.
  • A drug and its metabolites may affect other pathways for varying genotypes (or phenotypes). Currently available data on drug safety, as collected by regulators around the world, does not address genetic variances, although hundreds of different reactions are reported to occur in many body systems.
  • Accordingly, what is needed is an understanding of the impact of differing rates of metabolism on adverse drug events. Additionally, there is a need for a database providing drug safety data as collected by regulators around the world, particularly from a genetic perspective. Further, there is a need for a relational database that can assimilate and correlate these two sets of data, particularly from a genomic perspective.
  • SUMMARY OF THE INVENTION
  • Using data, meta-analysis, standardization of terminology and sophisticated association algorithms, the present invention addresses the needs in the above area of drug safety by providing a system of relating drug safety data, adverse reactions, pharmacogenetics, pharmacogenomics and demographic data.
  • The system described herein is in the area of genomic drug safety, and accordingly provides a method to evaluate existing and/or potential drugs from a genomic point of view. This method takes the drug dimension, with its different characteristics (e.g. chemical class) and adds information regarding the metabolic pathway of drugs (including current and historical drugs), thereby creating genetically relevant taxonomies of drugs. The method also includes demographic information such as the phenotype and genotype of a particular patient involved with a particular reaction (case). These extensions allow the resulting application to correlate drugs with the metabolizing characteristics of specific patient genotypes. As such, the method shows how these drug/genotype interactions lead to increased or decreased chances for particular adverse reactions.
  • The method creates and utilizes a database which correlates drugs, adverse events and patient characteristics. The method focuses primarily on genetic information, particularly SNPs and gene variants that relate to drugs or drug classes and adverse reactions. Whereas drugs generally metabolize in one or two pathways, adverse events can occur in any of the body systems or pathways, even those not originally believed to be impacted by the drug, leading to extreme complexity.
  • In accordance with the method, data is collected, for example, from a multitude of sources, and is then analyzed and stored in a standardized data structure. A method of developing “mappings” allows data to be consistently compared and analyzed even if the original sources use incompatible language.
  • The method takes existing drug data on efficacy and safety, such as is found in drug labels, monographs, and post market information, and solicits new data, such as through clinical trial data in the literature, or from patient and medical histories, and then associates those relationships with patient characteristics including genetic and environmental factors (e.g., diet, age, sex, race, etc.).
  • The resulting databases can be used in drug research, using the analytic techniques as provided herein can be used to sort among lead compounds to determine those with the lowest possible side effects based on population genetics. It can also be used in prescribing drugs, matching a patient not only to the most efficacious drug, but also to the most effective drug with the least side effects and the lowest required dosage based on the patient's specific genetic and environmental markers.
  • Since it is possible to check all combinations of genotypes against all drug information, an additional benefit of the present invention is the capability to prepare a database of known relationships and apply them prior to or at the point-of-care (POC). This allows the physician to check a particular patient profile against the drug to be prescribed to assess the risk of an adverse drug reaction for that patient.
  • The method described herein provides risk analysis of the use of particular drugs based on chemical, proteomic, genomic, and demographic information for both diagnostic and therapeutic purposes. The method uses an innovative complex of tools that associate historical drug safety data with both drug characteristics and patient genetic make-up. One embodiment of the method includes summary data on populations that have had adverse drug reactions along with their genetic profile. Another embodiment provides broad population genetics data for prescribing decisions made on drugs and therapeutics, based on the likelihood of adverse drug reactions. A further embodiment compares individuals with the drug reaction profile of a given population. For example, if a patient profile matches the genetic profile of those prone to liver disorders, then the physicians/pharmacists will avoid prescribing or dispensing drugs with potential adverse liver effects to that patient.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • Certain embodiments of the disclosed invention will now be described in greater detail, and exemplarily shown in the associated drawings in which like reference numerals have been used to indicate like and similar components, arrangements of components, and functional features of the same. The illustrative drawings disclose exemplary and, in some cases, alternative embodiments of the invention, in which regard:
  • FIG. 1 is a schematic block diagram illustrating one embodiment of a method of mapping various data sources.
  • FIG. 2 is a block diagram illustrating one embodiment of the steps utilized in performing the method of the present invention.
  • FIG. 3 is a block diagram illustrating one embodiment of the present invention integrating information on drugs, demographics and adverse reactions.
  • DETAILED DESCRIPTION OF THE INVENTION
  • The present invention comprises a system and method for creating, storing and using patient-specific and population-based genomic drug safety data including at least one or more integrated databases; a selector for selecting at least one drug for analysis (based on the generic, brand name or therapeutic category); a profiler for displaying statistics that describe one or more behaviors of the drug in multiple dimensions; a series of at least two filters and the means to control the filters individually and in combination; at least one data mining engine. Preferably, the data mining engine is a correlator, a proportional analysis engine, and a comparator; and a graphical user interface for displaying the results of the analysis.
  • Dimensions such as age, sex, weight, diet, dates, reactions, doses, outcomes, illnesses, report source, and concomitant drugs can be analyzed in combinations of two dimensions, in combinations of three dimensions, in combinations of four dimensions, and other combinations. The method described herein also permits analysis of the association between outcomes and other dimensions.
  • FIG. 1 shows the relationship among the various data sources and how data is drawn into a consistent data structure, one that is referenced to standard dictionaries, regardless of which dictionaries were used in the source. In the embodiment of the system and method illustrated in FIG. 1, source data is gathered to create a composite genomic relational drug safety database 10. This source data is collected from databases covering three general areas: adverse event database 20, drug information database 30, and patient or genomic database 40. Adverse event source database 20 includes data on drugs, reactions, demographics and outcomes. Sources of this data are databases such as, but not limited to, the FDA's Spontaneous Regulatory System (“SRS”), the FDA's Adverse Event Reporting System (“AERS”), the World Health Organization Adverse event database, or other country-specific regulatory or epidemiological databases, such as the UK Advert system and the General Practice Research Database (GPRD) as well as other databases and sources, which may be domestic, foreign and/or international in scope. Unexpected or previously unrecognized adverse drug effects can take the form of single reactions, groups of reactions, or increases in a labeled reaction. These adverse reactions may be due to the exposure of a greater number of people to the drug, or the reactions of a particular demographic group. Such information is continually updated as new cases are released and can be added to the information for use in the present method.
  • Adverse event data 20 may also be provided, for example, from pharmaceutical corporations, hospitals, physicians, health insurers, and state, federal and international agencies. A primary source of pharmaceutical industry data is the individual adverse event databases of the various pharmaceutical corporation safety departments. In each case, source data may be focused on clinical trails, post-market surveillance, research databases, or the like. Unedited data in each source database is referred to as “verbatim.” Clinical trial data available in literature includes safety data. Other information is collected and can be accessed from the World Health Organization (WHO), PEM, the General Practice Research Database (GPRD), and so forth.
  • Drug information source database 30 may include the type or class of drug, metabolic pathways, and drug pharmacokinetics and pharmacodynamics. Such drug source database 30 provides drug taxonomies that offer characteristics of drugs including metabolites, clearance rates, peak serum levels, pharmacodynamics, therapeutic category, chemical structure, or a way to group drugs and explore the relationship to both reactions and genotypes.
  • Patient source database 40, both records with the name and patent identification removed and summarized records, may include genetic data, demographic, phenotypic, and drug and patient adverse drug reaction (“ADR”) history. This source data may be acquired by accessing, soliciting, or assembling data on patients experiencing adverse drug reactions, and comparing the data against data from a control set of a broad population who are not taking the drug/drugs in question in order to see the relationship between certain reactions and genotype/phenotype. For example, light skinned people (a kind of phenotype with genotypic background) are generally prone to sunburn and may additionally be particularly sensitive to certain drugs. Population genetics information includes a wide variety of sources including DNA samples solicited directly from people who have had documented adverse reactions to certain drugs.
  • In addition to source data, reference data 50 from various accepted canonical references, including dictionaries, thesauruses, taxonomies, and hierarchies, is gathered and used in the genomic drug safety database 10. Examples of such references include the Medical Dictionary for Regulatory activities (“MedDRA™”), National Drug Code Directory (“NDDC”), and the FDA Orange Book. The method described herein has the capacity to substitute and manage both source and reference data.
  • The basic steps of the method are shown in FIG. 2. In step 100, data is assembled from sources that relate drugs to reactions and characteristics, and patients to medical history and genotype as described above with respect to FIG. 1. In step 200 of the method, data is implanted into a data structure, information that has been broken down to its most fundamental level is mapped to reference dictionaries, and source data is parsed into a relational database structure. For data sources not already in a relational database structure, transformation from raw source data to a relational structure preferably includes parsing each data source into an image with fields tailored to its corresponding source. Subsequently, the images are consolidated into a single safety table space. Since the database can be simple or complex, the method provides the ability to add many “dimensions” such as age, sex, dates, reactions, doses, outcomes, report source, and concomitant drugs. These dimensions can be entered as structured, narrative, numerical, or categorical variables. Hierarchies in all dimensions (in both preferred and custom paths) are defined as required by the particular end user.
  • Since several of the most favored data sources are not published in a format that lends itself to direct query, e.g., SRS is available from the U.S. Government only as delimited ASCII data, parsing such data into a relational database model allows the use of leveraging data management tools, which are ineffective on flat files. In preferred embodiments of the present method, the safety table space provides a common set of fields for the parsed source data.
  • In one method, an information management system integrates data from a plurality of interconnected local databases, providing users with access to a virtual database that ties drug, reaction and patient parameters together. In another method, a record repository is provided for accepting data from a medical service provider and linking it to a patient database, allowing documents for a patient to be retrieved through demographic data. An additional method provides an information extraction system wherein users ask questions about documents in a database and the system responds with information extracted from the documents. In yet another method, copious amounts of data regarding adverse events associated with a particular product are received and analyzed in view of known adverse events associated with the product, providing new uses for the product as well as a catalog of adverse event information for a large number of population sub-groups.
  • FIG. 3 shows the relationship of three information categories and their various parameters (characteristics). The category entitled “Reactions” of “Adverse Reactions” (20) has been structured by various means: COSTART, WHOART, and most recently MedDRA™. “Reactions” are additionally characterized by their seriousness (e.g. the FDA's “Designated Medical Events,” or DME). The method provides a means to integrate these various dictionaries. The second entity is the product or “drug”. It is classified by therapeutic class, chemical class, metabolic pathway, metabolites or structure. The method provides a system to integrate these parameters. The last entity, named “demographics,” has typically rested on either broad parameters such as age and sex, or on the more specific characteristics of weight, diet, climate, phenotype, proteome, genotype and environment. Recent work in genetics has identified a critical SNP or gene, a defining characteristic for an individual—the genotype.
  • In one embodiment, data cleanup may be performed independently of parsing source data into a safety database. This allows cleanup to be continual, ongoing, and iterative, either before or after one or more source databases are processed into the pharmacovigilance database which determines if the events are or are not due to the drug. Source database cleanup is an incremental process, proceeding from automated cleanup of certain errors, through human-assisted cleanup of ambiguous entries, to human correction of identified gross errors. Specific cleanup tasks include noise reduction (e.g., suppression of non-alpha characters noise words and combination words), adjustment for misspellings, adjustment for dislocations, transpositions and resolution of possible redundant entries.
  • In a preferred embodiment, reactions, drugs, and counts of the occurrence (by case and absolute) of each are extracted from the parsed source data. The counts are then grouped. In this embodiment, grouping is by order of magnitude of the count.
  • In a preferred embodiment, the bulk of data cleanup is performed on a computing platform separate from database storage. A spreadsheet application, such as Microsoft® Excel® is used to track cleanup operations. For example, the first column in such a spreadsheet may contain the verbatim term; the second column may contain a noise-suppressed verbatim term; the fourth column may contain the spell-checked verbatim term, and so on. Other data cleanup applications, such as Metaphone (discussed below), also reside on this separate computing platform. However, cleanup applications need not reside on a separate computing platform, or may be accessible via the Internet or other computer network.
  • As a part of the data cleanup operation, noise reduction may be performed on the data. Noise reduction involves suppression of words and characters that are typically unnecessary in determining the correct name for drug or reaction verbatim. Noise words and characters include, but are not limited to non-alpha characters (such as numbers, diacriticals, brackets, and control characters), words (e.g., “mg” or “tablet”), combination words (e.g., “20 mg” with no space). For example, both “Tylenol (500 mg)” and “Tylenol Capsules” would be reduced to “Tylenol.” A list of noise words and noise punctuation is stored in database tables associated with lexical processing. Non-alpha characters, such as control characters, are also suppressed at this stage.
  • After noise reduction, misspellings are detected and corrected using known tools such as spell checkers, sound-alike suggestion programs, a verbatim replacement table, and human inspection. A preferred spell checker operates on noise-suppressed verbatim terms, making a series of spelling variations on terms not found in the reference sources. These variations are used as the basis for searching reference sources and suggesting candidate canonical terms. Reference sources include standard and special-purpose dictionaries. The variations introduced include:
      • adding an extra character to the term, e.g., allowing noise-suppressed verbatim such as “proza” to be searched as “prozac”;
      • removing a character from the term, e.g., allowing noise-suppressed verbatim such as “prozzac” to be searched as “prozac”;
      • swapping adjacent characters, e.g., allowing noise-suppressed verbatim such as “rpozac” to be searched as “prozac.”
  • In addition to a spelling suggester, a sound-alike program, such as Metaphone or Soundex is employed to suggest variations. Metaphone is a published phonetic code algorithm similar to Soundex, which is a sound based indexing system. Every word has a four-letter Metaphone value that can be calculated. The Metaphone suggester calculates the Metaphone value for each entry in the reference sources and for each unresolved verbatim term. Those reference source terms having a Metaphone value matching that of an unresolved verbatim term will be offered as a suggestion to a database developer for resolution. For example, the Metaphone value for both “prosac” and “prozack” is PRSK; the Metaphone value for both “Claritin” and “Klariton” is “KLRT.” Where no candidates satisfy the developer, an option is provided for accepting a surrogate term from the developer.
  • Various embodiments of the method described herein include steps for capturing and using domain specific lexical knowledge not easily applied through noise reduction or spell checking. At the basic level, this amounts to use of a replacement table, containing mappings from known errors to corrected canonical terms. On a more sophisticated level, as domain-specific knowledge is accumulated, autocoders are employed to capture human decision-making experience regarding cleanup.
  • Human interaction is particularly useful in identification and correction of dislocation errors, i.e., where a term valid in one field (e.g., headache/reaction) appears in a field where it is not valid (e.g., headache/drug). Dislocation errors are identified in preferred embodiments where a term does not fit the type of the field it is found in, but nonetheless exists in reference sources outside the scope of the particular field.
  • Redundant entries are identified and removed with operator assistance. A “case” may include all data regarding the adverse events experienced by one person taking a drug. A sequence of events regarding a single individual taking a drug should not be recorded as separate cases (potentially duplicating the adverse events associated with the case). This is important for correct statistical views of the data. The method provides tools to operators to allow identification and consolidation of redundant cases. In preferred embodiments of the present invention, multiple cases involving the same person over a contiguous period are presented to an operator for determination as to whether or not such entries actually represent one case with multiple (or possibly single-occurrence, multiple-reported) events.
  • For example, if a case concerning an “eye pain” reaction is amended fifteen times, only one instance of eye pain should be aggregated for this individual case. Through record linking, preferred embodiments of the method match successor reports with their predecessors using data inherent in the records, and compare other information in the records to gauge the quality of the match. For example, two cases may match on the “case identification” field, or a “drug manufacturer identification” field, or a “report date.” Those cases known to be redundant, and those cases showing a link between records, are presented to researchers for resolution. In alternate embodiments, resolution between likely redundant cases is accomplished via an expert system.
  • It should be understood that underlying verbatim terms are not changed by application of noise suppression, the use of spell checkers, the resolution of dislocations, or the resolution of redundant entries. Verbatim terrns, such as drug and reaction terms, that have been parsed into a safety database and cleaned are then mapped to “tokens” from the reference data sources. The word “token” refers to the specific term(s), from one or more of the reference sources, that is associated with one or more of the verbatim terms in a manner that allows a search for the token to return results containing the verbatim term(s) linked to the token. Where an exact match exists between a verbatim term (source or cleaned) and a reference term, the verbatim term is mapped to the reference term as token. Where no exact match is found between verbatim (cleaned or otherwise) and reference data terms, the present invention presents a series of steps for resolving such unmatched terms.
  • In addition to corruption in verbatim data, valid variations in terminology may also be resolved through mapping to reference data tokens. For example, “PROZAC” and other trade names for flouxetine are preferably mapped to the generic “flouxetine.” In another example, luliberin, gonadotropin releasing hormone, GnRH, gonadotropin releasing factor, luteinizing hormone releasing hormone, LHRH, and LH-FSH RH are equivalents and may be considered as such for analyzing adverse effects. Furthermore, different chemical derivatives, such as esters, salts, or acidic or basic forms of the same drug may be grouped together, where a reference data term exists, under the same token in order to analyze adverse drug events.
  • In accordance with the method, source data verbatim terms may be nominated as token candidates; frequency of occurrence and absolute count being typical bases for nominating a term as a token candidate. Verbatim drug, patient and reaction terms may be grouped by order of magnitude of absolute count. For reactions, token candidates are chosen from accepted reference sources such as MedDRA™, Coding Symbols for a Thesarus of Adverse Reaction Terms (“COSTART”), GPRD, and World Health Organization Adverse Drug Reaction Terminology (“WHOART”). For drugs, token candidates are chosen from corresponding canonical sources such as the National Drug Code Directory (“NDCD”), WHO Drug Dictionaries for drugs, and the FDA's Orange Book.
  • Individual verbatim terms are then mapped to the selected tokens. According to the present method, this process may be used for multiple database dimensions in addition to drug and reaction, e.g., outcomes, where the definition of “serious” outcomes can differ over time and between reference sources. This mapping enables those searches of the database focused on tokenized fields, e.g., drug, patient, and reaction fields, to be executed with greater confidence. Using the mapping approach, variability in source event data entry, typically a difficult-to-control aspect of data collection on a large scale, is mitigated as a source of error.
  • Corrected verbatim is mapped to reference canonical terms and structures. As noted earlier, where an exact match exists between a verbatim term (source or cleaned) and a reference term, the verbatim term is mapped to the reference term as token. Where no exact match is found between verbatim (cleaned or otherwise) and reference data terms, the method presents a series of steps for resolving such unmatched terms. In those instances where a user is presented with a number of assigned unresolved entries, the method presents the user with suggestions identified by lexical processing (e.g., Metaphone, fixed list) for each unresolved verbatim term. The user may then select from this list or enter a surrogate term. After selecting a candidate term or entering a surrogate term, a list of generic drug names will be shown (if the matched term was indeed a trade name rather than a generic). At this point, a user can either save the mapping or modify the list of generic terms. This last option will allow a user to override the list of generics or to enter new chemical compounds as they are developed.
  • Cleaned source data reaction terms may be mapped to standardized hierarchies such as WHOART, COSTART, and MedDRA. Specifically, cleaned source data reaction terms are mapped to multiple levels (and possibly multiple entries within a level) of the hierarchy. In preferred embodiments, mapping of cleaned verbatim reaction terms proceeds in a fashion similar to mapping of drug terms. While the preferred embodiments perform mapping on cleaned source data, it should be understood that mapping may be performed on uncleaned, or even unparsed, source data.
  • Transparency in the process of moving from source data verbatim terms to a cleaned safety database with verbatim terms mapped to tokens is important to both database developers/operators and to end users. The present method captures the way source data terms have been cleaned and mapped as the “pedigree” of each term. The pedigree of a term is the link between the mapped term and the decisions made during data cleanup. End users typically wish to verify the pedigree of the data they use. In those embodiments, retained data includes one or more of the following as appropriate: verbatim term, token mapped to, source of the verbatim term, number of occurrences of the verbatim term, number of cases in which the verbatim term appears, which type of cleanup (if any) was performed, a cross-reference to where the token is defined, and dates of the earliest and latest reported occurrence.
  • The method may be implemented on a single computer or across a network of computers, e.g., a local area network or on or across the Internet. Preferred embodiments include implementations on computer-readable media storing a computer program product performing one or more of the steps described herein. Such a computer program product contains modules implementing the steps as inter-related functions as described herein. In a networked implementation, the databases, data management software and analytical software may reside in any combination of one or more local workstations and one or more network servers.
  • As described above, the database used in the present method sets up different information sources in a composite relational data structure, as illustrated in FIG. 3. This composition correlates patient information 40 (demographics, genetic make-up, environment), drug information 30 (class, therapeutic use, structure), and events of adverse reactions 20 (at different levels of detail). Along with DNA sample references and detailed individual medical histories, these are assembled and referenced into cases. The information in these cases includes both the verbatim data from the original sources as well as a standard reference dictionary term for each element of data. The latter is critical to ensure the ability to compare cases across drugs, patients and events.
  • Referring again to FIG. 2, in step 300 of the method, described herein, statistical associations are analyzed. The present method looks at cases and determines two outcomes:
      • a. Whether there is an indication of association among a drug, a reaction or group of reactions, and a genotype
      • b. The distribution of that association in a population.
        The first outcome is determined from one of four techniques—
      • 1. Proportional analysis against a variety of backgrounds,
      • 2. A correlation of two or more parameters (e.g., Pearson product-moment correlation coefficient),
      • 3. Differential analysis (i.e., changes over time or another dimension), and
      • 4. Neural networks (one of several paradigms that associate items and provide weights based on the statistical distribution of outcomes), or other machine learning algorithms such as Hidden Markov models, Bayesian networks and kernel methods among other methods known in the art.
  • Note that the method provides a way to apply these techniques, in a novel way, to categorical variables. That is, variables that retain non-numeric values. Reactions such as headache, rash, genotype, drug chemical class, etc. are the key parameters. The application of the Pearson Product-Moment using a binary scale (0 or 1) that means that one of possibly hundreds of parameters is there or not there, is a unique application of the Pearson technique. This application of the Pearson P-M rests on the ability of the method to link data to consistent dictionaries and also calculates millions of pairs.
  • Various methods are available according to the present method for using the database to analyze relationships. The database provides a classification taxonomy structure for drug and genomic data. As taxonomies develop over time (typically based on new ways of grouping drugs or new ways of grouping SNPs and gene variants), the method provides parameters and hierarchies to enable richer correlations. As drugs or genes are grouped, they provide the basis for “coherent processing” in signal detection, i.e., they have the ability to group information to reinforce the “signal” that a particular drug or genotype is, in fact, a possible cause of a reaction. Similar structures are used for reactions and outcomes (e.g., hospitalization or death) as well as a drug classification template that allows mapping between drug and pathway, proteome, drug classification and others, in search of potential associations at the drug level; and a SNPs and gene classification template that allows a variety of high/medium/low level detail (e.g., broad SNPs region for certain reactions to specify SNPs related to Stevens-Johnson Syndrome).
  • The present method analyzes the multivariate relationships of drug safety, therefore making the database more usable to researchers and clinicians. Many aspects of the use of drug, reaction, and genetic/environmental information depend on having the ability to consolidate data across many different databases, with specific applications of standard categories and standard dictionaries allowing the consolidated data to provide meaning. These applications include: a database on demographics, drugs, reactions, and outcomes linked to standard dictionaries; a selector/profiler/filter that allows a researcher to hypothesize on (i.e., groups of reactions or symptoms) and filter confounding elements (e.g., the known pathway effects etc.) in order to enhance the ability to uncover unwanted effects, by analyzing similar drugs chemically, or pathways genetically; a set of analytical engines that respectively analyze different aspects of the database (which is basically organized into historical or current “cases”, i.e. a patient), with a certain drug or drugs, and with certain reactions and outcomes. These include: proportional Engines (PRR, OR) that look for anomalies against a background; correlators that look for associations; differing engines that compare different sets of data (e.g., one population or another) in search of variations; Neural Network and other learning machine paradigms that apply heuristics to large databases in order to model and classify, e.g. by comparing weights of associations; a set of data display, viewing and visualization that helps the researcher use his/her insight and pattern recognition capabilities to see, for example, genetic patterns or patterns of pathways that are involved for certain genotypes. The present method then, uses both automated and semi-automated techniques to blend and create the best data mining and hypothesis testing.
  • Even though the efficacy of a given drug may only involve one or two metabolic pathways, and therefore body systems, adverse reactions may stem from one or several of the many different pathways inherent in the human organism. Thus, the present method combines detection as well as association algorithms to help identify the relationships among a drug, a given patient genotype, and the reactions and outcomes that could result.
  • Based upon the above, the present method accesses the system database, wherein a set of cases may be selected for analysis. Having selected the case-set of interest, the method then preferably proceeds to a profile, which preferably displays statistically-derived values that describe the behavior of the drug of interest based upon patient genotype. From the profile, the method can then preferably proceed to employ one or more filters that permit recalculation of the statistics by selecting among available variables. Once a set of cases is determined, for example, by the use of one or more filters, the cases can then preferably be submitted to one or more data mining engines. Such data mining engines may include a correlator engine, which provides information on analyses that have been previously completed—including date and time, task number, and generic drug. Each listing ends with a hyperlink that a user can employ to view the results of a search. A “delete” function is preferably provided to manage this list.
  • Step 400 of the method provides the data mining and extraction capabilities in which the results of statistical analysis in step 300 are compared against selected thresholds or criteria to extract the data of interest. Based on the above techniques, each dimension in the present invention will have a natural “filter” framework based on the number of parameter (dimension) variables and their number in the database. Within this framework, one or more of a combination of filters can be used to select “cases” and perform a restricted analysis of step 300. This interaction between steps 300 and 400 allows an individual researcher to use his/her hypothesis to adjust the analysis. For example, a set of adverse events are really a reflection of non-efficacy of a certain drug, such as reported adverse event of “depression” for a patient taking an anti-depressant. These reactions could be filtered out as part of step 300.
  • Output from the data mining engines is preferably displayed using a graphical viewer, which permits the user to present the data in a variety of formats, including, but not limited to a sortable table, a sortable line listing, and a radar screen, thus, allowing rapid identification of signals and providing the user the ability to drill down to individual case details.
  • Alternatively, in another preferred embodiment, the method of the present invention permits choosing a profile, applying one or more filters, processing the set of cases using the data mining engines, and displaying the results for a user or viewer.
  • The present invention therefore includes a means to assemble data, create a database, and finally produce a summary and individual outputs based on the implied parameter “triplet” consisting of patient, drug, and adverse event data. The result is a structural database that combines a variety of drug characteristics, drug class and pathway characteristics, and population as well as individual genetics. The present invention provides a method for applying genomic-based adverse event data in a drug lifecycle. There are many research and medical situations where it is critical to associate individual patient's genotype or the general population's genetic distribution to Adverse Drug Reactions (“ADRs”). The ADR arena is complex because it involves many possible human metabolic pathways. Due to this complexity, and the extremely difficult tracing of hundreds or thousands of possible causes for thousands of reactions, the database of the present invention was developed based on clinical outcomes. In this respect, the system and method works backward from ADRs toward the statistical distribution of genotypes potentially associated with the event. The grouping of SNPs and gene variants provides a “cluster” that can be associated with a higher-than-expected reaction. For example, it may be that roughly 1% of patients experience headache with a particular drug, but 30% of those patients sharing a particular genotype have experienced headache with the drug. This may potentially link that genotype to headaches associated with the drug.
  • With efficient and effective analysis of adverse drug effects, pharmaceutical research and development professionals can learn more details of the reaction profiles of drugs and the at-risk populations who may be prescribed those drugs. This information would allow a more effective selection of lead compounds and would ultimately lead to development of drugs with reduced risk of adverse effects.
  • The system and method disclosed herein may be used during the research and pre-clinical trial stage of drug development for reviewing a set of drugs against the genetic background of a population in order to determine the ADR profile. The system and method may also be used during clinical trials, comparing the actual experience of ADRs with the database, for example, using proportional analysis (or any of the above techniques) to see if the trial population exhibits unexpected adverse events. The system and method may additionally be used during the diagnosis and prescription process at a point of care for checking a particular patient's genetic profile against a drug to assess or determine the probability of an adverse drug reaction for a drug (especially those considered serious by the healthcare provider) compared to other drugs. The system and method may further be used on a continuing basis for collecting post-market data on drugs by retrieving the genetic profile of patients exhibiting adverse events for a drug, and updating the present invention database with this information.
  • It should be appreciated that in all these genetic comparisons, there is potential to identify phenotypic markers (e.g. blue eyes, blond hair, fair skin—people of Nordic decent) with genotypes. In addition, environmental factors such as diet, work, or smoking habit may be related as well.
  • The system and method further utilize a drug utilization review (“DUR”), incorporating the genomic dimension to the patient specific DUR. DUR application uses the link among the three elements discussed above, drug reactions and genotype, to create methods for avoiding ADRs for a given patient. With the knowledge of the genomic drug safety database, there is an established association of certain SNPs and gene variants of an individual that are associated with a drug and its reactions. By having background tested, or through an understanding of a person's genotype or phenotype from clinical evidence (for example, a person who does not metabolize and respond to fluoxetine can be presumed to poorly metabolize CYP (2D6) related drugs. If the database shows certain ADRs with such a person, the DUR application of the database would assess the drug and rate its potential for adverse reactions for that individual. Other factors, such as environment, nutrition, foodstuffs, beverages, exposure to toxins, chemicals, supplements, herbal remedies, and the use of other drugs, would also continue to be considered, based on the best available evidence.
  • To formulate risk of drug safety, the system and method assign a risk parameter that combines drug, genetic, and outcome information of drugs on the drug label, to create a rapid means to improve drugs for an individual patient, or assess the risk of a specific drug for that patient.
  • The system may assign a PIN to an individual, and then provide a means by which to query the database on the relative risk for a drug, as described below. By using a PIN based system, there is provided the maintenance of privacy.
  • The method and system described herein may be embodied in a network environment. Such an implementation enables the DUR application in a web-based system, where a central (or other network accessible means) is used to allow access to the check of drugs from any location. A subscriber would be provided the information in the background. Others would enter the PIN and drug and the resulting assessment of the risk would be returned. As a means for universal understanding of the results, a scale could be used (such as described below). This method would allow any subscriber or enrollee to a healthcare program using the method to input genomic data from any location, for example, a genomics lab that examines the patient, or enrollees profile. The data on other aspects of the patient would similarly be entered, all via PIN. Then, at any time, the DUR application would allow network access to the risk assessment.
  • The present method may use a parameter or coefficient specifically designed to measure risk. Although many weightings may be used, this parameter would provide, on a suitable scale (e.g., 0 to 1.0 or other normalized scale), a weighted assessment of genetic risk for the patient based on the probability that certain drug reactions are likely for the patient genotype. The scale takes into account the closeness of the drug to the drug in the database (it could only share chemical class) and the closeness of the genetic profile of the patient to the average genetic profile of the database, again using a closeness fraction based on the number of standard deviations from the mean fit. This scale would then provide a nominal risk, with an error range, for the individual. Any actual scale that uses a linear, or logarithmic weighting (or other), that accounts for a closeness adjustment for the drug and for the genotype, will then be used to report the expected ADRs. Note, in the preferred embodiment, the most serious reaction would be used and weighted more heavily (using, for example, the FDA CDER Designated Medical Event list).
  • The scale would be refined over time as better tests and more population statistics are added. Thus, both the database and the precision of the scale will improve over time. The use of a numerical range that healthcare providers become accustomed to will allow ease of interpretation of the results.
  • The method described herein provides for personalized medicine application in drug safety. The system and method draw a relationship between genetic type and predispositions to adverse reactions for certain drugs or drug types. Given a sufficient set of specific genetic information on an individual, there exists the potential to create a “profile” for that individual.
  • Unlike the relationship of genetic information to diseases, where only a few genes, or SNPs may be involved, the realm of adverse events involves numerous other areas of an individual's metabolism. In fact, it goes beyond that to include the influence of environmental factors such as coffee drinker, smoker, traveler, etc., that changes an individual's environment and medical situation into which drugs are prescribed. The system and method allow, for example, an individual's profile to be privately stored and accessed via a PIN. Then, given the drug a physician is considering, the patient's background can be checked for potential risk.
  • As disclosed herein above, various embodiments of the system and method provide several benefits. These benefits include warning a patient or physician that certain drugs have produced reactions associated with the patient's genotype; a broad understanding of gene and SNP relationships to pathways, proteome and drugs in those pathways; a statistical understanding of the genetic behavior of drug classes; a structural database that allows a drill-down on genetic differences for more specific reactions; a database that makes ADR profile associations with proteome possible; a database that increases the potential to uncover the multiple ways certain ADRs could develop; a method for scoring the genomic based risk of certain adverse events for drugs or drug classes; a method for preserving patient privacy while permitting clinical labs, physicians, etc., to access the information for a particular patient; adjustment of the genetic risk by correlation with environmental factors; a method for adding details to the genomic database as new information is made available; a broad, statistical understanding of population genetic impact on the percentage risk of certain adverse events with certain drugs or drug classes; and a method for adding data from sources that may have been based on different vocabularies.
  • Although the present invention has been described and illustrated in detail, it is to be clearly understood that the same is by way of illustration and example only, and is not to be taken as a limitation. The spirit and scope of the present invention are to be limited only by the terms of any claims presented hereinafter.

Claims (16)

1. A method for assessing and analyzing one or more drugs, adverse effects and associated risks, and patient demographics resulting from the use of at least drug of interest, comprising the steps of:
(a) collecting data from a plurality of sources comprising drug information, adverse effects relating to drugs and patient demographics;
(b) generating a relational database for relating drug information, adverse effects and patient demographics.
(c) selecting at least one case for analysis, the at least one case describing the behavior between at least one drug of interest and a patient genotype;
(d) profiling statistically derived values from multiple cases related to the safety of the at least one drug, wherein at least one filter is employed for deriving the values;
(e) submitting the values to at least one data mining engine; and
(f) displaying the analytic results from the data mining engine through an output device.
2. The method of claim 1, wherein analyzing using a data mining engine comprises correlating or proportionally comparing any two data types, wherein the data types are drug, adverse effects, or patient demographics.
3. The method of claim 1, wherein the data is cleaned, wherein cleaning comprises removal of noise, spell checking, and removal of redundant entries.
4. The method of claim 1, wherein the relational database comprises stored data which is mapped to tokens, wherein a token comprises a standardized search term.
5. The method of claim 4 wherein a token is selected from the group of reference sources consisting of MEDRA, WHO Drug Directories, FDA Orange Book, COSTART, NDCD, GPRD and WHOART.
6. The method of claim 1, wherein steps (a)-(f) are performed in a network environment.
7. The method of claim 1, wherein the demographics are age, sex, weight, diet, reactions, environment, illness, dosage, genotype, outcome, report source or concomitant drugs.
8. The method of claim 1, wherein the data mining engine is a correlator, a proportional analysis engine, or a comparator.
9. A system for assessing and analyzing one or more drugs, adverse effects and associated risks, and patient demographics resulting from the use of at least one drug of interest, comprising:
(a) a selector for selecting one or more cases for analysis, the cases describing the behavior between the at least one drug of interest and a patient genotype;
(b) a profiler profiling statistically derived values from multiple cases related to the safety of the at least one drug, wherein at least one filter is employed for deriving said values;
(c) at least one data mining engine for submitting the values to; and
(d) an output device for displaying the analytic results from the data mining engine.
10. The method of claim 9, wherein analyzing using a data mining engine comprises correlating or proportionally comparing any two data types, wherein the data types are drug, adverse effects, or patient demographics.
11. The method of claim 9, wherein the data is cleaned, wherein cleaning comprises removal of noise, spell checking, and removal of redundant entries.
12. The method of claim 9, wherein the relational database comprises stored data which is mapped to tokens, wherein a token comprises a standardized search term.
13. The method of claim 9 wherein a token is selected from the group of reference sources consisting of MEDRA, WHO Drug Directories, FDA Orange Book, COSTART, NDCD, GPRD and WHOART.
14. The method of claim 9, wherein steps (a)-(d) are performed in a network environment.
15. The method of claim 9, wherein the demographics are age, sex, weight, diet, reactions, environment, illness, dosage, genotype, outcome, report source or concomitant drugs.
16. The method of claim 9, wherein the data mining engine is a correlator, a proportional analysis engine, or a comparator.
US12/271,224 2001-08-29 2008-11-14 Method and system for the analysis and association of patient-specific and population-based genomic data with drug safety adverse event data Abandoned US20090076847A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
US12/271,224 US20090076847A1 (en) 2001-08-29 2008-11-14 Method and system for the analysis and association of patient-specific and population-based genomic data with drug safety adverse event data
US15/059,997 US20170061080A1 (en) 2001-08-29 2016-03-03 Method and system for the analysis and association of patient-specific and population-based genomic data with drug safety adverse event data

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US31552501P 2001-08-29 2001-08-29
US10/229,119 US7461006B2 (en) 2001-08-29 2002-08-28 Method and system for the analysis and association of patient-specific and population-based genomic data with drug safety adverse event data
US12/271,224 US20090076847A1 (en) 2001-08-29 2008-11-14 Method and system for the analysis and association of patient-specific and population-based genomic data with drug safety adverse event data

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US10/229,119 Continuation US7461006B2 (en) 2001-08-29 2002-08-28 Method and system for the analysis and association of patient-specific and population-based genomic data with drug safety adverse event data

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US15/059,997 Continuation US20170061080A1 (en) 2001-08-29 2016-03-03 Method and system for the analysis and association of patient-specific and population-based genomic data with drug safety adverse event data

Publications (1)

Publication Number Publication Date
US20090076847A1 true US20090076847A1 (en) 2009-03-19

Family

ID=23224826

Family Applications (3)

Application Number Title Priority Date Filing Date
US10/229,119 Active 2025-06-02 US7461006B2 (en) 2001-08-29 2002-08-28 Method and system for the analysis and association of patient-specific and population-based genomic data with drug safety adverse event data
US12/271,224 Abandoned US20090076847A1 (en) 2001-08-29 2008-11-14 Method and system for the analysis and association of patient-specific and population-based genomic data with drug safety adverse event data
US15/059,997 Abandoned US20170061080A1 (en) 2001-08-29 2016-03-03 Method and system for the analysis and association of patient-specific and population-based genomic data with drug safety adverse event data

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US10/229,119 Active 2025-06-02 US7461006B2 (en) 2001-08-29 2002-08-28 Method and system for the analysis and association of patient-specific and population-based genomic data with drug safety adverse event data

Family Applications After (1)

Application Number Title Priority Date Filing Date
US15/059,997 Abandoned US20170061080A1 (en) 2001-08-29 2016-03-03 Method and system for the analysis and association of patient-specific and population-based genomic data with drug safety adverse event data

Country Status (3)

Country Link
US (3) US7461006B2 (en)
AU (1) AU2002329901A1 (en)
WO (1) WO2003021389A2 (en)

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090158211A1 (en) * 2001-05-02 2009-06-18 Gogolak Victor V Method for graphically depicting drug adverse effect risks
US20100063830A1 (en) * 2008-09-10 2010-03-11 Expanse Networks, Inc. Masked Data Provider Selection
US20100138161A1 (en) * 2001-05-02 2010-06-03 Victor Gogolak Method and system for analyzing drug adverse effects
US20110161099A1 (en) * 2009-12-28 2011-06-30 Igor Igorevich Stukanov Low-cost method for reducing rates of side effects from using drugs, healing substances and medical procedures
US8131769B2 (en) 2001-05-02 2012-03-06 Druglogic, Inc. Processing drug data
US20120078601A1 (en) * 2010-09-27 2012-03-29 General Electric Company Drug treatment plans derived from holistic analysis
US20120124051A1 (en) * 2009-07-29 2012-05-17 Wilfred Wan Kei Lin Ontological information retrieval system
US20120143776A1 (en) * 2010-12-07 2012-06-07 Oracle International Corporation Pharmacovigilance alert tool
CN103336914A (en) * 2013-05-31 2013-10-02 中国人民解放军国防科学技术大学 Method and device for extracting meta biomarkers
US8799022B1 (en) * 2011-05-04 2014-08-05 Strat ID GIC, Inc. Method and network for secure transactions
US20140350964A1 (en) * 2013-05-22 2014-11-27 Quantros, Inc. Probabilistic event classification systems and methods
US20150186334A1 (en) * 2013-12-30 2015-07-02 Nice-Systems Ltd. System and method for automated generation of meaningful data insights
US9607266B2 (en) 2013-07-23 2017-03-28 Tata Consultancy Services Limited Systems and methods for signal detection in pharmacovigilance using distributed processing, analysis and representing of the signals in multiple forms

Families Citing this family (108)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7461006B2 (en) * 2001-08-29 2008-12-02 Victor Gogolak Method and system for the analysis and association of patient-specific and population-based genomic data with drug safety adverse event data
US7493265B2 (en) * 2001-12-11 2009-02-17 Sas Institute Inc. Integrated biomedical information portal system and method
US6983280B2 (en) * 2002-09-13 2006-01-03 Overture Services Inc. Automated processing of appropriateness determination of content for search listings in wide area network searches
US7668730B2 (en) * 2002-12-17 2010-02-23 JPI Commercial, LLC. Sensitive drug distribution system and method
US20040172285A1 (en) * 2003-02-18 2004-09-02 Gibson Jerry Tyrone Systems and methods for selecting drugs
US8688385B2 (en) 2003-02-20 2014-04-01 Mayo Foundation For Medical Education And Research Methods for selecting initial doses of psychotropic medications based on a CYP2D6 genotype
ES2557885T3 (en) * 2003-02-20 2016-01-29 Mayo Foundation For Medical Education And Research Methods to select antidepressant medications
WO2005038049A2 (en) * 2003-10-06 2005-04-28 Heinrich Guenther System and method for optimizing drug therapy
US7657443B2 (en) * 2003-12-19 2010-02-02 Carefusion 303, Inc. Intravenous medication harm index system
US7376644B2 (en) 2004-02-02 2008-05-20 Ram Consulting Inc. Knowledge portal for accessing, analyzing and standardizing data
US8135595B2 (en) 2004-05-14 2012-03-13 H. Lee Moffitt Cancer Center And Research Institute, Inc. Computer systems and methods for providing health care
WO2005116890A1 (en) * 2004-05-14 2005-12-08 H. Lee Moffitt Cancer Center And Research Institute, Inc. Computer systems and methods for providing health care
US7650262B2 (en) * 2004-10-25 2010-01-19 Prosanos Corp. Method, system, and software for analyzing pharmacovigilance data
US20110145018A1 (en) * 2005-03-21 2011-06-16 Fotsch Edward J Drug and medical device safety and support information reporting system, processing device and method
US7917374B2 (en) 2005-04-25 2011-03-29 Ingenix, Inc. System and method for early identification of safety concerns of new drugs
US7856362B2 (en) * 2005-04-25 2010-12-21 Ingenix, Inc. System and method for early identification of safety concerns of new drugs
NZ563292A (en) * 2005-05-11 2010-12-24 Carefusion 303 Inc Evaluating drug data sets against aggregate sets from multiple institutions
US10042980B2 (en) 2005-11-17 2018-08-07 Gearbox Llc Providing assistance related to health
AU2006320633A1 (en) * 2005-11-29 2007-06-07 Children's Hospital Medical Center Optimization and individualization of medication selection and dosing
US10296720B2 (en) 2005-11-30 2019-05-21 Gearbox Llc Computational systems and methods related to nutraceuticals
US20080210748A1 (en) * 2005-11-30 2008-09-04 Searete Llc, A Limited Liability Corporation Of The State Of Delaware, Systems and methods for receiving pathogen related information and responding
US8340950B2 (en) * 2006-02-10 2012-12-25 Affymetrix, Inc. Direct to consumer genotype-based products and services
GB0605484D0 (en) * 2006-03-18 2006-04-26 Isoft Applic Ltd Data input method
US8380539B2 (en) * 2006-05-09 2013-02-19 University Of Louisville Research Foundation, Inc. Personalized medicine management software
US20070271118A1 (en) * 2006-05-22 2007-11-22 Wilp William R Sales force sculpting method and system
US20070294112A1 (en) * 2006-06-14 2007-12-20 General Electric Company Systems and methods for identification and/or evaluation of potential safety concerns associated with a medical therapy
US10503872B2 (en) 2006-09-29 2019-12-10 Gearbox Llc Computational systems for biomedical data
US10095836B2 (en) 2006-09-29 2018-10-09 Gearbox Llc Computational systems for biomedical data
US10546652B2 (en) 2006-09-29 2020-01-28 Gearbox Llc Computational systems for biomedical data
US20080082271A1 (en) * 2006-09-29 2008-04-03 Searete Llc Computational systems for biomedical data
US10068303B2 (en) * 2006-09-29 2018-09-04 Gearbox Llc Computational systems for biomedical data
US20080131887A1 (en) * 2006-11-30 2008-06-05 Stephan Dietrich A Genetic Analysis Systems and Methods
WO2008067551A2 (en) * 2006-11-30 2008-06-05 Navigenics Inc. Genetic analysis systems and methods
US8099298B2 (en) 2007-02-14 2012-01-17 Genelex, Inc Genetic data analysis and database tools
US20080320029A1 (en) * 2007-02-16 2008-12-25 Stivoric John M Lifeotype interfaces
US20080228699A1 (en) 2007-03-16 2008-09-18 Expanse Networks, Inc. Creation of Attribute Combination Databases
US20090063438A1 (en) * 2007-08-28 2009-03-05 Iamg, Llc Regulatory compliance data scraping and processing platform
WO2009031073A2 (en) * 2007-09-04 2009-03-12 Koninklijke Philips Electronics N.V. Multi-treatment planning apparatus and method
CA2700975A1 (en) * 2007-09-26 2009-04-02 Navigenics, Inc. Methods and systems for genomic analysis using ancestral data
US7761471B1 (en) 2007-10-16 2010-07-20 Jpmorgan Chase Bank, N.A. Document management techniques to account for user-specific patterns in document metadata
US7805421B2 (en) * 2007-11-02 2010-09-28 Caterpillar Inc Method and system for reducing a data set
EP2266067A4 (en) * 2008-02-26 2011-04-13 Purdue Research Foundation Method for patient genotyping
US20100042438A1 (en) * 2008-08-08 2010-02-18 Navigenics, Inc. Methods and Systems for Personalized Action Plans
US8532931B2 (en) 2008-09-07 2013-09-10 Edward Lakatos Calculating sample size for clinical trial
WO2010030929A1 (en) * 2008-09-12 2010-03-18 Navigenics, Inc. Methods and systems for incorporating multiple environmental and genetic risk factors
US20100125782A1 (en) * 2008-11-14 2010-05-20 Howard Jay Snortland Electronic document for automatically determining a dosage for a treatment
US20100125421A1 (en) * 2008-11-14 2010-05-20 Howard Jay Snortland System and method for determining a dosage for a treatment
US8108406B2 (en) 2008-12-30 2012-01-31 Expanse Networks, Inc. Pangenetic web user behavior prediction system
EP3276526A1 (en) 2008-12-31 2018-01-31 23Andme, Inc. Finding relatives in a database
US8346369B2 (en) * 2009-05-14 2013-01-01 Cardiac Pacemakers, Inc. Systems and methods for programming implantable medical devices
EP2504810A1 (en) * 2010-01-21 2012-10-03 Indegene Lifesystems Pvt. Ltd. Method for organizing clinical trial data
US11164672B2 (en) 2010-01-22 2021-11-02 Deka Products Limited Partnership System and apparatus for electronic patient care
US20110313789A1 (en) 2010-01-22 2011-12-22 Deka Products Limited Partnership Electronic patient monitoring system
US11210611B2 (en) 2011-12-21 2021-12-28 Deka Products Limited Partnership System, method, and apparatus for electronic patient care
US11881307B2 (en) 2012-05-24 2024-01-23 Deka Products Limited Partnership System, method, and apparatus for electronic patient care
US10242159B2 (en) 2010-01-22 2019-03-26 Deka Products Limited Partnership System and apparatus for electronic patient care
US11244745B2 (en) 2010-01-22 2022-02-08 Deka Products Limited Partnership Computer-implemented method, system, and apparatus for electronic patient care
US10911515B2 (en) 2012-05-24 2021-02-02 Deka Products Limited Partnership System, method, and apparatus for electronic patient care
US10453157B2 (en) 2010-01-22 2019-10-22 Deka Products Limited Partnership System, method, and apparatus for electronic patient care
CN102262707B (en) * 2010-05-28 2016-01-13 南德克萨斯加速研究治疗有限责任公司 For managing machine and the method for clinical data
US11380440B1 (en) 2011-09-14 2022-07-05 Cerner Innovation, Inc. Marker screening and signal detection
US11869671B1 (en) 2011-09-14 2024-01-09 Cerner Innovation, Inc. Context-sensitive health outcome surveillance and signal detection
US20130096947A1 (en) * 2011-10-13 2013-04-18 The Board of Trustees of the Leland Stanford Junior, University Method and System for Ontology Based Analytics
US9235686B2 (en) 2012-01-06 2016-01-12 Molecular Health Gmbh Systems and methods for using adverse event data to predict potential side effects
US8473315B1 (en) 2012-08-17 2013-06-25 Ronald Lucchino Detection of adverse reactions to medication using a communications network
US20140089009A1 (en) * 2012-09-27 2014-03-27 Wobblebase, Inc. Method for Personal Genome Data Management
US11424040B2 (en) * 2013-01-03 2022-08-23 Aetna Inc. System and method for pharmacovigilance
US10489717B2 (en) 2013-01-03 2019-11-26 Aetna, Inc. System and method for pharmacovigilance
US8744872B1 (en) * 2013-01-03 2014-06-03 Aetna, Inc. System and method for pharmacovigilance
US10210312B2 (en) * 2013-02-03 2019-02-19 Youscript Inc. Systems and methods for quantification and presentation of medical risk arising from unknown factors
US20160048633A1 (en) * 2013-03-15 2016-02-18 Cypher Genomics, Inc. Systems and methods for genomic variant annotation
US10395766B2 (en) * 2013-06-28 2019-08-27 Hitachi, Ltd. Diagnostic process analysis system
US9898586B2 (en) 2013-09-06 2018-02-20 Mortara Instrument, Inc. Medical reporting system and method
US20150169830A1 (en) * 2013-12-17 2015-06-18 Vuca Health Holdings Llc Data Management and Delivery System for Health Care Applications
AU2015210999A1 (en) 2014-01-29 2016-07-21 Otsuka Pharmaceutical Co., Ltd. Device-based risk management of a therapeutic
JP6410289B2 (en) * 2014-03-20 2018-10-24 日本電気株式会社 Pharmaceutical adverse event extraction method and apparatus
US9495405B2 (en) * 2014-04-28 2016-11-15 International Business Machines Corporation Big data analytics brokerage
EP2985711A1 (en) 2014-08-14 2016-02-17 Accenture Global Services Limited System for automated analysis of clinical text for pharmacovigilance
US10395759B2 (en) 2015-05-18 2019-08-27 Regeneron Pharmaceuticals, Inc. Methods and systems for copy number variant detection
WO2016191340A1 (en) * 2015-05-22 2016-12-01 Georgetown University Discovery and analysis of drug-related side effects
CN106156483B (en) * 2016-01-18 2018-10-02 李雪 A kind of risk evaluating method, device and server based on data in literature
US20190057762A1 (en) * 2016-02-26 2019-02-21 Toyosaki Accounting Office Co., Ltd. Information processing device
USD811432S1 (en) 2016-04-18 2018-02-27 Aetna Inc. Computer display with graphical user interface for a pharmacovigilance tool
WO2018110490A1 (en) * 2016-12-12 2018-06-21 日本電気株式会社 Information processing device, genetic information creation method, and program
EP3625791A4 (en) * 2017-05-18 2021-03-03 Telepathy Labs, Inc. Artificial intelligence-based text-to-speech system and method
US10762169B2 (en) * 2017-06-16 2020-09-01 Accenture Global Solutions Limited System and method for determining side-effects associated with a substance
US10325020B2 (en) * 2017-06-29 2019-06-18 Accenture Global Solutions Limited Contextual pharmacovigilance system
US11456081B1 (en) 2017-07-20 2022-09-27 Jazz Pharmaceuticals, Inc. Sensitive drug distribution systems and methods
US11586654B2 (en) * 2017-09-08 2023-02-21 Open Text Sa Ulc System and method for recommendation of terms, including recommendation of search terms in a search system
CN107680692A (en) * 2017-09-13 2018-02-09 华中科技大学鄂州工业技术研究院 The network pharmacology analysis method and system of Chinese medicine and Chinese medicine preparation
KR101953762B1 (en) * 2017-09-25 2019-03-04 (주)신테카바이오 Drug indication and response prediction systems and method using AI deep learning based on convergence of different category data
US11227692B2 (en) 2017-12-28 2022-01-18 International Business Machines Corporation Neuron model simulation
US11164678B2 (en) * 2018-03-06 2021-11-02 International Business Machines Corporation Finding precise causal multi-drug-drug interactions for adverse drug reaction analysis
US20200243201A1 (en) * 2018-06-15 2020-07-30 Xact Laboratories, LLC System and method for suggesting insurance eligible genetic tests
US11061913B2 (en) 2018-11-30 2021-07-13 International Business Machines Corporation Automated document filtration and priority scoring for document searching and access
US11074262B2 (en) 2018-11-30 2021-07-27 International Business Machines Corporation Automated document filtration and prioritization for document searching and access
US10949607B2 (en) 2018-12-10 2021-03-16 International Business Machines Corporation Automated document filtration with normalized annotation for document searching and access
US11068490B2 (en) 2019-01-04 2021-07-20 International Business Machines Corporation Automated document filtration with machine learning of annotations for document searching and access
US10977292B2 (en) 2019-01-15 2021-04-13 International Business Machines Corporation Processing documents in content repositories to generate personalized treatment guidelines
US11721441B2 (en) 2019-01-15 2023-08-08 Merative Us L.P. Determining drug effectiveness ranking for a patient using machine learning
US11372905B2 (en) 2019-02-04 2022-06-28 International Business Machines Corporation Encoding-assisted annotation of narrative text
EP3792923A1 (en) * 2019-09-16 2021-03-17 Siemens Healthcare GmbH Method and device for exchanging information regarding the clinical implications of genomic variations
CN111209387B (en) * 2019-12-31 2022-02-18 上海亿锎智能科技有限公司 Retrieval analysis method and system based on MedDRA
JP6893052B1 (en) * 2020-06-29 2021-06-23 ゲノム・ファーマケア株式会社 Dosing plan proposal system, method and program
CN112382413A (en) * 2020-12-09 2021-02-19 温州市人民医院 Method for detecting adverse reaction of combined medication
US20220253777A1 (en) * 2021-02-08 2022-08-11 Birdeye, Inc. Dynamically Influencing Interactions Based On Learned Data And On An Adaptive Quantitative Indicator
US11907305B1 (en) * 2021-07-09 2024-02-20 Veeva Systems Inc. Systems and methods for analyzing adverse events of a source file and arranging the adverse events on a user interface
CN117131235B (en) * 2023-10-24 2024-01-30 上海柯林布瑞信息技术有限公司 Medical data retrieval method and device based on combined medication condition

Citations (81)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4384329A (en) * 1980-12-19 1983-05-17 International Business Machines Corporation Retrieval of related linked linguistic expressions including synonyms and antonyms
US5299121A (en) * 1992-06-04 1994-03-29 Medscreen, Inc. Non-prescription drug medication screening system
US5337919A (en) * 1993-02-11 1994-08-16 Dispensing Technologies, Inc. Automatic dispensing system for prescriptions and the like
US5371807A (en) * 1992-03-20 1994-12-06 Digital Equipment Corporation Method and apparatus for text classification
US5495604A (en) * 1993-08-25 1996-02-27 Asymetrix Corporation Method and apparatus for the modeling and query of database structures using natural language-like constructs
US5502576A (en) * 1992-08-24 1996-03-26 Ramsay International Corporation Method and apparatus for the transmission, storage, and retrieval of documents in an electronic domain
US5583758A (en) * 1992-06-22 1996-12-10 Health Risk Management, Inc. Health care management system for managing medical treatments and comparing user-proposed and recommended resources required for treatment
US5594637A (en) * 1993-05-26 1997-01-14 Base Ten Systems, Inc. System and method for assessing medical risk
US5634053A (en) * 1995-08-29 1997-05-27 Hughes Aircraft Company Federated information management (FIM) system and method for providing data site filtering and translation for heterogeneous databases
US5642731A (en) * 1990-01-17 1997-07-01 Informedix, Inc. Method of and apparatus for monitoring the management of disease
US5659731A (en) * 1995-06-19 1997-08-19 Dun & Bradstreet, Inc. Method for rating a match for a given entity found in a list of entities
US5664109A (en) * 1995-06-07 1997-09-02 E-Systems, Inc. Method for extracting pre-defined data items from medical service records generated by health care providers
US5692171A (en) * 1992-11-20 1997-11-25 Bull S.A. Method of extracting statistical profiles, and use of the statistics created by the method
US5737539A (en) * 1994-10-28 1998-04-07 Advanced Health Med-E-Systems Corp. Prescription creation system
US5758095A (en) * 1995-02-24 1998-05-26 Albaum; David Interactive medication ordering system
US5804803A (en) * 1996-04-02 1998-09-08 International Business Machines Corporation Mechanism for retrieving information using data encoded on an object
US5833599A (en) * 1993-12-13 1998-11-10 Multum Information Services Providing patient-specific drug information
US5845255A (en) * 1994-10-28 1998-12-01 Advanced Health Med-E-Systems Corporation Prescription management system
US5860917A (en) * 1997-01-15 1999-01-19 Chiron Corporation Method and apparatus for predicting therapeutic outcomes
US5864789A (en) * 1996-06-24 1999-01-26 Apple Computer, Inc. System and method for creating pattern-recognizing computer structures from example text
US5911132A (en) * 1995-04-26 1999-06-08 Lucent Technologies Inc. Method using central epidemiological database
US5924074A (en) * 1996-09-27 1999-07-13 Azron Incorporated Electronic medical records system
US5978804A (en) * 1996-04-11 1999-11-02 Dietzman; Gregg R. Natural products information system
US5991729A (en) * 1997-06-28 1999-11-23 Barry; James T. Methods for generating patient-specific medical reports
US6000828A (en) * 1997-08-22 1999-12-14 Power Med Incorporated Method of improving drug treatment
US6014631A (en) * 1998-04-02 2000-01-11 Merck-Medco Managed Care, Llc Computer implemented patient medication review system and process for the managed care, health care and/or pharmacy industry
US6055538A (en) * 1997-12-22 2000-04-25 Hewlett Packard Company Methods and system for using web browser to search large collections of documents
US6054268A (en) * 1994-06-17 2000-04-25 Perlin; Mark W. Method and system for genotyping
US6067524A (en) * 1999-01-07 2000-05-23 Catalina Marketing International, Inc. Method and system for automatically generating advisory information for pharmacy patients along with normally transmitted data
US6076088A (en) * 1996-02-09 2000-06-13 Paik; Woojin Information extraction system and method using concept relation concept (CRC) triples
US6076083A (en) * 1995-08-20 2000-06-13 Baker; Michelle Diagnostic system utilizing a Bayesian network model having link weights updated experimentally
US6082776A (en) * 1997-05-07 2000-07-04 Feinberg; Lawrence E. Storing personal medical information
US6092072A (en) * 1998-04-07 2000-07-18 Lucent Technologies, Inc. Programmed medium for clustering large databases
US6098062A (en) * 1997-01-17 2000-08-01 Janssen; Terry Argument structure hierarchy system and method for facilitating analysis and decision-making processes
US6108635A (en) * 1996-05-22 2000-08-22 Interleukin Genetics, Inc. Integrated disease information system
US6112182A (en) * 1996-01-16 2000-08-29 Healthcare Computer Corporation Method and apparatus for integrated management of pharmaceutical and healthcare services
US6120443A (en) * 1996-04-09 2000-09-19 Cohen-Laroque; Emmanuel-S. Device for determining the depth of anesthesia
US6128620A (en) * 1999-02-02 2000-10-03 Lemed Inc Medical database for litigation
US6137911A (en) * 1997-06-16 2000-10-24 The Dialog Corporation Plc Test classification system and method
US6151581A (en) * 1996-12-17 2000-11-21 Pulsegroup Inc. System for and method of collecting and populating a database with physician/patient data for processing to improve practice quality and healthcare delivery
US6188988B1 (en) * 1998-04-03 2001-02-13 Triangle Pharmaceuticals, Inc. Systems, methods and computer program products for guiding the selection of therapeutic treatment regimens
US6209004B1 (en) * 1995-09-01 2001-03-27 Taylor Microtechnology Inc. Method and system for generating and distributing document sets using a relational database
US6219674B1 (en) * 1999-11-24 2001-04-17 Classen Immunotherapies, Inc. System for creating and managing proprietary product data
US6226564B1 (en) * 1996-11-01 2001-05-01 John C. Stuart Method and apparatus for dispensing drugs to prevent inadvertent administration of incorrect drug to patient
US6246975B1 (en) * 1996-10-30 2001-06-12 American Board Of Family Practice, Inc. Computer architecture and process of patient generation, evolution, and simulation for computer based testing system
US6253169B1 (en) * 1998-05-28 2001-06-26 International Business Machines Corporation Method for improvement accuracy of decision tree based text categorization
US6263329B1 (en) * 1997-07-25 2001-07-17 Claritech Method and apparatus for cross-linguistic database retrieval
US6273854B1 (en) * 1998-05-05 2001-08-14 Body Bio Corporation Medical diagnostic analysis method and system
US20010049673A1 (en) * 2000-03-24 2001-12-06 Bridge Medical, Inc. Method and apparatus for displaying medication information
US6331138B1 (en) * 1997-05-27 2001-12-18 Holland Industriele Diamantwerken B.V. Grinding machine
US20020010595A1 (en) * 1998-02-27 2002-01-24 Kapp Thomas L. Web-based medication management system
US20020012921A1 (en) * 2000-01-21 2002-01-31 Stanton Vincent P. Identification of genetic components of drug response
US20020040282A1 (en) * 2000-03-22 2002-04-04 Bailey Thomas C. Drug monitoring and alerting system
US20020073042A1 (en) * 2000-12-07 2002-06-13 Maritzen L. Michael Method and apparatus for secure wireless interoperability and communication between access devices
US20020082869A1 (en) * 2000-12-27 2002-06-27 Gateway, Inc. Method and system for providing and updating customized health care information based on an individual's genome
US6421665B1 (en) * 1998-10-02 2002-07-16 Ncr Corporation SQL-based data reduction techniques for delivering data to analytic tools
US20020120350A1 (en) * 2001-02-28 2002-08-29 Klass David B. Method and system for identifying and anticipating adverse drug events
US6446081B1 (en) * 1997-12-17 2002-09-03 British Telecommunications Public Limited Company Data input and retrieval apparatus
US20020129031A1 (en) * 2001-01-05 2002-09-12 Lau Lee Min Managing relationships between unique concepts in a database
US20020142815A1 (en) * 2000-12-08 2002-10-03 Brant Candelore Method for creating a user profile through game play
US6466923B1 (en) * 1997-05-12 2002-10-15 Chroma Graphics, Inc. Method and apparatus for biomathematical pattern recognition
US20020169771A1 (en) * 2001-05-09 2002-11-14 Melmon Kenneth L. System & method for facilitating knowledge management
US20020183965A1 (en) * 2001-05-02 2002-12-05 Gogolak Victor V. Method for analyzing drug adverse effects employing multivariate statistical analysis
US20020187483A1 (en) * 2001-04-20 2002-12-12 Cerner Corporation Computer system for providing information about the risk of an atypical clinical event based upon genetic information
US6507829B1 (en) * 1999-06-18 2003-01-14 Ppd Development, Lp Textual data classification method and apparatus
US20030040662A1 (en) * 2001-08-22 2003-02-27 Philip Keys System, method and computer program for monitoring and managing medications
US6578003B1 (en) * 1997-07-31 2003-06-10 Schering Corporation Method and apparatus for improving patient compliance with prescriptions
US20030124514A1 (en) * 2001-08-08 2003-07-03 Vingerhoets Johan Hendrika Jozef Methods of assessing HIV integrase inhibitor therapy
US6658396B1 (en) * 1999-11-29 2003-12-02 Tang Sharon S Neural network drug dosage estimation
US20040010511A1 (en) * 2002-07-11 2004-01-15 Gogolak Victor V. Method and system for drug utilization review
US20040015372A1 (en) * 2000-10-20 2004-01-22 Harris Bergman Method and system for processing and aggregating medical information for comparative and statistical analysis
US6684221B1 (en) * 1999-05-06 2004-01-27 Oracle International Corporation Uniform hierarchical information classification and mapping system
US20040030503A1 (en) * 1999-11-29 2004-02-12 Scott Arouh Neural -network-based identification, and application, of genomic information practically relevant to diverse biological and sociological problems, including susceptibility to disease
US6697783B1 (en) * 1997-09-30 2004-02-24 Medco Health Solutions, Inc. Computer implemented medical integrated decision support system
US6778994B2 (en) * 2001-05-02 2004-08-17 Victor Gogolak Pharmacovigilance database
US6789091B2 (en) * 2001-05-02 2004-09-07 Victor Gogolak Method and system for web-based analysis of drug adverse effects
US6876966B1 (en) * 2000-10-16 2005-04-05 Microsoft Corporation Pattern recognition training method and apparatus using inserted noise followed by noise reduction
US6950755B2 (en) * 2001-07-02 2005-09-27 City Of Hope Genotype pattern recognition and classification
US7461006B2 (en) * 2001-08-29 2008-12-02 Victor Gogolak Method and system for the analysis and association of patient-specific and population-based genomic data with drug safety adverse event data
US7542961B2 (en) * 2001-05-02 2009-06-02 Victor Gogolak Method and system for analyzing drug adverse effects
US20090158211A1 (en) * 2001-05-02 2009-06-18 Gogolak Victor V Method for graphically depicting drug adverse effect risks

Patent Citations (85)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4384329A (en) * 1980-12-19 1983-05-17 International Business Machines Corporation Retrieval of related linked linguistic expressions including synonyms and antonyms
US5642731A (en) * 1990-01-17 1997-07-01 Informedix, Inc. Method of and apparatus for monitoring the management of disease
US5371807A (en) * 1992-03-20 1994-12-06 Digital Equipment Corporation Method and apparatus for text classification
US5299121A (en) * 1992-06-04 1994-03-29 Medscreen, Inc. Non-prescription drug medication screening system
US5583758A (en) * 1992-06-22 1996-12-10 Health Risk Management, Inc. Health care management system for managing medical treatments and comparing user-proposed and recommended resources required for treatment
US5502576A (en) * 1992-08-24 1996-03-26 Ramsay International Corporation Method and apparatus for the transmission, storage, and retrieval of documents in an electronic domain
US5692171A (en) * 1992-11-20 1997-11-25 Bull S.A. Method of extracting statistical profiles, and use of the statistics created by the method
US5337919A (en) * 1993-02-11 1994-08-16 Dispensing Technologies, Inc. Automatic dispensing system for prescriptions and the like
US5594637A (en) * 1993-05-26 1997-01-14 Base Ten Systems, Inc. System and method for assessing medical risk
US5592668A (en) * 1993-08-25 1997-01-07 Asymetrix Corporation Method and apparatus for specifying a query to an information system using natural language-like constructs
US5495604A (en) * 1993-08-25 1996-02-27 Asymetrix Corporation Method and apparatus for the modeling and query of database structures using natural language-like constructs
US6317719B1 (en) * 1993-12-13 2001-11-13 Cerner Mulium, Inc. Providing patient-specific drug information
US5833599A (en) * 1993-12-13 1998-11-10 Multum Information Services Providing patient-specific drug information
US6054268A (en) * 1994-06-17 2000-04-25 Perlin; Mark W. Method and system for genotyping
US5737539A (en) * 1994-10-28 1998-04-07 Advanced Health Med-E-Systems Corp. Prescription creation system
US5845255A (en) * 1994-10-28 1998-12-01 Advanced Health Med-E-Systems Corporation Prescription management system
US5758095A (en) * 1995-02-24 1998-05-26 Albaum; David Interactive medication ordering system
US5911132A (en) * 1995-04-26 1999-06-08 Lucent Technologies Inc. Method using central epidemiological database
US5664109A (en) * 1995-06-07 1997-09-02 E-Systems, Inc. Method for extracting pre-defined data items from medical service records generated by health care providers
US5659731A (en) * 1995-06-19 1997-08-19 Dun & Bradstreet, Inc. Method for rating a match for a given entity found in a list of entities
US6076083A (en) * 1995-08-20 2000-06-13 Baker; Michelle Diagnostic system utilizing a Bayesian network model having link weights updated experimentally
US5634053A (en) * 1995-08-29 1997-05-27 Hughes Aircraft Company Federated information management (FIM) system and method for providing data site filtering and translation for heterogeneous databases
US6209004B1 (en) * 1995-09-01 2001-03-27 Taylor Microtechnology Inc. Method and system for generating and distributing document sets using a relational database
US6112182A (en) * 1996-01-16 2000-08-29 Healthcare Computer Corporation Method and apparatus for integrated management of pharmaceutical and healthcare services
US6076088A (en) * 1996-02-09 2000-06-13 Paik; Woojin Information extraction system and method using concept relation concept (CRC) triples
US5804803A (en) * 1996-04-02 1998-09-08 International Business Machines Corporation Mechanism for retrieving information using data encoded on an object
US6120443A (en) * 1996-04-09 2000-09-19 Cohen-Laroque; Emmanuel-S. Device for determining the depth of anesthesia
US5978804A (en) * 1996-04-11 1999-11-02 Dietzman; Gregg R. Natural products information system
US6108635A (en) * 1996-05-22 2000-08-22 Interleukin Genetics, Inc. Integrated disease information system
US5864789A (en) * 1996-06-24 1999-01-26 Apple Computer, Inc. System and method for creating pattern-recognizing computer structures from example text
US5924074A (en) * 1996-09-27 1999-07-13 Azron Incorporated Electronic medical records system
US6246975B1 (en) * 1996-10-30 2001-06-12 American Board Of Family Practice, Inc. Computer architecture and process of patient generation, evolution, and simulation for computer based testing system
US6226564B1 (en) * 1996-11-01 2001-05-01 John C. Stuart Method and apparatus for dispensing drugs to prevent inadvertent administration of incorrect drug to patient
US6151581A (en) * 1996-12-17 2000-11-21 Pulsegroup Inc. System for and method of collecting and populating a database with physician/patient data for processing to improve practice quality and healthcare delivery
US5860917A (en) * 1997-01-15 1999-01-19 Chiron Corporation Method and apparatus for predicting therapeutic outcomes
US6098062A (en) * 1997-01-17 2000-08-01 Janssen; Terry Argument structure hierarchy system and method for facilitating analysis and decision-making processes
US6082776A (en) * 1997-05-07 2000-07-04 Feinberg; Lawrence E. Storing personal medical information
US6466923B1 (en) * 1997-05-12 2002-10-15 Chroma Graphics, Inc. Method and apparatus for biomathematical pattern recognition
US6331138B1 (en) * 1997-05-27 2001-12-18 Holland Industriele Diamantwerken B.V. Grinding machine
US6137911A (en) * 1997-06-16 2000-10-24 The Dialog Corporation Plc Test classification system and method
US5991729A (en) * 1997-06-28 1999-11-23 Barry; James T. Methods for generating patient-specific medical reports
US6263329B1 (en) * 1997-07-25 2001-07-17 Claritech Method and apparatus for cross-linguistic database retrieval
US6578003B1 (en) * 1997-07-31 2003-06-10 Schering Corporation Method and apparatus for improving patient compliance with prescriptions
US6000828A (en) * 1997-08-22 1999-12-14 Power Med Incorporated Method of improving drug treatment
US6697783B1 (en) * 1997-09-30 2004-02-24 Medco Health Solutions, Inc. Computer implemented medical integrated decision support system
US6446081B1 (en) * 1997-12-17 2002-09-03 British Telecommunications Public Limited Company Data input and retrieval apparatus
US6055538A (en) * 1997-12-22 2000-04-25 Hewlett Packard Company Methods and system for using web browser to search large collections of documents
US20020010595A1 (en) * 1998-02-27 2002-01-24 Kapp Thomas L. Web-based medication management system
US6014631A (en) * 1998-04-02 2000-01-11 Merck-Medco Managed Care, Llc Computer implemented patient medication review system and process for the managed care, health care and/or pharmacy industry
US6188988B1 (en) * 1998-04-03 2001-02-13 Triangle Pharmaceuticals, Inc. Systems, methods and computer program products for guiding the selection of therapeutic treatment regimens
US6092072A (en) * 1998-04-07 2000-07-18 Lucent Technologies, Inc. Programmed medium for clustering large databases
US6273854B1 (en) * 1998-05-05 2001-08-14 Body Bio Corporation Medical diagnostic analysis method and system
US6253169B1 (en) * 1998-05-28 2001-06-26 International Business Machines Corporation Method for improvement accuracy of decision tree based text categorization
US6421665B1 (en) * 1998-10-02 2002-07-16 Ncr Corporation SQL-based data reduction techniques for delivering data to analytic tools
US6067524A (en) * 1999-01-07 2000-05-23 Catalina Marketing International, Inc. Method and system for automatically generating advisory information for pharmacy patients along with normally transmitted data
US6128620A (en) * 1999-02-02 2000-10-03 Lemed Inc Medical database for litigation
US6684221B1 (en) * 1999-05-06 2004-01-27 Oracle International Corporation Uniform hierarchical information classification and mapping system
US6507829B1 (en) * 1999-06-18 2003-01-14 Ppd Development, Lp Textual data classification method and apparatus
US6219674B1 (en) * 1999-11-24 2001-04-17 Classen Immunotherapies, Inc. System for creating and managing proprietary product data
US6658396B1 (en) * 1999-11-29 2003-12-02 Tang Sharon S Neural network drug dosage estimation
US20040030503A1 (en) * 1999-11-29 2004-02-12 Scott Arouh Neural -network-based identification, and application, of genomic information practically relevant to diverse biological and sociological problems, including susceptibility to disease
US20020012921A1 (en) * 2000-01-21 2002-01-31 Stanton Vincent P. Identification of genetic components of drug response
US20020040282A1 (en) * 2000-03-22 2002-04-04 Bailey Thomas C. Drug monitoring and alerting system
US6542902B2 (en) * 2000-03-24 2003-04-01 Bridge Medical, Inc. Method and apparatus for displaying medication information
US20010049673A1 (en) * 2000-03-24 2001-12-06 Bridge Medical, Inc. Method and apparatus for displaying medication information
US6876966B1 (en) * 2000-10-16 2005-04-05 Microsoft Corporation Pattern recognition training method and apparatus using inserted noise followed by noise reduction
US20040015372A1 (en) * 2000-10-20 2004-01-22 Harris Bergman Method and system for processing and aggregating medical information for comparative and statistical analysis
US20020073042A1 (en) * 2000-12-07 2002-06-13 Maritzen L. Michael Method and apparatus for secure wireless interoperability and communication between access devices
US20020142815A1 (en) * 2000-12-08 2002-10-03 Brant Candelore Method for creating a user profile through game play
US20020082869A1 (en) * 2000-12-27 2002-06-27 Gateway, Inc. Method and system for providing and updating customized health care information based on an individual's genome
US20020129031A1 (en) * 2001-01-05 2002-09-12 Lau Lee Min Managing relationships between unique concepts in a database
US20020120350A1 (en) * 2001-02-28 2002-08-29 Klass David B. Method and system for identifying and anticipating adverse drug events
US20020187483A1 (en) * 2001-04-20 2002-12-12 Cerner Corporation Computer system for providing information about the risk of an atypical clinical event based upon genetic information
US20020183965A1 (en) * 2001-05-02 2002-12-05 Gogolak Victor V. Method for analyzing drug adverse effects employing multivariate statistical analysis
US6778994B2 (en) * 2001-05-02 2004-08-17 Victor Gogolak Pharmacovigilance database
US6789091B2 (en) * 2001-05-02 2004-09-07 Victor Gogolak Method and system for web-based analysis of drug adverse effects
US7539684B2 (en) * 2001-05-02 2009-05-26 Qed Solutions, Inc. Processing drug data
US7542961B2 (en) * 2001-05-02 2009-06-02 Victor Gogolak Method and system for analyzing drug adverse effects
US20090158211A1 (en) * 2001-05-02 2009-06-18 Gogolak Victor V Method for graphically depicting drug adverse effect risks
US20020169771A1 (en) * 2001-05-09 2002-11-14 Melmon Kenneth L. System & method for facilitating knowledge management
US6950755B2 (en) * 2001-07-02 2005-09-27 City Of Hope Genotype pattern recognition and classification
US20030124514A1 (en) * 2001-08-08 2003-07-03 Vingerhoets Johan Hendrika Jozef Methods of assessing HIV integrase inhibitor therapy
US20030040662A1 (en) * 2001-08-22 2003-02-27 Philip Keys System, method and computer program for monitoring and managing medications
US7461006B2 (en) * 2001-08-29 2008-12-02 Victor Gogolak Method and system for the analysis and association of patient-specific and population-based genomic data with drug safety adverse event data
US20040010511A1 (en) * 2002-07-11 2004-01-15 Gogolak Victor V. Method and system for drug utilization review

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
Google patents search result, 11/24/2014 *

Cited By (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100138161A1 (en) * 2001-05-02 2010-06-03 Victor Gogolak Method and system for analyzing drug adverse effects
US7925612B2 (en) 2001-05-02 2011-04-12 Victor Gogolak Method for graphically depicting drug adverse effect risks
US20090158211A1 (en) * 2001-05-02 2009-06-18 Gogolak Victor V Method for graphically depicting drug adverse effect risks
US7979373B2 (en) 2001-05-02 2011-07-12 Druglogic, Inc. Method and system for analyzing drug adverse effects
US8131769B2 (en) 2001-05-02 2012-03-06 Druglogic, Inc. Processing drug data
US20100063830A1 (en) * 2008-09-10 2010-03-11 Expanse Networks, Inc. Masked Data Provider Selection
US20120124051A1 (en) * 2009-07-29 2012-05-17 Wilfred Wan Kei Lin Ontological information retrieval system
US10089391B2 (en) * 2009-07-29 2018-10-02 Herbminers Informatics Limited Ontological information retrieval system
US20110161099A1 (en) * 2009-12-28 2011-06-30 Igor Igorevich Stukanov Low-cost method for reducing rates of side effects from using drugs, healing substances and medical procedures
US20120078601A1 (en) * 2010-09-27 2012-03-29 General Electric Company Drug treatment plans derived from holistic analysis
US20120143776A1 (en) * 2010-12-07 2012-06-07 Oracle International Corporation Pharmacovigilance alert tool
US8799022B1 (en) * 2011-05-04 2014-08-05 Strat ID GIC, Inc. Method and network for secure transactions
US20140350964A1 (en) * 2013-05-22 2014-11-27 Quantros, Inc. Probabilistic event classification systems and methods
US10269450B2 (en) * 2013-05-22 2019-04-23 Quantros, Inc. Probabilistic event classification systems and methods
CN103336914A (en) * 2013-05-31 2013-10-02 中国人民解放军国防科学技术大学 Method and device for extracting meta biomarkers
US9607266B2 (en) 2013-07-23 2017-03-28 Tata Consultancy Services Limited Systems and methods for signal detection in pharmacovigilance using distributed processing, analysis and representing of the signals in multiple forms
US20150186334A1 (en) * 2013-12-30 2015-07-02 Nice-Systems Ltd. System and method for automated generation of meaningful data insights
US9928516B2 (en) * 2013-12-30 2018-03-27 Nice Ltd. System and method for automated analysis of data to populate natural language description of data relationships

Also Published As

Publication number Publication date
US20170061080A1 (en) 2017-03-02
WO2003021389A2 (en) 2003-03-13
US7461006B2 (en) 2008-12-02
AU2002329901A1 (en) 2003-03-18
WO2003021389A3 (en) 2003-05-01
US20030046110A1 (en) 2003-03-06

Similar Documents

Publication Publication Date Title
US7461006B2 (en) Method and system for the analysis and association of patient-specific and population-based genomic data with drug safety adverse event data
US6778994B2 (en) Pharmacovigilance database
US11735323B2 (en) Computer implemented identification of genetic similarity
US7395222B1 (en) Method and system for identifying expertise
US6789091B2 (en) Method and system for web-based analysis of drug adverse effects
US7117198B1 (en) Method of researching and analyzing information contained in a database
US9129084B2 (en) Method and system for analyzing drug adverse effects
US20020183965A1 (en) Method for analyzing drug adverse effects employing multivariate statistical analysis
Moradi CIBS: A biomedical text summarizer using topic-based sentence clustering
US7925612B2 (en) Method for graphically depicting drug adverse effect risks
Francis Taming text: An introduction to text mining
Saiod et al. The Impact of Deep Learning on the Semantic Machine Learning Representation
Sung Prescription Drugs: From Paper to Database with Application to Air Pollution-Related Public Health Risk
Weinberg Topic Modeling for Prefocused Search of Open Health Datasets

Legal Events

Date Code Title Description
STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION

AS Assignment

Owner name: DRUGLOGIC, INC., VIRGINIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:GOGOLAK, VICTOR;REEL/FRAME:046257/0466

Effective date: 20131104