Genetics

Overview

GNU Health medical genetics functionality is provided by the following packages:

  • health_genetics

  • health_genetics_uniprot

GNU Health genetic functionality combines the molecular and environmental bases of health and disease unique for each individual, foundation for precision medicine.

The GNU Health genetic functionality can be used both in the clinical and research contexts.

GNU Health incorporates extensive information from Hugo Gene Nomenclature Committee (HGNC) The National Center for Biotechnology Information NCBI and from the UniProt Consortium gene natural variants and protein-related human health conditions.

The screenshots provided on this section aim to facilitate the usage of the genetic packages, as basic workflows and data entry. We will be using two of our (fictional) family members Ana Betz and her son, Matt. The CFTR (Cystic Fibrosis transmembrane conductance regulator) will be used to illustrate different objects and contexts of the GH genetic functionality.

Core genetics package

GNU Health genetic related packages integrate family history and individual relevant genetic information.

The core genetics package provides main functionality, datamodel and relationships with other objects in the GNUHealth data dictionary.

The main models included in this package are:

  • Genes

  • Protein related diseases

  • Family History

  • Personal genetic information

  • Natural variants

  • Variants phenotypes.

The Genes model which its atributes will be detailed next wil hold the HGCN dataset, which is also included on the health_genetics package.

The Gene Model

The main attributes are:

  • HGNC Id: HUGO Gene Nomenclature Committee ID for the gene.

  • Official Symbol: Gene symbol dictated by HGNC (eg .. CFTR)

  • Long Name: Official long name (eg CF transmembrane conductance regulator)

  • Aliases: Alternative symbols (eg .. MRP7|ABC35|TNR-CFTR|dJ760C5.1|CFTR/MRP)

  • Locus: Location of the gene in the chromosome

  • Type: Type of gene (eg, protein-coding, ncRNA, pseudogene..)

  • Variants: List of natural variants associated to the gene (from UniProt)

  • Encoding protein (ID and URI)

There are other fields such as RefSeq, Ensembl and OMIM IDs that help referencing related resources.

Natural Variant

  • Gene

  • Change in amino acid

  • Phenotype

  • Clinical significance

    • LB/B: Likely benign or benign

    • LP/P: Likely pathogenic or pathogenic

    • US: Unknown Significance

Variant Phenotype(s)

Accessing the genes dataset

You can access the genes dataset via menu or via GH Command line.

Menu: Health -> Configuration -> Genetics -> Genes

Command Line : gdis

Filtering / refining the search criteria

As in any model in GNU Health, you can restrict the search combining search arguments.

Viewing details of the gene

Once you have selected the gene in particular from the list view, you can access the form / detailed view by double clicking on it. In our example of the CFTR gene:

Health Genetics UniProt Package

While the genetics core package defines the genomics and medical genetics models, the health_genetics_uniprot loads the meaningful data.

The datasets included in this package are:

  • Human genes natural variants

  • Variant phenotypes

  • Genetic and Protein Related Disorders

Human Genes Natural Variants

GNU Health HIS combines HGNC with the UniProt human natural variants datasets. These two excellent sources provide acurate information and links to uptodate online references to the particular variant related to the patient.

Previously we saw that when selecting a particular gene we can see the all its natural variants. In the tree view, you will see a number. In the CFTR gene, the shows that contains 191 natural variants.

Variant Phenotypes

GNU Health also provides a way to study the phenotypes associated to the gene natural variants. We can access the variant phenotypes via command line or menu:

  • Command line: “VARP”

  • Menu : Health -> Configuration -> Genetics -> Variant Phenotypes

We can search for a particular phenotype or disorder, a change in the aminoacid, the gene or the feature ID of the variant (FTid).

In our example of the CFTR gene, we highlight the fact of a particular variant responsible of more than one phenotype. In the example, variant with UniProt FTid VAR_000119 (aa change p.Asp110His) has been documented to be involved in two phenotypes:

  • DI-01389: Congenital bilateral absence of the vas deferens

  • DI-01466: Cystic fibrosis

Personal Genetic Information

GNU Health stores the details on each of the genes that are relevant for the individual. This information can be accessed from the “Genetics” tab on the patient main form.

  • Gene involved: Information about the gene. The gene code, official long name, chromosome and locus. It also provides information about the protein encoded and the link to UniProt protein code.

  • Natural Variant: The specific amino acid change.

  • Phenotype: This field is used if the person shows clinical signs associated to the expression of that gene variant.

  • Onset : Age (expressed in years) of first clinical signs.

  • Notes : Comments contextualized to the individual.

If we rather want to see the genetic information at population level, then we can use the menu:

Health -> Personal Genetic Information

Family History

GNU Health shows extra info related to diseases including the name, code and main disease category, genetic data and a free text box for adding relevant extra info.

Genetic testing and Medical history

So far we’ve seen how to register the genetic information and family history of the individual, and we know will see how to register a particular phenotype or disorder.

A positive result of a particular genetic test does not necessarily equates to disease, as we see in many late-onset disorders.

In our family example, Ana Betz is a (healthy) carrier for a pathogenic variant in the CFTR gene, involved in cystic fibrosis.

Her son, Matt, carries two pathogenic alleles of the CFTR and he shows clinical signs compatible with Cystic Fibrosis.

When both the molecular testing and the clinical signs are present, we can then encode the corresponding ICD11 (or whatever coding standard) corresponding with the disease.

Ana Betz has a child (Matt). Cystic fibrosis is has a autosomal recessive inheritance pattern, that is, two pathogenic alleles are required to develop different signs of cystic fibrosis.

Matt has inherited two dysfunctional alleles, one from the mother (var_000179) and one from the father (var_000267). Because both alleles have been identified by molecular testing, and he has shown clinical signs compatible with the cystic fibrosis, the corresponding ICD11 (CA25) has been incorporated to his medical history.

GNU Health Commands for genetics

  • gen: Personal Genetic Information

  • gdis: Genes

  • var: Natural Variants

  • varp: Variant Phenotypes

  • pdis: Genetic / Protein diseases

The GNU Health commands allow arguments. For example, if you want to get the information related to the CFTR gene, you can type in the command line widget:

gdis CFTR

To retrieve a CFTR natural variant its Feature ID (FTid) var_000179

var VAR_000179

If you want to see the genetic information for a particular patient:

gen GNU777ORG

To access the latest information about cystic fibrosis:

pdis DI-01466