Scenario 5: Genetic Analysis Case Study on NRAS gene#

An Undiagnosed Child with Delayed Motor Development and Generalized Dysmorphism#

On clinical examination, the undiagnosed child was found to have several facial dysmorphisms, in particular a broad neck and low-set ears. It has also been assessed that the child exhibits delayed motor development.

Genetic testing has been performed using Whole Exome Sequencing (WES) which identified a probable pathogenic mutation in the NRAS gene.


Research Questions#

Question 1: Gene Mutation Prevalence#

Are there any other individuals with a mutation in the same gene?

To answer this question the SPARQL query below has been used for finding the phenopackets that include at least one mutation in the NRAS gene. To identify the correct gene descriptions, the HGNC gene ID is used of NRAS being HGNC:7989.

PREFIX ex: <https://example.org/>
PREFIX obo: <http://purl.obolibrary.org/obo/>
PREFIX sio: <http://semanticscience.org/resource/>
PREFIX dcterms: <http://purl.org/dc/terms/> 
PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>

SELECT ?phenopacketidval ?creatorname ?createddate
WHERE {
    GRAPH <http://example.org/MONARCH_PHENOPACKET_STORE_DATA> {
        # Find the phenopackets that contain at least one mutation in given gene
        ?genedescr a obo:NCIT_C16612 ;
                dcterms:identifier 'HGNC:7989' .
        ?vardescr sio:SIO_001403 ?genedescr .
        ?varinterpr sio:SIO_001403 ?vardescr .
        ?genomicinterp sio:SIO_001403 ?varinterpr .
        ?diagnosis sio:SIO_001403 ?genomicinterp .
        ?interpr sio:SIO_001403 ?diagnosis .
        ?phenopacket sio:SIO_001403 ?interpr ;
                    a obo:NCIT_C79269 ;
                    sio:SIO_000228 ?role .
        
        # Find phenopacket ID value
        ?phenopacketid sio:SIO_000020 ?role ;
                    sio:SIO_000300 ?phenopacketidval .
        
        # Find metadata
        ?phenopacket sio:SIO_001403 ?metadata .
        ?metadata a obo:NCIT_C52095 ;
                sio:SIO_000008 ?creator ;
                sio:SIO_000008 ?created .
        
        ?creator a dcterms:creator .
        ?creatorval sio:SIO_000628 ?creator ;
                    a obo:NCIT_C70856 ;
                    sio:SIO_000300 ?creatorname .
        
        ?created a dcterms:created .
        ?createdval sio:SIO_000628 ?created ;
                    a obo:NCIT_C70856 ;
                    sio:SIO_000300 ?createddate .
    }
}

As shown below, a total of fourteen phenopackets have been found including phenotypic and genetic information related to mutations in the NRAS gene. This set of phenopackets has been created by the same researcher at the same datetime based on the metadata.

phenopacket id creator creation date
0 PMID_26467218_individual_6_Cirstea_et_al__14 ORCID:0000-0002-0736-9199 2024-05-20 20:56:12
1 PMID_26467218_individual_11_Kraoua_et_al__23 ORCID:0000-0002-0736-9199 2024-05-20 20:56:12
2 PMID_26467218_individual_3_Denayer_et_al__22 ORCID:0000-0002-0736-9199 2024-05-20 20:56:12
3 PMID_26467218_individual_9_Cirstea_et_al__14 ORCID:0000-0002-0736-9199 2024-05-20 20:56:12
4 PMID_26467218_individual_4_Denayer_et_al__22 ORCID:0000-0002-0736-9199 2024-05-20 20:56:12
5 PMID_26467218_individual_12_Present_study ORCID:0000-0002-0736-9199 2024-05-20 20:56:12
6 PMID_26467218_individual_1_De_Filippi_et_al__20 ORCID:0000-0002-0736-9199 2024-05-20 20:56:12
7 PMID_26467218_individual_2_Runtuwene_et_al__21 ORCID:0000-0002-0736-9199 2024-05-20 20:56:12
8 PMID_26467218_individual_9_M_Cirstea_et_al__14 ORCID:0000-0002-0736-9199 2024-05-20 20:56:12
9 PMID_26467218_individual_5_Denayer_et_al__22 ORCID:0000-0002-0736-9199 2024-05-20 20:56:12
10 PMID_26467218_individual_12_F_Present_study ORCID:0000-0002-0736-9199 2024-05-20 20:56:12
11 PMID_26467218_individual_8_Cirstea_et_al__14 ORCID:0000-0002-0736-9199 2024-05-20 20:56:12
12 PMID_26467218_individual_10_Kraoua_et_al__23 ORCID:0000-0002-0736-9199 2024-05-20 20:56:12
13 PMID_26467218_individual_7_Cirstea_et_al__14 ORCID:0000-0002-0736-9199 2024-05-20 20:56:12

Question 2: Phenotype and Disease Prevalence#

What phenotypes and which diseases have been observed in these individuals and how often do they occur in this group of individuals?

The phenopackets found for the previous question can now be analysed to conclude which phenotypes and diseases occur in this group of individuals. This is done by using SPARQL queries for acquiring the associated phenotypes and diseases. The query below searches for all diagnosed diseases in the relevant phenopackets.

PREFIX ex: <https://example.org/>
PREFIX obo: <http://purl.obolibrary.org/obo/>
PREFIX sio: <http://semanticscience.org/resource/>
PREFIX dcterms: <http://purl.org/dc/terms/> 
PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>

SELECT ?idvalue ?diseaseid ?diseaselabel
WHERE {
    GRAPH <http://example.org/MONARCH_PHENOPACKET_STORE_DATA> {
        # Find relevant phenopackets
        ?id a obo:IAO_0020000 ;
            sio:SIO_000300 ?idvalue .
        VALUES ?idvalue { "PMID_26467218_individual_6_Cirstea_et_al__14" "PMID_26467218_individual_11_Kraoua_et_al__23" "PMID_26467218_individual_3_Denayer_et_al__22" "PMID_26467218_individual_9_Cirstea_et_al__14" "PMID_26467218_individual_4_Denayer_et_al__22" "PMID_26467218_individual_12_Present_study" "PMID_26467218_individual_1_De_Filippi_et_al__20" "PMID_26467218_individual_2_Runtuwene_et_al__21" "PMID_26467218_individual_9_M_Cirstea_et_al__14" "PMID_26467218_individual_5_Denayer_et_al__22" "PMID_26467218_individual_12_F_Present_study" "PMID_26467218_individual_8_Cirstea_et_al__14" "PMID_26467218_individual_10_Kraoua_et_al__23" "PMID_26467218_individual_7_Cirstea_et_al__14" } .
        
        # Find diagnosis
        ?id sio:SIO_000020 ?role .
        ?phenopacket sio:SIO_000228 ?role ;
                    sio:SIO_001403 ?interpr .
        ?interpr a obo:NCIT_C41255 ;
                sio:SIO_001403 ?diagnosis .
        ?diagnosis a sio:SIO_000614 ;
                sio:SIO_001403 ?disease .
        ?diseaseval a obo:NCIT_C2991 ;
                    sio:SIO_000628 ?disease .
        ?disease rdfs:label ?diseaselabel ;
                dcterms:identifier ?diseaseid .
    }
}

Given the results of the SPARQL query shown above, the individuals for which mutations are found in the NRAS gene are all diagnosed with Noonan syndrome 6.

phenopacket id disease ID disease label
0 PMID_26467218_individual_6_Cirstea_et_al__14 OMIM:613224 Noonan syndrome 6
1 PMID_26467218_individual_11_Kraoua_et_al__23 OMIM:613224 Noonan syndrome 6
2 PMID_26467218_individual_3_Denayer_et_al__22 OMIM:613224 Noonan syndrome 6
3 PMID_26467218_individual_9_Cirstea_et_al__14 OMIM:613224 Noonan syndrome 6
4 PMID_26467218_individual_4_Denayer_et_al__22 OMIM:613224 Noonan syndrome 6
5 PMID_26467218_individual_12_Present_study OMIM:613224 Noonan syndrome 6
6 PMID_26467218_individual_1_De_Filippi_et_al__20 OMIM:613224 Noonan syndrome 6
7 PMID_26467218_individual_2_Runtuwene_et_al__21 OMIM:613224 Noonan syndrome 6
8 PMID_26467218_individual_9_M_Cirstea_et_al__14 OMIM:613224 Noonan syndrome 6
9 PMID_26467218_individual_5_Denayer_et_al__22 OMIM:613224 Noonan syndrome 6
10 PMID_26467218_individual_12_F_Present_study OMIM:613224 Noonan syndrome 6
11 PMID_26467218_individual_8_Cirstea_et_al__14 OMIM:613224 Noonan syndrome 6
12 PMID_26467218_individual_10_Kraoua_et_al__23 OMIM:613224 Noonan syndrome 6
13 PMID_26467218_individual_7_Cirstea_et_al__14 OMIM:613224 Noonan syndrome 6

The query shown below searches for all phenotypes that are associated with at least one of the relevant phenopackets.

PREFIX ex: <https://example.org/>
PREFIX obo: <http://purl.obolibrary.org/obo/>
PREFIX sio: <http://semanticscience.org/resource/>
PREFIX dcterms: <http://purl.org/dc/terms/> 
PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>
PREFIX xsd: <http://www.w3.org/2001/XMLSchema#>

SELECT ?idvalue ?phenofeatid ?phenofeatlabel
WHERE {
    GRAPH <http://example.org/MONARCH_PHENOPACKET_STORE_DATA> {
        # Find relevant phenopackets
        ?id a obo:IAO_0020000 ;
            sio:SIO_000300 ?idvalue .
        VALUES ?idvalue { "PMID_26467218_individual_6_Cirstea_et_al__14" "PMID_26467218_individual_11_Kraoua_et_al__23" "PMID_26467218_individual_3_Denayer_et_al__22" "PMID_26467218_individual_9_Cirstea_et_al__14" "PMID_26467218_individual_4_Denayer_et_al__22" "PMID_26467218_individual_12_Present_study" "PMID_26467218_individual_1_De_Filippi_et_al__20" "PMID_26467218_individual_2_Runtuwene_et_al__21" "PMID_26467218_individual_9_M_Cirstea_et_al__14" "PMID_26467218_individual_5_Denayer_et_al__22" "PMID_26467218_individual_12_F_Present_study" "PMID_26467218_individual_8_Cirstea_et_al__14" "PMID_26467218_individual_10_Kraoua_et_al__23" "PMID_26467218_individual_7_Cirstea_et_al__14" } .
        
        # Find phenotypic features
        ?id sio:SIO_000020 ?role .
        ?phenopacket sio:SIO_000228 ?role ;
                    sio:SIO_001403 ?phenofeat .
        ?phenofeatval sio:SIO_000628 ?phenofeat ;
                    a obo:NCIT_C16977 .
        ?phenofeat dcterms:identifier ?phenofeatid ;
                rdfs:label ?phenofeatlabel .

        # Find the phenotypic features that are not excluded
        OPTIONAL {
            ?phenofeat sio:SIO_000008 ?excluded .
            ?excluded a obo:HP_0040285 .
            ?excludedval sio:SIO_000628 ?excluded ;
                        sio:SIO_000300 ?excludedlabel . 
        }
        
        FILTER (!bound(?excluded) || ?excludedlabel = "false"^^xsd:boolean)
    }
}

A total of 157 phenotypes have been found. A sample of these results are shown below:

phenopacket ID phenotype ID phenotype label
8 PMID_26467218_individual_6_Cirstea_et_al__14 HP:0000358 Posteriorly rotated ears
11 PMID_26467218_individual_6_Cirstea_et_al__14 HP:0006699 Premature atrial contractions
30 PMID_26467218_individual_3_Denayer_et_al__22 HP:0000348 High forehead
31 PMID_26467218_individual_3_Denayer_et_al__22 HP:0000369 Low-set ears
36 PMID_26467218_individual_3_Denayer_et_al__22 HP:0000563 Keratoconus
44 PMID_26467218_individual_9_Cirstea_et_al__14 HP:0032152 Keratosis pilaris
58 PMID_26467218_individual_12_Present_study HP:0000957 Cafe-au-lait spot
82 PMID_26467218_individual_2_Runtuwene_et_al__21 HP:0000348 High forehead
137 PMID_26467218_individual_10_Kraoua_et_al__23 HP:0001270 Motor delay
155 PMID_26467218_individual_7_Cirstea_et_al__14 HP:0000028 Cryptorchidism

Given these results, the occurrence of each phenotype is calculated:

phenotype ID phenotype label counts occurrence percentage
0 HP:0000369 Low-set ears 14 100.0%
1 HP:0000316 Hypertelorism 14 100.0%
2 HP:0000494 Downslanted palpebral fissures 14 100.0%
3 HP:0000358 Posteriorly rotated ears 14 100.0%
4 HP:0000348 High forehead 14 100.0%
5 HP:0000465 Webbed neck 10 71.4%
6 HP:0000767 Pectus excavatum 9 64.3%
7 HP:0001270 Motor delay 9 64.3%
8 HP:0000028 Cryptorchidism 6 42.9%
9 HP:0004322 Short stature 5 35.7%
10 HP:0000957 Cafe-au-lait spot 5 35.7%
11 HP:0002212 Curly hair 5 35.7%
12 HP:0000256 Macrocephaly 5 35.7%
13 HP:0001561 Polyhydramnios 4 28.6%
14 HP:0032152 Keratosis pilaris 4 28.6%
15 HP:0001639 Hypertrophic cardiomyopathy 3 21.4%
16 HP:0000545 Myopia 3 21.4%
17 HP:0001642 Pulmonic stenosis 3 21.4%
18 HP:0000508 Ptosis 3 21.4%
19 HP:0000978 Bruising susceptibility 3 21.4%
20 HP:0000486 Strabismus 2 14.3%
21 HP:0006699 Premature atrial contractions 1 7.1%
22 HP:0000563 Keratoconus 1 7.1%
23 HP:0001631 Atrial septal defect 1 7.1%
24 HP:0000483 Astigmatism 1 7.1%
25 HP:0012209 Juvenile myelomonocytic leukemia 1 7.1%
26 HP:0001655 Patent foramen ovale 1 7.1%
27 HP:0001680 Coarctation of aorta 1 7.1%
28 HP:0001195 Single umbilical artery 1 7.1%

Question 3: Matching Phenotypes#

Do the found phenotypes overlap with the phenotypes observed in the undiagnosed child?

Multiple phenotypes have been observed in the undiagnosed child being broad neck, low-set ears and motor delay. In order to match the observed phenotypes with the phenotypes observed in the phenopackets, the Human Phenotype Ontology (HPO) identifiers need to be retrieved. These identifiers are acquired by querying over this HPO ontology matching the phenotypes with the labels of the terms that are part of HPO. Below, a SPARQL query is shown that searches for the identifier for the broad neck phenotype:

PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>
PREFIX obo: <http://purl.obolibrary.org/obo/>

SELECT ?iri
WHERE {
    GRAPH <http://example.org/HPO> {
        ?iri rdfs:label 'Broad neck' .
    }
}

Executing this SPARQL query for all observed phenotypes results in the following identifiers:

id label
0 HP:0000475 Broad neck
1 HP:0000369 Low-set ears
2 HP:0001270 Motor delay

Next, the list of observed phenotypes can be compared with the list of phenotypes associated with the selection of phenopackets coming to the conclusion that the observed phenotypes Motor delay (HP:0001270), Low-set ears (HP:0000369) are also found in the individuals represented in the phenopackets.

It is still possible to gather more information about the observed phenotype that has not been found in at least one of the phenopackets. One step would be to investigate the closeness of the observed phenotype to the phenotypes associated with Noonan syndrome 6 given their relative positions in the HPO ontology. To be more explicit, this closeness can be calculated by acquiring the shortest path length between phenotypes in a network. This network is built by adding each phenotype as a node. An edge between two nodes represents the relation of one phenotype being the subclass or superclass of another phenotype.

Given the path length distances between each pair of phenotypes, the phenotypes can be shown on a two-dimensional plot by applying multidimensional scaling. In this way, a clear overview is generated of all phenotypes and their similarities expressed in their positions in the two dimensional space. This plot is shown below:

image

The phenotypes labeled in bold text are the phenotypes observed in the undiagnosed child. Again, it can be seen that the phenotypes low-set ears and motor delay are also found in patients with Noonan syndrome 6 given the colors of the points that represent the phenotypes. Interestingly, this plot also shows that the remaining observed phenotype “broad neck” seems to be similar to phenotype “webbed neck” that is also associated in group of individuals with this same diagnosis.


Conclusion#

Given the mutations in the NRAS gene identified in individuals with Noonan syndrome 6 and the overlap and similarities between the observed phenotypes and the phenotypes associated with the individuals diagnosed with Noonan syndrome 6, there is reason to further investigate Noonan syndrome 6 as being a likely diagnosis for the child.