Introduction

In January 2017, the Content Managers Advisory Group (CMAG) initiated an action to conduct a survey of the national extensions that were available, to date there has been limited information available - to other members and presumably SNOMED International. The responses of this survey are available here. The results shows a variety of extensions are produced, ranging from subset/refset, language translations and clinical content development. Subsequent activities investigating collaboration on subset may be pursued, but the CMAG was also interested in exactly what clinical content was in each extension, and how this might be shared. There should be very little clinical content that is exclusive to any country, and if it has been developed by one extension - it's likely globally relevant, and sharing it can reduce duplicated effort and maintenance burden of extension builders.

The results described here are a result of a combination of objective metrics (size of content), crude identifiction of duplicated effort, and finally some incidental quality observations. Further analysis of the content is still underway using description logic techniques, the results of which will be made available separately, at a later date.

SQL snippets are included in the document for future reference by author, but will unlikely be useful to public readership.

A summary of the results is available in the conclusion section at the end of this paper.

The cooperation of all Members is appreciated, and whilst all effort has been made to represent the extensions accurately, any inaccuracies are accidental.

Summary of Extensions

14 NRC responded to the survey, with 9 indicating they created clinical content extensions.
The Australian Edition also includes it's national drug extension, which has been excluded from this round of analysis (as no other extension appeared to include such content)
All extensions were based upon the July 2016 international release except one. This exception may produce some anomalies, but they are limited to the extension. 

A raw analysis of the active concepts within an extension.

Note: UK extension (~30 thousand concepts) is about 6 times larger than US (5000) and has been excluded from the graph as an outlier. Raw values are shown to the right.

NRCSizePrimitive Defined 
AU1339105678.9%28321.1%
CA2040203099.5%100.5%
DK290290100.0%00.0%
LT00 N/A0N/A 
NL104168665.9%35534.1%
SE1175106090.2%1159.8%
UK2979329793100.0%00.0%
US5100276454.2%233645.8%
UY41537189.4%4410.6%

Ratio of active to inactive concepts

NRC
Proportion currently active
SNOMED CT Netherlands NRC maintained module95.3%
US National Library of Medicine maintained module93.3%
módulo de la extensión de Uruguay95.6%
Canada Health Infoway English module68.3%
SNOMED CT Sweden NRC maintained module99.6%
Australian common model component extension33.3%
Danish module88.4%
SNOMED Clinical Terms Australian extension97.0%
SNOMED CT United Kingdom clinical extension module32.9%

All analysis was only performed on active content.

Extension changes against International Concept IDs

A total of 40 core concepts have been modified by extensions in some way.
Two were retired by an extension

  •  384612007|pT4a: Tumor directly invades other organs or structures (colon/rectum) (finding)|
  • 384613002|pT4b: Tumor penetrates visceral peritoneum (colon/rectum) (finding)|
  • (A third concept was retired, but later reactivated)

One concept had a change to definition status (marked Defined) by an extension

  • 399733007|Excision of retroperitoneal lymph node (procedure)|

Eight of these appear to be an attempt to address issues within the module assignment in the international release. (i.e. Concept inactivated on a different module to what they were created. metadata vs core).

The remainder are simply changes to moduleId, and either represent content promotion from an extension to the International. Or a possible error.

select id,count(*) from X_Concepts
where id in (246089008,246221002,260670006,263512003,263513008,447564002,449609005,700043003,11000119105,41000179103,441000119109,601000119109,1111000119100,1561000119105,4181000179103,4191000179101,4201000179104,4211000179102,4221000179107,4231000179109,4241000179101,4251000179103,4261000179100,4271000179106,4281000179108,4301000179109,4311000179106,4321000179101,4331000179104,4341000179107,4351000179105,5461000179100,5471000179106,5481000179108,5491000179105,5531000179105)
and moduleId != 161771000036108 
group by id
having count(distinct moduleId) > 1

Extension Concepts

There appears to be around 52 unique semantic tags across the extension content. many of these are attributable to translations. Not all extensions provide english FSNs for extension content1, semantic tags were manually translated and merged.
After normalisation, this comes to 32 semantic tags. The distribution of content is shown below.

Semantic TagCount
procedure11372
finding6925
observable entity5609
disorder4783
situation2677
event2251
qualifier value1903
record artifact1361
regime/therapy846
assessment scale549
occupation543
substance503
foundation metadata concept414
morphologic abnormality321
product320
person134
navigational concept132
environment / location114
organism114
body structure106
specimen95
administrative66
physical object59
link assertion22
core metadata concept19
ethnic group16
attribute7
social concept5
religion/philosophy4
linkage concept3
tumor staging2
cell1

 

9 Modules are in use across the extensions.

ModuleId
FSN
Country
11000146104SNOMED CT Netherlands NRC maintained moduleNL
731000124108US National Library of Medicine maintained moduleUS
5631000179106módulo de la extensión de UruguayUY
20621000087109Canada Health Infoway English moduleCA
45991000052106SNOMED CT Sweden NRC maintained moduleSE
161771000036108Australian common model component extensionAU
554471000005108Danish moduleDK
32506021000036107SNOMED Clinical Terms Australian extensionAU
999000011000000103SNOMED CT United Kingdom clinical extension moduleUK

The type of content by hierarchy

Each Top level hierarchy reviewed below for extension content.
Duplicates were found by comparing terms across extensions within given hierarchy. For example, "Look for duplicate terms within the procedure hierarchy". Duplicates within a module were also ignored.

Analysis was done on the complete aggregate of extensions plus the (International Core).
The presence of duplication may indicate:

  1. Extension concepts also in the core, either before or after.
    • Those where the concept appears in the International release after it's creation in an extension represent a maintenance burden for NRC's in the absence of a promotion process.
  2. At least two countries producing similar, if not same, content. Which would suggest it's not necessarily country specific content.

Initial analysis is agnostic of description types, however analysis was further performed on just FSNs to increase likelihood of duplicate detection.
A major limitation in the approach used is that translations will (almost) be inherently unique, so comparison is dependent on English terms.
It was discovered mid analysis that a setting within the analysis database, may have caused incorrect character renderings however, this is not expected to have consequence on this analysis. 

SET @Hierarchy = 404684003;

select term,count(distinct moduleId) from X_Descriptions
where conceptId in (select distinct id from X_Concepts where active)
and moduleId != 900062011000036108 -- exclude AMT module
-- and moduleId not in(900000000000207008,900000000000012004) -- exclude international
and typeId = 900000000000003001
and conceptId in (select sourceId from X_TransitiveClosure where destinationId = @Hierarchy)
and active = 1
group by term
having count(distinct moduleId) > 1;

-- candidates for consideration.
select * from X_Descriptions
-- active descriptions for active concepts
where active and conceptId in (select distinct id from X_Concepts where active)
-- target hierarchy
and conceptId in (select sourceId from X_TransitiveClosure where destinationId = @Hierarchy)
and term in (select distinct term from X_Descriptions
					where conceptId in (select distinct id from X_Concepts where active)
					and moduleId != 900062011000036108 -- exclude AMT module
					-- and moduleId not in(900000000000207008,900000000000012004)
					and conceptId in (select sourceId from X_TransitiveClosure where destinationId = @Hierarchy)
					and active = 1
					group by term
					having count(distinct moduleId) > 1);

 

Clinical finding

  • NRC
    • Concepts
      5.1%25.7%4.1%63.8%1.3%SNOMED CT Netherlands NRC maintainedUS National Library of Medicine maintainedSNOMED CT Sweden NRC maintainedSNOMED CT United Kingdom clinical extensionOtherSNOMED CT Netherlands NRC maintained: 681 (5.1%)US National Library of Medicine maintained: 3461 (25.7%)SNOMED CT Sweden NRC maintained: 548 (4.1%)SNOMED CT United Kingdom clinical extension: 8586 (63.8%)Other: 185 (1.3%)
      NRC Concepts
      SNOMED CT Netherlands NRC maintained 681
      US National Library of Medicine maintained 3461
      Uruguay extension module 14
      Canada Health Infoway English 73
      SNOMED CT Sweden NRC maintained 548
      Danish 24
      SNOMED Clinical Terms Australian extension 74
      SNOMED CT United Kingdom clinical extension 8586
      Potential Concept Duplication

      26 FSNs duplicated across the content, which are almost certainly candidates for promotion.
      The affected concepts are in the following extensions.

      • US National Library of Medicine maintained module
      • SNOMED CT United Kingdom clinical extension module
      • SNOMED CT Netherlands NRC maintained module
      • SNOMED Clinical Terms Australian extension
      • SNOMED CT Sweden NRC maintained module
      • Danish module

      All but the Danish module have some overlap with each other, as well as the international release.
      These are the identified FSNs.

      Adverse reaction to rotavirus vaccine (disorder)Atypical atrial flutter (disorder)
      Chronic epiglottitis (disorder)Excessive bioactive substance intake (finding)
      Excessive enteral nutrition infusion (finding)Excessive growth rate (finding)
      Excessive parenteral nutrition infusion (finding)Family history of lactose intolerance (situation)
      Inadequate bioactive substance intake (finding)Inadequate enteral nutrition infusion (finding)
      Inadequate parenteral nutrition infusion (finding)Mild dementia (disorder)
      Patient identity verified (finding)Physical disability (finding)
      Predicted excessive energy intake (finding)Predicted inadequate energy intake (finding)
      Severe dementia (disorder)Subendocardial myocardial infarction (disorder)
      Suspected cerebrovascular accident (situation)Suspected sepsis (situation)
      Takotsubo cardiomyopathy (disorder)Thrombosis of internal jugular vein (disorder)
      Typical atrial flutter (disorder)Unintentional weight gain (finding)
      Ureterostomy present (finding)Viral meningoencephalitis (disorder)
      There are 6400 synonyms that are not unique across this set. There appear to be a number of reasons for this, though most seem to relate to translations.

      For example:

      • 371093006|Urosepsis (disorder)| has descriptions in, the extensions from three countries, that are the same as the 'en' descritpion.
      • 27830001|Brachial radiculitis (disorder)| has translations in two extensions that are different to the 'en', but differ from eachother by the case of the first character.
      • 75049004|Jeune thoracic dystrophy (disorder)| has translations in two extensions that appear identical.

      These may have different character encoding or punctuation conventions, or written languages are genuinely similar (Danish and Swedish). A binary (eliminating case differences) compare halved the number of duplicate terms identified. It's unclear (to the author) what the standards and rules are concerning translations - are they complete (all concepts), some (only concepts of interest), as necessary (where word is different).

      Procedure

      • NRC
        • Concepts
          5.6%1.3%9.1%3.6%7.2%72.8%0.4%US National Library of Medicine maintainedUruguay extension moduleCanada Health Infoway EnglishSNOMED CT Sweden NRC maintainedSNOMED Clinical Terms Australian extensionSNOMED CT United Kingdom clinical extensionOtherUS National Library of Medicine maintained: 751 (5.6%)Uruguay extension module: 176 (1.3%)Canada Health Infoway English: 1214 (9.1%)SNOMED CT Sweden NRC maintained: 483 (3.6%)SNOMED Clinical Terms Australian extension: 960 (7.2%)SNOMED CT United Kingdom clinical extension: 9732 (72.8%)Other: 44 (0.4%)
          NRC Concepts
          SNOMED CT Netherlands NRC maintained 43
          US National Library of Medicine maintained 751
          Uruguay extension module 176
          Canada Health Infoway English 1214
          SNOMED CT Sweden NRC maintained 483
          Danish 1
          SNOMED Clinical Terms Australian extension 960
          SNOMED CT United Kingdom clinical extension 9732
          Potential Concept Duplication

          16 FSNs duplicated across the content, which are almost certainly candidates for promotion.
          The affected concepts are in the following extensions.

          • SNOMED CT Netherlands NRC maintained module

          • US National Library of Medicine maintained module

          • SNOMED Clinical Terms Australian extension

          • SNOMED CT United Kingdom clinical extension module

          • Canada Health Infoway English module

          • SNOMED CT Sweden NRC maintained module

          These are the identified FSNs.

          Admission to nursing home (procedure)
          Alpha-1 microglobulin measurement (procedure)
          Autopsy planned (situation)
          Computed tomography of head, neck, abdomen and pelvis with contrast (procedure)
          Decompressive craniectomy (procedure)
          electrochemotherapy (procedure)
          Flexible ureteroscopy (procedure)
          Injection of platelet-rich plasma (procedure)
          Laparoscopic extended right hemicolectomy (procedure)
          Laparoscopic gastrectomy (procedure)
          Laparoscopic radical prostatectomy (procedure)
          Open reduction of fracture of ankle with internal fixation (procedure)
          Open repair of strangulated incisional hernia (procedure)
          Open repair of strangulated incisional hernia with prosthesis (procedure)
          Rigid ureteroscopy (procedure)
          Smoking assessment (procedure)


          Special concept

          • NRC
            • Concepts
              4.9%23.9%4.9%3.3%4.0%57.5%1.5%SNOMED CT Netherlands NRC maintainedUS National Library of Medicine maintainedCanada Health Infoway EnglishSNOMED CT Sweden NRC maintainedSNOMED Clinical Terms Australian extensionSNOMED CT United Kingdom clinical extensionOtherSNOMED CT Netherlands NRC maintained: 370 (4.9%)US National Library of Medicine maintained: 1823 (23.9%)Canada Health Infoway English: 374 (4.9%)SNOMED CT Sweden NRC maintained: 254 (3.3%)SNOMED Clinical Terms Australian extension: 308 (4.0%)SNOMED CT United Kingdom clinical extension: 4379 (57.5%)Other: 112 (1.5%)
              NRC Concepts
              SNOMED CT Netherlands NRC maintained 370
              US National Library of Medicine maintained 1823
              Uruguay extension module 59
              Canada Health Infoway English 374
              SNOMED CT Sweden NRC maintained 254
              Danish 53
              SNOMED Clinical Terms Australian extension 308
              SNOMED CT United Kingdom clinical extension 4379
              Potential Concept Duplication

              15 FSNs duplicated across the content, which are almost certainly candidates for promotion.
              The affected concepts are in the following extensions.

              • SNOMED CT Netherlands NRC maintained module
              • US National Library of Medicine maintained module
              • SNOMED CT United Kingdom clinical extension module
              • Canada Health Infoway English module
              • SNOMED CT core module
              • Danish module
              • SNOMED Clinical Terms Australian extension

              These are the identified FSNs.

              5-Hydroxyhistamine (substance)
              Atypical atrial flutter (disorder)
              Autopsy planned (situation)
              Birch pollen (substance)
              Decompressive craniectomy (procedure)
              Deer dander (substance)
              Escherichia coli serogroup Orough (organism)
              Highlands j virus (organism)
              No diabetic retinopathy (situation)
              Subendocardial myocardial infarction (disorder)
              Suspected cerebrovascular accident (situation)
              Suspected sepsis (situation)
              Takotsubo cardiomyopathy (disorder)
              Thrombosis of internal jugular vein (disorder)
              Typical atrial flutter (disorder)
              The mix of semantic tags in this set, suggest a possible issue with the transitive queries and history of the "aggregate release". Further investigation is required.


              Situation with explicit context

              • NRC
                • Concepts
                  1.8%8.4%1.3%9.3%4.8%2.8%71.6%SNOMED CT Netherlands NRC maintainedUS National Library of Medicine maintainedUruguay extension moduleCanada Health Infoway EnglishSNOMED CT Sweden NRC maintainedSNOMED Clinical Terms Australian extensionSNOMED CT United Kingdom clinical extensionSNOMED CT Netherlands NRC maintained: 71 (1.8%)US National Library of Medicine maintained: 340 (8.4%)Uruguay extension module: 52 (1.3%)Canada Health Infoway English: 376 (9.3%)SNOMED CT Sweden NRC maintained: 194 (4.8%)SNOMED Clinical Terms Australian extension: 114 (2.8%)SNOMED CT United Kingdom clinical extension: 2877 (71.6%)
                  NRC Concepts
                  SNOMED CT Netherlands NRC maintained 71
                  US National Library of Medicine maintained 340
                  Uruguay extension module 52
                  Canada Health Infoway English 376
                  SNOMED CT Sweden NRC maintained 194
                  SNOMED Clinical Terms Australian extension 114
                  SNOMED CT United Kingdom clinical extension 2877
                  Potential Concept Duplication

                  8 FSNs duplicated across the content, which are almost certainly candidates for promotion.
                  The affected concepts are in the following extensions.

                  • SNOMED CT United Kingdom clinical extension module
                  • SNOMED CT Netherlands NRC maintained module
                  • SNOMED CT Sweden NRC maintained module
                  • US National Library of Medicine maintained module

                  These are the identified FSNs.

                  Autopsy planned (situation)
                  Family history of lactose intolerance (situation)
                  History of acute coronary syndrome (situation)
                  History of amaurosis fugax (situation)
                  History of supraventricular tachycardia (situation)
                  No diabetic retinopathy (situation)
                  Suspected cerebrovascular accident (situation)
                  Suspected sepsis (situation)


                  Observable entity

                  • NRC
                    • Concepts
                      1.1%97.3%1.6%SNOMED CT Sweden NRC maintainedSNOMED CT United Kingdom clinical extensionOtherSNOMED CT Sweden NRC maintained: 62 (1.1%)SNOMED CT United Kingdom clinical extension: 5538 (97.3%)Other: 92 (1.6%)
                      NRC Concepts
                      SNOMED CT Netherlands NRC maintained 20
                      US National Library of Medicine maintained 28
                      Uruguay extension module 26
                      Canada Health Infoway English 3
                      SNOMED CT Sweden NRC maintained 62
                      Danish 11
                      SNOMED Clinical Terms Australian extension 4
                      SNOMED CT United Kingdom clinical extension 5538
                      Potential Concept Duplication

                      There are no FSNs duplicated across the content.
                      There are 476 duplicate synonyms across this set. The affected concepts are in the following extensions.

                      • SNOMED CT Netherlands NRC maintained module
                      • SNOMED CT core module
                      • Danish module
                      • SNOMED CT Sweden NRC maintained module
                      • SNOMED CT United Kingdom clinical extension module

                      Event

                      • NRC
                        • Concepts
                          99.2%0.8%SNOMED CT United Kingdom clinical extensionOtherSNOMED CT United Kingdom clinical extension: 2230 (99.2%)Other: 19 (0.8%)
                          NRC Concepts
                          SNOMED CT Netherlands NRC maintained 2
                          US National Library of Medicine maintained 15
                          Uruguay extension module 1
                          Danish 1
                          SNOMED CT United Kingdom clinical extension 2230
                          Potential Concept Duplication

                          No FSNs are duplicated across the content.
                          17 synonyms are duplicated, the affected concepts are in the following extensions.

                          • Danish module
                          • SNOMED CT core module
                          • SNOMED CT Sweden NRC maintained module
                          • SNOMED CT Netherlands NRC maintained module

                          Qualifier value

                          • NRC
                            • Concepts
                              3.3%7.1%3.4%13.0%0.1%4.3%2.8%66.0%SNOMED CT Netherlands NRC maintainedUS National Library of Medicine maintainedUruguay extension moduleCanada Health Infoway EnglishSNOMED CT Sweden NRC maintainedDanishSNOMED Clinical Terms Australian extensionSNOMED CT United Kingdom clinical extensionSNOMED CT Netherlands NRC maintained: 100 (3.3%)US National Library of Medicine maintained: 217 (7.1%)Uruguay extension module: 104 (3.4%)Canada Health Infoway English: 396 (13.0%)SNOMED CT Sweden NRC maintained: 4 (0.1%)Danish: 132 (4.3%)SNOMED Clinical Terms Australian extension: 86 (2.8%)SNOMED CT United Kingdom clinical extension: 2001 (66.0%)
                              NRC Concepts
                              SNOMED CT Netherlands NRC maintained 100
                              US National Library of Medicine maintained 217
                              Uruguay extension module 104
                              Canada Health Infoway English 396
                              SNOMED CT Sweden NRC maintained 4
                              Danish 132
                              SNOMED Clinical Terms Australian extension 86
                              SNOMED CT United Kingdom clinical extension 2001
                              Potential Concept Duplication

                              28 FSNs duplicated across the content, which are almost certainly candidates for promotion.
                              The affected concepts are in the following extensions.

                              • SNOMED CT United Kingdom clinical extension module
                              • Canada Health Infoway English module
                              • US National Library of Medicine maintained module

                              These are the identified FSNs.

                              Adolescent medicine service (qualifier value)Adolescent psychiatry service (qualifier value)
                              Audiometry (qualifier value)Cardiology service (qualifier value)
                              Chiropody (qualifier value)Clinical allergy service (qualifier value)
                              Clinical genetics service (qualifier value)Clinical immunology service (qualifier value)
                              Clinical neurophysiology service (qualifier value)Dental hygiene service (qualifier value)
                              Dentistry service (qualifier value)Dialysis service (qualifier value)
                              Endodontic service (qualifier value)Genetics (qualifier value)
                              Health belief model (qualifier value)Infectious diseases service (qualifier value)
                              Intensive care medicine (qualifier value)Neonatal intensive care service (qualifier value)
                              Nephrology service (qualifier value)Neurology service (qualifier value)
                              Physiotherapy service (qualifier value)Prosthetics (qualifier value)
                              Prosthetics service (qualifier value)Prosthodontic service (qualifier value)
                              Psychology (qualifier value)Respiratory medicine service (qualifier value)
                              Respite care service (qualifier value)Sports medicine (qualifier value)


                              Record artifact

                              • NRC
                                • Concepts
                                  15.1%4.2%80.3%0.4%SNOMED CT Netherlands NRC maintainedUruguay extension moduleSNOMED CT United Kingdom clinical extensionOtherSNOMED CT Netherlands NRC maintained: 202 (15.1%)Uruguay extension module: 57 (4.2%)SNOMED CT United Kingdom clinical extension: 1078 (80.3%)Other: 5 (0.4%)
                                  NRC Concepts
                                  SNOMED CT Netherlands NRC maintained 202
                                  US National Library of Medicine maintained 4
                                  Uruguay extension module 57
                                  SNOMED CT Sweden NRC maintained 1
                                  SNOMED CT United Kingdom clinical extension 1078
                                  Potential Concept Duplication

                                  8 FSNs duplicated across the content, which are almost certainly candidates for promotion.
                                  The affected concepts are in the following extensions.

                                  • SNOMED CT Netherlands NRC maintained module
                                  • SNOMED CT United Kingdom clinical extension module

                                  These are the identified FSNs.

                                  Discharge letter (record artifact)
                                  Do not attempt cardiopulmonary resuscitation order (record artifact)
                                  Growth chart (record artifact)
                                  Letter (record artifact)
                                  Living will and advance directive record (record artifact)
                                  Medical photograph (record artifact)
                                  Referral letter (record artifact)
                                  Weight chart (record artifact)


                                  Social context

                                  • NRC
                                    • Concepts
                                      5.2%7.1%7.1%2.7%76.2%1.7%US National Library of Medicine maintainedUruguay extension moduleCanada Health Infoway EnglishDanishSNOMED CT United Kingdom clinical extensionOtherUS National Library of Medicine maintained: 41 (5.2%)Uruguay extension module: 56 (7.1%)Canada Health Infoway English: 56 (7.1%)Danish: 21 (2.7%)SNOMED CT United Kingdom clinical extension: 603 (76.2%)Other: 14 (1.7%)
                                      NRC Concepts
                                      SNOMED CT Netherlands NRC maintained 5
                                      US National Library of Medicine maintained 41
                                      Uruguay extension module 56
                                      Canada Health Infoway English 56
                                      SNOMED CT Sweden NRC maintained 3
                                      Danish 21
                                      SNOMED Clinical Terms Australian extension 6
                                      SNOMED CT United Kingdom clinical extension 603
                                      Potential Concept Duplication

                                      5 FSNs duplicated across the content, which are almost certainly candidates for promotion.
                                      The affected concepts are in the following extensions.

                                      • US National Library of Medicine maintained module
                                      • SNOMED Clinical Terms Australian extension
                                      • Canada Health Infoway English module

                                      These are the identified FSNs.

                                      Massage therapist (occupation)
                                      Maternal aunt (person)
                                      Maternal uncle (person)
                                      Paternal aunt (person)
                                      Paternal uncle (person)


                                      Substance

                                      • NRC
                                        • Concepts
                                          43.4%30.3%17.2%6.2%2.1%0.8%US National Library of Medicine maintainedCanada Health Infoway EnglishDanishSNOMED Clinical Terms Australian extensionSNOMED CT United Kingdom clinical extensionOtherUS National Library of Medicine maintained: 418 (43.4%)Canada Health Infoway English: 292 (30.3%)Danish: 166 (17.2%)SNOMED Clinical Terms Australian extension: 60 (6.2%)SNOMED CT United Kingdom clinical extension: 20 (2.1%)Other: 7 (0.8%)
                                          NRC Concepts
                                          SNOMED CT Netherlands NRC maintained 4
                                          US National Library of Medicine maintained 418
                                          Canada Health Infoway English 292
                                          SNOMED CT Sweden NRC maintained 3
                                          Danish 166
                                          SNOMED Clinical Terms Australian extension 60
                                          SNOMED CT United Kingdom clinical extension 20
                                          Potential Concept Duplication

                                          15 FSNs duplicated across the content, which are almost certainly candidates for promotion.
                                          The affected concepts are in the following extensions.

                                          • US National Library of Medicine maintained module
                                          • SNOMED Clinical Terms Australian extension
                                          • Canada Health Infoway English module
                                          • SNOMED CT core module
                                          • SNOMED CT United Kingdom clinical extension module

                                          These are the identified FSNs.

                                          5-Hydroxyhistamine (substance)
                                          Arugula (substance)
                                          Birch pollen (substance)
                                          Blue cheese (substance)
                                          Deer dander (substance)
                                          Flounder (substance)
                                          Ham (substance)
                                          Hickory nut (substance)
                                          Honeydew melon (substance)
                                          Jalapeno pepper (substance)
                                          Pneumococcal conjugate vaccine (product)
                                          Red onion (substance)
                                          Snail - dietary (substance)
                                          Tree nut (substance)
                                          White pepper (substance)


                                          Body structure

                                          • NRC
                                            • Concepts
                                              4.9%91.8%3.3%US National Library of Medicine maintainedSNOMED CT United Kingdom clinical extensionOtherUS National Library of Medicine maintained: 21 (4.9%)SNOMED CT United Kingdom clinical extension: 393 (91.8%)Other: 14 (3.3%)
                                              NRC Concepts
                                              SNOMED CT Netherlands NRC maintained 1
                                              US National Library of Medicine maintained 21
                                              Uruguay extension module 1
                                              Canada Health Infoway English 4
                                              SNOMED CT Sweden NRC maintained 2
                                              Danish 4
                                              SNOMED Clinical Terms Australian extension 2
                                              SNOMED CT United Kingdom clinical extension 393
                                              Potential Concept Duplication

                                              No FSNs duplicated across the content.

                                              1,888 synonyms are duplicated across the content, the affected concepts are in the following extensions.

                                              • Danish module
                                              • SNOMED CT Sweden NRC maintained module
                                              • SNOMED CT core module
                                              • Lithuania
                                              • SNOMED Clinical Terms Australian extension
                                              • US National Library of Medicine maintained module
                                              • SNOMED CT United Kingdom clinical extension module

                                              Staging and scales

                                              • NRC
                                                • Concepts
                                                  2.2%96.2%1.6%SNOMED CT Sweden NRC maintainedSNOMED CT United Kingdom clinical extensionOtherSNOMED CT Sweden NRC maintained: 12 (2.2%)SNOMED CT United Kingdom clinical extension: 535 (96.2%)Other: 9 (1.6%)
                                                  NRC Concepts
                                                  SNOMED CT Netherlands NRC maintained 1
                                                  US National Library of Medicine maintained 5
                                                  SNOMED CT Sweden NRC maintained 12
                                                  Danish 1
                                                  SNOMED Clinical Terms Australian extension 2
                                                  SNOMED CT United Kingdom clinical extension 535
                                                  Potential Concept Duplication

                                                  No FSNs duplicated across the content, which are almost certainly candidates for promotion.
                                                  464 synonyms are duplicated across the extensions. The affected concepts are in the following extensions.

                                                  • Danish module
                                                  • SNOMED CT core module
                                                  • SNOMED CT Sweden NRC maintained module
                                                  • SNOMED CT United Kingdom clinical extension module

                                                   

                                                  Pharmaceutical / biologic product

                                                  • NRC
                                                    • Concepts
                                                      36.5%33.3%19.6%4.2%6.4%US National Library of Medicine maintainedCanada Health Infoway EnglishDanishSNOMED Clinical Terms Australian extensionSNOMED CT United Kingdom clinical extensionUS National Library of Medicine maintained: 263 (36.5%)Canada Health Infoway English: 240 (33.3%)Danish: 141 (19.6%)SNOMED Clinical Terms Australian extension: 30 (4.2%)SNOMED CT United Kingdom clinical extension: 47 (6.4%)
                                                      NRC Concepts
                                                      US National Library of Medicine maintained 263
                                                      Canada Health Infoway English 240
                                                      Danish 141
                                                      SNOMED Clinical Terms Australian extension 30
                                                      SNOMED CT United Kingdom clinical extension 47
                                                      Potential Concept Duplication

                                                      12 FSNs duplicated across the content, which are almost certainly candidates for promotion.
                                                      The affected concepts are in the following extensions.

                                                      • Canada Health Infoway English module
                                                      • US National Library of Medicine maintained module
                                                      • SNOMED Clinical Terms Australian extension

                                                      These are the identified FSNs.

                                                      Arugula (substance)
                                                      Blue cheese (substance)
                                                      Flounder (substance)
                                                      Ham (substance)
                                                      Hickory nut (substance)
                                                      Honeydew melon (substance)
                                                      Jalapeno pepper (substance)
                                                      Pneumococcal conjugate vaccine (product)
                                                      Red onion (substance)
                                                      Snail - dietary (substance)
                                                      Tree nut (substance)
                                                      White pepper (substance)
                                                      There is obviously an issue with the semantic tag and transitive queries. This may be a problem with the analysis or content.

                                                      Organism

                                                      • NRC
                                                        • Concepts
                                                          64.0%21.1%10.5%2.6%1.8%SNOMED CT Netherlands NRC maintainedUS National Library of Medicine maintainedDanishSNOMED CT United Kingdom clinical extensionOtherSNOMED CT Netherlands NRC maintained: 73 (64.0%)US National Library of Medicine maintained: 24 (21.1%)Danish: 12 (10.5%)SNOMED CT United Kingdom clinical extension: 3 (2.6%)Other: 2 (1.8%)
                                                          NRC Concepts
                                                          SNOMED CT Netherlands NRC maintained 73
                                                          US National Library of Medicine maintained 24
                                                          SNOMED CT Sweden NRC maintained 1
                                                          Danish 12
                                                          SNOMED Clinical Terms Australian extension 1
                                                          SNOMED CT United Kingdom clinical extension 3
                                                          Potential Concept Duplication

                                                          2 FSNs duplicated across the content, which are almost certainly candidates for promotion.
                                                          The affected concepts are in the following extensions.

                                                          • US National Library of Medicine maintained module
                                                          • SNOMED CT core module
                                                          • Canada Health Infoway English module

                                                          These are the identified FSNs.

                                                          Escherichia coli serogroup Orough (organism)
                                                          Highlands j virus (organism)


                                                          Environment or geographical location

                                                          • NRC
                                                            • Concepts
                                                              8.0%2.9%29.9%59.2%US National Library of Medicine maintainedCanada Health Infoway EnglishDanishSNOMED CT United Kingdom clinical extensionUS National Library of Medicine maintained: 11 (8.0%)Canada Health Infoway English: 4 (2.9%)Danish: 41 (29.9%)SNOMED CT United Kingdom clinical extension: 81 (59.2%)
                                                              NRC Concepts
                                                              US National Library of Medicine maintained 11
                                                              Canada Health Infoway English 4
                                                              Danish 41
                                                              SNOMED CT United Kingdom clinical extension 81
                                                              Potential Concept Duplication

                                                              No FSNs duplicated across the content.
                                                              353 synonyms are duplicated across the extensions, The affected concepts are in the following extensions.

                                                              • Danish module
                                                              • SNOMED CT Sweden NRC maintained module
                                                              • SNOMED CT core module
                                                              • Lithuania

                                                              Specimen

                                                              • NRC
                                                                • Concepts
                                                                  40.4%59.6%US National Library of Medicine maintainedSNOMED CT United Kingdom clinical extensionUS National Library of Medicine maintained: 38 (40.4%)SNOMED CT United Kingdom clinical extension: 56 (59.6%)
                                                                  NRC Concepts
                                                                  US National Library of Medicine maintained 38
                                                                  SNOMED CT United Kingdom clinical extension 56
                                                                  Potential Concept Duplication

                                                                  No FSNs duplicated across the content.
                                                                  Nine synonyms are duplicated. The affected concepts are in the following extensions.

                                                                  • Danish module
                                                                  • SNOMED CT Sweden NRC maintained module
                                                                  • SNOMED CT core module
                                                                  • US National Library of Medicine maintained module
                                                                  • SNOMED CT United Kingdom clinical extension module

                                                                  Physical object

                                                                  • NRC
                                                                    • Concepts
                                                                      8.5%55.9%1.7%33.9%SNOMED CT Netherlands NRC maintainedUS National Library of Medicine maintainedSNOMED CT Sweden NRC maintainedSNOMED CT United Kingdom clinical extensionSNOMED CT Netherlands NRC maintained: 5 (8.5%)US National Library of Medicine maintained: 33 (55.9%)SNOMED CT Sweden NRC maintained: 1 (1.7%)SNOMED CT United Kingdom clinical extension: 20 (33.9%)
                                                                      NRC Concepts
                                                                      SNOMED CT Netherlands NRC maintained 5
                                                                      US National Library of Medicine maintained 33
                                                                      SNOMED CT Sweden NRC maintained 1
                                                                      SNOMED CT United Kingdom clinical extension 20
                                                                      Potential Concept Duplication

                                                                      No FSNs duplicated across the conten.
                                                                      399 synonyms are duplicated across the extensions. The affected concepts are in the following extensions.

                                                                      • Danish module
                                                                      • SNOMED CT Sweden NRC maintained module
                                                                      • SNOMED CT core module

                                                                      Physical force

                                                                      Single Concept : U-V radiation in diagnosis NOS (physical force)
                                                                       

                                                                      Extension Descriptions

                                                                      Most analysis performed as part of identifying duplicates within concepts. However, below is a summary of the translations - (extension descriptions for core concepts).

                                                                      • NRC
                                                                        • Translated Concepts
                                                                          1.1%5.3%46.4%39.6%6.6%1%SNOMED CT Netherlands NRC maintained moduleCanada Health Infoway French moduleSNOMED CT Sweden NRC maintained moduleDanish moduleLithuaniaOther

                                                                          NRCTranslated Concepts
                                                                          SNOMED CT Netherlands NRC maintained module6759
                                                                          US National Library of Medicine maintained module169
                                                                          Uruguay extension module227
                                                                          Canada Health Infoway French module34063
                                                                          Canada Health Infoway English module533
                                                                          SNOMED CT Sweden NRC maintained module298331
                                                                          Danish module254752
                                                                          SNOMED Clinical Terms Australian extension1929
                                                                          Lithuania42668
                                                                          SNOMED CT United Kingdom clinical extension module3799

                                                                          • Language
                                                                            • Concepts
                                                                              39.6%5.3%6.6%1.1%46.4%1%dafrltnlsvOther

                                                                              LanguageConcepts
                                                                              da254752
                                                                              en6327
                                                                              es227
                                                                              fr34063
                                                                              lt42668
                                                                              nl6756
                                                                              sv298331

                                                                              Extension Changes to Core Descriptions

                                                                              178 International descriptions have some modification in an extension. The associated modules are:

                                                                              • Australian common model component extension
                                                                              • SNOMED Clinical Terms Australian extension
                                                                              • US National Library of Medicine maintained module

                                                                              Relationship Extensions

                                                                              5,136 core concepts have been changes within an extension. Some of these look like promotions, however the majority do not appear to be.
                                                                               

                                                                              • Extension
                                                                                • Inferred
                                                                                  4401262357688172862112502004006008001000120014001600180020002200240026002800Australian common modelcomponent extension (coremetadata concept)Canada Health Infoway Englishmodule (core metadataconcept)Danish module (core metadataconcept)Uruguay extension module(core metadata concept)SNOMED Clinical TermsAustralian extension (coremetadata concept)SNOMED CT Netherlands NRCmaintained module (coremetadata concept)SNOMED CT Sweden NRCmaintained module (coremetadata concept)SNOMED CT United Kingdomclinical extension module (coremetadata concept)US National Library of Medicinemaintained module (coremetadata concept)

                                                                                  ExtensionInferredStated
                                                                                  Australian common model component extension (core metadata concept)457
                                                                                  Canada Health Infoway English module (core metadata concept)400
                                                                                  Danish module (core metadata concept)126
                                                                                  Uruguay extension module (core metadata concept)6223
                                                                                  SNOMED Clinical Terms Australian extension (core metadata concept)35798
                                                                                  SNOMED CT Netherlands NRC maintained module (core metadata concept)6880
                                                                                  SNOMED CT Sweden NRC maintained module (core metadata concept)170
                                                                                  SNOMED CT United Kingdom clinical extension module (core metadata concept)28622848
                                                                                  US National Library of Medicine maintained module (core metadata concept)1125401

                                                                                  Note: Some of the numbers comparing stated and inferred look odd, this is likely a result of the crude aggregation of extensions and some of the extension content already having been promoted to core.

                                                                                  Core relationships modified within an Extension

                                                                                  1,997 core relationships where modified by an extension, affecting 384 concepts

                                                                                  A single concept, 425630003|Acute irritant contact dermatitis (disorder)| was modified by two NRCs.
                                                                                  Both inactivated all the relationships, but one recreated them in the subsequent release.

                                                                                  Other changes are summarised below.
                                                                                   

                                                                                  • NRC
                                                                                    • Relationships Changed
                                                                                      15702968016128150200400600800100012001400SNOMED CTNetherlands NRCmaintained moduleUruguay extensionmoduleCanada HealthInfoway EnglishmoduleSNOMED CT SwedenNRC maintainedmoduleAustralian commonmodel componentextensionDanish moduleSNOMED ClinicalTerms Australianextension

                                                                                      NRCRelationships Changed
                                                                                      SNOMED CT Netherlands NRC maintained module1570
                                                                                      Uruguay extension module296
                                                                                      Canada Health Infoway English module80
                                                                                      SNOMED CT Sweden NRC maintained module16
                                                                                      Australian common model component extension12
                                                                                      Danish module8
                                                                                      SNOMED Clinical Terms Australian extension15

                                                                                      Types of Relationships Modified

                                                                                      A large variety (43) of relationship types are involved in the edits, most are IS A, and some are not part of the approved concept model or are attributes specific to an extension.
                                                                                       

                                                                                      • Relationship Type
                                                                                        • Number of relationships
                                                                                          126926350114324141712469331317231133272209903294372860536703171111221122376AfterAssociated findingAssociated morphologyAssociated procedureAssociated withCausative agentClinical courseComponentDirect deviceDirect morphologyDirect substanceDue toFinding contextFinding siteHas active ingredientHas definitional manifestationHas dose formHas focusHas intentHas interpretationHas specimenInterpretsIs aMethodMOVED FROMOccurrencePathological processPBCL flag trueProcedure contextProcedure siteProcedure site - DirectProcedure site - IndirectRoute of administrationSeveritySpecimen source identitySpecimen source topographySpecimen substanceSubject relationship contextSurgical approachTemporal contextUsing access deviceUsing deviceUsing substance02004006008001000120014001600180020002200240026002800

                                                                                          Relationship TypeNumber of relationships
                                                                                          After12
                                                                                          Associated finding69
                                                                                          Associated morphology263
                                                                                          Associated procedure50
                                                                                          Associated with11
                                                                                          Causative agent43
                                                                                          Clinical course24
                                                                                          Component1
                                                                                          Direct device4
                                                                                          Direct morphology17
                                                                                          Direct substance1
                                                                                          Due to24
                                                                                          Finding context69
                                                                                          Finding site331
                                                                                          Has active ingredient3
                                                                                          Has definitional manifestation17
                                                                                          Has dose form2
                                                                                          Has focus3
                                                                                          Has intent11
                                                                                          Has interpretation3
                                                                                          Has specimen3
                                                                                          Interprets27
                                                                                          Is a2209
                                                                                          Method90
                                                                                          MOVED FROM32
                                                                                          Occurrence94
                                                                                          Pathological process37
                                                                                          PBCL flag true2860
                                                                                          Procedure context53
                                                                                          Procedure site6
                                                                                          Procedure site - Direct70
                                                                                          Procedure site - Indirect3
                                                                                          Route of administration1
                                                                                          Severity7
                                                                                          Specimen source identity1
                                                                                          Specimen source topography1
                                                                                          Specimen substance1
                                                                                          Subject relationship context122
                                                                                          Surgical approach1
                                                                                          Temporal context122
                                                                                          Using access device3
                                                                                          Using device7
                                                                                          Using substance6

                                                                                          select count(distinct sourceId)  from X_Relationships
                                                                                          where moduleId not in(900000000000207008,900000000000012004,900062011000036108) -- exclude international+AMT
                                                                                          and sourceId in  (select id from X_Concepts where moduleId in(900000000000207008,900000000000012004) and active)
                                                                                          group by moduleId;

                                                                                          Comparison Examples - Published (inferred) relationships for Core concepts

                                                                                          Concept
                                                                                          Core
                                                                                          Extension
                                                                                          371040005

                                                                                          321000119108

                                                                                          Note: This example, appears to be a promoted concept. But the local relationships haven't been inactivated upon promotion. Examples such as this are a use case for promoting both stated and inferred relationships. Such that maintenance burden on NRCs is reduced, and authoring effort recognised.

                                                                                          212385001

                                                                                          Additional Observations

                                                                                          The following observations are only exemplars of the observations made, and by no means comprehensive.

                                                                                          Extensions vs Editions

                                                                                          Of the 9 releases looked at:

                                                                                          • Four publish Editions
                                                                                          • Three publish Extensions
                                                                                          • One publishes three separate extensions.
                                                                                          • One publishes an extension, "bundled" with the International Edition.

                                                                                          File naming

                                                                                          The file naming conventions, do not appear to be consistent across the extensions.

                                                                                          •  sct2_Concept_Snapshot_AU1000036_20161231.txt
                                                                                          •  sct2_Concept_Snapshot_en-CanadianExtension_20161031.txt
                                                                                          • sct2_Concept_Snapshot_DK1000005_20161130.txt
                                                                                          • sct2_Concept_Snapshot_LT1000092_20151107.txt
                                                                                          • sct2_Concept_Snapshot_NL_20160930.txt
                                                                                          • sct2_Concept_Snapshot_SE1000052_20161130.txt
                                                                                          • sct2_Concept_Snapshot_GB1000000_20161001.txt
                                                                                          • sct2_Concept_Snapshot_US1000124_20160901.txt
                                                                                          • sct2_Concept_Snapshot_es-UruguayExtension_20161215.txt
                                                                                          • sct2_Concept_Snapshot_INT_20160731.txt

                                                                                          Directory structure

                                                                                          Some variation was noticed in the the directory structure within the published zip files.
                                                                                          Below are the paths the the snapshot concepts file in each release. 

                                                                                          • \SnomedCT_Release_AU1000036_20161231\RF2Release\Snapshot\Terminology
                                                                                          • \SnomedCT_Canadian_EnglishExtension_Release_20161031\Snapshot\Terminology
                                                                                          • \SnomedCT_ManagedServiceDK_Production_DK1000005_20161130\Snapshot\Terminology
                                                                                          • \SnomedCT_RF2Release_LT1000092_20151107\Snapshot\Terminology
                                                                                          • \SnomedCT_Netherlands_EditionRelease_20160930\Snapshot\Terminology
                                                                                          • \SnomedCT_SE_Production_20161130T170000\Snapshot\Terminology
                                                                                          • \SnomedCT_RF2Release_GB1000000_20161001\Snapshot\Terminology
                                                                                          • \SnomedCT_RF2Release_US1000124_20160901\Snapshot\Terminology
                                                                                          • \SnomedCT_Uruguay_Extension_Release_20161215\Snapshot\Terminology

                                                                                          Specific file inclusions

                                                                                          The international release includes 6 files - Concepts, Description, Relationship,StatedRelationship,Identifier and TextDefinition files - within the "Terminology Folder"
                                                                                          The files are not consistently present in extensions.

                                                                                           
                                                                                          Concept
                                                                                          Description
                                                                                          Relationship
                                                                                          StatedRelationship
                                                                                          Identifier
                                                                                          TextDefinition
                                                                                          Australia111 1 
                                                                                          Candana*1111 1
                                                                                          Denmark**121112
                                                                                          Lithuania111   
                                                                                          Netherlands111  1
                                                                                          Sweden**12111 
                                                                                          UK1111  
                                                                                          USA111111
                                                                                          Uruguay1111 1

                                                                                          * Canada include a French and English bundle.
                                                                                          ** Sweden and Denmark include both an English and Native language Description file.

                                                                                          Denmark and USA are the countries to include all 6 files.
                                                                                          Further variations are present within the refset subdirectories.

                                                                                          Miscellaneous QA issues

                                                                                          The description file for one extension for found to be missing the language code for 268 entries. (The country was notified and have rectified).

                                                                                          Conclusion

                                                                                          • The analysis described above is reveals a wealth of information. There is evidence of duplication of content in almost every hierarchy, the extent of which likely to be much greater given the primitive analysis techniques used.
                                                                                          • There is value to the whole SNOMED CT community to introduce a process for content to be promoted through to core. This process should honour the identifiers issued by an extension builder, so as to minimise maintenance burden of the originating extension, and recognising their effort. The current process is prohibitive to promoting content, and consequently content is duplicated across extensions, the potential maintenance debt grows.
                                                                                          • Almost all NRCs are actively modifying core content. Which proves the importance of clarifying the issues raised in the Discussion Paper - Allowance of Extensions to Modify Core Content (SNOMED International Response). It seems Members have taken a different interpretation of the license to that held by the governing body, and this discrepancy has never been recognised.
                                                                                          • Variations in the artefacts published by NRCs exist. No comment is made about compliance with Technical Specifications, but such variances may impact portability of software that consumes SNOMED CT. Some sort of certification/verification process asserting a minimum conformance criteria would prove valuable.
                                                                                          • All the issues described here may just as likely apply to other (affiliate) extensions, however will remain unknown without systematic investigation.
                                                                                          • The issues described are real and Members are currently struggling to deal with. Prolonging their resolution introduces a cost to all.

                                                                                          Endnotes

                                                                                          1The author recalls a requirement that a US spelling 'en' FSN should be created for all concepts, but unable to identify this in current specification. Is this still a requirement?