Summary

SNOMED International propose to increase the maximum length of Fully Specified Name and Synonym descriptions from 255 characters to 4096 characters.   The requirement for this change has come from the Pharmaceutical / biologic product hierarchy, where the terming guidance for FSNs causes the current limit to be exceeded where there is a large number of ingredients in a medicinal product, which is particularly common in multivalent vaccines. 

There is some likelihood that implementers will have hardcoded database schemas to 255 characters, so a significant lead time is proposed before this change would take effect.   

Introduction

The RF2 Specification for SNOMED CT Descriptions states that the overall length limit for a description is 32Kb (understood to be Kilobits), equating to 4096 single byte characters.   This maximum length is then further restricted on a per description type basis, as specified in the Description Type Refset ( specifically: "to a maximum length, configurable for each description type as defined in the 900000000000538005 | Description format reference set| member associated with that description type - see the  Description Format Reference Set specifications document for more details." ).

At the October 2023 SNOMED International Business Meetings, the Modeling Advisory Group discussed a requirement (that had been brought up by the Editorial Advisory Group) to increase the limit for Fully Specified Name (FSN) and Synonym Descriptions from the current 255 characters, up to 4096 characters.    This represents the largest number of characters allowed by the specification, and would bring these two description types into line with the existing limit for the Text Definition description type.    The Modeling Group agreed that it would already be within the specification of RF2 to make this change just by increasing the size specified in the Description Type reference set, however, this would quite likely have an impact on implementers who may have created data storage structures that are not dynamically sized based on the values in the Description Type file.

Background and Rationale

Editorial Guidance for Medicinal Products, dictates that the FSN should include a concatenation of the preferred terms of all ingredients (in alphabetical order and separated by "and").   A worst case example of this currently active in the International Edition is  1162634005 |Pediatric vaccine product containing only acellular Bordetella pertussis, Clostridium tetani and Corynebacterium diphtheriae toxoids, Haemophilus influenzae type b conjugated, Hepatitis B virus and inactivated Human poliovirus antigens (medicinal product)|

which has been tweaked from what our automated systems would normally suggest, to fit in to the current limit of 255 characters.    If you look at the actual list of ingredients involved, it's clear that more characters would be required to correctly represent this product

Discussion on size limits, Editorial Guidance, and display of lengthy descriptions

The main intention with this change is to ensure that implementers allow for the maximum allowed description character length, but this does not necessarily mean that any particular release of SNOMED CT is going to feature large numbers of descriptions that get anywhere near this limit - they are expected to be quite exceptional.   In particular, this increase is intended to accommodate Fully Specified Names as their formation is often procedurally dictated by attribute values in the concept model.  Since user interfaces will generally display preferred terms, rather than FSNs, there is a low expectation of longer descriptions breaking current display formatting.   Also no use case for this length increase has yet been identified for the International Edition outside of the Medicinal Product Hierarchy.

That said, a 2024 release of the New Zealand extension is features this concept: 264031000210106 |Plasma cell neoplasm multiple myeloma relapsed systemic anti cancer therapy regimen using dexamethasone via oral route and bortezomib via subcutaneous route chemotherapy and daratumumab via subcutaneous route immunotherapy every three week followed by daratumab via subcutaneous route immunotherapy every four week (regime/therapy)| which clocks in at 331 characters!   So this proposed change is also required more widely.

This change in intended to ensure SNOMED CT remains flexible enough to accommodate future terming requirements.    By extending the 'physical' constraints on description lengths, we would suggest that the responsibility for setting reasonable term length limits be moved into Editorial Guidance and validation constraints presented to users as warnings, rather than enforcing this arbitrary limit at a database storage level.   For this reason, while limits of 512 and 1024 characters has been discussed, it was felt that - at some point in the future - we would have to revisit whatever limit was chosen, and therefore it would be expedient to just set the limit to be the largest number of characters currently allowed.

Proposed Changes to RF2

SNOMED International propose to modify the existing Description Type Refset such that all three description types would have a limit of 4096 characters.   Assuming this is done in time for the January 2026 release, this file would be named der2_ciRefset_DescriptionTypeSnapshot_INT_20250101.txt

ideffectiveTimeactivemoduleIdrefsetIdreferencedComponentIddescriptionFormatdescriptionLength
<UUID>2014013119000000000002070089000000000005380059000000000005500049000000000005400004096
<UUID>2026010119000000000002070089000000000005380059000000000000130099000000000005400004096
<UUID>2026010119000000000002070089000000000005380059000000000000030019000000000005400004096

Note that the line Text Definition is unchanged here since 20140131, having already been set to 4096 characters at that time.

Sample RF2 for testing

This delta zip archive contains the proposed changes to the Description Type Refset as well as a single description which is 331 characters in length - the FSN for Concept 1162634005 extended to conform to what our automated term generator would produce: "Pediatric vaccine product containing only acellular Bordetella pertussis antigen and Clostridium tetani toxoid antigen and Corynebacterium diphtheriae toxoid antigen and Haemophilus influenzae type b capsular polysaccharide conjugated antigen and Hepatitis B virus and inactivated whole Human poliovirus antigen (medicinal product)".    The effective time of the changes have been set to 20240505.    System administrators can try loading this file into test systems to check how the longer description length setting and usage will affect them.

xSnomedCT_InternationalRF2_TestDescLenDelta_20240505T120000Z.zip

Next Steps

SNOMED International request that users of SNOMED CT provide feedback on this proposal via this form. SNOMED International will respond to feedback received - on this page - until the end of this consultation exercise on 31 December 2024.  

Responses To Feedback Received

General Remarks

At time of writing (Oct 2024), 14 items of feedback have been received through the Google Form linked on this page.  By contrast, 34 items of feedback have been collated and returned - anonymised - by the UK NHS.  The feedback from the UK indicates a much higher level of concern about this proposal, and has a greater incidence of "Require more than 12 months to accommodate the proposed changes".   Most of the concerns addressed below relate to UK feedback.

Since this consultation started, another two extensions (Belgium and the LOINC Ontology), have found it necessary to create FSNs that exceed the current limit of 255 characters.   An example from LOINC is 

48642-3  486423 |562601010000101|  Volume rate per area of glomerular filtration rate in point in time in plasma or serum or whole blood by modification of diet in renal disease creatinine calculation formula relative to 1.73 square meters body surface area and adjusted for non-African race (observable entity)

As is the case in the Medicinal Products hierarchy, this FSN has been generated procedurally from the preferred terms coming from its modelled attributes.  It would be impossible to shorten this description without mis-representing the concept, leading to ambiguity between similar concepts eg the variant that is adjusted for 'African race'.

Process Transparency

Advice was received during the SNOMED International October Business Meetings 2024 that consultation feedback should be more visible to all concerned.  Publishing feedback received obviously requires careful consideration in terms of privacy and GDPR compliance, so SI will - in future - include additional checkboxes in feedback forms to ask respondents if they are willing to have their words published, and - separately - if they can be identified.    Specifically this request was made by Guillermo Reynoso (TermMed) and agreed with by Dion McMurtrie (AU) and Stuart Abbott (UK).

Choice of New Length Limit

Of the feedback received that expressed concerns about the proposed change, about half the feedback related to the fact that taking the limit from 255 characters to 4096 was too much.   To explain the choice of this new limit further:  the Modeling Advisory Group wanted to avoid the situation where some new limit were chosen - say 512 characters - having implementers go through the expense of making system changes, only to find ourselves in the situation of needing to revisit that decision at some point down the line.   It was felt to be preferable to pick the maximum value and to make it clear that, effectively, back end systems such as databases should not hard code field size limits.   The 4096 limit was thought to be sufficient to handle whatever might come along in the future.

The greatest danger - indeed, a medical risk - is what we see in existing systems where the current 255 limit has been coded into databases, such that descriptions are silently truncated when longer descriptions are encountered.  A key part of this change is to move the responsibility for enforcing description length limits into front end systems.   So the back end can store a longer description, and then the system can make a decision about what to do with that correctly stored value.    Something we have seen in systems is, for example, checks for excessive length descriptions fail to function, because only the first 255 characters of the description has been stored!

At 255 characters, an FSN of this length is already going to cause problems for User Interface designers.  Firstly, it is not expected that preferred terms would be become any longer than they are currently.   This change is only about making FSNs longer, and these would not generally be displayed to end users outside of some specific terminology function where such precision is required.   The suggestion is that unwieldily large descriptions should be dealt with using User Interface behaviour (eg tool tips) and Editorial Policy, not through hard coded limits in back end systems which can lead to unexpected consequences such as truncation.   Secondly, in the International Edition, there is no intention to use longer FSNs in any hierarchy outside of the Medicinal Products hierarchy.   We did not receive any feedback from implementers who expressed both a concern about the changes and also indicated that they currently display medicinal product concepts in user facing systems.

Conclusion

In response to concerns received during this consultation period, SNOMED International have agreed to delay the implementation of these changes to July 2026.



Page Contents


Feedback Form

SNOMED International request that users of SNOMED CT provide feedback on this proposal via this form. SNOMED International will respond to feedback received - on this page - until the end of this consultation exercise on 31 December 2024.  




Comments

    Add new comment