1 Introduction
IHTSDO SNOMED International passed a translation support policy for Members to assist in increasing translation and adoption of SNOMED CT.
1.1 Policy Assumptions:
- IHTSDO will SNOMED International will not assume managerial responsibility for any translations other than the existing Spanish translation
- The IP of any derivative work belongs to IHTSDOto SNOMED International
- The IP of any translated work belongs to IHTSDO and SNOMED International and is to be shared with all Members
- IHTSDO encourages SNOMED International encourages collaboration and will work to make national translation available to all Members
- Minimum core set should be translated for adoption purposes in non-English speaking Member countries
- The initial translation sets are provided in reference set (RF2) format. This means that the files contain conceptId's and descriptions ONLY
This document deals with the final assumption from the list above, that a minimum core set of terms will be made available, and that this set would be translated into a number of agreed languages to assist new implementers using SNOMED CT.
1.2 Release changes
This alpha release has been provided following some initial internal and external feedback. The changes to this release, compared to the April 2017 release are as follows:
- Removal of fully specified names (FSN)
- Update to Chinese descriptions following quality assurance feedback
- Update to French descriptions following quality assurance feedback
2 Background
2.1 Scope
The purpose of the release package is to provide a minimum core set of terms with a number of translations to assist in the implementation of SNOMED CT at a national level. The set is not intended to be a definitive list of required terms, but rather to provide a basic set of terms, which can then be extended in line with Member requirements. The primary use for the sets are to provide a view of SNOMED CT descriptions in languages which currently do not have a full SNOMED CT translation
2.2 Use cases:
The initial translation set and translation descriptions they contain, aim to assist end users to access SNOMED CT in languages other than English. This will enable a larger user base to begin to explore SNOMED CT by providing descriptions in four languages provided
3 Motivation
Translation is an essential part of ensuring the continued growth of SNOMED CT worldwide. The starter set and translations aim to support the increasing numbers of end-users who do not have English as their first language. The initial translation set does not attempt to provide a complete translation of SNOMED CT, but rather provide translations for commonly used terms.
4 Content
4.1 Minimum Core set (Starter set) creation
The minimum core set (initial translation set)) describes a set of terms identified as commonly used and high usage terms by existing SNOMED CT users. The set is not intended to be a comprehensive list, but rather to provide a defined list of common terms that will support new users of SNOMED CT. The approximate size of the minimum core set is 6,300 concepts.
4.2 Sources of content
To provide the content of the minimum core set, a number of sources have been identified. These are as follows:
▪ NLM top 2500 – 2,500 concepts
▪ Kaiser top 2500 – 2,500 concepts
▪ UK top 1000 terms used in Primary Care – 1,000 concepts
▪ IFP/GP subset – 4,346 concepts
NLM top 2500
Top 2500 problem list – created and maintained by the National Library of medicine (NLM). The list represents the top 2,500 diagnoses codes used in the United States
Kaiser top 2500
Top 2500 concepts used at Kaiser Permanente in the United States – created and maintained by the Kaiser Permanente (KP). The list includes SNOMED CT content created and maintained in the in the Kaiser Permanente SNOMED CT extension.
UK top 1000 terms used in Primary Care
Represent the top 1000 terms used in primary care in the UK. The usage figures represent the use of Read/SNOMED CT codes across multiple UK primary care system suppliers. The content includes mainly content created and maintained in the SNOMED CT UK extension.
IFP/GP (International Family Practice and General Practice) subset
Represents content used in primary care. The content originates from ICPC-2 (International Classification of primary care -2), maintained by the University of Sydney. The number of concepts is just under 4,500, and is reviewed on a release basis by the IFP/GP SIG.
4.3 Process for the initial creation of the initial translation set
The minimum core set is created through the identification of high use content from the specified data sources. The set uses the specified data sources, and any extension specific content is reconciled as specified in "Section 4.4". The list is then reconciled to remove duplicates, and the final reconciled list represents the minimum data set for publication.
4.4 Reconciliation of extension specific content
The use of the UK and Kaiser as sources of usage data creates an issue in the specification of an international set of terms, as both data sources include data from local SNOMED CT extensions. Whilst terms specified in extensions cannot be used internationally as they do not exist in the International Release, it is important that conceptually the clinical utterance is provided. For example in the UK extension, there is a concept “Activated partial thromboplastin time (observable entity)”. In the international release there is no equivalent term. But there are concepts representing findings related to “Activated partial thromboplastin time” which should be included in the minimum core set
The process to identify a clinically equivalent term in the International Release will be undertaken by a clinician, who is an experienced individual with extensive knowledge of SNOMED CT.
The approach to publication of the minimum core set is to publish the set with an agreed set of translations. The approach is aligned with the approach of both the United Nations (UN) and World Health Organization (WHO). The languages included represent a subset of the common global languages, and also target countries that are not yet members of the IHTSDO or SNOMED International or users of SNOMED CT.
4.5 Maintenance of the minimum core set
For the minimum core set to be viable and useful to users, it is essential that only current content is included. As the content of the International Release is updated currently on a six monthly basis, there may be content that becomes inactivated in the release, which can no longer be used in live systems. In terms of the minimum core set, any content that has been inactivated should be removed, to avoid confusion for potential users.
4.5.1 Release schedule
As the minimum core set, also includes translations and is based on usage data, it is not practical to update the set in line with the publication of the International Release. Updates should be undertaken on a two yearly basis, whereby new content can be identified from usage data, the content can be added to the minimum core set, and translations can be provided.
4.5.2 Regular maintenance
The regular maintenance of the minimum core set should be undertaken on a six monthly basis, in line with publication of the International Release. At this time, a difference report should be provided between the International Release and the minimum core set, where inactivated content is identified and removed from the set. In addition to the changes to the minimum core set, a change report will be produced documenting all removals and additions to the set since the previous formal released version.
4.5.3 Agreement of new terms for inclusion
On a two yearly basis, the minimum core set will be formally updated, and a new version published. At this time new content may be added to the set. New content will be identified from usage data, and reviewed through the internal governance process documented in "Section 5".
4.5.4 Agreement of language set
The criteria for agreeing these and future languages to be translated are as follows:
- Language use globally
- Interest in SNOMED CT
- IT infrastructure maturity in countries language is used
- Fiscal stability of the country – ability to pay
- A business case for SNOMED CT expansion
The proposed languages to be used initially are:
- German – Strong interest - used in multiple countries
- French – Strong interest – used in multiple countries
- Japanese – interest – volume
- Chinese – interest – volume
Future development of additional languages to be included should be based on the above criteria, and agreed through the governance process detailed further on in this document.
4.6 Process for translation
The IHTSDO does SNOMED International does not have access to translation expertise to allow high quality and timely delivery of translations into the required languages. The delivery of the required translations is therefore best delivered through a third party. There are a large number of providers of translations in the market; it is therefore suggested that we adopt an approach of publishing a request for proposals (RFP) to allow market forces to drive costs down. The process for publication and agreement of contract should take no longer than 3 weeks.
The contract for translation should include translation into the four identified languages, focusing on the preferred term. It should be noted that translations of synonyms are not recommended, as these tend to be driven by local user requirements and will not necessarily be of use, in addition to this SNOMED International has created a number of guidance documents relating to translation. These are available here
We acknowledge that there are language nuances relating to clinical terms that are sometimes difficult to identify. It is therefore recommended that where possible we attempt to identify a clinician fluent in the required language to review the translated product as a final quality assurance step.
4.7 Maintenance of the minimum core set
For the minimum core set to be viable and useful to users, it is essential that only current content is included. As the content of the International Release is updated currently on a six monthly basis, there may be content that becomes inactivated in the release, which can no longer be used in live systems. In terms of the minimum core set, any content that has been inactivated should be removed, to avoid confusion for potential users.
4.7.1 Release schedule
As the minimum core set, also includes translations and is based on usage data, it is not practical to update the set in line with the publication of the International Release. Updates should be undertaken on a two yearly basis, whereby new content can be identified from usage data, the content can be added to the minimum core set, and translations can be provided.
4.7.2 Regular maintenance
The regular maintenance of the minimum core set should be undertaken on a six monthly basis, in line with publication of the International Release. At this time, a difference report should be provided between the International Release and the minimum core set, where inactivated content is identified and removed from the set. In addition to the changes to the minimum core set, a change report will be produced documenting all removals and additions to the set since the previous formal released version.
4.7.3 Agreement of new terms for inclusion
On a two yearly basis, the minimum core set will be formally updated, and a new version published. At this time new content may be added to the set. New content will be identified from usage data, and reviewed through the internal governance process documented in "Section 5".
4.7.4 Updating of translations
When the updated version of the minimum core set has been agreed, the new terms for addition to the set will be identified from the change report. The list of new additions will then be identified within a statement of work, and the translation work outsourced to an external contractor, either through the publication of an RFP or through a formal contract with an existing supplier.
4.8 Quality assurance
The translations provided within this release have all undergone technical quality assurance, and were all created by clinicians fluent in each language. A secondary quality assurance has been undertaken on the French and Chinese translations by clinicians fluent in each language. The resulting feedback following the quality assurance has been included in the release files. The German and Japanese quality assurance are still underway, and will be included in the next release of the Initial Translation set.
5 Design
5.1 Version
The dependent version of SNOMED CT is the July 2017 International Edition release.
The initial translation set and translations are provided as an alpha release. The translations have undergone some quality assurance, but we would welcome feedback from end-users on the translations specifically. The initial translation set has been created as specified in "4 Content", and will be updated on a two yearly basis, based on the availability of usage figures from the sources specified.
5.2 Ownership
The initial translation set and translations are owned and updated by SNOMED International
5.3 Frequency
The production version of the SNOMED CT® Initial Translation Set Translation package will be published after the July 2017 International Edition release of SNOMED CT, based on the Alpha feedback content changes. Subsequently it will be maintained in line with each release of SNOMED CT International Edition.
6 Implementation
The initial translation set and translation descriptions can be used to provide a view of the concepts and descriptions. But we would recommend that the initial translation set and translation descriptions MUST be implemented with the complete SNOMED International release, thereby providing access to the relationships.
6.1 Implementation guidance
The release package includes the following:
- Starter set (Refset files)
- French language (Description and language files)
- German language (Description and language files)
- Chinese language (Description and language files)
- Japanese language (Description and language files)
We suggest that the implementation of the files should be in the following order:
- Implementation of the dependent International Release
- Implementation of the initial translation set files
- Implementation of the required Description and Language files
By following the order above, there should be no untoward occurrences linked to the implementation of the files
Please note that the number of records in the language and description files differ between languages. This is due to the addition of synonyms specific to the individual languages identified in the translation process, and therefore the number added will differ between languages sets.
Please note that in the initial translation set there are no language-en and description-en files, as all the required information will be found in the dependent International Release
7 Obtaining the Release
The SNOMED CT® Initial Translation Set package is available from SNOMED International and is maintained and updated in line with the SNOMED CT International release which is currently 6 monthly.
Access within IHTSDO member SNOMED International member countries is provided by the Member National Release Centre in each country, via the relevant Member page. Affiliates of IHTSDO in SNOMED International in non-member countries can access the refset through their Member Licensing and Distribution SERVICE (MLDS) account. Please contact info@snomed.org for more information if required.
8 Feedback
As stated in "5 Design", the Initial starter set and translations are provided as an Alpha release, and SNOMED International is actively seeking feedback on the translations specifically. Feedback can be provided through the SNOMED International Reset/Translation tool https://refset.ihtsdotools.org/#/refset, or through SNOMED International Helpdesk info@snomed.org, by marking the correspondence "Initial Translation Set".
PDF: Force Page Break |
---|
RF2 package format
The RF2 package convention dictates that it contains all relevant files, regardless of whether or not there is content to be included in each particular release. Therefore, the package contains a mixture of files which contain both header rows and content data, and also files that are intentionally left blank (including only a header record). The reason that these files are not removed from the package is to draw a clear distinction between:
- ...files that have been deprecated (and therefore removed from the package completely), due to the content no longer being relevant to RF2 in this or future releases, and
- ...files that just happen to contain no data in this particular release (and are therefore included in the package but left blank, with only a header record), but are still relevant to RF2, and could therefore potentially contain data in future releases.
This allows users to easily distinguish between files that have purposefully been removed or not, as otherwise if files in option 2 above were left out of the package it could be interpreted as an error, rather than an intentional lack of content in that release.
PDF: Hide in PDF Export | |||||||||
---|---|---|---|---|---|---|---|---|---|
Approvals
Download .pdf here:
|
PDF: Hide in PDF Export | ||||||||
---|---|---|---|---|---|---|---|---|
Draft Amendment History
|