ARCHIVED 3.3.4. Establishing a Monolingual Base List

 

Archived Content

Information identified as archived is provided for reference, research or recordkeeping purposes. It is not subject to the Government of Canada Web Standards and has not been altered or updated since it was archived. Please contact us to request a format other than those available.

Consult the Pavel in ...

Español Português Italiano Nederlands العربية

Previous page Next page

To create a monolingual base list, you must scan for terms all the texts that provide information about one or more of the concepts in the concept system under study. This means that you will highlight terminology units and note their contexts (sentences, paragraphs).

When term extraction is performed in more than one source dealing with the same topic, the lists resulting from the sources scanned can be merged so that more information is readily available for conceptual analysis and so the best textual supports can be selected for entry on the terminology record about a given concept.

In order to note authentic usage, it is recommended that original-language sources in the source and target languages be scanned for terms first, before any translated sources are scanned. You may be able to begin matching (apparently) equivalent terms at this stage, but conceptual analysis and further research are required to confirm your suppositions.

This list shows the bilingual matching of extracted terms.

thermoluminescent dosimeter; TLD

Bureau of Radiation and Medical Devices

radiation

Thermoluminescent Dosimetry Service

dosimetry

whole body dose

skin dose

inner plaque

plaque holder

lithium fluoride thermoluminescent chip

lithium fluoride

chip

thermoluminescent chip

electron

TLD reader

dosimètre thermoluminescent; DTL; dosimètre DTL

Bureau de la radioprotection et des instruments médicaux

 

service de dosimétrie thermoluminescent

dosimétrie

 

 

plaque intérieure

porte-plaque

cristal thermoluminescent au fluorure de lithium

fluorure de lithium

cristal

cristal thermoluminescent

électron

lecteur de DTL

The term extraction lists often include terms that belong to other subject fields or that designate concepts missing in your original diagram of the concept system. In addition to terminology units, automatic term-extraction software outputs a great deal of "noise" (that is, pseudo-terminological expressions or components that accidentally occur together in discourse but that do not designate concepts). By briefly reviewing the term’s contexts, you can eliminate noise, set aside terms that belong to other subject fields, and create a more complete graphical representation of the concept system through the insertion of missing concepts into the concept diagram.

The terminological base list is the list of terms created through term extraction. It contains all of the terms that you will be assigning to "nodes" of the concept diagram in order to group the textual supports by concept. In comparative terminology, the concept system is used to establish a terminological base list for each of the languages and is the main benchmark for matching base lists. A base list is usually subject to further terminological research.

Sometimes, your assignment may be to collect the terminology used in the bilingual documentation of a department or company. In this case, you can carry out bilingual term extraction in order to concurrently identify the terms and their contexts in both the source and target languages. In some terminology services, the translated equivalents are checked for authenticity by comparing them with the terms identified during term extraction in original-language texts. Such verification is not always possible in translation services where terminology files store source language terms and their target language equivalents without their textual supports or source references. Authentication usually involves further research work.

Term extraction not only reveals terminology units, but also co-occurrents of the terms. Collocations (two or more terms that typically appear together in a given subject field) show how the term is used in specialized discourse. This information is often recorded in the phraseologisms component of the record.

You can also use the terms that you identified during term extraction to perform a diagnosis of a database. The list of terms can be queried in order to determine the response rate of terms found in the database or file that is to be evaluated.

This is a base list retained following verification of the text on which automatic term extraction was performed.

actual dose

annual dose limit

average annual whole-body dose

average worker whole-body dose

Canadian Environmental

Assessment Agency

CANDU reactor

collective worker dose

concrete canister facility

conversion facility

dioxide pellet

dry-fuel storage

dry storage

estimated radiation dose

fuel bundle

fuel fabrication

mill maintenance worker

mill production worker

msv whole-body dose

occupational dose limit

prescribed substance

public dose limit

quarterly limit

radiation dose limit

regulatory dose limit

whole-body dose