TERMIUM Plus®

The Government of Canada’s terminology and linguistic data bank.

TOKENISER [2 records]

Record 1 2020-04-30

English

Subject field(s)
  • Computer Programs and Programming
  • Computer Processing of Language Data
  • Lexicology, Lexicography, Terminology
  • Applications of Automation
DEF

A software program that … determines boundaries for individual tokens [such as words, numbers or punctuation] in text.

Key term(s)
  • tokeniser

French

Domaine(s)
  • Programmes et programmation (Informatique)
  • Informatisation des données linguistiques
  • Lexicologie, lexicographie et terminologie
  • Automatisation et applications

Spanish

Save record 1

Record 2 2003-12-12

English

Subject field(s)
  • Computer Programs and Programming
  • Programming Languages
DEF

A utility program that translates lines of ... source code ...

OBS

In general, tokenising a text means merely identifying word forms and sentences. However, in highly technical documents such as the Unix man pages, this may become a formidable task. Apart from regular word forms, the ... tokeniser has to recognise ... Path names and absolute file names: /usr/bin/X11; usr/5bin/ls, /etc/hostname.le ...

French

Domaine(s)
  • Programmes et programmation (Informatique)
  • Langages de programmation

Spanish

Save record 2

Copyright notice for the TERMIUM Plus® data bank

© Public Services and Procurement Canada, 2024
TERMIUM Plus®, the Government of Canada's terminology and linguistic data bank
A product of the Translation Bureau

Features

Language Portal of Canada

Access a collection of Canadian resources on all aspects of English and French, including quizzes.

Writing tools

The Language Portal’s writing tools have a new look! Easy to consult, they give you access to a wealth of information that will help you write better in English and French.

Glossaries and vocabularies

Access Translation Bureau glossaries and vocabularies.

Date Modified: