TERMIUM Plus®
The Government of Canada’s terminology and linguistic data bank.
JOC CORPUS [1 record]
Record 1 - internal organization data 2001-03-09
Record 1, English
Record 1, Subject field(s)
- Titles of Documents and Works
- Language (General)
Record 1, Main entry term, English
- MULTEXT JOC Corpus
1, record 1, English, MULTEXT%20JOC%20Corpus
correct
Record 1, Abbreviations, English
Record 1, Synonyms, English
Record 1, Textual support, English
Record number: 1, Textual support number: 1 OBS
This CD-ROM contains a part of the corpus developed in the MULTEXT project financed by the European Commission (LRE 62-050). This part contains raw, tagged and aligned data from the Written Questions and Answers of the Official Journal of the European Community. The corpus contains approx. 5 million words in English, French, German, Italian and Spanish (approx. 1 million words per language). About 800,000 words were grammatically tagged and manually checked for English, French, Italian and Spanish, i.e. roughly 200,000 words per language. The same subset for French, German, Italian and Spanish was aligned to English at the sentence level. 1, record 1, English, - MULTEXT%20JOC%20Corpus
Record 1, Key term(s)
- JOC Corpus
Record 1, French
Record 1, Domaine(s)
- Titres de documents et d'œuvres
- Linguistique (Généralités)
Record 1, Main entry term, French
- Corpus JOC MULTEXT
1, record 1, French, Corpus%20JOC%20MULTEXT
correct
Record 1, Abbreviations, French
Record 1, Synonyms, French
Record 1, Textual support, French
Record number: 1, Textual support number: 1 OBS
Ce corpus contient une partie du corpus développé dans le cadre du projet MULTEXT financé par la Commission européenne (LRE 62-050). Cette partie comprend des données brutes, étiquetées et alignées des questions écrites et des réponses du Journal Officiel de la Communauté Européenne. Ce corpus contient environ 5 millions de mots en allemand, anglais, espagnol, français et italien (env. 1 million de mots par langue). Près de 800 000 mots ont été étiquetés grammaticalement et vérifiés manuellement pour l'anglais, le français, l'italien et l'espagnol (env. 200 000 mots par langue). Le même sous-ensemble pour le français, l'allemand, l'italien et l'espagnol a été aligné à l'anglais au niveau de la phrase. 1, record 1, French, - Corpus%20JOC%20MULTEXT
Record 1, Key term(s)
- Corpus JOC
Record 1, Spanish
Record 1, Textual support, Spanish
Copyright notice for the TERMIUM Plus® data bank
© Public Services and Procurement Canada, 2024
TERMIUM Plus®, the Government of Canada's terminology and linguistic data bank
A product of the Translation Bureau
Features
Language Portal of Canada
Access a collection of Canadian resources on all aspects of English and French, including quizzes.
Writing tools
The Language Portal’s writing tools have a new look! Easy to consult, they give you access to a wealth of information that will help you write better in English and French.
Glossaries and vocabularies
Access Translation Bureau glossaries and vocabularies.
- Date Modified: