tlCorpus is a Corpus Query Software application (concordance software application). Easy to use, fully internationalized (full Unicode support), "collocation summary" feature, multi-core utilization, auto-language and auto-encoding detection, corpus statistics, large file support (greater than 4GB files), support for BCP 47 language. Free download of tlCorpus Concordance 6.0, size 12.16 Mb.
Sassoon® Cambridge Joiner has been designed for simplicity of use. Joined text is created by running the program, setting your Preferences, entering your text in an Edit window and clicking 'Join'. The program calculates the joins and presents them for Saving, Printing or Copying. Now 'Copy' the text and switch to your. Free download of Sassoon Cambridge Joiner 1 3, size 8.21 Mb.
The Arabic Corpus {compiled by Dr. Mourad Abbas ( http://sites.google.com/site/mouradabbas9/corpora ) is composed of arabic texts for text categorization. The corpus Khaleej-2004 contains 5690 documents. It is divided to 4 topics (categories). The corpus Watan-2004 contains 20291 documents organized in 6 topics (categories). Researchers who use. Freeware download of Arabic Corpus 2004, size 14.42 Mb.
Construction of Chinese-Korean Bilingual Corpus and Search Technology. Some auto alignment programs and a search engine based on jung seong are provided.. Freeware download of Chinese-Korean Bilingual Corpus 1.0, size 476.68 Kb.
WordMetry is a powerful tool for analysis of word statistics, stylometrics, author identification, corpus linguistics, opinion poll, media focus, and prediction. It supports web-based text retrieval and analysis as well as traditional locally-based static text statistics.
Multi-threads support easy, simple and fast download and. Free download of WordMetry 1 55, size 97.49 Mb.
Xaira is the current name for a new version of SARA, the text searching software originally developed at OUCS for use with the British National Corpus. This new version has been entirely re-written as a general purpose XML search engine, which will operate on any corpus of well-formed XML documents. It is however best used with TEI-conformant. Freeware download of xaira 1.25.8.23, size 0 b.
Unitex is a corpus processing system, based on automata-oriented technology. The concept of this software was born at LAD, under the direction of its director, Maurice Gross. With this tool, you can handle electronic resources such as electronic dictionaries and grammars and apply them. You can work at the levels of morphology, the lexicon and. Freeware download of Unitex 2.0, size 31.97 Mb.
This program is an advanced dictionary with dynamic search of words. It is very simple to use, you just have to type the word in the combo box and the program will show the words located. The last words typed will be showed in the list of the combo box. For each character you type in you will see different options of words. You can copy and paste. Free download of Cambridge Advanced Learner's 2.0.2.146, size 0 b.
Collocation Extract is designed to provide a list of collocations in the corpus. Users can search for collocates of a particular word in the range of 2 to 5 words, or search for all collocations of two-word chunks. Three statistical methods, namely Dunning's Log Likelihood, (pointwise) Mutual Information, and Chi-square, are used in this. Freeware download of Collocation Extract 3 7, size 2.37 Mb.
No serious language specialist today does work without the aid of Corpus Query Software to readily and rapidly study actual language usage. The tlCorpus Corpus Query Software brings the efficiency and professionalism of the TLex Lexicography Software to corpus work.
FEATURES:
· Easy-to-use
· Fully internationalized (full. Free download of tlCorpus for Mac OS X 6.1.0.395, size 12.16 Mb.
Bitextor is an application created to generate translation memories using multilingual websites as a corpus source. It downloads an entire website and applies a set of heuristics (based mainly on HTML tag structure and text block length) to find bitexts.. Freeware download of Bitextor 3.1.0, size 207.79 Kb.
The code here is a parser for CALBC corpus into Java object.. Freeware download of CALBC 1.0, size 13.78 Kb.
Italian labeled digits corpus, good for speech recognition.Corpus di cifre italiane segmentato, adatto a esperimenti di riconoscimento vocale e riconoscimento fonetico.. Freeware download of corpuscifre 1.0, size 24.64 Mb.
An open-source corpus analysis class library written in C#. GUI of Tenka Text 0.1.3 comes with Wordlister - an advanced, extremely fast graphical wordlist tool and a simple regex concordance tool. Tenka Text - the open-source answer to WordSmith Tool. Freeware download of Corsis (formerly Tenka Text) 0.1.3.4, size 724.73 Kb.
Cunei is a data-driven machine translation system that builds dynamic, statistical models based on instances of known translations found in a corpus.. Freeware download of Cunei Machine Translation Platform 2.0, size 181.73 Kb.
DWDS/Dialing Concordance (DDC) - a collection of index and search tools for corpus linguists. Freeware download of DWDS/Dialing Concordance 2.0.1, size 1.34 Mb.
Emdros is a corpus query system for storage and retrieval of linguistic analyses of text. It is especially applicable in corpus linguistics dealing with syntax, morphology, phonology, and/or discourse. It is also a generally useful text database engine.. Freeware download of Emdros 3.3.0, size 8.74 Mb.
Get1T is a tool for filtering through the massive quantity of data available in the Web 1T corpus and extracting only the counts you need - including for simple wildcard patterns.. Freeware download of Get 1T 0.3, size 28.81 Kb.
This proyect presents a system, which, from a corpus of documents, extracts information about a theme area, and a pedagogical components collection. This information is packed into fine granularity learning objects (metadata included).. Freeware download of LookIng4LO 1.0, size 30.15 Mb.
TaCo is a tasty Palm application that enables you to use the Tanaka Corpus on your handheld. The Tanaka Corpus is a collection of Japanese/English sentence pairs that a student of Japanese language can use as a source of example sentences.. Freeware download of Palm TaCo 1.0, size 10.18 Mb.
Protects historic buildings in Cambridge, Massachusetts, marks historic sites, advises owners of historic buildings on preservation, researches and publishes on the city's architectural history.
An amateur society fieldwalking, excavating, and surveying within the Cambridge area. Recent activities, future program.
Cambridge, England based environmental and developmental economics consulting group offers services in economic and social impact studies, privatisation and enterprise reform.
Cambridge Circuit Company Web Site. Site describes printed circuit manufacturing capabilities, equipment and delivery times.
Corporate site. Offers products and programs, such as the Cambridge Diet, for weight loss.
Serving co-ops in and around Cambridge, UK. Includes a directory of co-ops and other social enterprises, sorted by name and business area.
A regional medical center with 86 licensed beds and a large multi-specialty clinic located in Cambridge.