Book
French
ID: <
10670/1.6zi07g>
Abstract
this volume contains the Acts of the 9th International Days of Statistical Analysis of Textile Data (JADT 2008), which took place from 12 to 14 March 2008 in Lyon. Every two years, since 1990, the JADT brings together researchers working in the various fields involved in automatic and statistical processing of textual data. Statisticians, linguists, sociologists, speech analysts, IT specialists, text mining specialists present their results, compare their tools and experiences; they submit and discuss innovative practical proposals such as state-of-the-art theoretical developments. After the meetings in Barcelona (1990), Montpellier (1993), Rome (1995), Nice (1998), Lausanne (2000), Saint-Malo (2002), Louvain-la-Neuve (2004) and Besançon (2006), the 2008 edition of the conference was an opportunity to launch a call for communications on the following non-exhaustive topics: — Textometry, textual statistics — Exploratory analysis of textual data — Text corpus, textual and hypertextual representations — Corpus Linguistic — Automatic processing of natural language: labelling, labelling, linguistic enrichment — Statistical analysis of answers to open questions — Text mining — Classification of texts, lexical and textual mapping — Documentary research, information research — Tool editing of digital texts — Software for textual analysis — Methodology and practices for analysing text corpus — Training in methods and tools for analysis of text corpus On the 140 submissions received (in the four working languages: French, Italian, English, Spanish), 76 oral communications and 26 displayed. Each tender has been reviewed by at least two proofreaders. Of the communications selected for the first evaluation, 24 % were subject to a second review. The oral communications were finally held in session on the following topics: — Lexical units, segmentation — lemmatisation and annotation — Cooccurrence — sequentiality and textual structure — lexical classification — Categorisation of texts — Visualisation, evaluation — Data model, architecture — Parallel corpus — Critical publishing — Alignment — New forms of textuality — Political corpus — Oral corpus — Surveys and undertakings — Surveys and society — Interviews — Methodology — Detailed position — Style, diachronia — lexical sequence — Terminology and translation — Data sheet — Information search