ID |
Title |
Authors |
4 |
Reference production in human-computer interaction: Issues for Corpus-based Referring Expression Generation |
Danillo Rocha and Ivandré Paraboni |
7 |
Simple Large-scale Relation Extraction from Unstructured Text |
Christos Christodoulopoulos and Arpit Mittal |
9 |
Crowdsourced Multimodal Corpora Collection Tool |
Patrik Jonell, Catharine Oertel, Dimosthenis Kontogiorgos, Jonas Beskow and Joakim Gustafson |
10 |
BlogSet-BR: A Brazilian Portuguese Blog Corpus |
Henrique Santos, Vinicius Woloszyn and Renata Vieira |
11 |
Transforming Wikipedia into a Large-Scale Fine-Grained Entity Type Corpus |
Abbas Ghaddar and Phillippe Langlais |
12 |
Error Analysis of Uyghur Name Tagging: Language-specific Techniques and Remaining Challenges |
Halidanmu Abudukelimu, Adudoukelimu Abulizi, Boliang Zhang, Xiaoman Pan, Di Lu, Heng Ji and Yang Liu |
13 |
A Multilingual Approach to Question Classification |
Aikaterini-Lida Kalouli, Katharina Kaiser, Annette Hautli-Janisz, Georg A. Kaiser and Miriam Butt |
20 |
Discourse Coherence Through the Lens of an Annotated Text Corpus: A Case Study |
Eva Hajicova and Jiří Mírovský |
24 |
The Nautilus Speaker Characterization Corpus: Speech Recordings and Labels of Speaker Characteristics and Voice Descriptions |
Laura Fernández Gallardo and Benjamin Weiss |
25 |
SMILE Swiss German Sign Language Dataset |
Sarah Ebling, Necati Cihan Camgöz, Penny Boyes Braem, Katja Tissi, Sandra Sidler-Miserez, Stephanie Stoll, Simon Hadfield, Tobias Haug, Richard Bowden, Sandrine Tornay, Marzieh Razavi and Mathew Magimai-Doss |
27 |
Building Parallel Monolingual Gan Chinese Dialects Corpus |
fan xu, mingwen wang and maoxi li |
28 |
Evaluation of Automatic Formant Trackers |
Florian Schiel and Thomas Zitzelsberger |
29 |
The Boarnsterhim Corpus: A Bilingual Frisian-Dutch Panel and Trend Study |
Marjoleine Sloos, Eduard Drenth and Wilbert Heeringa |
30 |
JESC: Japanese-English Subtitle Corpus |
Reid Pryzant, Denny Britz, Youngjoo Chung and Dan Jurafsky |
31 |
Building a Corpus for Personality-dependent Natural Language Understanding and Generation |
Ricelli Ramos, Georges Neto, Barbara Silva, Danielle Monteiro, Ivandré Paraboni and Rafael Dias |
32 |
Dataset for the First Evaluation on Chinese Machine Reading Comprehension |
Yiming Cui, Ting Liu, Zhipeng Chen, Wentao Ma, Shijin Wang and Guoping Hu |
33 |
Creating a Verb Synonym Lexicon Based on a Parallel Corpus |
Zdenka Uresova, Eva Fucikova, Eva Hajicova and Jan Hajic |
34 |
A Multi-Domain Framework for Textual Similarity. A Case Study on Question-to-Question and Question-Answering Similarity Tasks |
Amir Hazem, Basma El Amel Boussaha and Nicolas Hernandez |
35 |
Definite Description Lexical Choice: taking Speaker's Personality into account |
Alex Lan and Ivandré Paraboni |
36 |
Word Embedding approach for Synonym Extraction of Multi-Word Terms |
Amir Hazem, Béatrice Daille and Damien Cram |
37 |
A FrameNet for Cancer Information in Clinical Narratives: Schema and Annotation |
Kirk Roberts, Anshul Gandhi, Yuqi Si and Elmer Bernstam |
39 |
Referring Expression Generation in time-constrained communication |
André Mariotti and Ivandré Paraboni |
40 |
Coreference Resolution in FreeLing 4.0 |
Montserrat Marimón, Lluís Padró and Jordi Turmo |
41 |
Design and Development of Speech Corpora for Air Traffic Control Training |
Luboš Šmídl, Jan Švec, Daniel Tihelka, Jindrich Matousek, Jan Romportl and Pavel Ircing |
43 |
MOCCA: Measure of Confidence for Corpus Analysis - Automatic Reliability Check of Transcript and Automatic Segmentation |
Thomas Kisler and Florian Schiel |
45 |
DeepTC – An Extension of DKPro Text Classification for Fostering Reproducibility of Deep Learning Experiments |
Tobias Horsmann and Torsten Zesch |
46 |
LIdioms: A Multilingual Linked Idioms Data Set |
Diego Moussallem, Mohamed Ahmed Sherif, Diego Esteves, Marcos Zampieri and Axel-Cyrille Ngonga Ngomo |
48 |
BiLSTM-CRF for Persian Named-Entity Recognition ArmanPersoNERCorpus: the First Entity-Annotated Persian Dataset |
Hanieh Poostchi, Ehsan Zare Borzeshi and Massimo Piccardi |
49 |
SoMeWeTa: A Part-of-Speech Tagger for German Social Media and Web Texts |
Thomas Proisl |
50 |
An application for building a Polish telephone speech corpus |
Bartosz Ziółko, Piotr Żelasko, Ireneusz Gawlik, Tomasz Pędzimąż and Tomasz Jadczyk |
52 |
Baselines and Test Data for Cross-Lingual Inference |
Željko Agić and Natalie Schluter |
54 |
Annotating Modality Expressions and Event Factuality for a Japanese Chess Commentary Corpus |
Suguru Matsuyoshi, Hirotaka Kameko, Yugo Murawaki and Shinsuke Mori |
55 |
A Fast and Accurate Vietnamese Word Segmenter |
Dat Quoc Nguyen, Dai Quoc Nguyen, Thanh Vu, Mark Dras and Mark Johnson |
57 |
Using Discourse Information for Education with a Spanish-Chinese Parallel Corpus |
Shuyuan Cao and Harritxu Gete |
58 |
Disambiguation of Verbal Shifters |
Michael Wiegand, Sylvette Loda and Josef Ruppenhofer |
60 |
A 2nd Longitudinal Corpus for Children's Writing with Enhanced Output for Specific Spelling Patterns |
Kay Berkling |
61 |
A Comparison Of Emotion Annotation Schemes And A New Annotated Data Set |
Ian Wood, John Philip McCrae, Vladimir Andryushechkin and Paul Buitelaar |
62 |
TF-LM: TensorFlow-based Language Modeling Toolkit |
Lyan Verwimp, Hugo Van hamme and Patrick Wambacq |
66 |
A Recorded Debating Dataset |
Shachar Mirkin, Michal Jacovi, Tamar Lavee, Hong-Kwang Kuo, Samuel Thomas, Leslie Sager, Lili Kotlerman, Elad Venezian and Noam Slonim |
67 |
CPJD Corpus: Crowdsourced Parallel Speech Corpus of Japanese Dialects |
Shinnosuke Takamichi and Hiroshi Saruwatari |
69 |
BKTreebank: Building a Vietnamese Dependency Treebank |
Kiem-Hieu Nguyen |
70 |
Finite-state morphological analysis for Gagauz |
Francis Tyers, Sevilay Bayatli and Güllü Karanfil |
72 |
Data Anonymization for Requirements Quality Analysis: a Reproducible Automatic Error Detection Task |
Juyeon Kang and Jungyeul Park |
73 |
GeCoTagger: Annotation of German Verb Complements with Conditional Random Fields |
Roman Schneider and Monica Fürbacher |
75 |
Training and Adapting Multilingual NMT for Less-resourced and Morphologically Rich Languages |
Matīss Rikters, Rihards Krišlauks and Mārcis Pinnis |
78 |
Network Features Based Co-hyponymy Detection |
Abhik Jana and Pawan Goyal |
81 |
WorldTree: A Corpus of Explanation Graphs for Elementary Science Questions |
Peter Jansen, Elizabeth Wainwright, Steven Marmorstein and Clayton Morrison |
83 |
Annotating Attribution Relations in Arabic |
Amal Alsaif, Tasniem Alyahya, Madawi Alotaibi, Huda Almuzaini and Abeer Algahtani |
84 |
BASHI: A Corpus of Wall Street Journal Articles Annotated with Bridging Links |
Ina Roesiger |
85 |
A German Corpus for Fine-Grained Named Entity Recognition and Relation Extraction of Traffic and Industry Events |
Leonhard Hennig, Martin Schiersch, Veselina Mironova, Maximilian Schmitt and Philippe Thomas |
86 |
Extending the gold standard for a lexical substitution task: is it worth it? |
Ludovic Tanguy, Cécile Fabre and Laura Rivière |
88 |
A Corpus Study and Annotation Schema for Named Entity Recognition of Business Products |
Saskia Schön, Veselina Mironova and Leonhard Hennig |
89 |
Albanian Part-of-Speech Tagging: Gold Standard and Evaluation |
Besim Kabashi and Thomas Proisl |
90 |
A First South African Corpus of Multilingual Code-switched Soap Opera Speech |
Ewald Van der westhuizen and Thomas Niesler |
92 |
Collecting Code-Switched Data from Social Media |
Gideon Mendels, Victor Soto, Aaron Jaech and Julia Hirschberg |
93 |
Linking, Searching, and Visualizing Entities in Wikipedia |
Marcus Klang and Pierre Nugues |
94 |
Learning to Map Natural Language Statements into Knowledge Base Representations for Knowledge Base Construction |
Chin-Ho Lin, Hen-Hsen Huang and Hsin-Hsi Chen |
95 |
Bootstrapping Polar-Opposite Emotion Dimensions from Online Reviews |
Luwen Huangfu and Mihai Surdeanu |
96 |
Construction of a Japanese Word Similarity Dataset |
Yuya Sakaizawa and Mamoru Komachi |
98 |
Incorporating Semantic Attention in Video Description Generation |
Natsuda Laokulrat, Naoaki Okazaki and Hideki Nakayama |
100 |
All-words Word Sense Disambiguation Using Concept Embeddings |
Rui Suzuki, Kanako Komiya, Masayuki Asahara, Minoru Sasaki and Hiroyuki Shinnou |
101 |
English-Basque Statistical and Neural Machine Translation |
Inigo Jauregi Unanue, Lierni Garmendia Arratibel, Ehsan Zare Borzeshi and Massimo Piccardi |
102 |
IPSL: A Database of Iconicity Patterns in Sign Languages. Creation and Use |
Vadim Kimmelman, Anna Klezovich and George Moroz |
104 |
Multilingual Parallel Corpus for Global Communication Plan |
Kenji Imamura and Eiichiro Sumita |
105 |
A Web Service for Pre-segmenting Very Long Transcribed Speech Recordings |
Nina Poerner and Florian Schiel |
106 |
Teanga: A Linked Data based platform for Natural Language Processing |
Housam Ziad, John Philip McCrae and Paul Buitelaar |
107 |
Transc&Anno: A Graphical Tool for the Transcription and On-the-Fly Annotation of Handwritten Documents |
Nadezda Okinina, Lionel Nicolas and Verena Lyding |
108 |
A Real-life, French-accented Corpus of Air Traffic Control Communications |
Estelle Delpech, Marion Laignelet, Christophe Pimm, Céline Raynal and Michal Trzos |
110 |
Introducing a Lexicon of Verbal Polarity Shifters for English |
Marc Schulder, Michael Wiegand, Josef Ruppenhofer and Stephanie Köser |
111 |
DeModify: A Dataset for Analyzing Contextual Constraints on Modifier Deletion |
Vivi Nastase, Devon Fritz and Anette Frank |
112 |
Enhancing Modern Supervised Word Sense Disambiguation Models by Semantic Lexical Resources |
Stefano Melacci, Achille Globo and Leonardo Rigutini |
114 |
Correction of OCR Word Segmentation Errors in Articles from the ACL Collection through Neural Machine Translation Methods |
Vivi Nastase and Julian Hitschler |
115 |
Evaluating the WordsEye Text-to-Scene System: Imaginative and Realistic Sentences |
Morgan Ulinski, Bob Coyne and Julia Hirschberg |
116 |
Acquiring Verb Classes Through Bottom-Up Semantic Verb Clustering |
Olga Majewska |
118 |
Constructing High Quality Sense-specific Corpus and Word Embedding via Unsupervised Elimination of Pseudo Multi-sense |
Haoyue Shi, Xihao Wang, Yuqi Sun and Junfeng Hu |
119 |
Undersampling Improves Hypernymy Prototypicality Learning |
Koki Washio and Tsuneaki Kato |
121 |
TQ-AutoTest – An Automated Test Suite for (Machine) Translation Quality |
Vivien Macketanz, Renlong Ai and Aljoscha Burchardt |
125 |
Morphology Injection for English-Malayalam Statistical Machine Translation |
Sreelekha S and Pushpak Bhattacharyya |
126 |
Sentiment-Stance-Specificity (SSS) Dataset: Identifying Support-based Entailment among Opinions. |
Pavithra Rajendran, Danushka Bollegala and Simon Parsons |
127 |
A «Portrait» Approach to Multichannel Discourse |
Andrej Kibrik and Olga Fedorova |
129 |
Exploiting Pre-Ordering for Neural Machine Translation |
Yang Zhao, Jiajun Zhang and Chengqing Zong |
131 |
Open Subtitles Paraphrase Corpus for Six Languages |
Mathias Creutz |
133 |
Grapheme-level Awareness in Word Embeddings for Morphologically Rich Languages |
Suzi Park and Hyopil Shin |
135 |
Portable Spelling Corrector for a Less-Resourced Language: Amharic |
Andargachew Mekonnen Gezmu, Andreas Nürnberger and Binyam Ephrem Seyoum |
137 |
Improved Transcription and Indexing of Oral History Interviews for Digital Humanities Research |
Michael Gref, Joachim Köhler and Almut Leh |
139 |
Improving a Multi-Source Neural Machine Translation Model with Corpus Extension for Low-Resource Languages |
Gyu Hyeon Choi, Jong Hun Shin and Young Kil Kim |
141 |
Multilingual Extension of PDTB-Style Annotation: The Case of TED Multilingual Discourse Bank |
Deniz Zeyrek, Amália Mendes and Murathan Kurfalı |
142 |
Creating dialect sub-corpora by clustering: a case in Japanese for an adaptive method |
Yo Sato and Kevin Heffernan |
144 |
ScholarGraph :a Chinese Knowledge Graph of Chinese Scholars |
Shuo Wang, Zehui Hao, Xiaofeng Meng and Qiuyue Wang |
145 |
Lexical and Semantic Features for Cross-lingual Text Reuse Classification: an Experiment in English and Latin Paraphrases |
Maria Moritz and David Steding |
146 |
Resource Creation Towards Automated Sentiment Analysis in Telugu (a low resource language) and Integrating Multiple Domain Sources to Enhance Sentiment Prediction. |
Rama Rohit Reddy Gangula and Radhika Mamidi |
147 |
Building a Macro Chinese Discourse Treebank |
Xiaomin Chu and Feng Jiang |
148 |
Urdu Word Embeddings |
Samar Haider |
149 |
Multilingual Multi-class Sentiment Classification Using Convolutional Neural Networks |
Mohammed Attia, Younes Samih, Ali Elkahky and Laura Kallmeyer |
150 |
The Morpho-syntactic Annotation of Animacy for a Dependency Parser |
Mohammed Attia, Vitaly Nikolaev and Ali Elkahky |
153 |
Chinese Relation Classification using Long Short Term Memory Networks |
Linrui Zhang and Dan Moldovan |
154 |
Automatic Annotation of Semantic Term Types in the Complete ACL Anthology Reference Corpus |
Anne-Kathrin Schumann and Héctor Martínez Alonso |
157 |
CogCompNLP: Your Swiss Army Knife for NLP |
Daniel Khashabi, Mark Sammons, Christos Christodoulopoulos, Bhargav Mangipudi, Tom Redman, Ben Zhou, Guanheng Luo, Shaoshi Ling and Dan Roth |
158 |
Incorporating Global Contexts into Sentence Embedding for Relational Extraction at the Paragraph Level without Labeled Data |
Eun-kyung Kim and KEY-SUN CHOI |
159 |
Word Embedding Evaluation Datasets and Wikipedia Title Embedding for Chinese |
Chi-Yen Chen and Wei-Yun Ma |
160 |
A Large Self-Annotated Corpus for Sarcasm |
Mikhail Khodak, Nikunj Saunshi and Kiran Vodrahalli |
161 |
When ACE met KBP: End-to-End Evaluation of Knowledge Base Population with Component-level Annotation |
Bonan Min, Marjorie Freedman, Roger Bock and Ralph Weischedel |
163 |
Dynamic Oracle for Neural Machine Translation in Decoding Phase |
Zi-Yi Dou, Hao Zhou, Shu-Jian Huang, Xin-Yu Dai and Jia-Jun Chen |
164 |
Seq2Tree: A Tree-Structured Extension of LSTM Network |
Weicheng Ma, Kai Cao, Zhaoheng Ni, Peter Chin and Xiang Li |
166 |
Development of a Mobile Observation Support System for Students: FishWatchr Mini |
Masaya Yamaguchi, Masanori Kitamura and Naomi Yanagida |
167 |
What Causes the Differences in Communication Styles? A Multicultural Study on Directness and Elaborateness |
Juliana Miehle, Wolfgang Minker and Stefan Ultes |
168 |
Expert Evaluation of a Spoken Dialogue System in a Clinical Operating Room |
Juliana Miehle, Nadine Gerstenlauer, Daniel Ostler, Hubertus Feußner, Wolfgang Minker and Stefan Ultes |
172 |
Enhancing the AI2 Diagrams Dataset Using Rhetorical Structure Theory |
Tuomo Hiippala |
174 |
Cross-Lingual Generation and Evaluation of a Wide-Coverage Lexical Semantic Resource |
Attila Novák and Borbála Siklósi |
178 |
SACR: A Drag-and-Drop Based Tool for Coreference Annotation |
Bruno Oberle |
179 |
JAIST Annotated Free Conversation Corpus |
Kiyoaki Shirai and Tomotaka Fukuoka |
180 |
Classifying Sluice Occurrences in Dialogue |
Austin Baird, Anissa Hamza and Daniel Hardt |
182 |
Word Sense Disambiguation based on Automatically Induced Synsets |
Dmitry Ustalov, Denis Teslenko, Alexander Panchenko, Mikhail Chersnoskutov and Chris Biemann |
183 |
Deep Neural Networks for Coreference Resolution for Polish |
Bartłomiej Nitoń, Paweł Morawiecki and Maciej Ogrodniczuk |
185 |
An Automatic Learning of an Algerian Dialect Lexicon by using Multilingual Word Embeddings |
ABIDI Karima and Kamel Smaili |
186 |
The Metalogue Debate Trainee Corpus: Data Collection and Annotations |
Volha Petukhova, Andrei Malchanau, Youssef Oualil, Dietrich Klakow, Saturnino Luz, Fasih Haider, Nick Campbell, Dimitris Koryzis, Dimitris Spiliotopoulos, Pierre Albert, Nicklas Linz and Jan Alexandersson |
187 |
A Corpus for Modeling Word Importance in Spoken Dialogue Transcripts |
Sushant Kafle and Matt Huenerfauth |
188 |
Towards Continuous Dialogue Corpus Creation: writing to corpus and generating from it |
Andrei Malchanau, Volha Petukhova and Harry Bunt |
190 |
Building a Knowledge Graph from Natural Language Definitions for Text Entailment Recognition |
Vivian Silva, Siegfried Handschuh and André Freitas |
192 |
MYCanCor: A Video Corpus of spoken Malaysian Cantonese |
Andreas Liesenfeld |
193 |
ANKO – A Picture Postcard Corpus: Transcription, Annotation and Part-of-Speech Tagging |
Kyoko Sugisaki |
194 |
AET: Web-based Adjective Exploration Tool for German |
Tatiana Bladier, Esther Seyffarth, Oliver Hellwig and Wiebke Petersen |
195 |
One Sentence One Model for Neural Machine Translation |
Xiaoqing Li, Jiajun Zhang and Chengqing Zong |
196 |
Towards Processing of the Oral History Interviews and Related Printed Documents |
Zbynek Zajic, Lucie Skorkovska, Petr Neduchal, Pavel Ircing, Josef V. Psutka, Ales Prazak, Daniel Soutner, Jan Švec, Lukas Bures and Ludek Muller |
201 |
Open ASR for Icelandic: Resources and a Baseline System |
Anna Björk Nikulásdóttir, Inga Rún Helgadóttir, Matthías Pétursson and Jón Guðnason |
203 |
Automatic Prediction of Discourse Connectives |
Eric Malmi, Daniele Pighin, Sebastian Krause and Mikhail Kozhevnikov |
204 |
HappyDB: A Corpus of 100,000 Crowdsourced Happy Moments |
Akari Asai, Sara Evensen, Behzad Golshan, Alon Halevy, Vivian Li, Andrei Lopatenko, Daniela Stepanov, Yoshihiko Suhara, Wang-Chiew Tan and Yinzhan Xu |
208 |
The UIR Uncertainty Corpus: Annotating Chinese Microblog Corpus for Uncertainty Identification from Social Media |
Binyang Li, Jun Xiang, Le Chen, Tengjiao Wang and Kam-Fai Wong |
209 |
Annotating High-Level Structures of Short Stories and Personal Anecdotes |
Boyang Li, Beth Cardier, Tong Wang and Florian Metze |
210 |
Sentence Level Temporality Detection using an Implicit Time-sensed Resource |
Sabyasachi Kamila, Asif Ekbal and Pushpak Bhattacharyya |
212 |
Mapping Texts to Scripts: An Entailment Study |
Simon Ostermann, Hannah Seitz, Stefan Thater and Manfred Pinkal |
213 |
EventWiki: A Knowledge Base of Major Events |
Tao Ge, Lei Cui, Baobao Chang, Zhifang Sui and Ming Zhou |
214 |
Linguistic and Sociolinguistic Annotation of 17th Century Dutch Letters |
Marijn Schraagen, Feike Dietz and Marjo van Koppen |
215 |
Building a Web-Scale Dependency-Parsed Corpus from CommonCrawl |
Alexander Panchenko, Eugen Ruppert, Stefano Faralli, Simone Paolo Ponzetto and Chris Biemann |
217 |
MultiBooked: A Corpus of Basque and Catalan Hotel Reviews Annotated for Aspect-level Sentiment Classification |
Jeremy Barnes, Toni Badia and Patrik Lambert |
218 |
Text Annotation Graphs: Annotating Complex Natural Language Phenomena |
Angus Forbes, Kristine Lee, Gus Hahn-Powell, Marco Marco A. Valenzuela-Escarcega and Mihai Surdeanu |
220 |
A Fast and Flexible Webinterface for Dialect Research in the Low Countries |
Roeland van Hout, Nicoline van der Sijs, Erwin Komen and Henk van den Heuvel |
221 |
A Speaking Atlas of the Regional Languages of France |
Philippe Boula de Mareüil, Albert Rilliard and Frédéric Vernier |
222 |
Candidate Ranking for Maintenance of an Online Dictionary |
Claire Broad, Helen Langone and David Guy Brizan |
223 |
The AnnCor CHILDES Treebank |
Jan Odijk, Alexis Dimitriadis, Martijn van der Klis, Marjo van Koppen, Meie Otten and Remco van der Veen |
224 |
Unsupervised Korean Word Sense Disambiguation using CoreNet |
Kijong Han, Sangha Nam, Jiseong Kim, Younggyun Hahm and KEY-SUN CHOI |
225 |
MCScript: A Novel Dataset for Assessing Machine Comprehension Using Script Knowledge |
Simon Ostermann, Ashutosh Modi, Michael Roth, Stefan Thater and Manfred Pinkal |
226 |
Face2Text: Collecting an Annotated Image Description Corpus for the Generation of Rich Face Descriptions |
Albert Gatt, Marc Tanti, Adrian Muscat, Patrizia Paggio, Reuben A Farrugia, Claudia Borg, Kenneth Camilleri, Mike Rosner and Lonneke van der Plas |
227 |
Language adaptation experiments via cross-lingual embeddings for related languages |
Serge Sharoff |
228 |
Semantic Equivalence Detection: Are Interrogatives Harder than Declaratives? |
João Rodrigues, Chakaveh Saedi, António Branco and João Silva |
229 |
A New Corpus to Support Text Mining for the Curation of Metabolites in the ChEBI Database |
Matthew Shardlow, Nhung Nguyen, Gareth Owen, John McNaught and Sophia Ananiadou |
231 |
Semi-Automatic Construction of Word-Formation Networks (for Polish and Spanish) |
Mateusz Lango, Magda Sevcikova and Zdeněk Žabokrtský |
232 |
Tools for Building an Interlinked Multilingual Synonym Lexicon Network |
Zdenka Uresova, Eva Fucikova, Eva Hajicova and Jan Hajic |
234 |
Improving Hypernymy Extraction with Distributional Semantic Classes |
Alexander Panchenko, Dmitry Ustalov, Stefano Faralli, Simone Paolo Ponzetto and Chris Biemann |
237 |
Arabic Dialect Identification in the Context of Bivalency and Code-Switching |
Mahmoud El-Haj, Paul Rayson and Mariam Aboelezz |
238 |
CEFR-based Lexical Simplification |
Satoru Uchida, Shohei Takada and Yuki Arase |
241 |
QUD-Based Annotation of Discourse Structure and Information Structure: Tool and Evaluation |
Kordula De Kuthy, Nils Reiter and Arndt Riester |
242 |
A Corpus of Metaphor Novelty Scores for Syntactically-Related Word Pairs |
Natalie Parde and Rodney Nielsen |
243 |
Investigating the Influence of Bilingual MWU on Trainee Translation Quality |
YU Yuan and Serge Sharoff |
244 |
JFCKB: Japanese Feature Change Knowledge Base |
Tetsuaki Nakamura and Daisuke Kawahara |
245 |
Cross-lingual Terminology Extraction for Translation Quality Estimation |
YU Yuan, yuze gao, Yue Zhang and Serge Sharoff |
246 |
Very Large-Scale Lexical Resources to Enhance Chinese and Japanese Machine Translation |
Jack Halpern |
247 |
Social Image Tags as a Source of Word Embeddings: A Task-oriented Evaluation |
Mika Hasegawa, Tetsunori Kobayashi and Yoshihiko Hayashi |
248 |
Manzanilla: An Image Annotation Tool for TKB Building |
Arianne Reimerink and Pilar León-Araúz |
249 |
WordKit: a Pyhon Package for Orthographic and Phonological Featurization |
Stephan Tulkens, Dominiek Sandra and Walter Daelemans |
250 |
UFSAC: Unification of Sense Annotated Corpora and Tools |
Loïc Vial, Benjamin Lecouteux and Didier Schwab |
252 |
Beyond Generic Summarization: A Multi-faceted Hierarchical Summarization Corpus of Large Heterogeneous Data |
Christopher Tauchmann, Thomas Arnold, Andreas Hanselowski, Christian M. Meyer and Margot Mieskes |
253 |
Classifying the Informative Behaviour of Emoji in Microblogs |
Giulia Donato and Patrizia Paggio |
254 |
MIsA: Multilingual "IsA" Extraction from Corpora |
Stefano Faralli, Els Lefever and Simone Paolo Ponzetto |
256 |
Content-Based Conflict of Interest Detection on Wikipedia |
udochukwu orizu and Yulan He |
258 |
Creating Lithuanian and Latvian Speech Corpora from Inaccurately Annotated Web Data |
Askars Salimbajevs |
259 |
Interoperability of Language-related Information: Mapping the BLL Thesaurus to Lexvo and Glottolog |
Vanya Dimitrova, Christian Fäth, Christian Chiarcos, Heike Renner-Westermann and Frank Abromeit |
262 |
A Framework for the Needs of Different Types of Users in Multilingual Semantic Enrichment |
Jan Nehring and Felix Sasaki |
263 |
LOaDing: Adding Distributional-Semantics Features to Framester |
Stefano Faralli, Alexander Panchenko, Chris Biemann and Simone Paolo Ponzetto |
266 |
VSFC - Vietnamese Students' Feedback Corpus for Sentiment Analysis |
Kiet Nguyen, Vu Duc, Phu Nguyen, Tham Truong and Ngan Nguyen |
267 |
Towards a common dataset for research on conceptual pacts and alignment in task-oriented dialogue |
Todd Shore, Theofronia Androulakaki and Gabriel Skantze |
268 |
Evaluation of Domain-specific Word Embeddings using Knowledge Resources |
Farhad Nooralahzadeh, Lilja Øvrelid and Jan Tore Lønning |
270 |
Collection of Multimodal Dialog Data and Analysis of the Result of Annotation of Users' Interests |
Masahiro Araki, Sayaka Tomimasu, Mikio Nakano, Kazunori Komatani, Shogo Okada, Shinya Fujie and Hiroaki Sugiyama |
271 |
Annotating Chinese Light Verb Constructions according to PARSEME guidelines |
menghan jiang, Natalia Klyueva, Hongzhi Xu and Chu-Ren Huang |
272 |
Korean L2 Vocabulary Prediction: Can a Large Annotated Corpus be Used to Train Better Models for Predicting Unknown Words? |
Kevin Yancey and Yves Lepage |
274 |
Multi-layer Annotation of the Rigveda |
Oliver Hellwig, Heinrich Hettrich, Ashutosh Modi and Manfred Pinkal |
275 |
A New Annotated Portuguese/Spanish Corpus for the Multi-Sentence Compression Task |
Elvys Linhares Pontes, Juan-Manuel Torres-Moreno, Stéphane Huet and Andréa carneiro Linhares |
276 |
Universal Dependencies Version 2 for Japanese |
Masayuki Asahara, Hiroshi Kanayama, Takaaki Tanaka, Yusuke Miyao, Sumire Uematsu, Shinsuke Mori, Yuji Matsumoto, Mai Omura and Yugo Murawaki |
277 |
Browsing and Supporting Pluricentric Global Wordnet, or just your Wordnet of Interest |
António Branco, Ruben Branco, Chakaveh Saedi and João Silva |
278 |
Annotating Spin in Biomedical Scientific Publications : the case of Random Controlled Trials (RCTs) |
Anna Koroleva and Patrick Paroubek |
279 |
Discovering Canonical Indian English Accents: A Crowdsourcing-based Approach |
Sunayana Sitaram, Varun Manjunath, Varun Bharadwaj, Monojit Choudhury, Kalika Bali and Michael Tjalve |
281 |
Japanese Simplified Corpus with Core Vocabulary |
Takumi Maruyama and Kazuhide Yamamoto |
282 |
eSCAPE: a Large-scale Synthetic Corpus for Automatic Post-Editing |
Matteo Negri, Marco Turchi, Rajen Chatterjee and Nicola Bertoldi |
283 |
A High-Quality Gold Standard for Citation-based Tasks |
Michael Färber, Alexander Thiemann and Adam Jatowt |
285 |
Multi Modal Distance - An Approach to Stemma Generation With Weighting |
Armin Hoenen |
286 |
Crowdsourcing-based Annotation of the Accounting Registers of the Italian Comedy |
Adeline Granet, Benjamin Hervy, Geoffrey Roman-Jimenez, Marouane Hachicha, Emmanuel Morin, Harold Mouchère, Solen Quiniou, Guillaume Raschia, Françoise Rubellin and Christian Viard-Gaudin |
287 |
An Integrated Formal Representation for Terminological and Lexical Data included in Classification Schemes |
Thierry Declerck and Kseniya Egorova |
288 |
Corpora with Part-of-Speech Annotations for Three Regional Languages of France: Alsatian, Occitan and Picard |
Delphine Bernhard, Anne-Laure Ligozat, Fanny Martin, Myriam Bras, Pierre Magistry, Marianne Vergez-Couret, Lucie Steiblé, Pascale Erhart, Nabil Hathout, Dominique Huck, Christophe Rey, Sophie Rosset and Jean Sibille |
289 |
A Simple Approach to Incorporate Contextual Information for Language-Independent, Dynamic Disambiguation Tasks |
Tobias Staron, Özge Alacam and Wolfgang Menzel |
290 |
Retrofitting Word Representations for Unsupervised Sense Aware Word Similarities |
Steffen Remus and Chris Biemann |
291 |
A Neural Network Based Model for Loanword Identification in Uyghur |
Chenggang Mi, Yating Yang, Lei Wang, Xi Zhou and Tonghai Jiang |
292 |
Improving Hate Speech Detection with Deep Learning Ensembles |
Steven Zimmerman, Udo Kruschwitz and Chris Fox |
294 |
OpenSubtitles2018: Statistical Rescoring of Sentence Alignments in Large, Noisy Parallel Corpora |
Pierre Lison, Jörg Tiedemann and Milen Kouylekov |
295 |
A Pragmatic Approach for Classical Chinese Word Segmentation |
Shilei Huang and Jiangqin Wu |
296 |
A Corpus of Natural Multimodal Spatial Scene Descriptions |
David Schlangen |
297 |
Building an Ellipsis-aware Chinese Dependency Treebank for Web Text |
Xuancheng Ren, Bingzhen Wei, Zhiyuan Zhang and Xu Sun |
298 |
Visualization of the occurrence trend of infectious diseases using Twitter |
Ryusei Matsumoto, Minoru Yoshida, Kazuyuki Matsumoto, Hironobu Matsuda and Kenji Kita |
300 |
LREMap, a song of Resources and Evaluation |
Riccardo Del Gratta, Sara Goggi, Gabriella Pardelli and Nicoletta Calzolari |
301 |
ZAP: An Open-Source Multilingual Annotation Projection Framework |
Alan Akbik and Roland Vollgraf |
302 |
Machine Translation of Low-Resource Spoken Dialects: Strategies for Normalizing Swiss German |
Pierre-Edouard Honnet, Andrei Popescu-Belis, Claudiu Musat and Michael Baeriswyl |
303 |
Distributional Term Set Expansion |
Amaru Cuba Gyllensten and Magnus Sahlgren |
305 |
On the Vector Representation of Utterances in Dialogue Context |
Louisa Pragst, Niklas Rach, Wolfgang Minker and Stefan Ultes |
306 |
A Taxonomy for In-depth Evaluation of Normalization for User Generated Content |
Rob van der Goot, Rik van Noord and Gertjan van Noord |
307 |
A Swedish Cookie-Theft Corpus |
Dimitrios Kokkinakis, Kristina Lundholm Fors, Kathleen Fraser and Arto Nordlund |
308 |
Cross-checking WordNet and SUMO Using Meronymy |
Javier Alvez and German Rigau |
309 |
CRF+LG: A Hybrid Approach for the Portuguese Named Entity Recognition |
Juliana Pirovani and Elias Oliveira |
310 |
Reusable workflows for gender prediction |
Matej Martinc and Senja Pollak |
311 |
Extending Search System based on Interactive Visualization for Speech Corpora |
Tomoko Ohsuga, Yuichi Ishimoto, Tomoko Kajiyama, Shunsuke Kozawa, Kiyotaka Uchimoto and Shuichi Itahashi |
312 |
Evaluation of Dictionary Creating Methods for Finno-Ugric Minority Languages |
Zsanett Ferenczi, Iván Mittelholcz, Eszter Simon and Tamás Váradi |
313 |
ELRA Data Management Plan under the GDPR |
Pawel Kamocki |
314 |
From Manuscripts to Archetypes through Iterative Clustering |
Armin Hoenen |
316 |
BabyCloud, a Technological Platform for Parents and Researchers |
Xuan-Nga Cao, Cyrille Dakhlia, Patricia Del Carmen, Mohamed-Amine Jaouani, Malik Ould-Arbi and Emmanuel Dupoux |
317 |
Live Blog Corpus for Summarization |
Avinesh PVS, Maxime Peyrard and Christian M. Meyer |
318 |
A Gold Anaphora Annotation Layer on an Eye Movement Corpus |
Olga Seminck and Pascal Amsili |
319 |
FEIDEGGER: A Multi-modal Corpus of Fashion Images and Descriptions in German |
Leonidas Lefakis, Alan Akbik and Roland Vollgraf |
320 |
German Radio Interviews: The GRAIN Release of the SFB732 Silver Standard Collection |
Katrin Schweitzer, Kerstin Eckart, Markus Gärtner, Agnieszka Falenska, Arndt Riester, Ina Roesiger, Antje Schweitzer, Sabrina Stehwien and Jonas Kuhn |
321 |
Quantifying Qualitative Data for Understanding Controversial Issues |
Michael Wojatzki, Saif Mohammad, Torsten Zesch and Svetlana Kiritchenko |
322 |
ES-Port: a Spontaneous Spoken Human-Human Technical Support Corpus for Dialogue Research in Spanish |
Laura García-Sardiña, Manex Serras and Arantza del Pozo |
323 |
Can Domain Adaptation be Handled as Analogies? |
Núria Bel and Joel Pocostales |
324 |
A corpus of German political speeches from the 21st century |
Adrien Barbaresi |
325 |
SzegedKoref: A Hungarian Coreference Corpus |
Veronika Vincze, Klára Hegedűs and Richárd Farkas |
326 |
Toward a Lightweight Solution for Less-resourced Languages: Creating a POS Tagger for Alsatian Using Voluntary Crowdsourcing |
Alice Millour and Karën Fort |
327 |
Crowdsourced Corpus of Sentence Simplification with Core Vocabulary |
Akihiro Katsuta and Kazuhide Yamamoto |
328 |
A Corpus to Learn Refer-to-as Relations for Nominals |
Wasi Ahmad and Kai-Wei Chang |
329 |
Word Affect Intensities |
Saif Mohammad |
330 |
The Effects of Unimodal Representation Choices on Multimodal Learning |
Fernando T. Ito, Helena de Medeiros Caseli and Jander Moreira |
332 |
MPST: A Corpus of Movie Plot Synopses with Tags |
Sudipta Kar, Suraj Maharjan, Adrian Pastor López Monroy and Thamar Solorio |
333 |
Palmyra: A Platform Independent Dependency Annotation Tool for Morphologically Rich Languages |
Talha Javed, Nizar Habash and Dima Taji |
334 |
Dialog Intent Structure: A Hierarchical Schema of Linked Dialog Acts |
Silvia Pareti and Tatiana Lando |
335 |
EuroGames16: Evaluating Change Detection in Online Conversation |
Cyril Goutte, Yunli Wang, FangMing Liao, Zachary Zanussi, Samuel Larkin and Yuri Grinberg |
336 |
Metadata Collection Records for Language Resources |
Henk van den Heuvel, Erwin Komen and Nelleke Oostdijk |
337 |
The Natural Stories Corpus |
Richard Futrell, Edward Gibson, Harry J. Tily, Idan Blank, Anastasia Vishnevetsky, Steven Piantadosi and Evelina Fedorenko |
339 |
A Web-based System for Crowd-in-the-Loop Dependency Treebanking |
Stephen Tratz and Nhien Phan |
344 |
Tools for The Production of Analogical Grids and a Resource of N-gram Analogical Grids in 11 Languages |
Rashel Fam and Yves Lepage |
345 |
Analysis of Implicit Conditions in Database Search Dialogues |
Shun-ya Fukunaga, Hitoshi Nishikawa, Takenobu Tokunaga, Hikaru Yokono and Tetsuro Takahashi |
346 |
JDCFC: A Japanese Dialogue Corpus with Feature Changes |
Tetsuaki Nakamura and Daisuke Kawahara |
349 |
Knowing the Author by the Company His Words Keep |
Armin Hoenen |
350 |
MADARi: A Web Interface for Joint Arabic Morphological Annotation and Spelling Correction |
Ossama Obeid, Salam Khalifa, Nizar Habash, Houda Bouamor, Wajdi Zaghouani and Kemal Oflazer |
351 |
The MADAR Arabic Dialect Corpus and Lexicon |
Houda Bouamor, Nizar Habash, Mohammad Salameh, Wajdi Zaghouani, Owen Rambow, Dana Abdulrahim, Ossama Obeid, Salam Khalifa, Fadhl Eryani, Alexander Erdmann and Kemal Oflazer |
354 |
Author Profiling from Facebook Corpora |
Fernando Hsieh, Rafael Dias and Ivandré Paraboni |
355 |
Gaining and Losing Influence in Online Conversation |
Arun Sharma and Tomek Strzalkowski |
356 |
Semi-automatic Korean FrameNet Annotation over KAIST Treebank |
Younggyun Hahm, Jiseong Kim, Sunggoo Kwon and KEY-SUN CHOI |
357 |
Handling Normalization Issues for Part-of-Speech Tagging of Online Conversational Text |
Géraldine Damnati, Jérémy Auguste, Alexis Nasr, Delphine Charlet, Johannes Heinecke and Frédéric Béchet |
358 |
A Morphological Analyzer for St. Lawrence Island / Central Siberian Yupik |
Emily Chen and Lane Schwartz |
363 |
A Corpus of English-Hindi Code-Mixed Social Media Texts for Humor Detection |
Ankush Khandelwal, Sahil Swami, Syed Sarfaraz Akhtar and Manish Shrivastava |
364 |
Combining Concepts and Their Translations from Structured Dictionaries of Uralic Minority Languages |
Mika Hämäläinen, Liisa Lotta Tarvainen and Jack Rueter |
366 |
Towards AMR-BR: A SemBank for Brazilian Portuguese Language |
Rafael Anchiêta and Thiago Pardo |
368 |
Towards a Gold Standard Corpus for Variable Detection and Linking in Social Science Publications |
Andrea Zielinski and Peter Mutschke |
370 |
A Large Parallel Corpus of Full-Text Scientific Articles |
Felipe Soares, Viviane Moreira and Karin Becker |
371 |
Building Literary Corpora for Computational Literary Analysis - A Prototype to Bridge the Gap between CL and DH |
Andrew Frank and Christine IVANOVIC |
372 |
Towards Language Technology for Mi'kmaq |
Anant Maheshwari, Leo Bouscarrat and Paul Cook |
373 |
ASAP++: Enriching the ASAP Automated Essay Grading Dataset with Essay Attribute Scores |
Sandeep Mathias and Pushpak Bhattacharyya |
374 |
Building A Handwritten Cuneiform Character Imageset |
Kenji Yamauchi, Hajime Yamamoto and Wakaha Mori |
375 |
Strategic Planning for Creating Bilingual Dictionaries of Indonesian Ethnic Languages |
Arbi Haza Nasution, Yohei Murakami and Toru Ishida |
377 |
Transfer of Frames from English FrameNet to Construct Chinese FrameNet: A Bilingual Corpus-Based Approach |
Tsung-Han Yang, Hen-Hsen Huang, An-Zi Yen and Hsin-Hsi Chen |
378 |
Building Universal Dependency Treebanks in Korean |
Jayeol Chun, Na-Rae Han, Jena D. Hwang and Jinho D. Choi |
380 |
Automatic and Manual Web Annotations in an Infrastructure to handle Fake News and other Online Media Phenomena |
Georg Rehm, Julian Moreno-Schneider and Peter Bourgonje |
381 |
Evaluating Machine Translation Performance on Chinese Idioms with a Blacklist Method |
Yutong Shao, Rico Sennrich, Bonnie Webber and Federico Fancellu |
383 |
Infant Word Comprehension-to-Production Index Applied to Investigation of Noun Learning Predominance Using Cross-lingual CDI database |
Yasuhiro Minami, Tessei Kobayashi and Yuko Okumura |
384 |
Using English Baits to Catch Serbian Multi-Word Terminology |
Cvetana Krstev, Branislava Šandrih and Ranka Stankovic |
385 |
MirasText: An Automatically Generated Text Corpus for Persian |
Behnam Sabeti, Hossein Abedi Firouzjaee, Ali Janalizadeh Choobbasti and Seyed hani elamahdi Mortazavi Najafabadi |
387 |
Developing the Bangla RST Discourse Treebank |
Debopam Das and Manfred Stede |
389 |
Building a Sentiment Corpus of Tweets in Brazilian Portuguese |
Henrico Brum and Maria das Graças Volpe Nunes |
390 |
Computer-assisted speaker diarization: how to evaluate human corrections? |
Pierre-Alexandre Broux, David Doukhan, Simon Petitrenaud, Sylvain Meignier and Jean Carrive |
391 |
A Bird’s-eye View of Language Processing Projects at Romanian Academy |
Dan Tufiș and Cristea Dan |
392 |
An Integrated Representation of Linguistic and Social Functions of Code-Switching |
Silvana Hartmann, Monojit Choudhury and Kalika Bali |
393 |
Joint Learning of Sense and Word Embeddings |
Mohammed Alsuhaibani and Danushka Bollegala |
394 |
Construction of Large-scale English Verbal Multiword Expression Annotated Corpus |
Akihiko Kato, Hiroyuki Shindo and Yuji Matsumoto |
395 |
Unified Guidelines and Resources for Arabic Dialect Orthography |
Nizar Habash, Salam Khalifa, Fadhl Eryani, Owen Rambow, Dana Abdulrahim, Alexander Erdmann, Reem Faraj, Wajdi Zaghouani, Houda Bouamor, Nasser Zalmout, Sara Hassan, Faisal Al shargi, Sakhar Alkhereyf, Basma Abdulkareem, Ramy Eskander, Mohammad Salameh and Hind Saddiki |
396 |
Sign Languages and the Online World Online Dictionaries & Lexicostatistics |
Shi Yu and Carlo Geraci |
397 |
We Are Depleting Our Research Subject as We Are Investigating It: In Language Technology, more Replication and Diversity Are Needed |
António Branco |
400 |
A Parallel Corpus of Arabic-Japanese News Articles |
Go Inoue, Nizar Habash, Yuji Matsumoto and Hiyoruki Aoyama |
402 |
Representation Mapping: A Novel Approach to Automatically Generate High-Quality Multi-Lingual Emotion Lexicons |
Sven Buechel and Udo Hahn |
403 |
Pronunciation Dictionaries for the Alsatian Dialects to Analyze Spelling and Phonetic Variation |
Lucie Steiblé and Delphine Bernhard |
405 |
EMTC: Multi-label Corpus in Movie Domain for Emotion Analysis in Conversational Text |
Phan Duc-Anh and Yuji Matsumoto |
406 |
Towards an ISO Standard for the Annotation of Quantification |
Harry Bunt |
407 |
The ADELE Corpus of Dyadic Social Text Conversations:Dialog Act Annotation with ISO 24617-2 |
Emer Gilmartin, Christian Saam, Brendan Spillane, Maria O'Reilly, Ketong Su, Arturo Calvo, Loredana Cerrato, Killian Levacher, Nick Campbell and Vincent Wade |
410 |
The Spot the Difference corpus: a multi-modal corpus of spontaneous task oriented spoken interactions |
José Lopes, Nils Hemmingsson and Oliver Åstrand |
411 |
Towards Neural Speaker Modeling in Multi-Party Conversation: The Task, Dataset, and Models |
Zhao Meng, Lili Mou and Zhi Jin |
412 |
The Automatic Annotation of the Semiotic Type of Hand Gestures in Obama' s Humorous Speeches |
Costanza Navarretta |
414 |
Revisiting Distant Supervision for Relation Extraction |
Tingsong Jiang, Jing Liu and Chin-Yew Lin |
416 |
Distribution of Emotional Reactions to News Articles in Twitter |
Omar Juárez Gambino, Hiram Calvo and Consuelo-Varinia García-Mendoza |
418 |
Preparing Data from Psychotherapy for Natural Language Processing |
Margot Mieskes and Andreas Stiegelmayr |
419 |
Towards Visual WordNet: A Lexical Database Organized around Images |
Shantipriya Parida and Ondřej Bojar |
421 |
Universal Morphologies for the Caucasus region |
Christian Chiarcos, Maxim Ionov, Frank Abromeit, Christian Fäth, Monika Rind-Pawlowski, Kathrin Donandt, Hasmik Sargsian and Jesse Wichers Schreur |
422 |
Linguistic Analysis in a TEI Nutshell: Principles, Design and Potential of a TEI-Extension for Basic Linguistic Inline Annotation |
Piotr Banski, Susanne Haaf and Martin Mueller |
423 |
The Reference Corpus of the Contemporary Romanian Language (CoRoLa) |
Verginica Barbu Mititelu, Dan Tufiș and Elena Irimia |
424 |
BioRo: The Biomedical Corpus for the Romanian Language |
Maria Mitrofan and Dan Tufis |
426 |
A Corpus of Drug Usage Guidelines Annotated with Type of Advice |
Sarah Masud Preum, Md. Rizwan Parvez, Kai-Wei Chang and John Stankovic |
427 |
Semi-Supervised Clustering for Short Answer Scoring |
Andrea Horbach and Manfred Pinkal |
429 |
An Information-Providing Closed-Domain Human-Agent Interaction Corpus |
Jelte van Waterschoot, Guillaume Dubuisson Duplessis, Lorenzo Gatti and Merijn Bruijnes |
430 |
Discriminating between Similar Languages on Spoken Texts with Out-of-domain Data |
Junqing He, Xian Huang, Yonghong Yan and Yan Zhang |
432 |
Examining the Tip of the Iceberg: A Data Set for Idiom Translation |
Marzieh Fadaee, Arianna Bisazza and Christof Monz |
436 |
KRAUTS: A German Temporally Annotated News Corpus |
Jannik Strötgen, Anne-Lyse Minard, Lukas Lange, Manuela Speranza and Bernardo Magnini |
438 |
Konbitzul: an MWE-specific database for Spanish-Basque |
Uxoa Iñurrieta, Itziar Aduriz, Arantza Diaz de Ilarraza, Gorka Labaka and Kepa Sarasola |
439 |
EFLLex: A Graded Lexical Resource for Learners of English as a Foreign Languag |
Luise Dürlich and Thomas Francois |
440 |
Moving TIGER beyond sentence-level |
Agnieszka Falenska, Kerstin Eckart and Jonas Kuhn |
441 |
Elicitation protocol and material for a corpus of long prepared monologues in Sign Language |
Michael Filhol and Mohamed Nassime Hadjadj |
442 |
Fine-grained Semantic Textual Similarity for Serbian |
Vuk Batanović, Miloš Cvetanović and Boško Nikolić |
443 |
MirasVoice: A bilingual (English-Persian) speech corpus |
Amir Vaheb, Ali Janalizadeh Choobbasti, Mahdi Mortazavi and Saeid Safavi |
445 |
Semantic Relatedness of Wikipedia Concepts -- Benchmark Data and a Working Solution |
liat Ein Dor, Alon Halfon, Yoav Kantor, Ran Levy, Yosi Mass, Ruty Rinot, Eyal Shnarch and Noam Slonim |
446 |
Combining rule-based and embedding-based approaches to normalize textual entities with an ontology |
Arnaud Ferré, Louise Deléger, Pierre Zweigenbaum and Claire Nédellec |
449 |
Complex and Precise Movie and Book Annotations in French Language for Aspect Based Sentiment Analysis |
Stefania Pecore and Jeanne Villaneau |
450 |
GenDR: A Generic Deep Realizer with Complex Lexicalization |
François Lareau, Florie Lambrey, Ieva Dubinskaite, Daniel Galarreta-Piquette and Maryam Nejat |
451 |
Attention for Implicit Discourse Relation Recognition |
Andre Cianflone and Leila Kosseim |
452 |
A Multilingual Test Collection for the Semantic Search of Entity Categories |
Juliano Efson Sales, Siamak Barzegar, Wellington Franco, Bernhard Bermeitinger, Tiago Cunha, Brian Davis, Siegfried Handschuh and André Freitas |
456 |
From analysis to modeling of engagement as sequences of multimodal behaviors |
Soumia Dermouche and Catherine Pelachaud |
457 |
Lingmotif-lex: a Wide-coverage, State-of-the-art Lexicon for Sentiment Analysis |
Antonio Moreno-Ortiz and Chantal Pérez-Hernández |
458 |
Towards a Welsh Semantic Annotation System |
Scott Piao, Paul Rayson, Dawn Knight and Gareth Watkins |
460 |
Automatic Thesaurus Construction for Modern Hebrew |
Chaya Liebeskind, Ido Dagan and Jonathan Schler |
461 |
Constructing a Lexicon of Relational Nouns |
Edward Newell and Jackie Chi Kit Cheung |
462 |
Dialogue Scenario Collection of Persuasive Dialogue with Emotional Expressions via Crowdsourcing |
Koichiro Yoshino, Yoko Ishikawa, Masahiro Mizukami, Yu Suzuki, Sakriani Sakti and Satoshi Nakamura |
463 |
ChAnot: An Intelligent Annotation Tool for Indigenous and Highly Agglutinative Languages in Peru |
Rodolfo Mercado, José Pereira, Marco Antonio Sobrevilla Cabezudo and Arturo Oncevay |
464 |
Japanese Dialogue Corpus of Information Navigation and Attentive Listening Annotated with Extended ISO-24617-2 Dialogue Act Tags |
Koichiro Yoshino, Hiroki Tanaka, Kyoshiro Sugiyama, Satoshi Nakamura and Makoto Kondo |
465 |
A Japanese Corpus for Analyzing Customer Loyalty Information |
Yiou Wang and Takuji Tahara |
466 |
An Evaluation Framework for Multimodal Interaction |
Nikhil Krishnaswamy and James Pustejovsky |
469 |
Construction of the Corpus of Everyday Japanese Conversation: An Interim Report |
Hanae Koiso, Yasuharu Den, Yuriko Iseki, Wakako Kashino, Yoshiko Kawabata, Ken'ya Nishikawa, Yayoi Tanaka and Yasuyuki Usuda |
470 |
Deep JSLC: A Multimodal Corpus Collection for Data-driven Generation of Japanese Sign Language Expressions |
Heike Brock and Kazuhiro Nakadai |
471 |
FooTweets: A Bilingual Parallel Corpus of World Cup Tweets |
Henny Sluyter-Gäthje, Pintu Lohar, Haithem Afli and Andy Way |
474 |
WASA: A Web Application for Sequence Annotation |
Fahad AlGhamdi and Mona Diab |
479 |
TAP-DLND 1.0 : A Corpus for Document Level Novelty Detection |
Tirthankar Ghosal, Asif Ekbal, Pushpak Bhattacharyya, Amitra Salam and Swati Tiwary |
480 |
Augmenting Image Question Answering Dataset by Exploiting Image Captions |
Masashi Yokota and Hideki Nakayama |
481 |
Edit me: A Corpus and a Framework for Understanding Natural Language Image Editing |
Ramesh Manuvinakurike, Jacqueline Brixey, Trung Bui, Walter Chang, Doo Soon Kim, Ron Artstein and Kallirroi Georgila |
482 |
The Niki and Julie Corpus: Collaborative Multimodal Dialogues between Humans, Robots, and Virtual Agents |
Ron Artstein, Jill Boberg, Alesia Gainer, Jonathan Gratch, Emmanuel Johnson, Anton Leuski, Gale Lucas and David Traum |
483 |
Part-of-Speech Tagging for Arabic Gulf Dialect Using Bi-LSTM |
Randah Alharbi, Walid Magdy, Kareem Darwish, Ahmed Abdelali and Hamdy Mubarak |
485 |
Overcoming the Long Tail Problem: A Case Study on CO2-Footprint Estimation of Recipes using Information Retrieval |
Melanie Geiger and Martin Braschler |
486 |
A Deep Neural Network based Approach for Entity Extraction in Code-Mixed Indian Social Media Text |
Deepak Gupta, Asif Ekbal and Pushpak Bhattacharyya |
490 |
Phonemic Transcription of Low-Resource Tonal Languages |
Oliver Adams, Trevor Cohn, Graham Neubig, Steven Bird and Alexis MICHAUD |
493 |
AMeD: A Chinese Medical Dialogue Corpus Annotated with Conversation Structure and Actions |
Nan Wang, Yan Song and Fei Xia |
494 |
Adapting serious game for fallacious argumentation to German: Pitfalls, insights, and best practices |
Ivan Habernal, Patrick Pauli and Iryna Gurevych |
496 |
Discovering the Language of Wine Reviews: A Text Mining Account |
Els Lefever, Iris Hendrickx, Ilja Croijmans, Antal van den Bosch and Asifa Majid |
497 |
Handling Big Data and Sensitive Data Using EUDAT's Generic Execution Framework and the WebLicht Workflow Engine. |
Claus Zinn, Wei Qui, Marie Hinrichs, Emanuel Dima and Alexandr Chernov |
498 |
Dysarthric speech evaluation: automatic and perceptual approaches |
Imed Laaridh, Christine Meunier and Corinne Fredouille |
499 |
Signbank: Software to Support Web Based Dictionaries of Sign Language |
Steve Cassidy, Onno Crasborn, Henri Nieminen, Wessel Stoop, Micha Hulsbosch, Susan Even, Erwin Komen and Trevor Johnson |
500 |
NegPar: A parallel corpus annotated for negation |
Qianchu Liu, Federico Fancellu and Bonnie Webber |
501 |
Predicting Nods by using Dialogue Acts in Dialogue |
Ryo Ishii, Ryuichiro Higashinaka and Junji Tomita |
502 |
SPADE: Evaluation Dataset for Monolingual Phrase Alignment |
Yuki Arase and Jun'ichi Tsujii |
504 |
Lessons Learned: On the Challenges of Migrating a Research Data Repository from a Research Institution to a University Library. |
Thorsten Trippel and Claus Zinn |
505 |
J-MeDic: A Japanese Disease Name Dictionary based on Real Clinical Usage |
Kaoru Ito, Hiroyuki Nagai, Taro Okahisa, Shoko Wakamiya, Tomohide Iwao and Eiji Aramaki |
506 |
Carcinologic Speech Severity Index Project: A Database of Speech Disorders Productions to Assess Quality of Life Related to Speech After Cancer |
Corine Astésano, Mathieu Balaguer, Jérôme Farinas, Corinne Fredouille, Pascal Gaillard, Alain Ghio, Imed Laaridh, muriel lalain, Benoît Lepage, Julie Mauclair, Olivier Nocaudie, Julien Pinquier, Oriol Pont, Gilles Pouchoulin, Michèle Puech, Danièle Robert, Etienne Sicard and Virginie Woisard |
507 |
The WAW Corpus: The First Corpus of Interpreted Speeches and their Translations for English and Arabic |
Ahmed Abdelali, Irina Temnikova, Samy Hedaya and Stephan Vogel |
509 |
Biomedical term normalization of EHRs with UMLS |
Naiara Perez, Montse Cuadros and German Rigau |
510 |
New directions in ELRA activities |
Valérie Mapelli, Victoria Arranz, Hélène Mazo, Pawel Kamocki and Vladimir Popescu |
513 |
Recognizing Behavioral Factors while Driving: A Real-World Multimodal Corpus to Monitor the Driver’s Affective State |
Alicia Lotz, Klas Ihme, Audrey Charnoz, Pantelis Maroudis, Ivan Dmitriev and Andreas Wendemuth |
515 |
A Multilingual Wikified Data Set of Educational Material |
Iris Hendrickx, Eirini Takoulidou, Thanasis Naskos, Katia Lida Kermanidis, Vilelmini Sosoni, Hugo de Vos, Maria Stasimioti, Menno van Zaanen, Panayota Georgakopoulou, Valia Kordoni, Maja Popovic, Markus Egg and Antal van den Bosch |
516 |
TSix: A Human-involved-creation Dataset for Tweet Summarization |
Minh-Tien Nguyen, Dac Viet Lai, Huy-Tien Nguyen and Minh-Le Nguyen |
517 |
Crowdsourcing Regional Variables and Automatic Geolocalisation of Speakers of European French |
Jean-Philippe Goldman, Yves Scherrer, Julie Glikman, Mathieu Avanzi, Christophe Benzitoun and Philippe Boula de Mareüil |
518 |
An Assessment of Explicit Inter- and Intra-sentential Discourse Connectives in Turkish Discourse Bank |
Deniz Zeyrek and Murathan Kurfalı |
521 |
ArapTweet: A Large Multi-Dialect Twitter Corpus for Gender, Age and Language Variety Identification |
Wajdi Zaghouani and Anis Charfi |
522 |
Measuring Innovation in Speech and Language Processing Publications. |
Joseph Mariani, Gil Francopoulo and Patrick Paroubek |
525 |
Annotation and Modelling of Discourse Compositionality: A Context-based Approach for Dialogue Act Recognition using Recurrent Neural Networks |
Chandrakant Bothe, Cornelius Weber, Sven Magg and Stefan Wermter |
527 |
Semantic Frame Parsing for Information Extraction : the CALOR corpus |
gabriel marzinotto, Jeremy Auguste, Frederic Bechet, Geraldine Damnati and Alexis Nasr |
528 |
Enriching LICO with Corpus-based Data |
Anna Feltracco, Elisabetta Jezek and Bernardo Magnini |
529 |
A Morphologically Annotated Corpus of Emirati Arabic |
Salam Khalifa, Nizar Habash, Fadhl Eryani, Ossama Obeid, Dana Abdulrahim and Meera Al Kaabi |
530 |
Compilation of Corpora for the Study of the Information Structure–Prosody Interface |
Alicia Burga, Monica Dominguez, Mireia Farrús and Leo Wanner |
532 |
Building a List of Synonymous Words and Phrases of Japanese Compound Verbs |
Kyoko Kanzaki and Hitoshi Isahara |
533 |
Building TOCFL Learner Corpus for Chinese Grammatical Error Diagnosis |
Lung-Hao Lee, Yuen-Hsien Tseng and Liping Chang |
534 |
BDPROTO: A Database of Phonological Inventories from Ancient and Reconstructed Languages |
Egidio Marsico, Sebastien Flavier, Annemarie Verkerk and Steven Moran |
535 |
Experiments with Convolutional Neural Networks for Multi-Label Authorship Attribution |
Dainis Boumber, Yifan Zhang and Arjun Mukherjee |
536 |
Evaluating EcoLexiCAT: a Terminology-Enhanced CAT Tool |
Pilar León-Araúz and Arianne Reimerink |
537 |
PMKI: an European Commission action for the interoperability, maintainability and sustainability of Language Resources |
Peter Schmitz, Enrico Francesconi, Najeh Hajlaoui and Brahim Batouche |
539 |
Towards an Automatic Assessment of Crowdsourced Data for NLU |
Patricia Braunger, Wolfgang Maier, Jan Wessling, Maria Schmidt and Jordan Koontz |
541 |
Automatic Enrichment of Terminological Resources: the IATE RDF Example |
Mihael Arcan, Elena Montiel-Ponsoda, John Philip McCrae and Paul Buitelaar |
542 |
SimPA: A Sentence-Level Simplification Corpus for the Public Administration Domain |
Carolina Scarton, Gustavo Paetzold and Lucia Specia |
543 |
Spanish HPSG Treebank based on the AnCora Corpus |
Luis Chiruzzo and Dina Wonsever |
547 |
Extended HowNet 2.0 – An Entity-Relation Common-Sense Representation Model |
Yueh-Yin Shih and Wei-Yun Ma |
548 |
The Abkhaz National Corpus |
Paul Meurer |
551 |
Annotating Zero Anaphora for Question Answering |
Yoshihiko Asao, Ryu Iida and Kentaro Torisawa |
553 |
Evaluating Scoped Meaning Representations |
Rik van Noord, Lasha Abzianidze, Hessel Haagsma and Johan Bos |
558 |
MIAPARLE: Online training for the discrimination of stress contrasts |
Jean-Philippe Goldman |
559 |
Data-Driven Pronunciation Modeling of Swiss German Dialectal Speech for Automatic Speech Recognition |
Michael Stadtschnitzer and Christoph Schmidt |
560 |
Visual Choice of Plausible Alternatives: An Evaluation of Image-based Commonsense Causal Reasoning |
Jinyoung Yeo, Gyeongbok Lee, Gengyu Wang, Seungtaek Choi, Hyunsouk Cho, Reinald Kim Amplayo and Seung-won Hwang |
562 |
Multi-Dialect Arabic POS Tagging: A CRF Approach |
Kareem Darwish, Hamdy Mubarak, Ahmed Abdelali, Mohamed Eldesouki, Younes Samih, Randah Alharbi, Mohammed Attia, Walid Magdy and Laura Kallmeyer |
564 |
The SSIX Corpus: A Trilingual Gold Standard Corpus for Sentiment Analysis in Financial Microblogs |
Thomas Gaillat, Manel Zarrouk, André Freitas and Brian Davis |
565 |
Universal Dependencies for Amharic |
Binyam Ephrem Seyoum |
566 |
Preliminary Analysis of Embodied Interactions between Science Communicators and Visitors Based on a Multimodal Corpus of Japanese Conversations in a Science Museum |
Rui Sakaida, Ryosaku Makino and Mayumi Bono |
567 |
A Parser for LTAG and Frame Semantics |
David Arps and Simon Petitjean |
568 |
Evaluating Domain Adaptation for Machine Translation Across Scenarios |
Thierry Etchegoyhen, Anna Fernández Torné, Andoni Azpeitia, Eva Martínez Garcia and Anna Matamala |
569 |
Ensemble Romanian Dependency Parsing with Neural Networks |
Radu Ion, Elena Irimia and Verginica Barbu Mititelu |
570 |
The First 100 Days: A Corpus Of Political Agendas on Twitter |
Nathan Green and Septina Larasati |
571 |
Using a Corpus of English and Chinese Political Speeches for Metaphor Analysis |
Kathleen Ahrens, Huiheng Zeng and Shun-han Rebekah Wong |
573 |
Diacritics Restoration Using Neural Networks |
Jakub Náplava, Milan Straka, Pavel Straňák and Jan Hajic |
574 |
Revisiting the Task of Scoring Open IE Relations |
William Lechelle and Phillippe Langlais |
575 |
CLARIN: Towards FAIR and Responsible Data Science |
Franciska de Jong, Bente Maegaard, Koenraad De Smedt, Darja Fišer and Dieter Van Uytvanck |
576 |
A Workbench for Rapid Generation of Cross-Lingual Summaries |
Nisarg Jhaveri, Manish Gupta and Vasudeva Varma |
577 |
Medical Sentiment Analysis using Social Media: Towards building a Patient Assisted System |
Shweta Yadav, Asif Ekbal, Sriparna Saha and Pushpak Bhattacharyya |
579 |
Comprehensive Annotation of Various Temporal Information on the Time Axis |
Tomohiro Sakaguchi, Daisuke Kawahara and Sadao Kurohashi |
580 |
Automatic Identification of Maghreb Dialects using a Dictionary-based Approach |
Houda SAADANE, Hosni Seffih, Christian Fluhr, Khalid Choukri and Nasredine SEMMAR |
581 |
EmotionLines: An Emotion Corpus of Multi-Party Conversations |
Sheng-Yeh Chen, Chao-Chun Hsu, Chuan-Chun Kuo, Ting-Hao (Kenneth) Huang and Lun-Wei Ku |
582 |
Evaluation of Crowdsourcing WordNet Synset Localization |
Amarsanaa Ganbold and Altangerel Chagnaa |
583 |
Sarcasm Target Identification: Dataset and An Introductory Approach |
Aditya Joshi, Pranav Goel, Pushpak Bhattacharyya and Mark Carman |
585 |
Improving domain-specific SMT for low-resourced languages using data from different domains |
Fathima Farhath, Surangika Ranathunga, Sanath Jayasena, Gihan Dias and Uthayasanker Thayasivam |
586 |
A Danish FrameNet Lexicon and an Annotated Corpus Used for Training and Evaluating a Semantic Frame Classifier |
Bolette Pedersen, Sanni Nimb, Anders Søgaard, Mareike Hartmann and Sussi Olsen |
587 |
Comparison of Pun Detection Methods using Japanese Pun Corpus |
Motoki Yatsu and Kenji Araki |
590 |
ESCRITO - An NLP-Enhanced Educational Scoring Toolkit |
Andrea Horbach and Torsten Zesch |
591 |
TreeAnnotator: Versatile Visual Annotation of Hierarchical Text Relations |
Philipp Helfrich, Elias Rieb, Giuseppe Abrami, Andy Lücking and Alexander Mehler |
592 |
Finely Tuned, 2 Billion Token Based Word Embeddings for Portuguese |
João Rodrigues and António Branco |
596 |
Modeling Collaborative Multimodal Behavior in Group Dialogues: The MULTISIMO Corpus |
Maria Koutsombogera and Carl Vogel |
598 |
A Gold Standard for Multilingual Automatic Term Extraction from Comparable Corpora: Term Structure and Translation Equivalents |
Ayla Rigouts Terryn, Veronique Hoste and Els Lefever |
599 |
The brWaC Corpus: A New Open Resource to Aid in the Processing of Brazilian Portuguese |
Jorge Alberto Wagner Filho, Rodrigo Wilkens and Aline Villavicencio |
600 |
Multilingual Dependency Parsing for Low-Resource Languages: Case Studies of North Saami and Komi-Zyrian |
KyungTae Lim, Niko Partanen and Thierry Poibeau |
601 |
A supervised approach to taxonomy extraction using word embeddings |
Rajdeep Sarkar, John Philip McCrae and Paul Buitelaar |
602 |
SLIDE - a Sentiment Lexicon of Common Idioms |
Charles Jochim, Francesca Bonin, Roy Bar-Haim and Noam Slonim |
603 |
The Circumstantial Event Ontology (CEO) and ECB+/CEO; an Ontology and Corpus for Implicit Causal Relations between Events |
Roxane Segers, Tommaso Caselli and Piek Vossen |
604 |
Discovering parallel language resources for training MT engines |
Vassilis Papavassiliou, Prokopis Prokopidis and Stelios Piperidis |
610 |
Toward An Epic Epigraph Graph |
Francis Bond and Graham Matthews |
611 |
A fine-grained error analysis of NMT, SMT and RBMT output for English-to-Dutch |
Laura Van Brussel, Arda Tezcan and Lieve Macken |
612 |
Framing Named Entity Linking Error Types |
Adrian Brasoveanu, Giuseppe Rizzo, Philipp Kuntschick, Albert Weichselbraun and Lyndon J.B. Nixon |
613 |
A Corpus with Negative Full Forms for General Abbreviation Prediction |
Yi Zhang and Sun Xu |
615 |
Is it worth it? Budget-related evaluation metrics for model selection |
Filip Klubička, Giancarlo D. Salton and John D. Kelleher |
616 |
A Multi- versus a Single-classifier Approach for the Identification of Modality in the Portuguese Language |
João Sequeira, Teresa Goncalves, Paulo Quaresma, Amália Mendes and Iris Hendrickx |
617 |
ForFun 1.0: Prague Database of Syntactic Forms and Functions -- An Invaluable Resource for Linguistic Research |
Marie Mikulová and Eduard Bejček |
618 |
MacauCorpus: A Chinese-Portuguese Parallel Corpus for Machine Translation |
Siyou Liu, Longyue Wang and Qun Liu |
619 |
A Leveled Reading Corpus of Modern Standard Arabic |
Muhamed Al Khalil, Hind Saddiki and Nizar Habash |
620 |
A Detailed Evaluation of Neural Sequence-to-Sequence Models for In-domain and Cross-domain Text Simplification |
Sanja Štajner and Sergiu Nisioi |
621 |
Augmenting Librispeech with French Translations: A Multimodal Corpus for Direct Speech Translation Evaluation |
Ali Can Kocabiyikoglu, Laurent Besacier and Olivier Kraif |
623 |
Analyzing Vocabulary Commonality Index Using Large-scaled Database of Child Language Development |
Yan Cao, Yasuhiro Minami, Yuko Okumura and Tessei Kobayashi |
624 |
Neural Models of Selectional Preferences for Implicit Semantic Role Labeling |
Minh Le and Antske Fokkens |
625 |
CoNLL-UL: Universal Morphological Lattices for Universal Dependency Parsing |
Amir More, Özlem Çetinoğlu, Nizar Habash, Benoît Sagot, Djamé Seddah, Reut Tsarfaty, Dima Taji and Çağrı Çöltekin |
626 |
Annotation of Speaker Information in Novel Conversation Sentences and Its Quantitative Analysis |
Makoto Yamazaki, Yumi Miyazaki and Wakako Kashino |
627 |
Learning Word Vectors for 157 Languages |
Edouard Grave, Piotr Bojanowski, Prakhar Gupta, Armand Joulin and Tomas Mikolov |
628 |
Page Stream Segmentation with Convolutional Neural Nets Combining Textual and Visual Features |
Gregor Wiedemann and Gerhard Heyer |
629 |
Multimodal Lexical Translation |
Chiraag Lala and Lucia Specia |
630 |
CATS: A Tool for Customised Alignment of Text Simplification Corpora |
Sanja Štajner, Marc Franco-Salvador, Paolo Rosso and Simone Paolo Ponzetto |
632 |
T-REx: A Large Scale Alignment of Natural Language with Knowledge Base Triples |
Hady Elsahar, Pavlos Vougiouklis, Arslen Remaci, Christophe Gravier, Jonathon Hare, Frederique Laforest and Elena Simperl |
635 |
Parallel Corpora in Mboshi (Bantu C25, Congo-Brazzaville) |
Annie Rialland, Martine Adda-Decker, Guy-Noël Kouarata, Gilles Adda, Laurent Besacier, Lori Lamel, Elodie Gauthier, Pierre Godard and Jamison Cooper-Leavitt |
636 |
PoSTWITA-UD: an Italian Twitter Treebank in Universal Dependencies |
Manuela Sanguinetti, Cristina Bosco, Alberto Lavelli, Alessandro Mazzei and Fabio Tamburini |
639 |
The LREC Workshops Map |
Sara Goggi, Roberto Bartolini, Monica Monachini and Gabriella Pardelli |
640 |
Improving Crowdsourcing-Based Annotation of Japanese Discourse Relations |
Yudai Kishimoto, Shinnosuke Sawada, Yugo Murawaki, Daisuke Kawahara and Sadao Kurohashi |
642 |
The LIA Treebank of Spoken Norwegian Dialects |
Lilja Øvrelid, Andre Kåsen, Kristin Hagen, Per Erik Solberg and Janne Bondi Johannessen |
644 |
Polish Corpus of Annotated Descriptions of Images |
Alina Wróblewska |
646 |
PronouncUR: An Urdu Pronunciation Lexicon Generator |
Haris Bin Zia, Agha Ali Raza and Awais Athar |
648 |
Managing Public Sector Data for Multilingual Applications Development |
Stelios Piperidis, Penny Labropoulou, Miltos Deligiannis and Maria Giagkou |
650 |
A Large Automatically-Acquired All-Words List of Multiword Expressions Scored for Compositionality |
Will Roberts and Markus Egg |
652 |
Errator: a Tool to Help Detect Annotation Errors in the Universal Dependencies Project |
Guillaume Wisniewski |
653 |
Chats and Chunks: Annotation and Analysis of Multiparty Long Casual Conversations |
Emer Gilmartin, Carl Vogel and Nick Campbell |
658 |
A Corpus for Multilingual Document Classification in Eight Languages |
Holger Schwenk and Xian Li |
660 |
A Semi-autonomous System for Creating a Human-Machine Interaction Corpus in Virtual Reality: Application to the ACORFORMed System for Training Doctors to Break Bad News |
Magalie Ochs, Philippe Blache, Grégoire de Montcheuil, Jean-Marie Pergandi, Jorane Saubesty, Daniel Francon and Daniel Mestre |
661 |
ETPC - A Paraphrase Identification Corpus Annotated with Extended Paraphrase Typology and Negation |
Venelin Kovatchev, Toni Marti and Maria Salamo |
662 |
Bridging the LAPPS Grid and CLARIN |
Erhard Hinrichs, Nancy Ide, James Pustejovsky, Jan Hajic, Marie Hinrichs, Mohammad Fazleh Elahi, Keith Suderman, Marc Verhagen, Kyeongmin Rim, Pavel Stranak and Jozef Misutka |
664 |
Korean TimeBank Including Relative Temporal Information |
Chae-Gyun Lim, Young-Seob Jeong and Ho-Jin Choi |
666 |
Mining Biomedical Publications With The LAPPS Grid |
Nancy Ide and Keith Suderman |
667 |
From Data to Text: Capturing Long Tail Events through Microworlds and Reference Texts |
Piek Vossen, Marten Postma and Filip Ilievski |
668 |
A database of German definitory contexts from selected web sources |
Adrien Barbaresi, Lothar Lemnitzer and Alexander Geyken |
669 |
Building a Word Segmenter for Sanskrit Overnight |
Vikas Reddy, Amrith Krishna, Prateek Gupta, Vineeth M R and Pawan Goyal |
671 |
Czech Text Document Corpus v 2.0 |
Pavel Kral and Ladislav Lenc |
672 |
Dialogue Structure Annotation for Multi-Floor Interaction |
David Traum, Cassidy Henry, Stephanie Lukin, Ron Artstein, Felix Gervits, Kimberly Pollard, Claire Bonial, Su Lei, Clare Voss, Matthew Marge, Cory Hayes and Susan Hill |
673 |
Persian Discourse Treebank |
Azadeh Mirzaei and Pegah Safari |
674 |
Extracting an English-Persian Parallel Corpus from Comparable Corpora |
Akbar Karimi, Ebrahim Ansari and Bahram Sadeghi Bigham |
675 |
Manually Annotated Corpus of Polish Texts Published between 1830 and 1918 |
Witold Kieraś and Marcin Woliński |
676 |
A Multimodal Corpus of Expert Gaze and Behavior during Phonetic Segmentation Tasks |
Arif Khan, Ingmar Steiner, Yusuke Sugano, Andreas Bulling and Ross Macdonald |
677 |
Translation Crowdsourcing: Creating a Multilingual Corpus of Online Educational Content |
Vilelmini Sosoni, Katia Lida Kermanidis, Maria Stasimioti, Thanasis Naskos, Eirini Takoulidou, Menno van Zaanen, Sheila Castilho, Panayota Georgakopoulou, Valia Kordoni and Markus Egg |
678 |
Upping the Ante: Towards a Better Benchmark for Chinese-to-English Machine Translation |
Christian Hadiwinoto and Hwee Tou Ng |
679 |
A Corpus of eRulemaking User Comments for Measuring Evaluability of Arguments |
Joonsuk Park and Claire Cardie |
680 |
PDFAnno: a Web-based Linguistic Annotation Tool for PDF Documents |
Hiroyuki Shindo and Yuji Matsumoto |
682 |
M-CNER: A Corpus for Chinese Named Entity Recognition in Multi-Domains |
Qi Lu, YaoSheng Yang, Zhenghua Li, Wenliang Chen and Min Zhang |
683 |
Statistical Analysis of Missing Translation in Simultaneous Interpretation Using A Large-scale Bilingual Speech Corpus |
Zhongxi Cai, Koichiro Ryu and Shigeki Matsubara |
684 |
The DLDP Survey on Digital Use and Usability of EU Regional and Minority Languages |
Claudia Soria, Valeria Quochi and Irene Russo |
687 |
SimLex-999 for Polish |
Agnieszka Mykowiecka, Malgorzata Marciniak and Piotr Rychlik |
688 |
KIT-Multi: A Translation-Oriented Multilingual Embedding Corpus |
Thanh-Le Ha, Jan Niehues, Matthias Sperber, Ngoc Quan Pham and Alexander Waibel |
689 |
Automated Evaluation of Out-of-Context Errors |
Patrick Huber, Jan Niehues and Alex Waibel |
691 |
A Lightweight Modeling Middleware for Corpus Processing |
Markus Gärtner and Jonas Kuhn |
693 |
Action Verb Corpus |
Stephanie Gross, Matthias Hirschmanner, Brigitte Krenn, Friedrich Neubarth and Michael Zillich |
694 |
A Very Low Resource Language Speech Corpus for Computational Language Documentation Experiments |
Pierre Godard, Gilles Adda, Martine Adda-Decker, Juan Benjumea, Laurent Besacier, Jamison Cooper-Leavitt, Guy-Noel Kouarata, Lori Lamel, Hélène Maynard, Markus Mueller, Annie Rialland, Sebastian Stueker, François Yvon and Marcely Zanon Boito |
695 |
An Initial Test Collection for Ranked Retrieval of SMS Conversations |
Rashmi Sankepally and Douglas W. Oard |
701 |
Sharing Copies of Synthetic Clinical Corpora without Violating IPRs and Privacy Constraints — A Case Study Featuring the German JSYNCC Corpus |
Christina Lohr, Sven Buechel and Udo Hahn |
704 |
Modeling French Sign Language: a proposal for a semantically compositional system |
mohamed nassime hadjadj, michael filhol and Annelies Braffort |
705 |
A multilingual collection of CoNLL-U-compatible morphological lexicons |
Benoît Sagot |
706 |
Profiling Medical Journal Articles Using a Gene Ontology Semantic Tagger |
Mahmoud El-Haj, Paul Rayson, Scott Piao and Jo Knight |
707 |
Preserving Workflow Reproducibility: The RePlay-DH Client as a Tool for Process Documentation |
Markus Gärtner, Uli Hahn and Sibylle Hermann |
709 |
FrNewsLink : a corpus linking TV Broadcast News Segments and Press Articles |
Nathalie Camelin, Géraldine Damnati, Abdessalam Bouchekif, Anais Landeau, Delphine Charlet and Yannick Esteve |
710 |
An Italian Twitter Corpus of Hate Speech against Immigrants |
Manuela Sanguinetti, Fabio Poletto, Cristina Bosco, Viviana Patti and Marco Stranisci |
711 |
K-QuAD: Semi-supervised Creation of Korean QA Dataset using Automated Translation |
Kyungjae Lee, Kyoungho Yoon, Sunghyun Park and Seung-won Hwang |
713 |
ARMI: An Architecture for Recording Multimodal Interactions |
Patrik Jonell, Mattias Bystedt, Per Fallgren, Dimosthenis Kontogiorgos, José Lopes, Zofia Malisz, Samuel Mascarenhas, Catharine Oertel, Eran Raveh and Todd Shore |
714 |
EMO&LY (EMOtion and AnomaLY) : A new corpus for anomaly detection in an audiovisual stream with emotional context. |
Cédric Fayet, Arnaud Delhay, Damien Lolive and Pierre-françois Marteau |
715 |
TriMED: A Multilingual Terminological Database |
Federica Vezzani, Giorgio Maria Di Nunzio and Geneviève Henrot |
716 |
Fluid Annotation: A Granularity-aware Annotation Tool for Chinese Word Fluidity |
Shu-Kai HSIEH, Yu-Hsiang Tseng, Chi-Yao Lee and Chiung-Yu Chiang |
717 |
A Multi-layer Annotated Corpus of Argumentative Text: From Argument Schemes to Discourse Relations |
Elena Musi, Manfred Stede, Leonard Kriese, Smaranda Muresan and Andrea Rocci |
719 |
Creating Large-Scale Argumentation Structures for Dialogue Systems |
Kazuki Sakai, Akari Inago, Ryuichiro Higashinaka, Yuichiro Yoshikawa, Hiroshi Ishiguro and Junji Tomita |
721 |
Advances in Pre-Training Distributed Word Representations |
Tomas Mikolov, Edouard Grave, Piotr Bojanowski, Christian Puhrsch and Armand Joulin |
723 |
SynPaFlex-Corpus: An Expressive French Audiobooks Corpus dedicated to expressive speech synthesis. |
Aghilas SINI, Damien Lolive, Gaëlle Vidal, Marie Tahon and Élisabeth Delais-Roussarie |
724 |
Corpora of Typical Sentences |
Lydia Müller, Uwe Quasthoff and Maciej Sumalvico |
725 |
Neural Caption Generation for News Images |
Vishwash Batra, Yulan He and George Vogiatzis |
726 |
Annotating Opinions and Opinion Targets in Student Course Feedback |
Janaka Chathuranga, Shanika Ediriweera, Pranidhith Munasinghe, Ravindu Hasantha and Surangika Ranathunga |
728 |
An Annotation Language for Semantic Search of Legal Sources |
Adeline Nazarenko, Francois Levy and Adam Wyner |
729 |
Contextualized Usage-Based Material Selection |
Dirk De Hertog and Piet Desmet |
730 |
e-magyar -- A Digital Language Processing System |
Tamás Váradi, Eszter Simon, Bálint Sass, Attila Novák, Balázs Indig, Richárd Farkas and Veronika Vincze |
731 |
Huge Automatically Extracted Training-Sets for Multilingual Word SenseDisambiguation. |
Tommaso Pasini, Francesco Maria Maria and Roberto Navigli |
732 |
PyRATA, Python Rule-bAsed Text Analysis |
Nicolas Hernandez |
734 |
iLCM - A virtual research infrastructure for large-scale qualitative data |
Andreas Niekler, Christian Kahmann, Gregor Wiedemann and Gerhard Heyer |
736 |
fastSense: An Efficient Word Sense Disambiguation Classifier |
Tolga Uslu, Alexander Mehler, Daniel Baumartz and Alexander Henlein |
737 |
The German Reference Corpus DeReKo: New developments – new opportunities |
Marc Kupietz, Harald Lüngen, Pawel Kamocki and Andreas Witt |
740 |
Sanaphor++: Combining Deep Neural Networks with Semantics for Coreference Resolution |
Julien Plu, Roman Prokofyev, Alberto Tonon, Philippe Cudré-Mauroux, Djellel Eddine Difallah, Raphael Troncy and Giuseppe Rizzo |
741 |
'Aye' or 'No'? Speech-level Stance Detection on Hansard UK Parliamentary Debate Transcripts |
Gavin Abercrombie and Riza Batista-Navarro |
743 |
Annotating Abstract Meaning Representations for Spanish |
Noelia Migueles-Abraira, Rodrigo Agerri and Arantza Diaz de Ilarraza |
745 |
Evaluating Inflectional Complexity Crosslinguistically: a Processing Perspective |
Claudia Marzi, Marcello Ferro, Ouafae Nahli, Patrizia Belik, Stavros Bompolas and Vito Pirrelli |
746 |
Risamálheild: A Very Large Icelandic Text Corpus |
Steinþór Steingrímsson, Sigrún Helgadóttir and Eiríkur Rögnvaldsson |
749 |
ASR for documenting acutely under-resourced indigenous languages |
Robert Jimerson and Emily Prud'hommeaux |
751 |
Construction of English-French Multimodal Affective Conversational Corpus from Drama TV Series |
Sashi Novitasari, Quoc Truong Do, Sakriani Sakti, Dessi Lestari and Satoshi Nakamura |
752 |
Lexical Profiling of Environmental Corpora |
Benoît Robichaud and Marie-Claude L'Homme |
755 |
SandhiKosh: A Benchmark Corpus for Evaluating Sanskrit Sandhi Tools |
Shubham Bhardwaj, Neelamadhav Gantayat, Rahul Garg and Sumeet Agarwal |
757 |
SentEval: An Evaluation Toolkit for Universal Sentence Representations |
Alexis Conneau and Douwe Kiela |
758 |
PhotoshopQuiA: A Corpus of Non-Factoid Questions and Answers for Why-Question Answering |
Andrei Dulceanu, Thang Le Dinh, Walter Chang, Trung Bui, Doo Soon Kim, Manh Chien Vu and Seokhwan Kim |
760 |
Scalable Visualisation of Sentiment and Stance |
Jon Chamberlain, Udo Kruschwitz and Orland Hoeber |
763 |
Exploring Conversational Language Generation for Rich Content about Hotels |
Marilyn Walker |
765 |
A New Version of Składnica Treebank of Polish Harmonised with the Valency Dictionary Walenty |
Marcin Woliński, Elżbieta Hajnicz and Tomasz Bartosiak |
769 |
A Corpus of Expert and Novice Writers of Academic Spanish for a Lexical Tool |
Marcos García Salido, Marcos Garcia, Milka Villayandre-Llamazares and Margarita Alonso-Ramos |
770 |
Creating a Translation Matrix of the Bible’s Names Across 591 Languages |
Winston Wu, Nidhi Vyas and David Yarowsky |
773 |
The LODeXporter: Flexible Generation of Linked Open Data Triples from NLP Frameworks for Automatic Knowledge Base Construction |
René Witte and Bahar Sateli |
774 |
A Comparative Study of Extremely Low-Resource Transliteration of the World’s Languages |
Winston Wu and David Yarowsky |
775 |
From ‘Solved Problems’ to New Challenges: A Report on LDC Activities |
Christopher Cieri, Mark Liberman, Stephanie Strassel, Denise DiPersio, Jonathan Wright and Andrea Mazzucchi |
777 |
Automating Document Discovery in the Systematic Review Process: How to Use Chaff to Extract Wheat |
Christopher Norman, Mariska Leeflang, Pierre Zweigenbaum and Aurélie Névéol |
780 |
A new Evaluation and Data Exploration Tool: Evalomatic |
Olivier Galibert, Guillaume Bernard, Agnes Delaborde, Sabrina Lecadre and Juliette Kahn |
782 |
Analyzing Citation-Distance Networks for Evaluating Publication Impact |
Drahomira Herrmannova, Petr Knoth and Robert Patton |
783 |
RDF2PT: Generating Brazilian Portuguese Texts from RDF Data |
Diego Moussallem, Thiago Ferreira, Marcos Zampieri, Maria Cláudia Cavalcanti, Geraldo Xexéo, Mariana Neves and Axel-Cyrille Ngonga Ngomo |
785 |
Multi-lingual Argumentative Corpora in English, Turkish, Greek, Albanian, Croatian, Serbian, Macedonian, Bulgarian, Romanian and Arabic |
Alfred Sliwa, Yuan Man, Ruishen Liu, Niravkumar Borad, Seyedeh Ziyaei, Mina Ghobadi, Firas Sabbah and Ahmet Aker |
786 |
Performance Impact Caused by Hidden Bias of Training Data for Recognizing Textual Entailment |
Masatoshi Tsuchiya |
787 |
One event, many representation. Mapping action concepts through visual features. |
Alessandro Panunzi, Lorenzo Gregori and Andrea Amelio Ravelli |
789 |
UniMorph 2.0: Universal Morphology |
Christo Kirov, Ryan Cotterell, John Sylak-Glassman, Géraldine Walther, Ekaterina Vylomova, Patrick Xia, Manaal Faruqui, Arya McCarthy, Sandra Kübler, David Yarowsky, Jason Eisner and Mans Hulden |
790 |
A Dataset for Inter-Sentence Relation Extraction using Distant Supervision |
Angrosh Mandya, Danushka Bollegala, Frans Coenen and Katie Atkinson |
792 |
Towards a Conversation-Analytic Taxonomy of Speech Overlap |
Felix Gervits and Matthias Scheutz |
794 |
A Framework for Multi-Language Service Design with the Language Grid |
Donghui Lin, Yohei Murakami and Toru Ishida |
795 |
BioRead A New Dataset for Biomedical Reading Comprehension |
Dimitris Pappas, Ion Androutsopoulos and Haris Papageorgiou |
798 |
Parser combinators for Tigrinya and Oromo morphology |
Patrick Littell, Tom McCoy, Na-Rae Han, Shruti Rijhwani, Zaid Sheikh, David R. Mortensen, Teruko Mitamura and Lori Levin |
799 |
Czech Legal Text Treebank 2.0 |
Vincent Kríž and Barbora Hladka |
800 |
Development of an Annotated Multimodal Dataset for the Investigation of Classification and Summarisation of Presentations using High-Level Paralinguistic Features |
Keith Curtis, Nick Campbell and Gareth Jones |
801 |
The French-Algerian Code-Switching Triggered audio corpus (FACST) |
Amazouz Djegdjiga, Martine Adda-Decker and Lori Lamel |
803 |
Preparation and Usage of Xhosa Lexicographical Data for a Multilingual, Federated Environment |
Sonja Bosch, Thomas Eckart, Bettina Klimek, Dirk Goldhahn and Uwe Quasthoff |
805 |
Translating Web Search Queries into Natural Language Questions |
Adarsh Kumar and Sandipan Dandapat |
806 |
C-HTS: A Concept-based Hierarchical Text Segmentation approach |
Mostafa Bayomi and Seamus Lawless |
811 |
Literality and cognitive effort: Japanese and Spanish |
Isabel Lacruz, Michael Carl and Masaru Yamada |
812 |
LiDo RDF: From a Relational Database to a Linked Data Graph of Linguistic Terms and Bibliographic Data |
Bettina Klimek, Robert Schädlich, Edwin Knese, Dustin Kröger and Benedikt Elßmann |
813 |
Towards faithfully visualizing global linguistic diversity |
Garland McNew and Steven Moran |
814 |
A Lexicon of Discourse Markers for Portuguese – LDM-PT |
Amália Mendes, Iria del Río Gayo and Manfred Stede |
815 |
Language Technology for Multilingual Europe: An Analysis of a Large-Scale Survey regarding Challenges, Demands, Gaps and Needs |
Georg Rehm and Stefanie Hegele |
816 |
A Computational Architecture for the Morphology of Upper Tanana |
Olga Lovick, Christopher Cox, Miikka Silfverberg, Antti Arppe and Mans Hulden |
819 |
SlugNERDS: A Named Entity Recognition Tool for Open Domain Dialogue Systems |
Kevin Bowden, Jiaqi Wu, Shereen Oraby, Amita Misra and Marilyn Walker |
820 |
Shami: A Corpus of Levantine Arabic Dialects |
Chatrine Qwaider, Motaz Saad, Stergios Chatzikyriakidis and Simon Dobnik |
822 |
Chahta Anumpa: A multimodal corpus of the Choctaw Language |
Jacqueline Brixey, Eli Pincus and Ron Artstein |
823 |
The ICoN Corpus of Academic Written Italian (L1 and L2) |
Mirko Tavosanis and Federica Cominetti |
824 |
Effects of Gender Stereotypes on Trust and Likeability in Spoken Human-Robot Interaction |
Matthias Kraus, Johannes Kraus, Martin Baumann and Wolfgang Minker |
825 |
SumeCzech: Large Czech News-Based Summarization Dataset |
Milan Straka, Nikita Mediankin, Tom Kocmi, Zdeněk Žabokrtský and Jan Hajic |
826 |
MMQA: A Multi-domain Multi-lingual Question-Answering Framework for English and Hindi |
Deepak Gupta, Surabhi Kumari, Asif Ekbal and Pushpak Bhattacharyya |
827 |
Simulating ASR errors for training SLU systems |
Edwin Simonnet, Sahar Ghannay, Nathalie Camelin and Yannick Estève |
828 |
Annotated Corpus of Scientific Conference's Homepages for Information Extraction |
Piotr Andruszkiewicz and Rafal Hazan |
829 |
Meet CLARIN’s Key Resource Families |
Darja Fišer, Jakob Lenardič and Tomaž Erjavec |
831 |
Towards a music-language mapping |
Michele Berlingerio and Francesca Bonin |
832 |
Two Multilingual Corpora Extracted from the Tenders Electronic Daily for Machine Learning and Machine Translation Applications. |
oussama ahmia, Nicolas Béchet and Pierre-François Marteau |
833 |
Pronunciation Variants and ASR of Colloquial Speech: A Case Study on Czech |
David Lukeš, Marie Kopřivová, Zuzana Komrsková and Petra Klimešová |
835 |
Delta vs. N-Gram Tracing: Evaluating the Robustness of Authorship Attribution Methods |
Thomas Proisl, Stefan Evert, Fotis Jannidis, Christof Schöch, Leonard Konle and Steffen Pielström |
838 |
Unfolding the External Behavior and Inner Affective State of Teammates through Ensemble Learning: Experimental Evidence from a Dyadic Team Corpus |
Aggeliki Vlachostergiou, Mark Dennison, Catherine Neubauer, Stefan Scherer, Peter Khooshabeh and Andre Harrison |
840 |
Browsing the Terminological Structure of a Specialized Domain: A Method Based on Lexical Functions and their Classification |
Marie-Claude L' Homme, Benoît Robichaud and Nathalie Prévil |
841 |
Revita: a language learning platform at the intersection of ITS and CALL |
Anisia Katinskaia and Roman Yangarber |
844 |
One Language to rule them all: modelling Morphological Patterns in a Large Scale Italian Lexicon with SWRL |
Fahad Khan, Andrea Bellandi, Francesca Frontini and Monica Monachini |
845 |
The Distribution and Prosodic Realization of Verb Forms in German Infant-Directed Speech |
Bettina Braun and Katharina Zahner |
846 |
Generating a Gold Standard for a Swedish Sentiment Lexicon |
Jacobo Rouces, Nina Tahmasebi, Lars Borin and Stian Rødven Eide |
847 |
The IIT Bombay English-Hindi Parallel Corpus |
Anoop Kunchukuttan, Pratik Mehta and Pushpak Bhattacharyya |
848 |
Developing New Linguistic Resources and Tools for the Galician Language |
Rodrigo Agerri, Xavier Gómez Guinovart, German Rigau and Miguel Anxo Solla Portela |
849 |
Annotation and Analysis of Extractive Summaries for the Kyutech Corpus |
Takashi Yamamura and Kazutaka Shimada |
850 |
Parseme-it: Italian Resources for Verbal Multiword Expressions |
Johanna Monti, Maria Pia di Buono and Federico Sangati |
851 |
NoReC: The Norwegian Review Corpus |
Erik Velldal, Lilja Øvrelid, Eivind Alexander Bergem, Cathrine Stadsnes, Samia Touileb and Fredrik Jørgensen |
852 |
Using Adversarial Examples in Natural Language Processing |
Petr Bělohlávek, Ondřej Plátek, Zdeněk Žabokrtský and Milan Straka |
853 |
Evaluation of Machine Translation Performance Across Multiple Genres and Languages |
Marlies van der Wees, Arianna Bisazza and Christof Monz |
854 |
Parallel Corpora for the Biomedical Domain |
Aurélie Névéol, Antonio Jimeno Yepes, Mariana Neves and Karin Verspoor |
855 |
Improving Machine Translation of Educational Content via Crowdsourcing |
Maximiliana Behnke, Antonio Valerio Miceli Barone, Rico Sennrich, Vilelmini Sosoni, Thanasis Naskos, Eirini Takoulidou, Maria Stasimioti, Menno van Zaanen, Sheila Castilho, Federico Gaspari, Yota Georgakopoulou, Valia Kordoni, Markus Egg and Katia Lida Kermanidis |
856 |
Abstract Meaning Representation of Constructions: The More We Include, the Better the Representation |
Claire Bonial, Bianca Badarau, Kira Griffitt, Ulf Hermjakob, Kevin Knight, Tim O'Gorman, Martha Palmer and Nathan Schneider |
857 |
SenSALDO: Creating a Sentiment Lexicon for Swedish |
Jacobo Rouces, Nina Tahmasebi, Lars Borin and Stian Rødven Eide |
858 |
A Large Multilingual and Multi-domain Dataset for Recommender Systems |
Giorgia Di Tommaso and Paola Velardi |
861 |
Aggression-annotated Corpus of Hindi-English Code-mixed Data |
Ritesh Kumar, Aishwarya Adhikari and Akshit Bhatia |
862 |
Towards a Linked Open Data Edition of Sumerian Corpora |
Christian Chiarcos, Émilie Pagé-Perron, Ilya Khait and Lucas Reckling |
863 |
SemR-11: A Multi-Lingual Gold-Standard for Semantic Similarity and Relatedness for Eleven Languages |
Siamak Barzegar, Brian Davis, Siegfried Handschuh and André Freitas |
864 |
A Diachronic Corpus for Literary Style Analysis |
Carmen Klaussner and Carl Vogel |
865 |
A Review of the Interoperability between Similar or Complementary Linguistic Annotations in Overlapping Corpora: The Case of Events |
Chantal van Son, Lora Aroyo, Oana Inel, Roser Morante and Piek Vossen |
868 |
Web-based Annotation Tool for Inflectional Language Resources |
Abdulrahman Alosaimy and Eric Atwell |
869 |
The ACoLi CoNLL Libraries: Beyond Tab-Separated Values |
Christian Chiarcos and Niko Schenk |
870 |
HiNTS: A Tagset for Middle Low German |
Fabian Barteld, Sarah Ihden, Katharina Dreessen and Ingrid Schröder |
871 |
Improving Unsupervised Keyphrase Extraction using Background Knowledge |
Yang Yu |
873 |
Building a Constraint Grammar Parser for Plains Cree Verbs and Arguments |
Katherine Schmirler, Antti Arppe, Trond Trosterud and Lene Antonsen |
874 |
CBFC: a parallel L2 speech corpus for Korean and French learners |
Hiyon Yoo and Inyoung Kim |
877 |
Identification of Personal Information Shared in Chat-Oriented Dialogue |
Sarah Fillwock and David Traum |
878 |
Transfer Learning for Named-Entity Recognition with Neural Networks |
Ji Young Lee, Franck Dernoncourt and Peter Szolovits |
880 |
Systems’ Agreements and Disagreements in Temporal Processing: An Extensive Error Analysis of the TempEval-3 Task |
Tommaso Caselli and Roser Morante |
881 |
Evaluation of Feature-Space Speaker Adaptation for End-to-End Acoustic Models |
Natalia Tomashenko and Yannick Estève |
883 |
SentiArabic: A Sentiment Analyzer for Standard Arabic |
Ramy Eskander |
884 |
Towards the Inference of Semantic Relations in Complex Nominals: a Pilot Study |
Melania Cabezas-García and Pilar León-Araúz |
885 |
Leveraging Lexical Resources and Constraint Grammar for Rule-Based Part-of-Speech Tagging in Welsh |
Steven Neale, Kevin Donnelly, Gareth Watkins and Dawn Knight |
886 |
What's Wrong, Python? -- A Visual Differ and Graph Library for NLP in Python |
Balázs Indig, András Simonyi and Noémi Ligeti-Nagy |
887 |
Cross-linguistically Small World Networks are Ubiquitous in Child-directed Speech |
Steven Moran, Danica Pajović and Sabine Stoll |
888 |
Increasing the Accessibility of Time-Aligned Speech Corpora with Spokes Mix |
Piotr Pęzik |
889 |
A Repository of Corpora for Summarization |
Franck Dernoncourt and Walter Chang |
890 |
Epitran: Precision G2P for Many Languages |
David R. Mortensen, Siddharth Dalmia and Patrick Littell |
891 |
Towards a Diagnosis of Textual Difficulties for Children with Dyslexia |
Solen Quiniou and Béatrice Daille |
892 |
Automatic Wordnet Mapping: from CoreNet to Princeton WordNet |
Jiseong Kim, Younggyun Hahm, Sunggoo Kwon and KEY-SUN CHOI |
894 |
Introducing NIEUW: Novel Incentives and Workflows for Eliciting Linguistic Data |
Christopher Cieri, James Fiumara, Mark Liberman, Chris Callison-Burch and Jonathan Wright |
899 |
ANCOR-AS: Enriching the ANCOR Corpus with Syntactic Annotations |
Loïc Grobol, Isabelle Tellier, Eric De La Clergerie, Marco Dinarelli and Frédéric Landragin |
901 |
L1-L2 Parallel Treebank of Learner Chinese: Overused and Underused Syntactic Structures |
Keying Li and John Lee |
903 |
RtGender: A Corpus of Responses to Gender for Studying Gender Bias |
Rob Voigt, David Jurgens, Vinodkumar Prabhakaran, Dan Jurafsky and Yulia Tsvetkov |
905 |
WikiDragon: A Java Framework For Diachronic Content And Network Analysis Of MediaWikis |
Rüdiger Gleim, Alexander Mehler and Sung Y. Song |
908 |
Parsivar: A Language Processing Toolkit for Persian |
Salar Mohtaj, Behnam Roshanfekr, Atefeh Zafarian and Habibollah Asghari |
909 |
Determining Trolling in Textual Comments |
Luis Gerardo Mojica de la Vega and Vincent Ng |
910 |
Event Reference Interpretation with Instance-Based Lerning |
Jing Lu and Vincent Ng |
913 |
A Deep Neural Network Model for Part-Of-Speech Tagging of Tweets |
Sara Meftah and Nasredine Semmar |
914 |
Indra: A Word Embedding and Semantic Relatedness Server |
Juliano Efson Sales, Leonardo Souza, Siamak Barzegar, Brian Davis, Siegfried Handschuh and André Freitas |
917 |
Comparing Pretrained Multilingual Word Embeddings on an Ontology Alignment Task |
Dagmar Gromann and Thierry Declerck |
920 |
Strategies and Challenges for Crowdsourcing Regional Dialect Perception Data for Swiss German and Swiss French |
Jean-Philippe Goldman, Simon Clematide, Mathieu Avanzi and Raphaël Tandler |
922 |
Rollenwechsel-English: a large-scale semantic role corpus |
Asad Sayeed, Pavel Shkadzko and Vera Demberg |
923 |
Contextual Dependencies in Time-Continuous Multidimensional Affect Recognition |
Dmitrii Fedotov, Maxim Sidorov and Wolfgang Minker |
924 |
Modeling Northern Haida Verb Morphology |
Jordan Lachler, Lene Antonsen, Trond Trosterud, Sjur Moshagen and Antti Arppe |
925 |
The MonpAGE Database for the Documentation of Spoken French Throughout Adulthood |
cecile fougeron, Veronique Delvaux, Lucie Ménard and Marina Laganaro |
927 |
Up-cycling Data for Natural Language Generation |
Amy Isard |
929 |
You Tweet What You Speak: A City-Level Dataset of Arabic Dialects |
Muhammad Abdul-Mageed, Hassan Alhuzali and Mohamed Elaraby |
932 |
Graph Based Semi-Supervised Learning Approach for Tamil POS tagging |
Mokanarangan Thayaparan, Surangika Ranathunga and Uthayasanker Thayasivam |
933 |
The Use of Text Alignment in Semi-Automatic Error Analysis: Use Case in the Development of the Corpus of the Latvian Language Learners |
Roberts Darģis, Ilze Auziņa and Kristīne Levāne-Petrova |
934 |
Creating Large-Scale Multilingual Cognate Tables |
Winston Wu and David Yarowsky |
935 |
Creation of a Balanced State-of-the-Art Multilayer Corpus for NLU |
Normunds Gruzitis, Lauma Pretkalnina, Baiba Saulite, Laura Rituma, Gunta Nespore, Arturs Znotins, Roberts Dargis and Peteris Paikens |
936 |
Indian Language Wordnets and their Linkages with Princeton WordNet |
Diptesh Kanojia, Kevin Patel and Pushpak Bhattacharyya |
937 |
Towards a Standardized Dataset for Noun Compound Interpretation |
Girishkumar Ponkiya, Kevin Patel, Pushpak Bhattacharyya and Girish K. Palshikar |
938 |
A UIMA Database Interface for Managing NLP-related Text Annotations |
Giuseppe Abrami and Alexander Mehler |
940 |
Phonetically Balanced Code-Mixed Speech Corpus for Hindi-English Automatic Speech Recognition |
Ayushi Pandey, Brij Mohan Lal Srivastava and Suryakanth V Gangashetty |
941 |
ParCorFull: a Parallel Corpus Annotated with Full Coreference |
Ekaterina Lapshinova-Koltunski, Christian Hardmeier and Pauline Krielke |
942 |
A Vietnamese Dialog Act Corpus Base on Standard ISO 24617-2 |
NGO Thi Lan, Pham Khac Linh and Takeda Hideaki |
947 |
PDF-to-Text Reanalysis for Linguistic Data Mining |
Michael Wayne Goodman, Ryan Georgi and Fei Xia |
948 |
BULBasaa: A Bilingual Basaa-French Speech Corpus for the Evaluation of Language Documentation Tools |
Fatima Hamlaoui, Emmanuel-Moselly Makasso, Markus Müller, Jonas Engelmann, Gilles Adda, Alex Waibel and Sebastian Stüker |
949 |
Handling Rare Word Problem using Synthetic Training Data for Sinhala and Tamil Neural Machine Translation |
Pasindu Tennage, Prabath Sandaruwan, Malith Thilakarathne, Achini Herath and Surangika Ranathunga |
951 |
Annotating Temporally-Anchored Spatial Knowledge by Leveraging Syntactic Dependencies |
Alakananda Vempala and Eduardo Blanco |
952 |
Three Dimensions of Reproducibility in Natural Language Processing |
K. Bretonnel Cohen, Jingbo Xia, Pierre Zweigenbaum, Tiffany Callahan, Foster Goss, Nancy Ide, Aurélie Névéol, Cyril Grouin and Lawrence E. Hunter |
954 |
Researching Less-Resourced Languages – the DigiSami Corpus |
Kristiina Jokinen |
955 |
A Multilingual Dataset for Evaluating Parallel Sentence Extraction from Comparable Corpora |
Pierre Zweigenbaum, Serge Sharoff and Reinhard Rapp |
957 |
Understanding Emotions: A Dataset of Tweets to Study Interactions between Affect Categories |
Saif Mohammad and Svetlana Kiritchenko |
958 |
A Hybrid Approach to Build Bilingual Lexicons of Multiword Expressions from Parallel Corpora |
Nasredine Semmar |
959 |
Corpus Building and Evaluation of Aspect-based Opinion Summaries from Tweets in Spanish |
Daniel Peñaloza, Juanjosé Tenorio, Rodrigo López, Héctor Gomez, Arturo Oncevay and Marco Antonio Sobrevilla Cabezudo |
963 |
Semantic Supersenses for English Possessives |
Austin Blodgett and Nathan Schneider |
964 |
Expanding Abbreviations in a Strongly Inflected Language: Are Morphosyntactic Tags Sufficient? |
Piotr Żelasko |
965 |
Building Named Entity Recognition Taggers via Parallel Corpora |
Rodrigo Agerri, Yiling Chung, Itziar Aldabe, Nora Aranberri, Gorka Labaka and German Rigau |
966 |
A Dataset of Image Affect Intensities |
Saif Mohammad and Svetlana Kiritchenko |
967 |
Metaphor Suggestions based on a Semantic Metaphor Repository |
Gerard de Melo |
970 |
Generation of a Spanish Artificial Collocation Error Corpus |
Sara Rodríguez-Fernández, Roberto Carlini and Leo Wanner |
971 |
Low-resource Post Processing of Noisy OCR Output for Historical Corpus Digitisation |
Caitlin Richter, Matthew Wickes, Deniz Beser and Mitchell Marcus |
974 |
Cross-Document, Cross-Language Event Coreference Annotation Using Event Hoppers |
zhiyi song, Ann Bies, Justin Mott, Xuansong Li, Stephanie Strassel and Christopher Caruso |
977 |
Grounding Gradable Adjectives through Crowd-sourcing |
Rebecca Sharp, Ajay Nagesh, Dane Bell and Mihai Surdeanu |
978 |
Building an English Vocabulary Knowledge Dataset of Japanese English-as-a-Second-Language Learners Using Crowdsourcing |
Yo Ehara |
981 |
Test Sets for Chinese Nonlocal Dependency Parsing |
Manjuan Duan and William Schuler |
985 |
Structured Interpretation of Temporal Relations |
Yuchen Zhang and Nianwen Xue |
986 |
Annotating Reflections for Therapeutic Emotive Companion Robots |
Nishitha Guntakandla |
987 |
A Multimodal Corpus for Mutual Gaze and Joint Attention in Multiparty Situated Interaction |
Dimosthenis Kontogiorgos, Vanya Avramova, Simon Alexandersson, Patrik Jonell, Catharine Oertel, Jonas Beskow, Gabriel Skantze and Joakim Gustafson |
989 |
Studying Muslim Stereotyping through Microportrait Extraction |
Antske Fokkens, Nel Ruigrok, Camiel Beukeboom, Gagenstein Sarah and Wouter van Attveldt |
992 |
Adding Syntactic Annotations to Flickr30k Entities Corpus for Multimodal Ambiguous Prepositional-Phrase Attachment Resolution |
Sebastien Delecraz, Alexis Nasr, FREDERIC BECHET and Benoit Favre |
995 |
Introducing the CLARIN Knowledge Centre for Linguistic Diversity and Language Documentation |
Hanna Hedeland, Timm Lehmberg, Felix Rau, Mandana Seyfeddinipur and Andreas Witt |
996 |
Application and Analysis of a Multi-layered Scheme for Irony on the Italian Twitter Corpus TWITTIRO' |
Alessandra Teresa Cignarella, Cristina Bosco, Viviana Patti and Mirko Lai |
997 |
Automatic Labeling of Problem-Solving Dialogues for Computational Microgenetic Learning Analytics |
Yuanliang Meng, Anna Rumshisky and Florence Sullivan |
998 |
Arabic Data Science Toolkit: An API for Arabic Language Feature Extraction |
Paul Rodrigues, Valerie Novak, C. Anton Rytting, Julie Yelle, Jennifer Boutz and Tim Buckwalter |
999 |
Low Resource Methods for Medieval Charter Sections Analysis |
Petra Galuscakova and Lucie Neuzilova |
1000 |
Classifier-based Polarity Propagation in WordNet |
Jan Kocoń, Arkadiusz Janz and Maciej Piasecki |
1001 |
Annotating Educational Questions for Student Response Analysis |
Andreea Godea and Rodney Nielsen |
1002 |
Error annotation in a Learner Corpus of Portuguese |
Iria del Río Gayo and Amália Mendes |
1004 |
Bringing Order to Chaos: A Non-Sequential Approach for Browsing Large Sets of Found Audio Data |
Per Fallgren, Zofia Malisz and Jens Edlund |
1005 |
Visualizing the Dictionary of Regionalisms of France (DRF) |
Ada Wan |
1006 |
A Legal Perspective on Training Models for Natural Language Processing |
Richard Eckart de Castilho, Giulia Dore, Penny Labropoulou, tom margoni and Iryna Gurevych |
1008 |
CoLoSS: Cognitive Load Corpus with Speech and Performance Data from a Symbol-Digit Dual-Task |
Robert Herms, Maria Wirzberger, Maximilian Eibl and Günter Daniel Rey |
1009 |
PDFdigest: an Adaptable Layout-Aware PDF-to-XML Textual Content Extractor for Scientific Articles |
Daniel Ferrés, Horacio Saggion and Francesco Ronzano |
1010 |
SB-CH: A Swiss German Corpus with Sentiment Annotations |
Don Tuggener |
1011 |
Annotating If the Authors of a Tweet are Located at the Locations They Tweet About |
Vivek Reddy Doudagiri, Alakananda Vempala and Eduardo Blanco |
1012 |
SW4ALL: a CEFR Classified and Aligned Corpus for Language Learning |
Rodrigo Wilkens, Leonardo Zilio and Cédrick Fairon |
1013 |
DART: A Large Dataset of Dialectal Arabic Tweets |
israa alsarsour, Esraa Mohamed, Reem Suwaileh and Tamer Elsayed |
1015 |
VAST: A Corpus of Video Annotation for Speech Technologies |
Jennifer Tracey and Stephanie Strassel |
1016 |
Analyzing Middle High German syntax |
Christian Chiarcos, Maria Sukhareva and Benjamin Kosmehl |
1018 |
auto-hMDS: Automatic Construction of a Large Heterogeneous Multi-Document Summarization Corpus |
Markus Zopf |
1019 |
The Linguistic Category Model in Polish (LCM-PL) |
Aleksander Wawer and Justyna Sarzyńska |
1020 |
Simple Semantic Annotation and Situation Frames: Two Approaches to Basic Text Understanding in LORELEI |
Kira Griffitt, Jennifer Tracey, Ann Bies and Stephanie Strassel |
1021 |
NL2Bash: A Corpus and Semantic Parser for Natural Language Interface to the Linux Operating System |
Victoria Lin, Chenglong Wang, Luke Zettlemoyer and Michael D. Ernst |
1022 |
MGAD: Multilingual Generation of Analogy Datasets |
Mostafa Abdou, artur kulmizev and Vinit Ravishankar |
1024 |
The GermaParl Corpus of Parliamentary Protocols |
Andreas Blaette and Andre Blessing |
1027 |
DTFit: A Collection of Dynamically-Updated Thematic Fit Judgements |
Paolo Vassallo, Emmanuele Chersoni, Alessandro Lenci and Philippe Blache |
1028 |
Utilizing Large Twitter Corpora to Create Sentiment Lexica |
Valerij Fredriksen, Brage Jahren and Björn Gambäck |
1029 |
Modeling Diverse Word Compounding Processes Across Languages |
Winston Wu and David Yarowsky |
1030 |
A Survey on Automatically-Constructed WordNets and their Evaluation: Lexical and Word Embedding-based Approaches |
Steven Neale |
1032 |
Parse Me if You Can: Artificial Treebanks for Parsing Experiments on Elliptical Constructions |
Kira Droganova and Daniel Zeman |
1033 |
Analyzing the Quality of Counseling Conversations: the Tell-Tale Signs of Effective Counseling |
Verónica Pérez-Rosas, Xuetong Sun, Christy Li, Yuchen Wang and Rada Mihalcea |
1034 |
Universal Dependencies for Ainu |
Hajime Senuma and Akiko Aizawa |
1035 |
Graphene: an Open Information Extraction Framework for Complex Sentences |
Matthias Cetto, Bernhard Bermeitinger, Siegfried Handschuh and André Freitas |
1036 |
Identifying Speakers and Addressees in Dialogues Extracted from Literary Fiction |
Adam Ek, Mats Wirén, Robert Östling, Kristina Nilsson Björkenstam, Gintare Grigonyte and Sofia Gustafson Capková |
1044 |
WordNet-Shp: Towards the Building of a Lexical Database for a Peruvian Minority Language |
Diego Maguiño Valencia, Arturo Oncevay and Marco Antonio Sobrevilla Cabezudo |
1045 |
Creating New Language and Voice Components for the Updated MaryTTS Text-to-Speech Synthesis Platform |
Ingmar Steiner and Sébastien Le Maguer |
1046 |
Collection and Analysis of Code-switch Egyptian Arabic-English Speech Corpus |
Injy Hamed, Mohamed Elmahdy and Slim Abdennadher |
1047 |
Laying the Groundwork for Knowledge Base Population: Nine Years of Linguistic Resources for TAC KBP |
Jeremy Getman, Joe Ellis, Stephanie Strassel, zhiyi song and Jennifer Tracey |
1048 |
Increasing Argument Annotation Reproducibility by Using Inter-annotator Agreement to Improve Guidelines |
Milagro Teruel, Cristian Cardellino, Laura Alonso Alemany and Serena Villata |
1049 |
BPEmb: Tokenization-free Pre-trained Subword Embeddings in 275 Languages |
Benjamin Heinzerling and Michael Strube |
1050 |
An SLA Corpus Annotated with Pedagogically Relevant Grammatical Structures |
Leonardo Zilio, Rodrigo Wilkens and Cédrick Fairon |
1051 |
An Attribution Relations Corpus for Political News |
Edward Newell, Drew Margolin and Derek Ruths |
1052 |
No more beating about the bush : A Step towards Idiom Handling for Indian Language NLP |
Ruchit Agrawal, Vigneshwaran Muralidaran, Vighnesh Chenthil Kumar and Dipti Sharma |
1056 |
Integrating Generative Lexicon Event Structures into VerbNet |
Susan Brown, James Pustejovsky, Annie Zaenen and Martha Palmer |
1059 |
FontLex: A Typographical Lexicon based on Affective Associations |
Tugba Kulahcioglu and Gerard de Melo |
1063 |
Text Simplification from Professionally Produced Corpora |
Carolina Scarton, Gustavo Paetzold and Lucia Specia |
1064 |
Speech Rate Calculations with Short Utterances: A Study from a Speech-to-Speech, Machine Translation Mediated Map Task |
Akira Hayakawa, Carl Vogel, Saturnino Luz and Nick Campbell |
1065 |
Sentence and Clause Level Emotion Annotation, Detection, and Classification in a Multi-Genre Corpus |
Shabnam Tafreshi and Mona Diab |
1067 |
Automatic Identification of Study Fields in Scientific Corpora |
eric kergosien, Amin Farvardin, Maguelonne Teisseire, Marie-Noelle BESSAGNET, Joachim Schöpfel, Stéphane Chaudiron, Bernard Jacquemin, Annig Lacayrelle, mathieu roche, christian sallaberry and Jean-Philippe Tonneau |
1069 |
Linguistically-driven Framework for Computationally Efficient and Scalable Sign Recognition |
Mark Dilsizian, Dimitri Metaxas and Carol Neidle |
1071 |
Improving a Neural-based Tagger for Multiword Expressions Identification |
Dušan Variš and Natalia Klyueva |
1072 |
Multilingual Word Segmentation: Training Many Language-Specific Tokenizers Smoothly Thanks to the Universal Dependencies Corpus |
Erwan Moreau and Carl Vogel |
1074 |
Part-of-speech Unification of Propbank corpora |
Tim O'Gorman, Sameer Pradhan, Martha Palmer, Julia Bonn and Kathryn Conger |
1075 |
CONDUCT: An Expressive Conducting Gesture Corpus for Sound Control |
Lei Chen, Sylvie Gibet and Camille Marteau |
1077 |
Building a Morphological Treebank for German from a Linguistic Database |
Petra Steiner and Josef Ruppenhofer |
1079 |
Build Fast and Accurate Lemmatization for Arabic |
Hamdy Mubarak |
1080 |
Tel(s)-Telle(s)-Signs: Highly Accurate Automatic Cross-Lingual Hypernym Discovery |
Ada Wan |
1081 |
Interpersonal Relationship Labels for the CALLHOME Corpus |
Denys Katerenchuk, David Guy Brizan and Andrew Rosenberg |
1084 |
Text Mining for History: first steps on building a large dataset |
Suemi Higuchi, Cláudia Freitas, Bruno Cuconato and Alexandre Rademaker |
1085 |
World Knowledge for Semantic Parsing with Abstract Meaning Representation |
Charles Welch, Jonathan K. Kummerfeld, Song Feng and Rada Mihalcea |
1086 |
Collecting Language Resources from Public Administrations in the Nordic and Baltic Countries |
Andrejs Vasiļjevs, Roberts Rozis and Rihards Kalniņš |
1087 |
Topical Intertextual Correspondence for Integrating Corpora |
Jacky Visser, Rory Duthie and John Lawrence |
1088 |
Medical Entity Corpus with PICO elements and Sentiment Analysis |
Linda Andersson, Markus Zlabinger, Allan Hanbury, Michael Andersson, Vanessa Quasnik and Jon Brassey |
1089 |
A vision-grounded dataset for predicting typical locations for verbs |
Nelson Mukuze, Anna Rohrbach, Vera Demberg and Bernt Schiele |
1094 |
Building A Tacit Knowledge Corpus for Weak Signals Detection |
Octavian Popescu and Ngoc Phuoc An Vo |
1095 |
Designing a Russian Idiom-Annotated Corpus |
Katsiaryna Aharodnik, Anna Feldman and Jing Peng |
1096 |
PyrEval: An Automated Method for Summary Content Analysis |
Yanjun Gao, Andrew Warner and Rebecca Passonneau |
1097 |
Manual vs Automatic Bitext Extraction |
Aibek Makazhanov, Zhenisbek Assylbekov and Bagdat Myrzakhmetov |
1099 |
Improving Dialogue Act Classification for Spontaneous Arabic Speech and Instant Messages at Utterance Level |
AbdelRahim Elmadany, Sherif Abdou and Mervat Gheith |
1100 |
Quest: A Natural Language Interface to Relational Databases |
Octavian Popescu |
1101 |
Cheating a Parser to Death: Data-driven Cross-Treebank Annotation Transfer |
Djamé Seddah, Eric De La Clergerie, Benoît Sagot, Héctor Martínez Alonso and Marie Candito |
1102 |
Retrieving Information from the French Lexical Network in RDF/OWL Format |
Alexsandro Fonseca, Fatiha Sadat and François Lareau |
1104 |
Classification of Closely Related Sub-dialects of Arabic Using Support-Vector Machines |
Samantha Wray |
1109 |
Acquiring Typological Evidence from Universal Dependency Treebanks |
Chiara Alzetta, Felice Dell'Orletta, Simonetta Montemagni and Giulia Venturi |
1111 |
Evaluation of Croatian Word Embeddings |
Lukas Svoboda and Slobogan Beliga |
1115 |
Building Evaluation datasets for Cultural Microblog Retrieval |
Lorraine Goeuriot, Philippe Mulhem, Josiane Mothe and eric San Juan |
1118 |
A Large Resource of Patterns for Verbal Paraphrases |
Octavian Popescu, Ngoc Phuoc An Vo and Vadim Sheinin |
1119 |
The European Language Resource Coordination: Collecting Language Resources for Public Sector Information Management |
Andrea Lösch and Valérie Mapelli |