PROCEEDINGS

A PDF book containing the full collection of papers is available here. The table of contents and the conference and workshop programs in the book use hyperlinks to the respective papers, so it should be convenient to follow the program and have the right paper in front of you. You will also receive the book on CD-ROM as you register at the conference desk.

Alternatively, all individual contributions to the conference and the workshops are available below.

Main conference

Invited talks

The statistical approach to natural language processing: Achievements and open problems
Hermann Ney

Compositionality in (high-dimensional) space
Marco Baroni

Oral presentations

Data-driven knowledge extraction for the food domain
Michael Wiegand, Benjamin Roth and Dietrich Klakow

Integrating viewpoints into newspaper opinion mining for a media response analysis
Thomas Scholz and Stefan Conrad

A supervised POS tagger for written Arabic social networking corpora
Rania Al-Sabbagh and Roxana Girju

NLP workflow for on-line definition extraction from English and Slovene text corpora
Senja Pollak, Anže Vavpetič, Janez Kranjc, Nada Lavrač and Špela Vintar

Projecting semantic roles via Tai mappings
Hector-Hugo Franco-Penya and Martin Emms

Automatic Identification of motion verbs in WordNet and FrameNet
Parvin Sadat Feizabadi and Sebastian Padó

WordNet-based lexical simplification of a document
S. Rebecca Thomas and Sven Anderson

Adding nominal spice to SALSA - frame-semantic annotation of German nouns and verbs
Ines Rehbein, Josef Ruppenhofer, Caroline Sporleder and Manfred Pinkal

Semantic analysis in word vector spaces with ICA and feature selection
Tiina Lindh-Knuutila, Jaakko Väyrynen and Timo Honkela

Aggregating skip bigrams into key phrase-based vector space model for web person disambiguation
Jian Xu, Qin Lu and Zhengzhong Liu

Named entity recognition: Exploring features
Maksim Tkachenko and Andrey Simanovsky

Mention detection: First steps in the development of a Basque coreference resolution system
Ander Soraluze, Olatz Arregi, Xabier Arregi, Klara Ceberio and Arantza Díaz de Ilarraza

Phonetically aided syntactic parsing of spoken language
Zeeshan Ahmed, Peter Cahill and Julie Carson-Berndsen

Linguistic analyses of the LAST MINUTE corpus
Dietmar Rösner, Manuela Kunze, Mirko Otto and Jörg Frommer

S-restricted monotone alignments
Steffen Eger

Use of linguistic features for improving English-Persian SMT
Zakieh Shakeri, Neda Noormohammadi, Shahram Khadivi and Noushin Riahi

Poster presentations

A semantic similarity measure based on lexico-syntactic patterns
Alexander Panchenko, Olga Morozova and Hubert Naets

A multi-level annotation model for fine-grained opinion detection in German blog comments
Bianka Trevisan, Melanie Neunerdt and Eva-Maria Jakobs

Robust processing of noisy web-collected data
Jelke Bloem, Michaela Regneri and Stefan Thater

Navigating sense-aligned lexical-semantic resources: The web interface to UBY
Iryna Gurevych, Michael Matuschek, Tri-Duc Nghiem, Judith Eckle-Kohler, Silvana Hartmann and Christian Meyer

Using information retrieval technology for a corpus analysis platform
Carsten Schnober

Three approaches to finding valence compounds
Yannick Versley, Anne Brock, Verena Henrich and Erhard Hinrichs

A computational semantic analysis of gradable adjectives
Mantas Kasperavicius

Extending dependency treebanks with good sentences
Alexander Volokh and Günter Neumann

Using subcategorization frames to improve French probabilistic parsing
Anthony Sigogne and Matthieu Constant

Statistical denormalization for Arabic text
Mohammed Moussa, Mohammed Fakhr and Kareem Darwish

Automatic identification of language varieties: The case of Portuguese
Marcos Zampieri and Binyam Gebre

Extending the STTS for the annotation of spoken language
Ines Rehbein and Sören Schalowski

Comparing variety corpora with vis-à-vis - A prototype system presentation
Stefanie Anstein

Selecting features for domain-independent named entity recognition
Maksim Tkachenko and Andrey Simanovsky

Ambiguity in German connectives: A corpus study
Angela Schneider and Manfred Stede

Corpus-based acquisition of German event- and object-denoting nouns
Stefan Gorzitze and Sebastian Padó

PATHOS 2012: First Workshop on Practice and Theory of Opinion Mining and Sentiment Analysis

Invited talks

Emotions and creative language
Carlo Strappavara

Contributions

Unsupervised sentiment analysis with a simple and fast Bayesian model using Part-of-Speech feature selection
Christian Scheible and Hinrich Schütze

Sentiment analysis for media reputation research
Samuel Läubli, Mario Schranz, Urs Christen and Manfred Klenner

Opinion analysis: The effect of negation on polarity and intensity
Lei Zhang, Stéphane Ferrari and Patrice Enjalbert

Domain-specific variation of sentiment expressions: A methodology of analysis for academic writing
Stefania Degaetano-Orlieb, Elke Teich and Ekaterina Lapshinova-Koltunski

Creating annotated resources for polarity classification in Czech
Kateřina Veselovská, Jan Hajič Jr. and Jana Šindlerová

A phrase-based opinion list for the German language
Sven Rill, Sven Adolph, Johannes Drescher, Dirk Reinel, Jörg Scheidt, Oliver Schütz, Florian Wogenstein, Roberto V. Zicari and Nikolaos Korfiatis

Opinion mining in an informative corpus: Building lexicons
Patrice Enjalbert, Lei Zhang and Stéphane Ferrari

What is the meaning of 5 *'s? An investigation of the expression and rating of sentiment
Daniel Hardt and Julie Wulff

LThist 2012: First International Workshop on Language Technology for Historical Text(s)

Contributions

Rule-based normalisation of historical text - A diachronic study
Eva Pettersson, Beáta Megyesi and Joakim Nivre

Manual and semi-automatic normalization of historical spelling - case studies from Early New High German
Marcel Bollmann, Stefanie Dipper, Julia Krasselt and Florian Petran

The Sketch Engine as infrastructure for historical corpora
Adam Kilgarriff, Miloš Husák and Robyn Woodrow

Evaluating a post-editing approach for handwriting transcription
Verónica Romero, Joan Andreu Sánchez, Nicolás Serrano and Enrique Vidal

bokstaffua, bokstaffwa, bokstafwa, bokstaua, bokstawa ... Towards lexical link-up for a corpus of Old Swedish
Yvonne Adesam, Malin Ahlberg and Gerlof Bouma

Automatic extraction of potential examples of semantic change using lexical sets
Karin Cavallin

Automatic classification of folk narrative genres
Dong Nguyen, Dolf Trieschnigg, Theo Meder and Mariët Theune

The DTA 'base format': A TEI-subset for the compilation of interoperable corpora
Alexander Geyken, Susanne Haaf and Frank Wiegand

Building an old Occitan corpus via cross-Language transfer
Olga Scrivner and Sandra Kübler

Digitised historical text: Does it have to be mediOCRe?
Bea Alex, Claire Grover, Ewan Klein and Richard Tobin

Comparison of named entity recognition tools for raw OCR text
Kepa Joseba Rodriquez, Mike Bryant, Tobias Blanke and Magdalena Luszczynska

From semi-automatic to automatic affix extraction in Middle English corpora: Building a sustainable database for analyzing derivational morphology over time
Hagen Peukert

The reference corpus of Late Middle English scientific prose
Javier Calle-Martín, Laura Esteban-Segura, Teresa Marqués-Aguado and Antonio Miranda-García

LexSem 2012: Workshop on Recent Developments and Applications of Lexical-Semantic Resources

Contributions

The construction of a catalog of Brazilian Portuguese verbs
Márcia Cançado, Luisa Godoy and Luana Amaral

Comparing Czech and Russian valency on the material of VALLEX
Natalia Klyueva

Adding a constructicon to the Swedish resource network of Språkbanken
Benjamin Lyngfelt, Lars Borin, Markus Forsberg, Julia Prentice, Rudolf Rydstedt, Emma Sköldberg and Sofia Tingsell

A distributional memory for German
Sebastian Padó and Jason Utt

Towards high-accuracy bilingual phrase acquisition from parallel corpora
Lionel Nicolas, Egon Stemle and Klara Kranebitter

SFLR 2012: Workshop on Standards on Standards for Language Resources

Invited talks

Standards for language technology – Relevance and impact
Gerhard Budin

Standards for the technical infrastructure of language resource repositories: Persistent Identifiers
Oliver Schonefeld and Andreas Witt

Standards for the formal representation of linguistic data: An exchange format for feature structures
Rainer Osswald

Towards standardized descriptions of linguistic features: ISOcat and procedures for using common data categories
Menzo Windhouwer

Standardizing metadata descriptions of language resources: The Common Metadata Initiative, CMDI
Thorsten Trippel and Andreas Witt

Standardizing lexical-semantic resources - Fleshing out the abstract standard LMF
Judith Eckle-Kohler and Iryna Gurevych

A standardized general framework for encoding and exchange of corpus annotations: The Linguistic Annotation Framework, LAF
Kerstin Eckart

Standard for morphosyntactic and syntactic corpus annotation: The Morphosyntactic and the Syntactic Annotation Framework, MAF and SynAF
Laurent Romary

Towards standardized lexical semantic corpus annotation: Components of the Semantic Annotation Framework, SemAF
Tibor Kiss

Towards standards for corpus query: Work on a Lingua Franca for corpus query
Elena Frick, Piotr Bański and Andreas Witt

Getting involved into language resource standardization: The map of standards and ways to contribute to ongoing work
Gottfried Herzog