PROCEEDINGS
A PDF book containing the full collection of papers is available here. The table of contents and the conference and workshop programs in the book use hyperlinks to the respective papers, so it should be convenient to follow the program and have the right paper in front of you. You will also receive the book on CD-ROM as you register at the conference desk.
Alternatively, all individual contributions to the conference and the workshops are available below.
Main conference
Invited talks
The statistical approach to natural language processing: Achievements and open problems
Hermann Ney
Compositionality in (high-dimensional) space
Marco Baroni
Oral presentations
Data-driven knowledge extraction for the food domain
Michael Wiegand, Benjamin Roth and Dietrich Klakow
Integrating viewpoints into newspaper opinion mining for a media response analysis
Thomas Scholz and Stefan Conrad
A supervised POS tagger for written Arabic social networking corpora
Rania Al-Sabbagh and Roxana Girju
NLP workflow for on-line definition extraction from English and Slovene text corpora
Senja Pollak, Anže Vavpetič, Janez Kranjc, Nada Lavrač and Špela Vintar
Projecting semantic roles via Tai mappings
Hector-Hugo Franco-Penya and Martin Emms
Automatic Identification of motion verbs in WordNet and FrameNet
Parvin Sadat Feizabadi and Sebastian Padó
WordNet-based lexical simplification of a document
S. Rebecca Thomas and Sven Anderson
Adding nominal spice to SALSA - frame-semantic annotation of German nouns and verbs
Ines Rehbein, Josef Ruppenhofer, Caroline Sporleder and Manfred Pinkal
Semantic analysis in word vector spaces with ICA and feature selection
Tiina Lindh-Knuutila, Jaakko Väyrynen and Timo Honkela
Aggregating skip bigrams into key phrase-based vector space model for web person disambiguation
Jian Xu, Qin Lu and Zhengzhong Liu
Named entity recognition: Exploring features
Maksim Tkachenko and Andrey Simanovsky
Mention detection: First steps in the development of a Basque coreference resolution system
Ander Soraluze, Olatz Arregi, Xabier Arregi, Klara Ceberio and Arantza Díaz de Ilarraza
Phonetically aided syntactic parsing of spoken language
Zeeshan Ahmed, Peter Cahill and Julie Carson-Berndsen
Linguistic analyses of the LAST MINUTE corpus
Dietmar Rösner, Manuela Kunze, Mirko Otto and Jörg Frommer
S-restricted monotone alignments
Steffen Eger
Use of linguistic features for improving English-Persian SMT
Zakieh Shakeri, Neda Noormohammadi, Shahram Khadivi and Noushin Riahi
Poster presentations
A semantic similarity measure based on lexico-syntactic patterns
Alexander Panchenko, Olga Morozova and Hubert Naets
A multi-level annotation model for fine-grained opinion detection in German blog comments
Bianka Trevisan, Melanie Neunerdt and Eva-Maria Jakobs
Robust processing of noisy web-collected data
Jelke Bloem, Michaela Regneri and Stefan Thater
Navigating sense-aligned lexical-semantic resources: The web interface to UBY
Iryna Gurevych, Michael Matuschek, Tri-Duc Nghiem, Judith Eckle-Kohler, Silvana Hartmann and Christian Meyer
Using information retrieval technology for a corpus analysis platform
Carsten Schnober
Three approaches to finding valence compounds
Yannick Versley, Anne Brock, Verena Henrich and Erhard Hinrichs
A computational semantic analysis of gradable adjectives
Mantas Kasperavicius
Extending dependency treebanks with good sentences
Alexander Volokh and Günter Neumann
Using subcategorization frames to improve French probabilistic parsing
Anthony Sigogne and Matthieu Constant
Statistical denormalization for Arabic text
Mohammed Moussa, Mohammed Fakhr and Kareem Darwish
Automatic identification of language varieties: The case of Portuguese
Marcos Zampieri and Binyam Gebre
Extending the STTS for the annotation of spoken language
Ines Rehbein and Sören Schalowski
Comparing variety corpora with vis-à-vis - A prototype system presentation
Stefanie Anstein
Selecting features for domain-independent named entity recognition
Maksim Tkachenko and Andrey Simanovsky
Ambiguity in German connectives: A corpus study
Angela Schneider and Manfred Stede
Corpus-based acquisition of German event- and object-denoting nouns
Stefan Gorzitze and Sebastian Padó
PATHOS 2012: First Workshop on Practice and Theory of Opinion Mining and Sentiment Analysis
Invited talks
Emotions and creative language
Carlo Strappavara
Contributions
Unsupervised sentiment analysis with a simple and fast Bayesian model using Part-of-Speech feature selection
Christian Scheible and Hinrich Schütze
Sentiment analysis for media reputation research
Samuel Läubli, Mario Schranz, Urs Christen and Manfred Klenner
Opinion analysis: The effect of negation on polarity and intensity
Lei Zhang, Stéphane Ferrari and Patrice Enjalbert
Domain-specific variation of sentiment expressions: A methodology of analysis for academic writing
Stefania Degaetano-Orlieb, Elke Teich and Ekaterina Lapshinova-Koltunski
Creating annotated resources for polarity classification in Czech
Kateřina Veselovská, Jan Hajič Jr. and Jana Šindlerová
A phrase-based opinion list for the German language
Sven Rill, Sven Adolph, Johannes Drescher, Dirk Reinel, Jörg Scheidt, Oliver Schütz, Florian Wogenstein, Roberto V. Zicari and Nikolaos Korfiatis
Opinion mining in an informative corpus: Building lexicons
Patrice Enjalbert, Lei Zhang and Stéphane Ferrari
What is the meaning of 5 *'s? An investigation of the expression and rating of sentiment
Daniel Hardt and Julie Wulff
LThist 2012: First International Workshop on Language Technology for Historical Text(s)
Contributions
Rule-based normalisation of historical text - A diachronic study
Eva Pettersson, Beáta Megyesi and Joakim Nivre
Manual and semi-automatic normalization of historical spelling - case studies from Early New High German
Marcel Bollmann, Stefanie Dipper, Julia Krasselt and Florian Petran
The Sketch Engine as infrastructure for historical corpora
Adam Kilgarriff, Miloš Husák and Robyn Woodrow
Evaluating a post-editing approach for handwriting transcription
Verónica Romero, Joan Andreu Sánchez, Nicolás Serrano and Enrique Vidal
bokstaffua, bokstaffwa, bokstafwa, bokstaua, bokstawa ... Towards lexical link-up for a corpus of Old Swedish
Yvonne Adesam, Malin Ahlberg and Gerlof Bouma
Automatic extraction of potential examples of semantic change using lexical sets
Karin Cavallin
Automatic classification of folk narrative genres
Dong Nguyen, Dolf Trieschnigg, Theo Meder and Mariët Theune
The DTA 'base format': A TEI-subset for the compilation of interoperable corpora
Alexander Geyken, Susanne Haaf and Frank Wiegand
Building an old Occitan corpus via cross-Language transfer
Olga Scrivner and Sandra Kübler
Digitised historical text: Does it have to be mediOCRe?
Bea Alex, Claire Grover, Ewan Klein and Richard Tobin
Comparison of named entity recognition tools for raw OCR text
Kepa Joseba Rodriquez, Mike Bryant, Tobias Blanke and Magdalena Luszczynska
The reference corpus of Late Middle English scientific prose
Javier Calle-Martín, Laura Esteban-Segura, Teresa Marqués-Aguado and Antonio Miranda-García
LexSem 2012: Workshop on Recent Developments and Applications of Lexical-Semantic Resources
Contributions
The construction of a catalog of Brazilian Portuguese verbs
Márcia Cançado, Luisa Godoy and Luana Amaral
Comparing Czech and Russian valency on the material of VALLEX
Natalia Klyueva
Adding a constructicon to the Swedish resource network of Språkbanken
Benjamin Lyngfelt, Lars Borin, Markus Forsberg, Julia Prentice, Rudolf Rydstedt, Emma Sköldberg and Sofia Tingsell
A distributional memory for German
Sebastian Padó and Jason Utt
Towards high-accuracy bilingual phrase acquisition from parallel corpora
Lionel Nicolas, Egon Stemle and Klara Kranebitter
SFLR 2012: Workshop on Standards on Standards for Language Resources
Invited talks
Standards for language technology – Relevance and impact
Gerhard Budin
Standards for the technical infrastructure of language resource repositories: Persistent Identifiers
Oliver Schonefeld and Andreas Witt
Standards for the formal representation of linguistic data: An exchange format for feature structures
Rainer Osswald
Towards standardized descriptions of linguistic features: ISOcat and procedures for using common data categories
Menzo Windhouwer
Standardizing metadata descriptions of language resources: The Common Metadata Initiative, CMDI
Thorsten Trippel and Andreas Witt
Standardizing lexical-semantic resources - Fleshing out the abstract standard LMF
Judith Eckle-Kohler and Iryna Gurevych
A standardized general framework for encoding and exchange of corpus annotations: The Linguistic Annotation Framework, LAF
Kerstin Eckart
Standard for morphosyntactic and syntactic corpus annotation: The Morphosyntactic and the Syntactic Annotation Framework, MAF and SynAF
Laurent Romary
Towards standards for corpus query: Work on a Lingua Franca for corpus query
Elena Frick, Piotr Bański and Andreas Witt
Getting involved into language resource standardization: The map of standards and ways to contribute to ongoing work
Gottfried Herzog