RapidMiner Information Extraction Plugin

Description:

Nowadays more and more information is available spread all over the internet or other huge document collections.
The information is present on websites (containing pure text on the one hand and html-code on the other hand), in documents -- pdf-documents for instance --, or in log-files and so on.
To process this (daily growing) huge amount of information manually is impossible.

Therefore IE-techniques are used for the automatic identification of selected types of entities, relations, or events in free text.
While some IE-systems process IE-tasks like for instance Named Entity Recognition (NER) in a somehow black-boxed way, we present a very modular system, which can easily be adjusted and extended for already known or new tasks.

Link:

http://sourceforge.net/projects/ieplugin4rm/

Software File:

Authors:

Jungermann, Felix

Publications:

Jungermann/2009a	Jungermann, Felix. Information Extraction with RapidMiner. In Wolfgang Hoeppner (editors), Proceedings of the GSCL Symposium 'Sprachtechnologie und eHumanities', pages 50-61, Universität Duisburg-Essen, Abteilung für Informatik und Angewandte Kognitionswissenschaft Fakultät für Ingenieurwissenschaften, 2009. jungermann_2009a.pdf [342 KB]

Jungermann/2010a	Jungermann, Felix. An Information Extraction Plugin for RapidMiner 5. In Proceedings of the RapidMiner Community Meeting And Conference (RCOMM 2010), pages 67 -- 72, 2010. jungermann_2010a.pdf [233 KB]

Jungermann/2011a	Jungermann, Felix. Handling Tree-Structured Values in RapidMiner. In Proceedings of the 2nd RapidMiner Community Meeting and Conference (RCOMM 2011), pages 151 -- 162, 2011. jungermann_2011a.pdf [312 KB]

Jungermann/2011b	Jungermann, Felix. Tree Kernel Usage in Naive Bayes Classifiers. In Proceedings of the LWA 2011, 2011. jungermann_2011b.pdf [308 KB]

Jungermann/2011c	Jungermann, Felix. Documentation of the Information Extraction Plugin for RapidMiner. 2011. jungermann_2011c.pdf [394 KB]

Hauptnavigation

General

Research

Teaching

Staff